%A Nigam,Deepti %A Saxena,Swati %A Ramakrishna,G. %A Singh,Archana %A Singh,N. K. %A Gaikwad,Kishor %D 2017 %J Frontiers in Molecular Biosciences %C %F %G English %K Cajanus scarabaeoides (L.) Thouars.,cultivated wild relatives (CWR),de novo assembly,Illumina sequencing,SSRs %Q %R 10.3389/fmolb.2017.00048 %W %L %M %P %7 %8 2017-July-12 %9 Data Report %+ Kishor Gaikwad,Indian Council of Agricultural Research-National Research Centre on Plant Biotechnology,New Delhi, India,kish2012@nrcpb.org %# %! %* %< %T De novo Assembly and Characterization of Cajanus scarabaeoides (L.) Thouars Transcriptome by Paired-End Sequencing %U https://www.frontiersin.org/articles/10.3389/fmolb.2017.00048 %V 4 %0 JOURNAL ARTICLE %@ 2296-889X %X Pigeonpea [Cajanus cajan (L.) Millsp.] is a heat and drought resilient legume crop grown mostly in Asia and Africa. Pigeonpea is affected by various biotic (diseases and insect pests) and abiotic stresses (salinity and water logging) which limit the yield potential of this crop. However, resistance to all these constraints is not readily available in the cultivated genotypes and some of the wild relatives have been found to withstand these resistances. Thus, the utilization of crop wild relatives (CWR) in pigeonpea breeding has been effective in conferring resistance, quality and breeding efficiency traits to this crop. Bud and leaf tissue of Cajanus scarabaeoides, a wild relative of pigeon pea were used for transcriptome profiling. Approximately 30 million clean reads filtered from raw reads by removal of adaptors, ambiguous reads and low-quality reads (3.02 gigabase pairs) were generated by Illumina paired-end RNA-seq technology. All of these clean reads were pooled and assembled de novo into 1,17,007 transcripts using the Trinity. Finally, a total of 98,664 unigenes were derived with mean length of 396 bp and N50 values of 1393. The assembly produced significant mapping results (73.68%) in BLASTN searches of the Glycine max CDS sequence database (Ensembl). Further, uniprot database of Viridiplantae was used for unigene annotation; 81,799 of 98,664 (82.90%) unigenes were finally annotated with gene descriptions or conserved protein domains. Further, a total of 23,475 SSRs were identified in 27,321 unigenes. This data will provide useful information for mining of functionally important genes and SSR markers for pigeonpea improvement.