New Hypervariable SSR Markers for Diversity Analysis, Hybrid Purity Testing and Trait Mapping in Pigeonpea [Cajanus cajan (L.) Millspaugh]

Draft genome sequence in pigeonpea offers unprecedented opportunities for genomics assisted crop improvement via enabling access to genome-wide genetic markers. In the present study, 421 hypervariable simple sequence repeat (SSR) markers from the pigeonpea genome were screened on a panel of eight pigeonpea genotypes yielding marker validation and polymorphism percentages of 95.24 and 54.11%, respectively. The SSR marker assay uncovered a total of 570 alleles with three as an average number of alleles per marker. Similarly, the mean values for gene diversity and PIC were 0.44 and 0.37, respectively. The number of polymorphic markers ranged from 39 to 89 for different parental combinations. Further, 60 of these SSRs were assayed on 94 genotypes, and model based clustering using STRUCTURE resulted in the identification of the two subpopulations (K = 2). This remained in close agreement with the clustering patterns inferred from genetic distance (GD)-based approaches i.e., dendrogram, factorial and principal coordinate analysis (PCoA). The AMOVA accounted majority of the genetic variation within groups (89%) in comparison to the variation existing between the groups (11%). A subset of these markers was implicated for hybrid purity testing. We also demonstrated utility of these SSR markers in trait mapping through association and bi-parental linkage analyses. The general linear (GLM) and mixed linear (MLM) models both detected a single SSR marker (CcGM03681) with R2 = 16.4 as associated with the resistance to Fusarium wilt variant 2. Similarly, by using SSR data in a segregating backcross population, the corresponding restorer-of-fertility (Rf) locus was putatively mapped at 39 cM with the marker CcGM08896. However, The marker-trait associations (MTAs) detected here represent a very preliminary type and hence demand deeper investigations for conclusive evidence. Given their ability to reveal polymorphism in simple agarose gels, the hypervariable SSRs are valuable genomic resource for pigeonpea research community, particularly in South Asia and East Africa where pigeonpea is primarily grown.

Draft genome sequence in pigeonpea offers unprecedented opportunities for genomics assisted crop improvement via enabling access to genome-wide genetic markers. In the present study, 421 hypervariable simple sequence repeat (SSR) markers from the pigeonpea genome were screened on a panel of eight pigeonpea genotypes yielding marker validation and polymorphism percentages of 95.24 and 54.11%, respectively. The SSR marker assay uncovered a total of 570 alleles with three as an average number of alleles per marker. Similarly, the mean values for gene diversity and PIC were 0.44 and 0.37, respectively. The number of polymorphic markers ranged from 39 to 89 for different parental combinations. Further, 60 of these SSRs were assayed on 94 genotypes, and model based clustering using STRUCTURE resulted in the identification of the two subpopulations (K = 2). This remained in close agreement with the clustering patterns inferred from genetic distance (GD)-based approaches i.e., dendrogram, factorial and principal coordinate analysis (PCoA). The AMOVA accounted majority of the genetic variation within groups (89%) in comparison to the variation existing between the groups (11%). A subset of these markers was implicated for hybrid purity testing. We also demonstrated utility of these SSR markers in trait mapping through association and bi-parental linkage analyses. The general linear (GLM) and mixed linear (MLM) models both detected a single SSR marker (CcGM03681) with R 2 = 16.4 as associated with the resistance to Fusarium wilt variant 2. Similarly, by using SSR data in a segregating backcross population, the corresponding restorer-of-fertility (Rf) locus was putatively mapped at 39 cM with the marker CcGM08896. However, The marker-trait associations (MTAs) detected here represent a very preliminary type and hence demand deeper investigations for conclusive evidence. Given their ability to reveal polymorphism in simple agarose gels, the hypervariable SSRs are valuable genomic resource for pigeonpea research community, particularly in South Asia and East Africa where pigeonpea is primarily grown.
Keywords: pigeonpea, SSR, diversity, hybrid, polymorphism, genome INTRODUCTION Pigeonpea [Cajanus cajan (L.) Millspaugh] is an important grain-legume crop grown in 7.03 mha area with a total production of 4.89 mt from tropical and subtropical regions of the world (FAOSTAT, 2014). India is the largest producer of pigeonpea, contributing 67.3% to the global production, followed by Myanmar, Malawi, and Kenya (FAOSTAT, 2014). Owing to its inherent capacity to grow in low input and rainfed conditions coupled with its multiple uses as food, feed, forage and fuel wood, pigeonpea serves as a valued cash crop for small scale and marginal farmers (Sameer Kumar et al., 2016). A deep and extensive root system of pigeonpea contributes to improve structural and physical properties of the soil, a feature that could be harnessed to check soil erosion (Krauss, 1936). From the 32 species known under the sub-tribe Cajaninae, C. cajan is the only cultivated species (Van der Maesen, 1990;Odeny, 2007). The breeding efforts aimed at improving pigeonpea led to the development and release of more than 100 improved varieties during last 50 years in India . However, the genetic gains from conventional breeding remained limited over same period of time (Varshney et al., 2013). This implies toward an urgent need to strengthen pigeonpea breeding program with the modern tools to improve their efficacy.
According to Temnykh et al. (2001) SSRs with ≥20 nucleotides and <20 nucleotides are referred to as Class I (hypervariable) and Class II, respectively. The significance of hyper-variable SSRs in pigeonpea has been established owing to their ease of scoring in simple agarose gel (Singh et al., 2012;Dutta et al., 2013). Recently, decoding of the whole genome sequence of Asha (ICPL 87119) by Varshney et al. (2012) has enabled access to more than 23,000 primer pairs for SSRs (CcGM series). The present study aims to validate a set of 421 SSRs of this CcGM series. We further demonstrated the usefulness of these SSRs through investigating the genetic diversity and population structure of cultivated pigeonpea. The potential of these SSR markers in hybridity testing was also assessed on agarose gel. Also, we developed a partial linkage map of a backcross population using these SSRs and the Rf locus possibly responsible for A 2 -cytoplasmic male sterility (CMS) restoration was mapped. In parallel, these SSRs were also employed in association mapping to detect putative marker trait association (MTA) for Fusarium (Fusarium udum) wilt (FW) resistance in pigeonpea.

Plant Material
The experimental material comprised of 94 pigeonpea genotypes, which included 59 varieties, released during 1975-2015, 23 breeding lines/accessions, four exotic lines, four landraces, three fertility restorers, and two CMS lines ( Table 1). These genotypes are relevant to pigeonpea breeding objectives owing to their suitability to different agro-climatic zones and having several traits of interests such as high yield and disease resistance etc (Supplementary Table 1). A subset of these genotypes (40) was screened against Fusarium wilt (FW) variant 2 during 2014-15 in the wilt sick field at IIPR, Kanpur (Supplementary Table  2). The 40 genotypes were selected because of their breeding value (extensively used by the breeders) for the improvement of resistance to FW. Few genotypes, e.g., IPA 8F, IPA 9F, and IPA 16F were registered with the National Bureau of Plant Genetic Resources (NBPGR), New Delhi as donors for FW resistance . Similarly, the genotypes like BDN 2, C11, MAL 13, and ICP 8863 are being used as differential set to characterize virulence of the wilt pathogen. The experiment was conducted in randomized block design (RBD) with two replications. The FW incidence was recorded on plants from November to February as the weather conditions during this period favor wilt incidence. Percent incidence was calculated using the formula: % Disease incidence = (Number of infected plants/Total number of plants screened) × 100.
Arcsine transformation was used to normalize the data.

Genomic DNA Extraction
Genomic DNA was extracted from 94 pigeonpea genotypes for diversity analysis. Initially eight genotypes (Type 7, ICP 8863, PA 163A, ICPA 2089, AK 261322R, AK 261354R, AK 250189R, AK 250173R) were selected to assess the amplification status of SSR markers. For hybrid purity testing, DNA was extracted from 10 individuals each of three CMS hybrids viz. IPAH 16-06, IPAH 16-07, and IPH 15-03. To perform genetic linkage analysis, DNA from 102 individuals of a backcross population [ICPL 88039A × (ICPL 88039A × AK 250189R)] was also extracted. Also, to gain insights about within genotype variability in landraces DNA was isolated from 15 individuals each of six genotypes (two varieties and four landraces). Genomic DNA was isolated from young leaves according to Cuc et al. (2008). The quantity and quality of DNA was estimated through electrophoresis using 0.8% agarose gel (Sigma-Aldrich, St. Louis, USA).

SSR Analysis
For amplification of genomic DNA, a reaction mixture of 10 µl volume was prepared using 5.9 µl of sterilized distilled water,  1.0 µl template DNA (25 ng), 0.5 µl of forward and 0.5 µl of reverse primer (5 µM), 1.0 µl 10 × PCR buffer (10 mM Tris-HCl, 50 mM KCl, pH 8.3), 1.00 µl dNTP mix (0.2 mM each of dATP, dGTP, dCTP, and dTTP), and 0.1 µl Taq polymerase (5 U/µl) (Thermo Scientific, Mumbai, India). The amplification was carried out in G-40402 thermo cycler (G-STORM, Somerset, UK) in a touchdown PCR profile, in which the following condition was set: Initial denaturation at 94 • C for 5 min followed by 10 cycles of touchdown 55-45 • C, 20 s at 94 • C, annealing for 20 s at 55 • C (the annealing temperature for each cycle being reduced by 1 • C per cycle) and extension for 30 s at 72 • C. This was accompanied by 40 cycle of denaturation at 94 • C for 30 s, annealing at 50 • C for 30 s, elongation at 72 • C for 45 s, and 10 min of final extension at 72 • C. Amplified products were resolved in 3% agarose gel using 0.5× TBE running buffer and images were analyzed in Quantity one software (Bio-Rad, CA 94547, USA).

Genetic Diversity and Population Structure Analysis
The genetic diversity parameters such as major allele frequency, polymorphic information content (PIC), heterozygosity and alleles per locus were computed using PowerMarker v. 3.25 (Liu and Muse, 2005). Nei's gene diversity (h) and Shannon information index (I) were estimated in the sample using POPGENE v. 1.32. The model-based Bayesian approach was used to infer population structure with STRUCTURE v. 2.3.4 (Pritchard et al., 2000). The project was run with the admixture model and correlated allele frequency using burn in period of 20,000 and 200,000 Markov chain Monte Carlo (MCMC) replications. Five independent runs were performed with each K value ranging from 1 to 10. Evanno's k value was calculated by using STRUCTURE HARVESTER program by processing the STRUCTURE results (Evanno et al., 2005). Concerning distance based clustering, DARwin v. 6.0.13 (Perrier and Jacquemoud-Collet, 2006) was employed to generate genetic distance (GD) matrix, which was then used to create dendrogram using unweighted neighbor joining. Factorial analysis was also performed with GD matrix created using DARwin software. We performed analysis of molecular variance (AMOVA) among and within subpopulations (assigned by STRUCTURE), implemented in GenAlEx (Peakall and Smouse, 2012). GenAlEx was also employed to cluster the

Trait Mapping
The MTA for FW resistance was detected using TASSEL v. 2.1 (Bradbury et al., 2007) as described by Iqbal and Rahman (2017). The general linear model (GLM) and mixed linear model (MLM) based on Q matrix and Q+K matrix, respectively were employed to discover SSR markers associated with FW resistance. The structure analysis generated the Q matrix, while the relative kinship matrix was calculated by TASSEL software. Significant MTAs were declared at P ≤ 0.05 with corresponding R 2 indicating the phenotypic variation explained by the MTA. In parallel, a "coarse" linkage map was developed for the backcross population using IciMapping v. 4.1 with a minimum logarithm of odds (LOD) value of 2.5 (Meng et al., 2015). Segregation data was assembled for 75 SSR markers. By using 1% acetocarmine solution, segregation data on pollen fertility and sterility were reported earlier in this backcross population (Bohra et al., 2017). Kosambi mapping function was used to calculate genetic distances. Linkage map was drawn with MapChart v. 2.5 (Voorrips, 2002).

Validation of Hypervariable SSRs from Pigeonpea Genome
A set of 23,410 primer pairs was designed out of 309,052 SSRs identified in pigeonpea genome (Varshney et al., 2012). In the current analysis, 421 SSRs were selected based on length of the SSR tract (Class I or hypervariable) for amplification in pigeonpea lines. These 421 SSR markers were assayed on eight pigeonpea genotypes viz. Type 7, ICP 8863, PA 163A, ICPA 2089, AK 261322R, AK 261354R, AK 250189R, AK 250173R that are parents of different mapping populations (F 2 and backcross) segregating for important traits such as Fusarium wilt and fertility restoration. Of the total 421 markers, 401 provided scorable amplicons and 217 markers generated polymorphic fragments. Twenty SSR markers failed to show amplification in any of the eight genotypes (Supplementary Table 3). In perfect SSR category (as defined by Weber, 1990), tetra-(NNNN), and penta-nucleotide (NNNNN) repeats showed 60% polymorphism followed by tri (NNN) and hexa-nucleotide (NNNNNN) repeats ( Table 2).
The polymorphism information content (PIC) of these 217 SSRs was in the range of 0.11-0.71 with an average 0.38 (Table 3). A total of 570 alleles were detected and the number of alleles per marker ranged from 2 to 6 with an average of 3. Similarly, the range of gene diversity lied between 0.12 and 0.75 with a mean of 0.44. The highest PIC value was shown by the marker CcGM20721, while the marker CcGM21506 generated maximum number of alleles across eight genotypes. We also attempted to find out a relation between the length of the SSR tract and the PIC value with the dataset comprising only perfect SSRs. However, a weak positive relationship (r = 0.1427 and r 2 = 0.0204) could be inferred from the dataset.
In the case of pairwise polymorphism among parental lines of mapping populations a total 179 SSR markers showed polymorphism at least within one crossing combination (Supplementary Table 3). The number of polymorphic markers for parental combinations ranged from 39 (PA 163A × AK 261322R) to 89 (Type 7 × ICP 8863).

The Genetic Diversity and Population Structure of Cultivated Pigeonpea
Keeping in view the high quality marker profiling patterns and PIC values, a subset of 60 markers was selected from the 217 polymorphic markers for analyzing 94 genotypes. As a result, a total of 233 alleles were obtained across the genotypes. The average values for PIC and number of alleles per marker were 0.50 and 3.88, respectively. A representative image illustrating the SSR fingerprints of the 94 genotypes using the marker CcGM22990 has been shown in Figure 1. Though only four landraces were used in the current analysis, we analyzed 15 plants of these landraces and two varieties (IPA 203 and Narendra Arhar 1) using CcGM18684 to evaluate the within-genotype variability. No within genotype variability and heterozygosity was observed in the varieties. By contrast, the three landraces showed two alleles each and variable level of heterozygosity [0.06 (Banda Palera and Allahabad Local) and 0.2 (JBP-13)] (Supplementary Figure 1).
To examine the genetic variations identified through 60 SSR markers across 94 genotypes, model-as well as distancebased approaches were used. In model based clustering, five independent runs were performed using STRUCTURE programme with K values ranging from 1 to 10. The maximum delta K (ad-hoc quantity) was reached at K = 2 suggesting presence of two subpopulations in the collection (Figure 2). Of the total 94 genotypes, 28 belonged to subpopulation 1, while remaining 66 corresponded to subpopulation 2 in STRUCTURE analysis. With a few exceptions, the subpopulation 2 contained short-and medium-duration pigeonpea, whereas the genotypes with longer maturity duration were in subpopulation 1. The mean Nei's gene diversity was found to be 0.50 and 0.56 for subpopulation 1 and 2, respectively. Similarly, mean values for Shannon information index for subpopulation 1 and 2 were 0.91 and 1.04, respectively. Overall, the average values of Nei's gene diversity and Shannon information index were 0.58 and 1.089, respectively.
A neighbor-joining tree was constructed based on genetic distance matrix, which suggested existence of four clusters (Figure 3). The factorial analysis was also undertaken to offer an overall representation of the diversity in the studied panel. Interestingly, a close agreement was observed between the results arising from STRUCTURE and factorial analysis. The subpopulation 2 of STRUCTURE was contained primarily in quadrants II and III of factorial analysis (Figure 4). On the other hand, quadrant I contained genotypes that were assigned to subpopulation 1 in STRUCTURE analysis. Similarly, two clusters were also obtained in PCoA with the PC1 and PC2 explaining 10.47 and 7.98% of the variance, respectively (Figure 5). Further, the total genetic variation was partitioned by using AMOVA based on PhiPT-values, which accounted majority of the genetic variation to within groups (89%) in FIGURE 2 | (A) Population structure inferred from 60 SSR markers. Each genotype is represented by a vertical bar carrying K colored segments which indicates the estimated membership proportion to the K cluster (K = 2). Serial numbers are assigned according to Supplementary Table 1. (B) The true value of K was obtained following the delta K method of Evanno et al. (2005). comparison to the variation observed between the groups (11%) ( Table 4).

Hybridity Testing of CMS Based Hybrids
CMS technology has emerged as a promising means to deliver noticeable yield gains in pigeonpea. The CMS-based hybrid breeding involves three parental genotypes: male sterile (A) line, its isogenic line (B) line and fertility restorer (R) line. Ten SSR markers were tested in three CMS hybrids (IPAH 16-06, IPAH 16-07, and IPH 15-03) and respective parents i.e., A, B, and R lines. These hybrids are being tested in evaluation trails at different locations in India under all India coordinated research projects on pigeonpea (AICRPP) and consortium research platform on hybrid technology (CRPHT) schemes. The SSR markers that generated monomorphic fragments between A and B lines and polymorphic fragments between the A and R lines were selected for testing the genetic purity of the respective FIGURE 3 | Neighbor joining tree based on GD matrix of 94 pigeonpea genotypes. The two colors correspond to the two subgroups assigned by the STRUCTURE.

Association Analysis
The wilt incidence data and genotyping data of 60 SSRs on 40 pigeonpea genotypes were analyzed to detect SSR markers associated with the FW resistance. Interestingly, only one SSR marker (CcGM03681) could be declared as linked with the FW resistance using both GLM and MLM. In both models, the phenotypic variance (R 2 ) accounted to the SSR CcGM03681 was found to be 16.4%.

Genetic Analysis of a Backcross Population and Mapping of Rf Locus
Marker screening of a total of 824 SSRs (421 CcGMs, 261 HASSRs, 60 CcMs, 12 CZs, 10 CCBs, and 60 ASSRs) between the mapping parents (ICPL 88039A and AK 250189R) provided 115 polymorphic markers. Segregation data were assembled for 75 SSR markers and subjected to goodness-of-fit test. As a result, 55 SSRs showed Mendelian segregation of 1:1, while remaining 20 markers deviated from the ratio (Supplementary Table 4). The phenotypic segregation ratio of 1 (fertile): 1 (sterile) as evident from pollen fertility assay in this population showed possibility of one dominant gene for fertility restoration (Bohra et al., 2017). We could place 35 markers on to the coarse linkage map, and the Rf locus could be assigned at 39 cM with the marker CcGM08896 located on Scaffold130507 in the pigeonpea genome (Supplementary Figure 2).

DISCUSSION
Whole genome sequencing of Asha (ICPL 87119) has offered access to genome wide genetic variants such as SSRs and SNPs for their widespread applications in genomics and breeding. More recently, clustering patterns emerged by applying genome-wide SSR markers in rice showed closer agreement with the pedigree when compared with the SNP markers, thus highlighting the benefits associated with SSR markers (Gonzaga et al., 2015). The relevance of hypervariable SSR markers to plant breeding is well described in various crops including rice (Singh et al., 2010;Narshimulu et al., 2011).
We selected a set of 421 hyper-variable SSR markers for the present work. The validation success rate of 95.24% of the current investigation was comparable to earlier reported by Bohra et al. (2011) using BAC end sequence (BES)-derived SSR markers (96.48%), however the percentage was greater than reported in previous SSR-based studies in pigeonpea (Raju et al., 2010;Saxena et al., 2010a,b;Dutta et al., 2011;Singh et al., 2012). In a similar way, the percent polymorphism (54.11%) reported here was also greater than found earlier in pigeonpea. For instance, the percent polymorphism shown by SSR markers was reported to be 15% (Raju et al., 2010), 28.4% (Bohra et al., 2011), 47.94% (Odeny et al., 2009), and 48.71% . Considering only hypervariable SSRs, the level of DNA polymorphism detected here was higher than found previously in pigeonpea (40.8%: Singh et al., 2012, 41.1%: Dutta et al., 2013. Considering an earlier study (Singh et al., 2012) where complex type SSR markers showed least polymorphism, present study focused majorly on the perfect SSRs (88%). This underscores the significance of the SSR markers experimentally validated in the present investigation. As can be inferred from Table 7, the average PIC value of the current analysis corroborates with previously reported mean PIC values for SSR markers in pigeonpea, however the allele count was somewhat lower. The reason that explains smaller mean value for alleles per marker is possibly that the current study considers only C. cajan genotypes for diversity estimation. Incorporation of genotypes in the panel belonging to different species might lead to an increased number of alleles given by a particular DNA marker (Odeny et al., 2009;Singh D. et al., 2016). However, the present panel offers realistic PIC values in concern with pigeonpea breeding in India.
A lower genetic diversity in the cultivated pigeonpea as illustrated through Nei's gene diversity is not surprising, and  1975-1985 1986-1995 1996-2005 2005-2015 1975-1985 -1986-1995 0.0313 -1996-2005  is congruent with previous SSR-based diversity studies in pigeonpea. In addition to SSR marker, the narrow genetic base of the domesticated pigeonpea was also evident from analyses based on other DNA marker systems such as RAPD (Ratnaparkhe et al., 1995), RFLP (Nadimpalli et al., 1993), AFLP (Panguluri et al., 2006), DArT (Yang et al., 2006), ISR  (Kudapa et al., 2012), and SNP (Kassa et al., 2012). The SSR analysis of the 15 plants from landraces and varieties suggested existence of within-genotype variability in landraces. This in turn highlights the need for pooling DNA from 10 to 15 plants of a single genotype in diversity analysis particularly in case of landraces that harbor greater level of genetic variation/heterogeneity (Brondani et al., 2006). Preference for uniform varieties in place of landraces has played important Limited studies have been performed so far in pigeonpea that examine the population structure of the C. cajan. The existence of two major subpopulations in the sample was strongly supported by a correspondence between results arising from both model and distance based clustering methods. Further, the AMOVA based partitioning of the total variance also found resemblance with the patterns seen earlier in pigeonpea (Kassa et al., 2012) and also in other food legume crops such as lentil (Khazaei et al., 2016).
With a slight deviation, the subgroup I harbored long duration pigeonpea genotypes that are grown exclusively in north eastern plain zone (NEPZ) in India, particularly states like Bihar and Uttar Pradesh. A careful examination of pedigree of the genotypes contained in subgroup I revealed a popular variety Bahar as one of the parents . The pattern of grouping of genotypes, in particular the genotypes with long maturity duration showed similarity with the previous findings (Choudhury et al., 2008;Singh et al., 2013). We also attempted to dissect the genetic diversity among Indian cultivars based on their release year and area of adaptation. Unlike major staple crops like rice (Choudhary et al., 2013) and wheat (Mir et al., 2011), no significant trends in genetic diversity could emerge from the analysis of Indian pigeonpea varieties. However, a near constant (albeit moderate) genetic diversity among Indian varieties over the last 50 years demands excavation and recruitment of diverse alleles from the exotic or wild germplasm pool.
In addition to varietal improvement, harnessing hybrid vigor using CMS technology has emerged as a promising means to overcome the problem of yield stagnation in pigeonpea . Supply of genetically pure seeds remains central to the successful hybrid breeding programme (Saxena et al., 2010c). Molecular markers, particularly SSRs, are immensely useful in order to ensure the genetic purity of the seeds of hybrids and its parents (Saxena et al., 2010c;Bohra et al., 2015). Here we identify sets of robust SSR markers that offer a rapid and cost-effective genomic tools for genetic purity testing of the representative lot. Such DNA markers would serve as a great supplement to the seed certification program and hybrid identification. In parallel, we also attempted to show the utility of these SSR markers in trait mapping using association as well as bi-parental linkage analyses. As a result of association mapping one SSR marker showed significant association with the FW resistance, while bi-parental linkage analysis established association of the Rf locus (restoring A 2 -CMS) with the marker CcGM08896. Earlier, Singh et al. (2013) analyzed 36 pigeonpea genotypes using SSR markers and found MTAs explaining the phenotypic variation in the range of 23-56%. To best of our knowledge, this represents the first study to report molecular mapping of Rf locus responsible for fertility restoration in A 2 -CMS. Earlier, QTLs for A 4 -CMS restoration were reported based on the analysis of three segregating F 2 populations (Bohra et al., 2012). However, the MTAs detected in the present study are of very preliminary type and need extensive investigations including validation in diverse genetic backgrounds and multi-environmental evaluation to reach valid conclusion.
In summary, here we provide a set of 401 validated SSR markers and implicate them in the assessment of genetic variation and population structure of pigeonpea collection, genetic purity testing of CMS hybrids and bi-parental linkage and association analyses. A greater polymorphism percentage coupled with the consistent amplification patterns renders these SSR markers highly suitable for genotyping pigeonpea using simple lab equipment. These are important molecular tools that will certainly help pigeonpea research community for various molecular applications including marker-assisted selection. The clustering patterns resulting from model and distance based approaches will guide pigeonpea breeders for selection of the most diverse parental lines in future breeding programmes. Once validated in diverse genetic backgrounds, the MTAs detected here will pave the way for fine mapping/MAS of the loci controlling FW resistance and A 2 -CMS fertility restoration. Similarly, the genome wide SSR markers would facilitate background selection while practicing marker assisted back crossing. Also, the genetic populations reported here represent valuable genetic resources that will allow trait mapping and subsequent targeted trait improvement in pigeonpea.

AUTHOR CONTRIBUTIONS
AB and RKV conceived the idea and planned experiments; RJ, GP, and AM performed genotyping; AB, GP, PGP, and DS analyzed the data; IPS, FS, and RKM contributed germplasm; AB, RKV, RKS, and NPS participated in writing and finalizing the manuscript. All authors have read the manuscript.