Original Research ARTICLE
Construction of High-Density Genetic Map and Mapping Quantitative Trait Loci for Growth Habit-Related Traits of Peanut (Arachis hypogaea L.)
- 1North China Key Laboratory for Crop Germplasm Resources of Education Ministry, College of Agronomy, Hebei Agricultural University, Baoding, China
- 2Department of Crop, Soil and Environmental Sciences, Auburn University, Auburn, AL, United States
Plant growth habit is an important and complex agronomic trait and is associated with yield, disease resistance, and mechanized harvesting in peanuts. There are at least two distinct growth habits (erect and prostrate) and several intermediate forms existing in the peanut germplasm. A recombinant inbred line population containing 188 individuals was developed from a cross of “Jihua 5” and “M130” for genetically dissecting the architecture of the growth habit. A new high-density genetic linkage map was constructed by using specific locus amplified fragment sequencing technology. The map contains 2,808 single-nucleotide polymorphism markers distributed on 20 linkage groups with a total length of 1,308.20 cM and an average inter-marker distance of 0.47 cM. The quantitative trait locus (QTL) analysis of the growth habit-related traits was conducted based on phenotyping data from seven environments. A total of 39 QTLs for growth habit-related traits was detected on 10 chromosomes explaining 4.55–27.74% of the phenotypic variance, in which 6 QTLs were for lateral branch angle, 8 QTLs were for extent radius, 7 QTLs were for the index of plant type, 11 QTLs were for main stem height, and 7 QTLs were for lateral branch length. Among these QTLs, 12 were co-localized on chromosome B05 spanning an approximately 0.17 Mb physical interval in comparison with the allotetraploid reference genome of “Tifrunner.” Analysis of the co-localized genome region has shown that the putative genes are involved in light and hormones and will facilitate peanut growth habit molecular breeding and study of peanut domestication.
Peanut, or groundnut, is one of the most important oilseed crops and plays an important role in satisfying the demands for oil and protein throughout the world. The architecture of plants in nature is viewed as the vegetative and reproductive branching patterns, the relative arrangement of lateral organs, and differential lengths of the stem (Gallavotti, 2013). Plant architecture is associated with yield, disease resistance, and adaptation to mechanized harvesting. Donald (1968) coined the term “ideotype” which is described as optimum plant architecture. He defined the ideotype as plants that have comprehensive agronomic traits known to influence photosynthesis, growth habit, and grain yield (in cereals). In order to find the ideotype of peanut, many researchers have studied plant architecture including main stem height (MSH), first lateral branch length (LBL), and total branch number (Hake et al., 2017; Li Y. et al., 2017; Lv et al., 2018; Wang et al., 2018). There are at least two distinct growth habits (erect and prostrate) and several intermediate forms existing in the peanut germplasm (Coffelt, 1974). Erect genotypes have compact plant types and the pods are mainly concentrated at the bottom of plant. This type is most suitable for high-density planting. In contrasting, prostrate genotypes have larger spreading areas and the lateral branches of prostrate species creep along the ground, and so is better for pegging into soil. The branching habit trait was classified on a continuous scale from 1 (prostrate) to 6 (erect) according to the ratio of the length of the creeping to the length of the first lateral branch, and the angle between the main stem and the first lateral branch (Jiang et al., 2006). For domesticated peanut, plant types are readily classified into four types: prostrate, spreading, bunch, and erect (Kayam et al., 2017). The index of plant type (IOPT) (the ratio of the longest branch of the first pair lateral branches to the height of the main stem) is also used for quantifying the classification of peanut growth habit into three peanut plant types. The index for creeping plant type is around 2.0 and that of semi-creeping plant type is around 1.5. The erect plant type index is about 1.1–1.2 (Cao and Wang, 2013). Despite so many ways to define the growth habit, it is still difficult to distinguish those categories. Unlike other crops, peanuts bloom above ground and develop fruitage underground, so the growth pattern of lateral branches is an important characterization of peanut growth habit. Not only does it affect the peg penetration into the soil to form pods smoothly, but also closely relate to planting density. Hence, the angle between the erect main stem and the first lateral branch is becoming an important indicator of descriptive characteristics of plant architecture in peanut. In addition, MSH and the first LBL of peanut are important agronomic traits affecting growth habit and the number of the pods (Lan et al., 2011; Li Y. et al., 2017).
Peanut (Arachis hypogaea L., 2n = 4x = 40, genome AABB) probably derived from a hybridization event between two diploid species, Arachis duranensis (AA) and Arachis ipaënsis (BB) (Halward et al., 1992). The total genome size of AABB-type is around 2.7 GB, with an estimated repetitive content of 64% (Bertioli et al., 2016). A legacy of the “domestication bottleneck” is that cultivated peanut contains narrow genetic diversity, in which heritable variation is less than 13% (Varshney et al., 2009b). To date, there have been a large number of genetic maps constructed with various types of molecular markers, including restriction fragment length polymorphism (RFLP), amplified fragment length polymorphism (AFLP), cleaved amplified polymorphism sequences (CAPSs), and simple sequence repeat (SSR) markers. Recently, a large number of AhMITE1 markers were developed from the whole-genome re-sequencing data in peanut, which can be used for trait mapping with a high degree of polymorphism (Gayathri et al., 2018). However, most of these maps are low-density genetic maps because of the low throughput of molecular markers. The low-density genetic maps have limited the efficiency and accuracy of quantitative trait locus (QTL) mapping. Therefore, high-density genetic maps are the foundation for mapping QTL, marker-assisted selection (MAS), and map-based cloning. With the advance of high-throughput sequencing technologies, several effective methods for obtaining thousands of single-nucleotide polymorphism (SNP) or insertion/deletions (InDel) markers at relatively low costs make it possible to construct a high-density genetic map, such as restriction site-associated sequencing (RAD-seq) (Miller et al., 2007), double digest RAD-seq (Peterson et al., 2012), and genotyping by sequencing (GBS) (Poland et al., 2012). Sun et al. (2013) developed a specific length amplified fragment sequencing (SLAF-seq) method to detect large numbers of SNPs. SLAF-seq is based on reduced representation library (RRL) and high-throughput sequencing, which involves fragment length selection but not through random interruption. Therefore, its repeatability and accuracy are better than RAD-seq and GBS (Qi et al., 2014; Xu Y. et al., 2015). By contrast, the method of SLAF-seq not only provides an economical and efficient approach to large-scale genotyping, but can also explore a large number of SNP and InDel markers at the same time. SLAF-seq technology has been used to construct high-density genetic maps for cotton (Keerio et al., 2018), cucumber (Xu X. et al., 2015), soybean (Li B. et al., 2017), and peanut (Hu et al., 2018). Moreover, many peanut SNP genetic linkage maps using different sequencing methods have been constructed. Zhou et al. (2014) and Han et al. (2018) developed genetic maps using ddRAD-seq and GBS, respectively. Wang et al. (2018) and Hu et al. (2018) detected a large number of SNPs through SLAF-seq and constructed a high-resolution map, respectively.
Peanuts are a dicotyledonous plant, and the molecular genetic basis of growth habit has not been clearly elucidated. In the early years, researchers analyzed the genetic regularity of peanut growth habit using Mendelian Law based on the different mode of inheritance between the runner and bunch types. Patel et al. (1936), Silvestre (1961), and Gan et al. (1984) indicated that growth habit was controlled by a single gene, but Hayes (1933) stated that two genes controlled the trait based on the separation ratio of 15:1 in the F2 progeny. Higgins (1938) demonstrated that the growth habit of peanut was a complicated trait with multiple genes involved. According to hybridization and reciprocal crosses, it was showed that nuclear and cytoplasmic interaction controls growth habit in peanuts (Ashri, 1964, 1968). On the contrary, Kayam et al. (2017) revealed that the branching habit was controlled by a single gene without cytoplasmic effect by reciprocal cross experiments. These different results are due to from different classification for plant type evaluation. In previous studies, many researchers have also mapped the growth habit-related traits in peanuts. Fonceka et al. (2012) detected 14 QTLs for plant type using chromosome segment substitution lines (CSSLs) derived from a cross between the wild synthetic allotetraploid (A. ipaënsis × A. duranensis)4× and the cultivated “Fleur11.” Kayam et al. (2017) used an F2:3 population derived from Virginia-type peanut cultivars with spreading and bunch growth habit and found a single QTL on chromosome B05. Shirasawa et al. (2012) reported three QTLs for MSH with 4.80–19.20% phenotypic variation explained (PVE) and two QTLs for LBL with 14.2–21.1% PVE using F2 populations. Huang et al. (2015) detected three QTLs for MSH on A3, B4, and B7 in an F2:3 population. Li Y. et al. (2017) detected 11 QTLs for MSH and 16 QTLs for LBL and the QTL, ARS376-SEQ4G02, on linkage group 14 (LG14) influenced both traits. Lv et al. (2018) developed two different genetic background recombinant inbred line (RIL) populations to detect possible consistent QTLs for plant height and eventually found the QTLs on A09, B03, and B04.
With respect to domestication, the biggest change from wild ancestors to cultivated peanut was the transition of growth habit from prostrate growth to the erect (Muñoz et al., 2017). “Jihua 5” is an erect growth habit, while the “M130” is a spreading-type. A RIL population containing 188 individuals was developed from a cross of “Jihua 5” and “M130.” Therefore, the main objectives of this study were to develop a high-density genetic linkage map using SLAF-seq technology and identify QTLs controlling the growth habit in peanuts.
Materials and Methods
A cross between “Jihua 5” and “M130” was used to develop an F8 population of 321 RILs using single seed decent (SSD) method. “Jihua 5” is an erect growth habit peanut genotype with a lateral branch angle (LBA) around 40°. “M130,” a spreading-type, has a LBA of around 90° (Figure 1). The 188 RILs were randomly selected from 321 individuals and their parents were planted in seven different environments. They were grown in Hai-Nan (HN) experimental station (N18°59′ and E109°11′) and Bao-Ding (BD) experimental station (N38°40′ and E115°30′) in 2016, which were referred to as 16HN and 16BD, respectively. In 2017, the experiments were conducted in Han-Dan (HD) station (N35°57′ and E115°09′) and BD, which were referred to as 17HD and 17BD, respectively. During 2018 growing seasons, the parents and population were planted in Tang-Shan (TS) station (N39°99′ and E118°70′), HD, and BD, which were referred to as 18TS, 18HD, and 18BD, respectively. Each plot contained one row with 1.5 m length, and the distance between each plant was around 17 cm, with a row spacing of 50 cm. Approximately 10 plants were planted in each row. Field management was performed under normal conditions. A randomized complete block design with two replicates was adopted in these four locations.
Figure 1. The different phenotypes in the parents. (A) The female parent “Jihua 5” with scale bar of 10 cm. (B) The male parent “M130” with scale bar of 10 cm.
Trait Measurements and Data Analysis
Three typical plants from each plot were harvested at the mature stage for LBA, MSH, LBL, extent radius (ER), and IOPT measurements. The LBA is defined as the angle between the main stem and the first lateral branch (Supplementary Figure S1). The MSH is the length of internode from the meristematic place of the first pair of the lateral branch on the main stem to parietal lobe (Supplementary Figure S1). The LBL is the length from the junction with the main stem to parietal lobe of the longest first lateral branch (Supplementary Figure S1). The ER is the longest distance between the main stem and the first lateral branch (Supplementary Figure S1). The IOPT is the ratio of the longest branch of the first pair lateral branches to MSH. The LBA was measured using an electronic protractor. The other traits were measured using a measuring tape.
The broad-sense heritability for growth habit-related traits in seven environments was calculated using the equation h2 = σg2/(σg2 + σge2/n + σe2/nr), where σg2, σge2, and σe2 represent the genetic variance component, genotype–environment interaction variance component, and the random error variance component, respectively. The n and r were defined as the number of environments and replications, respectively. The SPSS 25.0 was used for the analysis of each variance component, variation coefficient, and descriptive statistics (IBM Corp, 2017).
DNA Extraction and SLAF Library Construction
Genomic DNA was extracted from young leaf tissues of each sample using TaKaRa MiniBEST Plant Genomic DNA Extraction kit (TaKaRa, Dalian) and SLAF-seq was used to genotype a total of 188 individuals and the two parents, as previously described (Sun et al., 2013), with a few modifications. An improved SLAF-seq strategy was utilized in the experiment. Firstly, the reference genome of “Tifrunner” was used to design marker discovery experiments by simulating in silico the number of markers produced by different enzymes (Bertioli et al., 2019). Next, a SLAF pilot experiment was performed, and the SLAF library was conducted in accordance with the predesigned scheme. For the RIL population, genomic DNA was digested by two enzymes [Hpy166II and EcoRV-HF®, New England Biolabs (NEB), United States]. A single nucleotide (A) overhang was added subsequently to the digested fragments using Klenow fragment (3′→5′ exo−) (NEB) and dATP at 37°C. The duplex tag-labeled sequencing adapters (PAGE-purified, Life Technologies, United States) were then ligated to the A-tailed fragments using T4 DNA ligase. Polymerase chain reaction (PCR) was performed using the diluted restriction–ligation DNA samples, dNTP, Q5® high-fidelity DNA polymerase, and PCR primers (forward primer: 5′-AATGATACGGCGACCACCGA-3′, reverse primer: 5′-CAAGCAGAAGACGGCATACG-3′) (PAGE-purified, Life Technologies, United States). The PCR products were then purified using Agencourt AMPure XP beads (Beckman Coulter, High Wycombe, United Kingdom) and pooled. The pooled samples were separated by 2% agarose gel electrophoresis. Fragments ranging from 364 to 414 bp (with indexes and adaptors) in size were excised and purified using a QIA Quick Gel Extraction Kit (Qiagen, Hilden, Germany). Gel-purified products were then diluted. Paired-end sequencing with reads length 125 bp was performed using an Illumina HiSeq 2500 system (Illumina, Inc., San Diego, CA, United States) according to the manufacturer’s recommendations.
Sequence Data Grouping and Genotyping
Specific length amplified fragment marker identification and SNP genotyping were performed using procedures described by Sun et al. (2013). Low-quality reads (quality score < 20e) were removed and then raw reads were sorted to188 individuals according to duplex barcode sequences. After the barcodes and the terminal 5-bp positions were trimmed from each high-quality read, clean reads from the same sample were mapped onto the “Tifrunner” genome sequence using SOAP software (Li et al., 2008). The reads mapping on the same position were defined as one SLAF locus (Zhang et al., 2015). SNP loci of each SLAF locus were then detected between parents using the software GATK (DePristo et al., 2011), and SLAFs with more than three SNPs were filtered out first. Alleles of each SLAF locus were then defined according to parental reads with sequence depth >40-fold, while for each offspring the reads with sequence depth >12.37-fold were used to define alleles. The SLAF loci with more than four alleles were defined as repetitive SLAFs and subsequently discarded. Only SLAFs with two to four alleles were identified as polymorphic and considered potential markers. All polymorphism SLAFs loci were genotyped with consistency in the parental and offspring SNP loci. Polymorphic markers were classified into eight segregation patterns (ab × cd, ef × eg, hk × hk, lm × ll, nn × np, aa × bb, ab × cc, and cc × ab). Only the aa × bb pattern markers were used for construction of a linkage map.
Linkage Map Construction
In order to reduce the genotyping errors and deletions due to the next-generation sequencing, high map strategy was used to construct a high-density linkage map for the RIL population (Liu et al., 2014). All high quality of SLAF markers were distributed into 20 LGs based on their locations on chromosomes. A detailed MST map algorithm was used to order SLAF markers (Wu et al., 2008) and the SMOOTH algorithm (van Os et al., 2005) was used to correct genotyping errors following marker ordering. The four procedures were followed to construct all LGs: (1) the SLAF markers were partitioned into LGs using a single-linkage clustering algorithm based on a pairwise modified independent LOD score for the recombination frequency; (2) using a combination of Gibbs sampling, spatial sampling, and the simulated annealing algorithm, the ordering module sequenced SLAF markers, and estimated linkage distances; (3) the error correction strategy of SMOOTH was then conducted according to parental contribution of genotypes (van Os et al., 2005), and a k-nearest neighbor algorithm was applied to impute missing genotypes (Huang et al., 2012). Skewed markers were then added into this map by applying a multi-point method of maximum likelihood; and (4) map distances were estimated using the Kosambi mapping function (Kosambi, 1944).
QTL Analysis and Candidate Genes Prediction for Hot Spot Regions
Quantitative trait locus IciMapping version 4.0 (Wang et al., 2014) was performed to identify the QTLs of each trait in different environments by using inclusive composite interval mapping of additive (ICIM-ADD). To claim significant QTLs, the LOD score was set at 2.5 to detect significant QTLs. QTLs were named with initial letter “q” followed by the trait name and LG. A numeric was added if two or more QTLs were identified in the same LG. For example, two QTLs for LBA were detected on LGB04, and then they were named qLBAB04.1 and qLBAB04.2. The QTLs with more than 10% PVE were considered as major QTLs, while other QTLs were considered as minor QTLs (Luo et al., 2017). The MapChart 2.3 software (Voorrips, 2002) was used to graph the genetic linkage map and the QTLs. QTLs of the growth habit-related traits co-localized in the same genomic region were defined as the hot spot region. The genes in the hot spot regions were predicted as candidates based on the “Tifrunner” genome sequence (Bertioli et al., 2019). The functional annotations from the Gene Ontology (GO), the Kyoto Encyclopedia of Genes and Genomes (KEGG), and the Non-Redundant Protein Database (NR) at NCBI were extracted to categorize the candidate genes.
Phenotypic Variation of Growth Habit-Related Traits in RILs and Their Parents
The parents “Jihua 5” and “M130” showed distinct differences in LBA, MSH, LBL, ER, and IOPT in all environments (Table 1 and Supplementary Figure S2). Large variations and transgressive segregations were observed in RILs in the seven environments for all traits except IOPT in 16BD, 18BD, and 18TS (Table 1). Continuous distributions were observed in Figure 2 and the Shapiro–Wilk (w) test indicated that the phenotypic data of MSH and LBL were normally distributed except in 16HN, 17BD, and 18HD. The data of ER were normally distributed in 18BD, but not for other locations. The genotypic coefficient of variation (GCV) and broad-sense heritability of measured traits ranged from 6.60 to 22.32% and 86 to 92%, respectively (Table 2). The ANOVA results indicated significant differences in genotype, environment, and genotype × environment for all traits except genotype × environment for LBA (Table 2). The LBA had a significant positive relationship with ER (r = 0.843) and IOPT (r = 0.689), while LBA had a significant negative association with MSH (r = −0.384) (Table 3). MSH and ER were also negatively correlated, with a correlation coefficient of −0.283 in the environment of 17HD, whereas MSH had a significant positive correlation with LBL (r = 0.681) (Table 3).
Figure 2. Phenotypic distributions of growth habit-related traits in the RIL population under seven environments.
High-Throughput SLAF Sequencing and Genotyping
A total of 472.94 GB of raw data containing 1,576.48 M reads were obtained for the SLAF library. The guanine–cytosine (GC) content was 39.13% in average and the Q30 ratio (quality score of at least 30, indicating a 1% chance of error) was 94.63%. For the paternal inbred line (“M130”), 1,021,241 SLAFs were generated from 50,864,965 reads with an average coverage of 48.83-fold for each SLAF. For the maternal line (“Jihua5”), 1,176,582 SLAFs were produced from 53,171,750 reads and the average coverage of each SLAF was 41.09-fold. For the analysis of the RIL mapping population, 604,234 SLAFs were screened from the 7,832,134 reads in each genotype with an average coverage of 12.37-fold (Table 4).
After filtering or discarding heterozygous SNPs, the same homozygous SNPs presented in both parents were removed. A total of 16,112 SNPs were identified and were successfully encoded and grouped into eight segregation patterns: ab × cd, ef × eg, ab × cc, cc × ab, hk × hk, lm × ll, nn × np, and aa × bb according to the genotype encoding rule. Only aa × bb segregation pattern of 6,978 SNPs was used for construction of the linkage map (Figure 3). Raw data have been submitted to the NCBI SRA database and the SRA accession number is PRJNA5177391.
Figure 3. Numbers of SLAFs markers in each segregation types. aa × bb indicated the parents were homozygous; ef × eg, hk × hk, and ab × cd indicated the parents were heterozygosis; lm × ll, nn × np, ab × cc, and cc × ab indicated one of the parents was homozygous and the other was heterozygosis.
High-Density Genetic Map Construction
All 2,808 high-quality markers were successfully assigned onto 20 LGs compared with the reference genome (Figure 4). The final molecular linkage map was 1,308.20 cM in total length with an average inter-marker distance of 0.47 cM (Table 5). B08 was the largest LG containing 300 markers, 122.60 cM in length, and an average distance of 0.41 cM. B07 was the smallest LG, which only had 39 markers, with the length of 44.07 cM and an average distance of 1.16 cM. The largest gap of linkage between markers was 17.1 cM located in B06. The gap with <5 cM was counted for 98.50% on the linkage map (Table 5).
Figure 4. High density genetic map with the 2,808 SNP markers distributed on 20 LGs of peanut. The x-axis represents the number of LGs and the y-axis represents the marker’s location in genetic map.
Visualization and Evaluation of the Genetic Map
The haplotype and heat maps were used to assess the quality of the peanut genetic map. Haplotype map reflected the ratio of double crossover for the population and implied the genotyping errors or hot spots region with genomic recombination. The higher the ratio of double crossovers, the more problems exist in the genotyping and marker ordering. The haplotype maps were generated for each individual of the 188 RILs as well as the parental lines using 2,808 markers. As shown in Supplementary Figure S3, there were a few double crossovers or deletions in the 20 LGs indicating that the genetic map was of high quality. The heat maps were used to analyze and evaluate the frequency of recombination between markers from each LG. The heat maps also showed that most LGs were of high quality (Supplementary Figure S4). The collinearity analysis was performed between the genetic map and the peanut reference genome (Tifrunner), and all of the SLAF markers were anchored to the peanut reference genome. The results from the analysis showed relatively high collinearity between LGs and corresponding chromosomes (Supplementary Figure S5).
QTL Mapping for Growth Habit-Related Traits
A total of 39 QTLs associated with growth habit-related traits were detected using ICIM-ADD based on IciMapping (Supplementary Table S1). For LBA, a total of six QTLs were identified and explained 5.02–21.87% of the phenotypic variance. qLBAB05.1 has the largest additive effect QTL (a = −5.80 ) and contributes 21.87% to PVE in 17HD. Another major QTL was qLBAB05.2 located on B05 and detected in two environments. The minor QTLs (qLBA04.1, qLBAB04.2, qLBAA04, and qLBAA05) were mapped on A04, A05, and B04 and can explain 5.02–7.85% of the phenotypic variance, respectively (Supplementary Table S1). For ER and IOPT, eight and seven QTLs were detected, respectively. Among them, the two most significant QTLs (qERB05.1 and qIOPTB05.1) were mapped on B05. One minor QTL (qERA05) related to ER trait on A05 was identified in three environments (16HN, 17BD, and 17HD) (Supplementary Table S1). The QTLs qIOPTB05.1 and qIOPTB05.3 located on B05 were the most significant for IOPT with about 14% PVE, while one minor QTL (qIOPTA04.1) associated with IOPT on A04 was detected in two environments (16HN and 18HD) (Supplementary Table S1). A total of 11 QTLs for MSH were detected and distributed on seven chromosomes, with the PVE of 5.67–17.68% and the additive effects of −2.24 to 3.11. Among these QTLs, one QTL (qMSHB09) on B09 was consistently detected in six out of a total of seven environments with the PVE of 5.67–13.73%. The QTL (qMSHB05.2) located on B05 in a single environment (17HD) can explain 17.68% of the phenotypic variance (Supplementary Table S1). For LBL, seven QTLs (qLBLA08, qLBLB05.1, qLBLB05.2, qLBLB05.3, qLBLA05, qLBLB09.1, and qLBLB09.2) were detected in various environments (Supplementary Table S1). It is interesting that some major QTLs were common loci for the different traits. For example, the QTLs qLBAB05.1, qMSHB05.2, qLBLB05.2, qLBLB05.3, qERB05.3, and qIOPTB05.1 were co-localized on B05 on the region of 60.50–64.50 cM underlying LBA, MSH, LBL, ER, and IOPT (Figure 5). Similarly, the QTLs found on A04, A05, A08, and B09 can control two or more traits (Figure 5).
Candidate Genes Identification for Hot Region on B05
Twelve important QTLs associated with peanut growth habit-related traits on B05 were verified and mapped the flanking markers on the 168,048 bp physical region from 159,819,755 to 159,987,803 bp of cultivated peanut physical genome (Supplementary Dataset S1), in which 19 putative genes were annotated. By GO annotation, there were 12 genes assigned at least one GO term. The 12 genes were divided into three GO categories, namely “cellular component” (15), “molecular function” (2), and “biological process” (25). These genes were involved in the biological processes: cytokinin-activated signaling pathway (GO:0009736); positive regulation of long-day photoperiodism (GO:0048578); responding to blue light (GO:0009637); responding to far red light (GO:0010218); responding to light intensity (GO:0009642); and temperature compensation of the circadian clock. These genes have highly promising functions of regulating the growth habit (Supplementary Dataset S2). Through KEGG analyses, SZWP5S.1 and SZWP5S.2 encoding GIGANTEA-like protein were assigned to circadian rhythm-plant pathways. Using the NR database, the number of 6, 12, and 1 homologous genes were identified from Glycine soja, Glycine max, and Cicer arietinum, respectively.
A high-density genetic linkage map plays a crucial role in many fields of fundamental and applied research, such as map-based cloning of genes, whole-genome sequencing for an assembly sequence, marker-assisted breeding, and QTL analysis. The first peanut SSR-based genetic map was developed with 135 loci onto 22 LGs using a RIL population (Varshney et al., 2009a). Subsequently, more SSR markers have been developed, and more SSR-based maps have been constructed (Hong et al., 2008, 2010; Huang et al., 2015). Although such SSR markers are available to construct the genetic map, the density of the map was restricted. The development of high-throughput sequencing technologies and low price are the prerequisites for detecting abundant SNPs markers throughout the genome. SLAF-seq, a relatively efficient experimental method, is different from others in that sequence-specific restriction fragment length was sequenced with better repeatability and accuracy, rather than sequencing random restriction fragments, like RAD-seq. Since the reference genome sequences of two diploid ancestors, A. duranensis (AA, 2n = 2x = 20) and A. ipaensis (BB, 2n = 2x = 20), were published, some researchers used the peanut diploid reference genome to genotype the SNP markers and to perform the collinearity analyses by comparing the positions of markers between the genetic and physical maps (Gupta et al., 2015; Agarwal et al., 2018; Hu et al., 2018; Wang et al., 2018). They also utilized the diploid reference to detect candidate genes in the QTL regions. Although the reference genome sequences of A. duranensis and A. ipaensis provided important genomic information of peanut, all the cultivated peanut cultivars are allotetraploids (Bertioli et al., 2016). Hence, the genome sequence of allotetraploids peanut ensures the accuracy of genetic map construction and candidate genes identification. The reference genome of allotetraploids peanut “Tifrunner” (Bertioli et al., 2019) was used in this study and constructed the linkage map that contains 2,808 SNP markers with the average distance of inter-marker of 0.47 cM. To our knowledge, the high density of markers in this map is sufficient to detect QTLs.
A total of 39 QTLs for LBA, MSH, LBL, ER, and IOPT distributed on 10 chromosomes were identified across the seven environments by IciMapping 4.0 software, with LOD scores ranging from 2.57 to 13.94, and 4.55 to 27.74% of the PVE (Figure 5 and Supplementary Table S1). Growth habit is a complex and comprehensive trait highly influenced by the domestication process. In the study, rather than using naked eyes, the angle between the main stem and the first lateral branch, the ER, and the IOPT was measured to represent the classification of the plant type. Interestingly, some QTLs (qLBAB05.1, qLBAB05.2, qERB05.1, qERB05.2, qERB05.3, qIOPTB05.1, and qIOPTB05.2) were mapped in the common genome region on B05 for all of these three traits across different environments, spanning approximately 0.17 Mb physical interval between 1,59,819,755 and 1,59,987,803. This physical position is different from the previous report on B05 (1,45,553,897–1,46,649,943) (Kayam et al., 2017). MSH is an important agronomic trait affecting photosynthesis efficiency, lodging, and yield. The first LBL is also an important trait for peanut breeding because it can affect planting density and yield. Both of these are quantitative traits. Previous studies showed that MSH and LBL were positively correlated. However, no consistent QTLs from different populations were detected for MSH or LBL because these QTLs could be specific for the populations (Shirasawa et al., 2012; Huang et al., 2015; Li Y. et al., 2017). In this study, the data showed a strong significant positive correction (r = 0.681) between MSH and LBL (Table 3). Eleven novel QTLs on A04, A06, A09, B05, B06, B08, and B09 with 5.67–17.68% PVE for MSH and 6 QTLs on A05, A08, B05, and B09 with 5.90–10.05% PVE for LBL were mapped. It was noteworthy that qMSHB05.2, qLBLB05.1, and qLBLB05.2 were co-localized on B05. The region from 57.50 to 64.50 cM on B05 was the hot QTL spot for all growth habit-related traits (LBA, MSH, LBL, ER, and IOPT), suggesting that multiple genes or genes with pleiotropic effects control these traits. This also can be verified by the significant correlation between these traits. For these overlapping QTLs, a positive additive effect indicated the allele derived from the female parent “Jihua 5” improving the value of MSH, while a negative additive effect indicated that the allele for increasing the value of LBA, ER, IOPT, and LBL came from the male parent “M130.”
The evolved architectural traits during domestication include the extent of the vegetative shoot and inflorescence branching, branch angle, as well as internode elongation (Teichmann and Muhr, 2015). In this study, a total of 19 candidate genes for growth habit were identified on the co-localized genome region on B05, which spanned 0.17 Mb physical intervals and 19 genes were annotated by at least one category of either GO, KEGG, or NR databases. GO annotations indicated that SZWP5S.1 and SZWP5S.2 play a role in light signal functions. Meanwhile, through KEGG analysis, SZWP5S.1 and SZWP5S.2 are predicted to have a role in the circadian rhythm-plant pathway. H1TMWI.1 and H1TMWI.2 have a role in a cytokinin-activated signaling pathway responding to light intensity. One strategy for improving yield is to increase planting density; however, a key factor that limits planting density in the practical application of agriculture is the plant’s shade-avoidance response. The shade-avoidance syndrome consists of increased plant height and reduced branching and early flowering that are triggered by changes in the quality of light (Casal, 2012; Xie et al., 2017). Elongation of the stem is increased under shade conditions because of the increased free indole acetic acid (IAA) due to the competition for the light with neighboring plants (Gallavotti, 2013). Annotations in the NR database implied that six genes (FXXJ78.1, IS4878.1, IS4878.3, FXXJ78.2, H1TMWI.1, and IS4878.2) were related to tryptophan aminotransferase-related protein 4 (G. soja). Tryptophan aminotransferase which is encoded by the TAA gene converts tryptophan into indole pyruvic acid (IPA) and subsequently converts IPA into IAA by monooxygenase enzymes which are encoded by the YUCCA gene family (Stepanova et al., 2008; Tao et al., 2008; Mashiguchi et al., 2011; Won et al., 2011). IAA is a common auxin which can affect plant growth and development. These genes, which are involved in light and hormones, need to be verified. The annotated genes could contribute to a better understanding of the genetic mechanism behind the growth habit-related traits. Although the plant type is visible, it is time-consuming for breeders to assess the performances of genotypes in changeable environments. Therefore, it is necessary to screen the markers linked with the growth habit-related traits to improve breeding efficiency. These presumptive genes could provide a potential for MAS in peanut breeding for growth habit-related traits.
A high-density genetic map of cultivated peanuts was constructed using SLAF-seq techniques. It consists of 2,808 markers on the 20 LGs with a total length of 1,308.20 cM and an average inter-marker distance of 0.47 cM. A total of 39 QTLs associated with growth habit-related traits were identified using a RIL population under seven environments in four locations. Furthermore, multiple QTLs were co-localized on the chromosome B05 under at least two environments. The identified genomic regions and putative genes that involved light and hormones will support the further development of new strategies at the molecular genetic level to promote the application of MAS in peanut breeding for improved growth habit.
The datasets generated for this study can be found in the NCBI SRA database and the SRA accession number is PRJNA517739 (https://www.ncbi.nlm.nih.gov/sra/PRJNA517739).
LFL and CC designed the experiments. XY, SC, and MJH developed the mapping population. LL, XM, and MYH performed the growth habit-related traits measurements of the plant materials. LL and XLY constructed the genetic linkage map and performed QTL analysis. LL analyzed the result and wrote the manuscript. LFL, CC, GM, and HZ revised the manuscript. All authors contributed to the manuscript and approved the final version of the manuscript to be published.
This study was supported by the National Natural Science Foundation of China (31701459 and 31771833); the Natural Science Foundation of Hebei Province (C2017204101); the China Agriculture Research System (CARS-13) and the Earmarked Fund for Hebei Oil Innovation Team of Modern Agro-industry Technology Research System (HBCT2018090202); the Science and Technology Supporting Plan Project of Hebei Province (16226301D); and the fund project for training of graduate student innovation ability of Hebei Province (CXZZBS2018115).
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
We thank Dr. David Weaver, Professor Emeritus of Department of Crop, Soil and Environmental Sciences, Auburn University for critical reviewing the manuscript.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2019.00745/full#supplementary-material
Agarwal, G., Clevenger, J., Pandey, M. K., Wang, H., Shasidhar, Y., and Chu, Y. (2018). High density genetic map using whole genome resequencing for fine mapping and candidate gene discovery for disease resistance in peanut. Plant Biotechnol. J. 6, 1954–1967. doi: 10.1111/pbi.12930
Bertioli, D. J., Cannon, S. B., Froenicke, L., Huang, G., Farmer, A. D., Cannon, E. K., et al. (2016). The genome sequences of Arachis duranensis and Arachis ipaensis, the diploid ancestors of cultivated peanut. Nat. Genet. 48, 438–446. doi: 10.1038/ng.3517
Bertioli, D. J., Jenkins, J., Clevenger, J., Dudchenko, O., Gao, D., Seijo, G., et al. (2019). The genome sequence of segmental allotetraploid peanut Arachis hypogaea. Nat. Genet. 51, 877–884. doi: 10.1038/s41588-019-0405-z
DePristo, M. A., Banks, E., Poplin, R., Garimella, K. V., Maguire, J. R., Hartl, C., et al. (2011). A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498. doi: 10.1038/ng.806
Fonceka, D., Tossim, H. A., Rivallan, R., Vignes, H., Lacut, E., and De Bellis, F. (2012). Construction of chromosome segment substitution lines in peanut (Arachis hypogaea L.) using a wild synthetic and QTL mapping for plant morphology. PLoS One 7:e48642. doi: 10.1371/journal.pone.0048642
Gayathri, M., Shirasawa, K., Varshney, R. K., Pandey, M. K., and Bhat, R. S. (2018). Development of AhMITE1 markers through genome-wide analysis in peanut (Arachis hypogaea L.). BMC Res. Notes 11:10. doi: 10.1186/s13104-017-3121-8
Gupta, S. K., Baek, J., Carrasquilla-Garcia, N., and Penmetsa, R. V. (2015). Genome-wide polymorphism detection in peanut using next-generation restriction-site-associated DNA (RAD) sequencing. Mol. Breed. 35:145. doi: 10.1007/s11032-015-0343-0
Hake, A. A., Shirasawa, K., Yadawad, A., Sukruth, M., Patil, M., and Nayak, S. N. (2017). Mapping of important taxonomic and productivity traits using genic and non-genic transposable element markers in peanut (Arachis hypogaea L.). PLoS one 12:e0186113. doi: 10.1371/journal.pone.0186113
Halward, T., Stalker, T., LaRue, E., and Kochert, G. (1992). Use of single-primer DNA amplifications in genetic studies of peanut (Arachis hypogaea L.). Plant Mol. Biol. 18, 315–325. doi: 10.1007/BF00034958
Han, S., Yuan, M., Clevenger, J. P., Li, C., Hagan, A., Zhang, X., et al. (2018). A SNP-based linkage map revealed QTLs for resistance to early and late leaf spot diseases in peanut (Arachis hypogaea L.). Front. Plant Sci. 9:1012. doi: 10.3389/fpls.2018.01012
Hong, Y., Chen, X., Liang, X., Liu, H., Zhou, G., Li, S., et al. (2010). A SSR-based composite genetic linkage map for the cultivated peanut (Arachis hypogaea L.) genome. BMC Plant Biol. 10:17. doi: 10.1186/1471-2229-10-17
Hong, Y. B., Liang, X. Q., Chen, X. P., Liu, H. Y., Zhou, G. Y., Li, S. X., et al. (2008). Construction of genetic linkage map based on SSR markers in peanut (Arachis hypogaea L.). Agric. Sci. China 7, 915–921. doi: 10.1016/S1671-2927(08)60130-3
Hu, X. H., Zhang, S. Z., Miao, H. R., Cui, F. G., Shen, Y., Yang, W. Q., et al. (2018). High-density genetic map construction and identification of QTLs controlling oleic and linoleic acid in peanut using SLAF-seq and SSRs. Sci. Rep. 8:5479. doi: 10.1038/s41598-018-23873-7
Huang, L., He, H., Chen, W., Ren, X., Chen, Y., Zhou, X., et al. (2015). Quantitative trait locus analysis of agronomic and quality-related traits in cultivated peanut (Arachis hypogaea L.). Theor. Appl. Genet. 128, 1103–1115. doi: 10.1007/s00122-015-2493-1
Huang, X., Zhao, Y., Li, C., Wang, A., Zhao, Q., Li, W., et al. (2012). Genome-wide association study of flowering time and grain yield traits in a worldwide collection of rice germplasm. Nat. Genet. 44, 32–39. doi: 10.1038/ng.1018
Kayam, G., Brand, Y., Faigenboim-Doron, A., Patil, A., Hedvat, I., Hovav, R., et al. (2017). Fine-mapping the branching habit trait in cultivated peanut by combining bulked segregant analysis and high-throughput sequencing. Front. Plant Sci. 8:467. doi: 10.3389/fpls.2017.00467
Keerio, A. A., Shen, C., Nie, Y., Ahmed, M. M., Zhang, X., and Lin, Z. (2018). QTL mapping for fiber quality and yield traits based on introgression lines derived from Gossypium hirsutum × G. tomentosum. Int. J. Mol.Sci. 19:243. doi: 10.3390/ijms19010243
Li, B., Fan, S., Yu, F., Chen, Y., Zhang, S., Han, F., et al. (2017). High-resolution mapping of QTL for fatty acid composition in soybean using specific-locus amplified fragment sequencing. Theor. Appl. Genet. 130, 1467–1479. doi: 10.1007/s00122-017-2902-8
Li, Y., Li, L., Zhang, X., Zhang, K., Ma, D., Liu, J., et al. (2017). QTL mapping and marker analysis of main stem height and the first lateral branch length in peanut (Arachis hypogaea L.). Euphytica 213:57. doi: 10.1007/s10681-017-1847-y
Liu, D., Ma, C., Hong, W., Huang, L., Liu, M., Liu, H., et al. (2014). Construction and analysis of high-density linkage map using high-throughput sequencing data. PLoS One 9:e98855. doi: 10.1371/journal.pone.0098855
Luo, H., Xu, Z., Li, Z., Li, X., Lv, J., Ren, X., et al. (2017). Development of SSR markers and identification of major quantitative trait loci controlling shelling percentage in cultivated peanut (Arachis hypogaea L.). Theor. Appl. Genet. 130, 1635–1648. doi: 10.1007/s00122-017-2915-3
Lv, J., Liu, N., Guo, J., Xu, Z., Li, X., Li, Z., et al. (2018). Stable QTLs for plant height on chromosome A09 identified from two mapping populations in peanut (Arachis hypogaea L.). Front. Plant Sci. 9:684. doi: 10.3389/fpls.2018.00684
Mashiguchi, K., Tanaka, K., Sakai, T., Sugawara, S., Kawaide, H., Natsume, M., et al. (2011). The main auxin biosynthesis pathway in Arabidopsis. Proc. Natl. Acad. Sci. U.S.A. 108, 18512–18517. doi: 10.1073/pnas.1108434108
Miller, M. R., Dunham, J. P., Amores, A., Cresko, W. A., and Johnson, E. A. (2007). Rapid and cost-effective polymorphism identification and genotyping using restriction site associated DNA (RAD) markers. Genome Res. 17, 240–248. doi: 10.1101/gr.5681207
Peterson, B. K., Weber, J. N., Kay, E. H., Fisher, H. S., and Hoekstra, H. E. (2012). Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species. PLoS One 7:e37135. doi: 10.1371/journal.pone.0037135
Poland, J. A., Brown, P. J., Sorrells, M. E., and Jannink, J. L. (2012). Development of high-density genetic maps for barley and wheat using a novel two-enzyme genotyping-by-sequencing approach. PLoS One 7:e32253. doi: 10.1371/journal.pone.0032253
Qi, Z., Huang, L., Zhu, R., Xin, D., Liu, C., Han, X., et al. (2014). A high-density genetic map for soybean based on specific length amplified fragment sequencing. PLoS One 9:e104871. doi: 10.1371/journal.pone.0104871
Shirasawa, K., Koilkonda, P., Aoki, K., Hirakawa, H., Tabata, S., Watanabe, M., et al. (2012). In silico polymorphism analysis for the development of simple sequence repeat and transposon markers and construction of linkage map in cultivated peanut. BMC Plant Biol. 12:80. doi: 10.1186/1471-2229-12-80
Stepanova, A. N., Robertson-Hoyt, J., Yun, J., Benavente, L. M., Xie, D. Y., and Doležal, K. (2008). TAA1-mediated auxin biosynthesis is essential for hormone crosstalk and plant development. Cell 133, 177–191. doi: 10.1016/j.cell.2008.01.047
Sun, X., Liu, D., Zhang, X., Li, W., Liu, H., Hong, W., et al. (2013). SLAF-seq: an efficient method of large-scale de novo SNP discovery and genotyping using high-throughput sequencing. PLoS One 8:e58700. doi: 10.1371/journal.pone.0058700
Tao, Y., Ferrer, J. L., Ljung, K., Pojer, F., Hong, F., Long, J. A., et al. (2008). Rapid synthesis of auxin via a new tryptophan-dependent pathway is required for shade avoidance in plants. Cell 133, 164–176. doi: 10.1016/j.cell.2008.01.049
van Os, H., Stam, P., Visser, R. G., and van Eck, H. J. (2005). SMOOTH: a statistical method for successful removal of genotyping errors from high-density genetic linkage data. Theor. Appl. Genet. 112, 187–194. doi: 10.1007/s00122-005-0124-y
Varshney, R. K., Bertioli, D. J., Moretzsohn, M. D. C., Vadez, V., Krishnamurthy, L., and Aruna, R. (2009a). The first SSR-based genetic linkage map for cultivated groundnut (Arachis hypogaea L.). Theor. Appl. 118, 729–739. doi: 10.1007/s00122-008-0933-x
Varshney, R. K., Hoisington, D. A., Nayak, S. N., and Graner, A. (2009b). “Molecular plant breeding: methodology and achievements,” in Plant Genomics-Methods and Protocols, eds D. J. Somers, P. Langridge, and J. P. Gustafson (New York, NY: Humana Press), 283–304. doi: 10.1007/978-1-59745-427-8_15
Wang, Z., Huai, D., Zhang, Z., Cheng, K., Kang, Y., Wan, L., et al. (2018). Development of a high-density genetic map based on specific length amplified fragment sequencing and its application in quantitative trait loci analysis for yield-related traits in cultivated peanut. Front. Plant Sci. 9:827. doi: 10.3389/fpls.2018.00827
Won, C., Shen, X., Mashiguchi, K., Zheng, Z., Dai, X., Cheng, Y., et al. (2011). Conversion of tryptophan to indole-3-acetic acid by tryptophan aminotransferases of Arabidopsis and yuccas in Arabidopsis. Proc. Natl. Acad. Sci. U.S.A. 108, 18518–18523. doi: 10.1073/pnas.1108436108
Wu, Y., Bhat, P. R., Close, T. J., and Lonardi, S. (2008). Efficient and accurate construction of genetic linkage maps from the minimum spanning tree of a graph. PLoS Genet. 4:e1000212. doi: 10.1371/journal.pgen.1000212
Xie, Y., Liu, Y., Wang, H., Ma, X., Wang, B., Wu, G., et al. (2017). Phytochrome-interacting factors directly suppress MIR156 expression to enhance shade-avoidance syndrome in Arabidopsis. Nat. Commun. 8:348. doi: 10.1038/s41467-017-00404-y
Xu, X., Xu, R., Zhu, B., Yu, T., Qu, W., Lu, L., et al. (2015). A high-density genetic map of cucumber derived from specific length amplified fragment sequencing (SLAF-seq). Front. Plant Sci. 5:768. doi: 10.3389/fpls.2014.00768
Xu, Y., Huang, L., Ji, D., Chen, C., Zheng, H., and Xie, C. (2015). Construction of a dense genetic linkage map and mapping quantitative trait loci for economic traits of a doubled haploid population of Pyropia haitanensis (Bangiales, Rhodophyta). BMC Plant Biol. 15:228. doi: 10.1186/s12870-015-0604-4
Zhang, J., Zhang, Q., Cheng, T., Yang, W., Pan, H., Zhong, J., et al. (2015). High-density genetic map construction and identification of a locus controlling weeping trait in an ornamental woody plant (Prunus mume Sieb. et Zucc). DNA Res. 22, 183–191. doi: 10.1093/dnares/dsv003
Zhou, X., Xia, Y., Ren, X., Chen, Y., Huang, L., Huang, S., et al. (2014). Construction of a SNP-based genetic linkage map in cultivated peanut based on large scale marker development using next-generation double-digest restriction-site-associated DNA sequencing (ddRADseq). BMC Genomics 15:351. doi: 10.1186/1471-2164-15-351
Keywords: cultivated peanut (Arachis hypogaea L.), high-density genetic map, SLAF-seq, QTL mapping, plant growth habit
Citation: Li L, Yang X, Cui S, Meng X, Mu G, Hou M, He M, Zhang H, Liu L and Chen CY (2019) Construction of High-Density Genetic Map and Mapping Quantitative Trait Loci for Growth Habit-Related Traits of Peanut (Arachis hypogaea L.). Front. Plant Sci. 10:745. doi: 10.3389/fpls.2019.00745
Received: 12 March 2019; Accepted: 20 May 2019;
Published: 12 June 2019.
Edited by:Takaki Yamauchi, PRESTO, Japan Science and Technology Agency, Japan
Reviewed by:Ramesh S. Bhat, University of Agricultural Sciences, Dharwad (ICAR), India
Xiaohua Qi, Yangzhou University, China
Copyright © 2019 Li, Yang, Cui, Meng, Mu, Hou, He, Zhang, Liu and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
†These authors have contributed equally to this work