Original Research ARTICLE
Fine-Mapping the Branching Habit Trait in Cultivated Peanut by Combining Bulked Segregant Analysis and High-Throughput Sequencing
- Department of Field Crops, Plant Science Institute, Agricultural Research Organization, Bet-Dagan, Israel
The growth habit of lateral shoots (also termed “branching habit”) is an important descriptive and agronomic character of peanut. Yet, both the inheritance of branching habit and the genetic mechanism that controls it in this crop remain unclear. In addition, the low degree of polymorphism among cultivated peanut varieties hinders fine-mapping of this and other traits in non-homozygous genetic structures. Here, we combined high-throughput sequencing with a well-defined genetic system to study these issues in peanut. Initially, segregating F2 populations derived from a reciprocal cross between very closely related Virginia-type peanut cultivars with spreading and bunch growth habits were examined. The spreading/bunch trait was shown to be controlled by a single gene with no cytoplasmic effect. That gene was named Bunch1 and was significantly correlated with pod yield per plant, time to maturation and the ratio of “dead-end” pods. Subsequently, bulked segregant analysis was performed on 52 completely bunch, and 47 completely spreading F3 families. In order to facilitate the process of SNP detection and candidate-gene analysis, the transcriptome was used instead of genomic DNA. Young leaves were sampled and bulked. Reads from Illumina sequencing were aligned against the peanut reference transcriptome and the diploid genomes. Inter-varietal SNPs were detected, scored and quality-filtered. Thirty-four candidate SNPs were found to have a bulk frequency ratio value >10 and 6 of those SNPs were found to be located in the genomic region of linkage group B5. Three best hits from that over-represented region were further analyzed in the segregating population. The trait locus was found to be located in a ~1.1 Mbp segment between markers M875 (B5:145,553,897; 1.9 cM) and M255 (B5:146,649,943; 2.25 cM). The method was validated using a population of recombinant inbreed lines of the same cross and a new DNA SNP-array. This study demonstrates the relatively straight-forward utilization of bulk segregant analysis for trait fine-mapping in the low polymeric and heterozygous germplasm of cultivated peanut and provides a baseline for candidate gene discovery and map-based cloning of Bunch1.
Peanut (Arachis hypogaea L.) is an economically significant crop grown throughout the world. It is the second-most important cultivated grain legume and the fourth largest edible oilseed crop (Faostat, 1998). It is an unusual legume plant in that its flowers are borne aboveground, but the fruits develop underground. The plant is an indeterminate, annual herbaceous bush that is 15–70 cm tall and is comprised of an erect main shoot and a number of lateral shoots (branches) that begin at the base of the plant.
The growth angle of the lateral shoots (commonly referred to as “growth habit” or “branching habit”) is one of the most important descriptive characteristics of peanut (Pittman, 1995). The wild polyploid peanut species A. monticola usually has a spreading phenotype, in which lateral branches strike or partially strike the ground. In domesticated peanut, four different, and easily distinguishable categories of branching habit are known: prostrate, spreading, bunch, and erect (Table 1). The description of a new peanut variety, especially one that is of the Virginia marketing-type, will almost always begin with the definition of its growth habit (e.g., bunch or spreading). In addition to assisting breeders and other researchers in identifying accessions for specific traits, branching habit has a great impact on peanut physiology, productivity, and crop management. Since fruiting in peanut occurs underground, the distance between the flowering buds and the ground is an important factor. Pegs of bunch/erect plants that do not reach the ground will not produce pods on time. However, the pods of bunch/erect plants develop at the same time, promoting early maturation. The growth habit of peanut affects the implementation of agrotechnology such as mechanical cultivation and disease management (Butzler et al., 1998).
Table 1. The four main types of growth habit in peanut (from Pittman, 1995).
Despite the agronomic importance of the growth habit of peanut, both the inheritance of this trait and the genetic mechanism that controls branching habit in this crop are not clear. The trait was studied in detail during the 1960s and 1970s and, during that period, two distinct phenotypic groups were usually considered: the runner growth habit wherein the side branches are prostrate, always growing peripherally from the main axis, trailing on the ground except for the tips, which may be somewhat ascending and the bunch growth habit, in which the laterals are also erect or ascending. Initially, a two-gene model for the control of growth habit was suggested, with the runner habit dominant to the erect habit (Hull, 1933; Patel et al., 1936; Coffelt, 1974), but investigators had difficulty classifying the intermediate and/or abnormal growth habits of F2 progeny of crosses between plants exhibiting different growth habits. A fundamental set of experiments conducted by Ashri (1964, 1968) indicated the existence of a genic-cytoplasmic interaction that controls growth habit in peanut. In a project involving a series of reciprocal crosses, differences in growth habit were recorded. A few more nuclear and cytoplasmic genes were later identified by the same group and those researchers concluded that cytoplasmic inheritance has a major effect on the branching habit of peanut (Ashri, 1975; Ashri and Levi, 1975). In more recent studies, the branching habit trait was genetically characterized and mapped by using inter-specific crossing system with an amphidiploid species (Fonceka et al., 2012a,b). The branching habit trait was phenotyped quantitatively by using a continuous scale from 1 (procumbent) to 6 (erect). The trait showed a wide range of morphologies, ranging from completely prostrate to totally erect, and several QTLs were found to control the trait, with the most significant located on linkage groups a07, b05, a10, and b10.
Like many polyploid species, cultivated peanut has experienced a genetic bottleneck, which, together with the effects of domestication, has greatly narrowed its genetic diversity and limited DNA polymorphism among subsequently derived Arachis lines (Kochert et al., 1991; Moretzsohn et al., 2013). As a result, peanut has a low degree of polymorphism among cultivated varieties. This limited polymorphism has hindered the development of molecular and genomic tools for use in domesticated peanut. With the introduction of the genome sequences of peanut ancestors Arachis duranensis and Arachis ipaensis (Bertioli et al., 2016), peanut is now “the orphan legume genome whose time has come” (Ozias-Akins, 2013). These genome sequences provide the resources necessary to move peanut genomics to the next level, facilitating the development of SNP-based marker technologies. In the past, the most widely used molecular markers were simple-sequence repeats (SSRs). Despite their widespread use on the intra-species cultivated level (e.g., Selvaraj et al., 2009), the utility of SSR studies of peanut is limited by their apparent low frequency of across the genome and the relatively low-throughput method of analysis. The use of high-throughput markers like SNPs is necessary for efficient use of genomic data for marker-assisted selection, quantitative trait locus mapping and genomic selection. Recently, several platforms have been developed to facilitate the use of SNP markers for gene-mapping in peanut, including genotyping by sequencing (Zhou et al., 2014) and genome resequencing-based SNP arrays (Clevenger et al., 2016; Pandey et al., 2017). However, despite these advances, these platforms are usually highly efficient only for homozygous populations, like recombinant inbreed lines (RILs) and introgression lines (ILs), which are relatively tedious and expensive to construct in peanut. These methods are usually less effective for trait-mapping in heterozygous genetic populations (e.g., F2 and F3 generations) due to the allopolyploid nature of the peanut genome. This is particularly true in cases of genetic populations that are based upon a cross between closely related parental genotypes, which are occasionally needed for better genetic dissection of specific traits.
In this study, we used a well-defined genetic system to further investigate the genetic nature of the branching habit trait of peanut. Initially, segregating F2 populations derived from a reciprocal cross between very closely-related Virginia marketing-type cultivars were analyzed. Against this particular background, the spreading/bunch trait, a well-known characteristic of the Virginia varieties, was found to be controlled by a single gene with no cytoplasmic effect. Subsequently, a combination of bulked segregant analysis and deep sequencing was developed to facilitate the SNP detection process and the fine-mapping of this gene. The processes were validated using a RIL population derived from the same cross and a new SNP array (Pandey et al., 2017). The relatively straight-forward utilization of this technique in ultra-low polymorphic and highly heterozygous peanut germplasm is demonstrated.
Materials and Methods
Plant Material and Data Collection
Segregating F2 and F3 populations derived from a reciprocal cross between very closely-related Virginia-type peanut cultivars were studied. The parental lines were cv. “Hanoch,” a late-maturing spreading-type cultivar, and cv. “Harari,” a medium-maturing bunch-type cultivar. The two parental lines share substantial genetic background, since cv. Harari was developed from an initial cross between cv. Hanoch and cv. Shulamit and an additional back-cross of cv. Hanoch with cv. Hillah (the outcome of Hanoch × Shulamit). In 2013, 314, and 252 F2 individuals from reciprocal Hanoch × Harari and Harari × Hanoch crosses, respectively, were grown under field conditions. The plot consisted of two rows, 75 cm apart, with 40 cm spacing between plants within each row. The experimental plots were sown alongside commercial plots under full irrigation. All agricultural practices were carried out according to local growing protocols as described previously (Gupta et al., 2014).
Growth habit was recorded at 80 days after sowing. At the end of the season, pods were harvested on an individual-plant basis. For each sample, the total pod yield, net pod yield (where immature and unhealthy pods are excluded), number of pods, total seed weight, “dead-end” ratio (relative number of pods with the remote seed aborted) and seed ratio (net seed weight per plant divided by the net pod weight per plant) traits were recorded as well. From each population, approximately 150 F3 families were grown in the subsequent season, with 16 seeds from each F2 individual sown. Branching habit was recorded at the family level (spreading/bunch/segregating) at 80 days after sowing. Plant maturity index was determined based on three random plants in the homozygous F3 families, as the number of fully matured pods out of the total number of pods at 140 days after sowing. To validate the bulk segregant analysis, 94 RILs (F6:F8), which originated from the same Hanoch × Harari cross by single seed descent, were analyzed. In 2016, 16 randomly-selected plants from each RIL were grown under field conditions and the growth habit of each plant was recorded 70 days after sowing.
RNA Isolation, Preparation of Libraries, and High-Throughput Sequencing
Bulked segregant analysis was performed on the F3 families that were found to be homozygous for the spreading or bunch growth habit. In total, 52 completely bunch and 47 completely spreading families were sampled. Young leaves were collected from all 16 individuals in each family. In each phenotypic group (spreading/bunch), all tissues from the families were bulked for the RNA extraction. Working on the RNA level was preferable to working on the DNA genomic level due to the large and relatively complex peanut genome and also facilitated the detection of candidate genes. Samples were taken of each of the ground tissues (400 mg each) and were used for RNA extraction using the hot-borate method, as described by Brand and Hovav (2010). The total RNA was used to prepare two RNA-Seq libraries, using TruSeq RNA Sample Preparation Kit v2 (Illumina) following the manufacturer's protocol as described previously (Gupta et al., 2016). Libraries were validated using DNA Screen Tape D1000 and the Tapestation 2200 (Agilent). RNA-Seq libraries were sequenced using an Illumina HiSeqTM2000 (single lane) at the sequencing center at the Technion in Haifa, Israel.
Data analyses followed the general guidelines for bulk segregant analysis using next-generation sequencing (Magwene et al., 2011) and the specific guidelines for polyploids (Trick et al., 2012), with several modifications. Raw reads were subjected to a cleaning procedure using the FASTX Toolkit (http://hannonlab.cshl.edu/fastx_toolkit/ index.htm) including: (1) trimming read-end nucleotides with quality scores <30 using fastq_quality_trimmer and (2) removing reads with less than 70% base pairs with quality score ≤ 30 using fastq_quality_filter. The sequences were mapped against the 4X tetraploid peanut transcript assembly reference (http://www.peanutbase.org/) and against two Arachis diploid genomes (A. duranensis and A. ipaensis; Bertioli et al., 2016; peanutbase.org) using Bowtie2 aligner (Langmead and Salzberg, 2012). The genome Analysis Toolkit (GATK) Unified Genotyper software version 2.5.2 (McKenna et al., 2010; DePristo et al., 2011) was used for the detection of SNPs. A custom Perl script was used to derive the symmetric difference of the two SNP sets. Polymorphisms between homologous genomes generate the same doubled code and should be common to both SNP sets. Yet, differences in the SNPs between cv. Hanoch and cv. Harari (varietal-specific SNPs) should generate doubled code for only one bulk and, therefore, be unique to the corresponding SNP set. In this manner, ~13,000 varietal-specific SNPs were retrieved between the two bulks. These SNPs were further filtered according to the number of reads for each SNP > 50, GATK quality value >100 and BFR >3. Also, genes with SNP densities higher than 5 SNPs/kb were eliminated to avoid possible paralogue SNPs.
Validation of the SNP Markers and the Bulk Segregant Analysis
For further validation of the SNPs, DNA was collected from the parental lines and 20 F3 progeny of the cross cv. Hanoch × cv. Harari using a DNA Easy kit (SIGMA Aldrich). The same leaves that were used for the RNA study were used for DNA extraction, but the DNA analysis was conducted on a single-plant basis instead of with bulk samples. To validate the three best SNP markers, the following primers were used: M35: F-TCTCTCTCTCTCACAGTCAC; R-CTTGCCGGCAAATAGAGCAT. M255: F--CAGATATGCAAGGCCTAACT; R-TGCCAGAGCAAGGAACATGT. M875: F-CCATCTGCAGTGAGAGTCAA; R-GTGATTCCTGCGTTCAAGTC. These primers were also used for further mapping of the trait in 182 F4 individuals derived from one F3 family segregating for the branching habit trait.
The fine-mapping of the branching habit gene carried out using the bulk-segregant approach was further validated by a custom Affymetrix Axiom SNP array (Pandey et al., 2017). For that analysis, DNA was collected from the two parental lines and each of 94 Recombinant Inbreed Lines (RILs) derived from the same cv. Hanoch × cv. Harari cross. Young leaves were collected from 12 random plants from each RIL and DNA was extracted with a specific kit (GenElute™; Sigma). DNA was quantified by Qubit (Invitrogene LTD) and diluted to 30 ng/uL according to the Affymetrix guidelines (http://www.affymetrix.com). The chip array calls were subjected to cluster-quality filtering, carried out according to Affymetrix guidelines, and additional filtering to select only SNPs that were polymorphic between the parental lines and segregated in a 1:1 ratio in the RILs.
Branching Habit Is Controlled by a Single Gene
The segregation patterns of the branching habit trait in the F1, F2, and F3 generations of the “Hanoch” X “Harari” cross are presented in Table 2. As shown, in this genetic background, the spreading/bunch pattern appears to correspond to a single-gene model of inheritance with no cytoplasmic effect. The allele that confers the spreading phenotype is dominant over the one that confers the bunch habit, as demonstrated by the spreading phenotype of all the F1 hybrids, the 3:1 segregation ratio among the F2 progeny and the 1:2:1 segregation ratio among the F3 families. The gene was named bunch1 and the classification of its corresponding phenotype was very easy and clear-cut, even as early as 50 days after sowing (Figure 1).
Table 2. Segregation pattern of the spreading/bunch trait in several generations derived from closely related peanut varieties.
Figure 1. Bunch and spreading phenotypes among (A) F2 individuals and (B) F3 families grown under field conditions, t 50 days after sowing.
Bunch1 Is Associated with Several Important Agronomic Traits
In addition to growth habit, other traits with agronomic importance were examined in the segregating populations. The associations between the bunch1 phenotype and each of these traits are presented in Figure 2. The bunch phenotype of bunch1 was significantly associated with a lower dead-end ratio. The bunch1 phenotype was also found to have a small, but significant [Prob (t) = 0.0022] effect on early maturation. On the other hand, the BUNCH1 phenotype (spreading) was significantly associated with higher total pod weight and a greater number of pods per plant.
Figure 2. The difference between the bunch (bunch1) and the spreading (BUNCH1) phenotypes of the bunch1 gene in terms of several agronomic traits. Phenotypes that are not labeled with the same letter are significantly different from one another (p < 0.05). (A) Net pod weight, (B) total pod weight, (C) seed content, (D) number of pods per plant, (E) dead-end ratio, and (F) percentage of mature pods at 10 days after sowing (DAS).
Identifying SNP Markers that Are Linked to Bunch1
In order to map bunch1 on the peanut genome, a bulk segregant analysis was performed. For that analysis, 52 completely bunch and 47 completely spreading F3 families were bulked RNA was extracted from each bulk and converted into two libraries suitable for Illumina sequencing. After a cleaning procedure, 72 million reads per library (on average) were aligned to a 4X transcript assembly (peanutbase.org) that contains 120,364 peanut transcripts from both the A and B genomes [60,814 transcripts represent the A genome (Arachis duranensis) while 59,551 transcripts represent the B genome (Arachis ipaensis)]. With about 98% of reads mapped to the reference assembly, the expression levels of 117,957 peanut genes were measured.
Pipelines for the SNP discovery and the analysis of bulk frequency ratio were constructed according to the general scheme that was previously suggested for polyploid wheat (Trick et al., 2012). After initial filtering, ~13,000 SNPs were found to be polymorphic by the two bulks. Subsequently, the bulk frequency ratio was determined for each SNP by calculating the frequency of each nucleotide of the SNP in each bulk and then dividing one bulk by another. If the SNP is a result of false-positive call of a homoeologus SNP, or if it is not linked to the trait, then both of the SNP nucleotides will be equally represented in the two bulks. However, if the SNP is linked to the gene, the frequencies of the SNP nucleotides in one bulk will be with significantly higher frequencies than the other bulk. In that manner, ~1,200 SNPs were found to have bulk frequency ratios of > 3, while 34 had bulk frequency ratios >10 (Figure 3A).
Figure 3. Bulk frequency ratio (BFR) analysis to identify SNPs and genome locations that are linked to Bunch1. (A) The distribution of BFR in ~13,000 varietal-specific SNPs. (B) The genomic locations of the 34 SNPs with the highest BFR (>10), indicating one over-represented region at the end of linkage group 5B (red).
The genomic location of the 34 SNPs with bulk frequency ratios >10 within the peanut genome was recorded (Figure 3B, Supplemental Table 1). One region at the end of linkage group 5B was found to be over-represented in this SNP group; six of the 34 were located between 5B:135,963,343 and 5B147,304,662, including the SNP with the highest bulk frequency ratio [M875 (EZ721696.1); bulk frequency ratio = 23]. Two of these 5B linkage group and another few hypothetical SNPs with high BFR ratio from different linkage groups were further analyzed for SNP classification with Sanger sequencing. The purpose of this step was to roughly validate the location of bunch1. Therefore, samples from the parental lines and 6 F2 individual plants (from which 3 spreading and 3 bunch F3 families were derived) were selected (Supplemental Table 1). In this initial analysis, the SNPs from B5 linkage group were found almost perfectly segregating with trait, while the SNPs from the other genomic locations found to be either homoeologus SNPs (and not varietal) or didn't segregate with the trait (Supplemental Table 1), indicating for relatively high false positive ratio for the BSA by GBS technique in this system.
SNP marker M875 and other two SNPs from the same genomic location that had high bulk frequency ratios [M35 (EZ721381.1), M255 (EZ748922.1)] were further analyzed (Figure 4A). Samples from 20 homozygous F3 families were checked (10 spreading and 10 bunch; Figure 4B). In this initial analysis, M875 was found to be completely linked to bunch1, while the two others, M35 and M255, were also linked to bunch1, but not completely. In the next phase, 182 F4 individuals that originated from heterozygous segregating F3 families were genotyped and phenotyped using markers M255 and M875. Bunch1 was found to be located in a ~1.1 Mbp segment between markers M875 (B5:145,553,897; 1.9 cM) and M255 (B5:146,649,943; 2.25 cM).
Figure 4. SNP validation and linkage analysis with Bunch1. (A) An example of Sanger sequencing validation of two SNP markers. (B) Analysis of linkage between the top three SNP markers and the phenotype of Bunch1. Samples from 10 completely spreading (BUNCH1/BUNCH1) and 10 completely bunch (Bunch1/Bunch1) F3 families were analyzed.
Further Validation of the Bulk Segregant Analysis Using a Peanut SNP Array
Final confirmation of the fine-mapping of the Bunch1 gene was obtained using a new Affymetrix Axiom SNP array (Pandey et al., 2017). Since the chip technology is not efficient enough to distinguish between the heterozygous and homozygous states in the polyploid, a RIL population, which was advanced from the same cv. Hanoch × cv. Harari hybridization (F6:8), was used. Ninety-four RILs and the two parental lines were used for the analysis. Genomic DNA was extracted and applied to the 58,233 SNP clusters of the chip. Out of all of these SNPs, which were designed based on a wide spectrum of diploid and tetraploid peanut species, only 615 passed through the filtering pipeline, including significant differences between the parental lines and 1:1 segregation among the 94 RILs. The genetic analysis of these SNPs and the phenotype of Bunch1 gene are presented in Figure 5A. Ten SNP markers from the array significantly (p < 0.01) co-segregated with the phenotype of Bunch1 (Figure 5A). The best-linked SNP marker (AXX147251194) had only 1 recombinant RIL out of the 94 checked RILs (p = e−50; R2 = 0.92). All significant SNP markers were located in one region at linkage 5B, indicating once again that a single locus is controlling the branching habit trait in this background. These SNPs were located in very close proximity to the three SNPs that were derived from the bulk segregant analysis (Figure 5B). Interestingly, none of the SNPs from the bulk segregant analysis were detected in the chip array and vice versa, indicating that more SNPs could possibly be found by bulk segregant analysis and used in future SNP-array designs for cultivated peanut.
Figure 5. Validation and further fine-mapping of Bunch1 using an Affymetrix Axiom SNP array. (A) Manhattan plot for the analysis of linkage between 615 Affymetrix Axiom SNPs and the Bunch1 phenotype. 1–10 = genome A; 11–20 = genome B (e.g., 15 = linkage group 5B). (B) Integrative map for the bulk segregant analysis and SNP-array analyses of the peanut linkage group B5 (from PeanutBase.org). Markers derived from the bulk segregant analysis are indicated in red. Yellow–gene models. Green–ESTs of genes. Pink–syntheny of this region with A. duranensis (Genome A).
The genetic/molecular mechanism that controls growth angle in plants has been the subject of several studies, mainly involving monocotyledons, particularly rice. Several abnormal tiller-angle mutants and their corresponding genes have been reported in rice,) such as LA1 (Li et al., 2007) and PIN2 (Chen et al., 2012). Two additional genes with opposite effects on tiller angle, Tiller Angle Control 1 (TAC1) and Prostrate Growth 1 (PROG1), have also been identified in rice (Yu et al., 2007; Tan et al., 2008). These genes have played critical roles in the domestication of rice. There are several reports regarding the molecular biology of the growth angle of lateral shoots of dicot species. Roychoudhry et al. (2013) described a model in which the set point angle of lateral branches of higher plants is controlled by an auxin-dependent antigravitropic mechanism. The molecular basis for the spatial pattern of tree branches was also studied in peach, resulting in the discovery of a new ortholog of the TAC1 gene, which controls the “pillar” tree phenotype (Dardick et al., 2013).
We explored the branching habit in the leguminous crop Arachis hypogaea and fine-mapped a major gene that controls this trait. The bunch1 gene was mapped to a relatively small genomic region that includes ~70 ORFs for gene models. Interestingly, BlastX analysis showed that none of the above-mentioned genes that control the growth angle in either monocots or dicots were present in the peanut genome or mapped in approximation to bunch1. The genetic controller of bunch1 may therefore be novel. Several candidate genes involved in plant hormone metabolism and light reception are located within that region and have been identified as possibly controlling bunch1. One of these may be a FAR1-Related sequence (B05:146200756.146203528) that encodes a family of proteins that are essential for phytochrome A-controlled far-red responses in Arabidopsis (Arabidopsis thaliana; (Lin and Wang, 2004). Another putative candidate gene is the 1-aminocyclopropane-1-carboxylate oxidase-like protein (ACC-oxidase; B05:146236653.146238358), which catalyzes the last step in ethylene biosynthesis. Ethylene biosynthesis may play an important role in determining peanut growth angle. Applying a relatively small amount of EPCA (an ethylene-releasing compound) caused the horizontal branches of runner-type plants to become erect (Ziv et al., 1976). Yet, these and other candidate genes must, of course, be further examined in light- and plant hormone-targeted studies, as well as subjected to verification by positional cloning and transformation.
The Bunch1 gene had strong associations with several traits in the segregating F2 and F3 populations. Plants with the bunch phenotype had, on average, earlier maturity values and fewer dead-end pods. Plants with the spreading phenotype had on average more pods per plant, but many of those pods were actually undeveloped. In the bunch type, especially when a wide planting spacing is used (as in our experiments), many flowers are too far from the ground and cannot reach to the soil. For that manner, only the pods that are close to the root will develop. However, those pods that reach the soil develop in a more synchronized manner among the bunch types than among the spreading types. This promotes uniform maturation, better pod filling and, eventually, fewer dead-end pods. Indeed, many of the dead-end pods in the spreading types (like cv. Hanoch) are from distal parts of the branches, where pods develop late in the season and do not fully mature. In a subset of the RIL population, the bunch phenotype of the branching gene is significantly correlated with greater resistance to white mold (caused by S. sclerotiorum; data not shown; submitted for publication). Therefore, it is suggested that the phenotype of the Bunch1 gene has an important agricultural role in Virginia-type peanuts.
As we explored the segregation patterns of several other crosses between Virginia-type related cultivars with different branching habits, we noticed that this model of the bunch1 gene is relatively common within Israeli peanut breeding germplasm. Crosses between cv. Hanoch (spreading) and cv. Hillah/Shulamit (bunch) resulted in a 3:1 spreading:bunch ratio (data not shown), tracing the origin of this trait back to the early 1970s. Crosses between cv. Harari (bunch) and a runner-type peanut line GK-7-Ol (spreading) also resulted in a 3:1 ratio (data not shown), indicating that the single-gene model for this trait is not confined to the Virginia-type germplasm. Yet, in crosses with Valencia-type peanut germplasm (plants with an erect branching habit), this system of single-gene inheritance was not found andthe branching habit was therefore relatively hard to classify. We conclude that the bunch1 phenotype is confined to the A. hypogaea ssp. hypogaea gene pool. However, other allelic variations of this gene may exist in the A. hypogaea ssb. fastigiata germplasm, since some of the bunch-habit Virginia-type lines in Israel (e.g., cvs. Shulamit, Hillah, etc.) have A. fastigiata origins. Moreover, the branching habit trait of peanut was analyzed in a previous study by Fonceka et al. (2012a), which involved mapping traits in a cross between amphidiploid A. ipaensis /A. duranensis and a Spanish-type cultivar (A. hypogaea ssb. fastigiata var vulgaris) with an erect growth habit. There was one significant QTL for the branching habit at the same location as Bunch1 (at the end of linkage group B5), explaining 16.2 of the total variation of the trait in the population. It is very likely that the reported QTL for branching habit and the locus of Bunch1 are the same. This demonstrates once again that the origin of the Bunch1 phenotype may be beyond the A. hypogaea ssp. hypogaea genetic background.
We have demonstrated the relatively straight-forward and easy utilization of bulk segregant analysis for the fine-mapping of a monogenic trait in the low-polymorphic and heterozygous germplasm of cultivated peanut. Also, although the bunch/spreading trait is very easy to classify, there may be some new uses for these new markers in peanut breeding, particularly in the validation of successful F1 hybrids (when the female of the cross is the dominant spreading type) and the selection of homozygous spreading families in early breeding generations. The fine-mapping of this trait also provides a baseline for the cloning of Bunch1, presumably one of the first map-based positional cloning ventures in current genetic research of peanut.
GK and YB were responsible for the molecular work; AF was responsible for the bioinformatic analysis; AP was responsible for RIL population analysis; IH conducted the field trials; RH managed the study and wrote the manuscript.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
This study was funded by the Israel Peanut Production and Marketing Board. The authors wish to thank Mr. Oren Buchshtab and other workers of Dod Moshe LTD for their assistance with the field trials.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/article/10.3389/fpls.2017.00467/full#supplementary-material
Supplementary Table 1. List of the 34 hypothetical SNPs with the highest (>10) BFR. The table indicates the location of the SNP, the change of the nucleotide and the statistic score for the change. Also, the table presents further Sanger sequencing analysis of six SNPs with high BFR ratio. For this, six F2 individual plants were sampled, from which three spreading (green) and three bunch (yellow) F3 families were derived.
Bertioli, D. J., Cannon, S. B., Froenicke, L., Huang, G. D., Farmer, A. D., Cannon, E. K., et al. (2016). The genome sequences of Arachis duranensis and Arachis ipaensis, the diploid ancestors of cultivated peanut. Nat. Genet. 48, 438–444. doi: 10.1038/ng.3517
Brand, Y., and Hovav, R. (2010). Identification of suitable internal control genes for quantitative Real-Time PCR expression analyses in peanut (Arachis hypogaea). Peanut Sci. 37, 12–19. doi: 10.3146/PS09-014.1
Butzler, T. M., Bailey, J., and Beute, M. K. (1998). Integrated management of Sclerotinia blight in peanut: utilizing canopy morphology, mechanical pruning, and fungicide timing. Plant Dis. 82, 1312–1318. doi: 10.1094/PDIS.1918.104.22.1682
Chen, Y. N., Fan, X., Song, W., Zhang, Y., and Xu, G. (2012). Over-expression of OsPIN2 leads to increased tiller numbers, angle and shorter plant height through suppression of OsLAZY1. Plant Biotechnol. J. 10, 139–149. doi: 10.1111/j.1467-7652.2011.00637.x
Clevenger, J., Chu, Y., Chavarro, C., Agarwal, G., Bertioli, D. J., Leal-Bertioli, S. C., et al. (2016). Genome-wide SNP genotyping resolves signatures of selection and tetrasomic recombination in peanut. Mol. Plant 10, 309–322. doi: 10.1016/j.molp.2016.11.015
Dardick, C., Callahan, A., Horn, R., Ruiz, K. B., Zhebentyayeva, T., Hollender, C., et al. (2013). PpeTAC1 promotes the horizontal growth of branches in peach trees and is a member of a functionally conserved gene family found in diverse plants species. Plant J. 75, 618–630. doi: 10.1111/tpj.12234
DePristo, M. A., Banks, E., Poplin, R., Garimella, K. V., Maguire, J. R., Hartl, C., et al. (2011). A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498. doi: 10.1038/ng.806
Faostat (1998). Available online at: http://faostat.fao.org/default.aspx, last update 2016.
Fonceka, D., Tossim, H. A., Rivallan, R., Vignes, H., Faye, I., Ndoye, O., et al. (2012a). Fostered and left behind alleles in peanut: interspecific QTL mapping reveals footprints of domestication and useful natural variation for breeding. BMC Plant Biol. 12:26. doi: 10.1186/1471-2229-12-26
Fonceka, D., Tossim, H. A., Rivallan, R., Vignes, H., Lacut, E., de Bellis, F., et al. (2012b). Construction of chromosome segment substitution lines in peanut (Arachis hypogaea L.) using a wild synthetic and QTL mapping for plant morphology. PLoS ONE 7:e48642. doi: 10.1371/journal.pone.0048642
Gupta, K., Buchshtab, O., and Hovav, R. (2014). The effects of irrigation level and genotype on pod-filling related traits in peanut (Arachis hypogaea). J. Agric. Sci. 7, 169–181. doi: 10.5539/jas.v7n1p169
Gupta, K., Hedvat, I., Faigenboim-Doron, A., Clevenger, J., Ozias-Akins, P., and Hovav, H. (2016). Transcriptome profiling of peanut developing seed with a focus on duplicate oil related pathways. Front. Plant Sci. 7:1491. doi: 10.3389/fpls.2016.01491
Kochert, G., Halward, T., Branch, W. D., and Simpson, C. E. (1991). Rflp variability in peanut (Arachis-Hypogaea L) cultivars and wild-species. Theor. Appl. Genet. 81, 565–570. doi: 10.1007/BF00226719
Lin, R., and Wang, H. (2004). Arabidopsis FHY3/FAR1 gene family and distinct roles of its members in light control of arabidopsis development. Plant Physiol. 136, 4010–4022. doi: 10.1104/pp.104.052191
McKenna, A., Hanna, M., Banks, E., Sivachenko, A., Cibulskis, K., Kernytsky, A., et al. (2010). The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303. doi: 10.1101/gr.107524.110
Moretzsohn, M. C., Gouvea, E. G., Inglis, P. W., Leal-Bertioli, S. C., Valls, J. F., and Bertioli, D. J. (2013). A study of the relationships of cultivated peanut (Arachis hypogaea) and its most closely related wild species using intron sequences and microsatellite markers. Ann. Bot. Lond. 111, 113–126. doi: 10.1093/aob/mcs237
Ozias-Akins, P. (2013). The orphan legume genome whosetime has come: symposium highlights from the American Peanut Research & Education Society annual meeting. Peanut Sci. 40, 66–69. doi: 10.3146/PS13-14.1
Pandey, M., Agarwal, G., Kale, S., Clevenger, J., Nayak, S., Sriswathi1, M., et al. (2017). Development and evaluation of a high density genotyping ‘Axiom_Arachis’ array with 58 K SNPs for accelerating genetics and breeding in groundnut. Sci. Rep. 7:40577. doi: 10.1038/srep40577
Roychoudhry, S., Del Bianco, M., Kieffer, M., and Kepinski, S. (2013). Auxin controls gravitropic setpoint angle in higher plant lateral branches. Curr. Biol. 23, 1497–1504. doi: 10.1016/j.cub.2013.06.034
Selvaraj, M. G., Narayana, M., Schubert, A. M., Ayers, J. L., Baring, M. R., and Burow, M. D. (2009). Identification of QTLs for pod and kernel traits in cultivated peanut by bulked segregant analysis. Electron. J. Biotechnol. 12. doi: 10.2225/vol12-issue2-fulltext-13
Trick, M., Adamski, N. M., Mugford, S. G., Jiang, C. C., Febrer, M., and Uauy, C. (2012). Combining SNP discovery from next-generation sequencing data with bulked segregant analysis (BSA) to fine-map genes in polyploid wheat. BMC Plant Biol. 12:14. doi: 10.1186/1471-2229-12-14
Zhou, X., Xia, Y., Ren, X., Chen, Y., Huang, L., Huang, S., et al. (2014). Construction of a SNP-based genetic linkage map in cultivated peanut based on large scale marker development using next-generation double-digest restriction-site-associated DNA sequencing (ddRADseq). BMC Genom. 15:351. doi: 10.1186/1471-2164-15-351
Keywords: peanut, branching habit, bulked segregant analysis, fine mapping
Citation: Kayam G, Brand Y, Faigenboim-Doron A, Patil A, Hedvat I and Hovav R (2017) Fine-Mapping the Branching Habit Trait in Cultivated Peanut by Combining Bulked Segregant Analysis and High-Throughput Sequencing. Front. Plant Sci. 8:467. doi: 10.3389/fpls.2017.00467
Received: 28 December 2016; Accepted: 16 March 2017;
Published: 04 April 2017.
Edited by:Maria Carlota Vaz Patto, Instituto de Tecnologia Quimica e Biologica and Universidade Nova de Lisboa, Portugal
Reviewed by:Daniela Marone, Centre of Cereal Research-CREA-CER-Foggia, Italy
Teresa Millan, Universidad de Cordoba, Spain
Daniel Fonceka, Agricultural Research Centre For International Development, France
Copyright © 2017 Kayam, Brand, Faigenboim-Doron, Patil, Hedvat and Hovav. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Ran Hovav, email@example.com