ORIGINAL RESEARCH article
Validation of Candidate Gene-Based Markers and Identification of Novel Loci for Thousand-Grain Weight in Spring Bread Wheat
- 1Department of Bioscience, CIMMYT, Texcoco, Mexico
- 2Departamento de Genética, Escuela Técnica Superior de Ingeniería Agronómica y de Montes, Edificio Gregor Mendel, Campus de Rabanales, Universidad de Córdoba, Córdoba, Spain
- 3Recursos Genéticos y Productividad, Colegio de Postgraduados, Montecillo, Mexico
Increased thousand-grain weight (TGW) is an important breeding target for indirectly improving grain yield (GY). Fourteen reported candidate genes known to enhance TGW were evaluated in two independent and existing datasets of wheat at CIMMYT, the Elite Yield Trial (EYT) from 2015 to 2016 (EYT2015-16) and the Wheat Association Mapping Initiative (WAMI) panel, to study their allele effects on TGW and to verify their suitability for marker-assisted selection. Of these, significant associations were detected for only one gene (TaGs3-D1) in the EYT2015-16 and two genes (TaTGW6 and TaSus1) in WAMI. The reported favorable alleles of TaGs3-D1 and TaTGW6 genes decreased TGW in the datasets. A haplotype-based genome wide association study was implemented to identify the genetic determinants of TGW on a large set of CIMMYT germplasm (4,302 lines comprising five EYTs), which identified 15 haplotype blocks to be significantly associated with TGW. Four of them, identified on chromosomes 4A, 6A, and 7A, were associated with TGW in at least three EYTs. The locus on chromosome 6A (Hap-6A-13) had the largest effect on TGW and additionally GY with increases of up to 2.60 g and 258 kg/ha, respectively. Discovery of novel TGW loci described in our study expands the opportunities for developing diagnostic markers and for multi-gene pyramiding to derive new allele combinations for enhanced TGW and GY in CIMMYT wheat.
Of all yield components, grain weight is the most stable and heritable trait, hence an important selection target for the genetic improvement of grain yield (GY) (Cooper et al., 2012; Xiao et al., 2012). In wheat, a number of genes related to increased thousand grain weight (TGW) have been cloned in the past decade taking advantage of the synteny between wheat and rice (Jiang et al., 2011; Su et al., 2011; Ma et al., 2012; Yang et al., 2012; Chang et al., 2013; Guo et al., 2013; Dong et al., 2014; Qin et al., 2014; Zhang et al., 2014; Jiang et al., 2015; Hanif et al., 2016; Hu et al., 2016; Ma et al., 2016; Simmonds et al., 2016;Wang et al., 2016; Zhang et al., 2012; Zhang et al., 2015; Zhang et al., 2017; Zheng et al., 2014). This has enriched the understanding of this trait and concomitantly gene-based single nucleotide polymorphism (SNP) markers have become available that are suggested to be used to enhance TGW through marker-assisted selection (MAS). Scrutiny of the published literature, however, has revealed that the effects of most of these genes have not been validated in diverse sets of wheat germplasm.
Recently, Mohler et al. (2016) investigated associations of 13 candidate genes (CGs) with TGW and kernel size traits in a European winter wheat panel. Favorable alleles of nine CGs were present in very low (<5%) or very high (>95%) frequencies in the panel and hence were unsuitable for analysis by CG association approach. Only TaCwi-A1 showed association with TGW; the reported favorable allele TaCwi-A1a (Ma et al., 2012) was associated with increased TGW but decreased GY thus pointing to compensatory effects of the TaCwi-A1 gene on TGW and GY (Mohler et al., 2016). Similarly, varying effects of observed haplotypes for the gene TaGW2 have been reported. Su et al. (2011) and Han et al. (2011)suggested the haplotype Hap-6A-A to be favorable for selection of genotypes with higher grain width and TGW, while Zhang et al. (2013) reported the haplotype Hap-6A-G as the favorable allele. Inconclusive results were also obtained in a recent study on CIMMYT germplasm, where Hap-6A-G was found superior in Mexico and the second allele, i.e., Hap-6A-A, in Nepal and India (Sukumaran et al., 2018). These examples suggest that besides compensatory effects on GY, the genetic effects of the reported CGs can be environment- and/or germplasm-dependent. Hence, validation of their effects in diverse germplasm is a prerequisite for their effective deployment in MAS. To accomplish this, we investigated allelic effects of 14 CGs on TGW in two independent and existing panels at CIMMYT; both panels represent CIMMYT wheat germplasm and have different compositions and diversities. One of the panels is an elite yield trial (EYT) composed of spring wheat lines that formed the entries of multi-environment trials during the 2015–2016 growing season. The other panel is the well-known WAMI (Wheat Association Mapping Initiative) that includes historical cultivars and synthetic-derived lines (Lopes et al., 2012; Lopes et al., 2015).
The specific objectives of this study were 1) to test the frequencies of alleles of 14 CGs in two independent panels of wheat, 2) to explore the allele effect of each CG on TGW and GY by implementing CG-based association study, and 3) to identify additional genetic determinants of TGW using a haplotype-based genome wide association study (GWAS) in five recent CIMMYT EYTs.
Materials and Methods
Plant Materials and Phenotyping
For CG-based association study, two bread wheat datasets, EYT2015-16 and the WAMI, were used. The EYT2015-16 dataset comprised 829 spring bread wheat breeding lines, which formed the entries of multi-environment trials during the 2015–2016 growing season. The WAMI panel is genetically a more diverse collection and comprised 294 lines with a reduced range of variation in phenology and height (for details on WAMI, refer to Lopes et al., 2012). For haplotype-based GWAS, 4,302 lines that formed the EYT2011-12, EYT2012-13, EYT2013-14, EYT2014-15, and EYT2015-16 and comprised 643, 905, 983, 942, and 829 lines, respectively, were used (Supplementary Table S1).
All EYTs were phenotyped at the Norman E. Borlaug Experimental Research Station (CENEB) in Ciudad Obregon, Mexico, during the respective growing seasons. For each EYT, five environments, including optimum and stress environments, were induced at the same site by modulating planting date, field management, and irrigation. The experimental design was an alpha lattice design with three replications. Each experiment included 28 lines and 2 checks. Small size units (30 plots with 28 entries and 2 checks) are used to minimize the field variation, which simplifies selection. TGW was measured in one of the five environments, optimally irrigated as single replications using a Seed Counter Contador (http://www.pfeuffer.com/contador.html?&L=5). Unbroken seeds were put in the Seed Counter and 1,000 seeds were counted by the counter and weighed to reveal TGW in grams. The counted seeds were put in an oven for 48 h at 72°C and TGW was reweighed to get moisture estimates. Adjusted means were calculated as described in Sehgal et al. (2017).
The WAMI population with 294 lines was evaluated for 3 years (2010, 2011, and 2013) under optimal management at CENEB. The trials were timely sown with full irrigation applied through gravity fled-irrigation. The experimental design was an alpha lattice with two replications. Blocks were arranged based on the heading date of the materials. TGW on two replications was measured as described in the manual “Physiological breeding II: a field guide to wheat phenotyping” (Pask et al., 2012). In brief, a random sample of whole grains was cleaned carefully to remove all broken and aborted grains before putting in counting machine. Two hundred grains were counted, re-dried, and weighed (DW_200_G). TGW was then calculated by multiplying DW_200_G by five. In each case, two replicates were measured but if the values of the two replicates differ by more than 10%, then a third sample was taken. Best Unbiased Linear Predictors (BLUPs) were calculated using META-R (Vargas et al., 2013) and were used for analysis.
Gene-Based and Genome-Wide Genotyping
For the study, we selected 14 CGs that have a direct influence on TGW rather than on grain morphometric traits. Genotyping with the selected CGs was performed using published Kompetitive allele-specific PCR (KASP) assays detailed in CerealsDB (http://www.cerealsdb.uk.net/cerealgenomics/CerealsDB/kasp_download.php?URL=) and in Rasheed et al. (2016). KASP assays for genes TaCwi-A1 (2A) and TaGW2 (6A) were designed in-house. Table S2 enlists each of the 14 CGs and the corresponding SNP markers, and Table S3 enlists the details of KASP primers designed in-house. For whole genome genotyping, genotyping-by-sequencing (GBS) was applied as described in Rutkoski et al. (2016). Briefly, the genotypes were sequenced at 192-plexing on Illumina HiSeq2000 with 1 × 100 bp reads. SNPs were called across all lines using the TASSEL GBS pipeline as described in Glaubitz et al. (2014) and anchored to the genome assembly of Chinese Spring (International Wheat Genome Sequencing Consortium, 2018).
Haplotype Block Construction
GBS data of all 4,302 lines were considered together to construct haplotype blocks. Markers with more than 40% missing data were excluded from haplotype construction. Since the haplotype blocks were created on all lines together, a MAF ≥ 0.15 was applied so that in each nursery a MAF ≥0.05 could be achieved. From an initial set of 20,794 SNP markers obtained on the 4,302 samples, a set of 8,443 filtered SNPs was used for haplotype block construction. The blocks were generated based on the linkage disequilibrium (LD) parameter D’ using the R script, but modified as described in Gabriel et al. (2002). We calculated D’ 95% confidence intervals between SNPs and each comparison was categorized as “strong LD,” “inconclusive,” or “strong recombination.” A haplotype block was created if 95% of the comparisons in one block were in “strong LD.” For two or more SNPs to be classified in “strong LD,” the minimum lower and upper confidence interval values were set to 0.70 and 0.98, respectively. The haplotype blocks were named as combinations of the prefix “Hap” for the haplotype block followed by chromosome, and then a number, which represents incrementing number of the haplotype blocks along the chromosome.
CG and Haplotype-Based GWAS
Population structure was investigated in the five EYTs by principal component analysis (PCA) using the function PRCOMP from the STATS package in R. For the WAMI panel, reported subpopulation structure from Lopes et al. (2015), based on the 1B.1R translocation, was utilized for the analysis. The kinship matrix was calculated by the VanRaden algorithm (VanRaden, 2008) in the GAPIT package (Lipka et al., 2012). For association analysis, a mixed linear model (MLM) was applied with PCA as a fixed variate and kinship as random. Bayesian information criterion (BIC; Schwarz, 1978) was used to assess the appropriate number of principal components for analysis, i.e., PC with the largest BIC value was used as a fixed variate. Candidate gene-based association mapping was conducted in TASSEL v4.0 and haplotype-based GWAS was conducted in Plink v1.07 (Purcell et al., 2007). The allelic effect of the CGs and associated haplotypes was estimated as the difference between the mean value of lines with and without favorable allele and was presented as box plots.
Epistatic Interactions, Stepwise Regression and In Silico Analysis
Two- and three-locus interactions were studied using an in-house script executed in R as described in Sehgal et al. (2017). Forward stepwise regression was conducted using the LEAPS package in R. In silico analysis of the significant loci was conducted in Ensembl Plants software and database system (https://plants.ensembl.org/Triticum_aestivum/Info/Index).
Phenotypic Variation, CG Allele Frequencies and Association With TGW in EYT2015-16 and WAMI Datasets
Figure S1 shows the distribution of TGW in the EYT and WAMI panels. TGW ranged from 33.5 to 57.8 g, 39.2 to 61.3 g, 36.2 to 57.0 g, 34.5 to 57.6 g, and 34.9 to 55.5 g in EYT2011-12, EYT2012-13, EYT2013-14, EYT2014-15, and EYT2015-16, respectively. In the WAMI panel, TGW was, in general, lower than in the EYTs and ranged from 33.8 to 54.7 g, 32.2 to 54.8 g, and 31.7 to 52.8 g in year 2010, 2011, and 2013, respectively. Pearson’s correlations of TGW with GY varied across EYTs (years) and induced environments between negative values (-0.18, P < 0.000) and positive values (0.39, P < 0.000) (Supplementary Table S4).
The frequencies of the 14 CGs in the two wheat datasets are shown in Table 1. The favorable alleles of the three genes, TaSus2-2B, TaCKX6-D1, and TaGASR7-A1, were absent or had very low frequencies (<5%) in both datasets. The favorable allele of the gene TaCwi-A1 had low frequency in EYT2015-16 (0.6%) while that of TaTPP-6AL gene was present in 96.5% of lines. The favorable alleles of the remaining 10 genes were present in moderate to high frequencies in EYT2015-16 (19.5% to 85.2%). In WAMI, the favorable allele of the gene TaCwi-5D was present in 99% of lines while those of the remaining genes were in low (6.4%) to high (87.7%) frequencies. Since both alleles of the genes TaGW2-6A and TaSus1-7A are reported to be favorable, we considered both allele frequencies in our datasets. The haplotype Hap-A-A of TaGW2-6A was moderately frequent (30% to 32%), whereas Hap-A-G was highly frequent (63% to 66%) in both datasets. Similarly, for TaSus1-7A, haplotype Hap 1 was highly frequent in both datasets (62.9% to 71.9%), while haplotype Hap 2 was moderately frequent (21.9% to 32%).
Table 1 Frequencies in percent of reported favorable alleles in 14 candidate genes in two CIMMYT datasets.
The CGs were initially mapped on chromosomes by blind association analysis, where the marker score was used as a phenotype (described in Lopes et al., 2015), in order to confirm the chromosome positions of the CGs. We obtained peaks at the respective chromosome except for TaTGW6. The gene was reported on chromosomes 3A and 4A in two different publications (Hanif et al., 2016; Hu et al., 2016). Blind association analysis mapped TaTGW6 on chromosome 3A (Figure S2). In the EYT2015-16 dataset, only one gene (TaGs3-D1) showed significant association with TGW, while in WAMI, two genes (TaTGW6 and TaSus1) showed associations (Table 2). The reported favorable alleles of the two genes, TaGs3-D1a (Zhang et al., 2014) and TaTGW6_A1a (Hanif et al., 2016), decreased TGW in our sets (Figure 1A). When their effects on GY were investigated, we found that in EYT2015-16, TaGs3-D1a increased GY, and in WAMI, TaTGW6_A1a decreased GY in 2010 and had no effect on GY in 2011 and 2013 (Figure 1B). Both alleles of TaSus1-7A are known to be favorable (Hou et al., 2014). In WAMI, Hap 2 was favorable increasing TGW and GY across the 3 years.
Figure 1 Allelic effects of two alleles of the associated genes TaGS3-D1, TaTGW6-A1, and TaSus-1 in EYT2015-16 and WAMI, respectively, on TGW (A) and GY (B). The reported favorable alleles in publications are underlined.
Haplotype Variation and Haplotype-Based GWAS for TGW in EYTs
Genome-wide, 501 haplotype blocks were constructed with a range from two to nine SNPs per block (Supplementary Table S5). The A, B, and D sub-genomes showed 197, 268, and 36 haplotype blocks and 506, 656, and 78 haplotypes, respectively.
PCA in each of the five EYTs identified a moderate population structure in all panels: two to three groups (Supplementary Figure S3). Haplotype-based GWAS identified 15 haplotype blocks significantly associated with TGW. Four of them were associated with TGW in at least three EYTs and had large effects. These were classified as major associations (Table 3, Figure 2 and 3). Effects on GY of the four major TGW loci were investigated under multiple environments to investigate locus-specific tradeoffs between the two traits (Table 3). Two of the haplotype blocks were identified on chromosome 4A, Hap-4A-14 and Hap-4A-19. Hap-4A-14 had three haplotypes, CC, CT, and TT, of which TT was the favorable haplotype resulting in an increase of TGW by 1.39 to 2.74 g (Table 3). The GY advantage with this haplotype ranged from 0.9% to 6.7% across eight environments. The second block on chromosome 4A (Hap-4A-19) had two haplotypes: CA and TG. The favorable haplotype CA resulted in an increase of TGW by 1.22 to 2.40 g and GY advantage in five environments of 0.8% to 5.7%. The haplotype block identified on chromosome 6A (Hap-6A-13) showed three haplotypes, CA, CG, and TG, of which TG resulted in a 1.91 to 2.58 g increase in TGW and 1.4% to 9.4% yield advantage across seven environments. Hap-7A-15 on chromosome 7A revealed two haplotypes, CG and TC, of which CG was favorable with advantages in TGW and GY of 0.70 to 2.04 g and 0.5% to 2.4%, respectively. The frequencies of the favorable haplotypes at the four major TGW loci across EYTs are shown in Figure 4. The frequency of favorable haplotype Hap-4A-14-TT was almost stable across EYTs. Of the remaining three, Hap-4A-19-CA showed an increase while Hap-6A-13-TG and Hap-7A-15-CG showed a decreasing trend in frequency. Currently, we are designing KASP (Kompetitive Allele Specific PCR) assays for these four major loci for their validation.
Table 3 Four major TGW loci and their effects on GY across multiple environments and across EYTs. The favorable allele is shown in bold and is italicized.
Figure 2 Manhattan plot created by combining haplotype blocks combined with single marker data. The haplotype blocks on 4A and 6A chromosomes had higher P value than their corresponding single SNPs (shown here) while haplotype block on 7A did not show any difference and not shown.
Figure 3 Allelic effects of haplotype in blocks Hap-4A-14 (A–C), Hap-4A-19 (D–F), Hap-6A-13 (G–I), and Hap-7A-15 (J–L) on TGW across three EYTs.
Eleven minor TGW loci were identified on chromosomes 1B, 2A, 2B, 3A, 3B, 5D, 6A, and 6B (Supplementary Table S6). These were classified as minor because they were identified in only one or two EYTs and had small effects on TGW (less than 1.0 g increase in TGW). Squared correlation coefficient (r2), measure of LD, between the CGs and the associated haplotypes (Table 4) was very low varying from 0.004 to 0.183.
Table 4 Squared correlation coefficient between the identified TGW loci and candidate genes investigated in the present study.
Epistatic Interactions Among TGW Loci and Stepwise Regression of Major Loci
Of the four major loci, three (Hap-4A-14, Hap-4A-19, and Hap-7A-15) were significantly involved in two or three locus interactions among themselves or with minor loci (Supplementary Table S7, Figure S4). The Hap-7A-15 was the main epistatic locus interacting with other identified loci contributing to additional variation of 4.4% to 11.6%.
Figure 4 Allele frequencies (number shown in the center of the bars) of the four favorable haplotypes in five EYTs. The dotted line shows the trend observed in frequency of each of the favorable haplotype from one year to the other.
All four major loci were regressed on TGW in each individual EYT to identify the best combination, which would lead to increased TGW. A general pattern that emerged is summarized in Figure 5A. The highest percentage variation was contributed by a combination of the three haplotype blocks, Hap-6A-13, Hap-7A-15, and Hap-4A-14. Addition of the fourth locus, i.e., Hap-4A-19, decreased percentage variation. The three-locus combination resulted in a 3% to 16% and 0.7% to 8.2% increase in TGW and GY, respectively, in the EYTs (Figure 5B).
Figure 5 Stepwise regression of four major TGW loci (A) and percent increase in TGW and GY obtained in each EYT by combining the three best combination identified from stepwise regression (B).
The allelic effects of CGs reported to increase TGW and/or grain morphometric traits have not been investigated comprehensively in diverse germplasm panels, thereby having limited impact in breeding. For instance, allele effects of CGs TaGASR7-A, TaGW2, TaGS3-D1, TaCwi (Dong et al., 2014; Qin et al., 2014; Zhang et al., 2014; Jiang et al., 2015), and recently TaSnRK2.3 (Miao et al., 2017) have been examined exclusively in Chinese wheat. Zanke et al. (2015) and Mohler et al. (2016) tested the effects of 9 and 13 CGs on kernel traits in European winter wheat varieties. Both studies reported association of one or two genes with TGW in their datasets (TaGW2-6A in the former study and TaSus-1-7A and TaCwi-A1 in the latter).
We investigated 14 CGs in two independent CIMMYT wheat datasets in order to validate their allele effects on TGW and to verify their suitability for MAS. We obtained moderate to high frequencies of the alleles/haplotypes for most genes except TaSus2-2B, TaCKX6-D1, and TaGASR7-A1. In congruence with Zanke et al. (2015) and Mohler et al. (2016), we obtained association of only one and two genes with TGW (TaGs3-D1 in EYT2015-16 and TaTGW6 and TaSus1-7A in WAMI). The reported favorable alleles of the two genes, TaGs3-D1 and TaTGW6 (Zhang et al., 2014; Hanif et al., 2016), decreased TGW in CIMMYT germplasm. Additionally, opposite and quite variable effects of the alleles on GY were observed. These results reinforce the previous findings that the currently known allelic effects of reported CGs related to TGW vary with genetic background and/or environment (Sukumaran et al., 2018), and it will be challenging to use the associated SNPs for MAS across individual breeding programs. Both favorable alleles were frequent in the two datasets, hence were not necessarily selected against by breeders. The opposing effect of TaGs3-D1a allele on TGW and GY in EYT2015-16 was similar to that reported by Mohler et al. (2016) for the TaCwi-A1a allele in a European winter wheat panel.
Some of CGs investigated have known function and have proven beneficial in Chinese wheat germplasm such as sucrose synthase 2 orthologous gene (TaSus2), putative cytokinin oxidase genes (TaCKX2.1 and TaCKX2.2), cell wall invertase gene (TaCwi), etc. Cell wall invertase (CWI) is one of the key enzymes involved in establishing sink strength in various sink tissues. In maize, lack of expression of the endosperm-specific cell wall-invertase gene Incw2 in the pedicel and endosperm caused interruption of photoassimilate transport into developing kernels, resulting in one-fifth of the normal seed weight (Miller and Chourey, 1992; Cheng et al., 1996; Kang et al., 2009). We investigated three wheat CWI genes reported on chromosomes 2A, 4A, and 5D, but none of them showed association with TGW in our panels. A similar case was observed with the sucrose synthase genes, playing an important role in the conversion of sucrose to starch, and the genes coding transcription and growth regulators, i.e., TaTPP-6AL (an important growth regulator involved in starch accumulation). On chromosome 6A, one of the first cloned grain width genes, TaGW2, showed effects with up to 3 g on TGW in Chinese wheat (Su et al., 2011). TaGW2 is an orthologue of the OsGW2 gene in rice that influences grain width and weight, and two haplotypes, Hap-6A-A and Hap-6A-G, have so far been reported (Su et al., 2011). Both haplotypes have been observed to be favorable (Su et al., 2011; Zhang et al., 2013). In the present study, although both alleles were present in moderate (Hap-6A-A) to high (Hap-6A-G) frequency, none showed association with TGW. The homeolog of TaGW2 on 6B (TaGW2-6B) has been reported to have a stronger influence on TGW than TaGW2-6A (Qin et al., 2014), but none of the alleles of TaGW2-6B showed significant association with TGW in the present study. Thus, in the CIMMYT wheat breeding program it will be difficult to apply the investigated gene-based markers in MAS unless re-tested in specific populations to identify beneficial genetic backgrounds. The reported favorable alleles have mainly been identified from SNP(s) or INDELs observed in promotors or intron sequences of genes in a narrow set of Chinese wheat lines. For example, published haplotypes (Hap-6A-A and Hap-6A-G)-based markers for the grain width gene TaGW2-6A are based on 593A/G SNP out of 8 SNPs identified in 2.9 kb sequenced promotor region in 34 Chinese wheat accessions (Su et al., 2011). Zanke et al. (2015) reported a novel allele for the TaGs3-D1 gene, named as TaGs3-D1c, in European winter wheat, which was absent in Chinese wheat. The existence of additional alleles yet unobserved for the CGs, together with significant genotype × genotype and/or genotype × environment effects, might be likely explanation for the varying results among studies. Allele mining of promotors and intron regions of the CGs in diverse set of lines should be a priority research area to identify novel alleles associated with TGW.
In order to explore the genetic determinants of TGW in CIMMYT germplasm, we adopted a haplotype-based GWAS approach (N’ Diaye et al., 2017; Singh et al., 2018) and identified four stable loci across EYTs. On chromosome 4A, the two identified loci, Hap-4A-14 and Hap-4A-19, were neither in LD (r2 = 0.02) with each other nor with the gene TaCwi-4A. The two loci identified on chromosome 4A are therefore independent and likely novel. The major locus on chromosome 6A (Hap-6A-13) had the largest effect on both TGW and GY with increases of up to 2.60 g and 258 kg/ha in TGW and GY, respectively. Hap-6A-13 was not in LD with TaGW2-6A and is located on 6AL based on BLAST analysis (https://plants.ensembl.org/Triticum_aestivum/Info/Index), while TaGW2-6A is mapped on 6AS near the centromere (Su et al., 2011). Sukumaran et al. (2018) identified a locus on 6A in the WAMI panel having positive association for GY and TGW with allelic substitution effects of 1.7% and 4% on TGW and GY, respectively. In the present study, the favorable allele at the 6A locus (“TG”) contributed to allelic effects of up to 5.6% and 9.4% on TGW and GY, respectively. The LD between 6A locus in WAMI and Hap-6A-13, on an average, is 0.05, suggesting these two as different loci.
Three TGW genes (TaAGP-S1, TaTGW, and TaSus-1) are known on chromosome 7A in wheat (Hou et al., 2014; Hu et al., 2016; Hou et al., 2017). Of these, TaSus-1 was investigated in the current research and it showed association with TGW in the WAMI panel. The r2 values between Hap-7A-15 and TaSus1-7A was 0.004, which eliminates the possibility of Hap-7A-15 being TaSus1-7A. BLAST analysis showed that TaTGW and TaAGP-S1 are ∼125 and ∼262 Mb upstream of Hap-7A-15, respectively, which also precludes the possibility of Hap-7A-15 as one of these genes. Hap-7A-15 was in fact located in the middle of two genes within 1 Mb region; TraesCS7A02G124500 is ∼0.9 Mb downstream and TraesCS7A02G124600 is ∼0.1 Mb upstream. TraesCS7A02G124500 is implicated in transferase activity (transferring acyl groups other than amino-acyl groups) and TraesCS7A02G124600 in serine-type endopeptidase activity. In model crop rice, role of serine-type peptidase encoding genes in determining grain size and grain filling has been demonstrated (Li et al., 2011; Li et al., 2016). For example, OsGS5 (encoding a putative serine carboxypeptidase) was identified to promote cell division by regulating cell cycle genes resulting in large grain size by an increased cell number (Li et al., 2011). Hence, we speculate that Hap-7A-15 could be linked to such cell cycle-related genes.
TGW and grain number (GN) are the two primary determinants of GY in wheat. While the association between GY and GN is generally positive, the positive association of GY with TGW is less profound and sometimes even negative as was also observed in the present study (Supplementary Table S4). Hence, identification of QTL that increases TGW without decreasing GY or alternatively QTL displaying a low trade-off between the two traits could be beneficial for improving wheat GY. Many previous studies have reported stable QTLs for TGW on chromosomes 2B, 2D, 4B, 6A, and 7D using bi-parental populations (Simmonds et al., 2014; Lopes et al., 2013; Yan et al., 2017; Tahmasebi et al., 2017) and on 2B, 4A, 4B, and 6A using GWAS (Guan et al., 2018; Sukumaran et al., 2018; Mangini et al., 2018). However, compensatory effect of TGW enhancing QTL on GY has not been addressed much (Zhai et al., 2018; Guan et al., 2018; Sukumaran et al., 2018). The positive allele effects of all four identified loci on TGW and GY in the current research brings in opportunities to generate new combination of TGW alleles, which might increase overall GY when introgressed in lines with high GN. It is noteworthy that contributions of these loci to GY became even more important at higher levels of stress (e.g., mild drought, severe drought, or heat stress) (Table 3). Forward stepwise regression identified a superior combination of three haplotype blocks conferring high TGW and GY in EYTs (3% to 16% and 0.7% to 8.2%, respectively).
The significance of epistasis in the genetic architecture of TGW has been investigated in wheat using bi-parental populations (Wu et al., 2012; Lv et al., 2014; Wei et al., 2014). Using GWAS, such studies are rare (Sehgal et al., 2016). Unravelling QTL with epistatic effects using GWAS will have more potential compared to linkage mapping because fixation of QTL alleles will be less likely in a naturally segregating population, and the power of GWAS will be considerably larger (Sehgal et al., 2016, Sehgal et al., 2017). This was reflected in the current study, where the cumulative contribution from epistatic effects was higher than that from main effects (Table S4). The locus Hap-7A-15 on chromosome 7A interacted with other major and minor loci on chromosomes 3A, 3B, 4A, 6A, and 6B in two- and three-locus interactions contributing to up to 8.9% and 11.6% variation, respectively. Bi-parental designs, on the other hand, revealed variation of 1.8% to 3.8% (Lv et al., 2014; Wei et al., 2014).
The findings of the study provide a useful foundation for more detailed investigations on genetic background dependence and environment sensitivity of the known CGs for TGW. The current available gene-based markers are hardly deployable in CIMMYT wheat germplasm, unless re-tested in specific genetic backgrounds. The direct use of gene-based markers reported in the literature is therefore in general questionable especially for more complex trait. More and better marker validation is required, which is currently lacking and has not received adequate attention. We have identified four stable and novel TGW loci in CIMMYT wheat germplasm. The positive allelic effects of the identified loci on GY and TGW make them attractive targets for MAS at CIMMYT.
Data Availability Statement
All datasets for this study are included in the manuscript and the Supplementary Files.
DS and SD conceived the manuscript. SD and CF generated candidate genes-based marker data. DS analyzed the data and wrote the manuscript. SM, RS, and CG generated phenotypic data. GGB validated KASP assays designed for major loci. All authors reviewed the manuscript.
The work was supported by funding from CRP WHEAT and the Delivering Genetic Gains in Wheat (DGGW) project (OPPGD1389) funded by the Bill & Melinda Gates Foundation (BMGF), the UK Department for International Development (DFID), and the US Agency for International Development (USAID) Feed the Future Innovation Lab for Applied Wheat Genomics (Cooperative Agreement No. AID-OAA-A-13-00051).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2019.01189/full#supplementary-material
Chang, J. J., Zhang, X., Mao, A., Li, J., Jia, J., Jing, R. (2013). Polymorphism of TaSAP1-A1 and its association with agronomic traits in wheat. Planta 237 (6), 1495–1508. doi: 10.1007/s00425-013-1860-x
Cheng, W. H., Taliercio, E. W., Chourey, P. S. (1996). The Miniature1 seed locus of maize encodes a cell wall invertase required for normal development of endosperm and maternal cells in the pedicel. Plant Cell, (6), 971–983. doi: 10.1105/tpc.8.6.971
Cooper, J. K., Ibrahim, A. M. H., Rudd, J., Malla, S., Hays, D. B., Baker, J. (2012). Increasing hard winter wheat yield potential via synthetic wheat: I. Path-coefficient analysis of yield and its components. Crop Sci. 201252, 2014–2022. doi: 10.2135/cropsci2011.07.0382
Dong, L., Wang, F., Liu, T., Dong, Z., Li, A., Jing, R., et al. (2014). Natural variation of TaGASR7-A1 affects grain length in common wheat under multiple cultivation conditions. Mol. Breed. 34, 937–947. doi: 10.1007/s11032-014-0087-2
Gabriel, S. B., Schaffner, S. F., Nguyen, H., Moore, J. M., Roy, J., Blumenstiel, B., et al. (2002). The structure of haplotype blocks in the human genome. Science 296, 2225–2229. doi: 10.1126/science.1069424
Glaubitz, J. C., Casstevens, T. M., Lu, F., Harriman, J., Elshire, R. J., Sun, Q., et al. (2014). TASSEL-GBS: a high capacity genotyping by sequencing analysis pipeline. PLoS One 9, e90346. doi: 10.1371/journal.pone.0090346
Guan, P., Lu, L., Jia, L., Kabir, M. R., Zhang, J., Lan, T., et al. (2018). Global QTL analysis identifies genomic regions on chromosomes 4A and 4B harboring stable loci for yield-related traits across different environments in wheat (Triticum aestivum L.). Front. Plant Sci 9, 529. doi: 10.3389/fpls.2018.00529
Guo, Y., Sun, J. J., Zhang, G. Z., Wang, Y. Y., Kong, F. M., Zhao, Y., et al. (2013). Haplotype, molecular marker and phenotype effects associated with mineral nutrient and grain size traits of TaGS1 a in wheat. Field Crops Res. 154, 119–125. doi: 10.1016/j.fcr.2013.07.012
Han, L., Yang, F., Xia, X., Yan, J., Zhang, Y., Qu, Y., et al. (2011). Distribution of genes related to plant height, kernel weight and disease resistance in wheat cultivars from major countries in the world. J. Triticeae Crops, 31 (5), 824–831.
Hanif, M., Gao, F. M., Liu, J. D., Wen, W. E., Zhang, Y. J., Rasheed, A., et al. (2016). TaTGW6-A1, an ortholog of rice TGW6, is associated with grain weight and yield in bread wheat. Mol. Breed. 36, 1. doi: 10.1007/s11032-015-0425-z
Hou, J., Jiang, Q., Hao, C., Wang, Y., Zhang, H., Zhang, X. (2014). Global selection on sucrose synthase haplotypes during a century of wheat breeding. Plant Physiol. 164, 1918–1929. doi: 10.1104/pp.113.232454
Hou, J., Li, T., Wang, Y., Hao, C., Liu, H., Zhang, X. (2017). ADP-glucose pyrophosphorylase genes, associated with kernel weight, underwent selection during wheat domestication and breeding. Plant Biotechnol. J. 15, 1533–1543. doi: 10.1111/pbi.12735
Hu, M. J., Zhang, H. P., Cao, J. J., Zhu, X. F., Wang, S. X., Jiang, H., et al. (2016). Characterization of an IAAglucose hydrolase gene TaTGW6 associated with grain weight in common wheat (Triticum aestivum L.). Mol. Breed. 36, 1–11. doi: 10.1007/s11032-016-0449-z
International Wheat Genome Sequencing Consortium. (2018). Shifting the limits in wheat research and breeding using a fully annotated reference genome by the international wheat genome sequencing consortium (IWGSC). Sci. 361 (6403), eaar7191. doi. 10.1126/science.aar7191
Jiang, Q., Hou, J., Hao, C., Wang, L., Ge, H., Dong, Y., et al. (2011). The wheat (T. aestivum) sucrose synthase 2 gene (TaSus2) active in endosperm development is associated with yield traits. Funct. Integr. Genom. 11, 49–61. doi: 10.1007/s10142-010-0188-x
Jiang, Y. M., Jiang, Q. Y., Hao, C. Y., Hou, J., Wang, L. F., Zhang, H. N., et al. (2015). A yield-associated gene TaCWI, in wheat: its function, selection and evolution in global breeding revealed by haplotype analysis. Theor. Appl. Genet. 128, 131–143. doi: 10.1007/s00122-014-2417-5
Kang, B. H., Xiong, Y., Williams, D. S., Pozueta-Romero, D., Chourey, P. S. (2009). Miniature1-encoded cell wall invertase is essential for assembly and function of wall-in-growth in the maize endosperm transfer cell. Plant Physiol. 151 (3), 1366–1376. doi: 10.1104/pp.109.142331
Li, Y., Fan, C., Xing, Y., Jiang, Y., Luo, L., Sun, L., et al. (2011). Natural variation in GS5 plays an important role in regulating grain size and yield in rice. Nat. Genet. 43, 1266–1269. doi: 10.1038/ng.977
Li, Z., Tang, L., Qiu, J., Zhang, W., Wang, Y., Tong, X., et al. (2016). Serine carboxypeptidase 46 regulates grain filling and seed germination in rice (Oryza sativa L.). PLoS One 11 (7), e0159737. doi: 10.1371/journal.pone.0159737
Lopes, M. S., Dreisigacker, S., Peña, R. J., Sukumaran, S., Reynolds, M. P. (2015). Genetic characterization of the wheat association mapping initiative (WAMI) panel for dissection of complex traits in spring wheat. Theor. Appl. Genet. 128, 453–464. doi: 10.1007/s00122-014-2444-2
Lopes, M. S., Reynolds, M. P., Jalal-Kamali, M. R., Moussa, M., Feltaous, Y., Tahir, S. A., et al. (2012). The yield correlations of selectable physiological traits in populations of advanced spring wheat lines grown in warm and drought environments. Field Crops Res. 128, 129–136. doi: 10.1016/j.fcr.2011.12.017
Lopes, M. S., Reynolds, M. P., McIntyre, C. L., Mathews, K. L., Kamali, M. J., Mossad, M., et al. (2013). QTL for yield and associated traits in the Seri/Babax population grown across several environments in Mexico, in the West Asia, North Africa, and South Asia regions. Theor. Appl. Genet. 126 (4), 971–984. doi: 10.1007/s00122-012-2030-4
Lv, C., Song, Y., Gao, L., Yao, Q., Zhou, R., Xu, R., et al. (2014). Integration of QTL detection and marker assisted selection for improving resistance to fusarium head blight and important agronomic traits in wheat. Crop J. 2, 70–78. doi: 10.1016/j.cj.2013.10.004
Ma, D. Y., Yan, J., He, Z. H., Wu, L., Xia, X. C. (2012). Characterization of a cell wall invertase gene TaCwi-A1 on common wheat chromosome 2A and development of functional markers. Mol. Breed. 29, 43–52. doi: 10.1007/s11032-010-9524-z
Ma, L., Li, T., Hao, C. Y., Wang, Y. Q., Chen, X. H., Zhang, X. Y. (2016). TaGS5-3A, a grain size gene selected during wheat improvement for larger kernel and yield. Plant Biotechnol. J. 14, 1269–1280. doi: 10.1111/pbi.12492
Mangini, G., Gadaleta, A., Colasuonno, P., Marcotuli, I., Signorile, A. M., Simeone, R., et al. (2018). Genetic dissection of the relationships between grain yield components by genome-wide association mapping in a collection of tetraploid wheats. PloS One. 13 (1), e0190162. doi: 10.1371/journal.pone.0190162
Miao, L., Mao, X., Wang, J., Liu, Z., Zhang, B., Li, W., et al. (2017). Elite haplotypes of a protein kinase gene TaSnRK2.3 associated with important agronomic traits in common wheat. Front. Plant Sci. 8, 368. doi: 10.3389/fpls.2017.00368
Miller, M. E., Chourey, P. S. (1992). The maize invertase-deficient miniature-1 seed mutation is associated with aberrant pedicel and endosperm development. Plant Cell. 4 (3), 297–305. doi: 10.1105/tpc.4.3.297
Mohler, V., Albrecht, T., Castell, A., Diethelm, M., Schweizer, G., Hartl, L. (2016). Considering causal genes in the genetic dissection of kernel traits in common wheat. J. Appl. Genet. 57 (4), 467–476. doi: 10.1007/s13353-016-0349-2
N’ Diaye, A., Haile, J. K., Cory, A. T., Clark, F. R., Clark, J. M., Knox, R. E., et al. (2017). Single marker and haplotype-based association analysis of semolina and pasta colour in elite durum wheat breeding lines using a high-density consensus map. PLoS One 12 (1), e0170941. doi: 10.1371/journal.pone.0170941
Purcell, S., Neale, B., Brown, K. T., Thomas, L., Ferreira, M. A. R., Bender, D., et al. (2007). PLINK: a toolset for whole-genome association and population-based linkage analysis. Am. J. Hum. Genet. 81 (3), 559–575. doi: 10.1086/519795
Qin, L., Hao, C., Hou, J., Wang, Y., Li, T., Wang, L., et al. (2014). Homologous haplotypes, expression, genetic effects and geographic distribution of the yield gene TaGW2. BMC Plant Biol. 14, 107. doi: 10.1186/1471-2229-14-107
Rasheed, A., Wen, W., Gao, F., Zhai, S., Jin, H., Liu, J., et al. (2016). Development and validation of KASP assays for genes underpinning key economic traits in bread wheat. Theor. Appl. Genet. 129, 1843–1860. doi: 10.1007/s00122-016-2743-x
Rutkoski, J., Poland, J., Mondal, S., Autrique, E., Perez, G. L., Crossa, J., et al. (2016). Canopy temperature and vegetation indices from high-throughput phenotyping improve accuracy of pedigree and genomic selection for grain yield in wheat. Genes Genomes Genet. 6, 2799–2808. doi: 10.1534/g3.116.032888
Sehgal, D., Autrique, E., Singh, R., Ellis, M., Singh, S., Dreisigacker, S. (2017). Identification of genomic regions for grain yield and yield stability and their epistatic interactions. Sci. Rep. 7, 41578. doi: 10.1038/srep41578
Sehgal, D., Dreisigacker, S., Belen, S., Küçüközdemir, Ü., Mert, Z., Özer, E., et al. (2016). Mining centuries old in situ conserved Turkish wheat landraces for grain yield and stripe rust resistance genes. Front. Genet. 7, 201. doi: 10.3389/fgene.2016.00201
Simmonds, J., Scott, P., Brinton, J., Mestre, T. C., Bush, M., Del, B. A., et al. (2016). A splice acceptor site mutation in TaGW2-A1 increases thousand-grain weight in tetraploid and hexaploid heat through wider and longer grains. Theor. Appl. Genet. 129, 1099–1112. doi: 10.1007/s00122-016-2686-2
Singh, S., Vikram, P., Sehgal, D., Burguano, J., Sharma, A., Singh, S. K., et al. (2018). Harnessing genetic potential of wheat germplasm banks through impact-oriented-prebreeding for future food and nutritional security. Nat. Sci. Rep. 8, 12527. doi: 10.1038/s41598-018-30667-4
Su, Z. Q., Hao, C. Y., Wang, L. F., Dong, Y. C., Zhang, X. Y. (2011). Identification and development of a functional marker of TaGW2 associated with grain weight in bread wheat (Triticum aestivum L.). Theor. Appl. Genet. 122, 211–223. doi: 10.1007/s00122-010-1437-z
Sukumaran, S., Lopes, M., Dreisigacker, S., Reynolds, M. (2018). Genetic analysis of multi-environmental spring wheat trials identifies genomic regions for locus-specific trade-offs for grain weight and grain number. Theor. Appl. Genet. 131 (4), 985–998. doi: 10.1007/s00122-017-3037-7
Tahmasebi, S., Heidari, B., Pakniyat, H., McIntyre, C. L. (2017). Mapping QTLs associated with agronomic and physiological traits under terminal drought and heat stress conditions in wheat (Triticum aestivum L.). Genome 60 (1), 26–45. doi: 10.1139/gen-2016-0017
Vargas, M., Combs, E., Alvarado, G., Atlin, G., Mathews, K., Crossa, J. (2013). Meta: a suite of SAS programs to analyze multienvironment breeding trials. Agron. J. 105 (1), 11–19. doi: 10.2134/agronj2012.0016
Wang, S., Yan, X., Wang, Y., Liu, H., Cui, D., Chen, F. (2016). Haplotypes of the TaGS5-A1 gene are associated with thousand-kernel weight in Chinese bread wheat. Front. Plant Sci. 7, 783. doi: 10.3389/fpls.2016.00783
Xiao, Y. G., Qiang, Z. G., Wu, K., Liu, J. J., Xia, X. C., Ji, W. Q., et al. (2012). Genetic gains in grain yield and physiological traits of winter wheat in Shandong province, China, from 1969 to 2006. Crop Sci. 52, 44–56. doi: 10.2135/cropsci2011.05.0246
Yan, L., Liang, F., Xu, H., Zhang, X., Zhai, H., Sun, Q., et al. (2017). Identification of QTL for grain size and shape on the D genome of natural and synthetic allohexaploid wheats with near-identical AABB genomes. Front. Plant. Sci. 8, 1705. doi: 10.3389/fpls.2017.01705
Yang, Z. B., Bai, Z. Y., Li, X. L., Wang, P., Wu, Q. X., Yang, L., et al. (2012). SNP identification and allelic-specific PCR markers development for TaGW2, a gene linked to wheat kernel weight. Theor. Appl. Genet. 125, 1057–1068. doi: 10.1007/s00122-012-1895-6
Zanke, C. D., Ling, J., Plieske, J., Kollers, S., Ebmeyer, E., Korzun, V., et al. (2015). Analysis of main effect QTL for thousand grain weight in European winter wheat (Triticum aestivum L.) by genome-wide association mapping. Front. Plant. Sci. 6, 644. doi: 10.3389/fpls.2015.00644
Zhai, H., Feng, Z., Du, X., Song, Y., Liu, X., Qi, Z., et al. (2018). A novel allele of TaGW2-A1 is located in a finely mapped QTL that increases grain weight but decreases grain number in wheat (Triticum aestivum L.). Theor. Appl Genet. 131 (3), 539–553. doi: 10.1007/s00122-017-3017-y
Zhang, P., He, Z., Tian, X., Gao, F., Xu, D., Liu, J., et al. (2017). Cloning of TaTPP-6AL1 associated with grain weight in bread wheat and development of functional marker. Mol. Breed. 37, 78. doi: 10.1007/s11032-017-0676-y
Zhang, X. Y., Chen, J. S., Shi, C. L., Chen, J. N., Zheng, F. F., Tian, J. C. (2013). Function of TaGW2-6A and its effect on grain weight in wheat (Triticum aestivum L.). Euphytica 192, 347–357. doi: 10.1007/s10681-012-0858-y
Zhang, Y., Liu, J., Xia, X., He, Z. (2014). TaGS-D1, an ortholog of rice OsGS3, is associated with grain weight and grain length in common wheat. Mol. Breed. 34, 1097–1107. doi: 10.1007/s11032-014-0102-7
Zhang, B., Liu, X., Xu, W. N., Chang, J. Z., Li, A., Mao, X. G., et al. (2015). Novel function of a putative MOC1 ortholog associated with spikelet number per spike in common wheat. Sci. Rep. 5, 12211. doi: 10.1038/srep12211
Zhang, L., Zhao, Y. L., Gao, L. F., Zhao, G. Y., Zhou, R. H., Zhang, B. S., et al. (2012). TaCKX6-D1, the ortholog of rice OsCKX2, is associated with grain weight in hexaploid wheat. New Phytol. 195, 574–584. doi: 10.1111/j.1469-8137.2012.04194.x
Zheng, J., Liu, H., Wang, Y. Q., Wang, L. F., Chang, X. P., Jing, R. L., et al. (2014). TEF-7A, a transcript elongation factor gene, influences yield-related traits in bread wheat (Triticum aestivum L.). J. Exp. Bot. 65, 5351–5365. doi: 10.1093/jxb/eru306
Keywords: Triticum aestivum, thousand-grain weight, genotyping-by-sequencing, haplotypes, elite yield trial
Citation: Sehgal D, Mondal S, Guzman C, Garcia Barrios G, Franco C, Singh R and Dreisigacker S (2019) Validation of Candidate Gene-Based Markers and Identification of Novel Loci for Thousand-Grain Weight in Spring Bread Wheat. Front. Plant Sci. 10:1189. doi: 10.3389/fpls.2019.01189
Received: 02 June 2019; Accepted: 29 August 2019;
Published: 26 September 2019.
Edited by:Anna Maria Mastrangelo, Council for Agricultural and Economics Research, Italy
Reviewed by:Jingjuan Zhang, Murdoch University, Australia
Yong Zhang, Chinese Academy of Agricultural Sciences, China
Copyright © 2019 Sehgal, Mondal, Guzman, Garcia Barrios, Franco, Singh and Dreisigacker. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Susanne Dreisigacker, firstname.lastname@example.org