ORIGINAL RESEARCH article

Front. Plant Sci., 07 July 2022

Sec. Plant Breeding

Volume 13 - 2022 | https://doi.org/10.3389/fpls.2022.945471

SoyMAGIC: An Unprecedented Platform for Genetic Studies and Breeding Activities in Soybean

  • Department of Plant Agriculture, Ontario Agriculture College, University of Guelph, Guelph, ON, Canada

Article metrics

View details

14

Citations

3,8k

Views

1,4k

Downloads

Abstract

Multi-Parent Advanced Generation Inter-Cross (MAGIC) populations are emerging genetic platforms for high-resolution and fine mapping of quantitative traits, such as agronomic and seed composition traits in soybean (Glycine max L.). We have established an eight-parent MAGIC population, comprising 721 recombinant inbred lines (RILs), through conical inter-mating of eight soybean lines. The parental lines were genetically diverse elite cultivars carrying different agronomic and seed composition characteristics, including amino acids and fatty acids, as well as oil and protein concentrations. This study aimed to introduce soybean MAGIC (SoyMAGIC) population as an unprecedented platform for genotypic and phenotypic investigation of agronomic and seed quality traits in soybean. The RILs were evaluated for important seed composition traits using replicated field trials during 2020 and 2021. To measure the seed composition traits, near-infrared reflectance (NIR) was employed. The RILs were genotyped using genotyping-by-sequencing (GBS) method to decipher the genome and discover single-nucleotide polymorphic (SNP) markers among the RILs. A high-density linkage map was constructed through inclusive composite interval mapping (ICIM). The linkage map was 3,770.75 cM in length and contained 12,007 SNP markers. Chromosomes 11 and 18 were recorded as the shortest and longest linkage groups with 71.01 and 341.15 cM in length, respectively. Observed transgressive segregation of the selected traits and higher recombination frequency across the genome confirmed the capability of MAGIC population in reshuffling the diversity in the soybean genome among the RILs. The assessment of haplotype blocks indicated an uneven distribution of the parents’ genomes in RILs, suggesting cryptic influence against or in favor of certain parental genomes. The SoyMAGIC population is a recombined genetic material that will accelerate further genomic studies and the development of soybean cultivars with improved seed quality traits through the development and implementation of reliable molecular-based toolkits.

Introduction

Since the 1920s, soybean [Glycine max (L.) Merr.] has been one of the major sources of protein and oil for human food and livestock feed in Canada (Singh and Hymowitz, 1999). Demand for this “king of beans” has been steadily increasing year-over-year due to its nutritional values for human and livestock, as well as industrial applications (Thrane et al., 2017). This growing demand has created a significant market for varieties with increased seed quality and yield, along with a range of improved agronomic traits. However, one of the main challenges for soybean breeders is the complexity associated with accumulating many of the desired quantitative traits in new cultivars. Many of these traits are regulated by multiple genes, located in different genomic regions, and tend to be dynamically regulated by a range of environmental, molecular, and biochemical factors (Whiting et al., 2020). A crucial step toward overcoming this challenge is deciphering the genetic structure of these quantitative traits, which can provide a prospect for plant breeders on how to select and develop cultivars with accumulated required traits.

Producing genetically recombinant crops through crossing two genetically diverse parents, so-called bi-parental crosses, has been one of the most important and common approaches for genetic studies and cultivar developments by plant geneticists and breeders. Genetic variation of the parental lines provides the opportunity to decipher and map genomic regions, quantitative trait loci (QTL), which are associated with the trait of interest (Miles et al., 2008). A wide range of genetic studies have been conducted to date to identify QTL regions associated with soybean seed quality traits using bi-parental populations (Eskandari et al., 2013; Pei et al., 2018; Chen et al., 2021). Nevertheless, bi-parental populations despite having strong mapping power suffer from insufficiency of recombination events and genetic diversity for a given locus, which results from genetic segregation of loci coming from only two parents (Diouf and Pascual, 2021). In addition, in respect to soybean seed quality traits, as each QTL has a smaller effect on the trait (Diers et al., 1992; Hu et al., 2021), achieving higher mapping resolution, i.e., “fine mapping,” for developing more durable molecular markers, can be challenging using this type of populations.

To address these limitations, various strategies have been proposed, including Advanced Intercrossed Lines (AILs), and Genome-Wide Association Studies (GWAS; Darvasi and Soller, 1995; Ozaki et al., 2002). However, AILs suffer from a low degree of genetic variation as a result of the presence of only two parents, and GWAS efficiency is also limited because of undetermined pedigree, missing parental information, and obtaining some false positive responses (Tam et al., 2019). A novel approach called “Multi-parent Advanced Generation Inter Crosses (MAGIC),” which was introduced by Kover et al. (2009), can to some extent address the above issues. In this approach, MAGIC populations resolve the issues associated with bi-parental analyses, and have a greater overall power in terms of genetic diversity, population structure, and mapping resolution (Huang et al., 2015; Diouf and Pascual, 2021). Developing MAGIC populations in self-pollinated crops includes crossing multiple genetically diverse inbred parental lines for several cycles, followed by single-seed descent selection process to produce recombinant inbred lines (RILs) carrying a mosaic of genome blocks from all parents (Scott et al., 2020). So far, the successful establishment of MAGIC population has been presented for several strategic crops such as maize (Jiménez-Galindo et al., 2019), barley (Novakazi et al., 2020), rice (Ponce et al., 2018), soybean (Shivakumar et al., 2018), and wheat (Stadlmeier et al., 2018). Scientific research publications in which MAGIC populations are used as the platform is showing a 250% increase in the last 10 years (Diouf and Pascual, 2021). The latter is facilitated by cost-effective, continuing, and reliable advances in high-throughput genotyping and phenotyping technologies that facilitated the establishment and evaluation of MAGIC populations with a large number of RILs along with well-developed phenotypic datasets.

The objective of this study was to develop and establish Soy MAGIC, an 8-founder soybean MAGIC population carrying various agronomic and seed composition traits, which can be used by researchers as an everlasting platform for deciphering and fine mapping of QTL associated with their target traits, and also to develop new value-added cultivars. Here, we present the process of SoyMAGIC development, high-density genetic linkage map construction as well as genetic features and validation of the population as a new genetic tool in soybean. The SoyMAGIC population with hundreds of RILs, each with a unique genetic combination of the eight parents and phenotypic performance, delivers a broad genetic resource for improving genetic gains of important traits in breeding programs as well as allowing for high precision QTL mapping of complex traits in soybean.

Materials and Methods

Development of Soybean MAGIC Population

To develop the SoyMAGIC population, the following eight elite soybean lines were used as the founders: (A) OAC Prosper (Eskandari et al., 2017), (B) OAC 13-55C-HL, (C) OAC 07-78C-LL, (D) AC X790P (Poysa and Buzzell, 2001), (E) RG 46, (F) RG 22, (G) RG 11, and (H) RG 23 (Figure 1). These genetically diverse parental lines were selected based on their diverse phenotypic performance for important agronomic and seed quality traits (Table 1). Parental lines were inter-crossed in the form of conical crosses, consisting of eight parents and three cycles of crosses (Figure 1). In the first cycle, for each cross, the F1 seeds of eight 2-way mating combinations of the eight parents were generated in a way that each parent was used once as the female parent and once as the male parent. In the second cycle, F1 seeds of eight 4-way crosses, executed between the 2-way F1 plants, were generated such that each founding parent is present only once as the female and once as the male. Following the same pattern, the F1 seeds of eight 8-way crosses were generated by crossing the 4-way F1 plants. The plants resulting from the advanced inter-crossing stage were progressed four generations by single seed decent (SSD) to create 721 homozygous recombinant inbreed individuals.

Figure 1

Table 1

Parent IDParent nameComment/characteristic
AOAC ProsperHigh protein, high yield, large seed, semi-determinate
BOAC 13-55C-HLHigh protein, high yield, and high linoleic
COAC 07-78C-LLHigh protein, high yield, and low linoleic
DAC X790PAverage yield, very high protein, large seed size
ERG 46High protein and oleic acid
FRG 22High oil, low stearic, and high linoleic acids
GRG 11High oil, high oleic, and low linolenic acids
HRG 23High protein, low stearic, and high oleic acids

Descriptive characteristics of the parental lines for establishing the SoyMAGIC population.

Experimental Design and Phenotyping

The RIL population was propagated in Ridgetown, Ontario, Canada (42°26′55.32″ N, 81°52′41.49″ W), during 2020 and 2021. The experiment was set up as a randomized complete block design (RCBD) with nearest neighbor adjustment and two replicates. Each plot consisted of five rows, 4.2 m long, with a row spacing of 43 cm. The rows were trimmed to 3.8 m in length after emergence, and the inside three rows were harvested. In each plot, 500 soybean seeds were planted to reach a plant density of 54 seeds per square meter (m−1). The plots were managed using conventional standard tillage, standard pest, and weed management treatments. Plants in three middle rows were harvested after reaching full maturity.

The total chemical composition of soybean seed (30 g) was measured using Perten DA 7250 SD Near-Infrared Reflectance (NIR) spectrometer (Perten Instruments, Hägersten, Sweden). Seed samples were placed in a 9 mm diameter clear glass bottle at 4 mm height for the NIR spectrometer. Evaluation of seeds was performed for chemical components concentration as intact (without any treatment) using calibrations provided by Perten Instruments, as reported by Whiting et al. (2020). Three technical replications were applied for each measurement. Statistical analysis and visualization of the phenotype data were completed using R software packages including ggplot2, heatmaply, pastecs, and plotly.

DNA Extraction and High-Throughput Genotyping

Young leaves were collected from each individual RILs and parental lines and stored at −80° C after lyophilization. Afterward, DNA was extracted using the Macherey-Nagle NuceloSpin II DNA kit (MACHEREY-NAGEL, Germany) according to the manufacturer’s instructions. DNA quality and quantity were assessed through Nano-drop spectrophotometer ND-1000 (Nanodrop Technologies, Inc., Wilmington, DE, United States) along with a Qubit v2.0 Fluorometer (Thermo Fisher Scientific Inc., United States), respectively. DNA quality of parental lines was verified using 1% agarose gel (Voltage) and stained with ethidium bromide prior to imaging on a GelDoc system (Supplementary Figure S1).

To genotype the RILs, sequencing libraries were prepared based on the genotyping by sequencing (GBS) protocol as explained by Elshire et al. (2011) except for the use of selective primers, which is described by Sonah et al. (2013) at the Plateforme d’analyses ge’nomiques (IBIS, Universite´ Laval). Normalized DNA concentrations of 10 ng/ml and restriction endonuclease of “ApeKI” were used in library preparation. Parental lines were genotyped by whole genome sequencing to obtain comprehensive genetic information as well as enough material for further investigations. Sequencing reads of parental lines were aligned to the reference genome, “William 82.” For the RILs, the variant call format (VCF) file was filtered out via VCFtools.1 After removing markers with more than 80% missing rate 183,482 SNPs remained out of 2,797,528 SNP markers. After individual level filtering, out of 760 individuals, 721 remained. Only bi-allelic SNPs remained. SNP imputation for the missing genotypes was carried out based on the haplotype structure of parental lines.

Physical map investigation and visualization were completed using rMVP and ggplot2 packages, R software (Wickham, 2017; Yin et al., 2021). Allelic contribution of parental lines in each chromosome was measured using “calc.genoprob” function with an error probability of 0.01 in the qtl2 package, R software (Broman et al., 2019).

Population Structure

Principal component analysis (PCA) was carried out using TASSEL V5.2 to calculate the patterns of multi-locus variation (Bradbury et al., 2007). To illustrate the dispersion of the RILs in the population, the first two principal components (PCs) were used. According to the method of (Nei and Li, 1979), pairwise similarity coefficients were determined for all pairwise combinations of the RILs. To explore and visualized the familial relatedness among RILs, a Kinship matrix was also calculated using Genome Associated Prediction Integrated Tool (GAPIT) package in R (Lipka et al., 2012; Supplementary Figures S2 and S3).

Construction of Genetic Linkage Map

Genetic linkage map constriction of SoyMAGIC population was conducted using the inclusive composite interval mapping (ICIM-ADD) method in GAPL V1.2 software (Zhang et al., 2019). Before running the map construction, quality of the genotypic data was checked by the software. First, “SNP data conversion” function was used to convert the genetic dataset to the format of the software. Non-polymorphed markers either in parents or progenies and markers which were missing in one or more parents were filtered out. Afterward, identification and filtering of redundant markers was applied to remove the markers with a missing rate of ≥10%, while the markers with the minimum missing rate were set to present the co-localized markers. In a particular population, a set of co-localized markers was defined as one bin. Markers with heterozygosity of more than 12.5% were discarded.

“Map construction in multi-parent derived pure-line populations” function was used to construct the genetic linkage map of SoyMAGIC population. Anchoring of markers with known chromosome ID on the physical map was the first step. Then, a grouping of markers was accomplished through anchored marker information and a threshold of marker recombination frequency (REC) of 0.3 for unanchored markers. For marker ordering, the two-optTSP and nearest-neighbor algorithms were used (Lin and Kernighen, 1973). Eventually, a window size of five-SNP was used as the rippling standard to measure the sum of adjacent recombination frequencies. Kosambi’s mapping function was used to convert the recombination frequency into map distance and the visualization of the genetic map was carried out using LinkageMapView package in R software (Ouellette et al., 2018).

Results

Population Development and Genotyping

A set of 721 soybean MAGIC RILs was produced through three and four generations of advanced inter-crossing and self-pollination, respectively (Figure 1). GBS of RILs resulted in a total of 183,342 SNPs that were polymorphic between the eight parents and RILs. The RILs were on average 87.9% homozygous and appeared highly diverse and clustered uniformly relative to their eight parents, among which RG11, RG22, and RG23 were closer to each other than the other parent-to-parent relationships (Figure 2).

Figure 2

Genomic Features and Recombination Frequency of SoyMAGIC

After discarding markers with a MAF ≤0.05 and heterozygous rate ≤0.13 from the 183,342 polymorphic SNPs and 721 individuals, 716 individuals with 122,747 SNPs remained, which were distributed across the whole soybean genome with an average spacing of 0.915 kb. Marker distribution varied among and within 20 chromosomes of soybean (Figure 3A). In the physical map, the largest and smallest numbers of markers were observed in Chromosomes 18 and 11 with 13,476 and 1,644 SNPs, respectively (Figure 3B). The mean genome-wide SNP number was recorded as 6,317 per chromosome (Figure 3B). Comparison of detected chromosome-wide markers with a gene density of G. max cultivar “William 82, genome assembly version 4” (Schmutz et al., 2010) demonstrated higher SNP frequency in the centromeric region of chromosome 2, 4, 18, and 20.2

Figure 3

The distribution of average major allele frequency (AF), minor AF, and proportion of heterozygotes is illustrated in Figure 4. The average proportion of heterozygotes was 0.121 and 0.034 in the RILs and the parental lines, respectively. Average minor AF was 0.268 in parental lines and 0.188 among RILs, while the average major allele frequency was 0.732 and 0.812 in parental lines and RILs, respectively. The results indicated that the average MAF of the RILs was ranged from 0.101 on chromosome 19 to 0.337 on chromosome 14. This suggests that the SoyMAGIC RILs have higher average MAF and adequate polymorphism than the threshold (MAF < 0.05) for further genomic studies.

Figure 4

Additionally, genome-wide and chromosome-wide assessment of parent’s allelic probability suggested that some parents contributed more to the SoyMAGIC RILs than others. Parents A and B with an average contribution of 19.3% and 14.2%, respectively, were more influential than the others (Figure 5). In contrast, parents D and E with an average contribution of 9.6% and 9.3%, respectively, were the least influential ones. Chromosomes 5 and 15 were recorded as the most unbalanced chromosomes with a maximum representation of parents A and G, respectively, and a minimum representation of parent F in both chromosomes.

Figure 5

Phenotypic Variation in SoyMAGIC

The normal distribution of phenotypic data was verified and confirmed by Shapiro Wilk test after removing outliers. As illustrated in Table 2, descriptive statistics of phenotypic data for RILs and parental lines were calculated. Almost all the selected seed composition traits showed lower minimums and higher maximums for RILs than parental lines. Moreover, the mean value of the protein and oil concentration was recorded higher in RILs than in parental lines. In terms of the fatty acids, the mean value of oleic, palmitic, and stearic acids decreased, whereas the mean value of linolenic and linoleic acids increased in RILs as compared to the parental lines. Amino acids such as histidine, alanine, tryptophan, phenylalanine, tyrosine, and proline had higher mean values, whereas others had a lower mean for the RILs than the parental lines. Pearson’s correlation coefficient analysis of the seed quality traits was also measured among both parental lines and RILs. A positive correlation between all measured amino acids and seed protein concentration (r > 0.9) was observed. However, negative correlation was observed between the amino acids and fatty acids. In addition, as was expected, oleic acid showed a significant negative correlation with linoleic and linolenic acids (Figure 6).

Table 2

TraitsRILsParents
MinMaxMeanSDMinMaxMeanSD
Protein38.3549.4944.161.42441.2748.3943.911.998
Oil18.4524.4721.550.95818.2023.2721.241.453
Palmitic acid9.9614.0212.140.49111.5412.8912.330.443
Stearic acid2.805.454.140.3643.375.544.400.651
Oleic acid15.1040.3426.723.54317.7539.6127.776.053
Linoleic acid39.1857.5648.562.86237.3153.9847.635.331
Linolenic acid3.4210.496.790.9423.839.066.541.484
Alanine1.662.021.850.0451.751.981.830.064
Arginine2.783.813.330.1382.963.613.270.183
Cysteine0.440.630.530.0330.520.610.560.034
Glycine1.672.051.850.0511.772.011.850.071
Histidine1.021.271.140.0351.061.251.130.054
Isoleucine1.822.232.060.0561.922.192.010.081
Leucine2.983.763.410.1013.183.663.370.139
Lysine2.283.022.760.0782.522.982.730.122
Methionine0.470.600.540.0190.520.590.560.021
Phenylalanine1.992.542.290.0712.122.492.250.107
Proline1.872.332.120.0671.962.282.080.101
Serine1.632.101.880.0711.752.081.870.091
Threonine1.461.761.600.0441.511.731.590.065

Quantitative statistics for seed composition traits of parents and RILs in SoyMAGIC population.

Figure 6

Genetic Linkage Map

After filtering out missing and low-quality markers using GAPL V1.2, 12,007 polymorphic SNPs were grouped into 20 linkage groups (LGs) with a total genome size of 3,770.75 centiMorgans (cM; Table 3). The highest and lowest map length was observed in LG18 and LG12 with 341.15 and 71.01 cM, respectively. The average length, across the LGs, was 188.54 cM. The number of markers for each linkage group ranged from 237 to 1,422 with an average of 600.35 marker. Additionally, the average marker interval was 0.37 cM. LG4 with an average distance of 0.15 cM was recorded as the densest LG, whereas LG7 had the largest average interval distance of 0.60 cM. The maximum and minimum interval distances were observed in LG19 and LG20 with 20.03 and 2.57 cM, respectively.

Table 3

Linkage groupMap lengthNo. of markersNo. of binsInterval distance
MaxAverage
1151.083771106.170.4
2165.74211163.760.39
3180.187092155.550.25
4212.031,4224846.030.15
5153.773911314.180.39
6215.754641675.40.46
7180.69303955.670.6
8182.654961636.550.37
9273.476552295.80.42
10183.133521137.620.52
11137.45266985.70.52
1271.01247773.090.29
13179.053811434.790.47
14263.071,1584937.520.23
15230.971,1103785.240.21
16203.096092337.120.33
17189.055931996.540.32
18341.151,1263225.650.3
19137.372376320.030.58
20120.16902322.570.17
Total3770.7512,0074,06120.030.37

Information on linkage map of the SoyMAGIC population.

Discussion

MAGIC populations are exceptional genetic resources for improving the recombination frequency of resultant RILs and discovering marker-trait relationships with high accuracy and resolution accordingly (Scott et al., 2020). Multiple parents with greater phenotypic and genetic variation, as well as multiple rounds of inter-crossing and selfing, enhance the number of recombination events and therefore maximize mapping accuracy (Huang et al., 2015). Through inter-crossing diverse parents for a particular trait, the genetic variability in the final RILs increases, which is a decisive advantage of developing these types of populations for genetic studies (Scott et al., 2020). Several studies have previously exploited MAGIC populations for investigating genetic control of important trait in strategic crops such as maize (Jiménez-Galindo et al., 2019), rice (Ponce et al., 2018) and wheat (Stadlmeier et al., 2018). Here, we report the establishment of a soybean MAGIC (SoyMAGIC) population developed by combining eight parental lines that were genetically and phenotypically diverse for several agronomic and seed quality traits (Table 1, Supplementary Table S1, and Figure 1).

In plant breeding programs, a large population size is one of the necessary factors to maximize the mapping resolution (Beavis, 1998; Rosenthal and Borschbach, 2014). The SoyMAGIC population was maintained reasonably large at 721 RILs, to accumulate a wider range of recombination events, using a reciprocal conical design (Figure 1). To capture the maternal cytoplasmic genetic variance of parents (Morgan, 2013), the reciprocal conical crossing strategy was used during population development.

Soybean seed compositions, particularly oil and protein concentrations, are among the most studied traits in soybean due to their economic importance in the food and feed industries (Kumawat et al., 2016). Phenotypically, larger standard deviations, maximum and minimum values of the selected traits of RILs compared to parental lines (Table 2), confirmed the transgressive segregations and indicated the capability of SoyMAGIC population in reshuffling the genome in RILs. In fact, intensification of the genetic variation across the genome of RILs was because of the way that the population is developed. Similar results were reported for multi-parent populations of other plant crops such as rice, maize, cowpea, and eggplant, confirming the competence of multi-parent populations in reshuffling of genome and improving the recombination level (Dell’Acqua et al., 2015; Huynh et al., 2018; Ponce et al., 2020; Mangino et al., 2022). Since the eight parents were all completely inbred lines, the plants in each F1 set were homogeneously heterozygous. Theoretically, the F1s resulting from the four-way crosses, on the other hand, segregate and show substantial heterogeneity (Figure 1). This heterozygosity and heterogeneity generated individuals with recombined genotypes and phenotypes. Furthermore, using four generations of SSD selection, in which we did not apply any targeted selective pressure for any of the target traits, a genetically and phenotypically diverse RIL population consisting of 721 was generated and established as the SoyMAGIC population.

To discriminate genotypes for their genetic diversity in plant genetic and breeding activities, GBS has already been confirmed to be an exceptionally efficient and cost-effective approach for the genotyping of large multi and bi-parental populations (He et al., 2014; Kishor et al., 2021). WGS of parental lines has also been reported as a highly effective genotyping strategy in multiparent plant breeding programs, which can be employed in further genetic investigations such as QTL mapping and identification of candidate genes (Islam et al., 2016; Thyssen et al., 2019). Detection of 183,342 SNP markers across the genome, confirmed that GBS of RILs, imputed using WGS of the parental lines, could be a suitable method for generating a high-resolution map for soybean multiparent genotyping. In this study, higher number of SNP markers was observed around telomeric regions of most of the chromosomes, whereas chromosome 5, 7, 12 and 13 exhibited higher SNP density around centromeric area (Figure 3). These results reflect the strength of SoyMAGIC population in reshuffling alleles across the genome and providing a highly recombined genomic platform suitable for discovering QTL/candidate genes associated with complex traits. Theoretically, in an 8-parent MAGIC population, each of the parental lines should contribute 12.5%. However, certain paternal lines contributed more to the SoyMAGIC population than others (Figure 5). The observed variance in the contribution of founders might be caused by a variety of genotypic or environmental factors such as fertility reduction or male sterility due to environmental conditions (Brauner-Otto, 2014; Li et al., 2019).

It has been shown that SNP discovery in soybeans is a challenging and time-consuming process (Wu et al., 2010). Limited sequence variation in currently cultivated varieties as well as the complicated nature of the soybean genome are two critical factors causing the complications (Choi et al., 2007). Considering these challenges, we have constructed a new and high-density genetic linkage map that contains 12,007 SNP markers with a genome length of 3,770.75 cM by employing an eight-parent RIL population. Compare to the previous studies on soybean genetic linkage maps of bi-parental populations (Hyten et al., 2010; Song et al., 2016), the current map demonstrated a greater number of distinct sites, comparable genome length, and shorter average bin size (Table 3, Figure 7). In comparison to bi-parental populations (Hyten et al., 2010), the SoyMAGIC population displayed a significantly higher number of marker alleles at each locus, which reflects the capacity of SoyMAGIC for enhancing genetic variation and recombination frequency in the population.

Figure 7

Establishing genetic linkage map is an important step for the dissection of genome regions associated with important agronomic and quality traits through identifying the location of quantitative trait loci (QTL; Williams, 2018). Through improving genetic recombination in RILs, SoyMAGIC has provided a desired platform for discovering marker’s location across the genome and constructing a high-density genetic linkage map, which, in turn, provided a strong platform for further marker-trait association investigations. So far, several MAGIC population-derived RILs have been developed to dissect the genome of many crops using different mapping strategies (Scott et al., 2020). For instance, Huynh et al. (2018) used linkage map in an eight parent cowpea MAGIC population with 305 RILs, leading to the successful detection of four QTL underlying flowering time. Huang et al. (2021) using genome-wide association mapping in an 8-way upland cotton MAGIC population, discovered 177 SNPs strongly associated with nine agronomic traits in multiple environments. SoyMAGIC population will provide researchers with immortal diverse plant materials that can be tested across a wide range of environments with different types of biotic and abiotic stresses for discovering environment-specific effects of genomic regions associated with traits. Genotypic and phenotypic data generated for these studies will be stored and made available to breeders for improving their selection criteria and establishing efficient breeding strategies.

Conclusion

In addition to serving as an immortal genetic resource for precise marker-traits association studies and precise QTL mapping, SoyMAGIC will support breeding programs in the long run by offering valuable pre-breeding resources. The preliminary phenotypic data collected on agronomic and seed quality traits along with the SNP data set showed large phenotypic and genetic diversity among the lines within the population, which indicate the potential benefits and advantages of using this diverse germplasm in genetic studies and breeding activities by the soybean community. SoyMAGIC has been established by inter-crossing eight founders using reciprocal conical crosses in order to maintain maternal genetic materials and high recombination rate in the RILs. The population represents a valuable plant germplasm resource, which consists of 721 highly recombined RILs with a large degree of phenotypic variation. We have developed the first high-density genetic linkage map of an eight-parent MAGIC population in soybean that allows efficient discovery of gene-trait associations and QTL mapping of quantitatively inherited traits.

Funding

This project was funded in part through the Ontario Regional Priorities Partnership Program (ON-RP3), a collaborative initiative between the Agricultural Adaptation Council, Ontario Genomics, the Government of Canada through Genome Canada, and SeCan.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Statements

Data availability statement

The original contributions presented in the study are publicly available. This data can be found here: https://github.com/SeyedMH/SoyMAGIC.

Author contributions

ME: conceptualization. SH: validation, data curation, visualization, and writing. SH and GP: formal analysis. SH, ME, IR, and GP: review and editing. ME and SH: project administration. All authors have read and agreed to the published version of the manuscript.

Acknowledgments

The authors acknowledge Robert Brandt and Lin Liao for managing field trails and making crosses during the development of the SoyMAGIC population and Sepideh Torabi for sharing insights on soybean genomics experiments and research.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2022.945471/full#supplementary-material

References

  • 1

    BeavisW. D. (1998). QTL Analyses: Power, Precision, and Accuracy.1st Edn.Boca Raton, FL: CRC Press.

  • 2

    BradburyP. J.ZhangZ.KroonD. E.CasstevensT. M.RamdossY.BucklerE. S. (2007). TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics23, 26332635. doi: 10.1093/bioinformatics/btm308

  • 3

    Brauner-OttoS. R. (2014). Environmental quality and fertility: the effects of plant density, species richness, and plant diversity on fertility limitation. Popul. Environ.36, 131. doi: 10.1007/s11111-013-0199-3

  • 4

    BromanK. W.GattiD. M.SimecekP.FurlotteN. A.PrinsP.SenS.et al. (2019). R/qtl2: software for mapping quantitative trait loci with high-dimensional data and multiparent populations. Genetics211, 495502. doi: 10.1534/genetics.118.301595

  • 5

    ChenH.PanX.WangF.LiuC.WangX.LiY.et al. (2021). Novel QTL and meta-QTL mapping for major quality traits in soybean. Front. Plant Sci.12, 122. doi: 10.3389/fpls.2021.774270

  • 6

    ChoiI. Y.HytenD. L.MatukumalliL. K.SongQ.ChakyJ. M.QuigleyC. V.et al. (2007). A soybean transcript map: gene distribution, haplotype and single-nucleotide polymorphism analysis. Genetics176, 685696. doi: 10.1534/genetics.107.070821

  • 7

    DarvasiA.SollerM. (1995). Advanced intercross lines, an experimental population for fine genetic mapping. Genet. Soc. Am.141, 11991207.

  • 8

    Dell’AcquaM.GattiD. M.PeaG.CattonaroF.CoppensF.MagrisG.et al. (2015). Genetic properties of the MAGIC maize population: a new platform for high definition QTL mapping in Zea mays. Genome Biol.16:167. doi: 10.1186/s13059-015-0716-z

  • 9

    DiersB. W.KeimP.FehrW. R.ShoemakerR. C. (1992). RFLP analysis of soybean seed protein and oil content. Theor. Appl. Genet.83, 608612. doi: 10.1007/BF00226905

  • 10

    DioufI.PascualL. (2021). Multiparental population in crops: methods of development and dissection of genetic traits. Methods Mol. Biol.2264, 1332. doi: 10.2135/1983.cropbreeding

  • 11

    ElshireR. J.GlaubitzJ. C.SunQ.PolandJ. A.KawamotoK.BucklerE. S.et al. (2011). A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS One6, 110. doi: 10.1371/journal.pone.0019379

  • 12

    EskandariM.AblettG. R.RajcanI.FischerD.StirlingB. T. (2017). OAC prosper soybean. Can. J. Plant Sci.97, 337339. doi: 10.1139/cjps-2016-0210

  • 13

    EskandariM.CoberE. R.RajcanI. (2013). Genetic control of soybean seed oil: II. QTL and genes that increase oil concentration without decreasing protein or with increased seed yield. Theor. Appl. Genet.126, 16771687. doi: 10.1007/s00122-013-2083-z

  • 14

    HeJ.ZhaoX.LarocheA.LuZ. X.LiuH. K.LiZ. (2014). Genotyping-by-sequencing (GBS), an ultimate marker-assisted selection (MAS) tool to accelerate plant breeding. Front. Plant Sci.5, 18. doi: 10.3389/fpls.2014.00484

  • 15

    HuQ.ZhangY.MaR.AnJ.HuangW.WuY.et al. (2021). Genetic dissection of seed appearance quality using recombinant inbred lines in soybean. Mol. Breed.41:72. doi: 10.1007/s11032-021-01262-9

  • 16

    HuangC.ShenC.WenT.GaoB.ZhuD.LiD.et al. (2021). Genome-wide association mapping for agronomic traits in an 8-way upland cotton MAGIC population by SLAF-seq. Theor. Appl. Genet.134, 24592468. doi: 10.1007/s00122-021-03835-w

  • 17

    HuangB. E.VerbylaK. L.VerbylaA. P.RaghavanC.SinghV. K.GaurP.et al. (2015). MAGIC populations in crops: current status and future prospects. Theor. Appl. Genet.128, 9991017. doi: 10.1007/s00122-015-2506-0

  • 18

    HuynhB. L.EhlersJ. D.HuangB. E.Muñoz-AmatriaínM.LonardiS.SantosJ. R. P.et al. (2018). A multi-parent advanced generation inter-cross (MAGIC) population for genetic analysis and improvement of cowpea (Vigna unguiculata L. Walp.). Plant J.93, 11291142. doi: 10.1111/tpj.13827

  • 19

    HytenD. L.ChoiI. Y.SongQ.SpechtJ. E.CarterT. E.ShoemakerR. C.et al. (2010). A high density integrated genetic linkage map of soybean and the development of a 1536 universal soy linkage panel for quantitative trait locus mapping. Crop Sci.50, 960968. doi: 10.2135/cropsci2009.06.0360

  • 20

    IslamM. S.ThyssenG. N.JenkinsJ. N.ZengL.DelhomC. D.McCartyJ. C.et al. (2016). A MAGIC population-based genome-wide association study reveals functional association of GhRBB1_A07 gene with superior fiber quality in cotton. BMC Genomics17, 903. doi: 10.1186/s12864-016-3249-2

  • 21

    Jiménez-GalindoJ. C.MalvarR. A.ButrónA.SantiagoR.SamayoaL. F.CaicedoM.et al. (2019). Mapping of resistance to corn borers in a MAGIC population of maize. BMC Plant Biol.19, 431417. doi: 10.1186/s12870-019-2052-z

  • 22

    KishorD. S.LeeH. Y.AlavilliH.YouC. R.KimJ. G.LeeS. Y.et al. (2021). Identification of an allelic variant of the CsOr gene controlling fruit endocarp color in cucumber (Cucumis sativus L.) using genotyping-by-sequencing (GBS) and whole-genome sequencing. Front. Plant Sci.12, 113. doi: 10.3389/fpls.2021.802864

  • 23

    KoverP. X.ValdarW.TrakaloJ.ScarcelliN.EhrenreichI. M.PuruggananM. D.et al. (2009). A multiparent advanced generation inter-cross to fine-map quantitative traits in Arabidopsis thaliana. PLoS Genet.5:e1000551. doi: 10.1371/journal.pgen.1000551

  • 24

    KumawatG.GuptaS.RatnaparkheM. B.MarannaS.SatputeG. K. (2016). QTLomics in soybean: a way forward for translational genomics and breeding. Front. Plant Sci.7:1852. doi: 10.3389/fpls.2016.01852

  • 25

    LiJ.NadeemM.SunG.WangX.QiuL. (2019). Male sterility in soybean: occurrence, molecular basis and utilization. Plant Breed.138, 659676. doi: 10.1111/pbr.12751

  • 26

    LinS.KernighenB. W. (1973). An effective heuristic algorithm for the traveling-salesman problem. Oper. Res.21, 498516. doi: 10.1287/opre.21.2.498

  • 27

    LipkaA. E.TianF.WangQ.PeifferJ.LiM.BradburyP. J.et al. (2012). GAPIT: genome association and prediction integrated tool. Bioinformatics28, 23972399. doi: 10.1093/bioinformatics/bts444

  • 28

    ManginoG.ArronesA.PlazasM.PookT.ProhensJ.GramazioP.et al. (2022). Newly developed MAGIC population allows identification of strong associations and candidate genes for anthocyanin pigmentation in eggplant. Front. Plant Sci.13, 115. doi: 10.3389/fpls.2022.847789

  • 29

    MilesB. C. M.PhD.WayneM.EducationP. D. N. (2008). Quantitative trait locus (QTL) analysis. Nat. Educ.1, 17.

  • 30

    MorganT. H. (2013). Reciprocal Cross.AmsterdamElsevier Inc.

  • 31

    NeiM.LiW. H. (1979). Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc. Natl. Acad. Sci. U. S. A.76, 52695273. doi: 10.1073/pnas.76.10.5269

  • 32

    NovakaziF.KrusellL.JensenJ. D.OrabiJ.JahoorA.BengtssonT.et al. (2020). You had me at “MAGIC”!: four barley MAGIC populations reveal novel resistance QTL for powdery mildew. Genes (Basel)11:1512. doi: 10.3390/genes11121512

  • 33

    OuelletteL. A.ReidR. W.BlanchardS. G.BrouwerC. R. (2018). LinkageMapView-rendering high-resolution linkage and QTL maps. Bioinformatics34, 306307. doi: 10.1093/bioinformatics/btx576

  • 34

    OzakiK.OhnishiY.IidaA.SekineA.YamadaR.TsunodaT.et al. (2002). Functional SNPs in the lymphotoxin-α gene that are associated with susceptibility to myocardial infarction. Nat. Genet.32, 650654. doi: 10.1038/ng1047

  • 35

    PeiR.ZhangJ.TianL.ZhangS.HanF.YanS.et al. (2018). Identification of novel QTL associated with soybean isoflavone content. Crop J.6, 244252. doi: 10.1016/j.cj.2017.10.004

  • 36

    PonceK. S.YeG.ZhaoX. (2018). QTL identification for cooking and eating quality in indica rice using multi-parent advanced generation intercross (MAGIC) population. Front. Plant Sci.9, 19. doi: 10.3389/fpls.2018.00868

  • 37

    PonceK.ZhangY.GuoL.LengY.YeG. (2020). Genome-wide association study of grain size traits in indica rice multiparent advanced generation intercross (MAGIC) population. Front. Plant Sci.11:395. doi: 10.3389/fpls.2020.00395

  • 38

    PoysaV.BuzzellR. I. (2001). AC X790P soybean. Can. J. Plant Sci.81, 447448. doi: 10.4141/P00-186

  • 39

    RosenthalS.BorschbachM. (2014). Impact of population size, selection and multi-parent recombination within a customized NSGA-II and a landscape analysis for biochemical optimization. Int. J. Adv. Life Sci.6, 310324.

  • 40

    SchmutzJ.CannonS. B.SchlueterJ.MaJ.MitrosT.NelsonW.et al. (2010). Genome sequence of the palaeopolyploid soybean. Nature463, 178183. doi: 10.1038/nature08670

  • 41

    ScottM. F.LadejobiO.AmerS.BentleyA. R.BiernaskieJ.BodenS. A.et al. (2020). Multi-parent populations in crops: a toolbox integrating genomics and genetic mapping with breeding. Heredity (Edinb).125, 396416. doi: 10.1038/s41437-020-0336-6

  • 42

    ShivakumarM.KumawatG.GireeshC.RameshS. V.HusainS. M. (2018). Soybean MAGIC population: a novel resource for genetics and plant breeding. Curr. Sci.114, 906908. doi: 10.18520/cs/v114/i04/906-908

  • 43

    SinghR. J.HymowitzT. (1999). Soybean genetic resources and crop improvement. Genome42, 605616. doi: 10.1139/g99-039

  • 44

    SonahH.BastienM.IquiraE.TardivelA.LégaréG.BoyleB.et al. (2013). An improved genotyping by sequencing (GBS) approach offering increased versatility and efficiency of SNP discovery and genotyping. PLoS One8, e54603e54609. doi: 10.1371/journal.pone.0054603

  • 45

    SongQ.JenkinsJ.JiaG.HytenD. L.PantaloneV.JacksonS. A.et al. (2016). Construction of high resolution genetic linkage maps to improve the soybean genome sequence assembly Glyma1.01. BMC Genomics17:33. doi: 10.1186/s12864-015-2344-0

  • 46

    StadlmeierM.HartlL.MohlerV. (2018). Usefulness of a multiparent advanced generation intercross population with a greatly reduced mating design for genetic studies in winter wheat. Front. Plant Sci.9, 112. doi: 10.3389/fpls.2018.01825

  • 47

    TamV.PatelN.TurcotteM.BosséY.ParéG.MeyreD. (2019). Benefits and limitations of genome-wide association studies. Nat. Rev. Genet.20, 467484. doi: 10.1038/s41576-019-0127-1

  • 48

    ThraneM.PaulsenP. V.OrcuttM. W.KriegerT. M. (2017). Soy Protein: Impacts, Production, and Applications.Amsterdam: Elsevier Inc.

  • 49

    ThyssenG. N.JenkinsJ. N.McCartyJ. C.ZengL.CampbellB. T.DelhomC. D.et al. (2019). Whole genome sequencing of a MAGIC population identified genomic loci and candidate genes for major fiber quality traits in upland cotton (Gossypium hirsutum L.). Theor. Appl. Genet.132, 989999. doi: 10.1007/s00122-018-3254-8

  • 50

    WhitingR. M.TorabiS.LukensL.EskandariM. (2020). Genomic regions associated with important seed quality traits in food-grade soybeans. BMC Plant Biol.20, 485414. doi: 10.1186/s12870-020-02681-0

  • 51

    WickhamH. (2017). ggplot2 – elegant graphics for data analysis (2nd Edn.). J. Stat. Softw.77, 35. doi: 10.18637/jss.v077.b02

  • 52

    WilliamsK. L. (2018). “Gene mapping,” in Encyclopedia of Bioinformatics and Computational Biology: ABC of Bioinformatics, RanganathanS.GribskovM.NakaiK.SchönbachC.GaetaB.Cambridge, MA: Academic Press. 242250.

  • 53

    WuX.RenC.JoshiT.VuongT.XuD.NguyenH. T. (2010). SNP discovery by high-throughput sequencing in soybean. BMC Genomics11:469. doi: 10.1186/1471-2164-11-469

  • 54

    YinL.ZhangH.TangZ.XuJ.YinD.ZhangZ.et al. (2021). rMVP: a memory-efficient, visualization-enhanced, and parallel-accelerated tool for genome-wide association study. Genom. Proteom. Bioinforma.19, 619628. doi: 10.1016/j.gpb.2020.10.007

  • 55

    ZhangL.MengL.WangJ. (2019). Linkage analysis and integrated software GAPL for pure-line populations derived from four-way and eight-way crosses. Crop J.7, 283293. doi: 10.1016/j.cj.2018.10.006

Summary

Keywords

soybean (Glycine max L.), genetic linkage map, genotyping by sequencing, multi-parent advanced generation inter-crosses, Seed composition/quality

Citation

Hashemi SM, Perry G, Rajcan I and Eskandari M (2022) SoyMAGIC: An Unprecedented Platform for Genetic Studies and Breeding Activities in Soybean. Front. Plant Sci. 13:945471. doi: 10.3389/fpls.2022.945471

Received

16 May 2022

Accepted

17 June 2022

Published

07 July 2022

Volume

13 - 2022

Edited by

Kazuo N. Watanabe, University of Tsukuba, Japan

Reviewed by

Giriraj Kumawat, ICAR Indian Institute of Soybean Research, India; Milind B. Ratnaparkhe, ICAR Indian Institute of Soybean Research, India

Updates

Copyright

*Correspondence: Milad Eskandari,

This article was submitted to Plant Breeding, a section of the journal Frontiers in Plant Science

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics