ORIGINAL RESEARCH article

Front. Genet., 01 September 2021

Sec. Statistical Genetics and Methodology

Volume 12 - 2021 | https://doi.org/10.3389/fgene.2021.637133

Impact of Genomic Prediction Model, Selection Intensity, and Breeding Strategy on the Long-Term Genetic Gain and Genetic Erosion in Soybean Breeding

  • 1. Department of Agronomy, Universidade Estadual do Centro-Oeste, Guarapuava, Brazil

  • 2. Department of Biostatistics, Corteva AgriscienceTM, Johnston, IA, United States

  • 3. Department of Agronomy, Purdue University, West Lafayette, IN, United States

Abstract

Genomic-assisted breeding has become an important tool in soybean breeding. However, the impact of different genomic selection (GS) approaches on short- and long-term gains is not well understood. Such gains are conditional on the breeding design and may vary with a combination of the prediction model, family size, selection strategies, and selection intensity. To address these open questions, we evaluated various scenarios through a simulated closed soybean breeding program over 200 breeding cycles. Genomic prediction was performed using genomic best linear unbiased prediction (GBLUP), Bayesian methods, and random forest, benchmarked against selection on phenotypic values, true breeding values (TBV), and random selection. Breeding strategies included selections within family (WF), across family (AF), and within pre-selected families (WPSF), with selection intensities of 2.5, 5.0, 7.5, and 10.0%. Selections were performed at the F4 generation, where individuals were phenotyped and genotyped with a 6K single nucleotide polymorphism (SNP) array. Initial genetic parameters for the simulation were estimated from the SoyNAM population. WF selections provided the most significant long-term genetic gains. GBLUP and Bayesian methods outperformed random forest and provided most of the genetic gains within the first 100 generations, being outperformed by phenotypic selection after generation 100. All methods provided similar performances under WPSF selections. A faster decay in genetic variance was observed when individuals were selected AF and WPSF, as 80% of the genetic variance was depleted within 28–58 cycles, whereas WF selections preserved the variance up to cycle 184. Surprisingly, the selection intensity had less impact on long-term gains than did the breeding strategies. The study supports that genetic gains can be optimized in the long term with specific combinations of prediction models, family size, selection strategies, and selection intensity. A combination of strategies may be necessary for balancing the short-, medium-, and long-term genetic gains in breeding programs while preserving the genetic variance.

Introduction

Soybean [Glycine max (L.)] is the most important source of protein for animal feed and an important source of oil for human consumption, biofuel, and other industrial applications. Soybeans are cultivated globally, and the largest producers include Brazil, United States, Argentina, Paraguay, and China (). Soybeans are bred for several traits, but grain yield is considered as the most important.

Genome-wide prediction is a key tool in soybean breeding. It is utilized for faster and more accurate selection of superior individuals (Meuwissen et al., 2001). Methodologically, genomic models recreate the framework utilized for pedigree analysis, but using genomic relationships instead (VanRaden, 2008; ; VanRaden et al., 2011). Other factors that may have contributed to the increasing adoption of genomic selection (GS) in plants include the decreasing cost of genotyping and the availability of software tools and computing power to analyze large datasets.

Studies involving GS in plants have been mostly focused on prediction for advancement purposes, hence restricted to the evaluation of genetic gain within a single generation (Schmutz et al., 2010; Sonah et al., 2013; ; Xavier et al., 2016, 2018a,b; ; Smallwood et al., 2019). Studies of long-term gains based on GS are expensive and time-consuming; consequently, the literature is scarce (Wray and Goddard, 1994; ; Yabe et al., 2016; ; ). In addition, evaluation with real data from breeding programs faces additional challenges, such as the ongoing changes in breeding pipelines driven by business decisions, changes in the genotyping technology, and annual changes in resources. Conversely, the deployment of simulations has become an instrumental decision tool in plant breeding. It enables the assessment of genetic gain under different scenarios. In part, the increasing popularity of simulations is due to the quantity and flexibility of software made available (; Pook et al., 2019; Toledo et al., 2019). For instance, breeders are now capable of simulating entire breeding programs with the intent of tuning the breeding parameters to maximize genetic gains in the short and long term (; ), along with the best allocation of resources for a given budget.

By assessing predictive models and contrasting selection strategies, this study envisioned analyzing the influence of a set of variables on long-term genetic gains based on a simulated soybean breeding program and providing insight into the best practices for optimizing genetic gains.

Materials and Methods

Simulated Populational Parameters

The founder breeding population contained 200 individuals. Those were simulated based on the genomic parameters using the Markovian Coalescent Simulator (MaCS; ), which recreates the evolutionary process with multiple cycles of drift, mutation, and selection. The genomic parameters for the simulations reproduce the soybean genome with detailed information (Schmutz et al., 2010). We considered a genetic map architecture of 20 chromosomes with 115 cM average length, which collectively spanned 950 Mb. For each chromosome, 1,000 segregating sites were assigned.

Our study focused on the simulation of grain yield (in tons per hectare) as the primary trait of interest. The genetic architecture of the simulated trait was assumed to be infinitesimal with 70% of all segregating sites, which were not necessarily utilized as markers, having a non-zero effect sampled from a normal distribution. The genotype-by-environment variance provided a non-heritable variation attributed to the season. Residual variance remained constant throughout the simulation, causing a reduction in heritability overtime as the genetic variance decreased. Simulations began assuming an average yield of 3.00 t ha–1. The function addTraitAEG from the AlphaSimR package was utilized for the simulation of the phenotypic values. All simulation code is available on GitHub.1

Additive genetic effects, genotype-by-environment interaction, and residuals were simulated from Gaussian distribution using variance components estimated from the SoyNAM dataset (; Xavier et al., 2018a) as , , , and h2 = 0.12. The parameter estimation from the SoyNAM dataset was based on a multivariate genomic best linear unbiased prediction (GBLUP) model with unstructured genetic covariance and diagonal residual covariance, fitting grain yield from all 18 environments as response variables and using as explanatory variables the overall mean (fixed) and a polygenic term (random). The final estimates of the variance components for , , and h2 were obtained as averages across the 18 environments, whereas was computed as the average off-diagonal of the variance–covariance matrix.

The main simulation settings followed a soybean breeding program with 300 families per cycle and with 50 individuals per family, producing a total of 15,000 individuals per cycle. After crossing, the populations were inbred via single seed descendent (SSD) until F2:4, as shown in Figure 1, where lines were evaluated in field trials and genotyped with a single nucleotide polymorphism (SNP) array similar to the Soybean 6K SNP chip (). Individuals were then selected to become parents of the upcoming breeding cycle using the phenotypic and genotypic information. The calibration of genomic prediction leveraged data from the previous three breeding cycles, thus leveraging information from up to 45,000 individuals per model. The processes of selecting and crossing were repeated for 200 cycles to capture the theoretical plateau of genetic gains across all simulated parameters. Each breeding scenario was reproduced 60 times with different computational random seeds.

FIGURE 1

A second simulation with 100 breeding cycles was performed with varying numbers of families and offspring, where five combinations that use the same number of resources were chosen—300 × 50, 250 × 60, 200 × 75, 150 × 100, and 100 × 150—where the combinations correspond to the number of families and individuals per family, respectively. Each breeding scenario was reproduced 45 times with different random seeds.

Genotypic and phenotypic data were simulated with the R package AlphaSimR (), reproducing the previous methodological framework (). The software was utilized to simulate the founder population, perform selection, fingerprint individuals with the specified SNP chip, make crosses, generate offspring, inbred individuals, and simulate phenotypic values. All simulations and subsequent statistical analyses of the results were performed using R software (R Core Team, 2020). The code was run in parallel by distributing the multiple breeding scenarios over 960 cores, requiring approximately 10 h of computation per run. The R package doParallel (Ooi et al., 2019) was utilized to parallelize the runs.

Evaluation of Simulated Scenarios

Evaluation of the breeding strategies, selection intensities, and selection models was based on previous studies (). The evaluation criteria included the population mean across breeding cycles, genetic variance, and accuracy. Analyses were performed within a generation, combining the data from the repeated simulation runs. The statistical model for the analysis of simulated data was the following:

where y is the vector of the random variable of the simulated population; μ is the model intercept; X represents the incidence matrix, which is further divided to accommodate the three factors under evaluation (Xm, Xs, Xi, and Xp); m for the selection model; s for the breeding strategy; i for the selection intensity; p for the population design, as combinations of the number of families and individuals per family; and ε is the vector of residuals, assumed to be distributed as . The statistical test of multiple comparison was based on Tukey’s range test with 5% probability of error fit using the built-in R function TukeyHSD. This model was used to generate Figure 2.

FIGURE 2

Selection Models

The following selection models are evaluated: (1) True breeding values (TBV)—true breeding value, which serves as the upper limit of the achievable prediction power; (2) Random—random selection of individual, as the worst-case scenario; (3) Pheno—phenotypic-based selection without the use of genomic information; (4) GBLUP—the genomic best linear unbiased predictor fitted with REML (restricted maximum likelihood) variance components (Nejati-Javaremi et al., 1997; ); (5) BayesA—Bayesian shrinkage regression that assigns a t prior to marker effects (Meuwissen et al., 2001); (6) BayesB—an extension of BayesA with variable selection (Meuwissen et al., 2001); (7) FLM—fast Laplace model (Xavier, 2019), an empirical Bayes model with a double exponential prior for marker effects; and (8) RF—random forest regression (), a common machine learning procedure based on bootstrapping aggregation of multiple decision trees. The models GBLUP, BayesA, BayesB, and FLM were fitted using the R package bWGR (Xavier et al., 2019) and solved via expectation–maximization (EM). The model RF was fitted using the R package ranger (Wright et al., 2020) with default settings.

As a brief description of the GS model, these models in function on genomic information can be written in terms of the linear model:

where y is the vector of phenotypic values; X is the incidence matrix of the environment term treated as a fixed effect; b is a vector of environmental means; f (M) is the function of markers that describe the genetic merit of individuals; and ε is a random vector of residuals, assumed to be distributed as . The genetic function of markers, f (M), varied from model to model. For GBLUP, BayesA, and FLM, the function was linear and the marker effects were strictly additive; thus, the function of markers was f (M) = Mβ. The distinction of the models was the prior assigned to the distribution of marker effects, being normal for GBLUP, distributed as Student’s t for BayesA, and distributed as a double exponential for FLM. The function describing BayesB was f (M) = Mβγ, which is also linear, but with a variable selection term (γ) that caused further shrinkage to the Student’s t prior assigned to the marker effects. The only non-linear model under evaluation was random forest, in which case the genetic function is a linear ensemble of multiple independent regression trees (T): f (M) = n−1T (mM).

Breeding Strategy

The breeding strategies were based on soybean breeding designs previously described in the literature (; Sebastian et al., 2010; ; ; Smallwood et al., 2019). The following approaches were considered in this study:

AF: across-family selection. Genotypes are selected across families based on their estimated genetic merit, without regard for their family structure or any constraint for selecting multiple individuals from the same pedigree.

WF: within-family selection. In this strategy, all families were equally represented in the advancements. The best genotypes from each family are selected to become parents in the upcoming generations.

WPSF: within the pre-selected family. This strategy comprises two steps. Firstly, the family level selection is performed to identify the best-performing families (top 30%). Secondly, the selection of individuals occurs within the family. With fewer families to select from, more individuals per family will be parenting the upcoming generation compared to WF.

Selection Intensity

Four levels of selection intensity were considered: 2.5, 5.0, 7.5, and 10.0%. These values represent the percentages of individuals selected to be used as parents of the next generation. The selection of parental combinations was performed at random; thus, it is possible that not all selected individuals served as parents.

Results

Genetic Gains

The simulation results presented in Figure 3 summarize the population means over the course of 200 cycles. Supplementary Table 1 provides the population means for all combinations of treatments under evaluation in breeding cycles 10, 100, and 200. Across all scenarios, the population mean of random selection is anchored at the starting point. Selection of TBV represents the upper boundary of each scenario; hence, these are particularly useful to contrast the potential of the different scenarios. The highest long-term population means from selection on TBV occurred WF with loose selection intensities (7.5–10%). Genetic gains were generally closer to those from TBV when selections were performed WPSF.

FIGURE 3

Phenotypic selection outperformed GS over the course of 200 breeding cycles. Selection using random forest provided poor predictive performance in all scenarios, possibly due to the non-additive nature of the regression trees fitting a strictly additive genetic architecture. All linear genomic models (BayesA, BayesB, FLM, and GBLUP) provided similar outcomes. When conditioning for all other varying parameters, BayesA and FLM were the best-performing models within the first 100 breeding cycles (Figure 2).

After 10 cycles of selection, the highest gains were attained at the highest selection intensity (2.5%), which characterizes the short-term gain benefit from a higher selection pressure while the genetic variance is still abundant. After 100 breeding cycles, the genetic gains are affected by the combination of selection intensity and breeding strategy. For example, selection performed AF using BayesA provided the highest gains with a selection intensity of 10%, whereas, under WF, the highest gains occurred with a selection intensity of 2.5%. Such discrepancy is attributed to the amount of genetic variance left for long-term selection.

The highest long-term gains were reached when selections were performed WF. The maximum attainable, as benchmarked by selection upon TBV, resulted in a grain yield of 54 t ha–1 (WF), being 35% higher than AF selections and 46% higher than WPSF (Supplementary Table 2). The overall trend for long-term gains using GS followed the order WF > WPSF > AF. When the selections were based on phenotypic values, the genetic gains outpaced the GS run for all strategies (AF, WPSF, and WF), whereas that was not observed within the first 100 cycles (Figure 1). In fact, phenotypic selection WF was the third highest performing model, behind AF and WF selections performed on TBVs. The impact of each factor on the prediction accuracy over 200 breeding cycles is provided in Supplementary Figure 1.

Figure 2 summarizes the results of the simulation performed within 100 cycles, where different family sizes were an additional variable under evaluation. Within 10 breeding cycles, the scenario of 100 families with 150 individuals displayed the highest average, although the differences were negligible. Over the course of 50 and 100 breeding cycles, the number of families and the family sizes displayed significant differences in the genetic gains, with larger differences as generations progressed. The overall trend was that a greater number of families increase the gain in the long term.

Diversity Loss

The decay in genetic variance overtime is presented in Figure 4. The number of cycles to exhaust 80% of the genetic variance is provided in Supplementary Table 3. The study simulates closed populations without the inflow of external variation, the existing genetic variance consumed overtime as selection takes place. Overall, a fast decay in genetic variance is observed under a higher selection pressure, whereas a lower selection pressure preserved more genetic variance in the long term. When selection was performed at random, over 80% of the initial genetic variance remained after 200 breeding cycles. The interaction between the selection intensity and selection strategy was significant (p < 0.01) across all selection models.

FIGURE 4

Within-family selection preserved the genetic variance for more cycles (Figure 4). Selection WF based on TBVs exhausted 80% of the genetic variance within 48–69 breeding cycles, whereas AF and WPSF selections on TBVs exhausted 80% of the diversity between 25 and 42 cycles (Supplementary Table 3). Depletion of genetic variance was more pronounced with GS. Under the selection intensity of 10%, BayesA selection WF exhausted 80% of the variance after 184 cycles, whereas selections AF and WPSF display the same diversity loss after 54 and 58 cycles, respectively.

Diversity loss attributed to genetic drift is presented in Figure 5. These results assess the impact of bottlenecking the population through the various combinations of breeding strategy and selection intensity, utilizing random selections to avoid the confounding effect of directional selection. Higher rates of drift occurred under a higher selection pressure (2.5%). Strategy-wise, losses were highest for selection WPSF, with little difference across the selection intensities, ranging from −0.325 to −0.353%. The lowest rate of drift was observed under WF selection, with the rate of losses ranging from −0.199 to −0.136%.

FIGURE 5

Discussion

Genomic prediction has become an important tool for selection and breeding in agriculture as it can enhance the rate of genetic gain in comparison to pedigree and phenotype-based selection by leveraging information on relationship and the linkage disequilibrium between the marker and the quantitative trait locus (QTL; Meuwissen et al., 2001; ; ; , ; ; ). In soybean, the value of genomic prediction has been assessed and described in recent years (; Xavier et al., 2016, 2018a,b; ; Matei et al., 2018; Xavier and Rainey, 2020). These studies agreed that adequate composition of the training data is imperative to successful and accurate prediction. The definition of an optimized training set entails (1) maximizing the genetic relationship between the training and target populations and (2) collecting phenotypic information from year–location combinations that represent the target population of environments. Whereas factors that affect genomic predictions for short-term gains have been well characterized, it is unclear which factors affect long-term genetic gains. The answer for that would come from long-term simulations, such as the present study. Primarily, simulations enable the optimization of the modern breeding program in animal and plant species (Yu et al., 2005; ; , ; ; Muleta et al., 2018) by enabling the assessment of the breeding conditions that increase the rate of genetic gains, the conservation of useful genetic diversity, and the best allocation of breeding resource, such as the number of field plots, genotyping density, number of crosses, and population size (; ; ,).

Simulations indicate that linear models outperformed random forest for complex traits controlled by additive genetics and additive genotype-by-environment interactions. Under different scenarios, other studies found machine learning methods to display similar performances (Li et al., 2018; ). The discrepancy in the results is likely due to the nature of trait and population under evaluation, as machine learning predictions could be suitable for more structured populations and with some degree of epistatic control (Xavier, 2019; ). We also acknowledge that random forest was run with default settings in this study, and parameter tuning would benefit its predictive performance.

Selection factors provided a similar outcome to the findings in other studies (; Santantonio and Robbins, 2020), where the authors assessed balancing short- and long-term sustainable gains in plant breeding. Their results indicate that higher population sizes provide higher long-term gains. An alternative framework for the maximization of long-term response to selection is proposed by based on the use of selection indexes that account for allele frequency aiming to account for the value of rare loci and in short- and long-term gains. Under limited resources, our simulations indicate that a lower selection pressure generally contributes to long-term gains at the cost of compromising short-term gains. Across breeding strategies, WPSF appears to provide reasonable gains in both the short and the long term while having the range of gains being less influenced by selection pressure. WPSF is an intermediate between AF and WF, and the results are, in fact, intermediary between the short-term gains provided by AF selections and the long-term gains provided by WF selection.

The real-life trend of genetic gains in soybeans is positive, but variable across geographies. In North America, the rates of genetic gain have been estimated to be 23.4 kg ha–1 year–1 (), 26.5 kg ha–1 year–1 (Koester et al., 2014), and 16.8 kg ha–1 year–1 (Rogers et al., 2015). In the southern regions of Brazil, the rates of genetic gains were estimated to be 71.5 kg ha–1 year–1 (Lange and Federizzi, 2009) and 40.0 kg ha–1 year–1 (Todeschini et al., 2019); in Argentina, the rate has been reported to be 44.3 kg ha–1 year–1 (). These reports provide insight from the perspective of traditional breeding progress before the deployment of GS and, in most cases, with lengthy breeding cycles with the choice of parents taking place in advanced generations and commercial products. Our simulations provided higher annual gains than what has been reported; however, with the advent of earlier evaluations and increasing trust in genomic prediction, it is likely that annual genetic gains will be progressively and iteratively optimized for multiple factors, including those evaluated in the present study (model, selection intensity, family size, and breeding strategy).

The selection of unproven parents from earlier generations is often interpreted as gambling with high risk and high rewards, even though much of the risk is mitigated with the use of genomic information with robust statistical models calibrated with phenotypic data from multiple years. In addition to advancements, more opportunities arise with the use of genomics to predict and select the best combinations for crossing that further increase the probability of generating elite offspring. Previous studies have evaluated population-level selection strategies in further detail (; ; ; ; Ma et al., 2016; ; Matei et al., 2018) with the goal of preserving the segregation of low-frequency haplotypes for long-term gains (). Balancing the number of families and the family size can be a fundamental part of the strategy to continue the steady gains overtime (Figure 2), and, whereas the difference is not perceived in the short term, the magnitude of grain increases significantly overtime. Yet, multiple factors should be taken into account when allocating resources in terms of the number of families and family size (Lindgren et al., 1997; ).

Scenarios simulated as provided herein were based on the parental selection at the F4 stage, which is commonly perceived as an early generation for recycling as the quality and the quantity of phenotypic data are still scarce, of doubtful quality, and in many cases, without replication. Nevertheless, early recycling is a promising framework for speeding up the rate of genetic gain by shortening the length of the breeding cycles. In fact, shortening the breeding cycles while inducing multiple cycles a year reproduces a framework referred to as “speed breeding” (; Nagatoshi and Fujita, 2019; ). Recent studies often support recombination in the early stages of inbred development (), more so as the accuracy of selection in the early stages benefits greatly from the GS. Another important aspect of parental selection regards the management of genetic diversity in modern plant breeding, which is largely ignored and not always adequately measured (). Our results indicate that the multiple factors in the breeding design can affect the rate of diversity loss, mainly selection pressure and selection strategy (Supplementary Table 2), and that one must consider to balance these factors to attain the desired gain in the short term without compromising long-term gains. That is particularly the case for soybeans, whose germplasm-wise genetic diversity is considered low when compared to that of other species (Martin, 1982). Some Canadian soybean breeding programs have maintained diversity through decades of breeding while fixing maturity genes (). In the United States, soybean population structures and diversity varied by maturity group (Vaughn and Li, 2016), which suggests that new sources of variation could be obtained through the introgression of material from different regions.

The diversity available in breeding programs affects the accuracy of breeding values by dictating the amount of existing genetic signals to select upon an effective population size (Meuwissen, 2009). With restricted diversity, the genotyping density and marker distribution can be optimized to capture the existing variation in the target population with the goal of increasing genomic prediction accuracy (Ma et al., 2016). Of course, the long-term impacts of selection on genetic variance also vary depending on the genetic variance of interest, as the prominence of additive and non-additive variances is not the same over multiple cycles of selection (Paixão and Barton, 2016).

In soybeans, the management of diversity is necessary to ensure useful variability for future breeding objectives, such as yield performance under drought or waterlogging (Valliyodan et al., 2017), the seed oil and protein content profiles (Stewart-Brown et al., 2019), and disease resistance (). Monitoring genetic diversity in the genomic era can be performed through tracking overtime changes in allele frequencies (; ; Meuwissen et al., 2020). We showed that selection could quickly exhaust genetic diversity under closed breeding systems, and breeding systems can benefit from balancing short gains to preserve diversity and assure long-term gains. Such balance had been the focal point of recent studies (; ; Ru and Bernardo, 2019, 2020; Santantonio and Robbins, 2020) seeking for avenues to extend genetic resources with genomic tools, including the selection of material from germplasm collection to expend the genetic basis of elite programs. In addition to germplasm introgression, increases in genetic diversity in soybeans have been done in the past through mutagenic agents (; Khan, 2013; ; ) and more recently, through genome editing techniques based on CRISPR-Cas9 (, ,; ; Sun et al., 2015; Zheng et al., 2020) and target recombination for directional backcrossing (Ru and Bernardo, 2019, 2020).

The simulations performed in our study indicate that GS enables higher rates of genetic gain in the short and medium term compared with phenotype selection, but also led to faster extinction of the genetic variance. Thus, genomic prediction and selection must be applied mindfully with the purpose of maximizing gains while maintaining genetic variance. We found that a breeding strategy that balances selection at the family level, and within and across family at the individual level, can mitigate losses in genetic variance while providing satisfying genetic gains in the short term. Simulation is a powerful and inexpensive tool to test hypotheses, and for future studies, we envision addressing the importance of other important breeding parameters. Namely, future studies should focus on investigating (1) the optimal generation to select the parents and its trade-off with the accuracy of selection; (2) the influence of non-additive and non-infinitesimal genetic architecture and how machine learning would perform in such conditions; (3) the long-term effect of different models designed to select parental combinations; (4) the impact of different island models where new sources of variation are constantly infused into the main breeding panel; and (5) what would be the potential benefit of breeding hybrid soybeans assuming there are variable levels of dominance.

Conclusion

Long-term gains were influenced by the interaction among GS models, breeding strategy, and selection intensity. Adequate handling of these factors will aid breeding programs to ensure genetic gains in short, medium, and long term. Therefore, the breeding strategy is the most influential factor and, therefore, is a key criterion to conserve genetic variance and obtain the highest population mean overtime. The absolute impact of the selection intensity is lower than that of the breeding strategy and GS model. The benefits of balancing family size and the number of families were not perceived on short-term gains. Additive GS models (BayesA, BayesB, FLM, and GBLUP) have similar behaviors in selecting the best individuals, whereas RF has poor predictive performance when implemented with default settings. In summary, a combination of strategies may be necessary for balancing the short-, medium-, and long-term genetic gains in breeding programs while preserving genetic variance.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://github.com/Ederdbs/GenomicSelection.

Author contributions

ÉS and AX implemented the research, contributed with ideas to the algorithms, and wrote the manuscript. MF implemented the research, contributed ideas, and wrote the manuscript. All authors approved the final version to be published and agreed to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.

Acknowledgments

The authors are grateful to the Universidade Estadual do Centro-Oeste (UNICENTRO), because this manuscript is part of the thesis of the ÉS and to Corteva Agriscience for support in computational resources. The authors also thank the reviewers for their helpful comments.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2021.637133/full#supplementary-material

References

  • 1

    Abdollahi-ArpanahiR.GianolaD.PeñagaricanoF. (2020). Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes.Genet. Sel. Evolut.52:12. 10.1186/s12711-020-00531-z

  • 2

    AkondM.LiuS.SchoenerL.AndersonJ. A.KantartziS. K.MeksemK.et al (2013). A SNP-Based genetic linkage map of soybean using the SoySNP6K Illumina infinium BeadChip genotyping array.Plant Genet. Genomics Biotechnol.18089. 10.5147/pggb.v1i3.154

  • 3

    AliM.ZhangL.DeLacyI.AriefV.DietersM.PfeifferW. H.et al (2020). Modeling and simulation of recurrent phenotypic and genomic selections in plant breeding under the presence of epistasis.Crop J. 8, 866877. 10.1016/j.cj.2020.04.002

  • 4

    AllierA.LehermeierC.CharcossetA.MoreauL.TeyssèdreS. (2019a). Improving short- and long-term genetic gain by accounting for within-family variance in optimal cross-selection.Front. Genet.10:1006. 10.3389/fgene.2019.01006

  • 5

    AllierA.TeyssèdreS.LehermeierC.ClaustresB.MalteseS.MelkiorS.et al (2019b). Assessment of breeding programs sustainability: application of phenotypic and genomic indicators to a North European grain maize program.Theor. Appl. Genet.13213211334. 10.1007/s00122-019-03280-w

  • 6

    BackesR. L.ReisM. S.CruzC. D.SediyamaT.SediyamaC. S. (2003). Correlation estimates and assessment of selection strategies in five soybean populations.CBAB3107116. 10.12702/1984-7033.v03n02a03

  • 7

    BernardoR. (2010). Genomewide selection with minimal crossing in self-pollinated crops.Crop Sci.50624627. 10.2135/cropsci2009.05.0250

  • 8

    BeukelaerH. D.BadkeY.FackV.MeyerG. D. (2017). Moving beyond managing realized genomic relationship in long-term genomic selection.Genetics20611271138. 10.1534/genetics.116.194449

  • 9

    BreimanL. (2001). Random forests.Mach. Learn.45532. 10.1023/A:1010933404324

  • 10

    BruceR. W.TorkamanehD.GraingerC.BelzileF.EskandariM.RajcanI. (2019). Genome-wide genetic diversity is maintained through decades of soybean breeding in Canada.Theor. Appl. Genet.13230893100. 10.1007/s00122-019-03408-y

  • 11

    CaiY.ChenL.LiuX.GuoC.SunS.WuC.et al (2018a). CRISPR/Cas9-mediated targeted mutagenesis of GmFT2a delays flowering time in soya bean.Plant Biotechnol. J.16176185. 10.1111/pbi.12758

  • 12

    CaiY.ChenL.SunS.WuC.YaoW.JiangB.et al (2018b). CRISPR/Cas9-mediated Deletion of large genomic fragments in soybean.Int. J. Mol. Sci.19:3835. 10.3390/ijms19123835

  • 13

    CaiY.ChenL.LiuX.SunS.WuC.JiangB.et al (2015). CRISPR/Cas9-mediated genome editing in soybean hairy roots.PLoS One10:e0136064. 10.1371/journal.pone.0136064

  • 14

    ChenG. K.MarjoramP.WallJ. D. (2009). Fast and flexible simulation of DNA sequence data.Genome Res.19136142. 10.1101/gr.083634.108

  • 15

    CowlingW. A.GaynorR. C.AntolínR.GorjancG.EdwardsS. M.PowellO.et al (2020). In silico simulation of future hybrid performance to evaluate heterotic pool formation in a self-pollinating crop.Sci. Rep.10:4037. 10.1038/s41598-020-61031-0

  • 16

    CowlingW. A.LiL.SiddiqueK. H. M.HenryonM.BergP.BanksR. G.et al (2017). Evolving gene banks: improving diverse populations of crop and exotic germplasm with optimal contribution selection.J. Exp. Bot.6819271939. 10.1093/jxb/erw406

  • 17

    CowlingW. A.StefanovaK. T.BeeckC. P.NelsonM. N.HargreavesB. L. W.SassO.et al (2015). Using the animal model to accelerate response to selection in a self-pollinating crop.G3514191428. 10.1534/g3.115.018838

  • 18

    CrossaJ.BeyeneY.KassaS.PérezP.HickeyJ. M.ChenC.et al (2013). Genomic prediction in maize breeding populations with genotyping-by-sequencing.G3319031926. 10.1534/g3.113.008227

  • 19

    CrossaJ.Pérez-RodríguezP.CuevasJ.Montesinos-LópezO.JarquínD.de los CamposG.et al (2017). Genomic selection in plant breeding: methods, models, and perspectives.Trends Plant Sci.22961975. 10.1016/j.tplants.2017.08.011

  • 20

    CurtinS. J.ZhangF.SanderJ. D.HaunW. J.StarkerC.BaltesN. J.et al (2011). Targeted mutagenesis of duplicated genes in soybean with zinc-finger nucleases.Plant Physiol.156466473. 10.1104/pp.111.172981

  • 21

    da SilvaF. M.De MatosPereiraE.ValB. H. P.PerecinD.MauroA. O. D.Unêda-TrevisoliS. H.et al (2018). Strategies to select soybean segregating populations with the goal of improving agronomic traits.Acta Scientiarum. Agronomy40:39324. 10.4025/actasciagron.v40i1.39324

  • 22

    DaetwylerH. D.CalusM. P. L.Pong-WongR.de los CamposG.HickeyJ. M. (2013). Genomic prediction in animals and plants: simulation of data, validation, reporting, and benchmarking.Genetics193347365. 10.1534/genetics.112.147983

  • 23

    DaetwylerH. D.HaydenM. J.SpangenbergG. C.HayesB. J. (2015). Selection on optimal haploid value increases genetic gain and preserves more genetic diversity relative to genomic selection.Genetics20013411348. 10.1534/genetics.115.178038

  • 24

    de Azevedo PeixotoL.MoellersT. C.ZhangJ.LorenzA. J.BheringL. L.BeavisW. D.et al (2017). Leveraging genomic prediction to scan germplasm collection for crop improvement.PLoS One12:e0179191. 10.1371/journal.pone.0179191

  • 25

    de Cássia PereiraF.BruziA. T.de MatosJ. W.RezendeB. A.PradoL. C.NunesJ. A. R. (2017). Implications of the population effect in the selection of soybean progeny.Plant Breed.136679687. 10.1111/pbr.12512

  • 26

    de Castro LaraL. A.PocrnicI.GaynorR. C.GorjancG. (2020). Temporal and genomic analysis of additive genetic variance in breeding programmes.bioRxiv[Preprint].10.1101/2020.08.29.273250

  • 27

    de FelipeM.GerdeJ. A.RotundoJ. L. (2016). Soybean Genetic gain in maturity Groups III to V in argentina from 1980 to 2015.Crop Sci.5630663077. 10.2135/cropsci2016.04.0214

  • 28

    de Los CamposG.HickeyJ. M.Pong-WongR.DaetwylerH. D.CalusM. P. L. (2013). Whole-genome regression and prediction methods applied to plant and animal breeding.Genetics193327345. 10.1534/genetics.112.143313

  • 29

    DemorestZ. L.CoffmanA.BaltesN. J.StoddardT. J.ClasenB. M.LuoS.et al (2016). Direct stacking of sequence-specific nuclease-induced mutations to produce high oleic and low linolenic soybean oil.BMC Plant Biol.16:225. 10.1186/s12870-016-0906-1

  • 30

    DiersB. W.SpechtJ.RaineyK. M.CreganP.SongQ.RamasubramanianV.et al (2018). Genetic architecture of soybean yield and agronomic traits.G3833673375. 10.1534/g3.118.200332

  • 31

    FAO (2021). FAO Global Statistical Yearbook, FAO Regional Statistical Yearbooks. Available online at: http://www.fao.org/faostat/en/#data/QC(accessed January 6, 2021).

  • 32

    FauxA.-M.GorjancG.GaynorR. C.BattaginM.EdwardsS. M.WilsonD. L.et al (2016). AlphaSim: software for breeding program simulation.Plant Genome9114. 10.3835/plantgenome2016.02.0013

  • 33

    FoxC. M.CaryT. R.ColgroveA. L.NafzigerE. D.HaudenshieldJ. S.HartmanG. L.et al (2013). Estimating soybean genetic gain for yield in the Northern United States—Influence of cropping history.Crop. Sci.5324732482. 10.2135/cropsci2012.12.0687

  • 34

    FuY.-B. (2015). Understanding crop genetic diversity under modern plant breeding.Theor. Appl. Genet.12821312142. 10.1007/s00122-015-2585-y

  • 35

    GaynorC.GorjancG.WilsonD.HickeyJ.MoneyD. (2020). AlphaSimR: Breeding Program Simulations. Available online at: https://CRAN.R-project.org/package=AlphaSimR(accessed July 6, 2020).

  • 36

    GaynorR. C.GorjancG.BentleyA. R.OberE. S.HowellP.JacksonR.et al (2017). A two-part strategy for using genomic selection to develop inbred lines.Crop Sci.5723722386. 10.2135/cropsci2016.09.0742

  • 37

    GoddardM. (2009). Genomic selection: prediction of accuracy and maximisation of long term response.Genetica136245257. 10.1007/s10709-008-9308-0

  • 38

    GoiffonM.KusmecA.WangL.HuG.SchnableP. S. (2017). Improving response in genomic selection with a population-based selection strategy: optimal population value selection.Genetics20616751682. 10.1534/genetics.116.197103

  • 39

    GonenS.Ros-FreixedesR.BattaginM.GorjancG.HickeyJ. M. (2017). A method for the allocation of sequencing resources in genotyped livestock populations.Genet. Sel. Evol.49:47. 10.1186/s12711-017-0322-5

  • 40

    GorjancG.BattaginM.DumasyJ.-F.AntolinR.GaynorR. C.HickeyJ. M. (2017a). Prospects for cost-effective genomic selection via accurate within-family imputation.Crop Sci.57216228. 10.2135/cropsci2016.06.0526

  • 41

    GorjancG.DumasyJ.-F.GonenS.GaynorR. C.AntolinR.HickeyJ. M. (2017b). Potential of low-coverage genotyping-by-sequencing and imputation for cost-effective genomic selection in biparental segregating populations.Crop Sci.5714041420. 10.2135/cropsci2016.08.0675

  • 42

    GorjancG.GaynorR. C.HickeyJ. M. (2018). Optimal cross selection for long-term genetic gain in two-part programs with rapid recurrent genomic selection.Theor. Appl. Genet.13119531966. 10.1007/s00122-018-3125-3

  • 43

    GorjancG.HickeyJ. M. (2018). AlphaMate: a program for optimizing selection, maintenance of diversity and mate allocation in breeding programs.Bioinformatics3434083411. 10.1093/bioinformatics/bty375

  • 44

    HabierD.FernandoR. L. (2009). Genomic selection using low-density marker panels.Genetics182343353. 10.1534/genetics.108.100289

  • 45

    HabierD.FernandoR. L.DekkersJ. C. M. (2007). The impact of genetic relationship information on genome-assisted breeding values.Genetics17723892397. 10.1534/genetics.107.081190

  • 46

    HabierD.FernandoR. L.KizilkayaK.GarrickD. J. (2011). Extension of the bayesian alphabet for genomic selection.BMC Bioinformatics12:186. 10.1186/1471-2105-12-186

  • 47

    HaunW.CoffmanA.ClasenB. M.DemorestZ. L.LowyA.RayE.et al (2014). Improved soybean oil quality by targeted mutagenesis of the fatty acid desaturase 2 gene family.Plant Biotechnol. J.12934940. 10.1111/pbi.12201

  • 48

    HeffnerE. L.LorenzA. J.JanninkJ.-L.SorrellsM. E. (2010). Plant breeding with genomic selection: gain per unit time and cost.Crop Sci.5016811690. 10.2135/cropsci2009.11.0662

  • 49

    HickeyJ. M.DreisigackerS.CrossaJ.HearneS.BabuR.PrasannaB. M.et al (2014). Evaluation of genomic selection training population designs and genotyping strategies in plant breeding programs using simulation.Crop Sci.5414761488. 10.2135/cropsci2013.03.0195

  • 50

    HickeyL. T.HafeezN. A.RobinsonH.JacksonS. A.Leal-BertioliS. C. M.TesterM.et al (2019). Breeding crops to feed 10 billion.Nat. Biotechnol.37744754. 10.1038/s41587-019-0152-9

  • 51

    JacobsT. B.LaFayetteP. R.SchmitzR. J.ParrottW. A. (2015). Targeted genome modifications in soybean with CRISPR/Cas9.BMC Biotechnol.15:16. 10.1186/s12896-015-0131-2

  • 52

    JähneF.HahnV.WürschumT.LeiserW. L. (2020). Speed breeding short-day crops by LED-controlled light schemes.Theor. Appl. Genet.13323352342. 10.1007/s00122-020-03601-4

  • 53

    JanninkJ.-L. (2010). Dynamics of long-term genomic selection.Genet. Sel. Evol.42:35. 10.1186/1297-9686-42-35

  • 54

    JarquínD.KocakK.PosadasL.HymaK.JedlickaJ.GraefG.et al (2014). Genotyping by sequencing for genomic prediction in a soybean breeding population.BMC Genomics15:740. 10.1186/1471-2164-15-740

  • 55

    JarquinD.SpechtJ.LorenzA. (2016). Prospects of genomic prediction in the USDA soybean germplasm collection: historical data creates robust models for enhancing selection of accessions.G3623292341. 10.1534/g3.116.031443

  • 56

    KemperK. E.BowmanP. J.PryceJ. E.HayesB. J.GoddardM. E. (2012). Long-term selection strategies for complex traits using high-density genetic markers.J. Dairy Sci.9546464656. 10.3168/jds.2011-5289

  • 57

    KhanH. (2013). A review on induced mutagenesis in soybean.J. Cereals Oilseeds41925. 10.5897/JCO10.004

  • 58

    KoesterR. P.SkoneczkaJ. A.CaryT. R.DiersB. W.AinsworthE. A. (2014). Historical gains in soybean (Glycine max Merr.) seed yield are driven by linear increases in light interception, energy conversion, and partitioning efficiencies.J. Exp. Bot.6533113321. 10.1093/jxb/eru187

  • 59

    LangeC. E.FederizziL. C. (2009). Estimation of soybean genetic progress in the South of Brazil using multi-environmental yield trials.Sci. Agric.66309316. 10.1590/S0103-90162009000300005

  • 60

    LiB.ZhangN.WangY.-G.GeorgeA. W.ReverterA.LiY. (2018). Genomic prediction of breeding values using a subset of snps identified by three machine learning methods.Front. Genet.9:237. 10.3389/fgene.2018.00237

  • 61

    LindgrenD.WeiR.-P.LeeS. J. (1997). How to calculate optimum family number when starting a breeding program.For. Sci.43206212. 10.1093/forestscience/43.2.206

  • 62

    MaY.ReifJ. C.JiangY.WenZ.WangD.LiuZ.et al (2016). Potential of marker selection to increase prediction accuracy of genomic selection in soybean (Glycine max L.).Mol. Breed.36:113. 10.1007/s11032-016-0504-9

  • 63

    MartinS. K. S. (1982). Effective population size for the soybean improvement program in maturity groups 00 to IV1.Crop Sci.22151152. 10.2135/cropsci1982.0011183X002200010035x

  • 64

    MateiG.WoyannL. G.MilioliA. S.de Bem OliveiraI.ZdziarskiA. D.ZanellaR.et al (2018). Genomic selection in soybean: accuracy and time gain in relation to phenotypic selection.Mol Breed.38:117. 10.1007/s11032-018-0872-4

  • 65

    MeuwissenT. H. E. (2009). Accuracy of breeding values of “unrelated” individuals predicted by dense SNP genotyping.Genet. Sel. Evol.41:35. 10.1186/1297-9686-41-35

  • 66

    MeuwissenT. H. E.HayesB. J.GoddardM. E. (2001). Prediction of total genetic value using genome-wide dense marker maps.Genetics15718191829.

  • 67

    MeuwissenT. H. E.SonessonA. K.GebregiwergisG.WoolliamsJ. A. (2020). Management of genetic diversity in the era of genomics.Front. Genet.11:880. 10.3389/fgene.2020.00880

  • 68

    MuletaK. T.PressoirG.MorrisG. P. (2018). Optimizing genomic selection for a sorghum breeding program in haiti: a simulation study.G3, 9391401. 10.1534/g3.118.200932

  • 69

    NagatoshiY.FujitaY. (2019). Accelerating Soybean Breeding in a CO2-Supplemented Growth Chamber.Plant Cell Physiol.607784. 10.1093/pcp/pcy189

  • 70

    Nejati-JavaremiA.SmithC.GibsonJ. P. (1997). Effect of total allelic relationship on accuracy of evaluation and response to selection.J. Anim. Sci.7517381745. 10.2527/1997.7571738x

  • 71

    OoiH.CorporationM.WestonS.TenenbaumD. (2019). doParallel: Foreach Parallel Adaptor for the “parallel” Package (Version 1.0.16). Available online at: https://CRAN.R-project.org/package=doParallel(accessed July 2, 2020).

  • 72

    PaixãoT.BartonN. H. (2016). The effect of gene interactions on the long-term response to selection.PNAS11344224427. 10.1073/pnas.1518830113

  • 73

    PookT.SchlatherM.SimianerH. (2019). MoBPS - modular breeding program simulator.bioRxiv[Preprint].10.1101/829333

  • 74

    R Core Team (2020). R: A Language and Environment for Statistical Computing.Vienna: R Foundation for Statistical Computing.

  • 75

    RogersJ.ChenP.ShiA.ZhangB.ScabooA.SmithS. F.et al (2015). Agronomic performance and genetic progress of selected historical soybean varieties in the southern USA.Plant Breed.1348593. 10.1111/pbr.12222

  • 76

    RuS.BernardoR. (2019). Targeted recombination to increase genetic gain in self-pollinated species.Theor. Appl. Genet.132289300. 10.1007/s00122-018-3216-1

  • 77

    RuS.BernardoR. (2020). Predicted genetic gains from introgressing chromosome segments from exotic germplasm into an elite soybean cultivar.Theor. Appl. Genet.133605614. 10.1007/s00122-019-03490-2

  • 78

    SantantonioN.RobbinsK. (2020). A hybrid optimal contribution approach to drive short-term gains while maintaining long-term sustainability in a modern plant breeding program.bioRxiv[Preprint].10.1101/2020.01.08.899039

  • 79

    SchmutzJ.CannonS. B.SchlueterJ.MaJ.MitrosT.NelsonW.et al (2010). Genome sequence of the palaeopolyploid soybean.Nature463178183. 10.1038/nature08670

  • 80

    SebastianS. A.StreitL. G.StephensP. A.ThompsonJ. A.HedgesB. R.FabriziusM. A.et al (2010). Context-specific marker-assisted selection for improved grain yield in elite soybean populations.Crop Sci.5011961206. 10.2135/cropsci2009.02.0078

  • 81

    SmallwoodC. J.SaxtonA. M.GillmanJ. D.BhandariH. S.WadlP. A.FallenB. D.et al (2019). Context-specific genomic selection strategies outperform phenotypic selection for soybean quantitative traits in the progeny row stage.Crop Sci.595467. 10.2135/cropsci2018.03.0197

  • 82

    SonahH.BastienM.IquiraE.TardivelA.LégaréG.BoyleB.et al (2013). An Improved Genotyping by Sequencing (GBS) approach offering increased versatility and efficiency of SNP discovery and genotyping.PLoS One8:e54603. 10.1371/journal.pone.0054603

  • 83

    Stewart-BrownB. B.SongQ.VaughnJ. N.LiZ. (2019). Genomic selection for yield and seed composition traits within an applied soybean breeding program.G3922532265. 10.1534/g3.118.200917

  • 84

    SunX.HuZ.ChenR.JiangQ.SongG.ZhangH.et al (2015). Targeted mutagenesis in soybean using the CRISPR-Cas9 system.Sci. Rep.5:10342. 10.1038/srep10342

  • 85

    TodeschiniM. H.MilioliA. S.RosaA. C.DallacorteL. V.PanhoM. C.MarcheseJ. A.et al (2019). Soybean genetic progress in South Brazil: physiological, phenological and agronomic traits.Euphytica215:124.

  • 86

    ToledoF. H.Pérez-RodríguezP.CrossaJ.BurgueñoJ. (2019). isqg: a binary framework for in silico quantitative genetics.G3924252428. 10.1534/g3.119.400373

  • 87

    ValliyodanB.YeH.SongL.MurphyM.ShannonJ. G.NguyenH. T. (2017). Genetic diversity and genomic strategies for improving drought and waterlogging tolerance in soybeans.J. Exp. Bot.6818351849. 10.1093/jxb/erw433

  • 88

    VanRadenP. M. (2008). Efficient methods to compute genomic predictions.J. Dairy Sci.9144144423. 10.3168/jds.2007-0980

  • 89

    VanRadenP. M.O’ConnellJ. R.WiggansG. R.WeigelK. A. (2011). Genomic evaluations with many more genotypes.Genet. Sel. Evol.43:10. 10.1186/1297-9686-43-10

  • 90

    VaughnJ. N.LiZ. (2016). Genomic signatures of North American soybean improvement inform diversity enrichment strategies and clarify the impact of hybridization.G3626932705. 10.1534/g3.116.029215

  • 91

    WrayN.GoddardM. (1994). Increasing long-term response to selection.Genet. Sel. Evol.26431451. 10.1186/1297-9686-26-5-431

  • 92

    WrightM. N.WagerS.ProbstP. (2020). ranger: A Fast Implementation of Random Forests. Available online at: https://CRAN.R-project.org/package=ranger(accessed July 2, 2020).

  • 93

    XavierA. (2019). Efficient estimation of marker effects in plant breeding.G3938553866. 10.1534/g3.119.400728

  • 94

    XavierA.JarquinD.HowardR.RamasubramanianV.SpechtJ. E.GraefG. L.et al (2018a). Genome-wide analysis of grain yield stability and environmental interactions in a multiparental soybean population.G38519529. 10.1534/g3.117.300300

  • 95

    XavierA.MuirW. M.RaineyK. M. (2016). Assessing predictive properties of genome-wide selection in soybeans.G3626112616. 10.1534/g3.116.032268

  • 96

    XavierA.MuirW. M.RaineyK. M. (2019). bWGR: bayesian whole-genome regression.Bioinformatics.3619571959. 10.1093/bioinformatics/btz794

  • 97

    XavierA.RaineyK. M. (2020). Quantitative genomic dissection of soybean yield components.G310665675. 10.1534/g3.119.400896

  • 98

    XavierA.ThapaR.MuirW. M.RaineyK. M. (2018b). Population and quantitative genomic properties of the USDA soybean germplasm collection.Plant Genet. Resour.16513523. 10.1017/S1479262118000102

  • 99

    YabeS.YamasakiM.EbanaK.HayashiT.IwataH. (2016). Island-model genomic selection for long-term genetic improvement of autogamous crops.PLoS One11:e0153945. 10.1371/journal.pone.0153945

  • 100

    YuJ.ArbelbideM.BernardoR. (2005). Power of in silico QTL mapping from phenotypic, pedigree, and marker data in a hybrid breeding program.Theor. Appl. Genet.11010611067. 10.1007/s00122-005-1926-7

  • 101

    ZhengN.LiT.DittmanJ. D.SuJ.LiR.GassmannW.et al (2020). CRISPR/Cas9-based gene editing using egg cell-specific promoters in Arabidopsis and Soybean.Front. Plant Sci.11:800. 10.3389/fpls.2020.00800

Summary

Keywords

long-term gains, soybean breeding, genomic selections, selection intensity, genomic prediction

Citation

Silva ÉDB, Xavier A and Faria MV (2021) Impact of Genomic Prediction Model, Selection Intensity, and Breeding Strategy on the Long-Term Genetic Gain and Genetic Erosion in Soybean Breeding. Front. Genet. 12:637133. doi: 10.3389/fgene.2021.637133

Received

02 December 2020

Accepted

05 August 2021

Published

01 September 2021

Volume

12 - 2021

Edited by

Waseem Hussain, International Rice Research Institute (IRRI), Philippines

Reviewed by

Yongkang Kim, University of Colorado Boulder, United States; Nicholas B. Larson, Mayo Clinic, United States

Updates

Copyright

*Correspondence: Éder David Borges da Silva,

This article was submitted to Statistical Genetics and Methodology, a section of the journal Frontiers in Genetics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics