Abstract
The last decade witnessed an unprecedented increase in the adoption of genomic selection (GS) and phenomics tools in plant breeding programs, especially in major cereal crops. GS has demonstrated the potential for selecting superior genotypes with high precision and accelerating the breeding cycle. Phenomics is a rapidly advancing domain to alleviate phenotyping bottlenecks and explores new large-scale phenotyping and data acquisition methods. In this review, we discuss the lesson learned from GS and phenomics in six self-pollinated crops, primarily focusing on rice, wheat, soybean, common bean, chickpea, and groundnut, and their implementation schemes are discussed after assessing their impact in the breeding programs. Here, the status of the adoption of genomics and phenomics is provided for those crops, with a complete GS overview. GS’s progress until 2020 is discussed in detail, and relevant information and links to the source codes are provided for implementing this technology into plant breeding programs, with most of the examples from wheat breeding programs. Detailed information about various phenotyping tools is provided to strengthen the field of phenomics for a plant breeder in the coming years. Finally, we highlight the benefits of merging genomic selection, phenomics, and machine and deep learning that have resulted in extraordinary results during recent years in wheat, rice, and soybean. Hence, there is a potential for adopting these technologies into crops like the common bean, chickpea, and groundnut. The adoption of phenomics and GS into different breeding programs will accelerate genetic gain that would create an impact on food security, realizing the need to feed an ever-growing population.
Introduction
Classical plant breeding has evolved considerably during the last century. This can be attributed to the combined action of molecular markers, improved experimental designs, statistical methods, understanding of the concepts of population and quantitative genetics, and integration of other disciplines such as entomology, pathology, soil science, engineering, agronomy, and physiology (Lopes et al., 2012; Ray et al., 2012). The evolution and adoption of all these techniques and tools has pushed the annual genetic gain of grain yield approximately 1% for major cereals like maize (Zea mays L.), rice (Oryzae sativa L.), and wheat (Triticum aestivum L.) (Lopes et al., 2012; Masuka et al., 2017a; Cobb et al., 2019). However, the rate of genetic gain in these crops is insufficient to cope with a 2% annual increase in the human population, which is expected to reach 9.8 billion by 2050 (Ray et al., 2012, 2013). Plant breeders and scientists are under pressure to develop new varieties and crops having higher yield, higher nutritional value, climate resilience, and disease and insect resistance. The solution requires the merging of new techniques like next-generation sequencing, genome-wide association studies, genomic selection, high throughput phenotyping, speed breeding, and CRISPR mediating gene editing with previously used tools and breeder’s skills (Varshney et al., 2021).
Since the 1980s, various molecular marker systems such as restriction fragment length polymorphism (RFLP), amplified fragment length polymorphism (AFLP), randomly amplified fragment polymorphic DNA (RAPD), simple sequence repeats (SSR), and single nucleotide polymorphism (SNP) have been developed and led to the identification of several quantitative trait loci (QTL) by linkage mapping in most crops (Zhu et al., 2008; Buerstmayr et al., 2009). This was further supported by the development of high throughput genotyping tools like diversity array technology (DArT), genotyping by sequencing (GBS), SNP array platform (for instance in wheat, several high-density SNP arrays are available including the Illumina Wheat 9K iSelect, Wheat 15K SNP array, 35K Axiom array developed from an 820K array, 55K SNP array developed from 660 arrays, Illumina 90K iSelect SNP array, and the Axiom wheat 660K SNP array), and next-generation sequencing, all of which provide tremendous amounts of marker information for utilization in mapping studies (Poland and Rife, 2012; Wang et al., 2014; Cui et al., 2017). Linkage mapping started with great hype for deciphering each trait’s genetic architecture and improving traits. This hype was later unrealized and attributed to low mapping resolution, QTL by genotype interaction, QTL by environment interaction, and QTL specific to a particular segregating population. However, there are some success stories utilizing linkage mapping for cultivar development, such as identification of Sub1 QTL for submergence tolerance in rice, Fhb1 QTL for providing tolerance to fusarium head blight in wheat, and QTL for providing resistance to cyst nematodes in soybean (Glycine max L.) (Concibido et al., 2004; Anderson et al., 2007; Septiningsih et al., 2009).
As the excitement about linkage mapping began to fade in the early 2000s, association mapping emerged as a new technique for studying marker-trait associations (Lander and Botstein, 1989; Breseghello and Sorrells, 2006; Yu et al., 2006). Association mapping has two main advantages over linkage mapping. Firstly, it saves the time, cost, and effort required to create a mapping population, as it uses a collection of germplasm, which can be easily assembled. Secondly, QTL can be mapped with higher resolution due to multiple historical recombination in the germplasm (Korte and Ashley, 2013). Several statistical models were developed, which varied from single locus to multi-locus models and multivariate models, including genotype by environment interaction, dominance, and epistasis components depending upon the associated crop’s nature (Huang et al., 2018; Tibbs Cortes et al., 2021). It was later realized that association mapping suffers from several limitations and has not shown the same potential as linkage mapping. The main reason for its low success was that it detects variants common in the mapping panel and thus has low power for detecting the rare variant. These rare variants could be identified by linkage mapping with segregation of alleles in the mapping population, which will provide higher power to detect rare QTL. Furthermore, several nested association mapping (NAM) and multi-parent advanced generation intercross (MAGIC) populations have been developed in most of the crop species discussed in this review for marker trait associations (MTAs) with high power and resolution during mapping studies (Yu et al., 2008; Diaz et al., 2021; Sandhu et al., 2021d).
By the late 2000s, plant breeders realized that they needed a technique that can not only identify associated QTL, but provides enough information to improve complex quantitative traits, for which previous mapping techniques had failed. Bernardo (Bernardo, 1994) achieved the earliest success for predicting breeding values by replacing pedigree based matrix with a marker based kinship using RFLP markers in maize. The term genomic selection (GS) was first coined in 2001 and uses whole genome-wide markers for predicting genomic-estimated breeding values (GEBVs) of individuals (Meuwissen et al., 2001; Bassi et al., 2015). GS is a technique that is not a design approach to create a cultivar with a specific QTL combination but uses a predictive approach to identify the line with the best breeding values using whole genome wide markers. It uses hundreds to thousands of genome-wide markers and previous years phenotypic data to build the GS model and predict the performance of new lines for quantitative traits (Isidro et al., 2015). If a marker is in linkage disequilibrium (LD) with the associated QTL, it will capture a large proportion of the genetic variance for predicting that trait’s performance. The interest of GS in plant breeding started after it was reported in maize in 2007 (Bernardo and Yu, 2007), and subsequently, several studies followed up utilizing this technique in different crop species (Crossa et al., 2014; Sun et al., 2017b). Plant breeders are rapidly adopting GS for selecting the parents of new crosses, removing poorly performing lines, predicting the performance of lines in untested environments, predicting quantitative traits early in the breeding pipeline (which was previously difficult due to less seed availability), and predicting the performance of traits that were not expressed in a particular environment owing to weather conditions (such as disease incidence) (Mohammadi et al., 2015; Millet et al., 2019; Cui et al., 2020; Krause et al., 2020).
Techniques like linkage and association mapping, marker-assisted selection (MAS), and GS need accurate phenotyping information for obtaining the desired results. GS requires phenotypic information for building models, and MAS requires phenotypic information for validating that a particular marker is associated with a trait (Kaur et al., 2021). In a large-scale breeding program, especially institutes such as the international maize and wheat improvement center (CIMMYT), international crops research institute for the semi-arid tropics (ICRISAT), international center for tropical agriculture (CIAT), and many breeding programs, approximately one hundred thousand breeding lines are screened every year at multiple locations, and the ability to accurately collect phenotyping data from this many lines and locations is challenging (Araus and Cairns, 2014; Araus et al., 2018; Zhang et al., 2020b; Juliana et al., 2020). Until now, advancements in phenotyping have not able to keep pace with developments in the field of genomics. However, the period from 2010 to 2019 witnessed the development and adoption of various phenomics tools in plant breeding under controlled and field conditions. Phenomics has unlocked the potential for phenotyping in plants for various traits like biotic (disease, insects, pests, viruses, and weeds) and abiotic stresses (drought, salinity, nutrient deficiency, flood, and other environmental factors), physiological (water use efficiency, photosynthesis mechanisms and different pigments), and agronomic traits (plant height, ear count and yield estimation) (Sankaran et al., 2015a; Zaman-Allah et al., 2015; Araus et al., 2018; Zhang et al., 2019). Merging phenomics with current genomics methods have improved progress in increasing the rate of genetic gain in many plant breeding programs (Masuka et al., 2017a, 2017b; Araus et al., 2018).
Several ground-based and aerial sensing platforms are being used with multiple sensors for measuring various traits in plants at different growth stages accurately, rapidly, and precisely (Sandhu et al., 2021e). The advancements in imaging sensors in plants varied from remote sensing to advanced autonomous vehicles equipped with RGB (red, green, and blue), near and far infrared, hyperspectral, light detection and ranging (LIDAR), 3D laser scanning, fluorescence, thermal, and spectro-radiometry imaging (Mewes et al., 2011; Atieno et al., 2017; Duan et al., 2018; Jimenez-Berni et al., 2018). Advanced autonomous platforms include ground robots, unmanned aerial vehicles (UAVs), and moving carts, which can take real-time data from several plots multiple times in a day to cover the whole season, generating enormous data for the plant breeders (Sankaran et al., 2019; Pattanashetti et al., 2020). Data generated from these sensors are longitudinally distributed in time and space, thus requiring skills from mathematics, statistics, data science, and machine learning for obtaining useful results, which could be merged with the genomic datasets and field breeding notes to make the best selections (Sun et al., 2017b; Sun et al., 2019).
The main objectives of this review are to 1) provide current status and overview about the advancements in genomics and phenomics for rice, wheat, soybean, common bean (Phaseolus vulgaris L.), chickpea (Cicer arietinum L.), and groundnut (Arachis hypogaea L). These six crops are chosen after considering the different rate of development during the last decade and importance in the human diet and crops were chosen separately from each cereal, legume and oilseed category; 2) offer an overview of GS and its implementation in cereal, legume, and oilseed breeding programs; 3) present developments in phenotyping platforms and imaging sensors for collecting phenotypic data; 4) discuss the status of below ground phenotyping techniques in plant breeding; and 5) discuss the merging of GS, machine learning, and phenomics information for increasing the genetic gain of breeding programs. This review is unique as it combines GS and phenomics in several important crops and will assist upcoming plant breeders understand the progress of this technology.
Overview of Six Crops Used in This Study
This review focuses on six important crops: rice, wheat, soybean, common bean, chickpea, and groundnut, as described above. Average productivity and area harvested from these crops are provided in Figure 1 from 1961 to 2019 (FAO 2019) (https://www.fao.org/statistics/en/). The average productivity increased from 1.9 to 4.7 ton/ha in rice, 1.1–3.5 ton/ha in wheat, 1.1–2.8 ton/ha in soybean, 0.5–0.9 ton/ha in common bean, 0.6–1.0 ton/ha in chickpea, and 0.8–1.6 ton/ha in groundnut from 1961 to 2019 (Figure 1A). There was an approximately three-fold increase in rice, wheat, and soybean productivity due to breeding and agronomic efforts. However, in common bean, chickpea, and groundnut, similar gains have not been observed (Figure 1A). Total area harvested for rice, wheat, and soybean constantly increased from 1961 to 2019 compared to common bean, chickpea, and groundnut (Figure 1B). Organizations like CIMMYT, ICRISAT, and CIAT are working on collaborative projects to increase the crop’s yield and awareness among farmers to use better agronomic practices in these crops (Pandey et al., 2020b; Thudi et al., 2020). Figure 2 shows the productivity of these six crops across continents from 1961 to 2019. The green revolution has resulted in the highest increase in productivity of rice and wheat in Asia, but since the last 2 decades, the rate of increase is linear, which won’t be sufficient for the current increasing population, thus, demanding additional scientific and technological breakthroughs (Ray et al., 2013).
FIGURE 1

The trend for yield and area harvested for the six crops, namely, rice, wheat, soybean, common bean, chickpea, and groundnut, staring from 1961 to 2019. (A) shows the yield trend and (B) shows the total area harvested for each crop since 1961. Source FAO, 2019 dated 02/20/2021.
FIGURE 2

The average productivity of the six crops, namely, rice (A), wheat (B), soybean (C), common bean (D), chickpea (E), and groundnut (F), across the continents starting from 1961 to 2019. These trends show huge potential for improving the crops using genomics and high throughput phenomics approaches in the coming years. Source FAO, 2019 dated 02/20/2021.
Rice is a major staple food consumed by more than one third of the world’s populations and meets up to 80% of the daily calorie intake for a vast majority of the Asian population (Kearney, 2010). Rice is a diploid species and has the smallest genome among the crops of economic importance, which assisted in its genome sequence in early 2002 (Sun et al., 2017a). Currently, several landraces, cultivar’s and wild relatives of rice have been sequenced, providing novel insights into the genome evolution of the crop and enhancing knowledge of new genes for rice breeding programs (Sun et al., 2017a). Due to its ease of transformation, abundant genetic and genomic resources (including mutants, cultivated landraces, and wild species), compact genome, and collinearity with other cereal crops, rice has become a model plant for crop genetic studies (Chen et al., 2014; Sun et al., 2017a). Rice was one of the crops which benefited from next generation sequencing due to its relatively modest level of repetitive sequences, making it easy to accurately align small reads to its reference genome (Abe et al., 2012; Takagi et al., 2013). Great success has been seen in rice for releasing cultivars having disease resistance, stress tolerance, improved nutritional value, and higher yield using CRISPR and other genome editing tools compared to the other five crops studied in this review (Mishra et al., 2018). The individual timeline for the genomics breakthrough in rice are depicted in Figure 3A.
FIGURE 3

Timeline for advancement of genomics in rice (A), wheat (B), soybean (C), common bean (D), chickpea (E), and groundnut (F).
Wheat is one of the three most consumed cereal crops globally, providing one-fifth of the total caloric input. It is grown on approximately 200 M ha globally and has widespread adaptation from 45 S in Argentina to 67 N in Scandinavia, including some high-altitude regions in the tropics and subtropics. Wheat went through two green revolution events, one in the late 1960s and another during the 1980s. During these green revolutions, the amount of gain for grain yield was approximately 3% in Asia, but has now declined to <0.9% annually, causing concern for breeders (Pingali, 2012). In spite of its hexaploid nature (2n = 6x = 42), wheat is one of the most widely studied crops at the genetic and cytogenetic level (Chhabra et al., 2021). The hexaploid nature of wheat has allowed the creation of major numerical and structural changes in chromosome constitution, that was made possible due to the efforts of Ernie Sears Sears et al, (1993). Sears et al. Sears et al, (1993) created aneuploid stocks of wheat, which were later used for several mapping and genome sequence studies. The last 3 decades witnessed a profound improvement in understanding wheat genomics and genetics due to the rapid adoption of DNA-based molecular markers such as RFLP, SSR, AFLP, DArT and SNPs from the early 1990s (Saini et al., 2022). These molecular markers have aided in conducting several QTL mapping studies using interval mapping, single-marker analysis, and GWAS (Muhu-Din Ahmed et al., 2020). Several development events in wheat, such as the first QTL mapping study, map-based cloning, first consensus map, adoption of high throughput genotyping arrays, translational genomics, gene editing, GS, and pangenome sequence are listed in Figure 3B to compare the development of genomics among the six crops (Poland et al., 2012; Rutkoski et al., 2016; Montenegro et al., 2017). Recently, the wheat pangenome sequence was released, with an average of 128,656 genes in each cultivar used, providing insights into genomic assisted crop improvement (Montenegro et al., 2017; Khan et al., 2020).
Soybean is a unique legume and oilseed crop consumed by humans, livestock, and poultry worldwide, as it is a rich source of protein, oil, essential amino acids, and metabolizable energy. The total protein and oil content is important for soybean, as 60% of its value comes from its meal and the remaining 40% from its oil (Warrington et al., 2015). A minimum of 47.5% protein content is required in soybean meal to develop livestock and poultry properly (Hurburgh et al., 1990). Although the domestication of soybean started in Asia, it found a welcomed home in the United States and Brazil. Brazil led production in 2019 (37%), closely followed by the United States (28%), Argentina (16%) and China (5%) (http://soystats.com/). Advancement of genomics started after 2010 in soybean with the genome sequence of cultivated soybean variety Williams 82 (Wm82) in the United States (Schmutz et al., 2010). In addition to the genome sequence of Wm82, several other accessions/lines were sequenced by China and Japan. The genome sequence was the base point for developing millions of SNP markers and thousands of SSR markers (Deshmukh et al., 2014). The development of next-generation sequencing and complexity reduction methods, namely GBS, restriction site-associated DNA (RAD) sequencing, and reduced representation libraries (RRL), are being routinely used. Technology advances have resulted in the development of several SNP arrays such as Illumina Infinium BeadChip (50K), Affymetrix Axiom (355K), Illumina Infinium BeadChip (8K), and Affymetrix Axiom (180), with many more routinely used for genotyping soybean plant introduction lines (Xu et al., 2013; Deshmukh et al., 2014). Recently, whole-genome assemblies released from 26 different soybean varieties and lead to the structuring of the soybean pangenome and the sequences of previously cultivated lines in the United States, China, and Japan (Liu et al., 2020). QTLs have been mapped for many quality, biotic and abiotic stress, and agronomic traits in soybean using QTL mapping and GWAS (Merry et al., 2019; Qin et al., 2019; Ravelombola et al., 2020; Shook et al., 2021). The complete details about the adoption of various genomic tools is presented as a timeline in Figure 3C.
Common bean is an important cultivated legume crop consumed worldwide, especially in developing countries in the tropics. It’s seed is rich in protein and other micronutrients like zinc and iron and provides a cheap energy source to millions of people in Africa, South Asia, and Latin America, where per capita consumption can reach up to 65 kg annually (Keller et al., 2020). Until now, the main hindrance in reaching the maximum threshold in bean is challenging environmental conditions. The important biotic and abiotic stresses affecting their performance include drought, low phosphorus, and diseases. Drought and low phosphorus have resulted in up to 70 and 50% yield loss and are the main focus for the common bean breeding programs worldwide (Beebe et al., 2008). Another important breeding objective is to reduce cooking time, as it retains the minerals and proteins which usually get lost with long cooking time. Less cooking time also saves energy and time for other tasks (Diaz et al., 2021). Mesoamerican and Andean have been described as two gene pools in common bean, with greater diversity present in the Mesoamerican pool. More progress for improving yield, disease resistance, and quality traits is reported in the Mesoamerican pool, but moving of genes/QTLs from this pool to the Andean pool has been challenging, especially due to linkage drag and incompatibility (Schmutz et al., 2014). Furthermore, with the sequencing of 100 landraces and 60 wild relatives, it is confirmed that there were two independent domestication events for common bean (Schmutz et al., 2014). MTAs have been performed for different disease traits, quality attributes, and yield traits for both pools in different studies (Giovannoni et al., 1991; Berry et al., 2020; Diaz et al., 2021). The timeline for the adoption of several genomic tools in common bean is provided in Figure 3D.
Chickpea is an important food legume crop grown on 13.72 M ha in 55 countries globally, producing 14.25 M tons (FAO 2019). Chickpea can produce 3.0–4.0 tons/ha, but currently it is restricted to ∼1 ton/ha due to limited work on biotic and abiotic stresses (Roorkiwal et al., 2018b). Total production of chickpea increased from 1961 at a slow pace due to the use and reuse of limited germplasm/donor parents (Varshney et al., 2013). Important abiotic stresses include drought and heat, while biotic stresses include ascochyta blight (Ascochyta rabiei), collar rot (Sclerotium rolfsii), dry root rot (Rhizoctonia bataticola), botrytis grey mold (Botrytis cinerea), and fusarium wilt (Fusarium oxysporum) that reduce crop yield. Chickpea is a rich source of dietary protein, minerals, carbohydrates, and essential nutrients, thus has the potential for improving malnutrition problems in south Asia and sub-Saharan Africa, where it is mostly grown (Varshney et al., 2013; Pandey et al., 2016; Roorkiwal et al., 2018b). The last couple of years have witnessed the adoption of several whole-genome sequencing and resequencing projects for sequencing several cultivars and landraces to explore genetic diversity (Verma et al., 2015; Varshney et al., 2019). The adoption of these next-generation sequencing methods in this decade has witnessed a shift from maker-based genotyping to sequenced based genotyping of diversified germplasm and breeding lines (Jaganathan et al., 2015; Li et al., 2018b). The development of chickpea varieties is further strengthened by the adoption of GS and speed breeding methods. The timeline for adopting several genomic tools in chickpea is provided in Figure 3E for comparison with other crops.
Groundnut or peanut is a nutritious oilseed and legume crop grown on 29.5 M ha in more than 100 countries globally, with a total productivity of 48.8 tons during 2019. Africa (55%) and Asia (40.3%) together have more than 95% of the groundnut cultivation area, account for 31.5 and 59.6% of the total production, respectively (FAO 2019). All parts of groundnut are a nutrition source for humans and animals. Groundnut plays an important role in fighting malnutrition as 80% of its seed consists of nutritious fats and proteins; furthermore, the crop can improve soil fertility and break the disease cycle when grown under rotation with cereal crops (Pandey et al., 2020b). Previously, groundnut was used as an edible crop in western countries, while in Asia and Africa, it was mainly used for oil production. The development of high oleic acid groundnut lines and awareness about its nutritional value has resulted in the rapid adoption of this crop as a primary food source across the globe. Genomic studies in groundnut gained momentum after the first SSR based genetic map was developed in 2009 (Varshney et al., 2009). Several MAGIC and NAM populations were developed for deciphering the genetic architecture of complex traits like aflatoxin contamination, oleic acid content, drought, and disease tolerance (Pandey et al., 2016; Chu et al., 2018). The last decade was the golden era for developing genomics in groundnut and several resources, such as a reference genome for cultivated tetraploid and progenitors, high density genotyping, genome-wide genetic markers, gene expression atlases, and MAGIC and NAM populations, were developed, with a timeline shown in Figure 3F (Akohoue et al., 2020; Pandey et al., 2020b, 2020a). Still, this crop has many other priorities for coming years like reference genome sequence for wild diploids, functional genomics, and high throughput genotyping assays, which might improve breeding for groundnut.
Genomic Selection and Its Implementation in the Breeding Program
As mentioned in the introduction, GS is a technique for predicting GEBVs using training and testing populations (Bhandari et al., 2019; Crossa et al., 2019). GS has been efficiently applied in wheat, rice, and soybean; however, in crops like chickpea, common bean, and groundnut, its progress is slow. Figure 4 summarizes the trends for GS studies conducted from 2011–20, and it is clear that GS was rapidly adopted in wheat, and other crops are following the trend at a slower pace. The slow rate of adoption in chickpea, common bean, and groundnut is due to the recent advancement of genomics tools, genome sequences, assembly of the core collection, pangenome, and whole-genome resequencing (Verma et al., 2015; Roorkiwal et al., 2018a; Pandey et al., 2020b). Thus, the coming years will see efforts in the adoption of GS and other new genomics tools to improve the genetic gain for these globally important crops.
FIGURE 4

Trends in publications mentioning/discussing the six crops and genomic selection since last decade (2011–2020). Search was made using associated crop and genomic selection keywork in the abstract. Source PubMed dated 02/25/2021.
Several factors affect the performance of GS models. They have been explored in multiple studies during the last decade, ranging from training population size, relatedness between training and testing population, cross-validation strategy, marker density, heritability of the trait, population structure, and prediction model (Yabe et al., 2018; Frouin et al., 2019; Huang et al., 2019). It is observed that a certain population size is required for model training to avoid model overfitting (Liu et al., 2018). A large training population size results in higher prediction accuracy; however, a smaller than desirable size is often used due to the costs associated with their phenotyping and genotyping (Heffner et al., 2011). In wheat, it was observed that prediction accuracy constantly increased when training population size was increased from 24 to 96 (Heffner et al., 2011). Similarly, another study in wheat showed the same trend when population size was increased from 250 to 2000 (Heslot et al., 2012). Relatedness between genotypes in the training and testing sets significantly affects prediction accuracy (Lozada et al., 2019). More related lines share common ancestors in a small number of prior generations, have fewer recombination events, and conserve marker and QTL linkage phases (Heslot et al., 2012). The effect of training population size is not observed on prediction accuracy when individuals are closely related in the training and testing set (Mujibi et al., 2011).
Since GS uses genome-wide markers, proper genotyping is required. To date, several genotyping platforms like RFLP, AFLP, SSR, DArT, and SNP chips have been explored for GS; however, since 2012, with the emergence of the GBS platform, it has dominated all previous platforms due to the low cost, genome-wide coverage, and reduced sampling bias compared to SNP chips (Poland and Rife, 2012; Poland, 2015). It has been seen that large marker density results in model overfitting, causing lower independent prediction accuracies (Werner et al., 2018). However, larger marker density is favored as it increases the probability of LD between the QTL and marker. Lower LD combined with a larger training population and higher marker density largely improves prediction accuracy (Crossa et al., 2014; Norman et al., 2018). Heritability and population size plays an important role in prediction, as they determine the amount of genetic variation that the associated prediction model could capture (Guo et al., 2014). A strong correlation is observed between the GS model’s prediction accuracy and the trait’s heritability in the training population (Edwards et al., 2019). Various parametric and non-parametric machine and deep learning models have been explored for GS in all the mentioned crops (Table 1). Until now, none of the models have significantly demonstrated superiority for all traits in all crops (Liu et al., 2019; Ravelombola et al., 2020; Sandhu et al., 2021b). Breeders should explore various models in their programs for different traits and use the best performing model final predictions after considering accuracy, error and computational burden (Wang et al., 2018). Table 1 provides information about models explored for GS, with their associated characteristics and links to the source codes, for breeders, if they want to explore them in their crop of interest.
TABLE 1
| Model | Model type | Characteristics | Codes | References |
|---|---|---|---|---|
| Ridge regression best linear unbiased prediction (RRBLUP) | Mixed model/parametric model | Ridge regression is equivalent to traditional BLUP, which assumes each marker has a small effect with constant variance and obtain this by using ridge regression parameter | https://github.com/cran/rrBLUP | Endelman (2011) |
| Genomic best linear unbiased prediction (GBLUP) | Mixed model/parametric model | GBLUP uses the relationship between genotypes for predicting their performance | https://github.com/gdlc/BGLR-R/blob/master/inst/md/GBLUP.md | Bernardo and Yu (2007) |
| Bayes A | Bayesian model/parametric model | Marker effects are obtained assuming a scaled inverted chi-square distribution of variance parameters, and all the markers are assumed to have an effect | https://github.com/gdlc/BGLR-R/blob/master/inst/md/Validation.md | Meuwissen et al. (2001) |
| Bayes B | Bayesian model/parametric model | It losses the restrictions of Bayes A and allows some markers to have zero effect | https://github.com/ShiuLab/GenomicSelection/blob/master/working/predict_BGLR.R | Meuwissen et al. (2001) |
| Bayes C | Bayesian model/parametric model | Bayes C uses the scaled-t mixture with a point mass at zero with scaled-t distribution | https://github.com/cma2015/G2P/blob/master/R/GSEnsemble.R | Pérez and De Los Campos (2014) |
| Bayes Cpi | Bayesian model/parametric model | Bayes Cpi is a special case of Bayes B but it assumes a constant variance for markers with non-zero effect | https://github.com/gdlc/BGLR-R/blob/master/inst/md/BayesianAlphabet.md | Pérez and De Los Campos (2014) |
| Bayes D | Bayesian model/parametric model | Bayes D uses the scaled-t distribution by estimating scale parameter from the datasets | https://github.com/gdlc/BGLR-R/blob/master/inst/md/BayesianAlphabet.md | Pérez and De Los Campos (2014) |
| Bayes Lasso (BL) | Bayesian model/parametric model | BL assumes a fixed set of markers have zero effect, and the remaining follow the double exponential distribution for variance components | https://github.com/Sandhu-WSU/Genomic-Selection-tutorial/blob/master/GSworkshop_InProg.R | Tishbirani (1996) |
| Elastic net (EN) | Parametric model | EN is the intermediate between ridge regression and Lasso using an average weight penalty for marker effect estimation | https://datadryad.org/stash/dataset/doi:10.5061/dryad.7f138 | Crain et al. (2018) |
| Bayesian threshold GBLUP (TGBLUP) | Bayesian model/parametric model | TGBLUP is a threshold models for ordinal and categorical data | https://github.com/gdlc/BGLR-R | Montesinos-López et al. (2019a) |
| Bayesian multi-trait and multi-environment model (BMTME) | Bayesian model/parametric model | BMTME is the multi-trait version of the Bayesian models | https://www.g3journal.org/content/9/5/1355#app-1 | Montesinos-López et al. (2018) |
| Reproducing kernel Hilbert space (RKHS) | Bayesian kernel-based/semi-parametric model | RKHS uses the kernel functions on the set of distances among markers to estimate the relationship matrix between the individuals and assumes the absence of linearity assumption | https://github.com/gdlc/BGLR-R/blob/master/inst/md/RKHS.md | Pérez and De Los Campos (2014) |
| Reaction norm | Mixed model/parametric model | Reaction norm model the interaction between the markers and environmental covariates using covariate functions | https://github.com/gdlc/BGLR-R/blob/master/inst/md/BayesianAlphabet.md | Jarquín et al. (2014) |
| Support vector machine (SVM) | Machine learning/semi-parametric model | SVM is another semiparametric model that uses kernel function, and its cost function is sensitive to residuals coefficient | https://github.com/afiliot/Kernel-Methods-For-Genomics | Ravelombola et al. (2020) |
| Random forests (RF) | Machine learning/non-parametric model | RF uses a network of the tree with varying number of nodes, mtry, and depth for building the final forest for predictions | https://github.com/xuanxu/nimbus | Guo et al. (2020a) |
| Gradient boost machine (GBM) | Machine learning/non-parametric model | GBM is an ensemble learning model and is similar to tree-based models used to reduce the subset the SNPs using linkage disequilibrium for obtaining higher prediction accuracy | https://cran.r-project.org/web/packages/gbm/index.html | Li et al. (2018a) |
| Functional B spline | Machine learning/non-parametric model | Functional B splines use the piecewise polynomial of degree n-1 in a variable x. Different spline functions are tried at a given degree for predicting the output | Montesinos-López et al. (2017) | |
| Partial least square regression (PLSR) | Machine learning/non-parametric model | PLSR is a dimensional reduction approach which uses latent variables derived from predictors to link with the response variables | https://datadryad.org/stash/dataset/doi:10.5061/dryad.7f138 | Crain et al. (2018) |
| Multilayer perceptron (MLP) | Deep learning/non-parametric model | MLP uses the combination of input, multiple hidden and output layers using a large number of neurons for building the relationship between the predictors and output | https://github.com/saeedkhaki92/Yield-Prediction-DNN | Sandhu et al. (2021b) |
| Convolutional neural network (CNN) | Deep learning/non-parametric model | CNN employs convolutional, flattening, pooling, and dense layer for prediction using filers and kernels to reduce the excess predictors from the model | https://github.com/Sandhu-WSU/DL_Wheat | Sandhu et al. (2021b) |
| Dual CNN | Deep learning/non-parametric model | Dual CNN uses two parallel streams of CNN and sums up layer is used for predictions | https://github.com/kateyliu/DL_gwas | Liu et al. (2019) |
| Arc-cosine kernel (AK) | Deep learning/non-parametric model | AK estimates the stepwise covariance matrix by adding more hidden layer in model training | https://www.frontiersin.org/articles/10.3389/fgene.2019.01168/full#h7 | Crossa et al. (2019) |
| DeepGS | Deep learning/non-parametric model | DeepGS uses deep CNN consisting of one input, one convolutional, one sampling, two fully connected and one output layer for building a relationship | https://github.com/cma2015/DeepGS | Ma et al. (2018) |
| Recurrent neural network (RNN) | Deep learning/non-parametric model | RNN is best for predictions under the presence of longitudinal or time-series data, as it uses the memory state to retains the information from previous data and update its prediction with new information | https://figshare.com/s/5cd5e5e4eaeef55b721f?file=24963563 | Maldonado et al. (2020) |
The detailed information about various models explored for genomic selection in different crops, with associated model type, characteristics, and links to the source codes that could be easily implemented in various breeding programs.
GS is being applied with two approaches in the plant breeding program. Firstly, it is applied at the early generation (F1) or (F2:3) for a rapid generation cycle with a short interval. This selection is used to predict the breeding values and helps the researchers select parents for new crosses or remove inferior performing lines earlier in the pipeline (Bassi et al., 2015; Gaynor et al., 2017). Therefore, linear additive models are sufficient for predicting at this stage. The second approach involves predicting the plant’s total genetic value by considering additive, dominance, epistasis, and environmental effects (Monteverde et al., 2019; Francki et al., 2020; Guo et al., 2020b). Genetic values are predicted for most environments using different combinations of environment, genotype by environment, and weather parameters in the GS models (Monteverde et al., 2019; Francki et al., 2020). Rapid progress is happening in the second approach for predicting traits in an untested environment with better prediction accuracy (Jarquín et al., 2017; Gill et al., 2021) (Table 2). We provided an outline of the implementation of GS in a plant breeding program for self-pollinated crops, where GS could be either applied within the cycle selection, across cycles, with multi-location selection and the inclusion of genotype by environment interactions, and utilization of phenomics datasets for improving prediction accuracies for complex traits (Figure 5). In this outline, it is assumed that a single generation is possible in a year until speed breeding is used to reach homozygosity (Watson et al., 2018).
TABLE 2
| Crop | Trait | Population size | Validation strategy | Marker intensity | Model | Single or multi-trait analysis | Accuracy | Country | References |
|---|---|---|---|---|---|---|---|---|---|
| Rice | Arsenic content | 228 accessions | CV and IV | 22,370 SNPs | GBLUP, Bayes A, RKHS | Single trait | 0.43–0.48 | France | Frouin et al. (2019) |
| Days to heading | 112 cultivars | CV and IV | 408,372 SNPs | GBLUP | Single and multi-trait models | 0.93–0.98 | Japan | Jarquin et al. (2020) | |
| Drought tolerance | 280 accessions | CV | 215,000 SNPs | GBLUP and RKHS | Single and multi-environment models | 0.22–0.80 | France | Bhandari et al. (2019) | |
| Grain weight distribution | 128 cultivars | CV | 42,508 SNPs | GBLUP and PLS | Single trait models | 0.28–0.53 | Japan | Yabe et al. (2018) | |
| Grain yield and quality traits | 327 & 320 breedinglines | CV and IV | 92,430 and 44,598 SNPs | GBLUP, PLS, and reaction norm mode | Single and multi-environment models | 0.11–0.82 | Uruguay | Monteverde et al. (2019) | |
| Rice blast | 161 and 162 accessions | CV and IV | 66,109 and 29,030 SNPs | GBLUP, Bayes A, Bayes C and MLP | Single and multi-trait models | 0.15–0.72 | United States | Huang et al. (2019) | |
| Ten agronomic traits | 1,495 hybrid rice | CV and IV | 232,935 SNPs | GBLUP, additive and dominance model | Single trait | 0.54–0.92 | China | Cui et al. (2020) | |
| Wheat | Anther extrusion | 603 lines | CV and IV | 7,649 SNPs | Reaction norm model | Single trait models | -0.03–0.74 | CIMMYT | Adhikari et al. (2020) |
| Days to heading | 286 accessions | CV | 9,047 SNPs | RRBLUP, BA, BB, BC, BL, and BRR | Single trait models | −0.04–0.45 | Iran | Shabannejad et al. (2020) | |
| Days to heading and plant height | 3,486 lines | CV | 2,083 SNPs | MTDL | Multi-trait models | 0.39–0.62 | CIMMYT | Montesinos-López et al. (2019b) | |
| End-use quality traits | 401 lines | CV and IV | 4,598 SNPs | RRBLUP | Single trait models | 0.38–0.63 | Austria | Michel et al. (2018) | |
| End-use quality traits | 1912 lines | CV and IV | 21,210 | GBLUP | Single and multi-trait models | 0.28–0.69 | France | Ben-Sadoun et al. (2020) | |
| End-use quality traits | 179 lines | CV | 16,383 SNPs | RRBLUP | Single trait models | 0.10–0.48 | Spain | Mérida-García et al. (2019) | |
| Fusarium head blight and Septoria tritici blotch | 642 lines | CV and IV | 8,398 SNPs | RRBLUP | Single trait models | −0.41–0.88 | Germany | Herter et al. (2019) | |
| Grain yield | 1,325 lines | CV and IV | 9,290 SNPs | GBLUP | Single and multi-trait models | 0.18–0.31 | Denmark | Tsai et al. (2020) | |
| Grain yield | 1716 lines | CV and IV | 15,853 SNPs | RRBLUP | Single trait | −0.05–0.20 | United States of America | Lozada et al. (2020b) | |
| Grain yield, days to heading, and plant height | 270 lines | CV and IV | 14,163 SNPs | GBLUP and DL | Single and multi-environment models | 0.02–0.91 | CIMMYT | Montesinos-López et al. (2019c) | |
| Grain yield and end-use quality traits | 282 lines | CV and IV | 7,426 SNPs | BL, RF, RKHS, and RRBLUP | Single trait | 0.07–0.68 | United States | Hu et al. (2019) | |
| Grain yield and protein content | 480 lines | CV | 7,300 SNPs | GBLUP | Single and multi-trait models | −0.60–0.74 | Austria | Michel et al. (2019a) | |
| Leaf and stripe rust | 1744 single crosses | CV | 15K SNPs | GBLUP | Single trait | 0.16–0.50 | Germany | Beukert et al. (2020) | |
| Powdery mildew | 467 lines | CV | 34,095 SNPs | RRBLUP | Single trait model | 0.36–0.67 | United States of America | Sarinelli et al. (2019) | |
| Septoria tritici blotch | 175 lines | CV | 6,097 SNPs | RRBLUP | Single trait model | 0.47–0.62 | Sweden | Odilbekov et al. (2019) | |
| Snow mold | 753 lines | CV and IV | 12,681 SNPs | RRBLUP, GBLUP and RKHS | Single trait | −0.09–0.92 | United States | Lozada et al. (2019) | |
| Winter hardiness and frost tolerance | 504 lines | CV and IV | 1,413 SNPs | GBLUP | Single traits | −0.02–0.58 | Austria | Michel et al. (2019b) | |
| Soybean | Amino acids | 249 lines | CV | 23,279 SNPs | RRBLUP and GBLUP | Single trait model | 0.18–0.85 | United States | Qin et al. (2019) |
| Chlorophyll content tolerance | 172 lines | CV | 4,089 SNPs | RRBLUP, GBLUP, BL, RF and SVM | Single trait model | 0.31–0.76 | United States | Ravelombola et al. (2019) | |
| Soybean cyst nematode | 234 lines | CV | 3,782 SNPs | RRBLUP, GBLUP, BL, RF and SVM | Single trait model | 0.05–0.53 | United States | Ravelombola et al. (2020) | |
| Yield, protein content, oil and height | 5,000 lines | CV | 4,236 SNPs | DualCNN, deepGS, singleCNN and RRBLUP | Single trait model | 0.23–0.47 | United States | Liu et al. (2019) | |
| Yield | 5,600 lines | CV | 4,600 SNPs | GBLUP | Single trait model | 0.27–0.60 | United States | Howard and Jarquin (2019) | |
| Groundnut | Leaflet length, 100 seed weight, days to maturity and total yield | 281 accessions | CV | 493 SNPs | RRBLUP | Single trait model | 0.02–0.62 | South Africa | Akohoue et al. (2020) |
| Seed weight, oleic acid content, total yield, and days to maturity | 340 lines | CV and IV | 13,355 SNPs | Reaction norm model | Single and multi-environment models | 0.19–0.89 | India | Pandey et al. (2020a) | |
| Chickpea | Grain yield, podding time, emergence score and seed number | 132 lines | CV | 144,777 SNPs | BL and BRR | Single trait model | 0.22–0.81 | Australia | Li et al. (2018b) |
| Seed weight, biomass, harvest index, and seed yield | 320 breeding lines | CV and IV | 89,000 SNPs | Reaction norm models | Single and multi-environment models | −0.01–0.94 | India | Roorkiwal et al. (2018b) | |
| Common bean | Cooking time, and water absorption capacity | 922 lines | CV and IV | 5,738 SNPs | RKHS, BA, BC and BL | Single trait model | 0.22–0.55 | Colombia | Diaz et al. (2021) |
| Grain yield and days to maturity | 481 breeding lines | CV and IV | 5,820 SNPs | GBLUP, BL, BA, BB, and RKHS | Single and multi-environment models | 0.6–0.8 | Colombia | Keller et al. (2020) |
Genomic selection studies covering important breeding traits conducted in 2019 and 2020 for rice, wheat, soybean, chickpea, common bean, and groundnut throughout the world. Complete detail about the population size, validation strategy, marker density, model type, the accuracy obtained and country where the study was conducted is provided.
FIGURE 5

The standard breeding scheme outline for self-pollinating crops with the implementation of genomic selection and phenomics information for predicting various traits earlier in the pipeline in different selection cycles. Three columns show the three separate breeding cycles starting from the cross initiation to the variety release. The yellow arrows represent how genomic selection can be used on datasets from previous years to predict phenotype in F2-F4 stage and early-stage testing stages. The red arrows show the stages where selection is imposed for low and high heritable traits in traditional breeding; however, with genomic selection, decisions can be performed for low heritable traits earlier in the pipeline. Here it is assumed that a single generation is planted in a year. The DH represents the double haploid, PYT is preliminary yield trial, AYT is advanced yield trial, and EYT is elite yield trial.
Figure 5 provides the outline for a breeding cycle for wheat and few modifications can be made in this scheme in order to adjust for other crops. In the first year two different parents are crossed with subsequent chromosome doubling in the second year using double haploid (DH) or any other technique for reaching 100% homozygosity (i.e., speed breeding, single seed descent, rapid generation advance, shuttle breeding or tissue culture). These early stage testing lines are evaluated in the third year, and selection is made for high heritable traits, like pod type in groundnut and soybean cyst nematode resistance (Akohoue et al., 2020; Ravelombola et al., 2020). Each set of early-stage testing progenies has a specific set of genes, and the breeder aims to identify the best combination for advancing to the next generation and seed multiplication trial. The measurement of several agronomic traits, such as grain yield and aflatoxin content in groundnut, quality attributes in rice, common bean and wheat, for which a large amount of seed is required, is not possible at this stage (Battenfield et al., 2016; Pandey et al., 2020a). Seeds from the selected lines are multiplied at a single location known as a preliminary yield trial (PYT), and spectral information could be collected using phenomics tools like unmanned aerial vehicles (UAVs), remote sensing, handheld scanners, or tractor-mounted instruments (Rutkoski et al., 2016; Sandhu et al., 2021c). The information generated with these phenomics tools provides a secondary source of trait information for selecting complex traits by understanding G by E interaction, field variation, and explanation of various physiological processes occurring in the plants. Furthermore, these phenomics tools have been used to measure several agronomic traits and disease severity more efficiently and effectively. The lines selected from the PYT are later planted for 1 year at various locations with different replications depending upon the seed generated in the PYT and constitutes the advanced yield trials (AYT). Spectral information can be collected in a similar way as done during PYT to increase selection efficiency. After AYT, breeders keep reducing the population’s size, owing to limited resources and space, and selected lines are continually planted at multiple locations for measuring more quantitative traits. This step is repeated for two-three years depending upon the trait and constitutes elite yield trials (EYT) (Bassi et al., 2015; Gaynor et al., 2017).
Across cycles, predictions are possible at early stages, when seed is limited, to measure quantitative traits like grain yield, end-use quality traits in rice and wheat, and protein content in chickpea and common bean (Jernigan et al., 2018; Diaz et al., 2021). Figure 5 shows that GS and phenomics data sets collected at PYT and AYT from the previous cycle could be used to predict quantitative traits for the F2-F4 population and early-stage testing lines in a new selection cycle. Similarly, in the subsequent years, data from previous cycles and the same cycle can predict AYT performance at multiple locations (Montesinos-López et al., 2017; Crain et al., 2018). Phenomics information provides a significant advantage for within cycle and across cycle prediction in multi-trait GS models. Spectral reflectance indices (SRI) derived from these phenomics measurements have increased prediction accuracy in various GS studies in wheat (Rutkoski et al., 2016; Crain et al., 2018; Sandhu et al., 2021c). Higher prediction accuracies are obtained for grain yield due to lower heritability and higher genetic correlation with SRI. Utilization of these SRI in multi-trait GS models, and as a covariate in the GS models, increases the capture of total variation for a particular trait and helps explain various physiological phenomena that are difficult to observe under field conditions (Rutkoski et al., 2016; Lozada and Carter, 2019). We were not able to find any GS study which used phenomics information in GS models in chickpea, common bean, and groundnut. Table 3 provides the studies that have used GS and phenomics information for predictions in wheat, and the improvement in the model’s performances are provided. There is a significant advantage of including phenomics datasets in GS models due to observed increase in prediction accuracy, suggesting that merging these two techniques can assist in increasing the yield of these crops in the coming decade.
TABLE 3
| Trait | Population size | Model | Phenomics traits used | Physiological trait explained | Effect on prediction accuracy with inclusion of phenomic information | Country/institute | References |
|---|---|---|---|---|---|---|---|
| Grain yield | 1,092 lines | RRBLUP | GNDVI, RNDVI and CT | Canopy size, greenness, chlorophyll content, and temperature | 70% increase in prediction accuracy | CIMMYT | Rutkoski et al. (2016) |
| Grain yield | 1,092 lines | GBLUP, EN, and PLSR | CT and NDVI | Canopy temperature and greenness | 7% increase in prediction accuracy | United States | Crain et al. (2018) |
| Grain yield | 3,282 lines | RRBLUP | CT and GNDVI | Canopy temperature and nitrogen status | 46% increase in prediction accuracy | United States | Sun et al. (2019) |
| Grain yield | 456 lines | Recommender system & GBLUP | NDVI, NWI, and SR | Biomass, greenness, and water status | 19% increase in prediction accuracy | United States | Lozada and Carter (2020) |
| Grain protein content | 650 lines | RRBLUP | NDVI, GNDVI, NWI, WI, ARI, and PRI | Biomass, chlorophyll, nitrogen, water, anthocyanin, and photochemical pigments status | 20% increase in prediction accuracy | United States | Sandhu et al. (2021c) |
| Grain yield | 4,368 lines | GBLUP | GNDVI | Biomass and greenhouse | 11–23% increase in prediction accuracy | CIMMYT | Juliana et al. (2019) |
| Grain yield | 242 lines | GBLUP | CT, SPAD, SGT, NDVI | Canopy temperature, chlorophyll content, stay green and senescence traits | 63% increase in prediction accuracy | United States | Guo et al. (2020b) |
| Grain yield | 1716 | RRBLUP | NDRE, NDVI, and SR | Biomass, vegetation, and water status | 13% increase in prediction accuracy | United States | Lozada et al. (2020a) |
| Grain yield | 771 lines | GBLUP | Reflectance bands | Whole spectrum from visible to infra-red were used | 10–16% increase in prediction accuracy | CIMMYT | Krause et al. (2019) |
| Grain yield | 630 lines | Random regression and GBLUP | CT and NDVI | Canopy temperature and biomass | 70% increase in prediction accuracy | United States | Sun et al. (2017b) |
Genomic selection studies that have used phenomics information in wheat is summarized. The traits or spectral information derived from the phenomics data sets and the physiological parameters which they explain is provided with information about their effect on the final prediction accuracies when included in the genomic selection model is added to show their potential.
Development in Phenotyping Platforms and Imaging Sensors
The last three-decades witnessed an unprecedented increase in the adoption and development of genomics in plant breeding programs, leading to a rise in genetic advances in the major cereal crops (Thudi et al., 2020). However, genetic gain has stagnated in major cereal crops globally, which requires the need to raise the efficiency of breeding programs. It is perceived that limitations in the progress and development of phenotyping tools and platforms contribute to lower efficiency in breeding (Rincent et al., 2018). With this in mind, several phenomic initiatives and facilities have been launched at regional, national, and international levels; still, breeders are skeptical about the application of these tools (Atieno et al., 2017; Duan et al., 2018). Breeders are concerned that results obtained from phenotyping platforms under controlled conditions are not indicative of field performance for complex traits, especially under large environmental variability (Atieno et al., 2017; Duan et al., 2018). Moreover, the high throughput platform’s extensive phenotyping is onerous and not cost-efficient compared to the benefits achieved so far. Lastly, data generated from these tools results in data management and big data problems, causing an issue for making a legitimate conclusion for decision-making without understanding data science and machine learning models (Singh et al., 2016). In spite of these challenges, several phenomics platforms, tools, and sensors have been developed, and their improvement and adoption rate is fairly high with the hope of breaking this stagnated genetic advance (Ashourloo et al., 2014; Dobbels and Lorenz, 2019). The next one or 2 decades have considerable potential for phenomics to reach the stage where genomics is today, allowing collection of a large amount of data, gaining understanding from previously unknown traits, and making valid conclusions based on those.
Imagers and sensors have allowed collection of multidimensional and high-resolution datasets from plants to quantify crop growth, yield, biotic or abiotic stress, and other physiological processes under both fields and controlled conditions (Cai et al., 2016; Sankaran et al., 2019). These sensors can measure spectral reflectance ranging from radio waves to gamma waves of the electromagnetic spectrum and create an abundance of information to select from. The resulting imaging sensors varies from LIDAR, X-ray computed tomography (CT), time-of-flight based systems, positron emission tomography, thermal, visible to near-infrared, multispectral, hyperspectral, fluorescence, and stereovision (Kobayashi et al., 2001; Zhang et al., 2018). The field-based platforms range from Internet-of-Things (IoT) based sensor systems, field mounted system (e.g., tower), tractor/sprayer modified systems (manually operated), small autonomous systems, scanning platforms, UAVs, aircraft, and more recently, low orbiting satellite systems (Sangjan et al., 2021). In general, most of the phenotyping systems in controlled environment are commercial systems developed by the private industry. Recently, there has been interest in the development of IoT based systems for customized operation in controlled environment (Sangjan et al., 2021). The commonly used sensors in the phenotyping platforms used in plant breeding are RGB, multispectral, hyperspectral, thermal, and fluorescence sensors employed on ground-based or aerial platforms. These can cover large numbers of plots at a time by measuring absorption, reflection, and refraction information from the plant canopy. RGB sensors are most often used owing to their cost and simplicity (Ashourloo et al., 2014). All these remote sensing tools provide information about several physiological parameters related to crop yield by considering the plant’s nutrient, water, radiation, pigment contents, resource allocation, and biomass partition (Duan et al., 2018; Dobbels and Lorenz, 2019). Most imagers and sensors are equipped on ground-based platforms, mainly stationary in the field or on phenomobiles at experimental facilities to develop new applications and require specialized training and considerations for their use (Cai et al., 2016; Jimenez-Berni et al., 2018). The increase in resolution and miniaturization has lowered their cost and could be easily purchased by small scale labs. The main success in plant phenotyping has come with higher resolution and miniaturization of the sensors coupled with UAVs for covering a large number of plots in a limited time frame and is preferred over the ground-based platforms in many programs (Sankaran et al., 2015b; Gracia-Romero et al., 2019).
Figure 6 provides the studies using high throughput phenotyping (HTP) in these six crops for the last decade (2011–2020). An observed 3–4 fold increase in the number of studies that are using HTP for rice and wheat can be found, but for chickpea, common bean, groundnut, and soybean, there is no improvement observed in this regard (Zhang et al., 2020a; Zhang et al., 2021). Fewer number of studies using HTP in chickpea, common bean, and groundnut might be attributed to the recent adoption of genomics technology (Pandey et al., 2020b). These crops can still benefit from the use of HTP technology to better evaluate various agronomic, biotic, and abiotic stress-related traits. Table 4 shows recent studies conducted for these six crops where different phenotyping platforms and imaging sensors were used for various agronomic, biotic, and abiotic stress studies. In general, most of the studies used RGB or multispectral imaging due to their lower cost, easy management of data, and avoidance of problems related to big data. Furthermore, UAVs have relatively high adoption rates over ground-based platforms by utilizing the same imaging sensors with better resolution and throughput in collecting data from large plots.
FIGURE 6

Trends in publications mentioning/discussing the six crops and high throughput phenotyping since last decade (2011–2020). Search was conducted using associated crop and high throughput phenotyping keywords in the abstract. Source PubMed dated 02/20/2021.
TABLE 4
| Crop | Trait | Platform | Sensor/imager | Discrimination | References |
|---|---|---|---|---|---|
| Rice | Rice blast | Hand-held, simulating aircraft imagery | Multispectral imaging | Reflectance values in the visible and near-infrared regions were used to link with a disease severity rating | Kobayashi et al. (2001) |
| Rice sheath blight | UAVs | RBG and multispectral imaging | Derived vegetation indices from multispectral images and percentage of infected leaf areas with RGB were used for disease detection | Zhang et al. (2018) | |
| Drought stress | Greenhouse automated system at the Rice Automatic Phenotyping (RAP) facility in Germany | RGB imaging | Stay green values were used to assess the stress tolerance ability of genotypes | Duan et al. (2018) | |
| Wheat | Powdery mildew | German Aerospace Centre | Hyperspectral imaging | Powdery mildew was detected, and the best hyperspectral bands were identified for detecting this fungal disease for application in breeding programs | Mewes et al. (2011) |
| Leaf rust | Hand-held ground-based sensing | RGB imaging and multispectral (spectroradiometer) sensors | Vegetation indices from multispectral imaging and percentage of infected leaves from RGB imaging were used for the classification of leaf rust | Ashourloo et al. (2014) | |
| Plant biomass and height | Phenomobile portable buggy | 3D imaging with LIDAR | Plant height, biomass, and canopy cover was measured in a labor-intensive way | Jimenez-Berni et al. (2018) | |
| Soybean | Seed yield | UAVs | RGB imaging | Average canopy cover obtained at an earlier stage was used as a covariate in yield prediction models | Moreira et al. (2019) |
| Iron deficiency | UAVs | Multispectral imaging | Image processing and unsupervised classification models were used for classifying the iron-deficient plots | Dobbels and Lorenz (2019) | |
| Seed yield | UAVs | Hyperspectral imaging | Feature selection approach was used to identify best bands for predicting seed yield with different ML models | Yoosefzadeh-Najafabadi et al. (2021) | |
| Chickpea | Salinity tolerance | Plant accelerator installed at University of Adelaide | RGB imaging | The plant growth rate was monitored throughout the growth stages to study the effect of salinity | Atieno et al. (2017) |
| Progression of senescence | Camera established on a stand | RGB imaging | Color distortion correction algorithms were applied on time series data to quantify the onset and progression of senescence | Cai et al. (2016) | |
| Common bean | Seed yield and biomass | UAVs | Multispectral imaging | Derived vegetation indices showed a strong relationship with seed yield and biomass | Sankaran et al. (2019) |
| Root architecture | Root excavation, ground-based | RGB imaging and traits estimation with DIRT | Genotypes were differentiated for their root traits | Burridge et al. (2016) | |
| Groundnut | Iron deficiency | Chlorophyll meter SPAD, Hand-held | Infrared sensor | Genetic loci associated with increasing iron deficiency were identified | Pattanashetti et al. (2020) |
Important studies conducted using phenomic tools in the last decade for the six crops explored in this study. Information about the trait, phenotyping platform, sensor and study description is provided.
RGB and multispectral imaging have shown a tremendous adoption rate during the last decade for studying biotic and abiotic stresses in crops. Rice sheath blight (Rhizoctonia solani) and blast (Magnaporthe oryzae) are devastating diseases of rice observed worldwide, and accurate detection and management are the focus of several breeding programs. RGB and multispectral imaging sensors on UAVs are an affordable and user-friendly option for disease detection and rating (Kobayashi et al., 2001; Zhang et al., 2018). Color space transformation and color feature extraction have been used to select the diseased varieties or qualitatively detect the infected portions; however, estimation of disease quantitatively was less effective. Vegetation indices extracted from multispectral images showed high accuracy for quantitatively predicting these diseases (Kobayashi et al., 2001; Zhang et al., 2018). Hyperspectral imaging covers a broader region of the electromagnetic spectrum (400–2,500 nm) with a narrow bandwidth, non-destructively explaining various biochemical and physiological changes occurring in the plant due to environmental conditions. For example, in wheat, hyperspectral imaging has been used to detect powdery mildew severity and infection using feature selection algorithms (Mewes et al., 2011). As hyperspectral imaging provides information about various spectral bands, most of which are unnecessary, feature selection is required. This became possible due to the adoption of machine learning models by plant breeders. Here, Mewes et al. (2011) used support vector machine and spectral angle mapper classification methods for feature selection to identify the most important spectral band. Later, those selected bands showed higher prediction accuracy for powdery mildew.
Phenomics aids in the collection of high-quality data earlier in the breeding pipeline from thousands of breeding plots with high temporal and spatial resolution (Krause et al., 2020). Data collected at earlier stages in the growth cycle has shown an advantage in soybean breeding, where canopy coverage during vegetative growth stages have high heritability and genetic correlation with seed yield (Moreira et al., 2019). UAVs are commonly used for collecting canopy coverage with RGB cameras, which is later used for predicting seed yield from multiple plots (Moreira et al., 2019). In a recent study, Yoosefzadeh-Najafabadi et al. (2021) used hyperspectral imaging collected at vegetative stages in soybean and feature selection with machine learning models and demonstrated 93% prediction accuracy for seed yield prediction. There are various other examples where phenomics is used in soybean for studying biotic stresses (powdery mildew, phomopsis seed decay, and target spot), abiotic stresses (nutrient deficiency, drought, and waterlogging), and agronomic traits (seed yield, pod number and biomass estimation) (Mo et al., 2015; Moreira et al., 2019; Yoosefzadeh-Najafabadi et al., 2021). Multiple vegetation indices [normalized difference vegetation index (NDVI), normalized water index (NWI), photochemical reflectance index (PRI)] derived from multispectral imaging were used to find the best time point for predicting the above ground mass and seed yield using correlation and regression analysis (Sankaran et al., 2019). Furthermore, thermal sensors were used to obtain the mean plot temperature and showed a high correlation with plant biomass (Sankaran et al., 2019). A couple of studies have shown the potential of multispectral imaging using UAVs for common bean to predict the seed yield and biomass, but the total number studies are limited when compared to wheat, rice, and soybean (Figure 6) (Burridge et al., 2016; Sankaran et al., 2019).
Ascochyta blight is a devastating disease in chickpea, and remote sensing has shown opportunities for its monitoring in the field (Zhang et al., 2019). Multispectral and thermal sensors deployed on UAVs were used to extract canopy area, percentage of cover, and vegetation indices for predicting disease severity and seed yield in chickpea. The study showed the potential for timely management of the disease by monitoring the crop with remote sensing techniques (Zhang et al., 2019). In a different study, two hundred forty-five chickpea accessions were evaluated using image-based phenotyping to study genetic variation for salt tolerance (Atieno et al., 2017). Pod abortion and pod filling inhibition are the main effects of salinity, and imaging sensors were used to identify the accessions with salt tolerance by phenotyping pod number and seed density (Atieno et al., 2017). In groundnut, iron deficiency occurs when plants are grown on neutral and alkaline soils, reducing the availability of Fe2+ in plants. Infrared sensors were used in groundnut for measuring chlorophyll and iron deficiency chlorosis systems (Pattanashetti et al., 2020). The adoption of phenomics for groundnut in high production countries like India and Ethiopia offers an advantage for reducing yield gaps by understanding various physiological and biochemical process, along with genomic technologies, to improve yield performance.
Going Underground, a Challenge for a Breeder
Although genomics and phenomics tools have helped plant breeders study above-ground traits in great detail, limited work has been done on belowground root systems, which play a vital part in a plants affecting overall grain yield potential. Figure 7 shows the trend for publications using HTP and root phenomics, and it can be concluded that root phenotyping studies are lagging behind other phenomics studies. Roots play an essential role by directly influencing plant growth by regulating water and nutrient uptake, regulating drought stress, resisting soil-borne diseases, and maintaining the crop’s yield and quality (Seck et al., 2020; Wu et al., 2021). The study of root system architecture (RSA) is challenging in situ compared to above-ground phenotyping. Several 2D transparent growth mediums are available that allow sequential capturing of RGB imaging to study growth dynamics and root hair development, such as PlaRoM, Rhizoslides, RootPainter, SNAP, Rhizovision, Rhizoponics, RADIX, and RhizoTubes (Le Marié et al., 2014; Mathieu et al., 2015; Falk et al., 2020; Smith et al., 2020). Various open-source image analysis tools like SmartRoot, RootNav, RootTrace, and EZ-Root-VIS are available to study RSA (French et al., 2009; Pound et al., 2013). To study 3D RSA, a gel-based cylinder can be used to study up to 16 roots traits (Iyer-Pascuzzi et al., 2010). Other 3D image reconstruction and image analysis tools are RootReader3D and GiaRoots (Iyer-Pascuzzi et al., 2010). All these platforms work under lab conditions.
FIGURE 7

Trends in publications mentioning/discussing root phenomics and high throughput phenotyping since last decade (2011–2020). Search was conducted using root phenomics and high throughput phenotyping keywords in the abstract. Source PubMed dated 02/20/2021.
The above-mentioned transparent media does not entirely mimic field conditions. GROWSCREEN-Rhizo, an intelligent mechanized root phenotyping platform, was developed to phenotype roots and shoots simultaneously in transparent soil-filled rhizotrons (Bodner et al., 2018). In a separate study, these Rhizotrons were equipped with thermal and hyperspectral cameras for measuring the temperature and root chemical components like lignin change, water content, and mineral observation capacity (Pound et al., 2013; Le Marié et al., 2014). The difference in the X-ray attenuation capacity of roots and soils is utilized to visualize the inner 3D structure in the X-ray CT. Open-source tools like RootViz3D and RooTrak are used for analyzing different X-ray attenuation capacity to reconstruct the 3D RSA (Mairhofer et al., 2015). However, X-ray CT suffers from some limitations, which vary from the impact of soil type, compaction, and homogeneity of soil particles on X-ray attenuation values. Furthermore, high doses of X-ray affect plant and microbial growth in the soil, and lastly, scanning resolution and volume increase the time of data collection for large pots, limiting the frequency of data acquisition (Metzner et al., 2015).
Other root phenotyping approaches include positron emission tomography (PET), magnetic resonance imaging (MRI), thermal neutron tomography, and neutron radiography. MRI uses the absorption and re-emission of electromagnetic radiation from the nuclei to determine its root architecture and functional attributes (Courtois et al., 2013; Beyer et al., 2019). But MRI is highly sensitive to moisture content and is only applicable if the root diameter is more than 1 mm. Similarly, PET uses the radiotracer distribution for non-invasively studying root attributes. PET has been used to scan the roots up to 85 mm deep non-invasively, and used to monitor carbohydrate transportation assimilates over a more extended period (Garbout et al., 2012). X-ray CT, MRI, and PET have been used differently and have their own strengths and limitations, and hence used interchangeably. For instance, 1) PET has lower signal deterioration by water content and soil structure compared to CT and MRI; furthermore, high water content affects the performance of CT more than MRI (Garbout et al., 2012); 2) CT is more effective for providing high-resolution information from small pots; however, when pot size is large, MRI provides more information about root structure than CT (Pflugfelder et al., 2017); 3) MRI and CT provide higher spatial resolution than PET, but PET provides better contrast between roots and soil owing to gamma radiation; and 4) PET and MRI scanning requires a large amount of time compared to CT, and is problematic for genetic studies where a large number of samples need to be screened (Metzner et al., 2015). In regard to the in-situ root phenotyping in field conditions, there has been great interest in utilizing ground penetrating radar (GPR) (Atkinson et al., 2019). But similar to other techniques, there are limitations associated with influence of soil type and condition on data quality. Table 5 provides information about various other root phenotyping techniques. Advancements in root phenotyping in recent years shows the potential for improving below ground traits in all the crop species by understanding traits better. Further reading about the below-ground phenotyping can be found in other review articles (Paez-Garcia et al., 2015; Wasaya et al., 2018).
TABLE 5
| Root phenotyping technique | Growth media | Description | References |
|---|---|---|---|
| Shovelomics | Soil (field based) | Involved excavation of root samples from the soils to visually score various attributes. The pipeline involves digging of sample, soaking and rinsing, picture collection and finally scoring the characteristics | Garbout et al. (2012) |
| Digital imaging | Liquid media (lab) | Roots are scanned in a liquid media for length, diameter, topology, and branching patterns | Piñeros et al. (2016) |
| Digital imaging | Growth pouch system | Roots are scanned in a growth pouch medium for length, diameter, topology, and branching patterns | Falk et al. (2020) |
| Soil coring | Soil (field based) | It uses tractor mounted hydraulic soil corer for digging steel alloy sampling tubes into soil and assist in phenotyping roots | Iyer-Pascuzzi et al. (2010) |
| Minirhizotrons | Soil (field-based) | A transparent tube is permanently inserted into the ground and growth of shoot and root is continuously monitored throughout the growth stages | Le Marié et al. (2014) |
| Rhizolysimeters | Soil (field-based) | It uses underground concrete pipes, silos and corridor to house soil containing cores for constant observation of root traits | Bodner et al. (2018) |
| Rhizoponics | Liquid media (lab) | It is combination of rhizotrons and hydroponics, where set up is immersed in tank filled with media. Non-destructive 2D imaging of roots and shoots is performed | Mathieu et al. (2015) |
| X-ray CT | Soil (greenhouse and lab) | X-ray CT non-destructively measures the attenuating ionizing radiations for assessing the root structure and constructing the 3D image of RSA | Metzner et al. (2015) |
| Ground penetrating radar | Soil (field-based) | It is mostly used for tree roots and uses electromagnetic pulse system for determining root diameter, biomass, and other attributes | Garbout et al. (2012) |
| Positron emission tomography | Liquid media (lab) | It uses the functional and molecular imaging for tracing the radio tracer distribution in the plant non-invasively | Garbout et al. (2012) |
| Magnetic resonance imaging | Soil (greenhouse and lab) | This study the magnetic moment of atomic particles using strong magnetic and radio frequency | Pflugfelder et al. (2017) |
Description of the important root phenotyping techniques and associated growth media’s for studying the root system architecture.
Developing crop varieties which remain productive on marginal soils and under water deficit is the main aim of several breeding programs, especially in Asia and Africa, owing to climate change (Pattanashetti et al., 2020). Breeding programs maintain yield by selecting combinations of traits like increased harvest index, increased shoot biomass, resistance against insects and pests, and altering the duration of the growing season (Mathieu et al., 2015; Atieno et al., 2017). However, these traits might be linked to root traits, but are not explored to such an extent. This could be achieved using several root phenotyping techniques under field, greenhouse, and laboratory conditions (Iyer-Pascuzzi et al., 2010). Various QTLs were identified controlling RSA for assistance in genomic assisted breeding (Li et al., 2017; Zhao et al., 2019; Seck et al., 2020). QTLs were identified controlling root branching, root length, root hair, and other root traits in certain crops. Identification of QTLs or genes controlling these traits requires accurate and reproducible phenotyping information (Li et al., 2017; Seck et al., 2020). Although several QTLs have been identified for these RSA traits, information is still lacking, such as their mechanism, effect under different genetic backgrounds, and role under different environments and soil types. Most of the roots traits identified so far are polygenic and demonstrate a tremendous potential for utilization of GS for predicting RSA by building reliable training sets for the crops (Li et al., 2017; Seck et al., 2020).
In a recent study, two hundred wheat lines were screened for root dry matter, root diameter, seminal axis root length, root dry matter, and branching pattern in seedling growth over the hydroponic system for performing MTAs (Beyer et al., 2019). From this study, 63 QTLs were identified to control these RSA traits and have a minor effect on phenotypes, suggesting the polygenic nature of these five traits in wheat (Beyer et al., 2019). A root phenotyping study was conducted on 529 rice accessions under controlled and drought conditions to identify MTAs for 21 traits. Researchers identified 264 QTLs controlling all 21 traits, and most of them were already reported in previous studies in rice, further validating the genetic architecture of root traits (Courtois et al., 2013; Li et al., 2017). Similarly, in soybean, GWAS has been performed in various studies to explore RSA trait’s genetic architecture. A recent study using 137 soybean lines grown under rhizoboxes and phenotyped with two-dimensional imaging identified 10 QTLs controlling 15–20% variation for primary root diameter and total root length (Seck et al., 2020). As common bean is mainly grown under drought conditions, 196 QTLs were identified in 438 accessions for various root traits such as root length/weight, lateral root length, taproot length, root volume, root surface area, average root diameter, and lateral root number under drought conditions (Wu et al., 2021). This study provided the genetic basis for roots traits under drought conditions, which will ultimately improve common bean (Wu et al., 2021). There was no major finding related to the study of genetic architecture for RSA traits for chickpea and groundnut, providing opportunities for adoption of root phenotyping in the coming years. We were also not able to find any study using GS for predicting root traits. This will be an emerging research area in coming decades due to rapid progress in root phenotyping that will help understand the genetic architecture of root traits, creating datasets for training GS models, and ultimately helping the breeder select multiple traits simultaneously.
Merging of Genomic Selection, Phenomics and Machine Learning in Breeding
As discussed previously, GS aids in predicting GEBVs and in increasing genetic gain by reducing variety development time and cost per cycle and increasing selection accuracy. Phenomics allows generation of high-quality quantitative data and effectively characterizes large breeding populations (Araus et al., 2018). It has been seen that there is potential for combining GS and phenomics for increasing efficiency and precision while minimizing labor and lowering costs. This will aid in increasing the selection intensity and accuracy within breeding programs and subsequently the selection response (Sun et al., 2017b; Sandhu et al., 2021c). Until now, data from phenomics tools have been used as secondary traits for evaluating disease and pest resistance, abiotic stresses, end-use quality traits, and ultimately grain yield. Furthermore, phenomics datasets are collected in a longitudinal framework that helps select individuals with a specific spectral trajectory during a particular growing stage and helps predict temporal breeding values for specific periods (Moreira et al., 2020). Table 3 provides most of the studies that have used phenomics datasets in multi-trait GS models to predict grain yield in wheat and observed the improvement in the prediction accuracy, either by using single indices or multiple indices in the models.
Most of the GS studies conducted so far use a single trait (univariate) statistical model to predict one trait at a time and do not benefit from the genetic correlation among two or more traits (Jia and Jannink, 2012; Galán et al., 2020). However, multi-trait (multivariate) GS has demonstrated increased prediction accuracy, reduced selection trait bias, high statistical power, and increased parameter estimation accuracy (Sandhu et al., 2021a). Multi-trait GS models have more advantages for traits with low heritability values, like grain yield and end-use quality traits, where secondary traits correlated with high heritability values aid in increasing prediction accuracy (Crain et al., 2018; Lozada and Carter, 2019; Sandhu et al., 2021c). Recently, several studies from CIMMYT have demonstrated an increase in prediction accuracy for grain yield in wheat when secondary longitudinal data collected from phenomics is included as a covariate or in multi-trait GS models (Sun et al., 2019; Lozada et al., 2020a). Furthermore, secondary traits extracted from phenomics aid in selecting earlier in plant growth stages for quantitative traits, allowing earlier program resource allocation to the best individuals. In addition to increasing prediction accuracy, selection response, and intensity, longitudinal phenomics data can explain the various biological process underlying plant growth, not limited to water status, biomass accumulation, chlorophyll content, and photosynthetic efficiency. Primarily SRI are extracted from these longitudinal phenomics data which indirectly explain important physiological processes and stresses in the plants and are mainly used in multi-trait GS models.
Rutkoski et al. (2016) used SRI extracted from phenomics datasets and included them into pedigree and GS models for predicting grain yield in wheat. Doing this in earlier stages of the breeding pipeline is advantageous to remove poorly performing lines, but GS is sometimes not possible at this stage owing to genotyping cost. They showed that pedigree information could also be used with SRI for predicting grain yield earlier to enhance genetic gain. Pedigree information removed the cost and effort of genotyping a large number of plants, and their use also satisfies Mendelian sampling. Rutkoski et al. (2016) observed a 56 and 70% improvement in prediction accuracy for grain yield for within environment predictions using pedigree and genomic relationship matrices when including SRI in the models. The indices used in the study were canopy temperature and green normalized difference vegetation index (GNDVI), which provided information about canopy temperature and biomass and were phenotypically and genetically correlated to grain yield (Rutkoski et al., 2016). In another study, Sun et al. (2017a) used NDVI and canopy temperature in a multi-trait, random regression, and repeatability model for predicting grain yield in wheat and observed a 70% increase in prediction accuracy compared to the single trait GS model. Furthermore, the multi-trait model’s average improvement in predictability was highest, followed by random regression and repeatability model. Various other studies obtained similar results by the inclusion of secondary traits in wheat (Sun et al., 2017b; Crain et al., 2018).
Campbell et al. (2018) used longitudinal phenomics data for fitting random regression models to predict shoot growth trajectories in rice using pedigree and genomic relationships by fitting a second-order Legendre polynomial. A random regression model with longitudinal phenomics data demonstrated improvement in prediction compared to a single data point in traditional mixed linear models. They also showed the future growth predictions could be performed with high accuracy by using a genomic random regression model by having a subset of early phenomics measurements (Campbell et al., 2018). Similarly, another study in rice used random regression models by fitting B-spline and second-order Legendre polynomials to predict the projected shoot area under water-limited and controlled conditions and demonstrated that random regression models performed better than the baseline multi-trait models (Campbell et al., 2019). Furthermore, B-spline models fit a better curve compared to Legendre polynomials (Campbell et al., 2019). Therefore, we have seen that predominantly random regression models are used in rice for fitting or predicting growth curves. In contrast, in the case of wheat, multi-trait GS models have shown the advantage to predict quantitative traits using longitudinal phenomics datasets, which results in significant improvements compared to traditional models. Additional studies suggest canopy cover plays an important role in predicting the seed yield for soybean. Jarquin et al. (2018) modeled the genetic covariance between canopy cover collected by phenomics tools and seed yield using various cross-validation schemes and molecular markers to predict the seed yield. The prediction ability was highest when both canopy cover and molecular markers were included in the prediction models compared to only molecular marker and canopy information (Jarquin et al., 2018).
Owing to the ability of phenomics to collect a large amount of data due to its high spatial and temporal variation, it sometimes creates the big data problem, where feature selection needs to be performed, and complex machine and deep learning models are needed to build the relationship between features and predictors. Several machine and deep learning models, such as random forests, ensemble-based methods, support vector machine, multilayer perceptron, convolutional neural network, and recurrent neural network, are often employed for analyzing phenomics data and predicting traits with GS models. The main interests for these models in plant breeding are useful due to their powerful ability to learn the complex/hidden non-linear relationship in the data to predict complex traits and usually result in higher prediction accuracy than a mixed linear GS model. Ma et al. (2018) and Sandhu et al. (2021e) have shown the potential of deep learning models for predicting grain yield in wheat and observed higher prediction accuracies than the previous BLUP based models and open up a new class of models that could be explored. Table 1 provides the various machine and deep learning models, and their source code links, which have been explored for GS so far. In the coming years, an active area of research is merging machine and deep learning models with phenomics datasets and molecular markers to predict the breeding program’s complex traits.
Concluding Remarks
We explored six important self-pollinated crops consumed by 90% of the world population. Most of the advancements in genomics and phenomics over the last decade have been observed in wheat and rice. The genome sequencing of other crops and the adoption of high throughput genotyping tools have paved the way for understanding various underlying genetic mechanisms. These crops can utilize phenomics in coming years after seeing the progress and benefits achieved in wheat and rice. Several GS models varying from traditional BLUP based model to machine/deep learning models have been explored for prediction. Furthermore, the inclusion of genotype by environment interaction in these models has delivered good prediction accuracy for predicting untested lines in new environments. All the GS models discussed in this study, including genotype and environment interaction, will assist the plant breeder in making improved selection decisions. Multi-trait GS models also indicate their success for predicting low heritable traits and will be explored in future years for prediction under multi-environment scenarios, with the inclusion of phenomics datasets, for understanding genotype by environment interactions.
The yield trends of crops across different continents is very diverse, and it is evident that in Asia and Africa, use of advanced genomic and phenomic technologies can improve/enhance grain yield. Furthermore, public breeding programs play a predominant role in these crops. To translate the advantage of GS and phenomics in their programs, low-cost genotyping and phenotyping needs to be developed and used. In this context, easy to handle, reliable, and affordable low throughput platforms pave the way, and among such tools, RGB cameras make good candidates. Below ground phenotyping is tedious for a plant breeder and is being ignored by most programs. However, several field and lab-based root phenotyping tools were launched in the last decade and their potential is being realized. Further refinement and throughput will pave a new way to better understand root traits in field crops. This is extremely important for continuously increasing drought, salinity, aluminium, and heavy metal sensitivity to plants. As the total number of studies for phenotyping the roots traits increase, this will ultimately aid in predicting new genotypes using GS once enough data are collected for each crop under the different breeding programs.
This review highlights the advantages of combining genomics and phenomics, especially in wheat and rice. There is a need to merge and adopt these two disciplines at a fast pace in other crops to increase their genetic gain. GS has been shown to increase genetic gain by increasing selection accuracy and intensity with reduction of cycle time, which can be further enhanced by using phenomics, and machine/deep learning models in the breeding programs due to big data sets. These tools could aid in screening large number of lines with less phenotyping cost and efforts, allowing better exploration of the genetic diversity of particular crops for various traits. Phenomics is assisting plant breeders in integrating physiological breeding in addition to using molecular and genetic tools for selection. Thus, future studies in breeding will focus on merging all these tools and domains to reach the required rate of genetic gain for grain yield.
Statements
Author contributions
KS and AC: conceptualized the idea and designed the study; KS: wrote the first draft of the manuscript; LM, SS, ZZ, and AC: reviewed, edited, and provided consistent feedback to improve the final draft of the manuscript; SS, ZZ and AC: supervision of the study; AC: resources and funding. All authors read and approved the final manuscript.
Funding
This project was supported by the Agriculture and Food Research Initiative Competitive Grant 2017-67007-25939 (WheatCAP) and 2016-68004-24770 from the USDA National Institute of Food and Agriculture and Hatch project 1014919.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1
AbeA.KosugiS.YoshidaK.NatsumeS.TakagiH.KanzakiH. (2012). Genome Sequencing Reveals Agronomically Important Loci in rice Using MutMap. Nat. Biotechnol.30, 174–178. 10.1038/nbt.2095
2
AdhikariA.BasnetB. R.CrossaJ.DreisigackerS.CamarilloF.BhatiP. K.et al (2020). Genome-wide Association Mapping and Genomic Prediction of Anther Extrusion in Cimmyt Hybrid Wheat Breeding Program via Modeling Pedigree, Genomic Relationship, and Interaction with the Environment. Front. Genet.11, 586687. 10.3389/fgene.2020.586687
3
AkohoueF.Achigan-DakoE. G.SnellerC.Van DeynzeA.SibiyaJ. (2020). Genetic Diversity, SNP-Trait Associations and Genomic Selection Accuracy in a West African Collection of Kersting’s Groundnut. PLoS One15, e0234769. 10.1371/journal.pone.0234769
4
AndersonJ. A.ChaoS.LiuS. (2007). Molecular Breeding Using a Major QTL for fusarium Head Blight Resistance in Wheat. Crop Sci.47–112. 10.2135/CROPSCI2007.04.0006IPBS
5
ArausJ. L.CairnsJ. E. (2014). Field High-Throughput Phenotyping: The New Crop Breeding Frontier. Trends Plant Sci.19, 52–61. 10.1016/j.tplants.2013.09.008
6
ArausJ. L.KefauverS. C.Zaman-AllahM.OlsenM. S.CairnsJ. E. (2018). Translating High-Throughput Phenotyping into Genetic Gain. Trends Plant Sci.23, 451–466. 10.1016/j.tplants.2018.02.001
7
AshourlooD.MobasheriM.HueteA. (2014). Evaluating the Effect of Different Wheat Rust Disease Symptoms on Vegetation Indices Using Hyperspectral Measurements. Remote Sens6, 5107–5123. 10.3390/rs6065107
8
AtienoJ.LiY.LangridgeP.DowlingK.BrienC.BergerB.et al (2017). Exploring Genetic Variation for Salinity Tolerance in Chickpea Using Image-Based Phenotyping. Sci. Rep.7, 1–11. 10.1038/s41598-017-01211-7
9
AtkinsonJ. A.PoundM. P.BennettM. J.WellsD. M. (2019). Uncovering the Hidden Half of Plants Using New Advances in Root Phenotyping. Curr. Opin. Biotechnol.55, 1–8. 10.1016/j.copbio.2018.06.002
10
BassiF. M.BentleyA. R.CharmetG.OrtizR.CrossaJ. (2015). Breeding Schemes for the Implementation of Genomic Selection in Wheat (Triticum spp). Plant Sci.242, 23–36. 10.1016/j.plantsci.2015.08.021
11
BattenfieldS. D.GuzmánC.Chris GaynorR.SinghR. P.PeñaR. J.DreisigackerS.et al (2016). Genomic Selection for Processing and End-Use Quality Traits in the CIMMYT spring Bread Wheat Breeding Program. Plant Genome9, 1. 10.3835/plantgenome2016.01.0005
12
BeebeS. E.RaoI. M.CajiaoC.GrajalesM. (2008). Selection for Drought Resistance in Common Bean Also Improves Yield in Phosphorus Limited and Favorable Environments. Crop Sci.48, 582–592. 10.2135/CROPSCI2007.07.0404
13
Ben-SadounS.RincentR.AuzanneauJ.OuryF. X.RollandB.HeumezE.et al (2020). Economical Optimization of a Breeding Scheme by Selective Phenotyping of the Calibration Set in a Multi-Trait Context: Application to Bread Making Quality. Theor. Appl. Genet.133, 2197–2212. 10.1007/s00122-020-03590-4
14
BernardoR. (1994). Prediction of maize Single‐cross Performance Using RFLPs and Information from Related Hybrids. Crop Sci.34, 20–25. 10.2135/cropsci1994.0011183X003400010003x
15
BernardoR.YuJ. (2007). Prospects for Genomewide Selection for Quantitative Traits in maize. Crop Sci.47, 1082–1090. 10.2135/cropsci2006.11.0690
16
BerryM.IzquierdoP.JefferyH.ShawS.Nchimbi-MsollaS.CichyK. (2020). QTL Analysis of Cooking Time and Quality Traits in Dry Bean (Phaseolus vulgaris L). Theor. Appl. Genet.133, 2291–2305. 10.1007/s00122-020-03598-w
17
BeukertU.ThorwarthP.ZhaoY.LonginC. F. H.SerflingA.OrdonF.et al (2020). Comparing the Potential of Marker-Assisted Selection and Genomic Prediction for Improving Rust Resistance in Hybrid Wheat. Front. Plant Sci.11, 1650. 10.3389/fpls.2020.594113
18
BeyerS.DabaS.TyagiP.BockelmanH.Brown-GuediraG.MohammadiM. (2019). Loci and Candidate Genes Controlling Root Traits in Wheat Seedlings—A Wheat Root GWAS. Funct. Integr. Genomics19, 91–107. 10.1007/s10142-018-0630-z
19
BhandariA.BartholoméJ.Cao-HamadounT. V.KumariN.FrouinJ.KumarA.et al (2019). Selection of Trait-specific Markers and Multi-Environment Models Improve Genomic Predictive Ability in rice. PLoS One14, 1. 10.1371/JOURNAL.PONE.0208871
20
BodnerG.NakhforooshA.ArnoldT.LeitnerD. (2018). Hyperspectral Imaging: A Novel Approach for Plant Root Phenotyping. Plant Methods14, 1. 10.1186/s13007-018-0352-1
21
BreseghelloF.SorrellsM. E. (2006). Association Mapping of Kernel Size and Milling Quality in Wheat (Triticum aestivum L.) Cultivars. Genetics172 (2), 1165–1177. 10.1534/genetics.105.044586
22
BuerstmayrH.BanT.AndersonJ. A. (2009). QTL Mapping and Marker-Assisted Selection for Fusarium Head Blight Resistance in Wheat: A Review. Plant Breed128, 1–6. 10.1111/j.1439-0523.2008.01550.x
23
BurridgeJ.JochuaC. N.BuckschA.LynchJ. P. (2016). Legume Shovelomics: High-Throughput Phenotyping of Common Bean (Phaseolus vulgaris L.) and Cowpea (Vigna Unguiculata Subsp, Unguiculata) Root Architecture in the Field. F. Crop Res.192, 21–32. 10.1016/j.fcr.2016.04.008
24
CaiJ.OkamotoM.AtienoJ.SuttonT.LiY.MiklavcicS. J. (2016). Quantifying the Onset and Progression of Plant Senescence by Color Image Analysis for High Throughput Applications. PLoS One11, e0157102. 10.1371/journal.pone.0157102
25
CampbellM.MomenM.WaliaH.MorotaG. (2019). Leveraging Breeding Values Obtained from Random Regression Models for Genetic Inference of Longitudinal Traits. Plant Genome12, 180075. 10.3835/plantgenome2018.10.0075
26
CampbellM.WaliaH.MorotaG. (2018). Utilizing Random Regression Models for Genomic Prediction of a Longitudinal Trait Derived from High-Throughput Phenotyping. Plant Direct2, 1. 10.1002/pld3.80
27
ChenH.XieW.HeH.YuH.ChenW.LiJ.et al (2014). A High-Density Snp Genotyping Array for rice Biology and Molecular Breeding. Mol. Plant7, 541–553. 10.1093/mp/sst135
28
ChhabraB.TiwariV.GillB. S.DongY.RawatN. (2021). Discovery of a Susceptibility Factor for Fusarium Head Blight on Chromosome 7A of Wheat. Theor. Appl. Genet.1, 3. 10.1007/s00122-021-03825-y
29
ChuY.HolbrookC. C.IsleibT. G.BurowM.CulbreathA. K.TillmanB.et al (2018). Phenotyping and Genotyping Parents of Sixteen Recombinant Inbred Peanut Populations. Peanut Sci.45, 1–11. 10.3146/ps17-17.1
30
CobbJ. N.JumaR. U.BiswasP. S.ArbelaezJ. D.RutkoskiJ.AtlinG.et al (2019). Enhancing the Rate of Genetic Gain in Public-Sector Plant Breeding Programs: Lessons from the Breeder’s Equation. Theor. Appl. Genet.132. 10.1007/s00122-019-03317-0
31
ConcibidoV. C.DiersB. W.ArelliP. R. (2004). A Decade of QTL Mapping for Cyst Nematode Resistance in Soybean. Crop Sci.44, 1121–1131. 10.2135/CROPSCI2004.1121
32
CourtoisB.AudebertA.DardouA.RoquesS.Ghneim- HerreraT.DrocG.et al (2013). Genome-wide Association Mapping of Root Traits in a Japonica rice Panel. PLoS One8, e78037. 10.1371/journal.pone.0078037
33
CrainJ.MondalS.RutkoskiJ.SinghR. P.PolandJ. (2018). Combining High-Throughput Phenotyping and Genomic Information to Increase Prediction and Selection Accuracy in Wheat Breeding. Plant Genome11, 1–14. 10.3835/plantgenome2017.05.0043
34
CrossaJ.MartiniJ. W. R.GianolaD.Pérez-rodríguezP.JarquinD.JulianaP.et al (2019). Deep Kernel and Deep Learning for Genome-Based Prediction of Single Traits in Multienvironment Breeding Trials. Front. Genet.10, 1–13. 10.3389/fgene.2019.01168
35
CrossaJ.PérezP.HickeyJ. B.OrnellaL.Cerón-RojasJ.ZhangX.et al (2014). Genomic Prediction in CIMMYT maize and Wheat Breeding Programs. Heredity (Edinb).112, 48–60. 10.1038/hdy.2013.16
36
CuiF.ZhangN.FanX. L.ZhangW.ZhaoC. H.YangL. J.et al (2017). Utilization of a Wheat660K SNP Array-Derived High-Density Genetic Map for High-Resolution Mapping of a Major QTL for Kernel Number. Sci. Rep.7, 1–12. 10.1038/s41598-017-04028-6
37
CuiY.LiR.LiG.ZhangF.ZhuT.ZhangQ.et al (2020). Hybrid Breeding of rice via Genomic Selection. Plant Biotechnol. J.18, 57–67. 10.1111/pbi.13170
38
DeshmukhR.SonahH.PatilG.ChenW.PrinceS.MutavaR.et al (2014). Integrating Omic Approaches for Abiotic Stress Tolerance in Soybean. Front. Plant Sci.5, 244. 10.3389/fpls.2014.00244
39
DiazS.Ariza-SuarezD.RamdeenR.AparicioJ.ArunachalamN.HernandezC.et al (2021). Genetic Architecture and Genomic Prediction of Cooking Time in Common Bean (Phaseolus vulgaris L). Front. Plant Sci.11, 2257. 10.3389/fpls.2020.622213
40
DobbelsA. A.LorenzA. J. (2019). Soybean Iron Deficiency Chlorosis High Throughput Phenotyping Using an Unmanned Aircraft System. Plant Methods15, 97. 10.1186/s13007-019-0478-9
41
DuanL.HanJ.GuoZ.TuH.YangP.ZhangD.et al (2018). Novel Digital Features Discriminate between Drought Resistant and Drought Sensitive rice under Controlled and Field Conditions. Front. Plant Sci.9, 492. 10.3389/fpls.2018.00492
42
EdwardsS. M. K.BuntjerJ. B.JacksonR.BentleyA. R.LageJ.ByrneE.et al (2019). The Effects of Training Population Design on Genomic Prediction Accuracy in Wheat. Theor. Appl. Genet.132, 1943–1952. 10.1007/s00122-019-03327-y
43
EndelmanJ. B. (2011). Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP. Plant Genome4, 250–255. 10.3835/plantgenome2011.08.0024
44
FalkK. G.JuberyT. Z.MirnezamiS. V.ParmleyK. A.SarkarS.SinghA.et al (20202020). Computer Vision and Machine Learning Enabled Soybean Root Phenotyping Pipeline. Plant Methods161 (16), 1–19. 10.1186/S13007-019-0550-5
45
FranckiM. G.WalkerE.McMullanC. J.MorrisW. G. (2020). Multi-location Evaluation of Global Wheat Lines Reveal Multiple Qtl for Adult Plant Resistance to Septoria Nodorum Blotch (SNB) Detected in Specific Environments and in Response to Different Isolates. Front. Plant Sci.11, 771. 10.3389/fpls.2020.00771
46
FrenchA.Ubeda-TomásS.HolmanT. J.BennettM. J.PridmoreT. (2009). High-throughput Quantification of Root Growth Using a Novel Image-Analysis Tool. Plant Physiol.150, 1784–1795. 10.1104/pp.109.140558
47
FrouinJ.LabeyrieA.BoisnardA.SacchiG. A.AhmadiN. (2019). Genomic Prediction Offers the Most Effective Marker Assisted Breeding Approach for Ability to Prevent Arsenic Accumulation in rice Grains. PLoS One14, e0217516. 10.1371/journal.pone.0217516
48
GalánR. J.Bernal-VasquezA. M.JebsenC.PiephoH. P.ThorwarthP.SteffanP.et al (2020). Integration of Genotypic, Hyperspectral, and Phenotypic Data to Improve Biomass Yield Prediction in Hybrid rye. Theor. Appl. Genet.133, 11. 10.1007/s00122-020-03651-8
49
GarboutA.MunkholmL. J.HansenS. B.PetersenB. M.MunkO. L.PajorR. (2012). The Use of PET/CT Scanning Technique for 3D Visualization and Quantification of Real-Time Soil/plant Interactions. Plant Soil352, 113–127. 10.1007/s11104-011-0983-8
50
GaynorR. C.GorjancG.BentleyA. R.OberE. S.HowellP.JacksonR.et al (2017). A Two-Part Strategy for Using Genomic Selection to Develop Inbred Lines. Crop Sci.57, 2372–2386. 10.2135/cropsci2016.09.0742
51
GillH. S.HalderJ.ZhangJ.BrarN. K.RaiT. S.HallC.et al (2021). Multi-trait Multi-Environment Genomic Prediction of Agronomic Traits in Advanced Breeding Lines of winter Wheat. Front. Plant Sci.0, 1619. 10.3389/FPLS.2021.709545
52
GiovannoniJ. J.WingR. A.GanalM. W.TanksleyS. D. (1991). Isolation of Molecular Markers from Specific Chromosomal Intervals Using DNA Pools from Existing Mapping Populations. Nucleic Acids Res.19, 6553–6568. 10.1093/nar/19.23.6553
53
Gracia-RomeroA.KefauverS. C.Fernandez-GallegoJ. A.Vergara-DíazO.Nieto-TaladrizM. T.ArausJ. L. (2019). UAV and Ground Image-Based Phenotyping: a Proof of Concept with Durum Wheat. Remote Sens11, 1244. 10.3390/rs11101244
54
GuoJ.KhanJ.PradhanS.ShahiD.KhanN.AvciM.et al (2020a). Multi-trait Genomic Prediction of Yield-Related Traits in US Soft Wheat under Variable Water Regimes. Genes (Basel)11, 1–26. 10.3390/genes11111270
55
GuoJ.PradhanS.ShahiD.KhanJ.McbreenJ.BaiG.et al (2020b). Increased Prediction Accuracy Using Combined Genomic Information and Physiological Traits in a Soft Wheat Panel Evaluated in Multi-Environments. Sci. Rep.10, 1–12. 10.1038/s41598-020-63919-3
56
GuoZ.TuckerD. M.BastenC. J.GandhiH.ErsozE.GuoB.et al (2014). The Impact of Population Structure on Genomic Prediction in Stratified Populations. Theor. Appl. Genet.127, 749–762. 10.1007/s00122-013-2255-x
57
HeffnerE. L.JanninkJ. L.IwataH.SouzaE.SorrellsM. E. (2011). Genomic Selection Accuracy for Grain Quality Traits in Biparental Wheat Populations. Crop Sci.51, 2597–2606. 10.2135/cropsci2011.05.0253
58
HerterC. P.EbmeyerE.KollersS.KorzunV.WürschumT.MiedanerT. (2019). Accuracy of within- and Among-Family Genomic Prediction for Fusarium Head Blight and Septoria Tritici Blotch in winter Wheat. Theor. Appl. Genet.132, 1121–1135. 10.1007/s00122-018-3264-6
59
HeslotN.YangH. P.SorrellsM. E.JanninkJ. L. (2012). Genomic Selection in Plant Breeding: A Comparison of Models. Crop Sci.52, 146–160. 10.2135/cropsci2011.06.0297
60
HowardR.JarquinD. (2019). Genomic Prediction Using Canopy Coverage Image and Genotypic Information in Soybean via a Hybrid Model. Evol. Bioinform. Online15, 1176934319840026. 10.1177/1176934319840026
61
HuX.CarverB. F.PowersC.YanL.ZhuL.ChenC. (2019). Effectiveness of Genomic Selection by Response to Selection for winter Wheat Variety Improvement. Plant Genome12, 180090. 10.3835/plantgenome2018.11.0090
62
HuangM.BalimponyaE. G.MgonjaE. M.McHaleL. K.Luzi-KihupiA.WangG. L.et al (2019). Use of Genomic Selection in Breeding rice (Oryza Sativa L.) for Resistance to rice Blast (Magnaporthe Oryzae). Mol. Breed.39, 1–16. 10.1007/s11032-019-1023-2
63
HuangM.LiuX.ZhouY.SummersR. M.ZhangZ. (2018). BLINK: A Package for the Next Level of Genome-wide Association Studies with Both Individuals and Markers in the Millions. Gigascience8, 1–12. 10.1093/gigascience/giy154
64
HurburghC. R.BrummT. J.GuinnJ. M.HartwigR. A. (1990). Protein and Oil Patterns in U.S. And World Soybean Markets. J. Am. Oil Chem. Soc.67, 966–973. 10.1007/BF02541859
65
IsidroJ.JanninkJ. L.AkdemirD.PolandJ.HeslotN.SorrellsM. E. (2015). Training Set Optimization under Population Structure in Genomic Selection. Theor. Appl. Genet.128, 145–158. 10.1007/s00122-014-2418-4
66
Iyer-PascuzziA. S.SymonovaO.MileykoY.HaoY.BelcherH.HarerJ.et al (2010). Imaging and Analysis Platform for Automatic Phenotyping and Trait Ranking of Plant Root Systems. Plant Physiol.152, 1148–1157. 10.1104/pp.109.150748
67
JaganathanD.ThudiM.KaleS.AzamS.RoorkiwalM.GaurP. M.et al (2015). Genotyping-by-sequencing Based Intra-specific Genetic Map Refines a ‘‘QTL-Hotspot” Region for Drought Tolerance in Chickpea. Mol. Genet. Genomics290, 559–571. 10.1007/s00438-014-0932-3
68
JarquínD.LemesC.GaynorR. C.PolandJ.FritzA.HowardR.et al (2017). Increasing Genomic-Enabled Prediction Accuracy by Modeling Genotype ´ Environment Interactions in kansas Wheat. Plant Genome.10 (2), 1. 10.3835/plantgenome2016.12.0130
69
JarquínD.CrossaJ.LacazeX.Du CheyronP.DaucourtJ.LorgeouJ.et al (2014). A Reaction Norm Model for Genomic Selection Using High-Dimensional Genomic and Environmental Data. Theor. Appl. Genet.127, 595–607. 10.1007/s00122-013-2243-1
70
JarquinD.HowardR.XavierA.ChoudhuryDasS. (2018). Increasing Predictive Ability by Modeling Interactions between Environments, Genotype and Canopy Coverage Image Data for Soybeans. Agronomy8, 1. 10.3390/agronomy8040051
71
JarquinD.Kajiya-KanegaeH.TaishenC.YabeS.PersaR.YuJ.et al (2020). Coupling Day Length Data and Genomic Prediction Tools for Predicting Time-Related Traits under Complex Scenarios. Sci. Rep.10, 1. 10.1038/s41598-020-70267-9
72
JerniganK. L.GodoyJ. V.HuangM.ZhouY.MorrisC. F.Garland-CampbellK. A.et al (2018). Genetic Dissection of End-Use Quality Traits in Adapted Soft white winter Wheat. Front. Plant Sci.9, 1–15. 10.3389/fpls.2018.00271
73
JiaY.JanninkJ. L. (2012). Multiple-trait Genomic Selection Methods Increase Genetic Value Prediction Accuracy. Genetics192, 1513–1522. 10.1534/genetics.112.144246
74
Jimenez-BerniJ. A.DeeryD. M.Rozas-LarraondoP.CondonA.TonyG.RebetzkeG. J.et al (2018). High Throughput Determination of Plant Height, Ground Cover, and Above-Ground Biomass in Wheat with LiDAR. Front. Plant Sci.9, 237. 10.3389/fpls.2018.00237
75
JulianaP.Montesinos-LópezO. A.CrossaJ.MondalS.González PérezL.PolandJ.et al (2019). Integrating Genomic-Enabled Prediction and High-Throughput Phenotyping in Breeding for Climate-Resilient Bread Wheat. Theor. Appl. Genet.132, 177–194. 10.1007/s00122-018-3206-3
76
JulianaP.SinghR. P.BraunH.-J.Huerta-EspinoJ.Crespo-HerreraL.GovindanV.et al (2020). Genomic Selection for Grain Yield in the Cimmyt Wheat Breeding Program—Status and Perspectives. Front. Plant Sci.11, 1. 10.3389/fpls.2020.564183
77
KaurB.SandhuK. S.KamalR.KaurK.SinghJ.RöderM. S. (2021). Omics for the Improvement of Abiotic, Biotic and Agronomic Traits in Major Cereals: Applications, Challenges, and Prospects. Plants10, 1989. 10.3390/plants10101989
78
KearneyJ. (2010). Food Consumption Trends and Drivers. Philos. Trans. R. Soc. B Biol. Sci.365, 2793. 10.1098/RSTB.2010.0149
79
KellerB.Ariza-SuarezD.de la HozJ.AparicioJ. S.Portilla-BenavidesA. E.BuendiaH. F.et al (2020). Genomic Prediction of Agronomic Traits in Common Bean (Phaseolus vulgaris L.) under Environmental Stress. Front. Plant Sci.11, 1001. 10.3389/fpls.2020.01001
80
KhanA. W.GargV.RoorkiwalM.GoliczA. A.EdwardsD.VarshneyR. K. (2020). Super-pangenome by Integrating the Wild Side of a Species for Accelerated Crop Improvement. Trends Plant Sci.25, 148–158. 10.1016/j.tplants.2019.10.012
81
KobayashiT.KandaE.KitadaK.IshiguroK.TorigoeY. (2001). Detection of rice Panicle Blast with Multispectral Radiometer and the Potential of Using Airborne Multispectral Scanners. Phytopathology91, 316–323. 10.1094/PHYTO.2001.91.3.316
82
KorteA.AshleyF. (2013). The Advantages and Limitations of Trait Analysis with GWAS : a Review. Plant Methods9, 29. 10.1186/1746-4811-9-29
83
KrauseM. R.González-PérezL.CrossaJ.Pérez-RodríguezP.Montesinos-LópezO.SinghR. P.et al (2019). Hyperspectral Reflectance-Derived Relationship Matrices for Genomic Prediction of Grain Yield in Wheat. G3 Genes, Genomes, Genet.9, 1231–1247. 10.1534/g3.118.200856
84
KrauseM. R.MondalS.CrossaJ.SinghR. P.PintoF.HaghighattalabA.et al (2020). Aerial High-Throughput Phenotyping Enables Indirect Selection for Grain Yield at the Early Generation, Seed-Limited Stages in Breeding Programs. Crop Sci.60, 3096–3114. 10.1002/csc2.20259
85
LanderE. S.BotsteinD. (1989). Mapping Mendelian Factors Underlying Quantitative Traits Using RFLP Linkage Maps. Genetics121, 185–199. 10.1093/genetics/121.1.185
86
Le MariéC.KirchgessnerN.MarschallD.WalterA.HundA. (2014). Rhizoslides: Paper-Based Growth System for Non-destructive, High Throughput Phenotyping of Root Development by Means of Image Analysis. Plant Methods10, 1–16. 10.1186/1746-4811-10-13
87
LiB.ZhangN.WangY.-G.GeorgeA. W.ReverterA.LiY. (2018a). Genomic Prediction of Breeding Values Using a Subset of Snps Identified by Three Machine Learning Methods. Front. Genet.9, 237. 10.3389/fgene.2018.00237
88
LiX.GuoZ.LvY.CenX.DingX.WuH.et al (2017). Genetic Control of the Root System in rice under normal and Drought Stress Conditions by Genome-wide Association Study. Plos Genet.13, e1006889. 10.1371/journal.pgen.1006889
89
LiY.RuperaoP.BatleyJ.EdwardsD.KhanT.ColmerT. D.et al (2018b). Investigating Drought Tolerance in Chickpea Using Genome-wide Association Mapping and Genomic Selection Based on Whole-Genome Resequencing Data. Front. Plant Sci.9, 190. 10.3389/fpls.2018.00190
90
LiuX.WangH.WangH.GuoZ.XuX.LiuJ.et al (2018). Factors Affecting Genomic Selection Revealed by Empirical Evidence in maize. Crop J.6, 341–352. 10.1016/J.CJ.2018.03.005
91
LiuY.DuH.LiP.ShenY.PengH.LiuS.et al (2020). Pan-genome of Wild and Cultivated Soybeans. Cell182, 162–176. e13. 10.1016/J.CELL.2020.05.023
92
LiuY.WangD.HeF.WangJ.JoshiT.XuD. (2019). Phenotype Prediction and Genome-wide Association Study Using Deep Convolutional Neural Network of Soybean. Front. Genet.10, 1091. 10.3389/fgene.2019.01091
93
LopesM. S.ReynoldsM. P.ManesY.SinghR. P.CrossaJ.BraunH. J. (2012). Genetic Yield Gains and Changes in Associated Traits of CIMMYT spring Bread Wheat in a “Historic” Set Representing 30 Years of Breeding. Crop Sci.52, 1123–1131. 10.2135/cropsci2011.09.0467
94
LozadaD.GodoyJ. V.MurrayT. D.WardB. P.CarterA. H.CarterA. H. (2019). Genetic Dissection of Snow Mold Tolerance in US pacific Northwest winter Wheat through Genome-wide Association Study and Genomic Selection. Front. Plant Sci.10, 1–15. 10.3389/fpls.2019.01337
95
LozadaD. N.CarterA. H. (2019). Accuracy of Single and Multi-Trait Genomic Prediction Models for Grain Yield in Us pacific Northwest winter Wheat. Crop Breeding, Genet. Genomics2019, 1. 10.20900/cbgg20190012
96
LozadaD. N.CarterA. H. (2020). Genomic Selection in winter Wheat Breeding Using a Recommender Approach. Genes11, 779. 10.3390/genes11070779
97
LozadaD. N.GodoyJ. V.WardB. P.CarterA. H. (2020a). Genomic Prediction and Indirect Selection for Grain Yield in US pacific Northwest winter Wheat Using Spectral Reflectance Indices from High-Throughput Phenotyping. Int. J. Mol. Sci.21, 1. 10.3390/ijms21010165
98
LozadaD. N.WardB. P.CarterA. H. (2020b). Gains through Selection for Grain Yield in a winter Wheat Breeding Program. PLoS One15, e0221603. 10.1371/journal.pone.0221603
99
MaW.QiuZ.SongJ.LiJ.ChengQ.ZhaiJ.et al (2018). A Deep Convolutional Neural Network Approach for Predicting Phenotypes from Genotypes. Planta248, 1307–1318. 10.1007/s00425-018-2976-9
100
MairhoferS.SturrockC. J.BennettM. J.MooneyS. J.PridmoreT. P. (2015). Extracting Multiple Interacting Root Systems Using X-ray Microcomputed Tomography. Plant J.84, 1034–1043. 10.1111/tpj.13047
101
MaldonadoC.Mora-PobleteF.Contreras-SotoR. I.AhmarS.ChenJ.-T.do Amaral JúniorA. T.et al (2020). Genome-wide Prediction of Complex Traits in Two Outcrossing Plant Species through Deep Learning and Bayesian Regularized Neural Network. Front. Plant Sci.11, 1808. 10.3389/fpls.2020.593897
102
MasukaB.AtlinG. N.OlsenM.MagorokoshoC.LabuschagneM.CrossaJ.et al (2017a). Gains in maize Genetic Improvement in Eastern and Southern Africa: I. CIMMYT Hybrid Breeding Pipeline. Crop Sci.57, 168–179. 10.2135/CROPSCI2016.05.0343
103
MasukaB.MagorokoshoC.OlsenM.AtlinG. N.BänzigerM.PixleyK. V.et al (2017b). Gains in maize Genetic Improvement in Eastern and Southern Africa: II. CIMMYT Open-Pollinated Variety Breeding Pipeline. Crop Sci.57, 180–191. 10.2135/CROPSCI2016.05.0408
104
MathieuL.LobetG.TocquinP.PérilleuxC. (2015). “Rhizoponics”: A Novel Hydroponic Rhizotron for Root System Analyses on Mature Arabidopsis thaliana Plants. Plant Methods11, 1–8. 10.1186/s13007-015-0046-x
105
Mérida-GarcíaR.LiuG.HeS.Gonzalez-DugoV.DoradoG.GálvezS.et al (2019). Genetic Dissection of Agronomic and Quality Traits Based on Association Mapping and Genomic Selection Approaches in Durum Wheat Grown in Southern Spain. PLoS One14, e0211718. 10.1371/journal.pone.0211718
106
MerryR.ButenhoffK.CampbellB. W.MichnoJ.-M.WangD.OrfJ. H.et al (2019). Identification and fine-mapping of a Soybean Quantitative Trait Locus on Chromosome 5 Conferring Tolerance to Iron Deficiency Chlorosis. Plant Genome12, 190007. 10.3835/PLANTGENOME2019.01.0007
107
MetznerR.EggertA.van DusschotenD.PflugfelderD.GerthS.SchurrU.et al (2015). Direct Comparison of MRI and X-ray CT Technologies for 3D Imaging of Root Systems in Soil: Potential and Challenges for Root Trait Quantification. Plant Methods11, 17. 10.1186/s13007-015-0060-z
108
MeuwissenT. H. E.HayesB. J.GoddardM. E. (2001). Prediction of Total Genetic Value Using Genome-wide Dense Marker Maps. Genetics157, 1819–1829. 10.1093/genetics/157.4.1819
109
MewesT.FrankeJ.MenzG. (2011). Spectral Requirements on Airborne Hyperspectral Remote Sensing Data for Wheat Disease Detection. Precis. Agric.12, 795–812. 10.1007/s11119-011-9222-9
110
MichelS.KummerC.GalleeM.HellingerJ.AmetzC.AkgölB.et al (2018). Improving the Baking Quality of Bread Wheat by Genomic Selection in Early Generations. Theor. Appl. Genet.131, 477–493. 10.1007/s00122-017-2998-x
111
MichelS.LöschenbergerF.AmetzC.PachlerB.SparryE.BürstmayrH. (2019a). Combining Grain Yield, Protein Content and Protein Quality by Multi-Trait Genomic Selection in Bread Wheat. Theor. Appl. Genet.132, 2767–2780. 10.1007/s00122-019-03386-1
112
MichelS.LöschenbergerF.HellingerJ.StrasserV.AmetzC.PachlerB.et al (2019b). Improving and Maintaining winter Hardiness and Frost Tolerance in Bread Wheat by Genomic Selection. Front. Plant Sci.10, 1195. 10.3389/fpls.2019.01195
113
MilletE. J.KruijerW.Coupel-LedruA.Alvarez PradoS.Cabrera-BosquetL.LacubeS.et al (2019). Genomic Prediction of maize Yield across European Environmental Conditions. Nat. Genet.516 (51), 952–956. 10.1038/s41588-019-0414-y
114
MishraR.JoshiR. K.ZhaoK. (2018). Genome Editing in rice: Recent Advances, Challenges, and Future Implications. Front. Plant Sci.9, 1361. 10.3389/fpls.2018.01361
115
MoC.KimM. S.KimG.CheongE. J.YangJ.LimJ. (2015). Detecting Drought Stress in Soybean Plants Using Hyperspectral Fluorescence Imaging. J. Biosyst. Eng.40, 335–344. 10.5307/jbe.2015.40.4.335
116
MohammadiM.TiedeT.SmithK. P. (2015). Popvar: A Genome-wide Procedure for Predicting Genetic Variance and Correlated Response in Biparental Breeding Populations. Crop Sci.55, 2068–2077. 10.2135/cropsci2015.01.0030
117
MontenegroJ. D.GoliczA. A.BayerP. E.HurgobinB.LeeH.ChanC.-K. K.et al (2017). The Pangenome of Hexaploid Bread Wheat. Plant J.90, 1007–1013. 10.1111/tpj.13515
118
Montesinos-LópezO. A.Martín-VallejoJ.CrossaJ.GianolaD.Hernández-SuárezC. M.Montesinos-LópezA.et al (2019a). A Benchmarking between Deep Learning, Support Vector Machine and Bayesian Threshold Best Linear Unbiased Prediction for Predicting Ordinal Traits in Plant Breeding. G3 Genes, Genomes, Genet.9, 601–618. 10.1534/g3.118.200998
119
Montesinos-LópezO. A.Martín-VallejoJ.CrossaJ.GianolaD.Hernández-SuárezC. M.Montesinos-LópezA.et al (2019b). New Deep Learning Genomic-Based Prediction Model for Multiple Traits with Binary, Ordinal, and Continuous Phenotypes. G3 Genes, Genomes, Genet.9, 1545–1556. 10.1534/g3.119.300585
120
Montesinos-LópezO. A.Montesinos-LópezA.CrossaJ.GianolaD.Hernández-SuárezC. M.Martín-VallejoJ. (2018). Multi-trait, Multi-Environment Deep Learning Modeling for Genomic-Enabled Prediction of Plant Traits. G3 Genes, Genomes, Genet.8, 3829–3840. 10.1534/g3.118.200728
121
Montesinos-LópezO. A.Montesinos-LópezA.CrossaJ.los CamposG.AlvaradoG.SuchismitaM.et al (2017). Predicting Grain Yield Using Canopy Hyperspectral Reflectance in Wheat Breeding Data. Plant Methods13, 1–23. 10.1186/s13007-016-0154-2
122
Montesinos-LópezO. A.Montesinos-LópezA.TuberosaR.MaccaferriM.SciaraG.AmmarK.et al (2019c). Multi-trait, Multi-Environment Genomic Prediction of Durum Wheat with Genomic Best Linear Unbiased Predictor and Deep Learning Methods. Front. Plant Sci.10, 1311. 10.3389/fpls.2019.01311
123
MonteverdeE.GutierrezL.BlancoP.Pérez de VidaF.RosasJ. E.BonnecarrèreV.et al (2019). Integrating Molecular Markers and Environmental Covariates to Interpret Genotype by Environment Interaction in rice (Oryza Sativa L.) Grown in Subtropical Areas. G3 (Bethesda)9 (5), 1519–1531. 10.1534/g3.119.400064
124
MoreiraF. F.HearstA. A.CherkauerK. A.RaineyK. M. (2019). Improving the Efficiency of Soybean Breeding with High-Throughput Canopy Phenotyping. Plant Methods15, 139. 10.1186/s13007-019-0519-4
125
MoreiraF. F.OliveiraH. R.VolenecJ. J.RaineyK. M.BritoL. F. (2020). Integrating High-Throughput Phenotyping and Statistical Genomic Methods to Genetically Improve Longitudinal Traits in Crops. Front. Plant Sci.11, 681. 10.3389/fpls.2020.00681
126
Muhu-Din AhmedH. G.SajjadM.ZengY.IqbalM.Habibullah KhanS.UllahA.et al (2020). Genome-wide Association Mapping through 90k Snp Array for Quality and Yield Attributes in Bread Wheat against Water-Deficit Conditions. Agriculture10, 392. 10.3390/agriculture10090392
127
MujibiF. D. N.NkrumahJ. D.DurunnaO. N.StothardP.MahJ.WangZ.et al (2011). Accuracy of Genomic Breeding Values for Residual Feed Intake in Crossbred Beef Cattle. J. Anim. Sci.89, 3353–3361. 10.2527/JAS.2010-3361
128
NormanA.TaylorJ.EdwardsJ.KuchelH. (2018). Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy. G3 Genes, Genomes, Genet.8, 2889–2899. 10.1534/g3.118.200311
129
OdilbekovF.ArmonienéR.KocA.SvenssonJ.ChawadeA. (2019). GWAS-assisted Genomic Prediction to Predict Resistance to Septoria Tritici Blotch in Nordic winter Wheat at Seedling Stage. Front. Genet.10, 1224. 10.3389/fgene.2019.01224
130
Paez-GarciaA.MotesC.ScheibleW.-R.ChenR.BlancaflorE.MonterosM. (2015). Root Traits and Phenotyping Strategies for Plant Improvement. Plants4, 334–355. 10.3390/plants4020334
131
PandeyM. K.ChaudhariS.JarquinD.JanilaP.CrossaJ.PatilS. C.et al (2020a). Genome-based Trait Prediction in Multi- Environment Breeding Trials in Groundnut. Theor. Appl. Genet.133, 3101–3117. 10.1007/s00122-020-03658-1
132
PandeyM. K.PandeyA. K.KumarR.NwosuC. V.GuoB.WrightG. C.et al (2020b). Translational Genomics for Achieving Higher Genetic Gains in Groundnut. Theor. Appl. Genet.133, 1679–1702. 10.1007/s00122-020-03592-2
133
PandeyM. K.RoorkiwalM.SinghV. K.RamalingamA.KudapaH.ThudiM.et al (2016). Emerging Genomic Tools for Legume Breeding: Current Status and Future Prospects. Front. Plant Sci.7, 455. 10.3389/fpls.2016.00455
134
PattanashettiS. K.PandeyM. K.NaiduG. K.VishwakarmaM. K.SinghO. K.ShasidharY.et al (2020). Identification of Quantitative Trait Loci Associated with Iron Deficiency Chlorosis Resistance in Groundnut ( Arachis hypogaea ). Plant Breed139, 790–803. 10.1111/pbr.12815
135
PérezP.De Los CamposG. (2014). Genome-wide Regression and Prediction with the BGLR Statistical Package. Genetics198, 483–495. 10.1534/genetics.114.164442
136
PflugfelderD.MetznerR.DusschotenD.ReichelR.JahnkeS.KollerR. (2017). Non-invasive Imaging of Plant Roots in Different Soils Using Magnetic Resonance Imaging (MRI). Plant Methods13, 102. 10.1186/s13007-017-0252-9
137
PiñerosM. A.LarsonB. G.ShaffJ. E.SchneiderD. J.FalcãoA. X.YuanL.et al (2016). Evolving Technologies for Growing, Imaging and Analyzing 3D Root System Architecture of Crop Plants. J. Integr. Plant Biol.58, 230–241. 10.1111/jipb.12456
138
PingaliP. L. (2012). Green Revolution: Impacts, Limits, and the Path Ahead. Proc. Natl. Acad. Sci.109, 12302–12308. 10.1073/PNAS.0912953109
139
PolandJ. A.BrownP. J.SorrellsM. E.JanninkJ. L. (2012). Development of High-Density Genetic Maps for Barley and Wheat Using a Novel Two-Enzyme Genotyping-By-Sequencing Approach. PLoS One7, 1. 10.1371/journal.pone.0032253
140
PolandJ. A.RifeT. W. (2012). Genotyping-by-sequencing for Plant Breeding and Genetics. Plant Genome5, 92–102. 10.3835/plantgenome2012.05.0005
141
PolandJ. (2015). Breeding-assisted Genomics. Curr. Opin. Plant Biol.24, 119–124. 10.1016/j.pbi.2015.02.009
142
PoundM. P.FrenchA. P.AtkinsonJ. A.WellsD. M.BennettM. J.PridmoreT. (2013). RootNav: Navigating Images of Complex Root Architectures. Plant Physiol.162, 1802–1814. 10.1104/pp.113.221531
143
QinJ.ShiA.SongQ.LiS.WangF.CaoY.et al (2019). Genome Wide Association Study and Genomic Selection of Amino Acid Concentrations in Soybean Seeds. Front. Plant Sci.10, 1445. 10.3389/fpls.2019.01445
144
RavelombolaW. S.QinJ.ShiA.NiceL.BaoY.LorenzA.et al (2019). Genome-wide Association Study and Genomic Selection for Soybean Chlorophyll Content Associated with Soybean Cyst Nematode Tolerance. BMC Genomics20, 904. 10.1186/s12864-019-6275-z
145
RavelombolaW. S.QinJ.ShiA.NiceL.BaoY.LorenzA.et al (2020). Genome-wide Association Study and Genomic Selection for Tolerance of Soybean Biomass to Soybean Cyst Nematode Infestation. PLoS One15, e0235089. 10.1371/journal.pone.0235089
146
RayD. K.MuellerN. D.WestP. C.FoleyJ. A. (2013). Yield Trends Are Insufficient to Double Global Crop Production by 2050. PLoS One8, e66428. 10.1371/journal.pone.0066428
147
RayD. K.RamankuttyN.MuellerN. D.WestP. C.FoleyJ. A. (2012). Recent Patterns of Crop Yield Growth and Stagnation. Nat. Commun.3, 1–7. 10.1038/ncomms2296
148
RincentR.CharpentierJ. P.Faivre-RampantP.PauxE.Le GouisJ.BastienC.et al (2018). Phenomic Selection Is a Low-Cost and High-Throughput Method Based on Indirect Predictions: Proof of Concept on Wheat and poplar. G3 Genes, Genomes, Genet.8, 3961–3972. 10.1534/g3.118.200760
149
RoorkiwalM.JainA.KaleS. M.DoddamaniD.ChitikineniA.ThudiM.et al (2018a). Development and Evaluation of High-Density Axiom ® CicerSNP Array for High-Resolution Genetic Mapping and Breeding Applications in Chickpea. Plant Biotechnol. J.16, 890–901. 10.1111/pbi.12836
150
RoorkiwalM.JarquinD.SinghM. K.GaurP. M.BharadwajC.RathoreA.et al (2018b). Genomic-enabled Prediction Models Using Multi-Environment Trials to Estimate the Effect of Genotype × Environment Interaction on Prediction Accuracy in Chickpea. Sci. Rep.8, 11701. 10.1038/s41598-018-30027-2
151
RutkoskiJ.PolandJ.MondalS.AutriqueE.PérezL. G.CrossaJ.et al (2016). Canopy Temperature and Vegetation Indices from High-Throughput Phenotyping Improve Accuracy of Pedigree and Genomic Selection for Grain Yield in Wheat. G3 Genes, Genomes, Genet.6, 2799–2808. 10.1534/g3.116.032888
152
SainiD. K.ChopraY.SinghJ.SandhuK. S.KumarA.BazzerS.et al (2022). Comprehensive Evaluation of Mapping Complex Traits in Wheat Using Genome-wide Association Studies. Mol. Breed.42, 1. 10.1007/s11032-021-01272-7
153
SandhuK. S.AounM.MorrisC. F.CarterA. H. (2021a). Genomic Selection for End-Use Quality and Processing Traits in Soft white winter Wheat Breeding Program with Machine and Deep Learning Models. Biol10, 689. 10.3390/BIOLOGY10070689
154
SandhuK. S.LozadaD. N.ZhangZ.PumphreyM. O.CarterA. H. (2021b). Deep Learning for Predicting Complex Traits in spring Wheat Breeding Program. Front. Plant Sci.11, 613325. 10.3389/fpls.2020.613325
155
SandhuK. S.MihalyovP. D.LewienM. J.PumphreyM. O.CarterA. H. (2021c). Combining Genomic and Phenomic Information for Predicting Grain Protein Content and Grain Yield in spring Wheat. Front. Plant Sci.12, 170. 10.3389/fpls.2021.613300
156
SandhuK. S.MihalyovP. D.LewienM. J.PumphreyM. O.CarterA. H. (2021d). Genomic Selection and Genome-wide Association Studies for for Grain Protein Content Stability in a Nested Association Mapping Population of spring Wheat. Agronomy11, 2528. 10.3390/agronomy11122528
157
SandhuK. S.PatilS. S.PumphreyM. O.CarterA. H. (2021e). Multi-trait Machine and Deep Learning Models for Genomic Selection Using Spectral Information in a Wheat Breeding Program. Plant Genome14 (3), e20119. 10.1002/tpg2.20119
158
SangjanW.CarterA. H.PumphreyM. O.JitkovV.SankaranS. (2021). Development of a Raspberry Pi-Based Sensor System for Automated In-Field Monitoring to Support Crop Breeding Programs. Invent6, 42. 10.3390/INVENTIONS6020042
159
SankaranS.KhotL. R.CarterA. H. (2015a). Field-based Crop Phenotyping: Multispectral Aerial Imaging for Evaluation of winter Wheat Emergence and spring Stand. Comput. Electron. Agric.118, 372–379. 10.1016/j.compag.2015.09.001
160
SankaranS.KhotL. R.EspinozaC. Z.JarolmasjedS.SathuvalliV. R.VandemarkG. J.et al (2015b). Low-altitude, High-Resolution Aerial Imaging Systems for Row and Field Crop Phenotyping: A Review. Eur. J. Agron.70, 112–123. 10.1016/j.eja.2015.07.004
161
SankaranS.QuirósJ. J.MiklasP. N. (2019). Unmanned Aerial System and Satellite-Based High Resolution Imagery for High-Throughput Phenotyping in Dry Bean. Comput. Electron. Agric.165, 104965. 10.1016/j.compag.2019.104965
162
SarinelliJ. M.MurphyJ. P.TyagiP.HollandJ. B.JohnsonJ. W.MergoumM.et al (2019). Training Population Selection and Use of Fixed Effects to Optimize Genomic Predictions in a Historical USA winter Wheat Panel. Theor. Appl. Genet.132, 1247–1261. 10.1007/s00122-019-03276-6
163
SchmutzJ.CannonS. B.SchlueterJ.MaJ.MitrosT.NelsonW.et al (2010). Genome Sequence of the Palaeopolyploid Soybean. Nature463, 178–183. 10.1038/nature08670
164
SchmutzJ.McCleanP. E.MamidiS.WuG. A.CannonS. B.GrimwoodJ.et al (2014). A Reference Genome for Common Bean and Genome-wide Analysis of Dual Domestications. Nat. Genet.46, 707–713. 10.1038/ng.3008
165
SearsE. R.GustafsonJ. P. (1993). Use of Radiation to Transfer Alien Chromosome Segments to Wheat. Crop Sci.33, 897–901. 10.2135/cropsci1993.0011183X003300050004x
166
SeckW.TorkamanehD.BelzileF. (2020). Comprehensive Genome-wide Association Analysis Reveals the Genetic Basis of Root System Architecture in Soybean. Front. Plant Sci.11, 590740. 10.3389/fpls.2020.590740
167
SeptiningsihE. M.PamplonaA. M.SanchezD. L.NeerajaC. N.VergaraG. V.HeuerS.et al (2009). Development of Submergence-Tolerant rice Cultivars: the Sub1 Locus and beyond. Ann. Bot.103, 151–160. 10.1093/AOB/MCN206
168
ShabannejadM.BihamtaM. R.Majidi-HervanE.AlipourH.EbrahimiA. (2020). A Simple, Cost-Effective High-Throughput Image Analysis Pipeline Improves Genomic Prediction Accuracy for Days to Maturity in Wheat. Plant Methods16, 146. 10.1186/s13007-020-00686-2
169
ShookJ. M.ZhangJ.JonesS. E.SinghA.DiersB. W.SinghA. K. (2021). Meta-GWAS for Quantitative Trait Loci Identification in Soybean. G3 Genes, Genomes, Genet.11, 1. 10.1093/G3JOURNAL/JKAB117
170
SinghA.GanapathysubramanianB.SinghA. K.SarkarS. (2016). Machine Learning for High-Throughput Stress Phenotyping in Plants. Trends Plant Sci.21, 110–124. 10.1016/j.tplants.2015.10.015
171
SmithA. G.HanE.PetersenJ.AlvinN.OlsenF.GieseC.et al (2020). RootPainter: Deep Learning Segmentation of Biological Images with Corrective Annotation. bioRxiv NA16, 044461. 10.1101/2020.04.16.044461
172
SunC.HuZ.ZhengT.LuK.ZhaoY.WangW.et al (2017a). RPAN: Rice Pan-Genome Browser for ∼3000 rice Genomes. Nucleic Acids Res.45, 597–605. 10.1093/nar/gkw958
173
SunJ.PolandJ. A.MondalS.CrossaJ.JulianaP.SinghR. P.et al (2019). High-throughput Phenotyping Platforms Enhance Genomic Selection for Wheat Grain Yield across Populations and Cycles in Early Stage. Theor. Appl. Genet.132, 1705–1720. 10.1007/s00122-019-03309-0
174
SunJ.RutkoskiJ. E.PolandJ. A.CrossaJ.JanninkJ.SorrellsM. E. (2017b). Multitrait, Random Regression , or Simple Repeatability Model in High-Throughput Phenotyping Data Improve Genomic Prediction for Wheat Grain Yield. Plant Genome10, 1. 10.3835/plantgenome2016.11.0111
175
TakagiH.AbeA.YoshidaK.KosugiS.NatsumeS.MitsuokaC.et al (2013). QTL-seq: Rapid Mapping of Quantitative Trait Loci in rice by Whole Genome Resequencing of DNA from Two Bulked Populations. Plant J.74, 174–183. 10.1111/tpj.12105
176
ThudiM.PalakurthiR.SchnableJ. C.ChitikineniA.DreisigackerS.MaceE.et al (2020). Genomic Resources in Plant Breeding for Sustainable Agriculture. J. Plant Physiol.257, 153351. 10.1016/j.jplph.2020.153351
177
Tibbs CortesL.ZhangZ.YuJ. (2021). Status and Prospects of Genome-wide Association Studies in Plants. Plant Genome14, e20077. 10.1002/tpg2.20077
178
TishbiraniR. (1996). Regression Shrinkage and Selection via the Lasso. J. R. Stat. Soc. Ser. B58, 267–288. 10.1111/j.2517-6161.1996.tb02080.x
179
TsaiH.-Y.CericolaF.EdrissV.AndersenJ. R.OrabiJ.JensenJ. D.et al (2020). Use of Multiple Traits Genomic Prediction, Genotype by Environment Interactions and Spatial Effect to Improve Prediction Accuracy in Yield Data. PLoS One15, e0232665. 10.1371/journal.pone.0232665
180
VarshneyR. K.BertioliD. J.MoretzsohnM. C.VadezV.KrishnamurthyL.ArunaR.et al (2009). The First SSR-Based Genetic Linkage Map for Cultivated Groundnut (Arachis hypogaea L). Theor. Appl. Genet.118, 729–739. 10.1007/s00122-008-0933-x
181
VarshneyR. K.BohraA.RoorkiwalM.BarmukhR.CowlingW. A.ChitikineniA.et al (2021). Trends in Genetics Fast-Forward Breeding for a Food-Secure World. Trends Genet.37, 1–13. 10.1016/j.tig.2021.08.002
182
VarshneyR. K.SongC.SaxenaR. K.AzamS.YuS.SharpeA. G.et al (2013). Draft Genome Sequence of Chickpea (Cicer Arietinum) Provides a Resource for Trait Improvement. Nat. Biotechnol.31, 240–246. 10.1038/nbt.2491
183
VarshneyR. K.ThudiM.RoorkiwalM.HeW.UpadhyayaH. D.YangW.et al (2019). Resequencing of 429 Chickpea Accessions from 45 Countries Provides Insights into Genome Diversity, Domestication and Agronomic Traits. Nat. Genet.51, 857–864. 10.1038/s41588-019-0401-3
184
VermaS.GuptaS.BandhiwalN.KumarT.BharadwajC.BhatiaS. (2015). High-density Linkage Map Construction and Mapping of Seed Trait QTLs in Chickpea (Cicer Arietinum L.) Using Genotyping-By-Sequencing (GBS). Sci. Rep.5, 1–14. 10.1038/srep17512
185
WangJ.ZhouZ.ZhangZ.LiH.LiuD.ZhangQ.et al (2018). Expanding the BLUP Alphabet for Genomic Prediction Adaptable to the Genetic Architectures of Complex Traits. Heredity (Edinb).121, 648–662. 10.1038/s41437-018-0075-0
186
WangS.WongD.ForrestK.AllenA.ChaoS.HuangB. E.et al (2014). Characterization of Polyploid Wheat Genomic Diversity Using a High-Density 90 000 Single Nucleotide Polymorphism Array. Plant Biotechnol. J.12, 787–796. 10.1111/pbi.12183
187
WarringtonC. V.Abdel-HaleemH.HytenD. L.CreganP. B.OrfJ. H.KillamA. S.et al (2015). QTL for Seed Protein and Amino Acids in the Benning × Danbaekkong Soybean Population. Theor. Appl. Genet.128, 839–850. 10.1007/s00122-015-2474-4
188
WasayaA.ZhangX.FangQ.YanZ. (2018). Root Phenotyping for Drought Tolerance: A Review. Agronomy8, 241. 10.3390/agronomy8110241
189
WatsonA.GhoshS.WilliamsM. J.CuddyW. S.SimmondsJ.ReyM. D.et al (2018). Speed Breeding Is a Powerful Tool to Accelerate Crop Research and Breeding. Nat. Plants4, 23–29. 10.1038/s41477-017-0083-8
190
WernerC. R.Voss-FelsK. P.MillerC. N.QianW.HuaW.GuanC.-Y.et al (2018). Effective Genomic Selection in a Narrow-Genepool Crop with Low-Density Markers: Asian Rapeseed as an Example. Plant Genome11, 170084. 10.3835/PLANTGENOME2017.09.0084
191
WuL.ChangY.WangL.WuJ.WangS. (2021). Genetic Dissection of Drought Resistance Based on Root Traits at the Bud Stage in Common Bean. Theor. Appl. Genet.1, 3. 10.1007/s00122-020-03750-6
192
XuX.ZengL.TaoY.VuongT.WanJ.BoermaR.et al (2013). Pinpointing Genes Underlying the Quantitative Trait Loci for Root-Knot Nematode Resistance in Palaeopolyploid Soybean by Whole Genome Resequencing. Proc. Natl. Acad. Sci. U. S. A.110, 13469–13474. 10.1073/pnas.1222368110
193
YabeS.YoshidaH.Kajiya-KanegaeH.YamasakiM.IwataH.EbanaK.et al (2018). Description of Grain Weight Distribution Leading to Genomic Selection for Grain-Filling Characteristics in rice. PLoS One13, e0207627. 10.1371/journal.pone.0207627
194
Yoosefzadeh-NajafabadiM.EarlH. J.TulpanD.SulikJ.EskandariM. (2021). Application of Machine Learning Algorithms in Plant Breeding: Predicting Yield from Hyperspectral Reflectance in Soybean. Front. Plant Sci.11, 2169. 10.3389/fpls.2020.624273
195
YuJ.HollandJ. B.McmullenM. D.BucklerE. S. (2008). Genetic Design and Statistical Power of Nested Association Mapping in maize. Genetics178, 539–551. 10.1534/genetics.107.074245
196
YuJ.PressoirG.BriggsW. H.BiI. V.YamasakiM.DoebleyJ. F.et al (2006). A Unified Mixed-Model Method for Association Mapping that Accounts for Multiple Levels of Relatedness. Nat. Genet.38, 203–208. 10.1038/ng1702
197
Zaman-AllahM.VergaraO.ArausJ. L.TarekegneA.MagorokoshoC.Zarco-TejadaP. J.et al (2015). Unmanned Aerial Platform-Based Multi-Spectral Imaging for Field Phenotyping of maize. Plant Methods11, 35. 10.1186/s13007-015-0078-2
198
ZhangC.ChenW.SankaranS. (2019). High-throughput Field Phenotyping of Ascochyta Blight Disease Severity in Chickpea. Crop Prot.125, 104885. 10.1016/j.cropro.2019.104885
199
ZhangC.CraineW. A.McGeeR. J.VandemarkG. J.DavisJ. B.BrownJ.et al (2020a). Image-based Phenotyping of Flowering Intensity in Cool-Season Crops. Sensors (Switzerland)20, 1–17. 10.3390/s20051450
200
ZhangC.McGeeR. J.VandemarkG. J.SankaranS. (2021). Crop Performance Evaluation of Chickpea and Dry Pea Breeding Lines across Seasons and Locations Using Phenomics Data. Front. Plant Sci.12, 1–13. 10.3389/fpls.2021.640259
201
ZhangD.ZhouX.ZhangJ.LanY.XuC.LiangD. (2018). Detection of rice Sheath Blight Using an Unmanned Aerial System with High-Resolution Color and Multispectral Imaging. PLoS One13, e0187470. 10.1371/journal.pone.0187470
202
ZhangQ.XuM.XiaX.KomatsudaT.VarshneyR. K.ShiK. (2020b). Crop Genetics Research in Asia: Improving Food Security and Nutrition. Theor. Appl. Genet.133, 1339–1344. 10.1007/s00122-020-03597-x
203
ZhaoY.JiangC. H.RehmanR. M. A.ZhangH. L.LiJ.LiZ. C. (2019). Genetic Analysis of Roots and Shoots in rice Seedling by Association Mapping. Genes and Genomics41, 95–105. 10.1007/S13258-018-0741-X/FIGURES/4
204
ZhuC.GoreM.BucklerE. S.YuJ. (2008). Status and Prospects of Association Mapping in Plants. Plant Genome J.1, 5–20. 10.3835/plantgenome2008.02.0089
Summary
Keywords
genetic gain, genomics, high throughput phenotyping, machine and deep learning, plant breeding, root phenomics
Citation
Sandhu KS, Merrick LF, Sankaran S, Zhang Z and Carter AH (2022) Prospectus of Genomic Selection and Phenomics in Cereal, Legume and Oilseed Breeding Programs. Front. Genet. 12:829131. doi: 10.3389/fgene.2021.829131
Received
05 December 2021
Accepted
31 December 2021
Published
21 January 2022
Volume
12 - 2021
Edited by
Muhammad Sajjad, COMSATS University, Islamabad Campus, Pakistan
Reviewed by
Sachin Kumar, Chaudhary Charan Singh University, India
Chengsong Zhu, University of Texas Southwestern Medical Center, United States
Updates
Copyright
© 2022 Sandhu, Merrick, Sankaran, Zhang and Carter.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Karansher S. Sandhu, k.sandhu@wsu.edu
This article was submitted to Evolutionary and Population Genetics, a section of the journal Frontiers in Genetics
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.