- 1Plant Breeding and Genetics Section, School of Integrative Plant Science, College of Agriculture and Life Sciences, Cornell University, Ithaca, NY, United States
- 2Cassava Breeding Program, International Institute of Tropical Agriculture (IITA), Ibadan, Nigeria
- 3Plant and Soil Sciences, College of Agriculture Food and Environment, University of Kentucky, Lexington, KY, United States
- 4Crops for Nutrition and Health, Alliance Bioversity and International Center for Tropical Agriculture (CIAT), Nairobi, Kenya
- 5Global Food Systems Institute, University of Florida, Gainesville, FL, United States
- 6Department of Agricultural and Biological Engineering, University of Florida, Gainesville, FL, United States
- 7Robert W. Holley Center for Agriculture and Health, United States Department of Agriculture, Agricultural Research Service (USDA-ARS), Ithaca, NY, United States
Cassava (Manihot esculenta Crantz) is a critical food security crop for sub-Saharan Africa. Efforts to improve cassava through breeding have expanded over the past decade. Crop growth models (CGM) are becoming common place in breeding efforts to expand the inference of evaluations of breeding germplasm to environments that have not been tested and to prepare for breeding for adaptation to future climates. We parameterized a CGM, the CROPGRO-MANIHOT-Cassava model in the DSSAT family of models, using data on 67 clones from the International Institute of Tropical Agriculture cassava breeding program evaluated from 2017 to 2020 and over eight locations in Nigeria using trial and error parameter adjustments and the General Likelihood Uncertainty Estimation method. Our objectives were to assess the feasibility of this large-scale calibration in the context of a cassava breeding program and to identify systematic biases of the model. For each cultivar we calculated the Pearson correlation between model prediction and observation across the environments, as well as root mean squared error and d statistics. As a result of calibration, the correlation coefficient increased from –0.03 to +0.08, the RMSE dropped from 21 t ha-1 to 5 t ha-1 while d increased from 0.23 to 0.44. We found that the model underestimated root yield in dry environments (low precipitation and high temperature) and overestimated root yield in wet environments (high precipitation and low temperature). Our experience suggests both that CGM calibration could become a routine component of the cassava breeding data analysis cycle and that there are opportunities for model improvement.
Introduction
Crop Growth Models (CGMs) now have a long history in the study of agronomic management practices (1) (Jones et al., 2003; Holzworth et al., 2018; Jones et al., 2017; Tsuji et al., 1998). CGMs simulate plant phenology and growth by integrating ecophysiological functions with weather data, usually on a daily time step, tracking photosynthesis and partitioning as well as resource extraction from the environment. Once calibrated, a CGM can be used to study such questions as planting dates or fertilizer application (Banterng et al., 2009), conservation tillage (Corbeels et al., 2016), crop rotations (Mohanty et al., 2012), or even intercropping (Berghuijs et al., 2021). An advantage of studying such issues with a model is that the system response can be predicted over a long timeframe or for many different location conditions representing a breadth of environments that would be impossible to evaluate empirically. Even for well-calibrated models, results cannot be taken at face value but should suggest the important factors to explore with field experiments.
Expanding the use of CGMs from agronomic questions to plant breeding has been more recent (Baenziger et al., 2004; Technow et al., 2015; Messina et al., 2018; White and Hoogenboom, 1996). Plant breeders base selection decisions on predictions of future performance of breeding lines, whether those predictions come from prior observations of the lines themselves or of close relatives whose genetic relatedness can be inferred from DNA variants. In either case, the predictions cannot be extrapolated to environments or management practices that have not been observed. Thus, a motivation to parameterize CGMs for breeding lines would be to enable predictions in environments where they have not been tested. Applications of such predictions could be estimating the stability of a line’s performance by modeling outcomes over many years or locations (for example, aiding such efforts as Lozada and Carter 2020; Olivoto et al., 2021), identifying specific environments that may be most promising for specific breeding lines (as Chai et al., 2022 have suggested occurs in modern breeding, for example, Phoncharoen, et al., 2021a) or for identifying optimal locations for a multi-location trial evaluation network (Putto et al., 2008). In the context of climate change, the promise of CGMs is to enable performance prediction under future weather scenarios to recommend “climate-smart” varieties. These models may therefore contribute to the development of efficient breeding strategies through decision support in breeding programs. A number of these applications are reviewed in the context of breeding to adapt to climate change in (Ramirez-Villegas et al., 2020).
As for many developments in breeding, attempts to integrate CGMs into breeding have been spurred on by the plummeting cost of DNA markers that has made it possible to associate traits with alleles or haplotypes at specific genomic segments. A challenge to the use of CGMs in breeding is the effort and cost to estimate many CGM parameters for individual breeding lines, as exemplified by early efforts (e.g., White and Hoogenboom, 1996; Hoogenboom and White, 2003). In particular, CGMs require many parameters that are not routinely measured in breeding programs so that they come at an extra cost to the program, which is prohibitive. A work around to this problem is to seek to estimate parameter values from data that are routinely collected in breeding programs. Estimation procedures essentially fit parameter values so that CGM predictions match as closely as possible to observed breeding data. One such method is General Likelihood Uncertainty Estimation (GLUE) that has been incorporated in the Decision Support System for Agrotechnology Transfer (DSSAT) (He et al., 2010; Hoogenboom et al., 2021; Berton Ferreira et al., 2024). To extend this methodology to cassava (Manihot esculenta Crantz), the initial step involves the development of a robust growth model.
The worldwide importance of cassava is undeniable, especially in sub-Saharan Africa where cassava roots are a valuable source of calories (Adebayo, 2023; Chiaka et al., 2022). Numerous efforts to improve cassava breeding to deliver greater genetic gain are underway. Such efforts span a wide range of technologies from on farm testing (Nanyonjo et al., 2024), to genomic selection (Wolfe et al., 2016; Ceballos et al., 2021), to developing cassava hybrids (Zhang et al., 2024). Integrating crop growth models into these efforts could prove beneficial for optimizing field evaluations, matching variety candidates to target environments, as well as for experimenting with new management options. A number of cassava CGMs have been reviewed (Phoncharoen et al., 2021b; Moreno-Cadena et al., 2021). An advantage of the CSM-MANIHOT-Cassava model described there is that is has been integrated within the DSSAT Crop Modeling Ecosystem in DSSAT v. 4.8 (Hoogenboom et al., 2019) for which there is also an R interface (Alderman, 2020). The DSSAT system in turn incorporates model calibration functionality, including GLUE.
The MANIHOT-Cassava model was developed from the CSM-CROPSIM model, itself a development of the GUMCAS model (Moreno-Cadena et al., 2020). While (Moreno-Cadena et al., 2020) only verbally describes the model updates, comprehensive details including equations are available in (Moreno-Cadena, 2021). Plant development is a function of accumulated thermal time with different cardinal temperatures for the processes of forking and leaf growth. Daily photosynthetic assimilates are obtained by multiplying the solar radiation intercepted by the photosynthetically active radiation use efficiency (PARUE). The model uses a spill-over strategy for biomass allocation: assimilates are allocated by prioritizing the growth of aboveground biomass and fibrous roots according to their demand and the remainder is assigned to the storage roots. Drought stress dynamics are influenced by soil moisture content, impacting germination, branching, leaf appearance, and photosynthesis. Nitrogen limitations are also factored in, though they necessitate further refinement. The current model version requires an array of parameters at the species, ecotype, and cultivar levels, which define stress thresholds, senescence processes, Vapor Pressure Deficit (VPD) effects, root water uptake, and other physiological traits like leaf morphology and growth characteristics. As a DSSAT crop growth model, MANIHOT-Cassava requires minimum data sets including daily weather data, soil surface and profile characteristics, crop management, initial conditions, and crop measurements (Hoogenboom et al., 2012, 2019).
The model performed well in simulating cassava development and growth, including forking dates, leaves and stem biomass, and the yield and yield components (Moreno-Cadena et al., 2020; Phoncharoen, et al., 2021b). A previous study assessed the MANIHOT-Cassava model as a breeding tool for identifying the most suitable production environments for potential varieties in Thailand (Phoncharoen, et al., 2021b). Nevertheless, any model is a work in progress with the potential to better track responses of many growth processes to external conditions and attempting to use more parsimonious parameterizations. Our objectives were to 1) assess the feasibility of calibrating the model based on observations made by the International Institute of Tropical Agriculture (IITA) breeding program using GLUE and across a range of elite Nigerian cassava germplasm and growing environments and 2) to identify any systematic biases in model predictions after that calibration, to suggest further improvements to the model. The experimental data included 67 clones evaluated across 11 locations in three years, thus representing a large dataset for model assessment. This report presents the first calibration of the CSM-MANIHOT-Cassava model in Nigeria, a country with a wide range of environmental conditions for cassava cultivation.
Materials and methods
Model calibration
Experimental design and field management
Phenotypic data were collected for 67 clones from the Uniform Yield Trials of IITA, which are the most advanced evaluation stage of the cassava breeding program prior to variety release decisions. The clones were derived from crosses within IITA’s elite population as part of the genomic recurrent breeding program. They underwent four stages of screening for diseases, vigor, agronomic, and quality traits in earlier trials of the breeding program, showing resistance to diseases and potential for high yield and dry matter content. To standardize the model terminology, we call clone a “cultivar”.
The trials were targeted across 11 IITA testing stations in Nigeria (Figure 1) over three growing seasons (2017-2018, 2018-2019, and 2019-2020). We considered each year by location combination to represent one environment. The fields selected for the trials had been fallow, with no crops grown in the previous three years. Before planting, the selected field were mowed, ploughed, harrowed, and ridged. The planting date varied from April to August depending on the rainy season in each location. Cultivars were planted in a randomized complete block design with 3 replications. Each plot consisted of 6 rows of 5.6 m (seven plants) with an inter-row spacing of 1 m and intra-row spacing of 0.8 m (plot size = 28 m2). The trials were managed without pesticides, irrigation, or fertilizer.

Figure 1. Map of Nigeria showing the locations of multi-environment trials in the IITA cassava breeding program. The agrometeorological zones and the number of tests carried out specified by the colors and the size of the point respectively.
Preliminary analysis of each trial was conducted to determine the relative strength of the genetic signal its data. The analysis was done as described by (Bakare et al., 2022) who worked on the same data. The model formula was:
Where is the phenotype, either fresh storage root weight or fresh above ground, is the trial mean, is the proportion of plants harvested per plot, fitted as a continuous fixed effect, and is a regression coefficient for that factor, is the replication effect, fitted as random, is the cultivar effect, fitted as random, and is the residual. The model was fitted using the lmer function from the lme4 package in R 4.1.0 (R Core Team 2023) using the statement:
The reliability of the model was extracted using the r.squaredGLMM function from MuMIn package in R. The broad-sense heritability and the reliability for both traits (fresh storage root weight or fresh above ground biomass) were calculated by trial. Trials were filtered to retain those with H2 over 0.25 and reliability over 40% for at least one trait (Supplementary Figure S1). A total of 28 trials out of 38 trials were retained after filtering, including eight out of the original 11 locations and all three years. These trials spanned 16 environments.
Soil and weather
For each location, specific soil and weather data were used. Soil profile characteristics were obtained from the Harvard Dataverse soils data Repository for crops (Han et al., 2019) according to the GPS coordinates for each location. The soil ID code and properties are given in Supplementary Table S1. The considered parameters included: Soil Lower limit of plant extractable soil water (SLLL), Soil Drained upper limit (SDUL), Saturated upper limit (SSAT), Soil root growth factor (SRGF), Saturated hydraulic conductivity (SSKS), Soil bulk density (SBDM g/cm3), Soil organic carbon (SLOC), Soil clay (SLCL), Soil silt (SLSI), Soil total nitrogen (SLNI), Soil pH in water (SLHW) and Soil cation exchange capacity (SCEC).
Daily weather data were collected from the National Aeronautics and Space Administration Prediction of Worldwide Energy Resource project (https://power.larc.nasa.gov/data-access-viewer/). For the eight location retained after filtering we summarized minimum temperature (TMIN), maximum temperature (TMAX), precipitation (RAIN), relative humidity (RH2M), solar radiation (SRAD) and wind speed (WIND) (Table 1).
The DSSAT Weatherman tool was used to create input weather data files in the model format (Hoogenboom et al., 2019). The initial soil water conditions for each layer were defined by setting available soil water at 100%, the incorporation depth for surface residue at 20 cm and the simulation was done under non-nitrogen-limited and non-phosphorus-limited production conditions.
Plant phenology
Plant phenological and leaf area index (LAI) data were collected only Ibadan in 2021. Five plants per plot, with two plots per cultivar, were assessed at 5 and 6 months after planting as follows. From the first branching level to the top of the plant, one branch was selected to be monitored at each branching point. Plant height at first branching, full plant height (from the bottom to the last leaf), length, and the number of internodes of each section between 2 forking points were measured. Three weeks later, the height and the number of internodes of new growth were measured. From these measurements, the phyllochron or time for node development was calculated. The forking time was estimated as a function of the section height and internode number. In the event of a genotype not showing forks during data collection, the default values provided by the model were used (Moreno-Cadena et al., 2020). The details of these default values can be found at: https://github.com/DSSAT/dssat-csm-os/tree/develop/Data/Genotype.
Leaf area index
We established a predictor model to estimate leaf area. We collected fully expanded, photosynthetically active, and undamaged leaves from the top, middle, and bottom of a plant not used for phenology. Thirty leaves were sampled per cultivar. The number of lobes, length, and width of the central lobe were measured for each leaf. The leaf was scanned to calculate its area with ImageJ software version 1.54k bundled with Java 8 on 2/24/2025 4:06:00 PM (Schneider et al., 2012). With these data, we established a model (Equation 1) that used the length and width of the central lobe and the number of lobes to estimate the leaf area of cassava (Figure 2).

Figure 2. Linear regression between the product of leaf lobe length, leaf lobe width, leaf lobes number width and leaf area for 67 cassava cultivars.
From the targeted plant (5 plants per plot), three leaves per plant (from top to bottom) were evaluated by the number of lobes, the length, and the width of the central lobe. The leaf area was estimated with the established equation (Equation 1). Then LAI was calculated as the total number of green leaves per plant multiplied by the mean per plant of leaf area and divided by the area occupied of each plant (Equation 2).
Yield and yield components
Yield and yield components were collected from 2017 to 2020 in 11 locations (Table 1). At harvest, fresh storage root yield and aboveground biomass (stem + leaves) were weighed separately. A net plot of 20 plants was harvested, excluding all plants on the border of a plot to avoid border effects. The storage root dry weight was calculated as a function of the fresh storage root yield and the dry matter content (Equation 3).
Dry matter content (DMC) in Ibadan was determined from 100 g of fresh roots of each plot grated and oven-dried at 65°C for 72 hours to constant weight. After the samples were cooled, their dry weights were recorded, and their respective DMC values were determined. For the other locations, the specific gravity method (Wholey and Booth, 1979) was used to estimate the DMC per plot. The dry aboveground biomass was calculated as 1/3 of the fresh aboveground biomass one (Streck et al., 2014).
Estimating genotype specific parameters
The DSSAT crop model contains genetic parameters at the levels of species, ecotype, and cultivar (Hoogenboom et al., 2019). For model evaluation, coefficients at the level of cultivars, referred as Genotype-Specific Parameters (GSPs), need to be calibrated. We also adjusted two ecotype coefficients directly involved in yield (Moreno-Cadena et al., 2020; Rankine et al., 2021).
The GSPs define differences among cultivars within a crop species (Hoogenboom et al., 2019). The 15 cultivar and 2 ecotype parameters incorporated within CSM-MANIHOT-Cassava that we calibrated for each cultivar are defined in Table 2. The prior distribution of each GSP was provided by Moreno-Cadena et al. (2020). Crop growth models simulate the processes of plant growth. They typically have phenology stages driven by thermal time. In CSM-MANIHOT-Cassava, stages determine branching. Growth is determined by the processes of sunlight being intercepted and converted to photosynthates which are partitioned to different plant organs: leaves, stems, and roots. In the CSM-MANIHOT-Cassava case, photosynthate demand is determined by node, leaf, and fibrous root growth, after which excess photosynthate goes to storage root growth. A full description of the model is given in Chapter 4 of (Moreno-Cadena, 2021) where a visualization of the model is also given (Figure 4-1 of Moreno-Cadena, 2021).
Two methods were used to estimate GSPs. The first group of GSPs, including the phenological parameters and some growth parameters, were hand-calibrated following the trial and error procedure recommended by Hoogenboom et al. (2019) and used by Phoncharoen et al. (2021b). Simulations were repeated until we could not increase the agreement evaluated by the correlation (r) and the Root Mean Square Error values (RMSE) between the simulated and observed values. That method was adopted because the target variables were measured only in Ibadan 2021. The calibration procedure was as follows:
- Parameters defining plant phenology were calibrated in the respective order of their progress during growth. They included thermal time to first forking (B01ND), second forking (B12ND), third forking (B23ND), and fourth forking (B34ND). The target variables were the time of these events from planting (days after planting).
- The growth parameters measuring the number of branches per fork (BR1FX, BR2FX, BR3FX, and BR4FX) were directly measured. In case of a lack of data, the default value from the model GSPs file was applied.
- Leaf parameters, including the maximum leaf area (LAXS) and leaf life (LLIFA) were calibrated. The target variable was leaf area index (LAI) as computed in Equation 2.
The second step of the calibration was done by the Generalized Likelihood Uncertainty Estimation (GLUE) method. The calibrated parameters were the specific leaf lamina area (SLAS), leaf petiole fraction (LPEFR), a slope for leaf production (LNSLP), node weight (NODWT) and internode length (NODLT). The GLUE method is one of the first methods to represent prediction uncertainty within the context of Monte Carlo analysis coupled with Bayesian estimation and propagation of uncertainty. The technique uses a likelihood function to measure the closeness-of-fit of modeled and observed data (He et al., 2010). The goal was to minimize the RMSE between simulated and observed values. The target variables were the aboveground biomass dry weight. The GLUE method was used because of its flexibility, ease of implementation as an R program, and its suitability for parallel implementation on distributed computer systems. It has previously been used to estimate GSPs for DSSAT crop models (Hoogenboom et al., 2019) such as wheat (Ibrahim et al., 2016; Li et al., 2018), maize (Mereu et al., 2019), and rice (Buddhaboon et al., 2018) as examples. The main steps of the GLUE program as described by Beven and Binley (1992); He et al. (2010), and updated by He et al. (2021), are summarized in Supplementary Methods.
After these two steps, if the storage root dry weight was systematically underestimated or overestimated across observations of a cultivar, the photosynthetically active radiation conversion factor (PARUE) and PAR extinction coefficient (KCAN) that are ecotype parameters were adjusted manually to remove the bias. Then, the GLUE method was run again as defined above to estimate the GSPs using new values of PARUE and KCAN.
Statistical analysis
Model performance analysis
The performance of the calibrated CSM-MANIHOT-Cassava model was analyzed using the correlation (r), the Root Mean Square Error (RMSE, Wallach and Goffinet, 1987), the normalized root mean square error (nRMSE, Loague and Green, 1991) and the index of agreement (d, Willmott, 1982). These four statistical parameters were analyzed separately for storage root dry weight. The parameters r (Equation 4), RMSE (Equation 5) and nRMSE (Equation 6) measure the deviation of the simulated from the observed values. The nRMSE is the RMSE value divided by the observed mean. The simulation is considered excellent with nRMSE< 10%, good in the 10-20% range, fair in the 20-30% range, and poor if nRMSE is > 30% (Harb et al., 2016; Rankine et al., 2021). The index of agreement, d, (Equation 7) was added because it incorporates both bias and variability. The d value varies between 0 and 1. A value of 1 indicates a perfect match, and 0 indicates no agreement at all.
M = number of observations
Si = simulated value
= mean of the simulated values
Oi = observed value
= mean of the observed values
Model sensitivity analysis
The main objective was to identify variables explaining deviations between model predictions and experimental data. The hypothesis was that the differences between the simulated and the observed values could be due either to a bad calibration of the GSPs (leading to a genotype effect on model deviation) or to the fact that some parameters of the environment were not well accounted for in the model (leading to an environment effect on model deviation) or to an interaction between these two causes (leading to a genotype by environment interaction effect on model deviation).
We analyzed deviations using the Additive Main effect and Multiplicative interaction (AMMI) model implemented in the statgenGxE R package, version 4.0.1 (van Rossum et al., 2022). The AMMI model separates additive main effects and multiplicative interaction effects. The model analyses the additive effects with classical analysis of variance (ANOVA) then applies principal component analysis (PCA) to the residual interaction portion of variation (Hongyu et al., 2014; Nowosad et al., 2016; Yan and Rajcan, 2002).
The AMMI model was:
where:
- M was the number of principal components included,
- yij was the deviation between observed versus simulated dry storage root yield for genotype i in environment j. That is, yij = Observed storage root dry weight – simulated storage root dry weight,
- μ was the general mean,
- Gi was the genotypic effect,
- Ej was the environmental effect,
- γmi was the genotypic score for principal component m and genotype i,
- δmj was the environment loading for principal component m and environment j,
- ϵij was the residual.
For example, if after calibration, observed yields of a genotype across environments are consistently higher than those simulated by the CSM-MANIHOT-Cassava model, Gi will be positive. Likewise, if observed yields across genotypes in an environment are consistently higher than those simulated by CSM-MANIHOT-Cassava, Ej will be positive. Conditions for γmi and δmj to be non-zero would require that certain aspects of the genotype interact with certain aspects of the environment to cause a bias in CSM-MANIHOT-Cassava simulations. Explanatory variables considered for these analyses included weather data, soil data, and additional phenotypic data, encompassing diseases such as Cassava Bacterial Blight incidence and severity (CB_), Cassava Green Mite severity (CGM), Cassava Mosaic Disease incidence and severity (CMO), as well as leaf area (LeafArea) and plant morphology, such as Plant forking number (branchP), Stem number (stem_nu), Stem diameter (stem_D), and Plant height (plant_H). The methods used for obtaining additional data were consistent with those described in Rabbi et al. (2014); Rabbi et al. (2017), and Wolfe et al. (2016).
Results
Variation among optimized GSPs
The cultivars presented variation in the optimized parameters controlling phenological events and growth (Table 3). For the phenology, the first forking (B01ND) time presented the highest coefficient of variation (CV = 62%). The second, the third, and the fourth forking time showed substantially lower CVs. The number of branches for the first forking was between 2 and 4, and some cultivars had 4 branches in the second branching. Parameter variation was observed at both the leaf and internode. The leaf production and functioning parameters, such as LAXS and SLAS presented CV values of 19% and 13%, respectively. Similarly, the internode weight (NODWT) and length (NODLT) showed CVs of 32% and 27%, respectively, which were the highest CVs among the GLUE-optimized parameters. Finally, among the parameters affecting photosynthetic activity, PARUE and KCAN had 21% and 15% CV values, respectively.

Table 3. Maximum (Max), minimum (Min), mean and coefficient of variation (CV) values of genotype-specific parameters calibrated for the 67 cassava cultivars in the MANIHOT-Cassava model.
Model performance
Performance by cultivar
The analysis included the 67 cultivars evaluated across 16 environments. For each cultivar we calculated the Pearson correlation r of model prediction and observation across the environments, as well as the RMSE and d statistic. The GLUE estimated GSPs improved the accuracy of the dry storage root yield simulated by the model (Figure 3). The correlation coefficient increased from –0.03 to +0.08, the RMSE dropped from 21 t ha-1 to 5 t ha-1 while d increased from 0.23 to 0.44. All these changes were highly significant using a paired t-test.

Figure 3. Comparison of the Pearson correlation coefficient (r), the root mean squared error (RMSE), and the index of agreement (d) values for the MANIHOT-Cassava predicted storage root dry weight before and after using General Likelihood Uncertainty Estimation (GLUE) method for Genotype-Specific Parameters (GSPs) estimation. Each point represents one cultivar.
Performance by environment
The r, RMSE, and d statistics could also be calculated across cultivars within environments. In this per-environment analysis, we also observed reasonable agreement between the simulated and the observed storage root dry weight. A total of 69% of the environments had an nRMSE ≤ 20% (considered excellent to good). Some environments such as Ikenne17 and Mokwa18 were very well simulated with nRMSE values of 9% and 10%, RMSE of 2.2 and 2.7 tha-1 and d values of 0.74 and 0.63 respectively (Table 4). Equally, substantial deviation of the model was observed in some environments such as Ikenne18 and Onne18 (nRMSE ≥ 30%). Given this observation, statistical analyses of simulation results were performed to determine the causes of the model deviations.

Table 4. Agreement between simulated (Sim) and observed (Obs) mean storage root dry weight of the cultivars across environments for MANIHOT-Cassava calibration.
Genotype and environment effects on model prediction error
The objective of the AMMI analysis was to identify the possible causes that could be ascribed to cultivars or environments of poor model performance in predicting root yield. The analysis was applied to the difference between observed and simulated storage root dry weight (observed – simulated = “O-S”). The AMMI model evaluated 66 genotypes across 16 environments. One genotype out of the 67 was not included because it had more than 10% missing data. The ANOVA analysis revealed a significant effect only for the environment explaining model error (Table 5). The effect of the genotype and environment interaction was also highly significant (p< 0.001). The first three interaction principal components (PCs) explained 50% of the GEI variation.

Table 5. Analysis of variance for the model storage root dry weight (t ha-1) deviation (observed – simulated) using the AMMI model.
The biplot of model prediction errors indicated that the environment effects were strongest at Ago18 and Mokwa17 whose effects were opposite on the component of GEI explained by PC1 (Figure 4).

Figure 4. Biplot of genotypes and environment interaction with PC1 and PC2. Genotypes are black points, and environments are red arrows.
Effects of environment characteristics on model prediction error
To identify characteristics of environments that affected model accuracy, we correlated weather and soil profile data with the AMMI analysis output including the mean value of model deviations (i.e., “O-S”) by environment (envMean) and the environment loading values on the three PCs.
Many input weather parameters affected the model performance (Figure 5). Indeed, the solar radiation, maximum temperature, wind, and daily photosynthetic radiation were positively correlated with the deviations while rainfall and relative humidity were negatively correlated (p-values< 0.01). Root yields in dry (low rain) environments marked by high temperatures and high solar radiation were underestimated by the model, while yields in wet (high rain) environments with low temperatures and low solar radiation were overestimated (Figure 6). These weather parameters affected estimation more in the first six months of plant growth: parameters at 6 months after planting (parameter abbreviation with “_6”) had the strongest correlations.

Figure 5. Correlation between the mean value of the model deviation (“Observed – Simulated”) by environment (envMean), the environment loading values on PC1, PC2 and PC3, and weather parameters The suffix number (“_x”) defines the months after planting, e.g., TMAX_6 is the maximum temperature at 6 MAP. ELEV, location altitude (m); LAT, location latitude; LONG, location longitude; RAIN, rainfall from planning to harvest; SRAD, solar radiation; TMIN, minimum daily temperature; TMAX, daily maximum temperature; RH2M, relative humidity; WIND, average wind speed at 2m; PAR, Photosynthetically active radiation.

Figure 6. Scatterplot of the mean model deviation (“Observed – Simulated”) against the annual total rain by environment (RAIN) and the annual mean maximum temperature by environment (TMAX).
Subsoil characteristics at 15 and 100 cm were strongly correlated, and we only present results at 15cm. Only the soil organic carbon (SLOC), significatively affected MANIHOT-Cassava performance showing a negative correlation with the model deviations (Figure 7).

Figure 7. Correlation between the mean value of the model deviation (“Observed – Simulated”) by environment (envMean), the environment loading values on PC1, PC2 and PC3, and soil profile parameters. Soil parameters abbreviations are SLLL, Lower limit of plant extractable soil water (cm3); SDUL, Drained upper limit (cm3); SSAT, Upper limit; saturated (cm3); SSKS, Saturated hydraulic conductivity (cm/h); SBDM, Bulk density (g/cm3); SLOC, Organic carbon (%), SLCL, Clay (%); SLSI, Silt (%); SLHW, pH in water; SCEC, Cation exchange capacity (cmol/kg).
Discussion
Variation in cultivar GSPs
Crop Growth Models are usually calibrated in non-stress environments (Hoogenboom et al., 2012). These environments enable the expression of the cultivar’s inherent growth and potential without confounding them with stress factors (Anothai et al., 2008). Our objective here, however, was to calibrate with breeding trial data to avoid requiring the breeding program to allocate resources to model calibration experiments. We assume that the lack of ideal environments can be compensated for by having data from many environments.
Across the 15 GSPs, we used three calibration approaches: Trial & Error, GLUE, and Yield Matching (Table 3). For the Trial & Error calibration we only used data from the Ibadan 2021 trial for which we had estimates of leaf area and of branch phenology and number of forks. These parameters were fixed in part because we found that the GLUE approach could not handle more than the five GSPs that were calibrated with it. Because GLUE samples from the prior distribution of parameters it is not super-efficient at narrowing in on values with maximum posterior density (He et al., 2010). Recent efforts have been made to parallelize GLUE (Berton Ferreira et al., 2024) to make it more computationally feasible. Finally, we set PARUE and KCAN to come close to matching the average yield of the cultivars. This approach was chosen to reduce the impact of genotype main effects on model calibration and instead capture sources of genotype by environment interaction. The approach is not unlike post hoc adjustments for harvest index to match harvested yield to model biomass simulation (as might have been done, for example, in Wellens et al., 2022). One consequence of that choice is that model deviations were barely affected by genotype (Table 5) so that we were not able to explore effectively genotype characteristics that might affect model performance. Adjusting to match genotype main effect, however, arithmetically could not affect the correlation for a genotype between simulation and observation across environments. Thus, the model improvement in that correlation from a median across cultivars of -0.03 for the default parameters to 0.08 was due to calibration of the other parameters. While the optimized correlation is not high, the improvement was statistically significant (Figure 3).
Across calibration approaches, we found evidence of variability between the IITA cultivars. Concerning plant phenology, the CV among cultivars was 62% for thermal time to first branching. This parameter establishes the difference between early- and late-branching genotypes (Moreno-Cadena et al., 2020). Similar results were obtained on the elite cassava cultivars from the National Crops Resource Research Institute breeding program of Uganda (Ibrahim et al., 2020) using standard statistical analysis. For Thai genotypes, Phoncharoen et al. (2021b) obtained a very high variation 97%, unlike the Jamaica genotypes where Rankine et al. (2021) found a CV of only 10%. Branching is a significant event in the cassava’s life cycle. Phenology not only determines the rate of production of leaves and the architecture of the plant but also accounts for the plant’s ability to flower, because an inflorescence is always coupled with branching (Halsey et al., 2008; Jennings and Iglesias, 2001). Thus, the level and timing of branching influence flower availability and fertility (Ibrahim et al., 2020). The B12ND, B23ND and B34ND parameters and their corresponding number of branches (BRnFX) could be interpreted because not all the cultivars showed a fork when the data was collected between 5 and 6 months after planting. For those cultivars, the model default values were used (Moreno-Cadena et al., 2020).
There was also variation among the cultivars for the leaf parameters. These results indicate that the germplasm of IITA was characterized by high variability of leaf production, growth, as well as leaf functioning as shown in the PARUE and KCAN parameters. The importance of getting leaf parameters well calibrated is that the MANIHOT-Cassava model uses a surplus of assimilate approach to determine when to allocate to storage roots. We were only able to measure LAI in one environment. More data on cassava LAI across studies might improve model performance. At least two different data collection times during the growing season are needed to calibrate the genetic parameters (Banterng et al., 2009; Phoncharoen al. 2021b; Rankine et al., 2021; Suriharn et al., 2007).
The individual node weight (NODWT) showed a high variation across the target germplasm followed by node length (NODLT) and leaf production slope (LNSLP). According to Moreno-Cadena et al. (2020), NODWT and LNSLP are both essential GSPs affecting plant total biomass with the latter being particularly important in warm environments. The NODWT parameter is important because it affects aboveground photosynthate demand which must be satisfied before the remaining assimilates are allocated to the storage roots. A high value for NODWT can lead to a decrease in storage root dry weight (Moreno-Cadena et al., 2021; Phoncharoen et al., 2021b).
Comparison to other model calibrations
Model calibration across 67 cultivars with final fine calibration using the GLUE method improved the prediction of the storage root dry weight. Strong model improvements for the RMSE and d statistics were no doubt partially due to the yield matching approach we used to calibrate PARUE and KCAN. The improvement in r that we observed was due to other GSP adjustments, however. Similar methodology was successfully used to calibrate the CSM-CERES-Wheat and CSM-CERES-Maize models for winter wheat and maize grain yield simulation, achieving RMSE of 0.27 and 1.57 t ha-1, respectively, and d values of 0.98 and 0.85, respectively (Ibrahim et al., 2016; Mereu et al., 2019). However, RMSE values of 2.19 and 3.72 t ha-1 were obtained for durum wheat and common wheat grain yield simulation, respectively, which are similar to the results we obtained in some environments (Mereu et al., 2019). The cassava storage root yield for six locations in Nigeria was simulated with an RMSE of 4.9 t dry matter ha−1 using the LINTUL-Cassava model (Adiele et al., 2021). The nRMSE values obtained by environment were similar to those reported by Phoncharoen et al. in two diferrent studies (Phoncharoen et al., 2021a; b). Finally, Wellens et al. (2022) achieved similar normalized RMSE in the range of 11% to 33%.
Finally, our model fit statistics could have been affected by the fact that we did not have explicitly measured local soil and weather data. Lack of accurate local soil profile data and specific topsoil chemical analysis data could lead to the model prediction errors. To assess this possibility further, we calibrated the model under both nitrogen-limited and non-nitrogen-limited conditions; phosphorus not being an important limiting factor in the CSM-MANIHOT-Cassava model (Moreno-Cadena et al., 2021). Model accuracy was better under non-nitrogen-limited conditions. The weather data we used came from NASA sources; a local weather station would have been preferred. In future research we will also seek to use Climate Hazards group Infrared Precipitation with Stations rainfall data (Funk et al., 2015).
Model variables to investigate further
To our knowledge no cassava study has separately calibrated a model for as many cultivars as we have here. We believe that the extensive set of cultivars we calibrated lends robustness to our conclusions on model performance. The analysis of model prediction errors showed that root yield prediction error was linked to the input weather parameters SRAD, TMAX, wind, rain, and RH2M (Figure 5). The fact that genotypes clustered together when plotted by environments also reflects the importance of environment on model prediction error (Figure 6). The correlation between weather parameters and model prediction error could be used to classify the environments and identify physiological relationships to be improved in the model.
The underestimated environments were characterized by low rain resulting in low relative humidity. The model integrates rainfall as plant extractable soil water by subtracting surface runoff, drainage, and soil evaporation (Hoogenboom et al., 2019). As root water uptake decreases, stomatal conductance decreases through stomatal closure, which reduces carbon assimilation and transpiration as well as nutrient absorption (Moreno-Cadena et al., 2020). In addition, prior studies with the model showed that under warm temperatures water availability could be a limiting factor for cassava root yield (Moreno-Cadena et al., 2021; Phoncharoen et al., 2021a; Rankine et al., 2021). Cassava stomata are sensitive to vapor pressure deficit (VPD) and reduce gas exchange more than other crops, which can limit photosynthesis while also saving water (Oguntunde and Alatise, 2007; Vongcharoen et al., 2018). The model currently incorporates a drought stress factor that integrates a VPD factor for photosynthesis. Our calibration study suggests that this stress factor might need to be adjusted. The improved cultivars from the IITA breeding program are selected for high performance including in water stress environments in the north of Nigeria. These cultivars could therefore be more tolerant than assumed by the model. A drought condition assumed by the model could still allow normal root growth for these improved genotypes. At the same time, increased rainfall led to higher predicted storage root dry weight but did not affect the observed values. The model requires defining thresholds for water and nitrogen stress, leaf and root senescence, and root water uptake (Moreno-Cadena, 2021). These are currently species coefficients that were not modified by our study. Modifying these settings could improve model performance in more extreme wet or dry environments.
Finally, the IITA improved cultivars might not follow the photosynthetic assimilate allocation strategy of CSM-MANIHOT-Cassava model which feeds the storage root growth with the remaining assimilate after having satisfied the aboveground sink. The spill over model is elegant and has scientific support (Cock and Connor, 2021) but is not a consensus (e.g., Gray, 2000). Several studies showed that improved cassava cultivars preferentially allocate photosynthetic assimilates to storage roots after their initiation (Lahai et al., 1999; Turyagyenda et al., 2013). The partitioning of the assimilate and the dynamics of the source-sink carbon allocation during the life cycle of the plant are important phenomena to understand to better simulate cassava’s growth. In a study contrasting two cassava varieties, one genotype favored stem growth while the other favored the storage roots once they were initiated (Chiewchankaset et al., 2022). Finally, a given cultivar need not be limited to a single allocation strategy but that, at a stage of its growth and according to its genotype and environmental factors, the cultivar adopts different mixtures of strategies.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.
Author contributions
PO: Conceptualization, Data curation, Formal Analysis, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing. SK: Investigation, Methodology, Supervision, Writing – review & editing. IR: Funding acquisition, Investigation, Project administration, Supervision, Writing – review & editing. PM: Conceptualization, Data curation, Formal Analysis, Methodology, Software, Writing – original draft, Writing – review & editing. GH: Writing – review & editing. JJ: Conceptualization, Funding acquisition, Methodology, Project administration, Supervision, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by the Bill & Melinda Gates Foundation through the “Next Generation Cassava Breeding project” (https://www.nextgencassava.org; grant no. INV-007637) managed by Cornell University.
Acknowledgments
The authors thank the Bill & Melinda Gates Foundation for their financial support. Thanks to the staff of the International Institute of Tropical Agriculture (IITA) of the cassava breeding program that assisted for implementing the field experiments and collecting the data.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be constructed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2025.1535058/full#supplementary-material
References
Adebayo, W. G. (2023). Cassava production in africa: A panel analysis of the drivers and trends. Heliyon 9, e19939. doi: 10.1016/j.heliyon.2023.e19939
Adiele, J. G., Schut, A. G. T., Van Den Beuken, R. P. M., Ezui, K. S., Pypers, P., Ano, A. O., et al. (2021). A recalibrated and tested LINTUL-Cassava simulation model provides insight into the high yield potential of cassava under rainfed conditions. Eur. J. Agron. 124, 126242. doi: 10.1016/j.eja.2021.126242
Alderman, P. D.. (2020). A comprehensive R interface for the DSSAT Cropping Systems Model. Comput. Electron. Agric. 172, 105325. doi: 10.1016/j.compag.2020.105325
Anothai, J., Patanothai, A., Jogloy, S., Pannangpetch, K., Boote, K. J., and Hoogenboom, G. (2008). A sequential approach for determining the cultivar coefficients of peanut lines using end-of-season data of crop performance trials. Field Crops Res. 108, 169–178. doi: 10.1016/j.fcr.2008.04.012
Baenziger, P. S., McMaster, G. S., Wilhelm, W. W., Weiss, A., and Hays, C. J. (2004). Putting genes into genetic coefficients.Field Crops Res. 90 (1), 133–143. doi: 10.1016/j.fcr.2004.07.022
Bakare, M. A., Kayondo, S. I., Aghogho, C. I., Wolfe, M. D., Parkes, E. Y., Kulakow, P., et al. (2022). Exploring genotype by environment interaction on cassava yield and yield related traits using classical statistical methods. PloS One 17, e0268189. doi: 10.1371/journal.pone.0268189
Banterng, P., Hoogenboom, G., Patanothai, A., Singh, P., Wani, S. P., Pathak, P., et al. (2009). Application of the cropping system model (CSM)-CROPGRO-soybean for determining optimum management strategies for soybean in tropical environments: optimum management strategies for soybean cropping. J. Agron. Crop Sci. 196, 231–242. doi: 10.1111/j.1439-037X.2009.00408.x
Berghuijs, H. N. C., Weih, M., van der Werf, W., Karley, A. J., Adam, E., et al. (2021). Calibrating and testing APSIM for wheat-faba bean pure cultures and intercrops across Europe. Field Crops Res. 264, 108088. doi: 10.1016/j.fcr.2021.108088
Berton Ferreira, T., Shelia, V., Porter, C., Moreno-Cadena, P., Salmeron Cortasa, M., Sohail Kahn, M., et al. (2024). Enhancing crop model parameter estimation across computing environments: Utilizing the GLUE method and parallel computing for determining genetic coefficients. Comput. Electron. Agric. 227, 109513. doi: 10.1016/j.compag.2024.109513
Beven, K. and Binley, A. (1992). The future of distributed models: Model calibration and uncertainty prediction. Hydrol. Process. 6, 279–298. doi: 10.1002/hyp.3360060305
Buddhaboon, C., Jintrawet, A., and Hoogenboom, G. (2018). Methodology to estimate rice genetic coefficients for the CSM-CERES-Rice model using GENCALC and GLUE genetic coefficient estimators. J. Agric. Sci. (Cambridge) 156, 482–492. doi: 10.1017/S0021859618000527
Ceballos, H., Hershey, C., Iglesias, C., and Zhang, X. (2021). Fifty years of a public cassava breeding program: evolution of breeding objectives, methods, and decision-making processes. Theor. Appl. Genet. 134 (8), 2335–2353. doi: 10.1007/s00122-021-03852-9
Chai, Y., Pardey, P. G., and Silverstein, K. A. T. (2022). Scientific selection: A century of increasing crop varietal diversity in US wheat. Proc. Natl. Acad. Sci. 119, e2210773119. doi: 10.1073/pnas.2210773119
Chiaka, J. C., Zhen, L., Yunfeng, H., Xiao, Y., Muhirwa, F., and Lang, T. (2022). Smallholder farmers contribution to food production in Nigeria. Front. Nutr. 9. doi: 10.3389/fnut.2022.916678
Chiewchankaset, P., Thaiprasit, J., Kalapanulak, S., Wojciechowski, T., Boonjing, P., and Saithong, T. (2022). Effective metabolic carbon utilization and shoot-to-root partitioning modulate distinctive yield in high yielding cassava variety. Front. Plant Sci. 13. doi: 10.3389/fpls.2022.832304
Cock, J. H. and Connor, D. J. (2021). “Cassava,” in Crop Physiology Case Histories for Major Crops. Eds. Sadras, V. O. and Calderini, D. F. (125 London Wall, London: Elsevier), 588–633.
Corbeels, M., Chirat, G., Messad, S., and Thierfelder, C. (2016). Performance and sensitivity of the DSSAT crop growth model in simulating maize yield under conservation agriculture. Europ. J. Agron. 76, 41–53. doi: 10.1016/j.eja.2016.02.001
Funk, C., Peterson, P., Landsfeld, M., Pedreros, D., Verdin, J., Shukla, S., et al. (2015). The climate hazards infrared precipitation with stations–a new environmental record for monitoring extremes. Sci. Data 2, 150066. doi: 10.1038/sdata.2015.66
Gray, V. (2000). A comparison of two approaches for modelling cassava (Manihot esculenta Crantz.) crop growth. Ann. Bot. 85, 77–90. doi: 10.1006/anbo.1999.0999
Halsey, M. E., Olsen, K. M., Taylor, N. J., and Chavarriaga-Aguirre, P. (2008). Reproductive biology of cassava (Manihot esculenta crantz) and isolation of experimental field trials. Crop Sci. 48, 49–58. doi: 10.2135/cropsci2007.05.0279
Han, E., Ines, A. V. M., and Koo, J. (2019). Development of a 10-km resolution global soil profile dataset for crop modeling applications. Environ. Model. Software 119, 70–83. doi: 10.1016/j.envsoft.2019.05.012
Harb, O. M., Abd El Hay, G. H., Hager, M. A., and Abou-El-Enin, M. M. (2016). Calibration and validation of DSSAT V.4.6.1, CERES and CROPGROModels for simulating no-tillage in central delta, Egypt. Agrotechnology 05. doi: 10.4172/2168-9881.1000143
He, J., Jones, J. W., Graham, W. D., and Dukes, M. D. (2010). Influence of likelihood function choice for estimating crop model parameters using the generalized likelihood uncertainty estimation method. Agric. Syst. 103, 256–264. doi: 10.1016/j.agsy.2010.01.006
He, J., Porter, C. H., Wilkens, P. W., and Jones, J. W. (2021). Guidelines for Installing and Running GLUE Program. Available online at: https://dssat.net/ (Accessed May 19, 2025).
Holzworth, D., Huth, N. I., Fainges, J., Brown, H., Zurcher, E., et al. (2018). APSIM Next Generation: Overcoming challenges in modernising a farming systems model. Environ. Model. Softw. 103, 43–51. doi: 10.1016/j.envsoft.2018.02.002
Hongyu, K., García-Peña, M., de Araújo, L. B., and dos Santos Dias, C. T. (2014). Statistical analysis of yield trials by AMMI analysis of genotype × environment interaction. Biometric. Lett. 51, 89–102. doi: 10.2478/bile-2014-0007
Hoogenboom, G., Jones, J. W., Traore, P. C. S., and Boote, K. J. (2012). “Experiments and data for model evaluation and application,” in Improving Soil Fertility Recommendations in Africa using the Decision Support Systems for Agrotechnology Transfers (DSSAT). Eds. Kihara, J., Fatondji, D., Jones, J. W., Hoogenboom, G., Tabo, R., and Bationo, A. (Springer, Dordrecht, the Netherlands), 9–18.
Hoogenboom, G., Porter, C. H., Boote, K. J., Shelia, V., Wilkens, P. W., Singh, U., et al. (2019). “The DSSAT crop modeling ecosystem,” in Advances in Crop Modeling for a Sustainable Agriculture. Ed. Boote, K. (Cambridge, United Kingdom, London: Burleigh Dodds Science Publishing), 173–216. doi: 10.19103/AS.2019.0061.10
Hoogenboom, G., Porter, C. H., Shelia, V., Boote, K. J., Singh, U., White, J. W., et al. (2021). Decision Support System for Agrotechnology Transfer (DSSAT) (4.8) [Computer software] (DSSAT Foundation). Available at: https://dssat.net/ (Accessed May 19, 2025).
Hoogenboom, G. and White, J. W. (2003). Improving physiological assumptions of simulation models by using gene‐based approaches. Agron. J. 95 (1), 82–89. doi: 10.2134/agronj2003.8200
Ibrahim, J., Baguma, Y., Abincha, W., Gibson, P., and Edema, R. (2020). Flowering problems and their possible solution in cassava breeding. J. Sci. Agric. 4, 83–89. doi: 10.25081/jsa.2020.v4.6220
Ibrahim, O. M., Gaafar, A. A., Wali, A. M., Tawfik, M. M., and El-Nahas, M. M. (2016). Estimating cultivar coefficients of a spring wheat using genCalc and GLUE in DSSAT. J. Agron. 15, 130–135. doi: 10.3923/ja.2016.130.135
Jennings, D. L. and Iglesias, C. (2001). “Breeding for crop improvement,” in Cassava: Biology, production and utilization, 1st ed. Eds. Hillocks, R. J. and Thresh, J. M. (Wallingford, UK: CABI Publishing), 149–166. doi: 10.1079/9780851995243.0149
Jones, J. W., Hoogenboom, G., Porter, C. H., Boote, K. J., Batchelor, W. D., et al. (2003). The DSSAT cropping system model. Eur. J. Agron. 18 (3), 235–265. doi: 10.1016/S1161-0301(02)00107-7
Jones, J. W., Antle, J. M., Basso, B., Boote, K. J., Conant, R. T., et al. (2017). Brief history of agricultural systems modeling. Agric. Syst. 155, 240–254. doi: 10.1016/j.agsy.2016.05.014
Lahai, M. T., George, J. B., and Ekanayake, I. J. (1999). Cassava (Manihot esculenta Crantz) Growth Indices, Root Yield and its Components in Upland and Inland Valley Ecologies of Sierra Leone. J. Agron. Crop Sci. 182, 239–248. doi: 10.1046/j.1439-037x.1999.00299.x
Li, Z., He, J., Xu, X., Jin, X., Huang, W., Clark, B., et al. (2018). Estimating genetic parameters of DSSAT-CERES model with the GLUE method for winter wheat (Triticum aestivum L.) production. Comput. Electron. Agric. 154, 213–221. doi: 10.1016/j.compag.2018.09.009
Loague, K. and Green, R. E. (1991). Statistical and graphical methods for evaluating solute transport models: Overview and application. J. Contam. Hydrol. 7, 51–73. doi: 10.1016/0169-7722(91)90038-3
Lozada, D. and Carter, A. (2020). Insights into the genetic architecture of phenotypic stability traits in winter wheat. Agronomy (Basel) 10 (3), 368. doi: 10.3390/agronomy10030368
Mereu, V., Gallo, A., and Spano, D. (2019). Optimizing genetic parameters of CSM-CERES wheat and CSM-CERES maize for durum wheat, common wheat, and maize in Italy. Agronomy 9, 665. doi: 10.3390/agronomy9100665
Messina, C. D., Technow, F., Tang, T., Totir, R., Gho, C., and Cooper, M. (2018). Leveraging biological insight and environmental variation to improve phenotypic prediction: Integrating crop growth models (CGM) with whole genome prediction (WGP). Eur. J. Agron. 100, 151–162. doi: 10.1101/100057
Mohanty, M., Probert, M. E., Reddy, K. S., Dalal, R. C., Mishra, A. K., et al. (2012). Simulating soybean–wheat cropping system: APSIM model parameterization and validation. Agric. Ecosyst. Environ. 152, 68–78. doi: 10.1016/j.agee.2012.02.013
Moreno-Cadena, P. (2021). Modeling the Dynamics of Starch Content in Cassava. Available online at: https://original-ufdc.uflib.ufl.edu/UFE0058245/00001 (Accessed 11 September 2024).
Moreno-Cadena, L. P., Hoogenboom, G., Cock, J. H., Ramirez-Villegas, J., Pypers, P., Kreye, C., et al. (2021). Modeling growth, development and yield of cassava: A review. Field Crops Res. 267, 108140. doi: 10.1016/j.fcr.2021.108140
Moreno-Cadena, L. P., Hoogenboom, G., Fisher, M. J., Ramirez-Villegas, J., Prager, S. D., Becerra-Lopez-Lavalle, L. A., et al. (2020). Importance of genetic parameters and uncertainty of MANIHOT, a new mechanistic cassava simulation model. Eur. J. Agron. 115, 126031. doi: 10.1016/j.eja.2020.126031
Nanyonjo, A. R., Angudubo, S., Iragaba, P., Brown, D., Nuwamanya, E., et al. (2024). On‐farm evaluation of cassava clones using the triadic comparison of technology options approach. Crop Sci. doi: 10.1002/csc2.21293
Nowosad, K., Liersch, A., Popławska, W., and Bocianowski, J. (2016). Genotype by environment interaction for seed yield in rapeseed (Brassica napus L.) using additive main effects and multiplicative interaction model. Euphytica 208, 187–194. doi: 10.1007/s10681-015-1620-z
Oguntunde, P. G. and Alatise, M. O. (2007). Environmental regulation and modelling of cassava canopy conductance under drying root-zone soil water. Meteorol. Appl. 14, 245–252. doi: 10.1002/met.24
Olivoto, T., Nardino, M., Meira, D., Meier, C., Follmann, D. N., et al. (2021). Multi‐trait selection for mean performance and stability in maize. Agron. J. 113 (5), 3968–3974. doi: 10.1002/agj2.20741
Phoncharoen, P., Banterng, P., Vorasoot, N., Jogloy, S., Theerakulpisut, P., and Hoogenboom, G. (2021a). Identifying suitable genotypes for different cassava production environments—A modeling approach. Agronomy 11, 1372. doi: 10.3390/agronomy11071372
Phoncharoen, P., Poramate, B., Moreno-Cadena, L. P., Vorasoot, N., Jogloy, S., Theerakulpisut, P., et al. (2021b). Performance of the CSM–MANIHOT–Cassava model for simulating planting date response of cassava genotypes. Field Crops Res. 264, 108073. doi: 10.1016/j.fcr.2021.108073
Putto, W., Patanothai, A., Jogloy, S., and Hoogenboom, G. (2008). Determination of mega−environments for peanut breeding using the CSM−CROPGRO−Peanut model. Crop Sci. 48, 973–982. doi: 10.2135/cropsci2007.10.0552
Rabbi, I. Y., Hamblin, M. T., Kumar, P. L., Gedil, M. A., Ikpan, A. S., Jannink, J.-L., et al. (2014). High-resolution mapping of resistance to cassava mosaic geminiviruses in cassava using genotyping-by-sequencing and its implications for breeding. Virus Res. 186, 87–96. doi: 10.1016/j.virusres.2013.12.028
Rabbi, I. Y., Udoh, L. I., Wolfe, M., Parkes, E. Y., Gedil, M. A., Dixon, A., et al. (2017). Genome-wide association mapping of correlated traits in cassava: dry matter and total carotenoid content. Plant Genome 10. doi: 10.3835/plantgenome2016.09.0094
Ramirez-Villegas, J., Molero Milan, A., Alexandrov, N., Asseng, S., Challinor, A. J., Crossa, J., et al. (2020). CGIAR modeling approaches for resource-constrained scenarios: I. Accelerating crop breeding for a changing climate. Crop Sci. 60, 547–567. doi: 10.1002/csc2.20048
Rankine, D., Cohen, J., Murray, F., Moreno-Cadena, L. P., Hoogenboom, G., Campbell, J., et al. (2021). Evaluation of DSSAT-MANIHOT-Cassava model to determine potential irrigation benefits for cassava in Jamaica. Agron. J. 113, 5317–5334. doi: 10.1002/agj2.20876
R Core Team (2023). R: A language and environment for statistical computing (Vienna, Austria; 4.3.0) [Computer software] (R Foundation for Statistical Computing). Available online at: https://www.R-project.org/ (Accessed May 19, 2025).
Schneider, C. A., Rasband, W. S., and Eliceiri, K. W. (2012). NIH Image to ImageJ: 25 years of image analysis. Nat. Methods 9, 671–675. doi: 10.1038/nmeth.2089
Streck, N. A., Pinheiro, D. G., Junior Zanon, A., Gabriel, L. F., Rocha, T. S. M., de Souza, A. T., et al. (2014). Efeito do espaçamento de plantio no crescimento, desenvolvimento e produtividade da mandioca em ambiente subtropical. Bragantia 73, 407–415. doi: 10.1590/1678-4499.0159
Suriharn, B., Patanothai, A., Pannangpetch, K., Jogloy, S., and Hoogenboom, G. (2007). Determination of cultivar coefficients of peanut lines for breeding applications of the CSM-CROPGRO-peanut model. Crop Sci. 47, 607–619. doi: 10.2135/cropsci2006.01.0050
Technow, F., Messina, C. D., Totir, L. R., and Cooper, M. (2015). Integrating Crop Growth Models with Whole Genome Prediction through Approximate Bayesian Computation. PLoS One 10 (6), e0130855. doi: 10.1371/journal.pone.0130855
Tsuji, G. Y., Hoogenboom, G., and Thornton, P. K. (Eds.) (1998). Understanding Options for Agricultural Production Vol. 7 (Dordrecht: Springer Netherlands). doi: 10.1007/978-94-017-3624-4
Turyagyenda, L. F., Kizito, E. B., Ferguson, M., Baguma, Y., Agaba, M., Harvey, J. J. W., et al. (2013). Physiological and molecular characterization of drought responses and identification of candidate tolerance genes in cassava. AoB Plants 5, plt007–plt007. doi: 10.1093/aobpla/plt007
van Rossum, B.-J., van Eeuwijk, F., Boer, M., Malosetti, M., Bustos-Korts, D., Millet, E. J., et al. (2022). Package ‘statgenGxE’ CRAN. R Package Version 4.0.1. Available online at: https://CRAN.R-project.org/package=statgenGxE (Accessed May 19, 2025).
Vongcharoen, K., Santanoo, S., Banterng, P., Jogloy, S., Vorasoot, N., and Theerakulpisut, P. (2018). Seasonal variation in photosynthesis performance of cassava at two different growth stages under irrigated and rain-fed conditions in a tropical savanna climate. Photosynthetica 56, 1398–1413. doi: 10.1007/s11099-018-0849-x
Wallach, D. and Goffinet, B. (1987). Mean squared error of prediction in models for studying ecological and agronomic systems. Biometrics 43, 561. doi: 10.2307/2531995
Wellens, J., Raes, D., Fereres, E., Diels, J., Coppye, C., Adiele, J. G., et al. (2022). Calibration and validation of the FAO AquaCrop water productivity model for cassava (Manihot esculenta Crantz). Agric. Water Manage. 263, 107491. doi: 10.1016/j.agwat.2022.107491
White, J. W. and Hoogenboom, G. (1996). Simulating effects of genes for physiological traits in a process-oriented crop model. Agron. J. 88, 416–422. doi: 10.2134/agronj1996.00021962008800030009x
Wholey, D. W. and Booth, R. H. (1979). A comparison of simple methods for estimating starch content of cassava roots. J. Sci. Food Agric. 30, 158–164. doi: 10.1002/jsfa.2740300210
Willmott, C. J. (1982). Some comments on the evaluation of model performance. Bull. Am. Meteorol. Soc. 63, 1309–1313. doi: 10.1175/1520-0477(1982)063<1309:SCOTEO>2.0.CO;2
Wolfe, M. D., Rabbi, I. Y., Egesi, C., Hamblin, M., Kawuki, R., Kulakow, P., et al. (2016). Genome-wide association and prediction reveals genetic architecture of cassava mosaic disease resistance and prospects for rapid genetic improvement. Plant Genome 9. doi: 10.3835/plantgenome2015.11.0118
Yan, W. and Rajcan, I. (2002). Biplot analysis of test sites and trait relations of soybean in Ontario. Crop Sci. 42, 11–20. doi: 10.2135/cropsci2002.1100
Keywords: cassava, crop growth model, Nigeria, calibration, general likelihood uncertainty estimation
Citation: Okoma PM, Kayondo SI, Rabbi IY, Moreno-Cadena PL, Hoogenboom G and Jannink J-L (2025) Evaluating a cassava crop growth model by optimizing genotype-specifc parameters using multienvironment trial breeding data. Front. Plant Sci. 16:1535058. doi: 10.3389/fpls.2025.1535058
Received: 26 November 2024; Accepted: 26 February 2025;
Published: 13 June 2025.
Edited by:
Leif Skot, Aberystwyth University, United KingdomReviewed by:
João Ricardo Bachega Feijó Rosa, RB Genetics & Statistics Consulting, BrazilHuabing Yan, Guangxi Academy of Agricultural Science, China
Copyright © 2025 Okoma, Kayondo, Rabbi, Moreno-Cadena, Hoogenboom and Jannink. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jean-Luc Jannink, amVhbmx1Yy5qYW5uaW5rQHVzZGEuZ292
†ORCID: Pamelas Michelle Okoma, 0009-0007-9446-3979
Siraj Ismail Kayondo, 0000-0002-3212-5727
Ismail Y. Rabbi, 0000-0001-9966-2941
Patricia L. Moreno Cadena, 0000-0002-8326-6124
Gerrit Hoogenboom, 0000-0002-1555-0537
Jean-Luc Jannink, 0000-0003-4849-628X