ORIGINAL RESEARCH article
Front. Plant Sci.
Sec. Plant Breeding
A bi-stage data-driven process-based model for sorghum breeding and yield prediction: coupling explainable artificial intelligence and crop modeling
Provisionally accepted- 1Oklahoma State University, Stillwater, United States
- 2Iowa State University, Ames, Iowa, United States
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
With the global population explosion, the increasing demand in food supply pushes the development of advanced breeding methods. This study presents a bi-stage data-driven and process-based crop model to provide breeding recommendations based on Genotype x Environment (GxE) effects for sorghum, a vital cereal crop with various plant types, such as Grain (G), Forage (F), Dual Purpose (DP), and Photoperiod-Sensitive (PS). The model combines traditional process-based crop modeling with explainable data-driven methods, which increases the general interpretability and flexibility of the model. The model considers extensive environmental data, including seven years of hourly weather records and soil factors from three research farms in Iowa, together with management practices and parental information from 651 males and 131 females. Additionally, the model predicts the hourly dry weight of sorghum's leaves, stems and grain, and predicts final yield based on management practices. The final combined Relative Root mean squared error reached 16% to 19% across several environmental conditions, which demonstrating the robust predictive capabilities. Besides, the model effectively identified elite hybrids in four distinct sorghum types, which also demonstrated its utility in reducing the need for extensive field trials. Additionally, our analysis of genotype by environment interactions revealed significant variability in performance, which indicates the precise breeding strategies customized for the environmental conditions are important and vital. This research highlights that our explainable hybrid model framework can greatly improve crop modeling and plant breeding, making agriculture more efficient and sustainable.
Keywords: Data-driven, Process-based, Crop modeling, Explainable AI, Neural Network, GxE
Received: 24 Apr 2025; Accepted: 24 Oct 2025.
Copyright: © 2025 Ni, Chang, Kemp, Salas Fernandez and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Maria Salas Fernandez, mgsalas@iastate.edu
Lizhi Wang, lizhi.wang@okstate.edu
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.