Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Plant Sci.

Sec. Plant Bioinformatics

MTMEGPS: an R package for multi-trait and multi-environment genomic and phenomic selection using Deep Learning

Provisionally accepted
Freddy  Mora-PobleteFreddy Mora-Poblete1Javiera  Valenzuela-HerreraJaviera Valenzuela-Herrera2Matías  BalachMatías Balach2Claudio  Martinez-ArayaClaudio Martinez-Araya2Carlos  Ernesto MaldonadoCarlos Ernesto Maldonado2*
  • 1Universidad de Talca Instituto de Ciencias Biologicas, Talca, Chile
  • 2Centro de Genómica y Bioinformática, Facultad de Ciencias, Universidad Mayor, Camino La Pirámide 5750, Huechuraba, Santiago 8580745, Chile, Santiago, Chile

The final, formatted version of the article will be published soon.

Genomic and phenomic selection have transformed modern breeding by enabling data-driven prediction of complex traits. Deep learning (DL) can further enhance predictive ability by capturing nonlinear patterns that classical and Bayesian approaches often fail to represent. However, despite its potential, the adoption of DL in breeding programs remains limited due to its computational demands and the lack of accessible tools for users without extensive programming experience. This study introduces the MTMEGPS (Multi-Trait and Multi-Environment Genomic and Phenomic Selection), an R package that provides a streamlined end-to-end workflow for Uni- and Multi-Trait (UT and MT, respectively) and Uni- and Multi-Environment (UE and ME, respectively) genomic and phenomic prediction. The package supports data preparation, hyperparameter optimization, model training, and DL-based evaluation. To assess its performance, MTMEGPS was applied to the two default datasets included in the package: Maize (genomic data) and Eucalyptus (near-infrared spectroscopy, NIR, data), as well as to an independent publicly available multi-environment validation dataset. Across most scenarios, MTMEGPS showed superior predictive ability compared with all benchmark models, particularly under UT for the internal datasets and MT for the independent multi-environment dataset. Mean squared error (MSE) values were similar across models, all falling within a moderate range. Overall, these results demonstrate the efficiency and practical utility of MTMEGPS for genomic and phenomic selection, even in scenarios where prediction errors remain moderate.

Keywords: deep learning, genomic selection, Spectral information, Molecular markers, Multi-Traits and Multi-Environments, Hyperparameters optimization

Received: 28 Jul 2025; Accepted: 26 Nov 2025.

Copyright: © 2025 Mora-Poblete, Valenzuela-Herrera, Balach, Martinez-Araya and Maldonado. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Carlos Ernesto Maldonado

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.