Your new experience awaits. Try the new design now and help us make it even better

EDITORIAL article

Front. Plant Sci.

Sec. Functional and Applied Plant Genomics

This article is part of the Research TopicMachine Learning for Mining Plant Functional GenesView all 9 articles

Editorial: Machine Learning for Mining Plant Functional Genes

Provisionally accepted
  • 1Northeast Forestry University, Harbin, China
  • 2University of Electronic Science and Technology of China, Chengdu, China
  • 3Cleveland Clinic, Cleveland, United States

The final, formatted version of the article will be published soon.

representations that capture latent regulatory and functional features. Compared with task-specific models, FMs offer enhanced generalization, cross-species transferability, and scalability, making them particularly attractive for plant systems characterized by genomic complexity and limited functional annotations. Providing a systematic overview of this paradigm, Xu et al. present a mini-review synthesizing recent advances in foundation models for plant molecular biology. The review traces the evolution from general DNA language models to plant-specific tools and highlights key challenges unique to plant systems, including polyploidy, repetitive genomes, and sparse experimental annotations. By outlining future directions such as multimodal integration and computational efficiency, this work establishes a conceptual framework for understanding how FMs are redefining computational plant biology and guiding next-generation model development.Building on this FM paradigm, several contributions demonstrate how representation learning can be applied to concrete biological problems. Zhang et al. applied a DNABERT-2-based framework combined with gradient boosting to identify DNA N6-methyladenine modifications in rice, illustrating how foundation models can enhance epigenetic marker detection while mitigating data sparsity. This work exemplifies a broader shift toward pretraining-based strategies in plant genomics, with implications for cross-species prediction and regulatory annotation. Taken together, the contributions in this Research Topic highlight the transformative role of machine learning and foundation models in plant functional genomics. By advancing representation learning, model architecture, interpretability, and multi-omics integration, these studies move the field beyond traditional sequence-based annotation toward predictive, mechanism-aware, and application-oriented frameworks. Continued synergy between computational innovation and experimental validation will be essential for translating these advances into resilient, high-yield crops capable of meeting future agricultural and environmental challenges.

Keywords: Large language models, machine learning, Multi-omics integration, Plant functional genomics, regulatory network inference

Received: 26 Jan 2026; Accepted: 16 Feb 2026.

Copyright: © 2026 Sun, Zou and Dou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Quan Zou
Lijun Dou

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.