AN ENSEMBLE HETROGENEOUS TRANSFORMER MODEL FOR AN EFFECTIVE DIAGNOSIS OF MULTIPLE PLANT DISEASES

Quasim, Mohammad Tabrez; Kalpana, Punogoti; gera, Pradeepini; Alabdulkreem, Eatedal; Baili, Jamel; Cho, Yongwon; Nam, Yunyoung

doi:10.3389/fpls.2025.1693095

ORIGINAL RESEARCH article

Front. Plant Sci.

Sec. Sustainable and Intelligent Phytoprotection

This article is part of the Research TopicSmart Sensing in Plant Science: Advancing Plant-Environment Interactions for Sustainable PhytoprotectionView all 8 articles

AN ENSEMBLE HETROGENEOUS TRANSFORMER MODEL FOR AN EFFECTIVE DIAGNOSIS OF MULTIPLE PLANT DISEASES

Provisionally accepted

Mohammad Tabrez Quasim^1*

Punogoti Kalpana²

Pradeepini gera³

Eatedal Alabdulkreem⁴

Jamel Baili⁵

Yongwon Cho⁶

Yunyoung Nam⁶

¹University of Bisha, BISHA, Saudi Arabia
²AVN Institute of Engineering and Technology, Ibrahimpatnam, India
³Koneru Lakshmaiah Education Foundation, Vijayawada, India
⁴Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia
⁵King Khalid University, Abha, Saudi Arabia
⁶Soonchunhyang University, Asan-si, Republic of Korea

The final, formatted version of the article will be published soon.

Plant diseases are a significant challenge to sustainable farming resulting in drastic losses of crop quality and quantity. Conventional diagnostic procedures like manual examination and single-model deep learning-based methods tend to be ineffective in identifying overlapping appearances, detailed textures of leaves, and environmental changes, which results in inconsistent performance. In order to address these issues, this paper presents an ensemble transformer framework that incorporates the segmentation, classification and optimization to identify multi-diseases in plants accurately. The framework has a two phase design. At the initial stage, U-Net and Swin Transformer V2 detect the disease-affected leaf areas with high accuracy, and the important features are correctly captured. In the second stage, classification is carried out using CoAtNet and its enhanced variant, which combine convolutional feature extraction with transformer-based global context learning. To further improve decision-making, a meta-heuristic fusion strategy based on the Levy Flight Honey Badger Algorithm dynamically weights classifier outputs, enhancing robustness and reducing misclassifications. Model interpretability is enhanced through GRAD-CAM visualizations, providing clear insights into the regions influencing disease classification. The framework was extensively evaluated on the PlantVillage dataset containing 54,305 images across 38 classes. Results demonstrate outstanding performance, with 99.31% accuracy, 99.32% precision, 99.31% recall, 99.32% specificity, and 99.31% F1-score. The ensemble segmentation approach exhibits a statistically significant 7.34% improvement over single-method implementations. Moreover, the heterogeneous ensemble model achieves 8.43% and 14.59% superiority over homogeneous ensembles and individual models, respectively. The integration of segmentation, hybrid transformer architectures, and meta-heuristic decision fusion delivers a powerful, interpretable, and highly reliable solution for early plant disease detection, offering strong potential for real-world agricultural deployment.

Keywords: CoATNets, Ensemble Deep Learning architectures, Plant Diseases, PlantVillage datasets, Transformer neural networks

Received: 26 Aug 2025; Accepted: 03 Nov 2025.

Copyright: © 2025 Quasim, Kalpana, gera, Alabdulkreem, Baili, Cho and Nam. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Mohammad Tabrez Quasim, tabrezquasim@gmail.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.