Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Plant Sci.

Sec. Sustainable and Intelligent Phytoprotection

This article is part of the Research TopicAdvanced Imaging and Phenotyping for Sustainable Plant Science and Precision Agriculture 4.0View all articles

A Hybrid Vision Transformer and ResNet18 based Model for Biotic Rice Leaf Disease Detection

Provisionally accepted
Sankar  SennanSankar Sennan1Ramasubbareddy  SomulaRamasubbareddy Somula2Yongyun  ChoYongyun Cho1*Selvaganapathi  SennanSelvaganapathi Sennan3
  • 1Department of Information and Communication Engineering, Sunchon National University, Suncheon, Republic of Korea
  • 2Associate Professor, Department of Computer Science and Engineering, Symbiosis Institute of Technology, Hyderabad Campus,, Symbiosis International (Deemed University), Pune, India
  • 3Technical Architect, Hexaware Technologies arizona, Arizona, United States

The final, formatted version of the article will be published soon.

Agriculture is crucial to human survival. The growing of biotic rice plants is very helpful for feeding a lot of people around the world, especially in places where rice is a main food. The detection of rice leaf disease is critical to increasing crop productivity. To improve the accuracy of rice leaf disease prediction, this paper proposes a hybrid Vision Transformer (ViT) with pre-trained ResNet18 models (ViT-ResNet18). In general, the input images apply to the pre-trained ViT and ResNet18 models independently. The output features of these two models are combined and fed into the final Fully Connected (FC) layer, followed by a Softmax layer for final classification. The output of rice leaf diseases from the FC layer. The proposed hybrid ViT with ResNet18 model achieved 94.4% accuracy, a precision of 0.948, a recall of 0.944, an F1-Score of 0.942, and an Area Under Curve (AUC) of 0.985. The proposed hybrid model ViT-ResNet18 shows a 5%, 1%, and 1% improvement in accuracy compared to VGG16 with Neural Network, Inception V3 with Neural Network, and SqueezeNet with Neural Network classifier, respectively.

Keywords: Rice leaf disease, vision Transformer, Resnet18, Agriculture, ImageClassification, deep learning

Received: 23 Sep 2025; Accepted: 24 Oct 2025.

Copyright: © 2025 Sennan, Somula, Cho and Sennan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Yongyun Cho, yycho@scnu.ac.kr

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.