Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Plant Sci.

Sec. Technical Advances in Plant Science

HDA-YOLO: A Hierarchical Attention-Driven Dense Fusion Network for Rice Pest Detection in Complex Agricultural Environments

Provisionally accepted
Shuo  YuanShuo YuanYing  DuanYing DuanHongting  SuHongting SuXinhui  ZhouXinhui Zhou*Yinfeng  HaoYinfeng Hao*
  • Henan University, Kaifeng, China

The final, formatted version of the article will be published soon.

Rapid and intelligent identification of rice pests serves as the core sensing technology for precision plant protection and smart rice farming systems, providing critical support for intelligent cultivation decisions. To address the challenges of insufficient robustness and low precision of existing lightweight detection models in complex agricultural environments, this study proposes an improved lightweight YOLOv8 model based on a Hierarchical Attention-Driven Dense Fusion network (HDA-YOLO) for fast and high-precision pest detection. To enhance feature fidelity, the model incorporates asymmetric dynamic downsampling (ADDS) and a multi-scale cascade pre-fusion (MCPF) module into the backbone network. To achieve dynamic, content-aware feature fusion, a hierarchical attention-driven dense fusion network (HADF-Net) is constructed, integrating an intra-scale self-attention module (ISAM) and an inter-scale cross-attention module (ICAM). Furthermore, the C2f module is upgraded to a multi-scale context (MSC) module to improve adaptability to variations in target scale. Experimental results on the self-built RicePest_12 dataset demonstrate that HDA-YOLO, while maintaining a lightweight architecture (3.93M parameters, 12.02 GFLOPs), achieves significant improvements over the baseline YOLOv8n model, with mAP@50, F1-score, and Recall increasing by 2.4%, 3.8%, and 4.8%, respectively. In comparison with the Transformer-based RT-DETR-R18 model, HDA-YOLO achieves a 4.8 percentage points higher mAP@50, while its computational cost is only 22% and its parameter count is only 20% of RT-DETR-R18. Moreover, the proposed model has been successfully deployed on a mobile application, achieving real-time and accurate identification of field pests and demonstrating significant potential in the field of smart rice agriculture.

Keywords: attention mechanism, deep learning, Mobile application, Rice pest detection, Smart rice agriculture

Received: 09 Dec 2025; Accepted: 30 Jan 2026.

Copyright: © 2026 Yuan, Duan, Su, Zhou and Hao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Xinhui Zhou
Yinfeng Hao

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.