Kidney Bean Detection in Complex Agricultural Scenarios Based on the KidneyB-YOLO Model

Qi, Jianjia; Liu, Chunxiang; Chu, Hongxia

doi:10.3389/fpls.2025.1672179

ORIGINAL RESEARCH article

Front. Plant Sci.

Sec. Sustainable and Intelligent Phytoprotection

This article is part of the Research TopicSmart Sensing in Plant Science: Advancing Plant-Environment Interactions for Sustainable PhytoprotectionView all 8 articles

Kidney Bean Detection in Complex Agricultural Scenarios Based on the KidneyB-YOLO Model

Provisionally accepted

Jianjia Qi

Chunxiang Liu^*

Hongxia Chu

Heilongjiang Institute of Technology, Harbin, China

The final, formatted version of the article will be published soon.

This study addresses the problem of detecting multiple targets and occluded soybeans in the complex agricultural scenarios under the open-field trellis cultivation, and proposes a target detection model based on kidneyB-YOLO. The model integrated the dynamic con-volution module, the Deformable Attention Transformer (DAT) attention mechanism, the feature fusion detection head improved Adaptively Spatial Feature Fusion (ASFF) detection method, and the Focaler-SIoU loss function, enhancing the accuracy and robustness of bean pod detection. The dynamic convolution module adaptively adjusted parameters based on the input, reducing computational overhead and improving the model's ability to represent complex scenes and small targets. The dynamic convolution module, when added to the neck network, achieved the best accuracy improvement. The DAT attention mechanism introduced a deformable attention mechanism, focusing only on a small key region of the image, which was suitable for detecting long, small bean pod targets. The feature fusion detection head ASFF, an adaptive spatial feature fusion method, filtered out conflicting target information, enhancing scale invariance and improving detection accuracy. The new loss function module, Focaler-SIoU, was formed by combining the SIoU loss function with the Focaler-IoU algorithm, giving extra attention to the specific features of different samples and addressing the issue of inconsistent sample resolution. An open-canopy environment bean pod dataset was constructed for model training and validation. The results showed that the KidneyB-YOLO model exhibited significant improvements compared to the original YOLOv8n, especially in occluded scenarios. The model achieved a detection performance of 85.90% mAP with a computational complexity of 12.4 GFLOPs, a model size of 12 MB, and operated at 32.5 FPS, demonstrating strong robustness and generalization capability in the fruit detection task for kidney bean harvesting under open-air trellises.

Keywords: Precision Agriculture1, object detection2, model comparison3, agricultural scenarios4, deep learning5

Received: 24 Jul 2025; Accepted: 21 Nov 2025.

Copyright: © 2025 Qi, Liu and Chu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Chunxiang Liu, liuchunxiang@hljit.edu.cn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.