MDE-DETR: Multi-Domain Enhanced Feature Fusion Algorithm for Bayberry Detection and Counting in Complex Orchards

Zhou, Cheng; Zhang, Yuyu; Fu, Wei; Yao, Lili; Yin, Chengliang

doi:10.3389/fpls.2025.1711545

ORIGINAL RESEARCH article

Front. Plant Sci.

Sec. Technical Advances in Plant Science

MDE-DETR: Multi-Domain Enhanced Feature Fusion Algorithm for Bayberry Detection and Counting in Complex Orchards

Provisionally accepted

Yuyu Zhang¹

¹Huzhou University, Huzhou, China
²Hainan University, Haikou, China

The final, formatted version of the article will be published soon.

Bayberry detection plays a crucial role in yield prediction. However, bayberry targets in complex orchard environments present significant detection challenges, including small volume, severe occlusion, and dense distribution, making traditional methods inadequate for practical applications. This study proposes a Multi-Domain Enhanced DETR (MDE-DETR) detection algorithm based on multi-domain enhanced feature fusion. First, an Enhanced Feature Extraction Network (EFENet) backbone is constructed, which incorporates Multi-Path Feature Enhancement Module (MFEM) and reparameterized convolution techniques to enhance feature perception capabilities while reducing model parameters. Second, a Multi-Domain Feature Fusion Network (MDFFN) architecture is designed, integrating SPDConv spatial pixel rearrangement, Cross-Stage Multi-Kernel Block (CMKBlock), and dual-domain attention mechanisms to achieve multi-scale feature fusion and improve small target detection performance. Third, an Adaptive Deformable Sampling (ADSample) downsampling module is constructed, which dynamically adjusts sampling positions through learnable spatial offset prediction to enhance model robustness for occluded and dense targets. Experimental results demonstrate that on a self-constructed bayberry dataset, MDE-DETR achieves improvements of 3.8% and 5.1% in mAP50 and mAP50:95 respectively compared to the RT-DETR baseline model, reaching detection accuracies of 92.9% and 67.9%, while reducing parameters and memory usage by 25.76% and 25.14% respectively. Generalization experiments on VisDrone2019(a small-target dataset) and TomatoPlantfactoryDataset(a dense occlusion dataset) datasets further validate the algorithm's effectiveness, providing an efficient and lightweight solution for small-target bayberry detection in complex environments.

Keywords: Bayberry, object detection, Complex environment, Multi-scale feature fusion, Deeplearning

Received: 23 Sep 2025; Accepted: 12 Nov 2025.

Copyright: © 2025 Zhou, Zhang, Fu, Yao and Yin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Chengliang Yin, 254771778@qq.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.