Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Plant Sci.

Sec. Sustainable and Intelligent Phytoprotection

Volume 16 - 2025 | doi: 10.3389/fpls.2025.1526142

MOSSNet : Multiscale and Oriented Sorghum Spike Detection and Counting in UAV images

Provisionally accepted
Jianqing  ZhaoJianqing Zhao1Zhiyin  JiaoZhiyin Jiao2Jinping  WangJinping Wang2Zhifang  WangZhifang Wang2Yongchao  GuoYongchao Guo2Ying  ZhouYing Zhou1Shiyi  ChenShiyi Chen2Wenjie  WuWenjie Wu1Yannan  ShiYannan Shi2*Peng  LVPeng LV2*
  • 1Jiangsu Second Normal University, Nanjing, China
  • 2Institute of Millet Crops, Hebei Academy of Agriculture and Forestry Sciences, Shijiazhuang, Hebei Province, China

The final, formatted version of the article will be published soon.

Accurate sorghum spike detection is critical for monitoring growth conditions, accurately predicting yield, and ensuring food security. Deep learning models have improved the accuracy of spike detection thanks to advances in artificial intelligence. However, the dense distribution of sorghum spikes, variable sizes and complex background information in UAV images make detection and counting difficult. Existing horizontal detection methods often results in overlapping detection boxes during detection and counting, which includes large amounts of background information. This study proposes a multiscale and oriented sorghum spike detection and counting model in UAV images. The model creates a Deformable Convolution Spatial Attention (DCSA) module to improve the network's ability to capture small sorghum spike features. It also integrates circular smooth labels to enable the network to extract directional information and effectively represent morphological features. The model also employs a Wise IoU-based localization loss function to improve network loss for bounding box alignment under complex field conditions. To further refine detection results, we remove redundant oriented detection boxes, and provide detailed output on spike size, position, angle, and class information. The results show that MOSSNet achieves a mean average precision (mAP) of 90.3% for sorghum spike detection, with a RMSE and MAE of 9.3 and 8.1, respectively. These results demonstrate the effectiveness of MOSSNet in complex field environments with significant variability and severe occlusion, providing a valuable technical reference for sorghum phenotyping.

Keywords: sorghum spike, oriented detection boxes, angle feature, deep learning, UAV

Received: 11 Nov 2024; Accepted: 29 Jul 2025.

Copyright: © 2025 Zhao, Jiao, Wang, Wang, Guo, Zhou, Chen, Wu, Shi and LV. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Yannan Shi, Institute of Millet Crops, Hebei Academy of Agriculture and Forestry Sciences, Shijiazhuang, Hebei Province, China
Peng LV, Institute of Millet Crops, Hebei Academy of Agriculture and Forestry Sciences, Shijiazhuang, Hebei Province, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.