CMNet: an asymmetric dual-branch network for accurate cotton segmentation

Zhang, Gengrong; Abudukelimu, Halidanmu; Musideke, Mayilamu; Wu, Shuqin; Abulizi, Abudukelimu; Guo, Cuiqin; Zhang, Yajun

doi:10.3389/fpls.2025.1692647

ORIGINAL RESEARCH article

Front. Plant Sci.

Sec. Sustainable and Intelligent Phytoprotection

This article is part of the Research TopicIntegrating Visual Sensing and Machine Learning for Advancements in Plant Phenotyping and Precision AgricultureView all 9 articles

CMNet: an asymmetric dual-branch network for accurate cotton segmentation

Provisionally accepted

Gengrong Zhang¹

Halidanmu Abudukelimu^1*

Mayilamu Musideke¹

Shuqin Wu¹

Abudukelimu Abulizi¹

Cuiqin Guo¹

Yajun Zhang²

¹Xinjiang University of Finance and Economics, Ürümqi, China
²Xinjiang University, Urumqi, China

The final, formatted version of the article will be published soon.

In agricultural automation, precise cotton segmentation is a key step for tasks such as intelligent harvesting and yield estimation. However, in complex field environments, factors such as background interference and irregular target shapes severely affect segmentation accuracy. Existing deep learning methods offer certain advantages but still generally suffer from limitations including insufficient accuracy, over-segmentation, and misidentification. To address these challenges, this study proposes a novel dual-branch cotton segmentation network, Cotton-aware Mamba-enhanced UNet (CMNet), which optimizes the ParaTransCNN architecture by incorporating the 2D Selective Scan (SS2D) module to replace the original Transformer branch, effectively balancing the extraction of local details and global semantic information while reducing computational burden. To enhance the model's perception of irregularly shaped cotton, a Deformable Convolutional Networks v1 (DCNv1) module is integrated into the Vision Mamba (VMamba) branch, further improving the delineation of target boundaries. Additionally, an Atrous Spatial Pyramid Pooling (ASPP) module is introduced at the end of the Convolutional Neural Network (CNN) branch to strengthen multi-scale feature representation. To optimize the fusion of channel and spatial information, the Spatial and Channel Squeeze-and-Excitation (scSE) attention mechanism replaces the original module, enhancing feature modeling capability. Experimental results on an in-field cotton image dataset demonstrate that CMNet outperforms existing mainstream methods, achieving Dice, mIoU, and Accuracy of 91.06%, 84.18%, and 98.10%, respectively, while reducing parameter count and computational complexity, thus exhibiting excellent performance. Furthermore, generalization experiments on multiple other plant datasets also achieved outstanding results, validating the model's adaptability and potential for broader applications in multi-crop segmentation tasks, providing valuable insights for smart agriculture segmentation research. The source code and dataset of this work are publicly available at https://github.com/halidanmu/CMNet.git.

Keywords: Dual branch, cotton segmentation, SS2D, deformable convolution, SCSE

Received: 26 Aug 2025; Accepted: 28 Nov 2025.

Copyright: © 2025 Zhang, Abudukelimu, Musideke, Wu, Abulizi, Guo and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Halidanmu Abudukelimu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.