Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Neurorobot.

Volume 19 - 2025 | doi: 10.3389/fnbot.2025.1633697

RSA-TransUNet:A Robust Structure-Adaptive TransUNet for Enhanced Road Crack Segmentation

Provisionally accepted
Liling  HouLiling Hou1Fei  YuFei Yu2*Yaowen  HuYaowen Hu3Yang  HuYang Hu4Ruoli  YangRuoli Yang3
  • 1Zhangzhou Institute of Technology, Zhangzhou, China
  • 2Minnan Normal University, Zhangzhou, China
  • 3National University of Defense Technology, Changsha, China
  • 4Central South University of Forestry and Technology, Changsha, China

The final, formatted version of the article will be published soon.

With the advancement of deep learning, road crack segmentation has become increasingly crucial for intelligent transportation safety. Despite notable progress, existing methods still face challenges in capturing fine-grained textures in small crack regions, handling blurred edges and significant width variations, and performing multi-class segmentation. Moreover, the high computational cost of training such models hinders their practical deployment.To tackle these limitations, we propose RSA-TransUNet, a novel model for road crack segmentation. At its core is the Axial-shift MLP Attention (ASMA) mechanism, which integrates axial perception with sparse contextual modeling. Through multi-path axial perturbations and an attention-guided structure, ASMA effectively captures longrange dependencies within row-column patterns, enabling detailed modeling of multi-scale crack features.To improve the model's adaptability to structural irregularities, we introduce the Adaptive Spline Linear Unit (ASLU), which enhances the model's capacity to represent nonlinear transformations. ASLU improves responsiveness to microstructural variations, morphological distortions, and local discontinuities, thereby boosting robustness across different domains.We further develop a Structure-aware Multi-stage Evolutionary Optimization (SMEO) strategy, which guides the training process through three phases: structural perception exploration, feature stability enhancement, and global perturbation. This strategy combines breadth sampling, convergence compression, and local escape mechanisms to improve convergence speed, global search efficiency, and generalization performance.Extensive evaluations on the Crack500, CFD, and DeepCrack datasets-including ablation studies and comparative experiments-demonstrate that RSA-TransUNet achieves superior segmentation accuracy and robustness in complex road environments, highlighting its potential for real-world applications.

Keywords: Road Crack segmentation, Axial-shift MLP Attention, Adaptive Spline Linear Unit, Structure-aware Multi-stage, Evolutionary optimization

Received: 23 May 2025; Accepted: 25 Aug 2025.

Copyright: © 2025 Hou, Yu, Hu, Hu and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Fei Yu, Minnan Normal University, Zhangzhou, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.