Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Artif. Intell.

Sec. Pattern Recognition

Volume 8 - 2025 | doi: 10.3389/frai.2025.1675154

Optimizing Surface Defect Detection with YOLOv9: The Role of Advanced Backbone Models

Provisionally accepted
  • 1D'Amore-McKim School of Business, Northeastern University, Boston, United States
  • 2School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, China
  • 3Shenzhen Institute for Advanced Study, University of Electronic Science and Technology of China, Shenzhen, China

The final, formatted version of the article will be published soon.

YOLO algorithmic models are widely utilised for detecting surface defects, offering a robust and efficient approach to identifying various flaws and imperfections on material surfaces. In this study, we explore the integration of six distinct backbone networks within the YOLOv9 framework, specifically to optimise surface defect detection in steel strips. Specifically, we improve the YOLOv9 framework by integrating six representative backbone networks—ResNet50, GhostNet, MobileNetV4, FasterNet, StarNet, and RepViT—and conduct a systematic evaluation on the NEU-DET dataset and the GC10-DET dataset. Using YOLOv9-C as the baseline, we compare these backbones in terms of detection accuracy, computational complexity, and model efficiency. Results show that RepViT achieves the best overall performance with an mAP50 of 68.8%, F1-score of 0.65, and a balanced precision-recall profile, while GhostNet offers superior computational efficiency with only 41.2M parameters and 190.2 GFLOPs. Further validation on YOLOv5-m confirms the consistency of the results. The study offers practical guidance for backbone selection in surface defect detection tasks, highlighting the advantages of lightweight architectures for real-time industrial applications.

Keywords: Defect detection, machine learning, YOLOv9, Industrial quality control, neural networks

Received: 29 Jul 2025; Accepted: 24 Sep 2025.

Copyright: © 2025 Zeng, Wang, Yao, Dong and Cai. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Hongyang Wang, defwhy@foxmail.com
Zile Dong, ziledong@uestc.edu.cn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.