Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Public Health

Sec. Digital Public Health

This article is part of the Research TopicAdvancing Healthcare AI: Evaluating Accuracy and Future DirectionsView all 16 articles

Development and Validation of an Artificial Intelligence-Assisted System for Automatic Boston Scoring of Bowel Cleanliness in Colonoscopy (with Video)

Provisionally accepted
Jian  ChenJian Chen1*Jingzhi  XuJingzhi Xu2Kaijian  XiaKaijian Xia3Qiuwen  HuaQiuwen Hua2Xiaodan  XuXiaodan Xu1*Ganhong  WangGanhong Wang2*
  • 1Changshu No.1 People's Hospital, Suzhou, China
  • 2Changshu Hospital of Traditional Chinese Medicine, Changshu, China
  • 3First People's Hospital of Changshu City, Changshu, China

The final, formatted version of the article will be published soon.

Background Bowel cleanliness is a critical factor affecting the detection of adenomatous polyps and early tumors. The Boston Bowel Preparation Scale (BBPS), a widely used evaluation tool, has limitations, including interobserver variability and insufficient standardized training. This study aims to develop an artificial intelligence-driven automatic BBPS scoring and teaching system. Methods Colonoscopy image and video data were collected from three centers between June 2019 and August 2024, categorized into different BBPS scores (0, 1, 2, 3), ileocecal part, and instrument operation frames. Transfer learning and fine-tuning were performed on four pre-trained YOLOv11 models. Performance metrics included accuracy, precision, sensitivity, and AUC. Grad-CAM was used to provide visual explanations of the best-performing model, which was further developed into a system capable of real-time and cumulative BBPS assessment for every video frame. Results Among the four models, YOLOv11m performed the best, achieving an accuracy of 99.86%, precision of 99.74%, sensitivity of 99.74%, and an F1 score of 99.75% on the validation set. On the test set, the model attained a weighted average precision of 95.37%, specificity of 98.25%, and an AUC of 0.996. Based on this model, the AutoBBPS system was developed, which automatically initiates real-time cumulative BBPS scoring once the cecum is reached. In image-level human-machine comparison experiments, the system outperformed junior endoscopists in recognition accuracy and was comparable to senior endoscopists. Video-level human-machine comparison experiments further evaluated the accuracy of the AutoBBPS system against endoscopists under varying confidence thresholds. Conclusions The AutoBBPS system, developed using YOLOv11, provides real-time and cumulative BBPS scoring for every video frame, effectively assisting endoscopists in improving scoring efficiency and accuracy. Additionally, the intelligent BBPS teaching assistant is particularly beneficial for junior endoscopists, promoting standardized training and enhancing overall scoring quality.

Keywords: artificial intelligence, YOLO, Colonoscopy, Boston Bowel Preparation Scale, Bowel cleanliness

Received: 18 Sep 2025; Accepted: 31 Oct 2025.

Copyright: © 2025 Chen, Xu, Xia, Hua, Xu and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Jian Chen, szcsdocter@gmail.com
Xiaodan Xu, xxddocter@gmail.com
Ganhong Wang, 651943259@qq.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.