Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Endocrinol.

Sec. Clinical Diabetes

An Interpretable Machine Learning Model for Detecting Vision-Threatening Diabetic Retinopathy Among Patients with Diabetic Retinopathy: A Web-Based Cross-Sectional Study

Provisionally accepted
Mingyang  SongMingyang SongMeng  ShiMeng Shi*
  • The First Affiliated Hospital of China Medical University, Shenyang, China

The final, formatted version of the article will be published soon.

Abstract Background: Vision-threatening diabetic retinopathy (VTDR) is a severe complication of type 2 diabetes mellitus (T2DM), particularly prevalent in patients with prolonged disease duration, poor glycemic control, and systemic comorbidities. This condition frequently progresses asymptomatically toward irreversible blindness without timely intervention. The early identification of VTDR is challenging due to the lack of validated biomarkers and a reliance on subjective clinical assessments. This study aimed to develop and validate an interpretable machine learning (ML) model to detect VTDR among patients with diabetic retinopathy (DR). Methods: Retrospective clinical data from T2DM patients with DR were extracted from the electronic medical records at our hospital and categorized into VTDR and non-VTDR (defined as mild-to-moderate non-proliferative diabetic retinopathy) groups. The dataset was partitioned into training and testing sets (7:3 ratio). Eight ML models were trained and evaluated using metrics such as Area Under the Curve (AUC), accuracy, and recall. Model performance was evaluated using a comprehensive scoring system (total score = 64). Shapley Additive Explanations (SHAP) were used to interpret the best-performing model. A web-based application was developed to demonstrate potential clinical utility. Results: Among 1,124 enrolled patients, the prevalence of VTDR was 36.9%. Key associated factors included diabetic treatment, T2DM duration, glycated hemoglobin levels, albuminuria, and anemia. The Support Vector Machine (SVM) model demonstrated superior performance, with an AUC of 0.879, accuracy of 0.837, precision of 0.833, Brier score of 0.129, and an F1 score of 0.756, outperforming the other ML models. The SVM model achieved the highest total score (57/64) in the testing cohort. Furthermore, decision curve analysis and calibration curves confirmed the robustness and reliability of the models. A simplified calculator derived from the SHAP feature importance rankings maintained strong diagnostic capacity. Conclusion: The interpretable SVM model effectively detected VTDR among patients with DR using routine clinical data. While requiring external validation, this study serves as a proof-of-concept for a cost-effective screening tool that could assist clinicians in prioritizing high-risk patients and facilitating early intervention to prevent irreversible vision impairment.

Keywords: Detection model, machine learning, Non-vision-threatening retinopathy, Shap, type 2 diabetes, Vision-threatening diabetic retinopathy

Received: 27 Dec 2025; Accepted: 16 Feb 2026.

Copyright: © 2026 Song and Shi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Meng Shi

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.