ORIGINAL RESEARCH article
Front. Cardiovasc. Med.
Sec. Precision Cardiology
This article is part of the Research TopicHarnessing Machine Learning for Enhanced Biomedical Diagnosis and Early Disease Detection: Bridging Data Science and HealthcareView all 9 articles
Explainable machine learning model of Coronary Artery Disease Combined with Diabetes: Development and validation study
Provisionally accepted- 1Department of Emergency ICU, The Affiliated Hospital of Qinghai University, Xining City, China
- 2The First Hospital of Jilin University Department of Cardiology, Changchun, China
- 3Lanzhou University Second Hospital Department of Neurology, Lanzhou, China
- 4Department of Biobank,Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University,, Guangzhou, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Background: Coronary artery disease (CAD) demonstrates a strong bidirectional association with diabetes mellitus, which not only elevates cardiovascular disease risk but also correlates with poorer clinical prognosis. Prognostication in patients with comorbid CAD and diabetes remains a critical clinical challenge, significantly influencing therapeutic decision-making. Leveraging readily available clinical parameters for predicting adverse outcomes in this population offers substantial clinical value. This investigation employs machine learning algorithms to develop predictive models for prognostic assessment in CAD patients with diabetes comorbidity. Method: We conducted a retrospective cohort study of 389 patients with comorbid coronary artery disease (CAD) and diabetes mellitus. The cohort was randomly allocated into a training set (n = 273) and an internal validation set (n = 116). Feature selection utilized LASSO regression followed by backward stepwise Cox regression analysis. A nomogram incorporating independent predictors was developed for clinical application. Model performance was assessed through discrimination metrics, calibration plots, and decision curve analysis (DCA). Random survival forest analysis validated the clinical significance of selected variables. Result: Our modeling approach employed a systematic methodology: LASSO regression for initial feature selection followed by backward stepwise Cox regression analysis, yielding eight independent predictors.The final model incorporated hemoglobin, INR, albumin, NT-proBNP, age, fibrinogen, diuretic use, and digitalis therapy. The integrated model demonstrated strong discriminative performance for mortality prediction across both training (AUC = 0.846, 0.838, 0.82) and validation cohorts (AUC = 0.824, 0.813, 0.798) at 3-, 5-, and 8-year intervals. Calibration plots and decision curve analysis confirmed model reliability and clinical utility over time. A nomogram was developed to facilitate individualized risk stratification. Kaplan-Meier analysis showed significant survival stratification by individual predictors, and restricted cubic spline analysis identified non-linear associations between continuous variables and mortality. Random survival forest analysis prioritized five key predictors (hemoglobin, INR, albumin, NT-proBNP, age). Comparative evaluation against the 9-variable model confirmed superior performance of the comprehensive model across all timepoints. Conclusion: Our multimodal prognostic model demonstrated robust performance in predicting all-cause mortality among patients with coronary artery disease and diabetes comorbidity. The nomogram's capacity for personalized risk estimation offers potential utility in clinical decision-making and patient stratification.
Keywords: Coronary Artery Disease, Diabetes Mellitus, machine learning, Mortality, Prognostic model
Received: 27 Jul 2025; Accepted: 04 Dec 2025.
Copyright: © 2025 Wang, Wang, Liu, Xie, Liang, Wang and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Biyun Wang
Yuke Zhang
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
