ORIGINAL RESEARCH article
Front. Endocrinol.
Sec. Clinical Diabetes
Volume 16 - 2025 | doi: 10.3389/fendo.2025.1684558
Explainable Machine Learning Model for Classifying Atherosclerotic Cardiovascular Disease in Patients with Metabolic Dysfunction-Associated Steatotic Liver Disease
Provisionally accepted- 1Department of Cardiology, The Affiliated Hospital of Qingdao University, Shandong, China
- 2Department of Gastroenterology, The Affiliated Hospital of Qingdao University, Shandong, China
- 3Department of Emergency, The Affiliated Hospital of Qingdao University, Shandong, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Background: Cardiovascular disease (CVD) is the leading cause of mortality in patients with metabolic dysfunction-associated steatotic liver disease (MASLD), yet traditional risk predictors remain limited in clinical practice. Objective: To develop machine learning (ML) models for classifying prevalent atherosclerotic cardiovascular disease (ASCVD) risk in MASLD patients, and to enhance model interpretability using SHapley Additive exPlanations (SHAP). Methods: This retrospective study included 590 MASLD patients diagnosed at the Affiliated Hospital of Qingdao University between December 2019 and December 2024. Patients were randomly divided into a training set (n=413) and a validation set (n=177), and further stratified based on ASCVD status. Least absolute shrinkage and selection operator (LASSO) regression was used for feature selection. Six ML models were developed and evaluated using sensitivity, specificity, accuracy, area under the receiver operating characteristic curve (AUC), and F1 score. SHAP analysis was performed to interpret feature contributions. Results: ASCVD was present in 434 of 590 patients (73.6%). The Gradient Boosting (GB) model achieved the best performance, with AUCs of 0.918 (95% CI: 0.890– 0.944) in the training set and 0.817 (95% CI: 0.739 – 0.883) in the validation set. SHAP analysis identified the top predictors as the Cholesterol–HDL–Glucose (CHG) index, Castelli Risk Index II (CRI-II), lipoprotein(a) [Lp(a)], serum creatinine (Scr), and uric acid (UA). Conclusion: The GB model demonstrated strong high accuracy in identifying existing ASCVD in MASLD patients and may serve as a useful tool for early risk stratification in clinical settings.
Keywords: Metabolic dysfunction-associated steatotic liver disease, Atherosclerotic cardiovascular disease, Composite Metabolic Index, machine learning, SHAP interpretability model
Received: 12 Aug 2025; Accepted: 13 Oct 2025.
Copyright: © 2025 Li, Chen, Ren, Wang, Ruan, Wang and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Wenzhong Zhang, xxmczwz@qdu.edu.cn
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.