Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med.

Sec. Translational Medicine

Volume 12 - 2025 | doi: 10.3389/fmed.2025.1593662

Prediction Model for Carotid Intima-Media Thickness Progression via Machine Learning and SHAP Analysis: A Three-Year Prospective Cohort Study

Provisionally accepted
An  ZhouAn Zhou1Kui  ChenKui Chen2yonghui  weiyonghui wei1qun  yequn ye1Yuanming  XiaoYuanming Xiao2Rong  ShiRong Shi1Jiangang  WangJiangang Wang2*Wei-Dong  LiWei-Dong Li1*
  • 1Department of Genetics, College of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
  • 2Health Management Center, Xiangya Hospital, Central South University, Hunan, Hunan Province, China

The final, formatted version of the article will be published soon.

Background: Carotid intima-media thickness (CIMT) serves as a simple,noninvasive examination method for cardiovascular disease risk assessment and clinical treatment decision guidance. By predicting CIMT progression, we can advance the prediction window for atherosclerotic cardiovascular disease (ASCVD), enabling earlier detection and treatment to reduce morbidity risk. Objective: To develop and compare machine learning models for predicting CIMT thickening (defined as an absolute threshold ≥1.0 mm or significant progression ≥0.1 mm increase) on the basis of routine clinical biochemical parameters. Methods: From 128,938 records (31,158 individuals) collected since 2015 from the "XiangYa Third Hospital of Central South University Health Examination Cohort," 904 participants who completed three consecutive annual CIMT measurements were included. CIMT thickening was defined as a final examination CIMT ≥1.0 mm or an increase ≥0.1 mm across consecutive measurements. After Boruta algorithm preprocessing and feature selection, seven machine learning models (logistic regression, random forest, XGBoost, support vector machine, elastic net, decision tree, and neural network) were developed and evaluated in the test cohort. Platt scaling was applied to optimize calibration, and decision curve analysis (DCA) was used to assess clinical utility.Results: Among 904 participants, 227 (25.1%) developed CIMT thickening during follow-up. The elastic net model demonstrated superior performance (AUC=0.763, sensitivity=0.706, specificity=0.700). Standard calibration analysis revealed significant calibration bias in all the models, with expected calibration errors (ECEs) ranging from 0.1368--0.2889. After Platt scaling, the calibration improved significantly (ECE was reduced to 0.0131--0.0617), representing calibration error reductions of 71.7%-95.5%. DCA revealed that across a broad threshold probability range (0.2--0.6), the elastic net and support vector machine models provided greater net benefits than the "treat all" or "treat none" strategies did. Model interpretability was examined via SHapley Additive exPlanations (SHAP) analysis. Conclusion: Our machine learning approach effectively predicts CIMT progression within three years.The Platt-scaled elastic net model demonstrated optimal performance in both discrimination and calibration. SHAP analysis provides clinical interpretability by identifying key risk factors. DCA confirmed the models' clinical utility across reasonable threshold probability ranges. This predictive tool could facilitate targeted early intervention strategies by identifying individuals at highest risk for CIMT progression, thereby preventing cardiovascular disease.

Keywords: CIMT, machine learning, SHAP analysis, Cardiovascular disease prevention, cardiovascular prevention

Received: 14 Mar 2025; Accepted: 28 May 2025.

Copyright: © 2025 Zhou, Chen, wei, ye, Xiao, Shi, Wang and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Jiangang Wang, Health Management Center, Xiangya Hospital, Central South University, Hunan, 410008, Hunan Province, China
Wei-Dong Li, Department of Genetics, College of Basic Medical Sciences, Tianjin Medical University, Tianjin, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.