ORIGINAL RESEARCH article
Front. Neurol.
Sec. Artificial Intelligence in Neurology
This article is part of the Research TopicPrecision Medicine in Neurocritical CareView all 15 articles
Development of an Interpretable Machine Learning Model for Predicting Venous Thromboembolism in Intensive Care Unit Patients with Intracerebral Hemorrhage
Provisionally accepted- 1Qinghai University, Xining, China
- 2Qinghai Provincial People's Hospital, Xining, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Background: Venous thromboembolism (VTE) is a frequent and potentially life-threatening complication in patients with intracerebral hemorrhage (ICH) in intensive care units (ICU). However, the necessity of prophylactic anticoagulation therapy for these patients remains controversial. This study aims to develop an interpretable machine learning (ML) model to accurately predict the risk of VTE in critically ill ICH patients, thereby enabling timely and individualized preventive measures. Methods: A retrospective analysis was performed on clinical data from the MIMIC-IV database and ICU patients diagnosed with ICH at Qinghai Provincial People's Hospital. After data preprocessing, 1545 cases from the MIMIC-IV database were randomly divided into a training set (1097 cases) and a test set (448 cases) in a 7:3 ratio. Data from 151 ICH patients treated in the ICU of Qinghai Provincial People's Hospital between January 2020 and December 2024 were utilized as an external validation set. The Least Absolute Shrinkage and Selection Operator (LASSO) algorithm was applied for feature selection. Model performance was assessed using metrics including the area under the curve (AUC), decision curve analysis (DCA), accuracy, positive predictive value (PPV), and negative predictive value (NPV). The optimal model was further explained using the SHapley Additive exPlanations (SHAP) method. Results: The XGBoost model exhibited the best predictive performance, with AUC values of 0.936, 0.778, and 0.761 for the training set, test set, and external validation set, respectively. Feature importance analysis identified the top 10 influential features as follows: ICU stay duration, age, prothrombin time, triglycerides, albumin, body mass index, partial thromboplastin time, blood glucose, white blood cell count, and systolic blood pressure. Conclusion: The XGBoost model accurately predicts VTE occurrence in ICH patients in the ICU. By employing the SHAP method, it is possible to precisely assess the impact of various pathophysiological parameters on individual patient predictions, thereby providing robust support for personalized risk stratification and preventive treatment.
Keywords: intracerebral hemorrhage, machine learning, Prediction model, Shap, Venous Thromboembolism, XGBoost
Received: 27 Aug 2025; Accepted: 16 Dec 2025.
Copyright: © 2025 He, Liu, Lu, Lv, Zhang, Jin and Han. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Zhongsheng Lu
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
