ORIGINAL RESEARCH article
Front. Med.
Sec. Intensive Care Medicine and Anesthesiology
Volume 12 - 2025 | doi: 10.3389/fmed.2025.1681621
This article is part of the Research TopicAdvances in Critical Care Blood PurificationView all articles
Early Prediction of Sepsis-Induced Coagulopathy in the ICU Using Interpretable Machine Learning: A Multi-Center Retrospective Cohort Study
Provisionally accepted- Huadong Hospital, Fudan University, Shanghai, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Background: Sepsis-induced coagulopathy (SIC) is a fatal complication in ICU patients, yet early risk prediction remains challenging. This study aimed to develop an interpretable machine learning model for predicting SIC within seven days of ICU admission. Methods: Clinical data for model development were retrieved from the Medical Information Mart for Intensive Care-IV (MIMIC-IV) database. Feature selection was performed using three distinct algorithms: least absolute shrinkage and selection operator (LASSO) regression, random forest recursive feature elimination (RF-RFE), and the Boruta method. Ten machine learning models underwent training employing 5-fold cross-validation on the training subset, with subsequent evaluation on the validation subset encompassing discrimination, calibration, and clinical utility metrics. The optimal model underwent further interpretability analysis through SHapley Additive exPlanations (SHAP) to elucidate variable contributions and their directional effects. External validation was then conducted using the electronic Intensive Care Unit Collaborative Research Database (eICU-CRD). Finally, the best-performing model was implemented as a web-based Shiny application featuring an interactive interface. Results: Among 10,740 patients in MIMIC-IV, 2,232 (20.78%) developed SIC within 7 days post-ICU admission. A LightGBM model with thirteen variables demonstrated optimal performance, achieving an area under the receiver operating characteristic curve (AUROC) of 0.885 (95% confidence interval (CI): 0.874–0.897) in the internal validation set and 0.831 (95% CI: 0.819–0.843) in the external eICU-CRD cohort. Key predictive variables included Prothrombin Time-International Normalization Ratio (INR), platelet count, Sequential Organ Failure Assessment (SOFA), lactate, systolic blood pressure (SBP), red cell distribution width (RDW), bicarbonate, phosphate, hemoglobin, age, the presence of heart failure (HF), ischemic heart disease (IHD) and the use of continuous renal replacement therapy (CRRT). The model was deployed as a clinician-oriented web application providing an accessible interface (https://shatao.shinyapps.io/Sepsis_Induced_Coagulopathy/). Conclusions: This model demonstrated strong predictive ability and clinical interpretability, enabling early SIC identification and targeted intervention.
Keywords: Sepsis-induced coagulopathy, machine learning, predictive models, Early prediction, MIMIC-IV database
Received: 14 Aug 2025; Accepted: 22 Oct 2025.
Copyright: © 2025 Sha, Jiang and Feng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Lei Feng, hdfenglei@fudan.edu.cn
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.