Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med.

Sec. Intensive Care Medicine and Anesthesiology

Volume 12 - 2025 | doi: 10.3389/fmed.2025.1681621

This article is part of the Research TopicAdvances in Critical Care Blood PurificationView all articles

Early Prediction of Sepsis-Induced Coagulopathy in the ICU Using Interpretable Machine Learning: A Multi-Center Retrospective Cohort Study

Provisionally accepted
Tao  ShaTao ShaHao  JiangHao JiangLei  FengLei Feng*
  • Huadong Hospital, Fudan University, Shanghai, China

The final, formatted version of the article will be published soon.

Background: Sepsis-induced coagulopathy (SIC) is a fatal complication in ICU patients, yet early risk prediction remains challenging. This study aimed to develop an interpretable machine learning model for predicting SIC within seven days of ICU admission. Methods: Clinical data for model development were retrieved from the Medical Information Mart for Intensive Care-IV (MIMIC-IV) database. Feature selection was performed using three distinct algorithms: least absolute shrinkage and selection operator (LASSO) regression, random forest recursive feature elimination (RF-RFE), and the Boruta method. Ten machine learning models underwent training employing 5-fold cross-validation on the training subset, with subsequent evaluation on the validation subset encompassing discrimination, calibration, and clinical utility metrics. The optimal model underwent further interpretability analysis through SHapley Additive exPlanations (SHAP) to elucidate variable contributions and their directional effects. External validation was then conducted using the electronic Intensive Care Unit Collaborative Research Database (eICU-CRD). Finally, the best-performing model was implemented as a web-based Shiny application featuring an interactive interface. Results: Among 10,740 patients in MIMIC-IV, 2,232 (20.78%) developed SIC within 7 days post-ICU admission. A LightGBM model with thirteen variables demonstrated optimal performance, achieving an area under the receiver operating characteristic curve (AUROC) of 0.885 (95% confidence interval (CI): 0.874–0.897) in the internal validation set and 0.831 (95% CI: 0.819–0.843) in the external eICU-CRD cohort. Key predictive variables included Prothrombin Time-International Normalization Ratio (INR), platelet count, Sequential Organ Failure Assessment (SOFA), lactate, systolic blood pressure (SBP), red cell distribution width (RDW), bicarbonate, phosphate, hemoglobin, age, the presence of heart failure (HF), ischemic heart disease (IHD) and the use of continuous renal replacement therapy (CRRT). The model was deployed as a clinician-oriented web application providing an accessible interface (https://shatao.shinyapps.io/Sepsis_Induced_Coagulopathy/). Conclusions: This model demonstrated strong predictive ability and clinical interpretability, enabling early SIC identification and targeted intervention.

Keywords: Sepsis-induced coagulopathy, machine learning, predictive models, Early prediction, MIMIC-IV database

Received: 14 Aug 2025; Accepted: 22 Oct 2025.

Copyright: © 2025 Sha, Jiang and Feng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Lei Feng, hdfenglei@fudan.edu.cn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.