AUTHOR=Zhang Zhenze , Zhao Caiqing , Zhou Yijun , Yao Ling , Liu Peng , Fang Ziling , Fang Nian TITLE=Random forest-driven mortality prediction in critical IBD care: a dual-database model integrating comorbidity patterns and real-time physiometrics JOURNAL=Frontiers in Medicine VOLUME=Volume 12 - 2025 YEAR=2025 URL=https://www.frontiersin.org/journals/medicine/articles/10.3389/fmed.2025.1624899 DOI=10.3389/fmed.2025.1624899 ISSN=2296-858X ABSTRACT=BackgroundInflammatory bowel disease (IBD) poses significant mortality risks for critically ill patients requiring intensive care unit (ICU) admission, driven by complications such as malnutrition, thromboembolism, and multi-organ dysfunction. Current prognostic tools for mortality prediction in this population remain limited. Machine learning (ML) offers advantages in handling complex clinical data but has not been systematically applied to this high-risk cohort. This multicenter study aimed to develop and validate ML-based models for mortality risk stratification in critically ill IBD patients using large-scale ICU databases.MethodsData from 551 IBD patients in the MIMIC-IV database (2008–2019) were analyzed, with external validation using the eICU dataset. Nine ML algorithms (XGBoost, logistic regression, LightGBM, random forest, decision tree, elastic net, MLP, KNN, RSVM) were trained to predict 1-year mortality. Predictors included demographics, comorbidities, laboratory parameters, vital signs, and disease severity scores. Missing data (<30%) were imputed using random forest. The cohort was split into training (75%) and internal testing (25%) sets, with hyperparameter optimization via 5-fold cross-validation. Model performance was evaluated using AUC, sensitivity, specificity, and calibration curves. The SHAP framework was integrated with predictive analytics to systematically evaluate key determinants of mortality risk through quantitative feature importance analysis. A nomogram was constructed based on key predictors identified through logistic regression.ResultsThe random forest model achieved superior discrimination in internal validation (AUC > 0.8). Nine predictors were identified: malignancy history, Charlson Comorbidity Index (CCI), Red Cell Distribution Width (Rdw), Glasgow Coma Scale (GCS), Sequential Organ Failure Assessment (Sofa), age, heart rate, weight and gender. The nomogram demonstrated robust external validation performance in the eICU cohort (AUC > 0.8).ConclusionWe developed and validated a machine learning-based nomogram to predict mortality in critically ill IBD patients, integrating interpretable predictors from multicenter ICU data.