ORIGINAL RESEARCH article
Front. Med.
Sec. Intensive Care Medicine and Anesthesiology
Comparative Machine Learning to Predict Acute Kidney Injury in Traumatic Brain Injury: A MIMIC-IV Cohort with SHAP Interpretation
Zhen xing Gu
Kang Qian
Xigang Wang
Ming Li
Bo Zhang
The Second Hospital of Nanjing, Nanjing, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Abstract
Background:AKI is a frequent and severe complication among TBI patients. Accurate early prediction is critical but remains challenging in ICU practice. Methods:We retrospectively analyzed the MIMIC-IV database. After screening 85,242 first ICU admissions and applying exclusions, 2,986 TBI patients were included. AKI was defined by KDIGO criteria. Demographic, physiological, laboratory, and intervention variables were extracted, preprocessed, and imputed. Predictors were selected using LASSO, Boruta, and logistic regression with bootstrap validation. Seven ML models (LR, DT, RF, XGBoost, LightGBM, SVM, ANN) were trained on 70% of the cohort and validated on 30%, with hyperparameters optimized by grid search and fivefold CV. Performance was assessed by AUC, calibration, DCA, accuracy, sensitivity, specificity, PPV, NPV, and F1-score. SHAP was applied to the best-performing model (XGBoost) for global and individual interpretability. Results:Of the 2,986 TBI patients, 2,045 (68.5%) developed AKI. AKI patients were older, heavier, and had higher glucose, sodium, SBP, and temperature, with lower urine output and more frequent ventilation. Feature selection consistently retained urine output, ventilation, weight, age, glucose, sodium, SBP, and temperature as core predictors. In validation, XGBoost showed the best performance (AUC 0.775, 95% CI 0.747–0.802; accuracy 74.4%; sensitivity 88.3%; F1-score 0.83), followed by RF (AUC 0.768; sensitivity 85.9%; F1-score 0.82). LR had moderate discrimination (AUC 0.763) but poor specificity (36.5%), while LightGBM achieved the highest specificity (50.4%) but lower AUC (0.741). DT performed worst (AUC 0.728; accuracy 69.3%). Calibration and DCA supported XGBoost as having the greatest clinical benefit. SHAP analysis of XGBoost identified urine output and ventilation as dominant predictors and provided patient-level explanations consistent with observed clinical patterns. Conclusions:Ensemble ML models, particularly XGBoost, demonstrated robust predictive power, outperforming LR and DT. The XGBoost model combined high discrimination, calibration, and interpretability, offering a clinically applicable tool for early AKI risk stratification in TBI.
Summary
Keywords
Acute Kidney Injury, machine learning, MIMIC-IV (3.1) database, SHAP (Shapley Additive explanation), Traumatic Brain Injury
Received
24 September 2025
Accepted
13 January 2026
Copyright
© 2026 Gu, Qian, Wang, Li and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Ming Li; Bo Zhang
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.