Comparative Machine Learning to Predict Acute Kidney Injury in Traumatic Brain Injury: A MIMIC-IV Cohort with SHAP Interpretation

Gu, Zhen  xing; Qian, Kang; Wang, Xigang; Li, Ming; Zhang, Bo

doi:10.3389/fmed.2026.1712221

ORIGINAL RESEARCH article

Front. Med.

Sec. Intensive Care Medicine and Anesthesiology

Comparative Machine Learning to Predict Acute Kidney Injury in Traumatic Brain Injury: A MIMIC-IV Cohort with SHAP Interpretation

Zhen xing Gu

Kang Qian

Xigang Wang

Ming Li

Bo Zhang

The Second Hospital of Nanjing, Nanjing, China

Article metrics

View details

136

Views

The final, formatted version of the article will be published soon.

Abstract

Background:AKI is a frequent and severe complication among TBI patients. Accurate early prediction is critical but remains challenging in ICU practice. Methods:We retrospectively analyzed the MIMIC-IV database. After screening 85,242 first ICU admissions and applying exclusions, 2,986 TBI patients were included. AKI was defined by KDIGO criteria. Demographic, physiological, laboratory, and intervention variables were extracted, preprocessed, and imputed. Predictors were selected using LASSO, Boruta, and logistic regression with bootstrap validation. Seven ML models (LR, DT, RF, XGBoost, LightGBM, SVM, ANN) were trained on 70% of the cohort and validated on 30%, with hyperparameters optimized by grid search and fivefold CV. Performance was assessed by AUC, calibration, DCA, accuracy, sensitivity, specificity, PPV, NPV, and F1-score. SHAP was applied to the best-performing model (XGBoost) for global and individual interpretability. Results:Of the 2,986 TBI patients, 2,045 (68.5%) developed AKI. AKI patients were older, heavier, and had higher glucose, sodium, SBP, and temperature, with lower urine output and more frequent ventilation. Feature selection consistently retained urine output, ventilation, weight, age, glucose, sodium, SBP, and temperature as core predictors. In validation, XGBoost showed the best performance (AUC 0.775, 95% CI 0.747–0.802; accuracy 74.4%; sensitivity 88.3%; F1-score 0.83), followed by RF (AUC 0.768; sensitivity 85.9%; F1-score 0.82). LR had moderate discrimination (AUC 0.763) but poor specificity (36.5%), while LightGBM achieved the highest specificity (50.4%) but lower AUC (0.741). DT performed worst (AUC 0.728; accuracy 69.3%). Calibration and DCA supported XGBoost as having the greatest clinical benefit. SHAP analysis of XGBoost identified urine output and ventilation as dominant predictors and provided patient-level explanations consistent with observed clinical patterns. Conclusions:Ensemble ML models, particularly XGBoost, demonstrated robust predictive power, outperforming LR and DT. The XGBoost model combined high discrimination, calibration, and interpretability, offering a clinically applicable tool for early AKI risk stratification in TBI.

Summary

Keywords

Acute Kidney Injury, machine learning, MIMIC-IV (3.1) database, SHAP (Shapley Additive explanation), Traumatic Brain Injury

Received

24 September 2025

Accepted

13 January 2026

© 2026 Gu, Qian, Wang, Li and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ming Li; Bo Zhang

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

ORIGINAL RESEARCH article

Comparative Machine Learning to Predict Acute Kidney Injury in Traumatic Brain Injury: A MIMIC-IV Cohort with SHAP Interpretation

Abstract

Summary

Outline

Share article

Article metrics