Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Neurol.

Sec. Neurotrauma

This article is part of the Research TopicEquity in Global NeurosurgeryView all articles

Multimodal Machine Learning for Predicting Postoperative Functional Outcomes in Surgically Treated Supratentorial Deep Intracerebral Hemorrhage: A Prospective Multicenter Study

Provisionally accepted
Min  CuiMin Cui1Yanyi  Yanyi LiuYanyi Yanyi Liu2Qi  HeQi He1Weiming  XiongWeiming Xiong1Yang  LiuYang Liu1Lei  XuLei Xu1Yongbing  DengYongbing Deng1Xingwei  TanXingwei Tan1,3*
  • 1Chongqing Emergency Medical Center, Chongqing, China
  • 2The Affiliated Yongchuan Hospital of Chongqing Medical University, Chongqing, China
  • 3The Third Affiliated Hospital of Chongqing Medical University, Chongqing, China

The final, formatted version of the article will be published soon.

Background: Early prediction of functional outcomes after surgery for spontaneous supratentorial deep intracerebral hemorrhage (sICH) remains difficult. This study developed and validated multimodal machine-learning models incorporating clinical, imaging, physiological, and biomarker data, including temperature management strategies, and explored interpretability using SHAP. Methods: This prospective multicenter cohort enrolled 285 surgically treated sICH patients. Outcome was defined as favorable (mRS 0–3) versus unfavorable (mRS 4–6). Data were split by stratified random sampling into a training set (n=199) and a test set (n=86). LASSO with 10-fold cross-validation (1-SE rule) selected key predictors. Five classifiers (Random Forest, neural network, decision tree, k-nearest neighbors, naïve Bayes) were trained with 10-fold cross-validation and evaluated on the test set. Performance was assessed using AUC (95% CI) and standard classification metrics; AUCs were compared by DeLong's test. SHAP was applied to the best model. Results: LASSO identified eight predictors: admission GCS, hematoma volume, TNF-α, GFAP, IL-1β, admission NIHSS, mean body temperature, and peak ICP. On the test set, Random Forest achieved the highest performance (AUC 0.883, 95% CI 0.829–0.937; accuracy 0.824; F1-score 0.836), with no significant AUC difference versus the neural network (AUC 0.867; P=0.312). SHAP ranked admission GCS and hematoma volume as the most important features, followed by TNF-α and GFAP. Conclusions: A multimodal Random Forest model provided good discrimination for predicting postoperative functional outcomes in surgically treated sICH, and SHAP improved interpretability by quantifying feature contributions.

Keywords: biomarkers, intracerebral hemorrhage, machine learning, Modified Rankin scale, random forest, Shap, supratentorial deep hemorrhage

Received: 24 Dec 2025; Accepted: 11 Feb 2026.

Copyright: © 2026 Cui, Yanyi Liu, He, Xiong, Liu, Xu, Deng and Tan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Xingwei Tan

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.