Machine learning-based prediction of 6-month functional recovery in hypertensive cerebral hemorrhage: insights from XGBoost and SHAP analysis

He, Menghui; Lu, Zhongsheng; Lv, Yiwei; Cheng, Zihai; Zhang, Qiang; Jin, Xiaoqing; Han, Pei

doi:10.3389/fneur.2025.1608341

ORIGINAL RESEARCH article

Front. Neurol., 04 June 2025

Sec. Artificial Intelligence in Neurology

Volume 16 - 2025 | https://doi.org/10.3389/fneur.2025.1608341

This article is part of the Research TopicArtificial Intelligence in Neurosurgical Practices: Current Trends and Future OpportunitiesView all 4 articles

Machine learning-based prediction of 6-month functional recovery in hypertensive cerebral hemorrhage: insights from XGBoost and SHAP analysis

Menghui He¹

Zhongsheng Lu²^*

Yiwei Lv¹

Zihai Cheng¹

Qiang Zhang²

Xiaoqing Jin²

Pei Han²

¹Department of Graduate School, Qinghai University, Xining, China
²Department of Neurosurgery, Qinghai Provincial People's Hospital, Xining, China

Background: The poor prognosis of hypertensive cerebral hemorrhage (HICH) remains high. The period of 3–6 months after onset is the most rapid phase of neurological recovery in hemorrhagic stroke patients. Accurate early prediction of 6-month functional outcomes is critical for optimizing therapeutic strategies. This study compared the predictive efficacy of multiple machine learning models to identify the optimal model for forecasting long-term prognosis in HICH patients.

Methods: We conducted a retrospective analysis of clinical data from 807 HICH patients admitted to Qinghai Provincial People's Hospital's Neurosurgery Department between June 2020 and June 2024. After data preprocessing, data from June 2020 to December 2023 (n = 716) were randomly split into training (n = 497) and test sets (n = 219) at a 7:3 ratio. Data from January to June 2024 (n = 91) served as an external validation set. Recursive Feature Elimination (RFE) was performed to identify optimal features, and repeated five-fold cross-validation minimized the risk of overfitting. Model performance was evaluated using Area Under the Curve (AUC) and Decision Curve Analysis (DCA) across XGBoost, Random Forest (RF), Logistic Regression (LR), Support Vector Machine (SVM), and K-Nearest Neighbors (KNN). The optimal model was interpreted via SHapley Additive exPlanations (SHAP).

Results: The 6-month poor prognosis rate among 807 HICH patients was 27.51%. The XGBoost model exhibited optimal performance in the training set (AUC = 0.921, 95% CI: 0.896–0.944) and demonstrated stability in the external validation set (AUC = 0.813, 95% CI: 0.728–0.899). DCA analysis showed that the XGBoost model provided higher net benefit than other models across threshold probabilities of 0%−20% and 56%−100%. SHAP analysis identified hematoma volume as the most critical predictor, with secondary contributions from Glasgow coma score, white blood cell count, age, serum albumin, and systolic blood pressure, among others.

Conclusion: XGBoost models demonstrate powerful accuracy in long-term prognosis prediction of HICH patients. The SHAP framework quantifies the specific contributions of key pathophysiological indicators to individual patient model predictions, enabling individualized risk stratification and strategic allocation of medical resources.

1 Introduction

Hypertensive cerebral hemorrhage (HICH), caused by the rupture of small blood vessels due to chronic hypertension, affects approximately 4 million individuals globally each year (1). HICH is associated with high mortality and disability rates, posing a significant threat to patient health and survival (2). As a critical neurosurgical condition, HICH is clinically characterized by acute onset, rapid progression, and associated complications (3, 4). Post-hemorrhagic motor recovery predominantly occurs within the first 3–6 months post-onset (5, 6). Consequently, early prediction of neurological recovery beyond 6 months and development of an effective prognostic system hold substantial clinical value, as they are essential for optimizing medical resource allocation, guiding individualized treatment strategies, and improving functional outcomes in affected patients (7).

Machine learning's powerful data-processing capabilities, adaptability, and proficiency in capturing non-linear patterns render it highly suitable for analyzing multifaceted clinical datasets, with applications in clinical research growing annually (8). Their capacity to analyze large-scale, intricate data makes them indispensable for clinical diagnosis and outcome assessment. SHAP is a strong method in the realm of machine learning interpretability. This method demystifies the “black-box” nature of complex models, thereby enhancing transparency and credibility in model outcomes (9).

This study aimed to compare multiple machine learning models using diverse clinical features to predict long-term prognosis in HICH patients. Following screening, SHAP analysis was performed on the best-performing models. By identifying key clinical metrics that influence prognosis prediction, we aimed to make the decision-making process of the models transparent.

2 Methods

2.1 Data sources

We retrospectively collected clinical data from 807 patients diagnosed with HICH admitted to the Department of Neurosurgery at Qinghai Provincial People's Hospital between June 2020 and June 2024. These patients were included in the study cohort. Specifically, data from June 2020 to December 2023 were randomly divided into a training set (70%) and a test set (30%). Data from January to June 2024 were reserved as an external validation set. The study protocol was approved by the Research Ethics Committee of Qinghai Provincial People's Hospital (reference number: 2025-022-02), and informed consent was obtained from all participants or their legal guardians.

Inclusion criteria were as follows: (1) age ≥18 years; (2) documented history of hypertension; (3) cerebral hemorrhage confirmed by head CT and/or MRI. Exclusion criteria included: (1) traumatic cerebral hemorrhage, cerebral amyloid angiopathy, or secondary hemorrhage (e.g., aneurysms, vascular malformations, vasculitis, coagulopathies, tumor-related strokes, cerebral venous thrombosis, and so on); (2) incomplete clinical data insufficient for analysis; (3) loss to follow-up; (4) comorbidities that could confound study outcomes, such as life-threatening systemic diseases.

2.2 Predictor variables

This study defined poor prognosis as the failure of HICH patients to achieve expected clinical recovery goals 6 months post-onset. Detailed admission clinical data were retrospectively collected, including age, gender, hypertension history, diabetes history, smoking and alcohol consumption status, admission blood pressure, admission CT hematoma volume, admission blood glucose, surgical intervention, and admission Glasgow Coma Scale (GCS) score (15: conscious; 12–14: mildly impaired consciousness; 9–11: moderately impaired consciousness; 3–8: coma). Additionally, modified Rankin Scale (mRS) scores were recorded 6 months post-onset (0–2: good prognosis; 3–6: poor prognosis).

Imaging data included hemorrhage location (basal ganglia, thalamus, cerebellum, or lobar), initial hematoma volume (measured within 24 h of onset using open-source software 3DSlicer for layer-by-layer delineation), and ventricular rupture status. Laboratory data encompassed red blood cell count, hemoglobin, white blood cell count, platelet count, prothrombin time, international normalized ratio, activated partial thromboplastin time, fibrinogen, serum potassium, serum calcium, serum sodium, serum albumin, alanine aminotransferase, and aspartate aminotransferase.

2.3 Data pre-processing

Missing values frequently occur in medical datasets, which can impair model performance. To address this issue, multiple imputation was employed to handle missing data (10). Specifically, the Multivariate Imputation by Chained Equations (MICE) algorithm was utilized for this purpose. Additionally, continuous variables underwent standardization, and categorical variables were factorized. To mitigate class imbalance, the Random Over-Sampling Examples (ROSE) method was applied. In this study, we utilized the ROSE package in R to implement the algorithm. We followed the default settings of the package, which automatically determine the appropriate sampling ratio based on the imbalance of the dataset. This approach helps to improve the model's ability to generalize and make accurate predictions for both majority and minority classes (11).

2.4 Selection of candidate variables and predictors

For feature selection, Recursive Feature Elimination (RFE) was employed to identify the optimal subset of predictors. RFE, a widely utilized feature selection method in machine learning, enhances model accuracy and generalizability by eliminating redundant or irrelevant features. This process also reduces computational complexity and improves model interpretability. RFE was strictly conducted on the training dataset alone to avoid any potential information leakage. And during the RFE process, five-fold cross-validation was employed to ensure robustness and prevent overfitting. RFE analysis generated 25 potential predictors, from which the top 10 were selected for model development. The optimal feature subset included hematoma volume, GCS score, white blood cell (WBC) count, age, serum albumin, systolic blood pressure (SBP), blood glucose, platelet count, mean corpuscular volume (MCV), and serum potassium.

2.5 Machine learning models

We employed five machine learning models for training and validation:

• SVM: A supervised learning algorithm widely used for classification and regression tasks. SVM constructs hyperplanes to maximize the margin between classes, enabling effective data separation.

• LR: A generalized linear model commonly applied to classification problems. Its simplicity and interpretability make it a foundational tool in predictive modeling.

• RF: An ensemble learning method that constructs multiple decision trees to improve prediction accuracy and stability. RF's inherent resistance to overfitting and ability to handle high-dimensional data make it suitable for complex datasets.

• KNN: A straightforward yet effective algorithm used for classification and regression. KNN predicts outcomes by measuring distances between data points, with performance enhanced through feature selection and optimal K-value tuning.

• XGBoost: A state-of-the-art gradient-boosting framework that combines weak learners (typically decision trees) into a strong predictive model. XGBoost is renowned for its high performance, scalability, and support for diverse loss functions and regularization techniques.

Each model was selected based on its unique strengths in addressing classification tasks and handling complex clinical datasets.

2.6 Machine learning explainable tool

The model interpretation was performed using the SHAP method, which quantifies the contribution of each feature to the final prediction. By isolating independent feature contributions and analyzing feature interactions, SHAP provides a comprehensive and interpretable framework. Each observation in the dataset is associated with a unique set of SHAP values, enabling granular insights into individual predictions.

2.7 Statistical analysis

All statistical modeling and visualization analyses were performed using R software (version 4.4.2). Categorical variables were analyzed via chi-square tests or Fisher's exact probability method and reported as frequency percentages. Continuous variables following a normal distribution were described using mean ± standard deviation, with group comparisons conducted via t-tests. Non-normally distributed data were expressed as quartiles and analyzed for variability using the Wilcoxon rank-sum test. A significance level of (P < 0.05) was adopted.

The model's discriminative ability was quantified by the area under the receiver operating characteristic curve (AUC), complemented by assessments of sensitivity and accuracy. To evaluate clinical applicability, DCA was employed to calculate net benefit values across different risk thresholds, thereby assessing the decision-making utility of the predictive model.

3 Results

3.1 Patient characteristics

From June 2020 to June 2024, the Department of Neurosurgery at Qinghai Provincial People's Hospital admitted a total of 1,407 HICH cases. After applying the inclusion and exclusion criteria, 807 patients were included in the final cohort of this study. The data was divided into a training set (n = 497), a test set (n = 219), and an external validation set (n = 91). In the above-mentioned dataset, the proportion of missing values was 8.7%, and multiple imputation was performed. Baseline characteristic comparisons (Table 1) revealed statistically significant differences in Platelet Count (188.11 ± 64.70 vs. 172.01 ± 60.45, P = 0.001), APTT (26.37 ± 3.19 vs. 25.73 ± 3.25, P = 0.002), and WBC (10.67 ± 3.77 vs. 10.12 ± 3.83, P = 0.036), which were higher in the test set, while ALT levels were higher in the training set (28.63 ± 21.91 vs. 26.81 ± 21.41, P = 0.017).

Table 1

Table 1. Demographic and clinical characteristics of the training and test set studies.

Within the training set, comparisons between prognosis groups (Table 2) showed that the poor prognosis group exhibited significantly higher Hematoma Volume (30.64 ± 9.06 vs. 21.35 ± 11.84, P < 0.001) and WBC (11.20 ± 4.44 vs. 9.67 ± 3.46, P < 0.001), but lower GCS (9.85 ± 2.16 vs. 11.63 ± 2.53, P < 0.001). These findings highlight key clinical indicators associated with prognosis.

Table 2

Table 2. Characteristics of HICH patients in the training set.

3.2 Model construction and evaluation

Using the training set data, we constructed five predictive models: XGBoost, RF, LR, SVM, and KNN. The training set was employed for training the models and performing hyperparameter tuning. We used a five-fold cross-validation approach on the training set to optimize model parameters during the development phase. The test set and the external validation set remained completely independent and were used only once after model selection and training were completed. This ensures an unbiased assessment of the model's generalization performance. The AUC values of the five ML models based on the training set are 0.921, 0.881, 0.789, 0.849, and 0.879, respectively (Figure 1). The XGBoost model demonstrated superior predictive accuracy, achieving an AUC of 0.921 (95% CI: 0.896–0.944), while the LR model showed relatively weaker performance (AUC = 0.789, 95% CI: 0.748–0.829).

Figure 1

Figure 1. ROC curve analysis of five machine learning algorithms in the training dataset for predicting the long-term prognosis of HICH patients.

To further assess the generalization ability of these models, we evaluated their performance on an independent external validation set (Figure 2). The results showed that the XGBoost model sustained superior performance on the external validation set, achieving an AUC of 0.813 (95% CI: 0.728–0.899). This consistency with its training set performance indicates strong generalization capabilities. Other models also demonstrated varying levels of performance: the RF model achieved an AUC of 0.794 (95% CI: 0.704–0.884), the KNN model an AUC of 0.779 (95% CI: 0.685–0.874), the SVM model an AUC of 0.730 (95% CI: 0.603–0.856), and the LR model an AUC of 0.788 (95% CI: 0.689–0.887).

Figure 2

Figure 2. ROC curve analysis of five machine learning algorithms in the external validation set for predicting the long-term prognosis of HICH patients.

To evaluate clinical utility, decision curve analysis (DCA) quantified the net clinical benefit of each model across threshold probabilities (Figure 3). All models outperformed the “treat all patients” (orange reference line) and “treat no patients” (yellow reference line) strategies. Notably, XGBoost provided the highest net benefit across a broad range of thresholds. Further performance assessment using metrics such as accuracy, sensitivity, positive predictive value (PPV), negative predictive value (NPV), and F1 score (Table 3) confirmed XGBoost's superiority. Consequently, XGBoost was selected as the core model for long-term HICH prognosis prediction.

Figure 3

Figure 3. Decision curve analysis of five models plotting net benefits with different threshold probabilities.

Table 3

Table 3. Predictive performance of the models.

3.3 Interpretation of XGBoost model by SHAP method

As depicted in Figure 4, Hematoma Volume emerged as the most influential predictor of prognosis, followed by GCS, WBC, Age, Albumin, and SBP. Figure 5 further elucidates the directional impact of each variable. Positive SHAP values (right side, orange) indicate features that increase the probability of poor prognosis, while negative values (left side, purple) suggest a reduced risk. Hematoma Volume showed a strong positive association with poor prognosis, with high values (orange) correlating with increased risk. Conversely, higher GCS scores (left side, orange) were linked to better outcomes, as indicated by negative SHAP values. For instance, larger hematoma volumes (right side, orange) were associated with poorer prognoses, whereas higher GCS scores (left side, orange) predicted better outcomes compared to lower scores (right side, purple).

Figure 4

Figure 4. The weights of variables importance.

Figure 5

Figure 5. The SHapley Additive exPlanation (SHAP) values.

3.4 SHAP individual force plots

Figure 6 presents individual SHAP force diagrams for two patients: one with a poor prognosis (A) and one with a good prognosis (B). The model's base value (E[f(x)] = 1.29) represents the initial predicted value in the absence of feature inputs. The individual predictive value (f(x)) quantifies deviations from the base value using a logarithmic odds ratio, reflecting the cumulative effect of clinical characteristics on prognosis. In the diagram, red arrows denote risk-enhancing features, while blue arrows denote risk-suppressing features. The arrow length corresponds to the magnitude of the feature contribution. For Patient A (poor prognosis), high-risk features such as large hematoma volume (46.4 mL), low GCS score (8 scores), and metabolic abnormalities (e.g., blood glucose 15 mmol/L) collectively elevated the predictive value (f(x) = 2.0) above the baseline, strongly indicating adverse outcomes. Conversely, for Patient B (good prognosis), protective features dominated, driving the predictive value (f(x) = 0.999) below the baseline.

Figure 6

Figure 6. SHapley Additive exPlanation (SHAP) force plot for two selected patients. (A) Person with a poor prognosis. (B) Person with a good prognosis.

4 Discussion

Current research on HICH prognosis primarily focuses on identifying key prognostic factors and elucidating their mechanisms. While established early predictors include age, gender, smoking and alcohol history, neurological deficit severity, hematoma volume, intraventricular hemorrhage, and subarachnoid hemorrhage, their prognostic utility remains debated (4, 12, 13). Traditional approaches, such as univariate and multivariate logistic regression, have demonstrated limited accuracy in predicting outcomes. Machine learning algorithms, increasingly utilized in medical research (14), often outperform conventional statistical models. Recent applications in HICH include predicting hematoma expansion using techniques like XGBoost, which showed superior performance in early prognosis (15, 16). However, the use of machine learning for long-term functional recovery assessment remains underexplored.

In this study, we compared multiple machine learning algorithms and demonstrated, for the first time, the significant advantage of the XGBoost model in predicting 6-month HICH prognosis (AUC = 0.921 in the training set and AUC = 0.813 in the external validation set). The research findings confirm that the high performance of the XGBoost model reflects its genuine predictive capability, rather than overfitting. Sonobe et al. (17) constructed an RF model for predicting poor prognosis in ICH patients after rehabilitation therapy, and the model also demonstrated excellent performance. Previous studies have primarily employed machine learning algorithms to predict the short-term prognosis of HICH patients (18, 19). However, HICH patients possess the potential for continuous neurological recovery, and their neurological function may progressively improve over time. A longitudinal study conducted by Sreekrishnan et al. (20) on 173 HICH patients demonstrated that the mRS scores of most patients showed significant improvement at 3 and 6 months post-discharge. This study uniquely predicts the long-term prognosis of HICH patients through machine learning models. It can provide critical evidence for the development of personalized treatment and rehabilitation plans in clinical practice, thereby enhancing patient prognosis and quality of life. Using the SHAP method, we systematically evaluated the clinical weights of predictor variables, ranking them by importance. The SHAP individual force plot reveals the specific contributions of key pathophysiological indicators to the model predictions for individual patients. This holds significant value in enhancing the transparency of the model. Top-ranked variables, including hematoma volume and GCS score, were analyzed in conjunction with clinical insights, providing a foundation for individualized risk assessment and clinical decision-making.

Hematoma volume was identified as the most critical predictive variable in this study, aligning with previous findings (21). Larger volumes increase brain tissue compression, exacerbate blood-brain barrier disruption, and induce cerebral edema and intracranial pressure elevation, ultimately worsening neurological deficits (22). A study by Delcourt et al. (23) demonstrated that each 1 mL increase in hematoma volume raised the risk of death or dependence by 5%. Different brain regions exhibit significant threshold differences in their tolerance to hematoma volume due to variations in anatomical structure, functional importance, and compensatory capacity (24). Future research needs to further integrate location-specific volume thresholds to optimize prognostic scoring systems and intervention protocols.

Since its introduction in 1974, the GCS has become the international standard for assessing consciousness impairment in patients with traumatic brain injury and spontaneous cerebral hemorrhage (25). It indirectly reflects the extent of brain tissue damage (26). While GCS is widely used for acute-phase severity assessment, therapeutic decision-making, and long-term prognosis prediction, its predictive accuracy can be enhanced by integrating it with multidimensional indicators such as hematoma volume and age (27). Age emerged as another critical predictor (28), with advancing age significantly increasing the risk of adverse outcomes due to reduced physiological reserve and recovery capacity. A study by Huang et al. (29) highlighted aging as a key risk factor for poor prognosis in HICH.

Elevated peripheral blood WBC counts are a critical prognostic factor in HICH. A study indicates that an early increase in WBC levels after hemorrhage is closely associated with a higher mortality rate (30). This link may stem from inflammation triggered by brain tissue damage, which releases mediators attracting WBCs, primarily neutrophils and monocytes. Neutrophils, the first responders, phagocytose debris but also release reactive oxygen species (ROS) and matrix metalloproteinases (MMPs) like MMP-9. While these degrade damaged tissue, excessive ROS and MMPs can disrupt the blood-brain barrier, worsen cerebral edema, and cause secondary injury (31). High WBC levels often indicate increased risks of complications such as cerebral edema and rebleeding (32), which may lead to poorer long-term functional outcomes (33). However, a study by Morotti et al. (34) highlighted differing roles of neutrophils and monocytes: reduced neutrophil counts were tied to a higher risk of hematoma expansion, while elevated monocyte counts correlated with increased expansion risk. These findings underscore the complex relationship between leukocyte subsets and HICH prognosis, suggesting a need for further research into the specific mechanisms of WBC action post-hemorrhage.

Serum albumin levels exhibit a significant negative correlation with poor prognosis in HICH. Research consistently indicates that low serum albumin is strongly associated with increased mortality and adverse outcomes in HICH patients. Specifically, diminished albumin levels may reflect severe comorbidities (Acute inflammation, infection, liver disease, or vascular endothelial injury) or malnutrition, both of which contribute to poor prognoses. Studies have highlighted that reduced albumin levels directly correlate with higher mortality risk (35). Furthermore, admission albumin levels serve as an independent prognostic indicator. One study observed that patients with low admission albumin had prolonged hospital stays and elevated short-term and long-term mortality rates (36). In clinical practice, monitoring albumin levels provides critical insights into patient prognosis. Physicians should closely track these levels and consider interventions to enhance nutritional status or address underlying conditions.

Systolic blood pressure (SBP) is a key risk factor in hypertensive HICH, impacting severity and prognosis. Each 10 mmHg SBP increase raises hemorrhage risk by 60% (37). In the acute phase, high SBP exacerbates hemorrhage and may cause further brain damage by increasing cerebral blood flow and intracranial pressure (38). Therefore, stabilizing SBP and minimizing fluctuations are essential for improving long-term prognosis. Clinical guidelines recommend controlling acute-phase SBP to approximately 140 mmHg, as this level is associated with reduced poor prognosis risk (39, 40). However, overly rapid SBP reduction may adversely affect short-term and long-term outcomes (41). Rational SBP management can significantly lower the risk of poor prognosis and enhance patients' quality of life.

5 Conclusion

We developed an interpretable XGBoost prediction model that demonstrated superior performance in assessing the risk of poor prognosis in patients with HICH. Furthermore, by quantifying the specific contributions of key pathophysiological indicators to individual patient model predictions through the SHAP framework, individualized risk stratification and optimization of medical resource allocation can be achieved.

6 Strengths

This study's strength lies in constructing a long-term prognostic prediction model for the high-risk HICH subtype (comprising 50%−70% of spontaneous cerebral hemorrhage cases), addressing the etiological heterogeneity limitations of broader sICH models. Beyond traditional indicators like hematoma volume and GCS score, the study confirmed the independent predictive value of serum albumin, white blood cell count, and systolic blood pressure fluctuations for HICH's long-term prognosis. The model's real-world generalizability is supported by external validation and decision curve analysis across independent time periods. Clinicians can utilize this model to identify at-risk patients and optimize rehabilitation resource allocation. Additionally, the SHAP framework's application enhances model transparency, offering an interpretable basis for personalized interventions.

7 Limitations

Our study has several limitations. First, the GCS is influenced by patient cooperation and rater experience, which may introduce data bias. Second, as a retrospective analysis, selection bias may affect the generalizability of the results. Third, the limited number of externally validated cases may impact the reliability of the findings. Fourth, since the data is sourced from a single institution, there are limitations in terms of coverage and diversity, which may result in the analysis lacking comprehensiveness and broad representativeness. Finally, future research should not only focus on developing high-performance predictive models but also aim to create accessible application platforms.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The study protocol was approved by the Research Ethics Committee of Qinghai Provincial People's Hospital (reference number: 2025-022-02). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

MH: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing. ZL: Funding acquisition, Project administration, Supervision, Writing – review & editing. YL: Data curation, Investigation, Writing – review & editing. ZC: Data curation, Investigation, Writing – review & editing. QZ: Funding acquisition, Project administration, Writing – review & editing. XJ: Data curation, Investigation, Writing – review & editing. PH: Project administration, Supervision, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. The author gratefully acknowledges the financial support by the General Program of the Major Science and Technology Project of Qinghai Provincial Science and Technology Department no.2024-SF-A2 as well as the “Kunlun Talents • Leading Talents in Science and Technology” project of Qinghai Province.

Acknowledgments

We sincerely thank all the authors for their joint efforts, as well as the patients and their families for their participation and support.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Gen AI was used in the creation of this manuscript.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Li D, Wei M, Wu S, Zhang L, Zhang Z. Prognostic factors in acute hypertensive intracerebral hemorrhage: impact of minimally invasive puncture and drainage. Am J Transl Res. (2024) 16:5371–84. doi: 10.62347/PQPP5715

PubMed Abstract | Crossref Full Text | Google Scholar

2. Kase CS, Hanley DF. Intracerebral hemorrhage: advances in emergency care. Neurol Clin. (2021) 39:405–18. doi: 10.1016/j.ncl.2021.02.002

PubMed Abstract | Crossref Full Text | Google Scholar

3. Gross BA, Jankowitz BT, Friedlander RM. Cerebral intraparenchymal hemorrhage: a review. JAMA. (2019) 321:1295–303. doi: 10.1001/jama.2019.2413

PubMed Abstract | Crossref Full Text | Google Scholar

4. Zhang S, Zhang X, Ling Y, Li A. Predicting recurrent hypertensive intracerebral hemorrhage: derivation and validation of a risk-scoring model based on clinical characteristics. World Neurosurg. (2019) 127:e162–71. doi: 10.1016/j.wneu.2019.03.024

PubMed Abstract | Crossref Full Text | Google Scholar

5. Kwakkel G, Kollen B, Lindeman E. Understanding the pattern of functional recovery after stroke: facts and theories. Restor Neurol Neurosci. (2004) 22:281–99. doi: 10.3233/RNN-2004-00282

PubMed Abstract | Crossref Full Text | Google Scholar

6. Jørgensen HS, Nakayama H, Raaschou HO, Olsen TS. Recovery of walking function in stroke patients: the Copenhagen Stroke Study. Arch Phys Med Rehabil. (1995) 76:27–32. doi: 10.1016/S0003-9993(95)80038-7

Crossref Full Text | Google Scholar

7. Campagnini S, Arienti C, Patrini M, Liuzzi P, Mannini A, Carrozza MC. Machine learning methods for functional recovery prediction and prognosis in post-stroke rehabilitation: a systematic review. J Neuroeng Rehabil. (2022) 19:54. doi: 10.1186/s12984-022-01032-4

PubMed Abstract | Crossref Full Text | Google Scholar

8. Handelman GS, Kok HK, Chandra RV, Razavi AH, Lee MJ, Asadi H. eDoctor: machine learning and the future of medicine. J Intern Med. (2018) 284:603–19. doi: 10.1111/joim.12822

PubMed Abstract | Crossref Full Text | Google Scholar

9. Chowdhury MZI, Leung AA, Walker RL, Sikdar KC, O'Beirne M, Quan H, et al. A comparison of machine learning algorithms and traditional regression-based statistical modeling for predicting hypertension incidence in a Canadian population. Sci Rep. (2023) 13:13. doi: 10.1038/s41598-022-27264-x

PubMed Abstract | Crossref Full Text | Google Scholar

10. Gravesteijn BY, Steyerberg EW, Lingsma HF. Modern learning from big data in critical care: primum non nocere. Neurocrit Care. (2022) 37:174–84. doi: 10.1007/s12028-022-01510-6

PubMed Abstract | Crossref Full Text | Google Scholar

11. Budhathoki N, Bhandari R, Bashyal S, Lee C. Predicting asthma using imbalanced data modeling techniques: evidence from 2019 Michigan BRFSS data. PLoS ONE. (2023) 18:e0295427. doi: 10.1371/journal.pone.0295427

PubMed Abstract | Crossref Full Text | Google Scholar

12. Hallevi H, Dar NS, Barreto AD, Morales MM, Martin-Schild S, Abraham AT, et al. The IVH score: a novel tool for estimating intraventricular hemorrhage volume: clinical and research implications. Crit Care Med. (2009) 37:969–74.e1. doi: 10.1097/CCM.0b013e318198683a

PubMed Abstract | Crossref Full Text | Google Scholar

13. Chuang YC, Chen YM, Peng SK, Peng SY. Risk stratification for predicting 30-day mortality of intracerebral hemorrhage. Int J Qual Health Care. (2009) 21:441–7. doi: 10.1093/intqhc/mzp041

PubMed Abstract | Crossref Full Text | Google Scholar

14. Minardi M, Bianconi A, Mesin L, Salvati LF, Griva F, Narducci A. Proposal of a machine learning based prognostic score for ruptured microsurgically treated anterior communicating artery aneurysms. J Clin Med. (2025) 14:578. doi: 10.3390/jcm14020578

PubMed Abstract | Crossref Full Text | Google Scholar

15. Yu F, Yang M, He C, Yang Y, Peng Y, Yang H, et al. CT radiomics combined with clinical and radiological factors predict hematoma expansion in hypertensive intracerebral hemorrhage. Eur Radiol. (2025) 35:6–19. doi: 10.1007/s00330-024-10921-2

PubMed Abstract | Crossref Full Text | Google Scholar

16. Ye H, Jiang Y, Wu Z, Ruan Y, Shen C, Xu J, et al. A comparative study of a nomogram and machine learning models in predicting early hematoma expansion in hypertensive intracerebral hemorrhage. Acad Radiol. (2024) 31:5130–40. doi: 10.1016/j.acra.2024.05.035

PubMed Abstract | Crossref Full Text | Google Scholar

17. Sonobe S, Ishikawa T, Niizuma K, Kawakami E, Ueda T, Takaya E, et al. Development and validation of machine learning prediction model for post-rehabilitation functional outcome after intracerebral hemorrhage. Interdiscip. Neurosurg. (2022) 29:101560. doi: 10.1016/j.inat.2022.101560

Crossref Full Text | Google Scholar

18. Qi X, Hu G, Sun H, Chen Z, Yang C. Machine learning-based perihematomal tissue features to predict clinical outcome after spontaneous intracerebral hemorrhage. J Stroke Cerebrovasc Dis. (2022) 31:106475. doi: 10.1016/j.jstrokecerebrovasdis.2022.106475

PubMed Abstract | Crossref Full Text | Google Scholar

19. Dierksen F, Sommer JK, Tran AT, Lin H, Haider SP, Maier IL, et al. Machine learning models for 3-month outcome prediction using radiomics of intracerebral hemorrhage and perihematomal edema from admission head computed tomography (CT). Diagnostics. (2024) 14:2827. doi: 10.3390/diagnostics14242827

PubMed Abstract | Crossref Full Text | Google Scholar

20. Sreekrishnan A, Leasure AC, Shi FD, Hwang DY, Schindler JL, Petersen NH, et al. Functional improvement among intracerebral hemorrhage (ICH) survivors up to 12 months post-injury. Neurocrit Care. (2017) 27:326–33. doi: 10.1007/s12028-017-0425-4

PubMed Abstract | Crossref Full Text | Google Scholar

21. Chen Y, Qin C, Chang J, Liu Y, Zhang Q, Ye Z, et al. Defining delayed perihematomal edema expansion in intracerebral hemorrhage: segmentation, time course, risk factors and clinical outcome. Front Immunol. (2022) 13:911207. doi: 10.3389/fimmu.2022.911207

PubMed Abstract | Crossref Full Text | Google Scholar

22. Guo W, Liu H, Tan Z, Zhang X, Gao J, Zhang L, et al. Comparison of endoscopic evacuation, stereotactic aspiration, and craniotomy for treatment of basal ganglia hemorrhage. J Neurointerv Surg. (2020) 12:55–61. doi: 10.1136/neurintsurg-2019-014962

PubMed Abstract | Crossref Full Text | Google Scholar

23. Delcourt C, Huang Y, Arima H, Chalmers J, Davis SM, Heeley EL, et al. Hematoma growth and outcomes in intracerebral hemorrhage: the INTERACT1 study. Neurology. (2012) 79:314–9. doi: 10.1212/WNL.0b013e318260cbba

PubMed Abstract | Crossref Full Text | Google Scholar

24. Morotti A, Li Q, Nawabi J, Mazzacane F, Schlunk F, Shoamanesh A, et al. Volume tolerance and prognostic impact of hematoma expansion in deep and lobar intracerebral hemorrhage. Stroke. (2025) 56:1224–31. doi: 10.1161/STROKEAHA.124.049008

PubMed Abstract | Crossref Full Text | Google Scholar

25. Teasdale G, Jennett B. Assessment of coma and impaired consciousness. A practical scale. Lancet. (1974) 2:81–4. doi: 10.1016/S0140-6736(74)91639-0

Crossref Full Text | Google Scholar

26. Mehta R, Chinthapalli K. Glasgow coma scale explained. BMJ. (2019) 365:l1296. doi: 10.1136/bmj.l1296

PubMed Abstract | Crossref Full Text | Google Scholar

27. Zhang J, Zhang N, Li X, Bao L, Liang F, Wang P. Retrospective analysis of prognostic factors in HICH patients after neuroendoscopic hematoma evacuation. Sci Rep. (2024) 14:29505. doi: 10.1038/s41598-024-81106-6

PubMed Abstract | Crossref Full Text | Google Scholar

28. Pinho J, Costa AS, Araújo JM, Amorim JM, Ferreira C. Intracerebral hemorrhage outcome: a comprehensive update. J Neurol Sci. (2019) 398:54–66. doi: 10.1016/j.jns.2019.01.013

PubMed Abstract | Crossref Full Text | Google Scholar

29. Huang X, Wang D, Zhang Q, Ma Y, Li S, Zhao H, et al. Development and validation of a clinical-based signature to predict the 90-day functional outcome for spontaneous intracerebral hemorrhage. Front Aging Neurosci. (2022) 14:904085. doi: 10.3389/fnagi.2022.904085

PubMed Abstract | Crossref Full Text | Google Scholar

30. He J, Zhang Y, Cheng X, Li T, Xiao Y, Peng L, et al. White blood cell count predicts mortality in patients with spontaneous intracerebral hemorrhage. Neurocrit Care. (2023) 39:445–54. doi: 10.1007/s12028-023-01716-2

PubMed Abstract | Crossref Full Text | Google Scholar

31. Lyden P, Anderson A, Rajput P. Therapeutic hypothermia and Type II errors: do not throw out the baby with the ice water. Brain Circ. (2019) 5:203–10. doi: 10.4103/bc.bc_53_19

PubMed Abstract | Crossref Full Text | Google Scholar

32. Zhu Y, Xie Z, Shen J, Zhou L, Liu Z, Ye D, et al. Association between systemic inflammatory response syndrome and hematoma expansion in intracerebral hemorrhage. Adv Clin Exp Med. (2022) 31:489–98. doi: 10.17219/acem/145852

PubMed Abstract | Crossref Full Text | Google Scholar

33. Guo P, Zou W. Neutrophil-to-lymphocyte ratio, white blood cell, and C-reactive protein predicts poor outcome and increased mortality in intracerebral hemorrhage patients: a meta-analysis. Front Neurol. (2023) 14:1288377. doi: 10.3389/fneur.2023.1288377

PubMed Abstract | Crossref Full Text | Google Scholar

34. Morotti A, Phuah CL, Anderson CD, Jessel MJ, Schwab K, Ayres AM, et al. Leukocyte count and intracerebral hemorrhage expansion. Stroke. (2016) 47:1473–8. doi: 10.1161/STROKEAHA.116.013176

PubMed Abstract | Crossref Full Text | Google Scholar

35. Wu D, Shen S, Luo D. Association of lactate-to-albumin ratio with in-hospital and intensive care unit mortality in patients with intracerebral hemorrhage. Front Neurol. (2023) 14:1198741. doi: 10.3389/fneur.2023.1198741

PubMed Abstract | Crossref Full Text | Google Scholar

36. Akirov A, Masri-Iraqi H, Atamna A, Shimon I. Low albumin levels are associated with mortality risk in hospitalized patients. Am J Med. (2017) 130:1465.e11–e19. doi: 10.1016/j.amjmed.2017.07.020

PubMed Abstract | Crossref Full Text | Google Scholar

37. Mullen MT, Anderson CS. Review of long-term blood pressure control after intracerebral hemorrhage: challenges and opportunities. Stroke. (2022) 53:2142–51. doi: 10.1161/STROKEAHA.121.036885

PubMed Abstract | Crossref Full Text | Google Scholar

38. Dandapani BK, Suzuki S, Kelley RE, Reyes-Iglesias Y, Duncan RC. Relation between blood pressure and outcome in intracerebral hemorrhage. Stroke. (1995) 26:21–4. doi: 10.1161/01.STR.26.1.21

Crossref Full Text | Google Scholar

39. Mutimer CA, Yassi N, Wu TY. Blood pressure management in intracerebral haemorrhage: when, how much, and for how long? Curr Neurol Neurosci Rep. (2024) 24:181–9. doi: 10.1007/s11910-024-01341-2

PubMed Abstract | Crossref Full Text | Google Scholar

40. Francoeur CL, Mayer SA. Acute blood pressure and outcome after intracerebral hemorrhage: the VISTA-ICH cohort. J Stroke Cerebrovasc Dis. (2021) 30:105456. doi: 10.1016/j.jstrokecerebrovasdis.2020.105456

PubMed Abstract | Crossref Full Text | Google Scholar

41. Xu J, Xie Z, Chen K, Lan S, Liao G, Xu S, et al. The L-shaped correlation between systolic blood pressure and short-term and long-term mortality in patients with cerebral hemorrhage. BMC Neurol. (2023) 23:230. doi: 10.1186/s12883-023-03271-x

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: hypertensive cerebral hemorrhage, predictive model, XGBoost, SHAP, machine learning

Citation: He M, Lu Z, Lv Y, Cheng Z, Zhang Q, Jin X and Han P (2025) Machine learning-based prediction of 6-month functional recovery in hypertensive cerebral hemorrhage: insights from XGBoost and SHAP analysis. Front. Neurol. 16:1608341. doi: 10.3389/fneur.2025.1608341

Received: 08 April 2025; Accepted: 15 May 2025;
Published: 04 June 2025.

Edited by:

Andrea Bianconi, University of Genoa, Italy

Reviewed by:

Eichi Takaya, Tohoku University, Japan
Shinya Sonobe, Tohoku University, Japan
Luca Francesco Salvati, ASST Sette Laghi, Italy

Copyright © 2025 He, Lu, Lv, Cheng, Zhang, Jin and Han. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zhongsheng Lu, TFpTMTM5OTcxNTQwNDdAMTYzLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.