Machine learning models for mortality prediction in patients with spontaneous subarachnoid hemorrhage following ICU treatment

Hu, Wenwen; Yu, Danfeng; Zhang, Liwen; Zhang, Jing

doi:10.3389/fneur.2025.1648353

ORIGINAL RESEARCH article

Front. Neurol., 17 September 2025

Sec. Artificial Intelligence in Neurology

Volume 16 - 2025 | https://doi.org/10.3389/fneur.2025.1648353

Machine learning models for mortality prediction in patients with spontaneous subarachnoid hemorrhage following ICU treatment

Wenwen Hu¹

Danfeng Yu¹

Liwen Zhang²

Jing Zhang¹^*

¹Department of Neurological Intensive Care Unit, Taihe Hospital, Hubei University of Medicine, Shiyan, China
²Graduate School, Hubei University of Medicine, Shiyan, China

Background: Spontaneous subarachnoid hemorrhage (SAH) is a severe and potentially life-threatening acute cerebrovascular disease. Early identification of the risk of death in patients with spontaneous SAH is of vital importance for improving prognosis, reducing mortality, and guiding clinical treatment.

Methods: A retrospective cohort study was conducted using the public database, Medical Information Mart for Intensive Care IV (MIMIC)-IV. The primary outcome was in-hospital mortality following intensive care unit (ICU) treatment. All features were extracted from first-day ICU admission data. Data analysis was performed by using R and Python, with feature selection conducted via least absolute shrinkage and selection operator (LASSO) regression. We constructed 8 models based on the 12 selected features in the training set and evaluated them in the test set by various metrics, including area under the curve (AUC), accuracy, precision (positive prediction value), recall (sensitivity), Brier score, Jordan index, and calibration slope. The most effective model was rendered explainable through the SHapley Additive exPlanations (SHAP) approach.

Results: The study included 1,121 records, with 870 surviving and 251 deceased patients. We selected 43 features for the preliminary baseline analysis. Based on LASSO regression analysis and clinical practical significance, 12 features were finally included in the construction of the machine learning models. We constructed eight machine learning models, among which the logistic regression (LR) model performed the best.

Conclusions: In our study, the LR model exhibited superior discrimination in predicting risk of mortality among patients with spontaneous SAH compared to other models. This research contributes to facilitating the early identification of mortality risk in patients with spontaneous SAH. External validation and further prospective studies are warranted to confirm and refine these predictive insights for clinical utilization.

1 Introduction

Subarachnoid Hemorrhage (SAH) is a critical public health concern, which remains a serious disease associated with considerable disability and mortality (1). The incidence of SAH is approximately 9 cases per 100,000 individuals, and it is the third most prevalent subtype of stroke (2). This disorder is primarily categorized into two types: traumatic and spontaneous (non-traumatic), with the spontaneous type accounting for approximately 85%−95% of cases, thus constituting the majority (3). Spontaneous SAH is relatively common, and its causes are diverse. The rupture of intracranial aneurysms is one of the main causes, accounting for approximately 85% (4). Cerebral vascular malformations, such as arteriovenous malformations, are also important contributing factors, occurring more frequently in adolescents (5). In addition, vascular inflammation, abnormal vascular networks at the base of the brain, brain tumors, and moyamoya disease may also lead to spontaneous SAH (5).

One-third of spontaneous SAH patients die within the initial days to weeks after the hemorrhage, and most survivors have long-term disability or cognitive impairment (6). Spontaneous SAH carries an exceptionally high disease-specific burden. Since traditional risk prediction is limited to a single feature selection method or a single algorithm, it has relative lag and limitations (7). There is an urgent need for a reliable method to predict the risk of death in spontaneous SAH patients in the ICU at an early stage. Clinical prediction models that utilize electronic health record data through advanced data mining techniques have emerged as a promising approach to addressing these challenges. Machine learning, with its high efficiency and accuracy in data processing, has become increasingly prevalent in various disease predictions. In our study, we aimed to integrate machine learning algorithms with traditional statistical analysis to comprehensively evaluate the risk factors for death in patients with spontaneous SAH following intensive care unit (ICU) treatment. These new analytic approaches may enhance risk prediction beyond only traditional statistical approaches used in the past (8). An assessment of the risk of death after spontaneous SAH is valuable for guiding early clinical management of patients and evaluating clinical efficacy.

2 Methods

2.1 Data source and study population

This study is a retrospective cohort study based on the Medical Information Mart for Intensive Care IV (MIMIC-IV, Version 3.1, released on 11 October 2024) database (9). In order to enhance usability of medical data and to improve patient care through knowledge discovery and algorithm development, a large deidentified dataset - MIMIC-IV database, developed and maintained by the Computational Physiology Laboratory at the Massachusetts Institute of Technology (MIT). MIMIC-IV contains data for over 65,000 patients admitted to an ICU and over 200,000 patients admitted to the emergency department at the Beth Israel Deaconess Medical Center in Boston, MA. All data was captured automatically through the three systems during clinical care: Hospital-wide Electronic Health Record (EHR) System, ICU Clinical Information (MetaVision) System, and Emergency Department (ED) System.

For data retrieval from the MIMIC-IV database, Structured Query Language (SQL) was applied. In order to comply with the regulations, the author, Wenwen Hu, obtained both a Cooperative Institutional Training Initiative (CITI) license and the necessary permissions to use the MIMIC-IV database (ID: 67812003). We developed detailed data extraction steps and conducted trial extractions before the official data extraction phase to test and refine the clarity and operability of these steps.

(1) Inclusion criteria

a. Patients who were diagnosed with spontaneous SAH confirmed by both the International Classification of Diseases (ICD)-9 or ICD-10.

b. For patients with ICU admissions more than once, only data of the first ICU admission of the first hospitalization was collected for the study.

(2) Exclusion criteria

a. Patients under the age of 18 were excluded from the study.

b. Patients with concurrent malignant tumors were excluded from the study.

c. Patients with over 20% missing features (after feature extraction) were also excluded from the study.

2.2 Feature selection and outcome

In this study, 43 features referring to published articles (10–13) and clinical experience were extracted from the MIMIC-IV database, including age, gender, basic vital signs, coexisting disorders, blood cell analysis, coagulation function, serum ions, biochemical parameters, ventilation status, Glasgow Coma Scale (GCS) score, sepsis-related organ failure assessment (SOFA) score, acute physiology score iii (APS III), simplified acute physiology score ii (SAPS II) and the primary outcome.

The vital signs and serum ions were selected based on the maximum and minimum values recorded on the first day of admission to the ICU, while blood cell analysis, coagulation function, liver and kidney function, and serum ions were selected based on the first test values recorded on the first day of admission to the ICU. In cases where multiple test results were available for a specific feature, the first measurement was used in the analysis.

The variance inflation factor (VIF) is an effective tool to detect multicollinearity (14). A VIF = 1 indicates no multicollinearity; a VIF between 1 and 5 indicates moderate collinearity; a VIF > 5 indicates high collinearity; a VIF > 10 indicates severe multicollinearity (15). To mitigate the interference caused by strong multicollinearity, we removed 5 features with severe multicollinearity. Re-calculated the VIF of the retained features, all of them were < 5.

We applied the least absolute shrinkage and selection operator (LASSO) regression in feature selection. LASSO achieves variable selection through L1 regularization. As the value of λ increases, more and more coefficients are shrunk to zero, resulting in twelve features with nonzero coefficients (16). The primary outcome was in-hospital mortality of spontaneous SAH patients following their treatment in the ICU.

2.3 Missing data processing

Missing data is inevitable because clinical needs and resources limit what data is collected, patient differences lead to inconsistent or variable measurements, and merging data from various sources may introduces omissions and discrepancies. Most features had missing rates < 10%, with the exception of PT (12.2%) and APTT (13.7%) (Supplementary Table S1).

For data with a missing rate of less than 10%, a filling method (median, mean, or mode) that represents the central tendency of the variables was selected based on the characteristics of the data distribution (17). For data with a missing rate ranging from 10% to 20%, the multiple imputation method was employed to replace missing values, thereby minimizing their impact on classification performance (18).

When the missing rate is low (< 10%), the bias introduced by simple imputation is typically negligible compared to the additional complexity associated with multiple imputation. Multiple imputation is designed for higher missing rates. In our study, simple imputation and multiple imputation didn't make significantly different results. Variability and the relationships between variables were preserved as they would be when using multiple imputation.

2.4 Statistical analysis

Categorical variables were presented as numbers and percentages (%). We compared proportions for unordered categorical variables using the χ²-test or Fisher's exact test and compared proportions for ordered categorical variables using the Wilcoxon rank-sum test. Normally and non-normally distributed continuous variables were expressed as mean ± SD and median (interquartile range, IQR), respectively. Normally distributed continuous variables were analyzed by an independent t-test, while non-normally distributed continuous variables were analyzed by the Mann–Whitney U test. P values less than 0.05 (two-sided test) were considered statistically significant.

2.5 Model construction and evaluation

Model construction was performed using 8 machine learning algorithms, including Random Forest (RF), Logistic Regression (LR), Light Gradient Boosting Machine (LGBM), Naive Bayes (NB), Decision Tree (DT), Extreme Gradient Boosting (XGBoost), Support Vector Machine (SVM), and Artificial Neural Network (ANN). The performance of each model was evaluated depending on area under the curve (AUC), accuracy, precision, recall, Brier score, Jordan index, and calibration slope. Receiver operating characteristic (ROC) curves and precision–recall (P-R) curves for the eight models were depicted in one plot, respectively, for comparison. The metrics and plots were used to determine the optimal model. Additionally, the SHapley Additive exPlanations (SHAP) approach was adopted to make the final optimized model more interpretable. The SHAP values indicate the contribution of each feature to the final classification, enabling us to interpret the model from a clinical perspective.

2.6 Software

We used Navicat (version 17.0.8) to access the MIMIC-IV database. Data preprocessing, feature selection, and statistical analysis were performed using R (version 4.4.3). Python (version 3.13.1) was employed for the construction and evaluation of machine learning models.

3 Results

3.1 Baseline characteristics of ICU patients with SAH

Initially, 1,329 ICU admission records diagnosed with spontaneous SAH and 43 features were extracted from the MIMIC-IV database. By combining exclusion criteria and discarding features with excessive missing values, 1,121 records were ultimately retained for subsequent analysis. The flowchart of the study is shown in Figure 1. Based on ICU outcomes, the entire study population was divided into two groups: a survival group (n = 870) and a non-survival group (n = 251). Baseline characteristics of the included patients are presented in Table 1.

Figure 1

Flowchart detailing the process of analyzing first ICU admission records of SAH patients. Starting with 1329 patients, exclusions reduce the number to 1121.Baseline characteristic and collinearity analyses occur, followed by LASSO regression narrowing to 38 features. Machine learning reduces it to 12 features, splitting into a training set of 897 and a test set of 224. Model construction and evaluation are performed using methods such as random forest and neural networks, concluding with SHAP analysis.

Figure 1. Patients and features selection flowchart of the MIMIC-IV database.

Table 1

Table 1. Characteristics of spontaneous SAH from the MIMIC-IV database.

Forty-three characteristics were compared between the two groups, among which 24 characteristics showed significant statistical difference. According to the baseline data presented in the Table 1, patients in the non-survival group were generally older than those in the survival group. No significant difference was observed in gender composition between the two groups, with females slightly outnumbering males in both the survival and non-survival groups. The data revealed that the non-survival group was more likely to have electrolyte disorders, coagulation abnormalities, hyperglycemia, and thrombocytopenia. Additionally, vital signs such as heart rate, body temperature, and oxygen saturation in these patients fluctuated more widely and were more likely to be accompanied by comorbidities.

3.2 Features selection

Given that the red blood cell (RBC) count influences hemoglobin and hematocrit levels, RBC was retained. The calculation of non-invasive mean arterial pressure (NMAP) is dependent on non-invasive systolic blood pressure (NBPS); hence, NBPS was retained. Given the close interrelationship among INR, PT, and APTT, APTT was retained. Consequently, five features - hemoglobin, hematocrit, INR, PT, and NMAP - with excessive multicollinearity were removed, and 38 features were retained for further analysis (Supplementary Figure S1).

Subsequently, the retained features were selected by the LASSO regression algorithm. Twelve of 38 features were selected as the best predictive to construct the machine learning models. These were identified at a shrinkage parameter (lambda.1se) of 0.02618559 (Figure 2). The following features raise the risk of mortality in our study: serum sodium, SAPSII score, admission age, BUN, glucose, SOFA score, heart rate, APSIII score, liver disease, and creatinine. Conversely, when SpO2 and platelet count rise, the risk of death falls. The importance ranking of the 12 features is shown in Figure 3. Then, these features were used in the subsequent analyses for all models in both training and test sets.

Figure 2

Chart A shows a line graph of coefficients versus Log Lambda, with multiple colored lines converging towards zero as Log Lambda increases. Chart B displays a plot of binomial deviance against Log Lambda, featuring a red line with error bars, indicating a U-shaped pattern with two dashed vertical lines at critical points.

Figure 2. Features selection using a LASSO regression model. (A) LASSO coefficient path graph: Lasso achieves variable selection through L1 regularization. As the value of λ increases, more and more coefficients are shrunk to zero. To determine the optimal predictors of the model, ten-fold cross-validation with minimum criteria was used, resulting in twelve features with nonzero coefficients. (B) The minimum criteria (lambda.min) and 1 SE of the minimum criteria (lambda. 1se) were used to depict the optimal values with dotted vertical lines. We chosed lambda.1se instead of lambda.min because lambda.1se (the maximum λ value within the minimum error range of one standard error) usually provides a more robust and concise model, which helps to avoid overfitting.

Figure 3

Graph of test set model metrics comparing different algorithms: ANN, DT, LGBM, LR, NB, RF, SVM, and XGBoost. Metrics include AUC, Accuracy, Precision, Recall, Brier Score, Jordan Index, and Calibration Slope, with varying performance across metrics.

Figure 3. LASSO coefficient profile of 12 features. Features after selection. There are 38 features in total before the selection process, and 12 features remain after using LASSO regression. The plot presents the top 12 features that had the greatest impact on survival or death in SAH patients after receiving treatment in the ICU. Green bars indicate protective factors and red bars indicate risk factors. The length of the bar for each feature indicates the importance (weight) of that feature in making the prediction. A longer bar indicates a feature that contributes more to survival or death.

3.3 Model performance and explanation

1,121 records were randomly divided into a training set (n = 897) and a test set (n = 224) at a ratio of 8:2. The number of non-survival patients was 205 (22.9%) and 46 (20.5%) in the training and test sets respectively. We developed 8 machine learning models to predict the risk factors for death after receiving treatment in ICU. The 8 models were trained employing the training set. Their performance was subsequently evaluated employing the test set.

Table 2 and Figure 4 showed the metrics for each model in predicting mortality. The LR model outperformed others with the highest accuracy of 0.8545 and a higher recall of 0.7826. The Jordan index of 0.7291, calibration slope of 0.7623, and Brier score of 0.1650 all indicated better model performances. On the training set, both LGBM (AUC = 0.9907) and XGBoost (AUC = 0.9780) achieved near-perfect AUC scores approaching 1.0, whereas their performance declined markedly on the test set (AUC = 0.8396 and 0.8510), indicating potential overfitting. In contrast, the LR model demonstrated relatively excellent and stable performance across both the training and test sets, achieving the highest prediction performance on the test set (AUC = 0.8646), as is shown in Figure 5. The precision-recall curve is a visualization tool used to evaluate the trade-off relationship between the precision and recall of a classification model at different thresholds. Figure 6 showed that this LR model algorithm has high classification precision and recall rate. Therefore, the LR model was finally selected to predict the mortality rate of spontaneous SAH patients in the ICU.

Table 2

Table 2. Evaluation of different machine learning models.

Figure 4

Bar chart titled “LASSO Feature Coefficient Analysis” showing feature coefficients. Sodium max has the highest positive coefficient at 0.496, while spo2 min has a negative coefficient of -0.125. Other features include sapsii, admission age, bun max, and glucose max with positive coefficients.

Figure 4. Line graph of the test set model metrics. LR, logistic regression; RF, random forest; LGBM, light gradient boosting machine; NB, naive bayes; DT, decision tree; XGBoost, extreme gradient boosting; SVM, support vector Machine; ANN, artificial neural network.

Figure 5

ROC curve comparison chart for multiple models, displaying true positive rate versus false positive rate. Models include DT (AUC=0.79), LGBM (0.84), LR (0.86), RF (0.84), SVM (0.83), XGB (0.85), NB (0.86), and ANN (0.64). A diagonal line represents random guessing.

Figure 5. ROC curves of eight machine learning models. LR, logistic regression; RF: random forest; LGBM, light gradient boosting machine; NB, naive bayes; DT, decision tree; XGBoost, extreme gradient boosting; SVM, support vector Machine; ANN, artificial neural network.

Figure 6

Precision-recall curves comparison for various models on a test set. Models include Decision Tree, LGBM, Logistic Regression, Random Forest, SVM, XGBoost, Naive Bayes, and ANN. Average precision scores range from 0.36 (ANN) to 0.66 (LGBM). The x-axis represents recall, and the y-axis represents precision.

Figure 6. Precision–recall curves of eight machine learning models. LR, logistic regression; RF, random forest; LGBM, light gradient boosting machine; NB, naive bayes; DT, decision tree; XGBoost, extreme gradient boosting; SVM, support vector Machine; ANN, artificial neural network.

To identify the most influential features, we employed the SHAP value analysis, an interpretable method widely used in medical research. By visualizing the SHAP values of all features across the entire dataset, we were able to discern overarching patterns and relationships within the data. As illustrated in the SHAP summary plots (Figures 7A, B), we evaluated the contributions of 12 selected features to the model performance. We further transformed the SHAP value matrix into a heatmap (Figure 7C) to visualize the feature contributions at the individual sample level. This visualization provided a granular view of how each feature contributed to the prediction for every sample. In Figure 7D, the decision plot depicted the decision-making process for each participant. Every line converges to a single point at −0.323.

Figure 7

Panel A shows a SHAP summary plot illustrating the impact of different features on the model output, with red and blue colors indicating high and low feature values. Panel B depicts a bar chart of feature importance based on mean absolute SHAP values, with glucose_max having the highest importance. Panel C presents a heatmap of SHAP values clustered by samples, with varying colors representing different impact levels. Panel D illustrates a decision path analysis plot, displaying pathways of features affecting the model output, with color-coded pathways indicating impact direction.

Figure 7. (A) SHAP summary dot plot; (B) SHAP summary bar plot; (C) SHAP heatmap plot; (D) SHAP decision plot. (A) Color denotes the feature value–red denotes a value that is greater, and blue denotes a value that is lower. The more dispersed the points of the graph represent, the greater the impact of the feature on the model. (B) The average SHAP values were calculated and ranked in descending order, with each row representing a distinct feature and the horizontal axis indicating the magnitude of its SHAP value. (C) Rows represent features, and columns represent samples. The color intensity reflects the magnitude of feature values on the model output (SHAP value). Red has a greater influence, while blue has a lesser influence. (D) The features were listed in order of decreasing importance, based on their cumulative SHAP values across the plotted observations. Every line converges to a single point at −0.323.

4 Discussion

Spontaneous SAH is an acute cerebrovascular disease with a life-threatening neurological condition, which imposes a heavy burden on individuals, families, and even society (19). Once the disease occurs, most patients will be sent to the ICU for rescue and treatment. However, many patients still have poor prognosis or even death (19). Currently, the majority of studies are limited to the impact of a single conventional indicator or factor on mortality, and they lack an analysis of multiple causes of death. There are some studies that have reported that gender, WFNS class, APACHE II score, IL-6, Hunt and Hess grade, troponin I, white blood cell count, and electrocardiographic abnormalities are associated with spontaneous SAH (20).

We extracted relevant indicators from the MIMIC-IV database as comprehensively as possible for machine learning. To refine the feature selection process, we applied collinearity analysis and LASSO regression, thereby effectively reducing the influence of multicollinearity on the selected features. Subsequently, multiple models were developed using various machine learning algorithms. These models were evaluated based on multiple parameters to ensure that the final model had better and more stable performance in predicting the outcome of death.

We found, quite interestingly, patients with hyperglycemia on admission had increased mortality most significantly. A retrospective analysis showed that admission hyperglycemia was associated with significantly increased mortality in critically ill patients with SAH (21). The causes of admission hyperglycemia, which could either be pre-existing diabetes mellitus or stress-induced hyperglycemia. Stress-induced hyperglycemia was an independent risk factor for pulmonary infection and death after intracranial hemorrhage (22). In clinical practice, blood glucose is an indicator that can be quickly obtained. In subsequent studies, we will focus on blood glucose to explore the mechanism of hyperglycemia on the death of spontaneous SAH patients. Rapidly detecting blood glucose and reducing hyperglycemia that may occur in the early stage of the disease are likely to become a potent measure to reduce the mortality rate of patients.

In our study, low SpO₂ indicates increased mortality, which may be related to the lack of oxygen in patients. However, a study utilizing data from large ICU databases revealed a U-shaped relationship between SpO₂ levels and mortality among patients (23). For patients with TBI and SAH, maintaining SpO₂ at 94–96% will minimize the in-hospital mortality of patients. In the other study, the optimal range of SpO₂ was 94% to 98% (24). The patients who were within the optimal range of SpO₂ were associated with decreased hospital mortality (23). This finding is different from the results of our study. In the subsequent research, we can conduct a separate analysis of the relationship between SpO₂ and mortality.

Regardless of whether baseline analysis or machine learning models were used, older age was associated with a higher risk of mortality. It is well-established that as patients grow older, their organ functions become increasingly susceptible to failure (25). Therefore, when elderly patients experience an acute disease, like spontaneous SAH, they will probably be at relatively high risk of death.

In terms of vital signs, patients with higher heart rates on ICU admission had a higher risk of death. Studies have shown that increased sympathetic nerve stimulation in patients with subarachnoid hemorrhage leads to increased heart rate and even arrhythmia, and severe sympathetic nerve stimulation can greatly increase the risk of cardiac arrest, thereby increasing mortality (26). Therefore, reducing the heart rate of patients within the normal range and correcting arrhythmia immediately on admission can reduce the mortality of patients.

The on-admission platelet count was found to be significant and predictive of patient outcome on discharge (27). In our study, spontaneous SAH patients with low platelet counts have an increased risk of death. According to the literature, thrombocytopenia has been identified as an independent risk factor for symptomatic vasospasm following aneurysmal subarachnoid hemorrhage (28). Additionally, thrombocytopenia is associated with an increased risk of bleeding, which in turn elevates the mortality risk among patients.

We analyzed serum sodium, potassium, and calcium levels and found that serum sodium may be associated with the risk of death, with higher sodium levels at ICU admission indicating a higher risk of death. A study found that high serum sodium levels are related to higher ICU and hospital mortality in patients with non-traumatic SAH (29). Increased intracranial pressure (ICP) impairs hypothalamic function, thereby resulting in electrolyte disturbances in the body (30). Therefore, timely identification and correction of electrolyte disturbances is critical to prevent permanent central nervous system damage. Elevated serum sodium levels are indicative of severe intracranial hemorrhage and significant neurological dysfunction, potentially guiding clinicians to promptly initiate pharmacological interventions or surgical procedures aimed at reducing ICP.

Impaired kidney or liver function was also associated with high mortality (31). Several studies have shown that the underlying mechanism may involve renal failure leading to severe electrolyte disturbances and acid-base imbalances (32), as well as liver failure resulting in coagulopathy, which increases the risk of bleeding (33). This suggests that for patients with spontaneous SAH, early initiation of liver and renal protection treatment may be conducive to reducing the mortality of patients.

Several grading systems have been used to predict the outcome of critically ill patients (34). SAPS II score, APS III score, and SOFA score are related to the health status and organ function of patients. The higher the score, the worse the health status of the patients and the higher the risk of death (35).

Furthermore, the best performance of a model under the current methodology reflects the effectiveness of the selection process rather than an inherent advantage of the model itself. While LR remains remarkably competitive in the wave of Artificial Intelligence (AI), no single model is universally superior—optimal performance is fundamentally contingent on careful matching of algorithmic strengths to the specific data characteristics and specific research constraints at hand.

The study possesses several notable strengths: (1) The data were sourced from a large, publicly accessible database on the Internet, ensuring reliability and representativeness. (2) Collinearity analysis was used for feature selection, enhancing the robustness of the model. (3) Multiple machine learning algorithms were used to construct models capable of ranking feature importance. (4) The final 12 selected features are readily available clinically. There is no analysis of the combined effects of these features on SAH.

Additionally, this study had several limitations: (1) The study lacks external validation. We have initiated the collection of relevant data from our hospital and plan to conduct further research upon reaching the target sample size. (2) This study captured laboratory indicators only on the first day of ICU admission, lacking dynamic monitoring of these indicators over time. (3) In future follow-up studies, targeting aneurysmal SAH exclusively could help exclude the influence of etiology on the results, thereby enhancing the specificity of the findings.

5 Conclusion

Our study develops an interpretable machine learning model to predict the risk factors and mortality in patients with spontaneous SAH. We selected the best-performing model among the eight models, namely the LR model. This model incorporates 12 features. Finally, SHAP was used to interpret the model to improve the interpretability of the model. This study may facilitate the early identification of mortality risk in patients with spontaneous SAH, thereby enabling timely intervention. Moreover, it can assist clinicians in optimizing patient management under resource constraints, thus reducing mortality risk and improving clinical outcomes.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article.

Ethics statement

The studies involving humans were approved by the Institutional Review Boards of both Beth Israel Deaconess Medical Center (BIDMC) and the Massachusetts Institute of Technology (MIT). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants' legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

WH: Data curation, Formal analysis, Writing – review & editing, Writing – original draft. DY: Conceptualization, Methodology, Writing – review & editing. LZ: Software, Writing – review & editing. JZ: Writing – review & editing, Project administration, Supervision.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Acknowledgments

We would like to thank the Massachusetts Institute of Technology and the Beth Israel Deaconess Medical Center for the Medical Information Mart for Intensive Care project. We would also like to thank all healthcare workers and analysis workers involved in this study.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Gen AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2025.1648353/full#supplementary-material

Supplementary Figure S1 | Collinearity analysis. VIF, variance inflation factor. VIF = 1: no multicollinearity, VIF between 1 and 10: moderate multicollinearity, VIF > 10: serious multicollinearity. Blue bars: the retained features. Green bars: the removed features.

Supplementary Table S1 | The missing data of this study.

References

1. Kagiyama N, Sugahara M, Crago EA, Qi Z, Lagattuta TF, Yousef KM, et al. Neurocardiac injury assessed by strain imaging is associated with in-hospital mortality in patients with subarachnoid hemorrhage. JACC Cardiovasc Imaging. (2020) 13:535–46. doi: 10.1016/j.jcmg.2019.02.023

PubMed Abstract | Crossref Full Text | Google Scholar

2. de Rooij NK, Linn FH, van der Plas JA, Algra A, Rinkel GJ. Incidence of subarachnoid haemorrhage: a systematic review with emphasis on region, age, gender, and time trends. J Neurol Neurosurg Psychiatry. (2007) 78:1365–72. doi: 10.1136/jnnp.2007.117655

PubMed Abstract | Crossref Full Text | Google Scholar

3. Macdonald RL, Schweizer TA. Spontaneous subarachnoid haemorrhage. Lancet. (2017) (10069):655–66. doi: 10.1016/S0140-6736(16)30668-7

PubMed Abstract | Crossref Full Text | Google Scholar

4. Maher M, Schweizer TA, Macdonald RL. Treatment of spontaneous subarachnoid hemorrhage. Stroke. (2020) 51:1326–32. doi: 10.1161/STROKEAHA.119.025997

PubMed Abstract | Crossref Full Text | Google Scholar

5. Song JP, Ni W, Gu YX, Zhu W, Chen L, Xu B, et al. Epidemiological features of nontraumatic spontaneous subarachnoid hemorrhage in China. Chin Med J. (2017) 130:776–81. doi: 10.4103/0366-6999.202729

PubMed Abstract | Crossref Full Text | Google Scholar

6. Etminan N, Chang HS, Hackenberg K, de Rooij NK, Vergouwen MDI, Rinkel GJE, et al. Worldwide incidence of aneurysmal subarachnoid hemorrhage according to region, time period, blood pressure, and smoking prevalence in the population: a systematic review and meta-analysis. JAMA Neurol. (2019) 6:588–97. doi: 10.1001/jamaneurol.2019.0006

PubMed Abstract | Crossref Full Text | Google Scholar

7. Smith EE, Shobha N, Dai D, Olson DM, Reeves MJ, Saver JL, et al. A risk score for in-hospital death in patients admitted with ischemic or hemorrhagic stroke. J Am Heart Assoc. (2013) 2:e005207. doi: 10.1161/JAHA.112.005207

PubMed Abstract | Crossref Full Text | Google Scholar

8. Zhang T, Rabhi F, Chen X, Paik HY, MacIntyre CR. A machine learning-based universal outbreak risk prediction tool. Comput Biol Med. (2024) 169:107876. doi: 10.1016/j.compbiomed.2023.107876

PubMed Abstract | Crossref Full Text | Google Scholar

9. Johnson AEW, Bulgarelli L, Shen L, Gayles A, Shammout A, Horng S, et al. MIMIC-IV, a freely accessible electronic health record dataset. Sci Data. (2023) 10:1. doi: 10.1038/s41597-023-02136-9

PubMed Abstract | Crossref Full Text | Google Scholar

10. Khera R, Haimovich J, Hurley NC, McNamara R, Spertus JA, Desai N, et al. Use of machine learning models to predict death after acute myocardial infarction. JAMA Cardiol. (2021) 6:633–41. doi: 10.1001/jamacardio.2021.0122

PubMed Abstract | Crossref Full Text | Google Scholar

11. D'Ascenzo F, De Filippo O, Gallone G, Mittone G, Deriu MA, Iannaccone M, et al. Machine learning-based prediction of adverse events following an acute coronary syndrome (PRAISE): a modelling study of pooled datasets. Lancet. (2021) 397:199–207. doi: 10.1016/S0140-6736(20)32519-8

PubMed Abstract | Crossref Full Text | Google Scholar

12. Savarraj JPJ, Hergenroeder GW, Zhu L, Chang T, Park S, Megjhani M, et al. Machine learning to predict delayed cerebral ischemia and outcomes in subarachnoid hemorrhage. Neurology. (2021) 96:e553–62. doi: 10.1212/WNL.0000000000011211

PubMed Abstract | Crossref Full Text | Google Scholar

13. Zarrin DA, Suri A, McCarthy K, Gaonkar B, Wilson BR, Colby GP, et al. Machine learning predicts cerebral vasospasm in patients with subarachnoid haemorrhage. EBioMedicine. (2024) 105:105206. doi: 10.1016/j.ebiom.2024.105206

PubMed Abstract | Crossref Full Text | Google Scholar

14. Towards Data Science. (2025). When predictors collide: mastering VIF in multicollinear regression. Available online at: https://towardsdatascience.com/when-predictors-collide-mastering-vif-in-multicollinear-regression/ [Accessed May 15, 2025].

Google Scholar

15. Yoo W, Mayberry R, Bae S, Singh K, Peter He Q, Lillard JW Jr. A study of effects of multi collinearity in the multivariable analysis. Int J Appl Sci Technol. (2014) 4:9–19.

Google Scholar

16. Xi LJ, Guo ZY, Yang XK, Ping ZG. Application of LASSO and its extended method in variable selection of regression analysis. Chin J Prevent Med. (2023) 57:107–11. doi: 10.3760/cma.j.cn112150-20220117-00063

PubMed Abstract | Crossref Full Text | Google Scholar

17. Heymans MW, Twisk JWR. Handling missing data in clinical research. J Clin Epidemiol. (2022) 151:185–88. doi: 10.1016/j.jclinepi.2022.08.016

PubMed Abstract | Crossref Full Text | Google Scholar

18. Austin PC, White IR, Lee DS, van Buuren S. Missing data in clinical research: a tutorial on multiple imputation. Can J Cardiol. (2021) 37:1322–31. doi: 10.1016/j.cjca.2020.11.010

PubMed Abstract | Crossref Full Text | Google Scholar

19. Robba C, Busl KM, Claassen J, Diringer MN, Helbok R, Park S, et al. Contemporary management of aneurysmal subarachnoid haemorrhage. An update for the intensivist. Intensive Care Med. (2024) 50:646–64. doi: 10.1007/s00134-024-07387-7

PubMed Abstract | Crossref Full Text | Google Scholar

20. Wang M, Pan W, Xu Y, Zhang J, Wan J, Jiang H. Prevalence, in-hospital mortality, and factors related to neurogenic pulmonary edema after spontaneous subarachnoid hemorrhage: a systematic review and meta-analysis. Neurosurg Rev. (2023) 46:169. doi: 10.1007/s10143-023-02081-6

PubMed Abstract | Crossref Full Text | Google Scholar

21. Liu D, Tang Y, Zhang Q. Admission hyperglycemia predicts long-term mortality in critically Ill patients with subarachnoid hemorrhage: a retrospective analysis of the MIMIC-III database. Front Neurol. (2021) 12:678998. doi: 10.3389/fneur.2021.678998

PubMed Abstract | Crossref Full Text | Google Scholar

22. Chen S, Wan Y, Guo H, Shen J, Li M, Xia Y, et al. Diabetic andstress-induced hyperglycemia in spontaneous intracerebral hemorrhage: A multicenter prospective cohort (CHEERY) study. CNS Neurosci Ther. (2023) 29:979–87. doi: 10.1111/cns.14033

PubMed Abstract | Crossref Full Text | Google Scholar

23. van den Boom W, Hoy M, Sankaran J, Liu M, Chahed H, Feng M, et al. The search for optimal oxygen saturation targets in critically Ill patients: observational data from large ICU databases. Chest. (2020) 157:566–73. doi: 10.1016/j.chest.2019.09.015

PubMed Abstract | Crossref Full Text | Google Scholar

24. Yin H, Yang R, Xin Y, Jiang T, Zhong D. In-hospital mortality and SpO₂ incritical care patients with cerebral injury: data from the MIMIC-IV Database. BMC Anesthesiol. (2022) 22:386. doi: 10.1186/s12871-022-01933-w

PubMed Abstract | Crossref Full Text | Google Scholar

25. Sasaki T, Naraoka M, Shimamura N, Takemura A, Hasegawa S, Akasaka K, et al. Factors affecting outcomes of poor-grade subarachnoid hemorrhage. World Neurosurg. (2024) 185:e516–522. doi: 10.1016/j.wneu.2024.02.064

PubMed Abstract | Crossref Full Text | Google Scholar

26. Borutta MC, Gerner ST, Moeser P, Hoelter P, Engelhorn T, Doerfler A, et al. Correlation between clinical severity and extent of autonomic cardiovascular impairment in the acute phase of subarachnoid hemorrhage. J Neurol. (2022) 269:5541–52. doi: 10.1007/s00415-022-11220-w

PubMed Abstract | Crossref Full Text | Google Scholar

27. Fischer I, Lala R, Donaldson DM, Schieferdecker S, Hofmann BB, Cornelius JF, et al. Prognostic value of platelet levels in patients with aneurysmal Subarachnoid Hemorrhage. Sci Rep. (2024) 14:16743. doi: 10.1038/s41598-024-67322-0

PubMed Abstract | Crossref Full Text | Google Scholar

28. Hirashima Y, Hamada H, Kurimoto M, Origasa H, Endo S. Decrease in platelet count as an independent risk factor for symptomatic vasospasm following aneurysmal subarachnoid hemorrhage. J Neurosurg. (2005) 102:882. doi: 10.3171/jns.2005.102.5.0882

PubMed Abstract | Crossref Full Text | Google Scholar

29. Liu J, Li J, Zhang Q, Wang L, Wang Y, Zhang J, et al. Association between serum sodium levels within 24 h of admission and all-cause mortality in critically ill patients with non-traumatic subarachnoid hemorrhage: a retrospective analysis of the MIMIC-IV database. Front Neurol. (2023) 14:1234080. doi: 10.3389/fneur.2023.1234080

PubMed Abstract | Crossref Full Text | Google Scholar

30. Espay AJ. Neurologic complications of electrolyte disturbances and acid-base balance. Handb Clin Neurol. (2014) 119:365–82. doi: 10.1016/B978-0-7020-4086-3.00023-0

PubMed Abstract | Crossref Full Text | Google Scholar

31. Vanent KN, Leasure AC, Acosta JN, Kuohn LR, Woo D, Murthy SB, et al. Association of chronic kidney disease with risk of intracerebral hemorrhage. JAMA Neurol. (2022) 79:911. doi: 10.1001/jamaneurol.2022.2299

PubMed Abstract | Crossref Full Text | Google Scholar

32. Prough DS. Physiologic acid-base and electrolyte changes in acute and chronic renal failure patients. Anesthesiol Clin North Am. (2000) 18:809–33. doi: 10.1016/s0889-8537(05)70196-6

PubMed Abstract | Crossref Full Text | Google Scholar

33. Lagman C, Nagasawa DT, Azzam D, Sheppard JP, Chen CHJ, Ong V, et al. Survival outcomes after intracranial hemorrhage in liver disease. Oper Neurosurg. (2019) 16:138–46. doi: 10.1093/ons/opy096

PubMed Abstract | Crossref Full Text | Google Scholar

34. Basile-Filho A, Lago AF, Menegueti MG, Nicolini EA, Nunes RS, Lima SL, et al. The use of SAPS 3, SOFA, and Glasgow Coma Scale to predict mortality in patients with subar.chnoid hemorrhage: a retrospective cohort study. Medicine. (2018) 7:e12769. doi: 10.1097/MD.0000000000012769

PubMed Abstract | Crossref Full Text | Google Scholar

35. Kurtz P, Taccone FS, Bozza FA, Bastos LSL, Righy C, Gonçalves B, et al. Systemic severity and organ dysfunction in subarachnoid hemorrhage: a large retrospective multicenter cohort study. Neurocrit Care. (2021) 35:56–61. doi: 10.1007/s12028-020-01139-3

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: subarachnoid hemorrhage, intensive care unit, machine learning, predictive model, MIMIC-IV database

Citation: Hu W, Yu D, Zhang L and Zhang J (2025) Machine learning models for mortality prediction in patients with spontaneous subarachnoid hemorrhage following ICU treatment. Front. Neurol. 16:1648353. doi: 10.3389/fneur.2025.1648353

Received: 17 June 2025; Accepted: 27 August 2025;
Published: 17 September 2025.

Edited by:

Elisa Gouvêa Bogossian, Université Libre de Bruxelles, Belgium

Reviewed by:

Michael Veldeman, RWTH Aachen University, Germany
Elda Diletta Sterchele, Université Libre de Bruxelles, Belgium

Copyright © 2025 Hu, Yu, Zhang and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jing Zhang, empzanp6QGhibXUuZWR1LmNu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.