Skip to main content

ORIGINAL RESEARCH article

Front. Neurol., 08 August 2023
Sec. Stroke

Interpretable machine learning for predicting 28-day all-cause in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the ICU: a multi-center retrospective cohort study with internal and external cross-validation

  • 1Emergency Department, The Second Affiliated Hospital of Chongqing Medical University, Chongqing, China
  • 2The Graduate School of Guangxi University of Traditional Chinese Medicine, Nanning, China
  • 3Department of Cardiology, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
  • 4Department of Neurosurgery, Xiu Shan People's Hospital, Chongqing, China
  • 5Department of Critical Care Medicine, Beijing Shijitan Hospital, Capital Medical University, Beijing, China
  • 6Department of Cardiology, Daping Hospital, The Third Military Medical University (Army Medical University), Chongqing, China
  • 7Faculty of Medicine, Macau University of Science and Technology, Taipa, Macao SAR, China
  • 8College of Medical Informatics, Chongqing Medical University, Chongqing, China
  • 9Key Laboratory of Novel Materials for Sensor of Zhejiang Province, College of Materials and Environmental Engineering, Hangzhou Dianzi University, Hangzhou, China
  • 10Department of Neurology, The Second Affiliated Hospital of Xuzhou Medical University, Xuzhou, Jiangsu, China

Background: Timely and accurate outcome prediction plays a critical role in guiding clinical decisions for hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU. However, interpreting and translating the predictive models into clinical applications are as important as the prediction itself. This study aimed to develop an interpretable machine learning (IML) model that accurately predicts 28-day all-cause mortality in hypertensive ischemic or hemorrhagic stroke patients.

Methods: A total of 4,274 hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU in the USA from multicenter cohorts were included in this study to develop and validate the IML model. Five machine learning (ML) models were developed, including artificial neural network (ANN), gradient boosting machine (GBM), eXtreme Gradient Boosting (XGBoost), logistic regression (LR), and support vector machine (SVM), to predict mortality using the MIMIC-IV and eICU-CRD database in the USA. Feature selection was performed using the Least Absolute Shrinkage and Selection Operator (LASSO) algorithm. Model performance was evaluated based on the area under the curve (AUC), accuracy, positive predictive value (PPV), and negative predictive value (NPV). The ML model with the best predictive performance was selected for interpretability analysis. Finally, the SHapley Additive exPlanations (SHAP) method was employed to evaluate the risk of all-cause in-hospital mortality among hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU.

Results: The XGBoost model demonstrated the best predictive performance, with the AUC values of 0.822, 0.739, and 0.700 in the training, test, and external cohorts, respectively. The analysis of feature importance revealed that age, ethnicity, white blood cell (WBC), hyperlipidemia, mean corpuscular volume (MCV), glucose, pulse oximeter oxygen saturation (SpO2), serum calcium, red blood cell distribution width (RDW), blood urea nitrogen (BUN), and bicarbonate were the 11 most important features. The SHAP plots were employed to interpret the XGBoost model.

Conclusions: The XGBoost model accurately predicted 28-day all-cause in-hospital mortality among hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU. The SHAP method can provide explicit explanations of personalized risk prediction, which can aid physicians in understanding the model.

1. Introduction

Stroke is the second most common cause of death and the third leading cause of disability worldwide, imposing a substantial economic burden in terms of healthcare costs and reduced productivity (1, 2). Low- and lower-middle-income countries bear the majority of the global stroke burden, accounting for 86% of stroke-related fatalities (2). The incidence of cerebral ischemic stroke is significantly higher than that of hemorrhagic stroke, with ischemic stroke being the more prevalent type. Ischemic stroke accounts for ~87% of all stroke cases, while intracerebral hemorrhage and subarachnoid hemorrhage contribute to 10 and 3% of strokes, respectively (3). Hypertension is the most prevalent modifiable risk factor for stroke in both industrialized and developing nations (4). It is an indicator of poor prognosis in 70% or more of individuals with acute ischemic or hemorrhagic stroke (5). Moreover, the intricate interaction between hypertension and other modifiable risk factors, including smoking, high body mass index, diabetes mellitus, and high cholesterol, substantially increases the overall risk of cardiovascular and cerebrovascular diseases in individuals (4).

The prevalence of patients with hypertension among stroke patients is high, and there is no prediction model for predicting 28-day in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the ICU. Precise and adaptable assessment tools play a crucial role in the early identification of high-risk patients in the ICU. Conventional approaches, such as the Cox proportional hazard model, are inefficient in examining the intricate non-linear relationships within the data (6, 7). Machine learning (ML) is increasingly utilized in medicine to quantify risk, identify predictors, and develop highly accurate prediction models for diagnosis and prognosis (8, 9).

In the present study, five ML models, including artificial neural network (ANN), gradient boosting machine (GBM), eXtreme Gradient Boosting (XGBoost), logistic regression model (LR), and support vector machine (SVM), were constructed to explore the risk factors of hypertensive ischemic or hemorrhagic stroke patients in the ICU and to support clinical decision-making based on clinical characteristics. Additionally, an interpretable machine learning (IML) approach was employed to predict the 28-day in-hospital mortality of hypertensive patients with ischemic or hemorrhagic stroke who were admitted to the ICU, using SHapley Additive exPlanations (SHAP) values and feature significance.

2. Materials and methods

2.1. Data source

Data for this study were collected from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database (https://mimic.physionet.org/, certification ID: 42039823) and the Collaborative Research Database (eICU-CRD, https://eicu-crd.mit.edu/). The MIMIC-IV database is a publicly accessible intensive care database that comprises de-identified clinical data from over 70,000 ICU hospitalizations in the USA from 2008 to 2019 (10). The eICU-CRD database is a multi-center intensive care database that is made available to the public by Philips Healthcare in collaboration with the MIT Laboratory for Computational Physiology. It contains de-identified clinical data for over 200,000 patients who were admitted to the ICU from 2014 to 2015 (11). The de-identified health information of patients was collected, and informed consent was not required for this study.

2.2. Study population and outcome

Hypertensive ischemic or hemorrhagic stroke patients in the last ICU stay were enrolled in the study cohort. Ischemic or hemorrhagic stroke and hypertensive patients were found to use diagnosis codes from the International Classification of Diseases, Ninth Revision (ICD-9). The screening process of the patients included in this study is shown in Figure 1. The exclusion criteria of this study include the following: (1) <24 h of ICU stay; (2) more than 28 days of ICU stay; (3) patients <18 years of age; and (4) patients with missing data (death). The primary outcome of this study is 28-day in-hospital mortality in the ICU.

FIGURE 1
www.frontiersin.org

Figure 1. The flow chart of patient selection. The study included 2,526 patients with hypertension and hypertension ischemic stroke.

2.3. Data extraction and preprocessing

Clinical data of all participants were collected from the eICU-CRD and MIMIC-IV database based on previously published literature with relevant topics (12, 13). A total of 41 predictor features consisting of demographics, laboratory tests, and co-morbidities were analyzed. Features with missing values of more than 30% were excluded to guarantee a higher accuracy of the outcome, and the k-Nearest Neighbors (kNN) imputation was applied to impute the missing values. The R package “DMwR2” was used for kNN imputation.

2.4. Construction of the machine learning model

In our study, models were developed to predict the 28-day in-hospital mortality of hypertensive ischemic or hemorrhagic stroke patients using five widely used algorithms, including ANN, GBM, LR, XGBoost, and SVM. All continuous variables were rescaled to have a distribution with a mean of 0 and a standard deviation of 1 using scale transformation to increase the stability of the prediction models. To choose the optimal prediction model for each algorithm with various tuning parameters, 5-fold cross-validation was applied to the ML models that needed tuning. The accuracy or receiver operating characteristic (ROC) was chosen as the metric during the search procedure. The testing set was solely utilized for model evaluation after concluding the complete model selection and training procedure. It was not employed during model tuning.

2.5. Model assessment

The confusion matrix metrics of accuracy and area under the receiver operating characteristic curve (AUROC) were used to assess the final models. Based on the prediction probabilities, the ROC curves were developed. Then, the model with the best predictive performance was identified by comparing the AUC values of the models in the testing data set.

2.6. Interpretation analysis

2.6.1. Feature importance

Feature ranking evaluation refers to a method of measuring the significance of each feature in the feature set based on its impact on the final classification outcome. Feature importance was measured using the “shapviz” package, which describes any classifier's predictions understandably and faithfully by learning an understandable model locally around the prediction. Relative variable importance was computed and presented to seek out the effect of features on the predictive models.

2.6.2. Shapley additive explanation (SHAP) value

The SHAP value of features was evaluated using the “shapviz” package. We selected SHAP summary, SHAP force plot, and SHAP waterfall to evaluate the SHAP value of features, which would increase the clinical utility of the predictive models.

2.7. Statistical analysis

The original dataset was randomly divided into the training set (n = 2,031) for developing the models and the testing set (n = 495) for evaluating the models' performance, based on a ratio of 8:2 in the eICU-CRD database. External validation was performed using the MIMIC-IV database. In the training set, testing set, and MIMIC-IV database, continuous data with normal distribution were demonstrated as the mean with standard errors, continuous data with non-normal distribution were demonstrated as the median with interquartile range (IQR), and categorical data were demonstrated as the frequency (percentage). A chi-squared test was performed to compare the qualitative features. To regularize the results of the statistical analysis for potential confounding factors, LASSO regression analysis was performed to predict 28-day in-hospital mortality in hypertensive patients with ischemic and hemorrhagic stroke. This enhances the prediction accuracy and the interpretation ability of a statistical model and is appropriate for high-dimensional data reduction. To guarantee minimized autocorrelation, features having non-zero coefficients were chosen for the additional analysis in the LASSO regression model. R 4.1.3 and Rstudio 1.1.463 were used for all statistical analyses. The R package “caret” was used to pre-process the data, tune the parameters, and train the model. The R package “shapviz” was used to evaluate the SHAP value and feature importance. The R package “rcs” was used to evaluate the cutoff value of features. A forest plot was performed using the package “forestplot.” The LASSO and logistic regression analyses were performed using the R package “glmnet.” To evaluate the effectiveness of each model, the ROC curve analysis and the AUC were computed using the “pROC” and ggplot2 packages. All P-values were two-sided, and features with a P-value of <0.05 were deemed statistically significant.

3. Results

3.1. Patient characteristics

This study comprised 2,526 hypertensive ischemic or hemorrhagic stroke patients who were admitted to the ICU. Patient characteristics are shown in Table 1. In total, the median age was 71 years, and 1,256 patients (50%) were women. Furthermore, 269 (11%) patients had myocardial ischemia (MI), 255 (10%) had chronic heart failure (CHF), 128 (5%) had dementia, 409 (16%) had chronic obstructive pulmonary disease (COPD), 599 (24%) had diabetes, 738 (29%) had atrial fibrillation (AF), and 1,330 (51%) had hyperlipidemia. The baseline characteristics of the training and testing sets did not differ significantly. In this study, 490 (19%) patients experienced in-hospital death. Baseline data for external validation sets are shown in Supplementary Table 1.

TABLE 1
www.frontiersin.org

Table 1. Baseline characteristics, laboratory parameters, vital signs, and statistical results of patients with hypertension and ischemic stroke in the training and testing sets.

3.2. Selection of predictors

In the LASSO method, the penalty on the β-coefficients was controlled by the tuning parameter λ (λ = 0.02574252; lambda.1se; Figure 2). Eleven features with non-zero coefficients were selected, including ethnicity, age, peripheral oxygen saturation (SpO2), white blood cell (WBC), mean corpuscular volume (MCV), red blood cell distribution width (RDW), bicarbonate, blood urea nitrogen (BUN), calcium, glucose, and hyperlipidemia.

FIGURE 2
www.frontiersin.org

Figure 2. The result of the Least Absolute Shrinkage and Selection Operator (LASSO) method for filtering variables. (A) Coefficients of all predictors gradually returning to zeros by used 10-fold cross-validation. (B) 11 predictors with non-zero coefficients at the rightmost dashed line.

3.3. Model performance

Five models including ANN, GBM, LR, XGBoost, and SVM were developed. Each model was evaluated by the AUROC and accuracy. All models had accuracy values of 0.80 and above in the testing set. The accuracy of the ANN, GBM, LR, XGBoost, and SVM in the testing (training) set was 0.802 (0.833), 0.808 (0.874), 0.804 (0.814), 0.818 (0.822), and 0.816 (0.871), respectively (Table 2). The AUROCs of the ANN, GBM, LR, XGBoost, and SVM in the testing set were 0.720, 0.738, 0.723, 0.739, and 0.719, respectively. The ROC curves and AUROC of different models in the testing and training sets are shown in Figures 3A, B. Meanwhile, the AUROCs of the Acute Physiology Score III (APS III) and Glasgow Coma Scale (GCS) scoring were 0.766 and 0.695 in the testing set, and the AUROC of the XGBoost model was 0.700 in the MIMIC-IV database (Figure 3C). Figure 4 shows that the prediction of the XGBoost model in the training and testing sets is in good agreement with the actual outcome, and the model calibration performance is good.

TABLE 2
www.frontiersin.org

Table 2. Assessment of each model by accuracy (95%CI), PPV, and NPV in the training and testing sets.

FIGURE 3
www.frontiersin.org

Figure 3. The receiver operator characteristic (ROC) curves for the ML models predict all-cause mortality in patients with hypertension and ischemic stroke (the training, testing, and external validation sets). The ROC curves of the nine ML models predicting all-cause mortality in the testing (A), training (B), and external validation sets (C), respectively.

FIGURE 4
www.frontiersin.org

Figure 4. Calibration plots of the XGBoost model in the training (A) and testing sets (B).

3.4. Feature importance

The 11 most important features of the best ML model were calculated, as shown in Figure 5A. The characteristics of the laboratory test, including glucose, WBC, calcium, BUN, MCV, RDW, and bicarbonate, vital signs, such as SpO2, and comorbidity, such as hyperlipidemia, significantly affected most predictive models. Demographic characteristics including age and ethnicity also significantly affected most predictive models.

FIGURE 5
www.frontiersin.org

Figure 5. The model's interpretation: (A) the importance ranking of the top 11 variables according to the mean (SHAP value) in the XGBoost model and (B) the importance ranking of the top 11 risk factors in the LR model. The higher the SHAP value of a feature, the higher the risk of death for the patient.

Meanwhile, the 11 most important features of the LR model in the ascending order were age, ethnicity, WBC, hyperlipidemia, MCV, glucose, SpO2, calcium, RDW, BUN, and bicarbonate, as shown in Figure 5B. There were similarities in the most important features between the XGBoost and the LR model (Figure 5).

3.5. SHAP values of features

The SHAP values of features are summarized in Figure 6A. The SHAP values of patients' features, including RDW of 14%, calcium of 8.5 mg/dL, BUN of 27 mg/dL, ethnicity of 0, glucose of 162 mg/dL, age of 82.4 years, SpO2 of 100%, MCV of 101 fl, WBC of 11.7*109/L, bicarbonate of 20 mmol/L, and hyperlipidemia of 1, are shown in Figure 6B. Furthermore, the features based on their contribution to the model are glucose, age, SpO2, WBC, ethnicity, calcium, BUN, MCV, RDW, bicarbonate, and hyperlipidemia, in descending order. Figure 7 shows that glucose, age, WBC, BUN, MCV, and RDW are positively correlated with SHAP values, suggesting that the higher their levels, the greater the SHAP values, which is conducive to the occurrence of outcome events.

FIGURE 6
www.frontiersin.org

Figure 6. (A) Scatter plot of feature values and SHAP values. (B) Consent waterfall plot showing an example of interpretability analysis for a patient. The yellow part of the feature value represents a higher value. The purple part of the feature value represents a lower value.

FIGURE 7
www.frontiersin.org

Figure 7. A scatter plot of variable values vs. SHAP values. Glucose (A); Age (B); SpO2 (C); WBC (D); Ca (E); BUN (F); MCV (G); RDW (H); Bicarbonate (I).

3.6. Cutoff values of features

The RCS showed the cutoff values of features, including SpO2 of 98%, age of 71.27 years, MCV of 91 fl, RDW of 13.6%, bicarbonate of 24 mmol/L, BUN of 17 mg/dL, calcium of 8.8 mg/dL, and glucose of 130 mg/dL after adjustment for covariates (Figure 8). These cutoff values were consistent with the tendency of SHAP values (Figure 7). The calibration plot showed a high degree of predictability between the actual and predicted probabilities (Figure 4). Univariate adjusted RCS analysis of the relationship between SpO2 and the outcome showed that the cutoff value was 92.6%, which indicated a low level of oxygen saturation.

FIGURE 8
www.frontiersin.org

Figure 8. RCS showing the cutoff values of features, including SpO2 of 98%, age of 71.27 years, MCV of 91 fl, RDW of 13.6%, bicarbonate of 24 mmol/L, BUN of 17 mg/dL, calcium of 8.8 mg/dL, and glucose of 130 mg/dL.

3.7. Subgroup analysis of ischemic stroke and hemorrhagic stroke

Important features such as ethnicity, SpO2, age, MCV, RDW, BUN, calcium, glucose, hyperlipidemia, and WBC were independent risk factors in the ischemic stroke subgroup. Furthermore, important features such as ethnicity, age, MCV, RDW, calcium, hyperlipidemia, and WBC were independent risk factors in the intracerebral hemorrhage group (Table 3).

TABLE 3
www.frontiersin.org

Table 3. A subgroup analysis of ischemic stroke and hemorrhagic stroke.

4. Discussion

This is the first study to develop and validate an explicable ML-based prediction model to identify risk factors for 28-day in-hospital mortality of hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU by using data from the eICU-CRD and MIMIC-IV databases. The XGBoost model showed excellent performance (AUC > 0.7) in this study and had good consistency with the LR model in terms of feature importance. Furthermore, the features were segmented into value ranges, making them more suitable for predicting 28-day in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU.

In our study, the 28-day in-hospital mortality was 490 (19%) for hypertensive ischemic or hemorrhagic stroke patients in the eICU-CRD database, which was similar to another cross-sectional study conducted in the USA from 2007 to 2016 (mortality: 21.6%) (14). In our prediction model, the glucose level was the most crucial indicator of in-hospital death, and higher blood glucose levels were associated with increased 28-day in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the ICU. Diabetes is a well-known risk factor for stroke (15, 16). The direct effects of hyperglycemia on brain tissues are possible, but it can also cause microvascular alterations due to an increase in glucose flux, the disruption of intracellular second messenger pathways, an imbalance in the production and scavenging of reactive oxygen species, and advanced glycation of crucial functional and structural proteins (17). In addition to studying the relationship between diabetes and stroke, more and more researchers are paying attention to the relationship between prediabetes and stroke. In the study by Wang et al. (18), prediabetes (plasma glucose concentration between 100 and 125 mg/dL) was significantly associated with the risks of total stroke [hazard ratio (HR) 1.33, 95% confidence interval (CI) 1.18–1.52, P = 0.0147] and ischemic stroke [HR 1.33, (95% CI 1.16–1.54), P = 0.0413]. In our model, the cutoff value of glucose was 130 mg/dL, which was slightly higher than the normal reference value. Sometimes, clinicians may ignore this slightly elevated blood glucose level. Therefore, our predictive model emphasized this point, which can alert physicians of the severity of the disease for better glycemic management.

Excessive intake of high-cholesterol diet results in elevated blood lipid levels, which causes hyperlipidemia. Numerous studies have shown that hyperlipidemia is a major risk factor for stroke, myocardial infarction, sudden cardiac death, cerebrovascular accidents, and other conditions (19, 20). Studies during the last decade suggest that hyperlipidemia is associated not only with the occurrence of stroke but also with the prognosis of patients after stroke (2123). This is probably because patients with hyperlipidemia tend to have lower white matter hyperintensity volumes, which have been shown to forecast the progression of infarcts after stroke and result in less favorable clinical outcomes (21, 24). In our prediction model, the risk of death was higher in hypertensive stroke patients with hyperlipidemia, which is consistent with previous studies (23).

According to the model's feature importance, age significantly influenced predictive models. Stroke primarily affects older adults, particularly those over 65, and age significantly affects their prognosis (25). Multiple studies have shown that older adult patients have a higher mortality rate and a lower quality of life following stroke than younger patients (26). In our study, when age was higher than 71. 27 years, it indicated an adverse outcome. In addition, another demographic indicator, ethnicity, also played an important role. Our model, as well as other pieces of evidence (27, 28), suggested that genetic studies could help differentiate stroke subtypes and even assist in patient management.

In this study, several laboratory tests, such as WBC, calcium, MCV, RDW, BUN, and bicarbonate, played important roles in our prediction model. Specifically, WBC is often associated with inflammation. Neuroendocrine hormones that are discharged during an immediate stressful situation can cause an immunological response in stroke patients with an elevated WBC count (29). Zheng et al. (30) found that elevated WBC on admission was associated with death and major disability at 3 months after acute ischemic stroke, and the association was linear (P for linear trend = 0.001). However, in another study, the association between WBC count and death at 3 months was not significant (P = 0.426) in patients with intracerebral hemorrhage after adjusting confounding factors such as age, sex, and glucose. In our subgroup analysis, an elevated WBC count was associated with increased in-hospital mortality both in ischemic and hemorrhagic stroke. Emerging data about calcium indicate that abnormalities in blood calcium are associated with the risk of stroke (31) and mortality in patients with coronary heart disease (32, 33). In our model, when the serum calcium was <8.8 mg/dL, it indicated an adverse outcome. Furthermore, MCV, RDW, BUN, and bicarbonate substantially contributed to our model. Elevated MCV, RDW, and BUN levels have been associated with increased in-hospital mortality. Additionally, lower bicarbonate, which may suggest metabolic acidosis, indicated a higher risk of mortality.

The characteristics of vital indicators such as SpO2 were observed to also affect most predictive models. Specifically, SpO2 is a crucial physiological metric for determining how much oxygen is supplied to the human body. It quantifies the proportion of oxygenated hemoglobin to the total hemoglobin. In our prediction model, the cutoff value was 98%, which is generally considered clinically normal. Moreover, in the subgroup analysis, SpO2 was not significant in the intracerebral hemorrhage group. Therefore, further research is required in this area in the future.

In this study, to precisely identify 28-day in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU, supervised ML models, such as the ANN, GBM, LR, SVM, and XGBoost, were employed. However, the ML model's operation is in a black box state. In this study, the model with the best average prediction performance on the testing set was considered the best model. This study developed an IML model based on the XGBoost model. Therefore, by developing an interpretable ML model using the shapviz and caret packages, we established the model's ability to depict key features and constructed a high-accuracy mortality prediction model for hypertensive ischemic or hemorrhagic stroke patients admitted to the ICU. The interpretation of feature importance was illustrated by plotting for feature importance and SHAP value. The 11 most important features in descending order were glucose, age, SpO2, WBC, ethnicity, calcium, BUN, MCV, RDW, bicarbonate, and hyperlipidemia in the XGBoost model, which mainly resembled the important features of the LR model. Meanwhile, the multi-dimensional correlations between the characteristics of the patients and their outcomes were addressed through regularization and normalization before developing the ML model. These approaches helped generate the ML model that could significantly improve the accuracy of determining mortality risk in hypertensive ischemic or hemorrhagic stroke patients.

This study has several limitations. First, this was a retrospective study using publicly available data; therefore, prospective studies are still needed to further verify our findings. Second, the cutoff values of the most important features were found in this study, but further research is needed to segment the feature values according to the degree of risk. Third, patients with liver disease, renal failure, or respiratory failure were not included in this study; therefore, the prediction model may not be applicable to ischemic and hemorrhagic stroke patients complicated with these conditions.

5. Conclusion

This study developed an IML model for predicting 28-day in-hospital mortality in hypertensive ischemic or hemorrhagic stroke patients in the eICU-CRD and MIMIC-IV databases. The 11 most important features in the ICU, including glucose, age, SpO2, WBC, ethnicity, calcium, BUN, MCV, RDW, bicarbonate, and hyperlipidemia, were applied in the XGBoost model. The value of these features in predicting the mortality of hypertensive ischemic or hemorrhagic stroke patients was deemed worthy of clinicians' attention. The ML model developed in this study has potential in clinical practice, in that it can help personalize the prevention and strengthen therapeutic strategies.

Data availability statement

The data analyzed in this study was obtained from the Medical Information Mart for Intensive Care IV (MIMIC-IV) Database, the following licenses/restrictions apply: To access the files, users must be credentialed users, complete the required training (CITI Data or Specimens Only Research), and sign the data use agreement for the project. Requests to access these datasets should be directed to PhysioNet, https://physionet.org/, doi: 10.13026/6mm1-ek67. Publicly available datasets were analyzed in this study. This data can be found at: Electronic Intensive Care Unit (eICU) Collaborative Research database (eICU-CRD), https://eicu-crd.mit.edu/.

Ethics statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent from the patients/participants or patients/participants' legal guardian/next of kin was not required to participate in this study in accordance with the national legislation and the institutional requirements.

Author contributions

JH and XL were responsible for conceiving the study. JD, TS, CY, MD, LF, and KW collected the data. HC, SZ, and JH were responsible for writing and revising the manuscript. SZ was responsible for designing the study and APC charge. All authors contributed to the article and approved the submitted version.

Funding

We acknowledge the funding from the Scientific Research Project of Jiangsu Health Committee (No. H2019054), the Xuzhou Science and Technology Planning Project (No. KC21220), and the Development Fund of the Affiliated Hospital of Xuzhou Medical University (No. XYFY2020013).

Acknowledgments

We are highly grateful to the participating staff. We would like to thank all the creators who contributed to the development of R packages including caret, shapviz, rcs, forestplot, glmnet, and pROC.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2023.1185447/full#supplementary-material

References

1. Organization WH. World Health Statistics 2022: Monitoring Health for the SDGs, Sustainable Development Goals. Geneva: World Health Statistics 2022. (2022).

Google Scholar

2. Collaborators GBDS. Global, regional, and national burden of stroke and its risk factors, 1990-2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Neurol. (2021) 20:795–820. doi: 10.1016/S1474-4422(21)00252-0

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Tsao CW, Aday AW, Almarzooq ZI, Alonso A, Beaton AZ, Bittencourt MS, et al. Heart disease and stroke statistics-2022 update: a report from the American Heart Association. Circulation. (2022) 145:e153–639. doi: 10.1161/CIR.0000000000001052

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Pistoia F, Sacco S, Degan D, Tiseo C, Ornello R, Carolei A. Hypertension and stroke: epidemiological aspects and clinical evaluation. High Blood Press Cardiovasc Prev. (2016) 23:9–18. doi: 10.1007/s40292-015-0115-2

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Bath PMW. Efficacy of nitric oxide, with or without continuing antihypertensive treatment, for management of high blood pressure in acute stroke (ENOS): a partial-factorial randomised controlled trial. Lancet. (2015) 385:617–28. doi: 10.1016/S0140-6736(14)61121-1

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Xing X, Yang X, Liu F, Li J, Chen J, Liu X, et al. Predicting 10-year and lifetime stroke risk in Chinese population. Stroke. (2019) 50:2371–8. doi: 10.1161/STROKEAHA.119.025553

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Wang Y, Liu J, Wang W, Wang M, Qi Y, Xie W, et al. Lifetime risk of stroke in young-aged and middle-aged Chinese population: the Chinese Multi-Provincial Cohort Study. J Hypertens. (2016) 34:2434–40. doi: 10.1097/HJH.0000000000001084

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Deo RC. Machine learning in medicine. Circulation. (2015) 132:1920–30. doi: 10.1161/CIRCULATIONAHA.115.001593

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Doupe P, Faghmous J, Basu S. Machine learning for health services researchers. Value Health. (2019) 22:808–15. doi: 10.1016/j.jval.2019.02.012

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Liu W, Tao G, Zhang Y, Xiao W, Zhang J, Liu Y, et al. A simple weaning model based on interpretable machine learning algorithm for patients with sepsis: a research of MIMIC-IV and eICU databases. Front Med. (2021) 8:814566. doi: 10.3389/fmed.2021.814566

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Ye W, Li R, Liang H, Huang Y, Xu Y, Li Y, et al. Development and validation of a clinical risk model to predict the hospital mortality in ventilated patients with acute respiratory distress syndrome: a population-based study. BMC Pulm Med. (2022) 22:268. doi: 10.1186/s12890-022-02057-0

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Zhao QY, Wang H, Luo JC, Luo MH, Liu LP, Yu SJ, et al. Development and Validation of a machine-learning model for prediction of extubation failure in intensive care units. Front Med. (2021) 8:676343. doi: 10.3389/fmed.2021.676343

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Zhao Y, Zhang R, Zhong Y, Wang J, Weng Z, Luo H, et al. Statistical analysis and machine learning prediction of disease outcomes for COVID-19 and pneumonia patients. Front Cell Infect Microbiol. (2022) 12:838749. doi: 10.3389/fcimb.2022.838749

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Wu H, Le Couteur DG, Hilmer SN. Mortality trends of stroke and dementia: changing landscapes and new challenges. J Am Geriatr Soc. (2021) 69:2888–98. doi: 10.1111/jgs.17322

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Almdal T, Scharling H, Jensen JS, Vestergaard H. The independent effect of type 2 diabetes mellitus on ischemic heart disease, stroke, and death: a population-based study of 13,000 men and women with 20 years of follow-up. Arch Intern Med. (2004) 164:1422–6. doi: 10.1001/archinte.164.13.1422

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Mankovsky BN, Ziegler D. Stroke in patients with diabetes mellitus. Diabetes Metab Res Rev. (2004) 20:268–87. doi: 10.1002/dmrr.490

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Biessels GJ, Staekenborg S, Brunner E, Brayne C, Scheltens P. Risk of dementia in diabetes mellitus: a systematic review. Lancet Neurol. (2006) 5:64–74. doi: 10.1016/S1474-4422(05)70284-2

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Wang A, Zhang J, Zuo Y, Tian X, Chen S, Wu S, et al. Prediabetes and risk of stroke and its subtypes by hypertension status. Diabetes Metab Res Rev. (2022) 38:e3521. doi: 10.1002/dmrr.3521

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Sun L, Yu F, Yi F, Xu L, Jiang B, Le L, et al. Acteoside from Ligustrum robustum (Roxb) blume ameliorates lipid metabolism and synthesis in a HepG2 cell model of lipid accumulation. Front Pharmacol. (2019) 10:602. doi: 10.3389/fphar.2019.00602

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Guzik A, Bushnell C. Stroke epidemiology and risk factor management. Continuum. (2017) 23:15–39. doi: 10.1212/CON.0000000000000416

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Jimenez-Conde J, Biffi A, Rahman R, Kanakis A, Butler C, Sonni S, et al. Hyperlipidemia and reduced white matter hyperintensity volume in patients with ischemic stroke. Stroke. (2010) 41:437–42. doi: 10.1161/STROKEAHA.109.563502

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Shigematsu K, Watanabe Y, Nakano H, Kyoto Stroke Registry C. Influences of hyperlipidemia history on stroke outcome; a retrospective cohort study based on the Kyoto Stroke Registry. BMC Neurol. (2015) 15:44. doi: 10.1186/s12883-015-0297-1

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Xing Y, An Z, Yu N, Zhao W, Ning X, Wang J. Low density lipoprotein cholesterol and the outcome of acute ischemic stroke: results of a large hospital-based study. Eur Neurol. (2016) 76:195–201. doi: 10.1159/000450604

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Arsava EM, Rahman R, Rosand J, Lu J, Smith EE, Rost NS, et al. Severity of leukoaraiosis correlates with clinical outcome after ischemic stroke. Neurology. (2009) 72:1403–10. doi: 10.1212/WNL.0b013e3181a18823

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Manwani B, Liu F, Scranton V, Hammond MD, Sansing LH, McCullough LD. Differential effects of aging and sex on stroke induced inflammation across the lifespan. Exp Neurol. (2013) 249:120–31. doi: 10.1016/j.expneurol.2013.08.011

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Roy-O'Reilly M, McCullough LD. Age and sex are critical factors in ischemic stroke pathology. Endocrinology. (2018) 159:3120–31. doi: 10.1210/en.2018-00465

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Boehme AK, Esenwa C, Elkind MS. Stroke risk factors, genetics, and prevention. Circ Res. (2017) 120:472–95. doi: 10.1161/CIRCRESAHA.116.308398

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Gretarsdottir S, Thorleifsson G, Manolescu A, Styrkarsdottir U, Helgadottir A, Gschwendtner A, et al. Risk variants for atrial fibrillation on chromosome 4q25 associate with ischemic stroke. Ann Neurol. (2008) 64:402–9. doi: 10.1002/ana.21480

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Yu S, Arima H, Heeley E, Delcourt C, Krause M, Peng B, et al. White blood cell count and clinical outcomes after intracerebral hemorrhage: the INTERACT2 trial. J Neurol Sci. (2016) 361:112–6. doi: 10.1016/j.jns.2015.12.033

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Zheng X, Zeng N, Wang A, Zhu Z, Zhong C, Xu T, et al. Prognostic value of white blood cell in acute ischemic stroke patients. Curr Neurovasc Res. (2018) 15:151–7. doi: 10.2174/1567202615666180626154857

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Dibaba DT, Xun P, Fly AD, Bidulescu A, Tsinovoi CL, Judd SE, et al. Calcium intake and serum calcium level in relation to the risk of ischemic stroke: findings from the REGARDS study. J Stroke. (2019) 21:312–23. doi: 10.5853/jos.2019.00542

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Chen Q, Zhang Y, Ding D, Li D, Yang Y, Li Q, et al. Associations between serum calcium, phosphorus and mortality among patients with coronary heart disease. Eur J Nutr. (2018) 57:2457–67. doi: 10.1007/s00394-017-1518-8

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Grandi NC, Brenner H, Hahmann H, Wusten B, Marz W, Rothenbacher D, et al. Calcium, phosphate and the risk of cardiovascular events and all-cause mortality in a population with stable coronary heart disease. Heart. (2012) 98:926–33. doi: 10.1136/heartjnl-2011-300806

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: machine learning, hypertensive, stroke, all-cause mortality, interpretable prediction model, SHAP

Citation: Huang J, Chen H, Deng J, Liu X, Shu T, Yin C, Duan M, Fu L, Wang K and Zeng S (2023) Interpretable machine learning for predicting 28-day all-cause in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the ICU: a multi-center retrospective cohort study with internal and external cross-validation. Front. Neurol. 14:1185447. doi: 10.3389/fneur.2023.1185447

Received: 21 April 2023; Accepted: 18 July 2023;
Published: 08 August 2023.

Edited by:

Jean-Claude Baron, University of Cambridge, United Kingdom

Reviewed by:

Songqiao Liu, Southeast University, China
Jing Qi, Central South University, China

Copyright © 2023 Huang, Chen, Deng, Liu, Shu, Yin, Duan, Fu, Wang and Zeng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Song Zeng, zengsong@hospital.cqmu.edu.cn

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.