Enhancing Robustness of Machine Learning Integration With Routine Laboratory Blood Tests to Predict Inpatient Mortality After Intracerebral Hemorrhage

Chen, Wei; Li, Xiangkui; Ma, Lu; Li, Dong

doi:10.3389/fneur.2021.790682

ORIGINAL RESEARCH article

Front. Neurol., 03 January 2022

Sec. Stroke

Volume 12 - 2021 | https://doi.org/10.3389/fneur.2021.790682

Enhancing Robustness of Machine Learning Integration With Routine Laboratory Blood Tests to Predict Inpatient Mortality After Intracerebral Hemorrhage

Wei Chen^1,2,3

Xiangkui Li^2,3

Lu Ma¹

Dong Li^2,3,4^*

¹Department of Neurosurgery, West China Hospital of Sichuan University, Chengdu, China
²West China Biomedical Big Data Center, West China Hospital of Sichuan University, Chengdu, China
³Med-X Center for Informatics, Sichuan University, Chengdu, China
⁴Division of Hospital Medicine, Emory School of Medicine, Atlanta, GA, United States

Objective: The accurate evaluation of outcomes at a personalized level in patients with intracerebral hemorrhage (ICH) is critical clinical implications. This study aims to evaluate how machine learning integrates with routine laboratory tests and electronic health records (EHRs) data to predict inpatient mortality after ICH.

Methods: In this machine learning-based prognostic study, we included 1,835 consecutive patients with acute ICH between October 2010 and December 2018. The model building process incorporated five pre-implant ICH score variables (clinical features) and 13 out of 59 available routine laboratory parameters. We assessed model performance according to a range of learning metrics, such as the mean area under the receiver operating characteristic curve [AUROC]. We also used the Shapley additive explanation algorithm to explain the prediction model.

Results: Machine learning models using laboratory data achieved AUROCs of 0.71–0.82 in a split-by-year development/testing scheme. The non-linear eXtreme Gradient Boosting model yielded the highest prediction accuracy. In the held-out validation set of development cohort, the predictive model using comprehensive clinical and laboratory parameters outperformed those using clinical alone in predicting in-hospital mortality (AUROC [95% bootstrap confidence interval], 0.899 [0.897–0.901] vs. 0.875 [0.872–0.877]; P <0.001), with over 81% accuracy, sensitivity, and specificity. We observed similar performance in the testing set.

Conclusions: Machine learning integrated with routine laboratory tests and EHRs could significantly promote the accuracy of inpatient ICH mortality prediction. This multidimensional composite prediction strategy might become an intelligent assistive prediction for ICH risk reclassification and offer an example for precision medicine.

Introduction

To date, spontaneous intracerebral hemorrhage (ICH), a leading cause of stroke and a life-threatening and disabling illness, remains a severe condition worldwide (1–4). Early aggressive care draws more advocates for spontaneous ICH outlined by the current practice guideline, requiring early and accurate identification of individuals who are at risk for unfavorable outcomes (5). In response to such urgent needs, robust risk estimators are highly recommended. The ICH score (6) is a classic prediction tool widely used in ICH management currently. However, this score assessment system comprises five risk factors: the Glasgow Coma Scale (GCS) score, ICH volume, intraventricular hemorrhage (IVH), the infratentorial origin of ICH, and age simple to use in clinical practice. However, such a clinical grading scale is used as a stand-alone risk assessment system and less integrated with additional dimension information, such as routine clinical laboratory profiles, for more comprehensive and precise individual risk assessment.

Integrating routine laboratory blood tests for risk predictions in patients with ICH may have several important implications. Firstly, linking laboratory biomarkers to augment traditional ICH risk prediction could benefit individualized disease management. Those laboratory indicators provide additional multidimensional information, which is hardly captured by static electronic health records (EHRs) and even images from the PACS (picture archiving and communication systems) (7). Secondly, the multiple laboratory blood tests for inpatients provided dynamic clues for the progression and pathological changes of the disease over time (8, 9). In addition, using those objectively measured and readily available data would allow cost-effective assessment and intervention of ICH without posing an additional burden in an urgent clinical scenario. However, most conventional tests are limers, making clinical replication difficult. Besides, traditional analytical methods are difficult to adapt to large quantities of measurements.

Machine learning (ML) is a promising strategy for learning complex rules and objectively synthesizing and interpreting the patterns from multidimensional datasets (10, 11). ML could incorporate an extensive array of predictors in a non-linear pattern and use multiple interactions to enhance prediction accuracy (10–13). In recent years, ML has been used for prediction and decision-making in a multitude of ICH (14–19). Unfortunately, to our knowledge, there has been no effort to use ML to take advantage of blood laboratory data to help physicians predict outcomes at a personalized level in patients with ICH who undergo assessments during routine clinical care. Moreover, the black box-like feature of these algorithms that usually limits their usefulness in clinical practice (20, 21). Clinicians are more likely to trust and use ML methods when they are explainable. In doing so, we introduced the SHapley Additive exPlanation (SHAP) algorithm to help explain the prediction model (22, 23). With the aforementioned considerations in mind, we initially introduced the SHapley Additive exPlanation (SHAP) algorithm to enhance the robustness of ML integrated with routine laboratory blood tests to predict inpatient mortality after ICH.

Materials and Methods

Study Population

The patients from the West China Hospital of Sichuan University, tertiary care, academic and non-profit hospital with 4,300 beds, had ~279,000 patients discharged in 2019 (24). Consecutive patients with acute spontaneous ICH admitted to the hospital between October 1, 2010, and December 31, 2018, were screened. The inclusion criteria include (1) ≥18 years old, (2) with the first-ever diagnosis of spontaneous ICH within 24 h that was confirmed by the head CT scan, (3) laboratory blood tests are available at admission, and (4) complete discharge diagnosis and outcomes records are available. We excluded patients with primary intraventricular hemorrhage (IVH) and secondary ICH, such as trauma, tumors, or vascular structural abnormalities (e.g., aneurysms and arteriovenous malformation). The West China Hospital of Sichuan University Biomedical Research Ethics Committee reviewed and waived informed consent (No. 20-1209) due to a retrospective data analysis characterized by desensitized data. This study adheres to the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) reporting guideline for diagnostic and prognostic studies.

Data Source

Clinical data were derived from the EHRs system. Demographic information and clinical characteristics were retrieved, including age, sex, time from the symptom onset to admission (hours), length of stay (days), admission GCS score, lifestyle risk factors (smoking and alcohol use), recorded comorbidities (hypertension and diabetes mellitus), and neurosurgical operation (hematoma evacuation). Hypertension and diabetes mellitus was defined as diagnosed or documented—all blood laboratory biomarkers obtained from standard laboratory tests in routine clinical practice at the time of admission. Of note, only measures that were routinely available for most patients were included. As a result, a total of 59 laboratory blood parameters were included in the present analysis (see details in the Supplementary Table 1). Neuroimaging data on hematoma volume [mL; calculated using the formula ABC/2 (25)], hematoma locations (infratentorial origin or not), and the presence of an IVH were also collected manually. The primary outcome of interest was all-cause in-hospital mortality retrieved from paramedic EHRs. The outcome label of in-hospital mortality was defined as a discharge disposition of “expired.”

If data elements were not structured, they were manually processed by two experienced investigators (X.L. and a non-author) who were blinded to the study's aims. Missing values for each variable were assigned a default value of “NA” (not available), which served as an indicator for missingness. This allowed the algorithm to include observations with missing features and to gain signals from missingness itself. Prior studies have also shown that the eXtreme Gradient Boosting (XGBoost) model can gain signals from missingness (or non-missingness) without resorting to imputation techniques (26).

ML Algorithms and Model Selection

The dataset was separated in a split-by-year training/test scheme. We trained a ML model on data from patients who were admitted to the hospital from October 1, 2010 to December 31, 2016. We randomly selected 90% of patients for model training in the training dataset and a held-out 10% for validation. We used the annotated 59 laboratory biomarkers to train and evaluate several supervised learning algorithms and compare their performances. We experimented with four classifiers: logistic regression, classification and regression trees, random forest, and the XGBoost model. Initial tests demonstrated the superior performance of the XGBoost model compared with the other three models, so we selected this model using the Python programming language (Python Software Foundation) with the XGBoost package as our preferred model, with the 13 most influential circulating parameters for the exciting outcome chosen using five cross-validations. We created a base model using the 5 ICH risk score variables. We then provided additional performance characteristics for the selected approach to focus on the differences between ML with only clinical data (ML-clinical) and ML with clinical and laboratory data (ML-combined). In the test dataset, the model was further validated in an additional cohort spanning from January 1, 2017 to December 31, 2018. The SHAP value was used to illustrate the positive or negative effects of the 18 features attributed to the XGBoost model. We also used the SHAP dependence plot to explain how a single feature of these laboratory biomarkers affects the output of the XGBoost prediction model.

Statistical Analysis

Continuous variables are presented as mean ± standard deviation (SD) or medians with interquartile ranges (IQR). Categorical variables are presented as numbers with percentages. As appropriate, comparisons of intergroup differences were analyzed using the Student t-test, Mann-Whitney U-test, or Chi-square test. The performances of the ML models were evaluated by receiver operating characteristic (ROC) curve, Kaplan–Meier curve, confusion matrix metrics, and evaluation metrics, including area under the ROC curve (AUROC), precision, sensitivity, specificity, and accuracy. The formulas for computing the metrics were described elsewhere. We calculated 95% confidence intervals (CIs) for comparisons of AUROC using 1,000 bootstrap replications. Probabilities of more than 2/3 quantile were assigned to high-risk and otherwise to low risk. All statistical analyses were performed using R, version 3.3 (R Foundation for Statistical Computing) and Python, version 2.7 (Python Software Foundation) software. A 2- tailed P <0.05 was considered statistical significance.

Results

Study Cohort and Population

Figure 1 illustrates the study design and study flow chart. We included a total of 1,835 patients with ICH in the present analysis; 1,405 of these patients were included in the development cohort (training and held-out validation set), and 430 were used as the testing cohort. The mean age was 59 ± 15 years, and 67.9% were men. The overall percentages of all-cause in-hospital mortality for the full population, development, and testing cohorts were 20.3, 19.4, and 23.0%, respectively.

FIGURE 1

Figure 1. Study design and study flowchart. CART, classification and regression trees; ICH, intracerebral hemorrhage; IVH, intraventricular hemorrhage; LR, logistic regression; RF, random forest; XGBoost, eXtreme Gradient Boosting.

Table 1 summarizes the baseline characteristics of the study population. Patient's characteristics were similar between the development and testing cohort except for GCS score (82 of 430 patients with GCS score 3–4 in the development cohort [19.1%] vs. 188 of 1,405 patients in the testing cohort [13.4%]; P = 0.013), the lifestyle risk factor of current smoking (P = 0.043), and undergoing hematoma evacuation therapy (P = 0.045) (Table 1). Patients who died were older, had a shorter time from the symptom onset to admission, and more temporary hospital stay, to have a comorbidity of diabetes mellitus. In addition, they were more likely to have lower GCS scores and larger ICH volume, had hematoma located in infratentorial, accompanied by a higher presence of IVH. At the same time, they were less likely to have undergone hematoma evacuation therapy (all P < 0.05), as presented in Table 1.

TABLE 1

Table 1. Demographic and baseline characteristics of study patients.

Model Performance and Comparisons

Figure 2A demonstrates the model diagnostic performance in terms of AUROC in the validation and testing sets. Overall, ML models using laboratory data achieved AUROCs of 0.71–0.82 in a split-by-year development/testing scheme, with the non-linear XGBoost model yielded the highest prediction accuracy. In the held-out validation set, the predictive model using comprehensive clinical and laboratory parameters outperformed those using clinical alone in predicting in-hospital mortality (AUROC [95% bootstrap CI], 0.899 [0.897–0.901] vs. 0.875, [0.872–0.877]; P < 0.001). Similar performance was observed in the testing set. Table 2 presents the classification model performance when applied to the validation set and testing set. Interestingly, the performance in the held-out validation cohort was worse than the testing cohort in terms of precision (77.5 vs. 81.3%), sensibility (77.7 vs. 81.2%), specificity (77.7 vs. 81.2%), and accuracy (85.8 vs. 86.7%). Additional statistical measures of the performance of models are reported in the Supplementary Table 2.

FIGURE 2

Figure 2. Predictive performance of machine learning-based models. (A) Areas under the curve (AUCs) to assess the performance of in-hospital mortality risk prediction of models in the development cohort (held-out for validation) and testing cohort using clinical features only or combined with selected laboratory variables. The AUC for ML combined with clinical features and laboratory variables was significantly higher than that used clinical features only. (B) Kaplan–Meier curves indicate the overall survival of patients with high and low mortality risk in the validation cohort. The tick marks refer to censored patients. The dark green or orange line indicates the survival probability, and the light green or orange areas represent the 95% confidence interval of survival probability (log-rank P < 0.001).

TABLE 2

Table 2. Statistical measures of the performance of models^*.

We further investigated precise individual-level mortality risk. With the time from admission to death or discharge as the endpoint, Kaplan-Meier analysis further confirmed that the ML model could robustly stratify patients by mortality risk. High-risk patients labeled by our model were significantly less likely to survive than low-risk patients in the testing cohort with a hazard ratio of 11.79 (95% CI: 7.64–18.20), highlighting the capability of the model to accurately predict the prognosis of ICH patients, as shown in Figure 2B.

Most Influential Predictors

Figure 3 presents the statistical analysis of the essential predictors selected by the best classifier (non-linear XGBoost model) from 59 laboratory blood parameters in the training set. A total of 13 features were eventually chosen for modeling, with each variable included in the model has been varying importance over in-hospital mortality. Spearman's correlation coefficient analysis using raw data of the 13 features exhibited various degrees of correlation (Figure 3A). Among the top 3 of those selected variables, blood glucose was selected as the top-most influential predictor in the model, followed by creatinine, and white blood cell count, respectively (Figure 3B). In the model that includes both laboratory blood parameters and clinical variables, the relative importance of those included parameters on the model's predictions was observed to have altered mildly or moderately. Standard box plots presented the included parameters' distributions between discharged and deceased patients (Figure 3C). According to datasets and outcomes, additional distributions of these laboratory values can be found in Supplementary Tables 3, 4.

FIGURE 3

Figure 3. Statistical analysis of features included in models. (A) Heatmap represents the correlation between all laboratory features included in the model using Spearman's correlation coefficient. The colors in the plot represent the correlation coefficients. The numbers in the triangle represent the value of the correlation coefficient. (B) Scaled importance rank of all laboratory features included in the models. The size of bars represents the value of relative importance. The bars with orange represent the feature included in the model that did not contain clinical features, while the gray bar represents the feature included in the model that did not contain clinical features. (C) Box and jitter plots show the distribution of all laboratory features included in the present study between deceased patients (n = 373) and discharged patients (n = 1,466). The centerline represents the median of the feature. Box limits represent upper and lower quartiles. Whiskers represent 1.5 times the interquartile range. Color points represent outliers. Mann-Whitney U-test was used in the univariate comparison between groups, and a two-tailed P < 0.05 was considered statistically significant. AA, the ratio of alanine aminotransferase to AST; AST, aspartate aminotransferase; HR, hazard ratio; CI, confidence interval; CREA, creatinine; GLU, blood glucose; LDH, low-density lipoprotein; LYMPH, percentage of lymphocytes; PT, prothrombin time; RBCDW, red blood cell distribution width (CV); UA, uric acid; WBC, white blood cell count.

Model Explanation

Figure 4 depicts the SHAP algorithm for the interpretations of the ML model. We illustrated two representative cases from the holdout validation set (Figure 4A). In this example, the patient's GCS scores 3–4 (=2 points) is the most crucial variable to increase the risk of in-hospital mortality in case 1 who died, while GCS scores 13–15 (=0) is the most critical risk-decreasing variable in case 2 who discharged. Additionally, the contribution of each of the 13 features in the model was visualized by applying the SHAP summary plot (Figure 4B). This allowed understanding how a single variable influences the output of the XGBoost prediction model. According to the prediction model, the higher the SHAP value of a feature, the more likely death becomes. Furthermore, we used the SHAP algorithm to rank the importance of all the included variables for the prediction model (Figure 4C). The most crucial variable has the highest mean of absolute SHAP values.

FIGURE 4

Figure 4. The Shapley additive explanations algorithm for the machine learning model. (A) Illustration of the impact of features for two patient-specific predictions using the Shapley additive explanations (SHAP) algorithm. Case 1 was alive, while case 2 was deceased during the hospitalization. The factors positively (red) and negatively (blue) impacting the construction of the model, with the size of the bars depicting importance. For example, Glasgow Coma Scale score 3–4 (=2 points) is an essential variable increase the risk of in-hospital mortality in case 2. (B) The attributes of the features in the black-box model. Each line represents a feature, and the abscissa is the SHAP value, which represents the degree of influence on the outcome. Each dot represents a sample. The redder the color, the greater the value of the feature, and the bluer the color, the lower the value. (C) Ranking of feature importance indicated by SHAP. The most critical variable has the highest mean of absolute SHAP values. AA, the ratio of alanine aminotransferase to AST; AST, aspartate aminotransferase; CREA, creatinine; GLU, blood glucose; LDH, low-density lipoprotein; LYMPH, percentage of lymphocytes; PT, prothrombin time; RBCDW, red blood cell distribution width (CV); UA, uric acid; WBC, white blood cell count.

Discussion

To our knowledge, this study represents the first large-scale report leveraging explainable ML algorithms to generate accurate outcome predictions for the ICH population by combining clinical and routine laboratory blood test data. The non-linear XGBoost model could automatically derive critical variables from comprehensive laboratory results such as glucose, creatinine, and white blood cell count. This pipeline analysis model is suitable for integrating clinical data with readily available laboratory profile results for dynamic disease progression prediction. Our results reveal that an ML-based algorithm offers a great potential to enhance accuracy in predicting personalized in-hospital all-cause mortality in patients with ICH. These findings add insights into ML algorithms designed to improve optimal evaluation and clinical decisions in digital health care.

Currently, the ability to optimally assess risk in individual patients remains a significant challenge in ICH. Such a challenge is mainly because the evaluation of such emergent conditions is often time pressing. It is crucial but difficult for clinicians to make rapid and accurate clinical decisions in a short period when facing a broad array of available information. Traditional clinical practice has dealt with this situation, in part, by using various risk prediction scoring systems, some of which are summarized by Gregório et al. (27). However, these scoring systems are limited by the amount of clinically available measurements, particularly laboratory measures. The presented ML strategy provides insight into integrating algorithm (software) with routine laboratory tests (facilities and other resources) and interpretation beyond what is provided by conventional statistics. More importantly, our study introduced a novel unsupervised learning algorithm strategy to derive labs without a priori assumptions about the influence or weighting of individual factors or how they may interact. This approach allowed us to filter through the massive laboratory parameters to identify potential risk factors for death after ICH.

As a result, certain variables linked to patient survival were ultimately included in the prediction model. For example, in our model, blood glucose derived from labs is the most important feature for predicting ICH mortality during hospitalization, as considerable previous studies have supported this finding (28–30). Some of which, including creatinine (31), white blood cell count (32), prothrombin time (33), chlorine (34), potassium (35), percentage of lymphocytes (36), red blood cell distribution width (37), uric acid (38), and phosphorus (39) also correspond with previously investigated risk factors from previous clinical studies. These features were presented in nearly two-thirds of the overall selecting variables (8 out of 13), suggesting our model is reliable. Notably, our model identified several biomarkers that other ICH studies have not reported, such as the aspartate aminotransferase and the ratio of alanine aminotransferase to aspartate aminotransferase.

When integrating those selecting features, the performance of the ML-combined score was superior to the clinical risk metric that is traditionally used to study prognostic outcomes after ICH. Several studies have been published using ML to predict outcomes (primarily survival and function) in patients with ICH (15–19). The reported performances (AUROC) vary from 0.63 to 0.92, mostly around 0.75–0.85. Therefore, our model performance is consistent with these studies but with enhancing clinical utility and generalizability. We developed a pipeline automatic screening model that is especially suitable for large-scale clinical data analysis with comprehensive laboratory parameters in urgent situations to determine disease progression and conditions. In addition, our model was able to provide a relatively high AUROC and predicted a probability of death for each patient. At the point of care, the estimated probabilities can be used to construct risk strata to quickly separate low-risk patients from those with a high risk of mortality. This technique could be precious for determining who is most likely to benefit when limited resources and assessment time is pressing. Those results further highlight ML technology can assist physicians with digesting a large amount of information and be critical to fully utilizing these growing datasets to help transform and optimize medical practice.

Several limitations of this study need to point out. Firstly, this study relied on retrospective data, and deriving clinical information from EHRs is an inherent limitation (40). For example, some critical variables related to the mortality or functional outcomes of ICH, such as baseline NIHSS and hematoma extension, were not included in the model. Secondly, the laboratory measurements at the time of admission were used in this study. Dynamic laboratory parameters may provide more information to the ML model. Bedsides, to assess the added predictive value of laboratory information concisely, straightforward, and comparably, we only input the variables from classic ICH risk score variables rather than selecting all clinical variables using the ML model. Third, as with any other ML-based study, the model is subject to the constraints of the specific population that it trains on (41). Further validation in a multicenter setting with different patient populations is warranted because our model was performed in a single-center study and not externally validated.

Conclusion

In conclusion, our study initially provided translational data to support that ML integrated with laboratory blood tests can effectively predict inpatient mortality after intracerebral hemorrhage. The above strategy might become a novel, intelligent clinical companion diagnostic tool for risk assessment of ICH, integrating with the electronic health record and laboratory blood test results.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

The studies involving human participants were reviewed and approved by West China Hospital of Sichuan University Biomedical Research Ethics Committee. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author Contributions

The study design was conceived by WC and DL. All authors drafted the article or revised it critically for important intellectual content, approved the final draft to be published, and agree to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.

Funding

This study was funded by the Sichuan Science and Technology Program (Grant No. 2021YFS0203), the 1·3·5 project for disciplines of excellence–Clinical Research Incubation Project and Post-Doctor Research Project, West China Hospital, Sichuan University (Grant Nos. ZYJC18010 and 2020HXBH156), the Chengdu Science and Technology Bureau (Grant Nos. 2019-YF09-00211-SN and 2021-YF05-01006-SN) and the National key R&D Program of China (Grant Nos. 2018YFA 0108604 and 2018YFA0108603). The funding agency was not involved in the study.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2021.790682/full#supplementary-material

References

1. Qureshi AI, Mendelow AD, Hanley DF. Intracerebral haemorrhage. Lancet. (2009) 373:1632–44. doi: 10.1016/S0140-6736(09)60371-8

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Krishnamurthi RV, Feigin VL, Forouzanfar MH, Mensah GA, Connor M, Bennett DA, et al. Global and regional burden of first-ever ischaemic and haemorrhagic stroke during 1990-2010: findings from the Global Burden of Disease Study 2010. Lancet Glob Health. (2013) 1:e259–81. doi: 10.1016/S2214-109X(13)70089-5

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Chen Y, Wright N, Guo Y, Turnbull I, Kartsonaki C, Yang L, et al. Mortality and recurrent vascular events after first incident stroke: a 9-year community-based study of 0·5 million Chinese adults. Lancet Glob Health. (2020) 8:e580–90. doi: 10.1016/S2214-109X(20)30069-3

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Zhao W, Wu C, Stone C, Ding Y, Ji X. Treatment of intracerebral hemorrhage: current approaches and future directions. J Neurol Sci. (2020) 416:117020. doi: 10.1016/j.jns.2020.117020

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Hemphill JC III, Greenberg SM, Anderson CS, Becker K, Bendok BR, Cushman M, et al. Guidelines for the management of spontaneous intracerebral hemorrhage: a guideline for healthcare professionals from the American Heart Association/American Stroke Association. Stroke. (2015) 46:2032–60. doi: 10.1161/STR.0000000000000069

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Hemphill JC 3rd, Bonovich DC, Besmertis L, Manley GT, Johnston SC. The ICH score: a simple, reliable grading scale for intracerebral hemorrhage. Stroke. (2001) 32:891–7. doi: 10.1161/01.STR.32.4.891

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Wang TJ. Assessing the role of circulating, genetic, and imaging biomarkers in cardiovascular risk prediction. Circulation. (2011) 123:551–65. doi: 10.1161/CIRCULATIONAHA.109.912568

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Xue M, Yong VW. Neuroinflammation in intracerebral haemorrhage: immunotherapies with potential for translation. Lancet Neurol. (2020) 19:1023–32. doi: 10.1016/S1474-4422(20)30364-1

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Zhou Y, Wang Y, Wang J, Anne Stetler R, Yang QW. Inflammation in intracerebral hemorrhage: from mechanisms to clinical translation. Prog Neurobiol. (2014) 115:25–44. doi: 10.1016/j.pneurobio.2013.11.003

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Obermeyer Z, Emanuel EJ. Predicting the future - big data, machine learning, and clinical medicine. N Engl J Med. (2016) 375:1216–9. doi: 10.1056/NEJMp1606181

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Deo RC. Machine learning in medicine: will this time be different? Circulation. (2020) 142:1521–3. doi: 10.1161/CIRCULATIONAHA.120.050583

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Beam AL, Kohane IS. Big data and machine learning in health care. JAMA. (2018) 319:1317–8. doi: 10.1001/jama.2017.18391

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Schwalbe N, Wahl B. Artificial intelligence and the future of global health. Lancet. (2020) 395:1579–86. doi: 10.1016/S0140-6736(20)30226-9

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Panesar SS, Kliot M, Parrish R, Fernandez-Miranda J, Cagle Y, Britz GW. Promises and perils of artificial intelligence in neurosurgery. Neurosurgery. (2020) 87:33–44. doi: 10.1093/neuros/nyz471

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Xu X, Zhang J, Yang K, Wang Q, Chen X, Xu B. Prognostic prediction of hypertensive intracerebral hemorrhage using CT radiomics and machine learning. Brain Behav. (2021) 11:e02085. doi: 10.1002/brb3.2085

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Hall AN, Weaver B, Liotta E, Maas MB, Faigle R, Mroczek DK, et al. Identifying modifiable predictors of patient outcomes after intracerebral hemorrhage with machine learning. Neurocrit Care. (2021) 34:73–84. doi: 10.1007/s12028-020-00982-8

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Zhu F, Pan Z, Tang Y, Fu P, Cheng S, Hou W, et al. Machine learning models predict coagulopathy in spontaneous intracerebral hemorrhage patients in ER. CNS Neurosci Ther. (2021) 27:92–100. doi: 10.1111/cns.13509

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Wang HL, Hsu WY, Lee MH, Weng HH, Chang SW, Yang JT, et al. Automatic machine-learning-based outcome prediction in patients with primary intracerebral hemorrhage. Front Neurol. (2019) 10:910. doi: 10.3389/fneur.2019.00910

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Gupta VP, Garton ALA, Sisti JA, Christophe BR, Lord AS, Lewis AK, et al. Prognosticating functional outcome after intracerebral hemorrhage: the ICHOP score. World Neurosurg. (2017) 101:577–83. doi: 10.1016/j.wneu.2017.02.082

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Watson DS, Krutzinna J, Bruce IN, Griffiths CE, McInnes IB, Barnes MR, et al. Clinical applications of machine learning algorithms: beyond the black box. BMJ. (2019) 364:l886. doi: 10.1136/bmj.l886

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Thorsen-Meyer HC, Nielsen AB, Nielsen AP, Kaas-Hansen BS, Toft P, Schierbeck J, et al. Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records. Lancet Digit Health. (2020) 2:e179–91. doi: 10.1016/S2589-7500(20)30018-2

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Lundberg S, Lee SI. A unified approach to interpreting model predictions. Adv Neur In. (2017) 1:4765–74.

23. Lundberg SM, Erion GG, Lee SI. Consistent individualized feature attribution for tree ensembles. arXiv[Preprint].arXiv:180203888.(2018). https://arxiv.org/abs/1802.03888#:~:text=Consistent%20Individualized%20Feature%20Attribution%20for%20Tree%20Ensembles%20Scott,often%20heuristic%20and%20not%20individualized%20for%20each%20prediction

Google Scholar

24. West China Hospital of Sichuan University (2019). Available online at: http://www.wchscu.cn/details/50464.html (accessed August 5, 2021).

25. Kothari RU, Brott T, Broderick JP, Barsan WG, Sauerbeck LR, Zuccarello M, et al. The ABCs of measuring intracerebral hemorrhage volumes. Stroke. (1996) 27:1304–5. doi: 10.1161/01.STR.27.8.1304

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Chen T, Guestrin C, Assoc Comp M. XGBoost: A Scalable Tree Boosting System. New York: Assoc Computing Machinery (2016). p. 785–94.

PubMed Abstract | Google Scholar

27. Gregório T, Pipa S, Cavaleiro P, Atanásio G, Albuquerque I, Chaves PC, et al. Assessment and comparison of the four most extensively validated prognostic scales for intracerebral hemorrhage: systematic review with meta-analysis. Neurocrit Care. (2019) 30:449–66. doi: 10.1007/s12028-018-0633-6

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Fogelholm R, Murros K, Rissanen A, Avikainen S. Admission blood glucose and short term survival in primary intracerebral haemorrhage: a population based study. J Neurol Neurosurg Psychiatry. (2005) 76:349–53. doi: 10.1136/jnnp.2003.034819

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Passero S, Ciacci G, Ulivelli M. The influence of diabetes and hyperglycemia on clinical course after intracerebral hemorrhage. Neurology. (2003) 61:1351–6. doi: 10.1212/01.WNL.0000094326.30791.2D

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Saxena A, Anderson CS, Wang X, Sato S, Arima H, Chan E, et al. Prognostic significance of hyperglycemia in acute intracerebral hemorrhage: the INTERACT2 study. Stroke. (2016) 47:682–8. doi: 10.1161/STROKEAHA.115.011627

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Qureshi AI, Huang W, Lobanova I, Hanley DF, Hsu CY, Malhotra K, et al. Systolic blood pressure reduction and acute kidney injury in intracerebral hemorrhage. Stroke. (2020) 51:3030–8. doi: 10.1161/STROKEAHA.120.030272

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Suzuki S, Kelley RE, Dandapani BK, Reyes-Iglesias Y, Dietrich WD, Duncan RC. Acute leukocyte and temperature response in hypertensive intracerebral hemorrhage. Stroke. (1995) 26:1020–3. doi: 10.1161/01.STR.26.6.1020

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Zhang Q, Li X, Wei Z, Ye X, Zhu L, Xie M, et al. Risk factors and clinical characteristics of non-promyelocytic acute myeloid leukemia of intracerebral hemorrhage: a single center study in China. J Clin Neurosci. (2017) 44:203–6. doi: 10.1016/j.jocn.2017.06.058

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Williamson MR, Dietrich K, Hackett MJ, Caine S, Nadeau CA, Aziz JR, et al. Rehabilitation augments hematoma clearance and attenuates oxidative injury and ion dyshomeostasis after brain hemorrhage. Stroke. (2017) 48:195–203. doi: 10.1161/STROKEAHA.116.015404

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Wu XY, Zhuang YK, Cai Y, Dong XQ, Wang KY, Du Q, et al. Serum glucose and potassium ratio as a predictive factor for prognosis of acute intracerebral hemorrhage. J Int Med Res. (2021) 49:3000605211009689. doi: 10.1177/03000605211009689

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Zhang F, Ren Y, Shi Y, Fu W, Tao C, Li X, et al. Predictive ability of admission neutrophil to lymphocyte ratio on short-term outcome in patients with spontaneous cerebellar hemorrhage. Medicine. (2019) 98:e16120. doi: 10.1097/MD.0000000000016120

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Lorente L, Martín MM, González-Rivero AF, Pérez-Cejas A, Sabatel R, Ramos L, et al. Red blood cell distribution width and mortality of spontaneous intracerebral hemorrhage patients. Clin Neurol Neurosurg. (2020) 195:106066. doi: 10.1016/j.clineuro.2020.106066

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Huang H, Huang G, Gu J, Chen K, Huang Y, Xu H. Relationship of serum uric acid to hematoma volume and prognosis in patients with acute supratentorial intracerebral hemorrhage. World Neurosurg. (2020) 143:e604–12. doi: 10.1016/j.wneu.2020.08.018

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Loggini A, El Ammar F, Mansour A, Kramer CL, Goldenberg FD, Lazaridis C. Association between electrolyte levels at presentation and hematoma expansion and outcome in spontaneous intracerebral hemorrhage: a systematic review. J Crit Care. (2021) 61:177–85. doi: 10.1016/j.jcrc.2020.10.029

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Gianfrancesco MA, Tamang S, Yazdany J, Schmajuk G. Potential biases in machine learning algorithms using electronic health record data. JAMA Internal Med. (2018) 178:1544–7. doi: 10.1001/jamainternmed.2018.3763

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Futoma J, Simons M, Panch T, Doshi-Velez F, Celi LA. The myth of generalisability in clinical research and machine learning in health care. Lancet Digit Health. (2020) 2:e489–92. doi: 10.1016/S2589-7500(20)30186-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: machine learning, intracerebral hemorrhage, laboratory profiles, prediction, prognostication

Citation: Chen W, Li X, Ma L and Li D (2022) Enhancing Robustness of Machine Learning Integration With Routine Laboratory Blood Tests to Predict Inpatient Mortality After Intracerebral Hemorrhage. Front. Neurol. 12:790682. doi: 10.3389/fneur.2021.790682

Received: 07 October 2021; Accepted: 07 December 2021;
Published: 03 January 2022.

Edited by:

David S. Liebeskind, University of California, Los Angeles, United States

Reviewed by:

Craig S. Anderson, University of New South Wales, Australia
Xunming Ji, Capital Medical University, China

Copyright © 2022 Chen, Li, Ma and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Dong Li, ZHJfZGxpMkAxMjYuY29t; ZGxpMkBlbW9yeS5lZHU=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.