Machine learning-based in-hospital mortality risk prediction tool for intensive care unit patients with heart failure

Chen, Zijun; Li, Tingming; Guo, Sheng; Zeng, Deli; Wang, Kai

doi:10.3389/fcvm.2023.1119699

ORIGINAL RESEARCH article

Front. Cardiovasc. Med., 03 April 2023

Sec. Intensive Care Cardiovascular Medicine

Volume 10 - 2023 | https://doi.org/10.3389/fcvm.2023.1119699

This article is part of the Research TopicModern Approaches to Risk Stratification in Critical Care CardiologyView all 8 articles

Machine learning-based in-hospital mortality risk prediction tool for intensive care unit patients with heart failure

Zijun Chen^1,2,†

Tingming Li^1,†

Sheng Guo³

Deli Zeng¹

Kai Wang^1*

¹Department of Cardiology, The Yongchuan Hospital of Chongqing Medical University, Chongqing, China
²Department of Cardiology, The Second Affiliated Hospital of Chongqing Medical University, Chongqing, China
³Department of Cardiology, The People’s Hospital of Rongchang District, Chongqing, China

Objective: Risk stratification of patients with congestive heart failure (HF) is vital in clinical practice. The aim of this study was to construct a machine learning model to predict the in-hospital all-cause mortality for intensive care unit (ICU) patients with HF.

Methods: eXtreme Gradient Boosting algorithm (XGBoost) was used to construct a new prediction model (XGBoost model) from the Medical Information Mart for Intensive Care IV database (MIMIC-IV) (training set). The eICU Collaborative Research Database dataset (eICU-CRD) was used for the external validation (test set). The XGBoost model performance was compared with a logistic regression model and an existing model (Get with the guideline-Heart Failure model) for mortality in the test set. Area under the receiver operating characteristic cure and Brier score were employed to evaluate the discrimination and the calibration of the three models. The SHapley Additive exPlanations (SHAP) value was applied to explain XGBoost model and calculate the importance of its features.

Results: The total of 11,156 and 9,837 patients with congestive HF from the training set and test set, respectively, were included in the study. In-hospital all-cause mortality occurred in 13.3% (1,484/11,156) and 13.4% (1,319/9,837) of patients, respectively. In the training set, of 17 features with the highest predictive value were selected into the models with LASSO regression. Acute Physiology Score III (APS III), age and Sequential Organ Failure Assessment (SOFA) were strongest predictors in SHAP. In the external validation, the XGBoost model performance was superior to that of conventional risk predictive methods, with an area under the curve of 0.771 (95% confidence interval, 0.757–0.784) and a Brier score of 0.100. In the evaluation of clinical effectiveness, the machine learning model brought a positive net benefit in the threshold probability of 0%–90%, prompting evident competitiveness compare to the other two models. This model has been translated into an online calculator which is accessible freely to the public (https://nkuwangkai-app-for-mortality-prediction-app-a8mhkf.streamlit.app).

Conclusion: This study developed a valuable machine learning risk stratification tool to accurately assess and stratify the risk of in-hospital all-cause mortality in ICU patients with congestive HF. This model was translated into a web-based calculator which access freely.

Introduction

Congestive heart failure (HF) is a complex clinical syndrome caused by structural and/or functional disorders. These disorders lead to high mortality rate in HF patients. Despite significant advances in its treatment in recent years, HF mortality has not yet reached a decreasing inflection point (1). In the United States, approximately 10% of hospitalized patients with HF require admission to the intensive care unit (ICU) for a higher level of intensive care (2, 3), which imposes a heavy economic and social burden. The risk stratification of the ICU patients has become an important strategy in management of HF. Studies have also demonstrated that accurate event risk assessment and active intervention can help improve cardiac disease outcomes (4). Therefore, effective stratification tools are needed to achieve this goal. The Get with the guideline-Heart Failure (GWTG-HF model) risk score uses common clinical features (race and systolic blood pressure) to predict in-patient mortality. However, it was developed more than a decade ago and has unsatisfactory applicability in current clinical practice (5, 6). Additionally, with the increasing availability of clinical data, traditional prediction models based on logistic regression analysis may not be able to capture nonlinear relationships from high-dimensional data.

Machine learning algorithms, which provide researchers with powerful tools, have been applied in several medical fields, ranging from disease diagnosis, outcome prediction, and efficacy prediction to medical image interpretation (7–9). It has also been used to develop new HF risk prediction models. These models show a better prediction performance (7–9). However, most of these models were verified only in the same cohort, and independent external verification was seldom performed. Moreover, the degree of model calibration has not been reported (8, 10). Based on the above, in this study, a machine learning algorithm was introduced to construct an in-patient mortality risk prediction model for ICU HF patients, and an independently validation was performed in the test dataset.

Methods

Study population

The study population was enrolled from two different databases: The Medical Information Mart for Intensive Care IV database (MIMIC-IV) and the eICU Collaborative Research Database dataset (eICU-CRD). MIMIC-IV (version 2.2) is a single-center database that contains data from over 190,000 ICU patients between 2008 and 2019, including demographic records, hourly vital signs from bedside monitors, laboratory tests, international classification of diseases (ICD-9 and ICD-10) code diagnostics, and other clinical features (11). The eICU-CRD is a multi-center intensive care database that contains data of 335 ICU patients in the United States from 2014 to 2015 (12). Inclusion criteria: All patients with congestive heart failure. The data of patients with congestive heart failure were obtained from the two databases by ICD-9 and ICD-10 code (Supplementary Material). Exclusion criteria: (a) patients less than 18 years of age; (b) patients not admitted to ICU for the first time; (c) patients with ICU stay less than 24 h. Ultimately, as shown in the flow chart (Figure 1), 11,156 and 9,837 patients were enrolled in the study from the MIMIC-IV and eICU-CRD, respectively. This study was a secondary analysis based on the data described above, and all patient’ information was anonymized; therefore, the requirement of informed patient consent was waived.

FIGURE 1

Figure 1. Study flow chart. According to the inclusion and exclusion criteria of the study, the population of the training set and the test set were obtained.

Data acquisition and outcome definition

According to previous researches (13, 14), this study documented demographic characteristics (age, sex, weight and height), severity score [Sequential Organ Failure Assessment (SOFA) and Acute Physiology Score III (APS III)], comorbidities [diabetes, hypertension, severe liver disease, chronic obstructive pulmonary disease (COPD), and myocardial infarction], medication information [dopamine, norepinephrine, dobutamine, epinephrine, phenylephrine, vasopressin, milrinone, furosemide, beta blocker and angiotensin converting enzyme I (AECI)/Angiotensin II receptor blocker (ARB)], mechanical ventilation, continuous renal replacement therapy (CRRT), vital signs at admission (heart rate, respiratory rate, SpO₂, systolic blood pressure and mean blood pressure), and laboratory tests [white blood cell count (WBC), basophils, eosinophils, lymphocytes, monocytes, neutrophils, red blood cells count (RBC), hematocrit, hemoglobin, mean corpuscular volume (MCV), mean corpuscular hemoglobin content (MCHC), mean hemoglobin concentration (MHC), red blood cell distribution width (RDW), platelets, albumin, aspartate aminotransferase (AST) alanine aminotransferase (ALT), alkaline phosphatase (ALP), total bilirubin, urea nitrogen (BUN), creatinine, glucose, sodium, calcium, chloride, potassium, anion gap, bicarbonate, international normalized ratio (INR), thromboplastin time (PTT), prothrombin time (PT)] at admission. The same features in eICU-CRD were extracted to achieve external validation of those models. The primary endpoint was defined as all-caused in-hospital mortality based on survival at discharge. Data extraction process was performed using the PostgreSQL programming language.

Model construction and validation

The MIMIC-IV and eICU-CRD datasets were used as training set and test set, respectively. In the training set, the Least Absolute Shrinkage and Selection Operator (LASSO) regression was used to select the features from items (15). The eXtreme Gradient Boosting (XGBoost) is an optimal implementation of gradient boosting which is based on the ensemble of weak learners with high bias and low variance (15). Taking the primary endpoint and the screened features as the prediction outcome and the prediction features, respectively, XGBoost was applied to obtain the optimal model super parameters (such as the number of trees, the maximum depth of the trees, the learning rate, and the like) through 10-fold cross-validation and Bayesian optimization. This machine learning model designated as XGBoost model. Logistic regression was applied to obtain a generalized linear regression model (Logistic model). The GWTG-HF model was obtained according to the previous study (6).

Machine learning models are considered black boxes due to difficulty in explaining how an algorithm provides accurate predictions for a specific population. Therefore, we introduced the SHapley Additive exPlanations (SHAP) value to explain the XGBoost model and calculate the importance of its features. SHAP is a unified framework for explaining machine learning algorithm prediction proposed by Lundberg and Lee as a new method for explaining various black box models, with verified interpretable properties (16).

Subsequently, the XGBoost model, Logistic model and GWTG-HF model were independently validated and compared with each other in the test set. Model discrimination was assessed using the area under the receiver operating characteristic curve (AUC). The calibration evaluation of the model was assessed using the calibration map and the Brier score. Clinical effectiveness evaluation of models was assessed by using decision curve analysis.

Statistical analysis

Continuous variables are expressed as mean and standard deviation, and the t-test was used for comparisons between the groups. Categorical variables are expressed as absolute values and proportions, and the chi-squared test was used for comparison between the groups. In this study, it was assumed that the missing clinical features occurred randomly, and multiple interpolations were performed using the random forest method in the multivariate imputation by chained equation (MICE) function of the MICE package, assuming missing clinical features. P value <0.05 was considered as statistically significant. All statistical analyses were performed using R 4.1.3 software (The R Foundation for Statistical Computing, Vienna, Austria).

Results

Population baseline characteristics

A total of 11,156 and 9,837 HF patients from the MIMIC-IV (training set) and eICU-CRD (test set), were included in the study respectively (Table 1). Among them, the median age was 73 and 71 years, and 55.9% and 53.0% were men, respectively. The proportions of patients with diabetes, hypertension, severe liver disease, COPD and myocardial infarction were 40.3%, 77.2%, 2.9%, 20.4%, 32.9%, 40.8%, 66.4%, 1.3%, 20.4% and 17.3%, respectively. The proportions of patients requiring mechanical ventilation were 39.3% and 46.4%, respectively. Endpoint events occurred in 13.3% (1,484/11,156) and 13.4% (1,319/9,837) of patients, respectively.

TABLE 1

Table 1. Clinical data of the study population.

Feature selection

In the training set, the LASSO regression was used for the automatic features selection (Figure 2). LASSO regression minimizes the loss function (binomial deviance) by changing the regularization coefficient lambda (λ) to generate zero coefficients. Finally, 17 features with the highest predictive value were introduced into the models.

FIGURE 2

Figure 2. Features selection process. Automated feature selection for 57 clinical factors was performed using Least Absolute Shrinkage and Selection Operator, which minimized the loss function binomial deviance, shrank coefficients, and produced some coefficients that are zero, allowing efficient feature selection (A). The algorithm outputted 17 filtered features with non-zero coefficients that were included in model generation subsequently (B).

Model establishment construction

The selected 17 features were input into a machine learning algorithm to establish the XGBoost model in the training set. The same procedure was performed using the traditional linear regression method to establish the Logistic model. The super-parameters of the XGBoost model were obtained: number of trees (nrounds) = 95; maximum depth of the tree (max depth) = 3; learning rate (eta) = 0.1527683; sample resampling ratio (subsample) = 0.6293208; minimum loss split (gamma) = 0.1934144; minimum sample weight required on child nodes (min childweight) = 10; characteristic random sampling ratio (colsample bytree) = 0.6447264; and the single maximum increment allowed in the weight estimation of the tree (max delta step) = 10. The importance of each prediction feature of the XGBoost model was obtained using the SHAP algorithm, and the importance graph listed the features in descending order (Figure 3).

FIGURE 3

Figure 3. Chart of feature importance ranking in all ICU patients with congestive heart failure. The importance ranking of the 17 risk factors with stability and interpretation using the optimal model in the training set. Each point in the graph represents the SHAP value for each sample; a color month closer to purple indicates a larger value, while that closer to yellow indicates a smaller value. The more scattered the points in the graph, the greater the influence of the variable on the model. APS III, age and SOFA were strongest predictors. SHAP, SHapley additive exPlanations; SOFA, sequential organ failure assessment; APS III, acute physiology score III scores; MCHC, mean corpuscular hemoglobin content; RDW, red blood cell distribution width.

The SHAP values represented the contribution of each feature to the final prediction and are useful for elucidating and interpreting model predictions for a single patient. The combined effect of all factors provided the final SHAP value that corresponded to the predicted score. APS III, age and SOFA were the three strongest predictors (Figure 3).

Model validation

The XGBoost model, Logistic model, and GWTG-HF model were independently validated in the test set. The XGBoost model performed best in the discrimination evaluation (Figure 4A), with an AUC of 0.771 [95% confidence interval (CI): 0.757–0.784] compared to the Logistic model (AUC 0.725, 95% CI: 0.710–0.740, P < 0.001) and the GWTG-HF model (AUC 0.649, 95% CI: 0.633–0.665, P < 0.001). The calibration degree evaluation was performed to demonstrate the consistency between the predicted probability of the model and the actual probability. The results showed that the three models had a good degree of calibration (Figure 5). The XGBoost model achived better Brier scores than the Logistic model and the GWTG-HF model (Brier score: 0.100, 0.107 and 0.112, respectively), which indicated better model calibration. In addition, in the evaluation of clinical effectiveness, the analysis of the decision curve showed that the net benefit level using the XGBoost model was higher than “zero risk of mortality” and “all mortality”, and superior to the other two models in the threshold probability of 0%–90% (Figure 4B).

FIGURE 4

Figure 4. The receiver operating characteristic curves and the decision curve analysis of all models in test dataset. (A). The brown transverse line = net benefit when all patients are considered to not have the outcome (in-hospital all-cause mortality); red oblique line = net benefit when all patients were considered to have the outcome; the decision curve analysis of all models showed that the proportion of the benefit for the population was the highest when the risk assessment of the XGBoost model was used for treatment, while the treatment threshold probability was from 0% to 90% (B).

FIGURE 5

Figure 5. Calibration plots of all models in testing dataset. Calibration plots of predicted probabilities (X-axis) and actual proportions (Y-axis) for different prediction models. With the calibration slope closest to 1.0 (an ideal model), the eXtreme Gradient Boosting (XGBoost model) from the machine learning algorithm obtained a fairly satisfactory calibration, while the other models calibrated poorly.

Model deployment

An internet-based version of the XGBoost model has been accessible for all physicians (https://nkuwangkai-app-for-mortality-prediction-app-a8mhkf.streamlit.app). This web-based tool will automatically predict the outcome for ICU patients with HF when the values of the 17 features required for this model are entered as shown in Figure 6. Moreover, the online tool provides users with the explanation of the prediction of the model. It can predict the outcome of patients with missing values.

FIGURE 6

Figure 6. Schematic diagram of online website calculator for the mortality prediction in all ICU patients with congestive heart failure. Input the patient information and click the “predict” button to get the patient's in-hospital mortality risk assessment results.

Discussion

In this study, we established and validated an interpretable machine learning-based risk stratification tool for in-hospital all-cause mortality in ICU HF patients. Compared with traditional risk prediction methods, machine learning technology captures both the linear and the nonlinear relationships between risk prediction factors and mortality endpoints from high-dimensional datasets. Our model achieved the most satisfactory risk stratification and calibration.

Data from 341 hospitals in the United States showed the median ICU admission rate for hospitalized HF patients was 10% (interquartile range, 6%–16%) (3), and the in-hospital mortality rate of HF patients admitted to the ICU was significantly higher than that of patients in general wards. The all-cause in-hospital mortality rates were 17.3% and 6.5% for ICU-admitted HF patients and all HF patients, respectively, in the RO-AHFS study (17), and 17.8% and 4.5% in the ALARM-HF study (18). The in-hospital mortality rates in our study were 13.3% and 13.4%, respectively, similar to those in the above studies. Although the high in-hospital mortality in ICU HF patients was primarily due to the underlying disease severity, accurate prognostic evaluation is the basis of clinical decision-making in ICU patients with HF. Therefore, this study has potentially important clinical implications.

A major challenge in the management of HF is the identification of mortality risk. Those previously prediction models (GWTG-HF model and Logistic model) were mostly based on traditional generalized linear regression, such as logistic regression, which is highly explanatory. However, faced with a large data volume and high data latitude, they may be useless, as the clinical processes are often unpredictable and the clinical factors of HF are highly variable, which makes it difficult for physicians to obtain accurate predictions using single-factor analysis. The application of machine learning integrated provides more powerful support for clinical decision-making (19–21). Similar to these studies, our study obtained a novel XGBoost model by applying a more flexible integrated machine learning algorithm. This model was derived from a comprehensive consideration of clinical features, comorbidities, and medication. Distinct from other machine learning models (7–9, 13, 22, 23), our study established an independent external test set and reported the calibration degree of the models in detail. Independent validation confirmed that our XGBoost model significantly improved prediction performance compared with the Logistic model and the GWTG-HF model in all-cause mortality risk for ICU patients with HF. It is worth mentioning that population characteristics between the training set and the test set were significantly different, but our XGBoost model overcame this difference and achieved good risk stratification (Table 1, Figures 4, 5). This excellent risk stratification performance may indicate that our XGBoost model can be generalized to other HF patients, not just the test set. This improvement also confirmed the nonlinear correlation between the clinical features of patients with HF and the risk of in-hospital mortality from another perspective. The good prediction alerts doctors and patients to the disease’s the severity to prepare for higher-level life support, such as mechanical circulatory support and heart transplantation or hospice care. This finding requires confirmation in future studies.

Another important challenge is capacity building for the primary management of HF. With the formation of a hierarchical diagnosis and treatment system, most primary hospitals have set up ICU wards to admit critically ill patients. Previous studies attempted to incorporate more biochemical indicators or cardiac magnetic resonance imaging parameter into models to obtain superior prediction performance (24). However, imaging parameters limited their generalized application to a certain extent. In contrast, in our study, 17 variables with the most predictive value were screened into the model by LASSO, which were all easily accessible, making it possible for primary hospitals to accurately assess patients’ risk through feature acquisition.

The third obvious challenge is to correctly explain the machine learning prediction model and visually present the prediction results to clinicians. We applied the SHAP value to the XGBoost model for optimize prediction and interpretability. SHAP can perform both local and global interpretability, and has a solid theoretical basis compare to other methods (25). SHAP’s evaluation includes the importance of the output of all the combinations of elements, and provides a consistent and locally accurate attribute value for each element in the prediction model. This visual interpretation was applied to the black box tree integration model XGBoost. Furthermore, we developed a website calculator to help physicians intuitively understand both the key features and the decision-making process. These means may promote and strengthen individualized treatment strategies.

The fourth challenge is how to translate machine learning model into clinical practical solution. Few studies develop a website or application to facilitate the accession to the tool (26). We built a new website calculator to public as shown in Figure 6 (https://nkuwangkai-app-for-mortality-prediction-app-a8mhkf.streamlit.app). This calculator concisely displays the risk of in-hospital all-cause mortality in ICU patients with HF.

Inevitably, this study had several limitations. First, there were some missing values in the dataset, but multiple imputation methods were used to fill in the missing values, which may make the them closer to the true values. Second, the information collected was structured or tabular. Further studies are needed to mine and integrate the unstructured data such as medical records and imaging biomarkers to improve the prediction. Third, since out data are derived from ICU patients, this model is primarily applicable for ICU patients accompanied with HF. The clinical characteristics of ICU patients with HF might be different from those admitted to the cardiovascular care unit, so evidence is needed for the application of this model in cardiovascular care unit patients with HF. Fourth, it is difficult to tell whether or not the heart failure is the primary cause of ICU admission in MIMIC-IV. Although we validated our model in an independent external data, further research is needed to validate this model in patients with different heart failure etiologies.

Conclusion

In this study, machine learning techniques were used to build a new risk stratification model to stratified the risk of in-hospital all-cause mortality in ICU patients with HF. This model was translated into a web-based calculator which can accessed freely. We believe that this stratification model and calculator provide a clear explanation for individualized risk prediction and serve as a simple rapid estimation tool.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Ethics statement

Ethical approval was not provided for this study on human participants because This study was a secondary analysis based on the data described above, and all patient’ information was kept anonymous. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author contributions

ZC, TL, SG, DZ and KW contributed to the design of the work. ZC, TL, and KW contributed to the analysis of the work. TL, SG, DZ and KW contributed to the interpretation of data. ZC, TL, and KW wrote the original manuscript. ZC, TL, SG, DZ and KW wrote the revised manuscript. ZC, TL, SG, DZ and KW approved the revised version to be published. ZC, TL, SG, DZ and KW agreed to be accountable for all aspects of the work. All authors contributed to the article and approved the submitted version.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcvm.2023.1119699/full#supplementary-material.

References

1. Ponikowski P, Voors AA, Anker SD, Bueno H, Cleland JGF, Coats AJS, et al. 2016 Esc guidelines for the diagnosis and treatment of acute and chronic heart failure: the task force for the diagnosis and treatment of acute and chronic heart failure of the European society of cardiology (esc)Developed with the special contribution of the heart failure association (hfa) of the esc. Eur Heart J. (2016) 37(27):2129–200. doi: 10.1093/eurheartj/ehw128

PubMed Abstract | CrossRef Full Text | Google Scholar

2. van Diepen S, Bakal JA, Lin M, Kaul P, McAlister FA, Ezekowitz JA. Variation in critical care unit admission rates and outcomes for patients with acute coronary syndromes or heart failure among high- and low-volume cardiac hospitals. J Am Heart Assoc. (2015) 4(3):e001708. doi: 10.1161/JAHA.114.001708

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Safavi KC, Dharmarajan K, Kim N, Strait KM, Li SX, Chen SI, et al. Variation exists in rates of admission to intensive care units for heart failure patients across hospitals in the United States. Circulation. (2013) 127(8):923–9. doi: 10.1161/CIRCULATIONAHA.112.001088

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Lee D, Straus S, Farkouh M, Austin P, Taljaard M, Chong A, et al. Trial of an intervention to improve acute heart failure outcomes. N Engl J Med. (2023) 388(1):22–32. doi: 10.1056/NEJMoa2211680

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Suzuki S, Yoshihisa A, Sato Y, Kanno Y, Watanabe S, Abe S, et al. Clinical significance of get with the guidelines-heart failure risk score in patients with chronic heart failure after hospitalization. J Am Heart Assoc. (2018) 7(17):e008316. doi: 10.1161/JAHA.117.008316

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Peterson PN, Rumsfeld JS, Liang L, Albert NM, Hernandez AF, Peterson ED, et al. A validated risk score for in-hospital mortality in patients with heart failure from the American heart association get with the guidelines program. Circ Cardiovasc Qual Outcomes. (2010) 3(1):25–32. doi: 10.1161/CIRCOUTCOMES.109.854877

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Wang Q, Li B, Chen K, Yu F, Su H, Hu K, et al. Machine learning-based risk prediction of malignant arrhythmia in hospitalized patients with heart failure. ESC Heart Fail. (2021) 8(6):5363–71. doi: 10.1002/ehf2.13627

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Wang K, Tian J, Zheng C, Yang H, Ren J, Liu Y, et al. Interpretable prediction of 3-year all-cause mortality in patients with heart failure caused by coronary heart disease based on machine learning and shap. Comput Biol Med. (2021) 137:104813. doi: 10.1016/j.compbiomed.2021.104813

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Tohyama T, Ide T, Ikeda M, Kaku H, Enzan N, Matsushima S, et al. Machine learning-based model for predicting 1 year mortality of hospitalized patients with heart failure. ESC Heart Fail. (2021) 8(5):4077–85. doi: 10.1002/ehf2.13556

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Shin S, Austin PC, Ross HJ, Abdel-Qadir H, Freitas C, Tomlinson G, et al. Machine learning vs. conventional statistical models for predicting heart failure readmission and mortality. ESC Heart Fail. (2021) 8(1):106–15. doi: 10.1002/ehf2.13073

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Mimic-Iv (Version 2.2). Physionet [Internet]. (2022).

12. Pollard TJ, Johnson AEW, Raffa JD, Celi LA, Mark RG, Badawi O. The eicu collaborative research database, a freely available multi-center database for critical care research. Sci Data. (2018) 5:180178. doi: 10.1038/sdata.2018.178

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Wang X, Zhu T, Xia M, Liu Y, Wang Y, Wang X, et al. Predicting the prognosis of patients in the coronary care unit: a novel multi-category machine learning model using Xgboost. Front Cardiovasc Med. (2022) 9:764629. doi: 10.3389/fcvm.2022.764629

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Li J, Liu S, Hu Y, Zhu L, Mao Y, Liu J. Predicting mortality in intensive care unit patients with heart failure using an interpretable machine learning model: retrospective cohort study. J Med Internet Res. (2022) 24(8):e38082. doi: 10.2196/38082

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Nwanosike EM, Conway BR, Merchant HA, Hasan SS. Potential applications and performance of machine learning techniques and algorithms in clinical practice: a systematic review. Int J Med Inf. (2022) 159:104679. doi: 10.1016/j.ijmedinf.2021.104679

CrossRef Full Text | Google Scholar

16. Lundberg SM, Nair B, Vavilala MS, Horibe M, Eisses MJ, Adams T, et al. Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat Biomed Eng. (2018) 2(10):749–60. doi: 10.1038/s41551-018-0304-0

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Chioncel O, Ambrosy AP, Filipescu D, Bubenek S, Vinereanu D, Petris A, et al. Patterns of intensive care unit admissions in patients hospitalized for heart failure: insights from the ro-ahfs registry. J Cardiovasc Med. (2015) 16(5):331–40. doi: 10.2459/JCM.0000000000000030

CrossRef Full Text | Google Scholar

18. Follath F, Yilmaz MB, Delgado JF, Parissis JT, Porcher R, Gayat E, et al. Clinical presentation, management and outcomes in the acute heart failure global survey of standard treatment (alarm-hf). Intensive Care Med. (2011) 37(4):619–26. doi: 10.1007/s00134-010-2113-0

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Segar MW, Hall JL, Jhund PS, Powell-Wiley TM, Morris AA, Kao D, et al. Machine learning-based models incorporating social determinants of health vs traditional models for predicting in-hospital mortality in patients with heart failure. JAMA Cardiol. (2022) 7(8):844–54. doi: 10.1001/jamacardio.2022.1900

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Segar MW, Jaeger BC, Patel KV, Nambi V, Ndumele CE, Correa A, et al. Development and validation of machine learning-based race-specific models to predict 10-year risk of heart failure: a multicohort analysis. Circulation. (2021) 143(24):2370–83. doi: 10.1161/CIRCULATIONAHA.120.053134

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Zhang Z, Chen L, Xu P, Hong Y. Predictive analytics with ensemble modeling in laparoscopic surgery: a technical note. Laparosc Endosc Robot Surg. (2022) 5(1):25–34. doi: 10.1016/j.lers.2021.12.003

CrossRef Full Text | Google Scholar

22. Zhao X, Liu C, Zhou P, Sheng Z, Li J, Zhou J, et al. Development and validation of a prediction rule for major adverse cardiac and cerebrovascular events in high-risk myocardial infarction patients after primary percutaneous coronary intervention. Clin Interv Aging. (2022) 17:1099–111. doi: 10.2147/CIA.S358761

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Xu Y, Han D, Huang T, Zhang X, Lu H, Shen S, et al. Predicting icu mortality in rheumatic heart disease: comparison of Xgboost and logistic regression. Front Cardiovasc Med. (2022) 9:847206. doi: 10.3389/fcvm.2022.847206

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Coriano’ M, Bray G, Micheli LD, Cecere AG, Marra MP, Lanera C, et al. Adverse events prediction in nonischemic dilated cardiomyopathy by two machine learning models integrating clinical and cardiovascular magnetic resonance data. Eur Heart J Cardiovasc Imaging. (2022) 23(Supplement_2):jeac141.010. doi: 10.1093/ehjci/jeac141.010

CrossRef Full Text | Google Scholar

25. Hu JY, Wang Y, Tong XM, Yang T. When to consider logistic lasso regression in multivariate analysis? Eur J Surg Oncol. (2021) 47(8):2206. doi: 10.1016/j.ejso.2021.04.011

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Yao Z, Mao C, Ke Z, Xu Y. An explainable machine learning model for predicting the outcome of ischemic stroke after mechanical thrombectomy. J Neurointerv Surg. (2022):019598. doi: 10.1136/jnis-2022-019598

CrossRef Full Text | Google Scholar

Keywords: heart failiure, mortality, risk stratication, machine laerning, calculator, intensive care unit

Citation: Chen Z, Li T, Guo S, Zeng D and Wang K (2023) Machine learning-based in-hospital mortality risk prediction tool for intensive care unit patients with heart failure. Front. Cardiovasc. Med. 10:1119699. doi: 10.3389/fcvm.2023.1119699

Received: 9 December 2022; Accepted: 21 March 2023;
Published: 3 April 2023.

Edited by:

Remo Furtado, Brazilian Clinical Research Institute, Brazil

Reviewed by:

Zhongheng Zhang, Sir Run Run Shaw Hospital, China
Dongkai Li, Peking Union Medical College Hospital (CAMS), China

© 2023 Chen, Li, Guo, Zeng and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Kai Wang bmt1d2FuZ2thaUAxNjMuY29t

^†These authors have contributed equally to this work and share first authorship

Specialty Section: This article was submitted to Intensive Care Cardiovascular Medicine, a section of the journal Frontiers in Cardiovascular Medicine

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.