Abstract
Background: Heart failure (HF) is the main cause of mortality in hemodialysis (HD) patients. However, it is still a challenge for the prediction of HF in HD patients. Therefore, we aimed to establish and validate a prediction model to predict HF events in HD patients.
Methods: A total of 355 maintenance HD patients from two hospitals were included in this retrospective study. A total of 21 variables, including traditional demographic characteristics, medical history, and blood biochemical indicators, were used. Two classification models were established based on the extreme gradient boosting (XGBoost) algorithm and traditional linear logistic regression. The performance of the two models was evaluated based on calibration curves and area under the receiver operating characteristic curves (AUCs). Feature importance and SHapley Additive exPlanation (SHAP) were used to recognize risk factors from the variables. The Kaplan–Meier curve of each risk factor was constructed and compared with the log-rank test.
Results: Compared with the traditional linear logistic regression, the XGBoost model had better performance in accuracy (78.5 vs. 74.8%), sensitivity (79.6 vs. 75.6%), specificity (78.1 vs. 74.4%), and AUC (0.814 vs. 0.722). The feature importance and SHAP value of XGBoost indicated that age, hypertension, platelet count (PLT), C-reactive protein (CRP), and white blood cell count (WBC) were risk factors of HF. These results were further confirmed by Kaplan–Meier curves.
Conclusions: The HF prediction model based on XGBoost had a satisfactory performance in predicting HF events, which could prove to be a useful tool for the early prediction of HF in HD.
Introduction
Heart failure (HF) as a clinical syndrome is one of the main causes of mortality (Ebong et al., 2014; Virani et al., 2021). More than 60Â million people are affected by HF worldwide, and the number of HF patients is increasing every year (Groenewegen et al., 2020). Compared with the general population, the prevalence of HF in patients with chronic kidney disease (CKD) is much higher, especially in patients with end-stage renal disease (ESRD) (Rangaswami and McCullough, 2018; Rapa et al., 2019). Currently, hemodialysis (HD) as a renal replacement therapy (RRT) is the main treatment of ESRD (Akash et al., 2014). However, more than 40% of HD patients suffer from HF, which increase the medical care, economic burden, and mortality (House et al., 2019; Sun et al., 2019). Therefore, identification, pre-estimation, and timely interventions can improve the prognosis and prolong the survival time for the population who show a high risk of HF (Abdo, 2017).
The causes of HF in HD patients are multifactorial (Wang and Sanderson, 2011). Similar to the general population, traditional risk factors such as aging, hypertension, diabetes mellitus (DM), and atherosclerosis are associated with HF in HD patients (Liu et al., 2014; Cozzolino et al., 2018). However, excess HF prevalence and mortality in HD population are not fully accounted for by the traditional risk factors (Schefold et al., 2016). Some researchers have found that dialysis can increase the risk of HF events in ESRD patients (Rangaswami and McCullough, 2018; Samanta et al., 2019). Evidence shows that the malnutrition–inflammation syndrome in HD patients is associated with HF (Almeida et al., 2008; Chang et al., 2020). In addition, changed hemodynamic and blood pressure by HD can increase the risk of HF (Dorairajan et al., 2010). Several traditional risk prediction models were proposed to predict the risk of HF in population without HD (Ouwerkerk et al., 2014; Voors et al., 2017). However, these linear models missed specific related factors for HD people and oversimplified the complicated relationships between factors, which may lead to a decrease in performance and miss important risk factors (Mpanya et al., 2021).
Recent advances in extreme gradient boosting (XGBoost), a new integrated machine learning algorithm, have provided a robust method to identify the complex non-linear relationship between multiple variables and outcomes (Kagiyama et al., 2019). This algorithm has strong model generalization ability, fast operating speed, and high model accuracy, which has been applied in orthopedic auxiliary classification, prediction of interaction, and analysis of hypertension-related symptoms to improve accuracy in complex clinical decision-making (Chang et al., 2019; Li and Zhang, 2020; Yu et al., 2020).
Therefore, we aimed to establish a prediction model that integrated patient-specific information and non-traditional factors of HF in HD patients based on XGBoost 1) to accurately predict HF events and 2) to assess the risk of HF in patients with HD treatment.
Materials and Methods
Study Population
This retrospective study analyzed the data of 410 ESRD patients who underwent HD treatment in the HD centers of the Third Affiliated Hospital of Southern Medical University and the Third Affiliated Hospital of Sun Yat-sen University between January 2015 and September 2019. All the patients were older than 18Â years and received HD treatment for at least 3Â months. Patients with the following conditions were excluded from this study: 1) history of renal transplantation; 2) malignancy, acute infection, hepatic, and pulmonary dysfunction; and 3) lack of data on demographic characteristics, laboratory examinations, or physical examination. This study was approved by the Ethics Committee of the Third Affiliated Hospital of Southern Medical University.
Data Collection
A total of 21 factors were collected at the start of HD therapy. The basic information consisted of the patient’s gender, age, and history of various diseases. After fasting overnight for at least 8 h, the patient’s venous blood samples were collected before dialysis therapy. The laboratory indicators included C-reactive protein (CRP), blood urea nitrogen (BUN), calcium (CA), hemoglobin (HB), phosphorus (P), cholesterol (CHOL), serum creatinine (CRE), lymphocyte (LYMPH), uric acid (UA), low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol (HDL-C), intact parathyroid hormone (IPTH), white blood cell count (WBC), platelet count (PLT), and neutrophil count (NEUT). All the abovementioned examinations were conducted at the laboratory centers in the Third Affiliated Hospital of Southern Medical University and the Third Affiliated Hospital of Sun Yat-sen University.
Outcome and Follow-Up
The occurrence of HF events (fatal or not) after HD was recorded. The HF diagnosis was made based on Framingham criteria (Groenewegen et al., 2020). Follow-up visits were performed by the HD centers. Patients who developed HF after HD were censored at the earliest date of a HF event. The HF events were recorded up to May 1, 2020.
Statistical Analysis
All statistical analyses and model establishment were performed using SPSS 26.0 for Windows and Python 3.7.6. All continuous variables were described as mean ± standard deviation (SD), and these variables were compared between groups by performing t-tests. In addition, discrete variables were expressed in numbers (n) and percentages (%), and the chi-square test was applied. All results were considered to be statistically significant within a two-sided test with p <0.05.
Two classification models were established based on XGBoost and traditional linear logistic regression models. The multiple logistic regression was selected using a stepwise method. In the XGBoost model, all the characteristics of HD patients were included. The data set was randomly divided into a training set by 75% and a validation set by 25%. The performance of the models was evaluated using calibration curves and area under the receiver operating characteristic curves (AUCs). To evaluate the influence of different variables on the results, the SHAP value of important variables was calculated (Rodriguez-Perez and Bajorath, 2020). A risk curve was constructed using the Kaplan–Meier analysis, and differences between groups were compared by a log-rank test to further evaluate the performance of our model and verify the risk factors selected by the mode. To make our model be an easy-to-use tool, a nomogram was developed based on the risk factors selected by our model.
Results
Demographic and Clinical Characteristics of Study Population
According to the inclusion and exclusion criteria, a total of 353 patients were finally included in this study. The basic variables of study population are shown in Table 1. Among these patients, 96 patients (73 males and 23 females) developed HF during the follow-up duration and 257 patients (166 males and 91 females) did not. In general, the average age of these patients was 54.92 ± 15.29, and the proportion of males was 67.71%. Compared with patients without HF, those with HF were older and had a higher ratio of hypertension and diabetes, and there was a significant difference in gender. In terms of laboratory data, for patients in the HF group, their WBC, NEUT, CRP, and TIBC levels were significantly (p <0.05) higher than those of the other group. There was no significant difference in other characteristics.
TABLE 1
| Clinical characteristic and laboratory parameter | All patients | HF | Non-HF | p-value |
|---|---|---|---|---|
| Age (years) | 56.05 (43.13–66.32) | 63.1 (54.64–72.03) | 52.0 (41.0–61.1) | <0.001 |
| Gender (male) (n, %) | 239 (66.71) | 73 (76.04) | 166 (64.59) | 0.041 |
| HTN (n, %) | 281 (79.60) | 88 (91.67) | 193 (75.10) | <0.001 |
| DM (n, %) | 123 (34.84) | 43 (44.79) | 80 (31.13) | 0.017 |
| HB (g/L) | 96.44 ± 21.01 | 97.71 ± 21.52 | 95.98 ± 20.96 | 0.423 |
| NEUT (109/L) | 4.39 (3.47–5.60) | 4.86 (3.50–6.41) | 4.26 (3.44–5.17) | 0.005 |
| WBC (109/L) | 6.72 (5.36–8.03) | 7.68 (6.07–9.00) | 6.48 (5.20–7.69) | <0.001 |
| LYMPH (109/L) | 1.19 (0.89–1.50) | 1.18 (0.90–1.54) | 1.20 (0.89–1.49) | 0.854 |
| IPTH (pg/ml) | 343.40 (213.43–529.19) | 322.50 (192.51–465.64) | 356.45 (218.37–536.82) | 0.128 |
| UA(μmol/L) | 494.88 ± 146.20 | 513.74 ± 154.06 | 487.81 ± 142.83 | 0.861 |
| BUN (mmol/L) | 23.07 (16.39–29.09) | 23.27 (18.30–31.25) | 22.85 (15.96–28.69) | 0.299 |
| CRE (μmol/L) | 848.60 ± 391.62 | 812.26 ± 372.39 | 862.18 ± 398.40 | 0.329 |
| CA (mmol/L) | 2.17 (2.04–2.32) | 2.14 (1.99–2.30) | 2.19 (2.07–2.33) | 0.103 |
| P (mmol/L) | 1.81 (1.44–2.25) | 1.81 (1.51–2.20) | 1.82 (1.44–2.26) | 0.854 |
| CHOL (mmol/L) | 4.15 (3.62–4.89) | 4.10 (3.57–4.97) | 4.17 (3.65–4.87) | 0.655 |
| TG (mmol/L) | 1.51 (1.17–1.92) | 1.43 (1.11–1.90) | 1.52 (1.20–1.93) | 0.434 |
| HDL-C (mmol/L) | 0.97 (0.87–1.11) | 0.99 (0.88–1.15) | 0.96 (0.86–1.10) | 0.572 |
| LDL-C (mmol/L) | 2.45 (2.00–2.96) | 2.46 (2.00–3.15) | 2.44 (1.99–2.94) | 0.668 |
| CRP (mg/L) | 5.66 (2.65–13.65) | 8.40 (4.05–16.35) | 5.01 (2.12–13.00) | <0.001 |
| TIBC (mmol/L) | 36.16 (33.73–39.55) | 35.84 (33.48–38.09) | 36.46 (33.95–40.00) | 0.034 |
| PLT (109/L) | 201.00 (156.00–243.70) | 210.00 (155.50–273.00) | 198.72 (157.25–237.00) | 0.177 |
Clinical characteristics and laboratory parameters of patients in the HF group and the non-HF group.
CRP, C-reactive protein; BUN, blood urea nitrogen; CA, calcium; HB, hemoglobin; DM, diabetes mellitus; TG, triglycerides; P, phosphorus; CHOL, cholesterol; CRE, serum creatinine; LYMPH, lymphocyte; UA, uric acid; LDL-C, low-density lipoprotein cholesterol; HDL-C, high-density lipoprotein cholesterol; IPTH, intact parathyroid hormone; WBC, white blood cell count; NEUT, neutrophil count; PLT, platelet count; TIBC, total iron-binding capacity; HTN, hypertension.
Logistic Regression Analysis
As presented in Table 2, the following factors were included in a multivariate stepwise logistic regression model: HTN (OR: 3.786, 95% CI: 1.640–8.739, and p = 0.002), WBC (OR: 1.115, 95% CI: 1.026–1.212, and p = 0.011), CRP (OR: 1.020, 95% CI: 1.009–1.032, and p = 0.001), and age (OR: 1.026, 95% CI: 1.008–1.044, and p = 0.004). The receiver operating characteristic (ROC) curve of the logistic regression model is shown in Figure 1, and the performance of this model was evaluated using the calibration curve, as shown in Figure 2. The area under the curve (AUC) was 0.722 for the validation group and 0.735 for the training group. In addition, accuracy, sensitivity, and specificity of the model were 74.8, 75.6, and 74.4%, respectively, evaluated by the validation group.
TABLE 2
| Variable | B | SE | Wald | p-value | OR (95%CI) |
|---|---|---|---|---|---|
| HTN | 1.331 | 0.427 | 9.733 | 0.002 | 3.786 (1.640–8.739) |
| WBC | 0.109 | 0.043 | 6.532 | 0.011 | 1.115 (1.026–1.212) |
| CRP | 0.020 | 0.006 | 11.730 | 0.001 | 1.020 (1.009–1.032) |
| AGE | 0.025 | 0.009 | 8.113 | 0.004 | 1.026 (1.008–1.044) |
Multivariate logistic regression model.
HTN, hypertension; WBC, white blood cell count; CRP, C-reactive protein; OR, odds ratio; CI, confidence interval.
FIGURE 1
FIGURE 2
Extreme Gradient Boosting Model
The ROC curve and the calibration curve of the XGBoost model are presented in Figure 1 and Figure 2, respectively. It was found that the accuracy, sensitivity, specificity, and AUC value were 78.5, 79.6, 78.1%, and 0.814, respectively, evaluated by the validation cohort. The feature importance ranking which represents the contribution of the corresponding feature to the model prediction based on gain is shown in Figure 3. It was found that HTN, age, CRP, WBC, and PLT were more important for the prediction of HC than other features.
FIGURE 3
Interpretation of the Extreme Gradient Boosting Model
The SHAP value quantifies the contribution of data to the outcome in the XGBoost model; the size of the value reflects the influence of the corresponding eigenvalue on the outcome, and the positive and negative values reflect whether the results are promoted. Based on the established XGBoost model, the SHAP values of the five relevant factors showed how the feature affected the results, as shown in Figure 3.
Kaplan–Meier Analysis
The Kaplan–Meier analysis was used to confirm the effects of risk factors on the incidence of HF. The risk factors were stratified by the SHAP value when it was 0. As shown in Figure 4, the risk curve revealed that considering the influence of different outcome time, patients with high levels of WBC, CRP, PLT, old age, and hypertension had a higher incidence of HF, and the effect of UA was not obvious enough. The nomogram of the XGBoost model was constructed, as shown in Figure 5.
FIGURE 4
FIGURE 5
Discussion
In this retrospective study, we established and validated a clinical prediction model to predict HF in HD patients based on the XGBoost algorithm. The results indicated that our model could effectively identify the individuals who suffer from HF using routinely clinical parameters. Our result indicated that XGBoost as a non-linear model had better performance than the logistic model and showed stronger ability in identification of risk factors.
In recent years, machine learning has been widely used in risk prediction and disease screening, which has obtained excellent performance (Kwon et al., 2020; Meng et al., 2020; Cobb et al., 2021; Koteluk et al., 2021; Kourou et al., 2021). Therefore, in this research, XGBoost, an integrated machine learning algorithm, was applied to identify the complex non-linear relationship between HF and clinical variables, as well as to evaluate the importance of the variables to the HF. Although traditional multivariable analysis methods have been applied for the identification of HF in HD patients (Gedfew et al., 2020; Bramania et al., 2021), to our knowledge, this was the first machine learning model for the prediction of HF. Our results showed that the performance of the XGBoost model was better than the traditional logistic regression model in prediction of HF. To make our XGBoost model interpretable, the feature importance and SHAP value were applied to evaluate the contribution of variables to HF. Except four risk factors including age, hypertension, WBC, and CRP found in the logistic regression model, a new risk factor, PLT, was found in the XGBoost model. This showed the advantages of our XGBoost mode, which could improve the accuracy of the classification and the efficiency of identification.
In addition to HF response, the performance of our model was evaluated by the Kaplan–Meier analysis. The differences between the Kaplan–Meier curves of different risk levels indicated that the risk factors selected by our XGBoost model were important prognostic factors of HF. The results of Kaplan–Meier analysis also showed that age more than 60, WBC more than 6.5 × 109, CRP more than 15 mg/L, PLT more than 250 × 109, and hypertension were independent risk factors which could increase the probability of HF. Moreover, based on the five risk factors selected by the XGBoost model and the Kaplan–Meier analysis, the nomogram (Yao et al., 2019) was designed to give an easy-to-use tool for the prediction of HF.
In our study, the proportion of hypertension in HF patients was significantly greater than that in the non-HF group, which is consistent with a recent study (Bramania et al., 2021). The analysis of characteristics showed that hypertension was a risk factor for HF events, which is consistent with the results of a recent study (Cozzolino et al., 2017). Patients with hypertension already had a huge cardiovascular burden before dialysis (Van Buren, 2016). There are pieces of evidence that dialysis treatment can lead to abnormal fluctuations in blood pressure and affect patient’s hemodynamics (Chou et al., 2018; Douvris et al., 2019). The vascular intima is more damaged by the abnormal fluctuations in the blood pressure, and HF is more likely to occur (Buren and Inrig, 2012).
Age is an important risk factor of HF. In old age, the shape and function of the vascular wall are changed due to oxidative stress, cell aging, and inflammation (Ghebre et al., 2016). In addition, with increasing age, body functions gradually decline, physiological compensatory function decreases, chronic diseases such as high blood pressure and diabetes occur frequently, and symptoms such as anemia, malnutrition, and chronic inflammation are prone to occur; these factors increase the risk of HF events (McHugh and Gil, 2018; Groenewegen et al., 2020). It has been reported that approximately 80% of HF patients were more than 60Â years old, and the prevalence increased with age (Vigen et al., 2012). According to the results of our study, the risk of HF continued to increase from about 45Â years old, which was consistent with the report from the American Heart Association (Virani et al., 2021).
WBC and CRP are widely used to reflect the inflammatory state in patients (Rienstra et al., 2012; Adamo et al., 2020). Inflammation and HF are strongly interconnected, and higher levels of inflammation markers indicated an increased mortality and morbidity of HF in patients (Ekdahl et al., 2017). Some studies have confirmed that inflammation markers are implicated in the development of HF because the immune system leads to deterioration of the structure and function of the cardiovascular system by the inflammatory response (Van Linthout and Tschope, 2017; Castillo et al., 2020). A study has demonstrated that high levels of WBC are closely related to the incidence of HF (Bekwelem et al., 2011). CRP is one of the recognized risk factors for CVD events, and a meta-analysis has reported that CRP plays an important role in the development of HF (Emerging Risk Factors Collaboration et al., 2010). In this study, our result was consistent with these previous findings. In addition, our results showed that other serum biomarkers, such as an anomalous level of PLT, were related to HF. Excessive PLT can promote inflammation, negatively affect the left ventricular function, and increase the risk of atherosclerosis; these diseases would increase the probability of HF (Gary et al., 2013; Chen and Yang, 2020).
All these data required in this study can be easily obtained through routine clinical examination, not limited by many additional conditions, and can make full use of existing resources. The XGBoost model showed a better prediction effect by using higher indicators such as AUC, accuracy, sensitivity, and atherosclerosis specificity. It can be a useful tool for doctors to evaluate the HF risk of HD patients and carry out personalized intervention in advance. However, there are still some limitations to our study. We only focused on the event of HF in HD population of China; it needs to be verified by different populations and races. In addition, this study is a retrospective examination which lacks longitudinal observation. Therefore, in further research, a longitudinal data analysis can help evaluate the changes in these risk factors over time and further validate the results in our model.
In conclusion, we developed and validated a clinical prediction model based on the XGBoost algorithm for HF in end-stage renal disease patients. The model had an excellent predictive performance and provided a useful tool for the early prediction of HF in HD patients.
Statements
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving human participants were reviewed and approved by the Ethics Committee of the Third Affiliated Hospital of Southern Medical University. The patients/participants provided their written informed consent to participate in this study.
Author contributions
PL and XY contributed to the concept and design of the sutdy. YIW and XM organized the database. YAW, XM, and GX wrote the first draft of the manuscript. CH and JS wrote sections fo the manuscript. All authors contributed to manuscript revision, read, and approved the submitted version.
Funding
This work was supported by the Third Affiliated Hospital of Sun Yat-Sen University, Clinical Research Program (P000-277), the Science and Technology Planning Project of Guangdong Province, China (2019B020228001), the Meizhou Science and Technology Project (2019B0203003), the Open Fund of State Key Laboratory of EC Prevention and Treatment (K2020-0010 and K2020-0011), and the National Natural Science Foundation of China (U1804262, 61632002). Zhongyuan Thound Talents Program (204200510003).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1
AbdoA. S. (2017). Diagnosis and Management of Heart Failure in Long-Term Dialysis Patients. Curr. Heart Fail. Rep.14 (5), 404–409. 10.1007/s11897-017-0354-8
2
AdamoL.Rocha-ResendeC.PrabhuS. D.MannD. L. (2020). Reappraising the Role of Inflammation in Heart Failure. Nat. Rev. Cardiol.17 (5), 269–285. 10.1038/s41569-019-0315-x
3
AkashN. K.GiacomoM.EnricoR.ClaudioR. (2014). The Role of Economies of Scale in the Cost of Dialysis across the World: a Macroeconomic Perspective. Nephrol. Dial. Transpl.29 (4), 885–891. 10.1093/ndt/gft528
4
AlmeidaH. R. M.SantosE. M. C.DouradoK.MotaC.PeixotoR. (2008). Malnutrition Associated with Inflammation in the Chronic Renal Patient on Hemodialysis. Rev. Assoc. Med. Bras (1992)64 (9), 837–844. 10.1590/1806-9282.64.09.837
5
BekwelemW.LutseyP. L.LoehrL. R.AgarwalS. K.AstorB. C.GuildC.et al (2011). White Blood Cell Count, C-Reactive Protein, and Incident Heart Failure in the Atherosclerosis Risk in Communities (ARIC) Study. Ann. Epidemiol.21 (10), 739–748. 10.1016/j.annepidem.2011.06.005
6
BramaniaP. K.RuggajoP. J.FuriaF. F. (2021). Prevalence and Predictors of Heart Failure Among Patients on Maintenance Hemodialysis Therapy at Muhimbili National Hospital in Tanzania: a Cross-Sectional Study. Egypt. Heart J.73 (1), 97. 10.1186/s43044-021-00223-z
7
BurenP.InrigJ. K. (2012). Hypertension and Hemodialysis: Pathophysiology and Outcomes in Adult and Pediatric Populations. Pediatr. Nephrol.27 (3), 339–350. 10.1007/s00467-011-1775-3
8
CastilloE. C.Vazquez-GarzaE.Yee-TrejoD.Garcia-RivasG.Torre-AmioneG. (2020). What Is the Role of the Inflammation in the Pathogenesis of Heart Failure?Curr. Cardiol. Rep.22 (11), 139. 10.1007/s11886-020-01382-2
9
ChangJ. F.WuC. C.HsiehC. Y.LiY. Y.WangT. M.LiouJ. C. (2020). A Joint Evaluation of Impaired Cardiac Sympathetic Responses and Malnutrition-Inflammation Cachexia for Mortality Risks in Hemodialysis Patients. Front. Med. (Lausanne).7, 99. 10.3389/fmed.2020.00099
10
ChangW.LiuY.XiaoY.XuX.ZhouS.LuX.et al (2019). Probability Analysis of Hypertension-Related Symptoms Based on XGBoost and Clustering Algorithm. Appl. Sci.9 (6), 1215. 10.3390/app9061215
11
ChenT.YangM. (2020). Platelet-to-lymphocyte Ratio Is Associated with Cardiovascular Disease in Continuous Ambulatory Peritoneal Dialysis Patients. Int. Immunopharmacol78, 106063. 10.1016/j.intimp.2019.106063
12
ChouJ. A.StrejaE.NguyenD. V.RheeC. M.ObiY.InrigJ. K.et al (2018). Intradialytic Hypotension, Blood Pressure Changes and Mortality Risk in Incident Hemodialysis Patients. Nephrol. Dial. Transpl.33 (1), 149–159. 10.1093/ndt/gfx037
13
CobbJ. S.SealeM. A.JanorkarA. V. (2021). Evaluation of Machine Learning Algorithms to Predict the Hydrodynamic Radii and Transition Temperatures of Chemo-Biologically Synthesized Copolymers. Comput. Biol. Med.128, 104134. 10.1016/j.compbiomed.2020.104134
14
CozzolinoM.GalassiA.PivariF.CiceriP.ConteF. (2017). The Cardiovascular Burden in End-Stage Renal Disease. Contrib. Nephrol.191, 44–57. 10.1159/000479250
15
CozzolinoM.ManganoM.StucchiA.CiceriP.ConteF.GalassiA. (2018). Cardiovascular Disease in Dialysis Patients. Nephrol. Dial. Transplant.33 (Suppl. l_3), iii28–iii34. 10.1093/ndt/gfy174
16
DorairajanS.ChockalingamA.MisraM. (2010). Myocardial Stunning in Hemodialysis: what Is the Overall Message?Hemodial Int.14 (4), 447–450. 10.1111/j.1542-4758.2010.00495.x
17
DouvrisA.ZeidK.HiremathS.BagshawS. M.WaldR.Beaubien-SoulignyW.et al (2019). Mechanisms for Hemodynamic Instability Related to Renal Replacement Therapy: a Narrative Review. Intensive Care Med.45 (10), 1333–1346. 10.1007/s00134-019-05707-w
18
EbongI. A.GoffD. C.Jr.RodriguezC. J.ChenH.BertoniA. G. (2014). Mechanisms of Heart Failure in Obesity. Obes. Res. Clin. Pract.8 (6), e540–e548. 10.1016/j.orcp.2013.12.005
19
EkdahlK. N.SoveriI.HilbornJ.FellstromB.NilssonB. (2017). Cardiovascular Disease in Haemodialysis: Role of the Intravascular Innate Immune System. Nat. Rev. Nephrol.13 (5), 285–296. 10.1038/nrneph.2017.17
20
Emerging Risk Factors CollaborationKaptogeS.Di AngelantonioE.LoweG.PepysM. B.ThompsonS. G.CollinsR.et al (2010). C-reactive Protein Concentration and Risk of Coronary Heart Disease, Stroke, and Mortality: an Individual Participant Meta-Analysis. The Lancet375 (9709), 132–140. 10.1016/S0140-6736(09)61717-7
21
GaryT.PichlerM.BelajK.HafnerF.GergerA.FroehlichH.et al (2013). Platelet-to-lymphocyte Ratio: a Novel Marker for Critical Limb Ischemia in Peripheral Arterial Occlusive Disease Patients. PLoS One8 (7), e67688. 10.1371/journal.pone.0067688
22
GedfewM.AyenewT.MengstB.YirgaT.ZelalemM.WorkuY.et al (2020). Incidence and Predictors of Congestive Heart Failure Among Hemodialysis Patients at Felege Hiote Referral Hospital, Northwest Ethiopia, 2020: Retrospective Cohort Study. Res. Rep. Clin. Cardiol.11, 65–79. 10.2147/rrcc.s274942
23
GhebreY. T.YakubovE.WongW. T.KrishnamurthyP. (2016). Vascular Aging: Implications for Cardiovascular Disease and Therapy. Transl Med.6 (04). 10.4172/2161-1025.1000183
24
GroenewegenA.RuttenF. H.MosterdA.HoesA. W. (2020). Epidemiology of Heart Failure. Eur. J. Heart Fail.22 (8), 1342–1356. 10.1002/ejhf.1858
25
HouseA. A.WannerC.SarnakM. J.PinaI. L.McIntyreC. W.KomendaP.et al (2019). Heart Failure in Chronic Kidney Disease: Conclusions from a Kidney Disease: Improving Global Outcomes (KDIGO) Controversies Conference. Kidney Int.95 (6), 1304–1317. 10.1016/j.kint.2019.02.022
26
KagiyamaN.ShresthaS.FarjoP. D.SenguptaP. P. (2019). Artificial Intelligence: Practical Primer for Clinical Research in Cardiovascular Disease. J. Am. Heart Assoc.8 (17), e012788. 10.1161/JAHA.119.012788
27
KotelukO.WarteckiA.MazurekS.KolodziejczakI.MackiewiczA. (2021). How Do Machines Learn? Artificial Intelligence as a New Era in Medicine. J. Pers Med.11 (1), 32. 10.3390/jpm11010032
28
KourouK.ManikisG.Poikonen-SakselaP.MazzoccoK.Pat-HorenczykR.SousaB.et al (2021). A Machine Learning-Based Pipeline for Modeling Medical, Socio-Demographic, Lifestyle and Self-Reported Psychological Traits as Predictors of Mental Health Outcomes after Breast Cancer Diagnosis: An Initial Effort to Define Resilience Effects. Comput. Biol. Med.131, 104266. 10.1016/j.compbiomed.2021.104266
29
KwonE.ChoM.KimH.SonH. S. (2020). A Study on Host Tropism Determinants of Influenza Virus Using Machine Learning. Curr. Bioinform15 (2), 121–134. 10.2174/1574893614666191104160927
30
LiS.ZhangX. (2020). Research on Orthopedic Auxiliary Classification and Prediction Model Based on XGBoost Algorithm. Neural Comput. Appl.32 (59), 1971–1979. 10.1007/s00521-019-04378-4
31
LiuM.LiX. C.LuL.CaoY.SunR. R.ChenS.et al (2014). Cardiovascular Disease and its Relationship with Chronic Kidney Disease. Eur. Rev. Med. Pharmaco18 (19), 2918–2926.
32
McHughD.GilJ. (2018). Senescence and Aging: Causes, Consequences, and Therapeutic Avenues. J. Cel Biol217 (1), 65–77. 10.1083/jcb.201708092
33
MengC.ZhangJ.YeX.GuoF.ZouQ. (2020). Review and Comparative Analysis of Machine Learning-Based Phage Virion Protein Identification Methods. Biochim. Biophys. Acta Proteins Proteom1868 (6), 140406. 10.1016/j.bbapap.2020.140406
34
MpanyaD.CelikT.KlugE.NtsinjanaH. (2021). Machine Learning and Statistical Methods for Predicting Mortality in Heart Failure. Heart Fail. Rev.26 (3), 545–552. 10.1007/s10741-020-10052-y
35
OuwerkerkW.VoorsA. A.ZwindermanA. H. (2014). Factors Influencing the Predictive Power of Models for Predicting Mortality And/or Heart Failure Hospitalization in Patients with Heart Failure. JACC Heart Fail.2 (5), 429–436. 10.1016/j.jchf.2014.04.006
36
RangaswamiJ.McCulloughP. A. (2018). Heart Failure in End-Stage Kidney Disease: Pathophysiology, Diagnosis, and Therapeutic Strategies. Semin. Nephrol.38 (6), 600–617. 10.1016/j.semnephrol.2018.08.005
37
RapaS. F.Di IorioB. R.CampigliaP.HeidlandA.MarzoccoS. (2019). Inflammation and Oxidative Stress in Chronic Kidney Disease-Potential Therapeutic Role of Minerals, Vitamins and Plant-Derived Metabolites. Int. J. Mol. Sci.21 (1), 263. 10.3390/ijms21010263
38
RienstraM.SunJ. X.MagnaniJ. W.SinnerM. F.LubitzS. A.SullivanL. M.et al (2012). White Blood Cell Count and Risk of Incident Atrial Fibrillation (From the Framingham Heart Study). Am. J. Cardiol.109 (4), 533–537. 10.1016/j.amjcard.2011.09.049
39
Rodriguez-PerezR.BajorathJ. (2020). Interpretation of Machine Learning Models Using Shapley Values: Application to Compound Potency and Multi-Target Activity Predictions. J. Comput. Aided Mol. Des.34 (10), 1013–1026. 10.1007/s10822-020-00314-0
40
SamantaR.ChanC.ChauhanV. S. (2019). Arrhythmias and Sudden Cardiac Death in End Stage Renal Disease: Epidemiology, Risk Factors, and Management. Can. J. Cardiol.35 (9), 1228–1240. 10.1016/j.cjca.2019.05.005
41
SchefoldJ. C.FilippatosG.HasenfussG.AnkerS. D.von HaehlingS. (2016). Heart Failure and Kidney Dysfunction: Epidemiology, Mechanisms and Management. Nat. Rev. Nephrol.12 (10), 610–623. 10.1038/nrneph.2016.113
42
SunC. Y.SungJ. M.WangJ. D.LiC. Y.KuoY. T.LeeC. C.et al (2019). A Comparison of the Risk of Congestive Heart Failure-Related Hospitalizations in Patients Receiving Hemodialysis and Peritoneal Dialysis - A Retrospective Propensity Score-Matched Study. PLoS One14 (10). 10.1371/journal.pone.0223336
43
Van BurenP. N. (2016). Evaluation and Treatment of Hypertension in End-Stage Renal Disease Patients on Hemodialysis. Curr. Cardiol. Rep.18 (12), 125. 10.1007/s11886-016-0805-y
44
Van LinthoutS.TschopeC. (2017). Inflammation - Cause or Consequence of Heart Failure or Both?Curr. Heart Fail. Rep.14 (4), 251–265. 10.1007/s11897-017-0337-9
45
VigenR.MaddoxT. M.AllenL. A. (2012). Aging of the United States Population: Impact on Heart Failure. Curr. Heart Fail. Rep.9 (4), 369–374. 10.1007/s11897-012-0114-8
46
ViraniS. S.AlonsoA.AparicioH. J.BenjaminE. J.BittencourtM. S.CallawayC. W.et al (2021). Heart Disease and Stroke Statistics-2021 Update: A Report from the American Heart Association. Circulation143 (8), e254–e743. 10.1161/CIR.0000000000000950
47
VoorsA. A.OuwerkerkW.ZannadF.van VeldhuisenD. J.SamaniN. J.PonikowskiP.et al (2017). Development and Validation of Multivariable Models to Predict Mortality and Hospitalization in Patients with Heart Failure. Eur. J. Heart Fail.19 (5), 627–634. 10.1002/ejhf.785
48
WangA. Y.SandersonJ. E. (2011). Current Perspectives on Diagnosis of Heart Failure in Long-Term Dialysis Patients. Am. J. Kidney Dis.57 (2), 308–319. 10.1053/j.ajkd.2010.07.019
49
YaoZ.ZhengZ.KeW.WangR.MuX.SunF.et al (2019). Prognostic Nomogram for Bladder Cancer with Brain Metastases: a National Cancer Database Analysis. J. Transl Med.17 (1), 411. 10.1186/s12967-019-2109-7
50
YuX.ZhouJ.ZhaoM.YiC.DuanQ.ZhouW.et al (2020). Exploiting XG Boost for Predicting Enhancer-Promoter Interactions. Curr. Bioinform15 (9), 1036–1045. 10.2174/1574893615666200120103948
Summary
Keywords
machine learning, extreme gradient boosting, heart failure prediction, hemodialysis, risk factors
Citation
Wang Y, Miao X, Xiao G, Huang C, Sun J, Wang Y, Li P and You X (2022) Clinical Prediction of Heart Failure in Hemodialysis Patients: Based on the Extreme Gradient Boosting Method. Front. Genet. 13:889378. doi: 10.3389/fgene.2022.889378
Received
04 March 2022
Accepted
15 March 2022
Published
26 April 2022
Volume
13 - 2022
Edited by
Quan Zou, University of Electronic Science and Technology of China, China
Reviewed by
Leimin Wang, China University of Geosciences Wuhan, China
Yansen Su, Anhui University, China
Updates
Copyright
© 2022 Wang, Miao, Xiao, Huang, Sun, Wang, Li and You.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Panlong Li, 2019021@zzuli.edu.cn; Xu You, maja_1983@126.com
†These authors have contributed equally to this work and share first authorship
This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.