- 1Zunyi Fourth People’s Hospital, Zunyi, Guizhou, China
- 2Bojishan Hospital, Jinan, Shandong, China
Background: The risk-benefit balance of early metoprolol administration in critically ill patients with congestive heart failure (CHF) admitted to the intensive care unit (ICU) remains controversial. However, high-quality causality clinical evidence is still lacking. This study utilized real-world data and causal forest algorithm to investigate the average treatment effect (ATE) of early metoprolol use on 30-day mortality in ICU patients with CHF. We also explored the heterogeneity of individual treatment effects (ITE) to identify clinical characteristics across different treatment-effect populations.
Method: Study population and their clinical characteristics were obtained from the MIMIC-IV database (version 3.1). Propensity score matching (PSM) was performed to balance baseline confounders. Causal forest models estimated the ATE and ITE. Variable importance, conditional average treatment effect (CATE), and ITE-based quartile analysis were performed to identify heterogeneity and clinical characteristics of patients.
Results: A total of 5,758 patients were included in the initial cohort, with 3,934 patients entering the final analysis after PSM. The causal forest model indicated that early metoprolol use increased the risk of 30-day ICU mortality by 2% compared with no use group (ATE = 0.02, 95% CI: 0.0004–0.0396, P = 0.045). APSIII, age, and SOFA were the top three variables contributing to the ATE. The causal forest model estimated the conditional average treatment effects for every variable. Individual treatment effects exhibited a right-skewed distribution. The highest ITE percentile was associated with APSIII >64, age >78 years, SOFA >8, WBC >13.95 × 109/L, and BUN >43.06 mg/dL.
Conclusion: Early metoprolol use within 24 h of ICU admission in CHF patients is associated with increased 30-day mortality. Therapeutic effects were heterogeneous, with the adverse effects being particularly pronounced in patients with high APSIII, advanced age, elevated SOFA, high white blood cell counts, and elevated blood urea nitrogen levels. These findings suggest careful consideration should be given to the early use of metoprolol in high-risk patients.
1 Introduction
Congestive heart failure (CHF), characterized by impaired cardiac pumping function or association with elevated ventricular filling pressures, is one of the most prevalent cardiac diseases among the elderly (Benjamin et al., 2019; AHA/ACC/HFSAACC/AHA Joint Committee Members, 2022). CHF affects over 26 million individuals globally, with increasing prevalence observed annually. Prevention and management of these diseases have consequently emerged as a major focus within clinical medicine and public health (Singh et al., 2024; Chioncel et al., 2019).
β-blockers (such as metoprolol, bisoprolol, and carvedilol) exert biological functions that reduce ejection fraction and improve ventricular remodeling, serving as cornerstone therapies for heart failure (Paolillo et al., 2021; Beghini et al., 2025). Numerous evidence-based medical studies indicate that long-term administration of beta-blockers suppressed an overactive sympathetic nervous system, improve ventricular remodeling, and significantly reduce mortality and readmission rates in patients with chronic heart failure (Martínez-Milla et al., 2019). The 2022 AHA guidelines for heart failure management indicate that metoprolol exerts a protective effect in heart failure with reduced ejection fraction and in cases of prolonged tachycardia combined with congestive heart failure. It is recommended as a first-line treatment (Heidenreich et al., 2022). However, the risk-benefit balance of metoprolol in critically ill patients admitted to the ICU with CHF remains controversial. The pathophysiological perspective suggests that CHF patients in the ICU often remain hemodynamically unstable and reliant on sympathetic tone for compensatory output. At this time, the negative inotropic and negative chronotropic effects of metoprolol could theoretically disrupt this compensatory mechanism, potentially leading to hypoperfusion and worsened organ function, outweighing long-term benefits (Schurtz et al., 2023).
This clinical dilemma persists due to a lack of high-quality evidence. Randomized controlled trials (RCTs) in critically ill patients confront ethical and practical limitations. Existing evidence mostly derives from observational studies, while the conclusions of the few relevant RCTs are inconsistent, making it difficult to establish reliable guidance (Zhou et al., 2025; Ibanez et al., 2025; Munkhaugen et al., 2025). Thus, decisions regarding whether to initiate and when to commence beta-blocker therapy in the ICU setting remain unresolved clinical problems (Jondeau et al., 2009; Bhagat et al., 2019).
Causal machine learning, an emerging methodology, offers a new perspective for resolving the aforementioned challenges. It utilizes counterfactual inference methods combined with the powerful computational capabilities of machine learning to perform causal inference. This approach allows for better use of large amounts of real-world data. Unlike traditional machine learning, which focuses on predicting outcome probabilities, causal machine learning concentrates on uncovering causal effects where altering treatment conditions leads to outcome changes (Feuerriegel et al., 2024). The causal forest is a classic causal machine learning algorithm, offering robust stability and interpretability, and capable of detecting data heterogeneity (Liu M. et al., 2025). It is an ideal tool for realizing precision medicine and moving research from ‘average changes’ towards ‘individualized alterations’.
In summary, this study employed real-world data and causal forest algorithms to investigate the effect of early initiation of metoprolol administration on 30-day mortality among ICU patients with CHF and to identify subgroups benefiting or harmed by treatment. This study aims to address clinical questions and provide theoretical guidance for clinical practice.
2 Materials and methods
2.1 Study population
Data of study population were obtained from the MIMIC-IV database (version 3.1). The MIMIC database is a publicly available clinical database of intensive care unit (ICU) patients, jointly developed by the Massachusetts Institute of Technology and Beth Israel Deaconess Medical Center in Boston, USA, and that it contains single-center ICU patient data from this hospital (Johnson et al., 2023). The diagnostic information in the database was derived from de-identified diagnostic records in the electronic medical system of Beth Israel Deaconess Medical Center, and that all diagnoses were made by ICU physicians based on clinical conditions of patients. Author Yunzhu Chen completed the course required for database access and has been granted permission to download research data (certification number: 73859939). The MIMIC database has implemented patient privacy protection measures. Consequently, our research has been granted ethical exemption. This research was performed in compliance with the STROBE.
The inclusion criteria for this study were: (1) Congestive heart failure as the primary diagnosis for admission to the ICU (ICD-9: 428.0); (2) First-time ICU admission; (3) Age over 18 years; (4) ICU stay exceeding 24 h. Exclusion criteria: (1) Complete atrioventricular block (ICD-10: I44.2); (2) Sick sinus syndrome (ICD-10: I49.5); (3) Missing data rate of study variables exceeding 20%. Early metoprolol use was defined as metoprolol therapy administered within 24 h of ICU admission; No metoprolol use referred to the nonuse of metoprolol and other beta-blocker agents within 24 h of ICU admission. A total of 5,758 patients were enrolled in the initial study cohort by fitting the aforementioned criteria. After PSM, 3,934 patients were ultimately included in the final study cohort (Figure 1), comprising 1,967 patients in the early metoprolol use group and 1,967 in the no metoprolol use group.
2.2 Data extraction
Data of enrolled CHF patients were extracted from the MIMIC-IV database (version 3.1) using PostgreSQL (version 14.0). Research variables comprised seven sections: (1) Demographic information including age, gender, weight; (2) Comorbidities including hypertension, diabetes mellitus, myocardial infarction, liver disease, kidney disease, chronic respiratory disease, cerebrovascular disease; (3) Invasive operations including CPR, invasive vent; (4) Critical illness assessment scores: including APS III, SOFA; (5) Vital signs: including heart rate, systolic blood pressure, diastolic blood pressure, mean arterial pressure, oxygen saturation (SO2), arterial oxygen partial pressure (PO2), arterial carbon dioxide partial pressure (PCO2); (6) Pharmacological interventions: including norepinephrine, spironolactone, angiotensin converting enzyme inhibitors (ACEI), angiotensin receptor blockers (ARB); (7) Laboratory test results: including white blood cell count (WBC), blood urea nitrogen (BUN), partial thromboplastin time (PTT), blood glucose, troponin I, hemoglobin. The aforementioned indicators were first measured within 24 h of the patient’s admission to the ICU. To eliminate potential bias, subjects with variable missingness exceeding 20% would be excluded.
2.3 Clinical outcome
The clinical outcome in this study was all-cause mortality within 30 days of ICU admission.
2.4 Statistical analysis
The normality of continuous variables was assessed using the Shapiro-Wilk test. Variables that followed a normal distribution were presented as mean ± standard deviation and were compared between groups using the t-test. For non-normally distributed variables, data were expressed as median (interquartile range) and were compared using the Mann–Whitney U test. Categorical data were expressed as percentages and analyzed using chi-square tests. Missing data were imputed through the random forest algorithm. The PSM employed 1:1 optimal matching using all baseline covariates listed in Table 1. The Causal Forest algorithm was performed to estimate the ATE, CATE, variable importance to ATE, ITE and the interaction between important variables. The hyperparameters of the study model were (1) num.trees = 1,000; (2) mtry = 4; (3) sample.fraction = 0.7; (4) honesty = TRUE. To further investigate treatment heterogeneity, all patients were divided into four groups based on their ITE values. To further investigate treatment heterogeneity, all patients were divided into four groups based on their ITE values. Logistic regression analysis was employed to examine mortality risk factors in subgroups. Comparisons of key covariates between subgroups were conducted using t-tests or non-parametric tests. Statistical analysis was performed using the R software (version 4.5.1). P-value <0.05 was considered statistically significant.
3 Result
3.1 Baseline of the study population
Before PSM, the study data exhibited significant imbalance. The P-values of 20 variables were less than 0.05 in intergroup comparisons with ten variables of Standardized Mean Difference (SMD) greater than 0.1. After 1:1 optimal PSM matching, the imbalances between groups were effectively improved. There were 7 variables with intergroup comparison p-values below 0.05, with only norepinephrine and SOFA exhibiting SMDs greater than 0.1. There were seven variables with P-value less than 0.05 in intergroup comparisons. Only norepinephrine and SOFA had the SMD greater than 0.1. Moreover, the medians of the aforementioned variables were approximately similar between groups after matching. The causal forest model could effectively mitigate the impact of variable imbalance on outcomes (f), and both variables held high clinical significance. Consequently, all variables were involved into subsequent analyses after PSM (Table 1; Figure 2).
Figure 2. The line chart of absolute standardized mean difference of all variables before and after PSM.
3.2 ATE of the intervention
The causal forest model estimated the ATE of early metoprolol use on 30-day mortality. Result of ATE was 0.02 (95% CI = 0.0004–0.0396; P = 0.045), which suggested early metoprolol use resulted in an approximately 2% increase in the risk of 30-day mortality of whole final population. The number needed to harm for this intervention was 50 (NNH = 50).
3.3 Heterogeneity analysis of ATE
ITE histogram revealed a right-skewed asymmetric distribution of study population, which suggested treatment heterogeneity (Figure 3A). Variable importance analysis identified APSIII, age, SOFA, WBC, and BUN as the top five variables contributing to ATE (Figure 3B). We selected variables with ATE contribution values higher than 0.03 for conditional average treatment effect (CATE) analysis. The results were visually represented through partial dependence plots, which illustrated how ATE changed with variable values (Figure 3C). Body weight and WBC were the highest interaction effect variable pairs, with 2.1% contribution to ATE attributable to their interaction. The interaction effects of all other variable pairs were less than 2% (Figure 3D).
Figure 3. The results of ATE Heterogeneity analysis. (A) ITE distribution histogram. (B) Bar chart showing the ranking of contribution of variables to ATE. (C) Partial dependency plot of CATE for key variables. (D) Bar chart of variable interaction ranking.
3.4 Heterogeneity analysis of ITE
To further elucidate the heterogeneity of ITE, we quartile-stratified the early metoprolol use group and no metoprolol use group according to their ITE values, then merged the respective quartiles to form the final groups (Q1-Q4). In Q1 (mean ITE = −0.006), metoprolol use had an non-significant protective effect (OR = 0.85, 95% CI: 0.40–1.80, P = 0.67). In Q4 (mean ITE = 0.057), metoprolol use significantly increased mortality risk (OR = 1.45, 95% CI: 1.05–2.02, P = 0.03) (Figure 4A).
Figure 4. The results of ATE heterogeneity analysis. (A) Forest plot of logistic regression results of group Q1–Q4. (B) Forest plot of results of key variable comparisons between Q1 and Q4 groups.
We found that key variables of top 10 contribution to ATE differed significantly between Q4 and Q1 (Figure 4B). For the purpose of directly describing the characteristics of high-risk individuals, we extracted the mean values of Q4 group variables as cut-off points and employed CATE as the reference of variable range. Results indicated that patients with CHF on their first ICU admission who presented with an APS III > 64, age > 78 years, SOFA > 8, WBC > 13.95 × 109/L, and BUN > 43.06 mg/dL were at significantly increased risk of 30-day mortality when given metoprolol early.
3.5 Model sensitivity analysis
Calibration test revealed that the estimates of mean.forest.prediction and differential.forest.prediction yielded close to 1, with P-values below 0.05 (Table 2). This indicated the model was correctly established and possessed a robust capacity for identifying heterogeneity. Furthermore, we adjusted the hyperparameters of the causal forest model by setting num.trees = 2000. The ATE of refitting model was 0.02 (95% CI = 0.0004–0.0396), similar with the model presented model in our study, which suggests the model was robust.
To investigate the therapeutic window for metoprolol administration, we extended the dosing window to within 48 h and fitted a causal forest model using the same methodology. Results showed ATE = 0.005 (95% CI: −0.015–0.027; P = 0.62), with eight of the top ten variables contributing to ATE remaining identical (Supplementary Table S1). These findings indicate that the period within 24 h of ICU admission exhibits greater specificity for intervention risk regarding 30-day mortality, though high-risk patient characteristics remain broadly similar regardless of timing.
We further applied the same causal forest model to analyze the impact of early metoprolol use on in-hospital mortality, aiming to more comprehensively evaluate the short-term effects of early intervention. Results showed an ATE of 0.012 (95% CI: −0.006–0.003; P = 0.18), with the top 10 variables by ATE ranking consistent with the primary analysis (Supplementary Table S2). These sensitivity analyses indicated that the heterogeneity of treatment effects exhibits high robustness and consistency.
4 Discussion
This study utilized real-world data and causal machine learning algorithms to investigate whether early metoprolol use in ICU patients diagnosed with congestive heart failure influenced 30-day mortality. We found that in the overall study population, early metoprolol use was associated with a 2% increase in 30-day mortality (ATE = 0.02, P = 0.045, NNH = 50). This effect exhibited significant heterogeneity, with individual treatment effects showing a right-skewed distribution, which implied that for a substantial proportion of patients, the harm effect far exceeds the average 2% mortality. Our study successfully identified the variables contributing most to the ATE and high-risk patient subgroups. The marginal statistical effect was observed for the ATE (ATE = 0.02, P = 0.045). Specifically, we believe this effect could be due to several factors, including the potential influence of single-center data on statistical power, as well as the dilution of the average treatment effect by treatment heterogeneity. The top five key variables and their characteristics were: higher ICU scores (APS III > 64, SOFA > 8), older age (> 78 years), and increased markers of inflammation (WBC >13.95 × 109/L) and renal dysfunction (BUN > 43.06 mg/dL).
The study population comprised a heterogeneous cohort of patients hospitalized for acute decompensated heart failure, with baseline blood pressure resembling the intergroup mean, reflecting this diversity. However, deeper analysis revealed that patients’ hemodynamic status served as a key modifier of the early metoprolol treatment effect. Although routine vital signs may not have shown extreme abnormalities, the need for norepinephrine support and higher SOFA cardiovascular scores signaled underlying hemodynamic instability and poorer physiological reserve (Vincent et al., 1996). Our causal inference analysis indicates that it is precisely within this patient subgroup that early metoprolol therapy shows the most significant association with increased mortality risk. This suggests that metoprolol’s negative inotropic and negative chronotropic effects may expose or exacerbate their hemodynamic vulnerability. Therefore, the focus of clinical decision-making should shift to “whether the current hemodynamics and overall physiological state of this heart failure patient can tolerate such therapy.” The comprehensive high-risk features identified in this study—such as elevated SOFA scores and APS III—provide quantitative support for this assessment (Di Santo et al., 2021).
In the present stage, several clinical guidelines recommend the long-term use of beta-blockers for chronic heart failure (Ostrominski et al., 2024). Our findings indicate that early administration of metoprolol increases mortality among ICU inpatients diagnosed with congestive heart failure. This appears to contradict prevailing views, but we believe the numerous characteristics of this study may serve as a valuable supplement to the mainstream perspective.
Firstly, the study population has been expanded: classic RCT studies are constrained by ethical limitations, making it difficult to conduct research on critically ill patients in ICUs that may cause significant harm (Paddock et al., 2022). Real-world studies can fill this gap in such populations. Beyond investigating ATE, our research focuses more on the heterogeneity of characteristics in populations at risk from early metoprolol use. Patients with higher APS-III and SOFA presented with more severe conditions and poorer prognoses (Fan and Ma, 2024; Liu C. et al., 2025). Those with APS-III > 64 and SOFA > 8 might confront critical conditions such as hypotension, shock, and multiple organ injury (Ranzani et al., 2025; Özdemir et al., 2021). Initiating treatment with beta-blockers, exemplified by metoprolol, at this stage might compromise cardiac compensatory mechanisms, thereby contributing to adverse outcomes (Pollesello et al., 2025). Research indicated that elderly heart failure patients often experience worsening conditions and poor outcomes due to factors such as overlapping comorbidities, diminished organ reserve, and psychosocial vulnerability (Sen et al., 2025). The indiscriminate use of metoprolol might exacerbate these pathophysiological processes. Elevated white blood cell counts indicate systemic inflammatory responses potentially associated with worsening heart failure (Meier et al., 2020). Severe heart failure might reduce circulating blood volume, impair renal function, and increase blood urea nitrogen (BUN) levels (Tang et al., 2024). These mechanisms suggested partially explain high mortality outcomes in high-risk patient receiving metoprolol.
Secondly, statistical tools have become significantly more powerful. This study is focused on causal inference, we believe that the level of clinical evidence generated is higher than that of typical machine learning retrospective studies. For this reason, we specifically selected these validated key variables to assess their contribution to the ATE and to examine the distribution of these variables following the classification of study cases based on ITE. With advances in statistical theory and increased computational power, machine learning algorithms were applied more extensively across various types of medical research. They now possessed the capability to utilize counterfactual reasoning within given real-world data, enabling relatively accurate estimation of outcome changes resulting from treatment modifications (Richens et al., 2020). Causal machine learning algorithms could leverage their inherent capability to process complex, high-dimensional data, enabling more efficient and stable handling of diverse real-world datasets. They provided powerful tools for uncovering genuine causality (Weberpals et al., 2025). Currently, most real-world data studies examining the relationship between beta-blockers and heart failure employ retrospective analyses and outcome prediction machine learning methods (Wang et al., 2024; Wu et al., 2025). Our research provided valuable supplementary evidence at the causal level for this clinical problem.
Thirdly, the translation of research findings has become more convenient. Nowadays, the gold standard for clinical causal research is RCT, whose outcomes reflect differences in overall intervention effects (Chan et al., 2025). Interestingly, numerous cases have emerged within RCT studies where results from other studies of the same design prove inconsistent. The accumulation of heterogeneity within study populations has probably one key factor contributing to these divergent outcomes (Verstraete et al., 2023). Our research explored the heterogeneity of intervention effects concealed beneath the ATE framework. The heterogeneity of the experimental population was confirmed through a combined approach by grouped ITE values and logistic analysis. Building upon this foundation, we have also identified specific clinical characteristics of high-risk populations, reflecting the principles of precision medicine (Hartl et al., 2021). Concurrently, our study provides intuitive guidance for clinical practice.
This study employed causal machine learning models to not only estimate ATE but, more importantly, to reveal the underlying heterogeneity of effects. The robustness of this finding was strongly supported by sensitivity analyses (VanderWeele and Ding, 2017). Although the ATE of early metoprolol use on extended 48-h medication use and in-hospital mortality was consistent with the primary analysis, it did not reach statistical significance. The revealed pattern of individual treatment effect heterogeneity—i.e., the core clinical characteristics determining risk levels—highly aligned with findings from the 30-day mortality analysis. This consistency across outcomes at different time windows strongly suggested that patient subgroups defined by indicators such as APS III, age, SOFA, WBC, and BUN may universally exhibit greater vulnerability to early beta-blocker therapy within the ICU. Thus, the value of our study extended beyond assessing statistical significance for a single endpoint. It lied in providing a stable, transferable set of risk stratification characteristics that offer an evidence-based foundation for more refined treatment decisions within the highly heterogeneous ICU environment (Wu et al., 2020).
Although this study has obtained some findings, it still has certain limitations. Our data were derived from real-world database, the cases possibly exhibited biases, and some key variables (diagnosis and classification of heart failure, echocardiogram results, some types of therapeutic drugs) were incompletely recorded, meaning confounding factors could not be entirely ruled out. The findings remain of borderline statistical significance. Although the MIMIC database contains a large number of cases, it remains a single-center dataset, and our conclusions require validation in external cohorts. Furthermore, this study did not address the optimal timing for initiating beta-blocker therapy. These problems warrant further investigation in future research.
5 Conclusion
In summary, our study found that early metoprolol administration within 24 h of ICU admission for CHF diagnosed patients were associated with increased 30-day mortality, and this effect exhibited considerable heterogeneity. The adverse effect was particularly in patients with high illness severity scores, advanced age, higher WBC, and elevated BUN. These findings suggested that metoprolol should be used cautiously in critically ill heart failure patients following thorough assessment of their condition and mitigation of risk factors.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.
Ethics statement
The studies involving humans were approved by Ethics Committee of Bojishan Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because The MIMIC database has implemented patient privacy protection measures. Consequently, our research has been granted ethical exemption.
Author contributions
YC: Conceptualization, Project administration, Supervision, Writing – original draft. TL: Writing – original draft, Writing – review and editing.
Funding
The author(s) declared that financial support was not received for this work and/or its publication.
Acknowledgements
The authors would like to thank all the patients who recorded in the MIMIC database.
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphar.2026.1771969/full#supplementary-material
Abbreviations
ACEI, Angiotensin Converting Enzyme Inhibitors; APSIII, Acute Physiology Score III; ARB, Angiotensin Receptor Blockers; ATE, Average Treatment Effect; BUN, Blood Urea Nitrogen; CATE, Conditional Average Treatment Effect; CHF, Congestive Heart Failure; CI, Confidence Interval; CPR, Cardiopulmonary Resuscitation; ICD, International Statistical Classification of Diseases and Related Health Problems; ICU, Intensive Care Unit; ITE, Individual Treatment Effects; MIMIC, Medical Information Mart for Intensive Care; NNH, Number Needed to Harm; PSM, Propensity Score Matching; PTT, Partial Thromboplastin Time; RCT, Randomized Controlled Trial; SMD, Standardized Mean Difference; SOFA, Sequential Organ Failure Assessment; WBC, White Blood Cell.
References
AHA/ACC/HFSAACC/AHA Joint Committee Members (2022). 2022 AHA/ACC/HFSA guideline for the management of heart failure. J. Card. Fail. 28 (5), e1–e167. doi:10.1016/j.cardfail.2022.02.010
Beghini, A., Sammartino, A. M., Papp, Z., von Haehling, S., Biegus, J., Ponikowski, P., et al. (2025). 2024 update in heart failure. ESC Heart Fail. 12 (1), 8–42. doi:10.1002/ehf2.14857
Benjamin, E. J., Muntner, P., Alonso, A., Bittencourt, M. S., Callaway, C. W., Carson, A. P., et al. (2019). Heart disease and stroke Statistics—2019 update: a report from the American heart association. Circulation 139 (10), e56–e528. doi:10.1161/CIR.0000000000000659
Bhagat, A. A., Greene, S. J., Vaduganathan, M., Fonarow, G. C., and Butler, J. (2019). Initiation, continuation, switching, and withdrawal of heart failure medical therapies during hospitalization. JACC Heart Fail. 7 (1), 1–12. doi:10.1016/j.jchf.2018.06.011
Chan, A.-W., Boutron, I., Hopewell, S., Moher, D., Schulz, K. F., Collins, G. S., et al. (2025). SPIRIT 2025 statement: updated guideline for protocols of randomised trials. BMJ 389, e081477. doi:10.1136/bmj-2024-081477
Chioncel, O., Mebazaa, A., Maggioni, A. P., Harjola, V. P., Rosano, G., Laroche, C., et al. (2019). Acute heart failure congestion and perfusion status – impact of the clinical classification on in-hospital and long-term outcomes; insights from the ESC-EORP-HFA heart failure long-term registry. Eur. J. Heart Fail. 21 (11), 1338–1352. doi:10.1002/ejhf.1492
Di Santo, P., Mathew, R., Jung, R. G., Simard, T., Skanes, S., Mao, B., et al. (2021). Impact of baseline beta-blocker use on inotrope response and clinical outcomes in cardiogenic shock: a subgroup analysis of the DOREMI trial. Crit. Care 25 (1), 289. doi:10.1186/s13054-021-03706-2
Fan, S., and Ma, J. (2024). The value of five scoring systems in predicting the prognosis of patients with sepsis-associated acute respiratory failure. Sci. Rep. 14 (1), 4760. doi:10.1038/s41598-024-55257-5
Feuerriegel, S., Frauen, D., Melnychuk, V., Schweisthal, J., Hess, K., Curth, A., et al. (2024). Causal machine learning for predicting treatment outcomes. Nat. Med. 30 (4), 958–968. doi:10.1038/s41591-024-02902-1
Hartl, D., de Luca, V., Kostikova, A., Laramie, J., Kennedy, S., Ferrero, E., et al. (2021). Translational precision medicine: an industry perspective. J. Transl. Med. 19 (1), 245. doi:10.1186/s12967-021-02910-6
Heidenreich, P. A., Bozkurt, B., Aguilar, D., Allen, L. A., Byun, J. J., Colvin, M. M., et al. (2022). 2022 AHA/ACC/HFSA guideline for the management of heart failure: a report of the American college of cardiology/american heart association joint committee on clinical practice guidelines. Circulation 145 (18), e895–e1032. doi:10.1161/CIR.0000000000001063
Ibanez, B., Latini, R., Rossello, X., Dominguez-Rodriguez, A., Fernández-Vazquez, F., Pelizzoni, V., et al. (2025). Beta-blockers after myocardial infarction without reduced ejection fraction. N. Engl. J. Med. 393 (19), 1889–1900. doi:10.1056/NEJMoa2504735
Johnson, A. E. W., Bulgarelli, L., Shen, L., Gayles, A., Shammout, A., Horng, S., et al. (2023). MIMIC-IV, a freely accessible electronic health record dataset. Sci. Data 10 (1), 1. doi:10.1038/s41597-022-01899-x
Jondeau, G., Neuder, Y., Eicher, J.-C., Jourdain, P., Fauveau, E., Galinier, M., et al. (2009). B-CONVINCED: beta-Blocker CONtinuation vs. INterruption in patients with congestive heart failure hospitalizED for a decompensation episode. Eur. Heart J. 30 (18), 2186–2192. doi:10.1093/eurheartj/ehp323
Liu, M., Xu, Z., He, L., Xu, X., Cai, X., Guo, Y., et al. (2025a). Heterogeneous effects of obesity on all-cause mortality: a causal forest analysis of hospitalized patients in China. Diabetes Obes. Metab. 27 (7), 3695–3704. doi:10.1111/dom.16391
Liu, C., Wang, H., Liu, C., and Cao, M. (2025b). The predictive value of SOFA and APSIII scores for 28-day mortality risk in SIMI: a cohort study based on the MIMIC-IV database. Front. Cell Infect. Microbiol. 15, 1574625. doi:10.3389/fcimb.2025.1574625
Martínez-Milla, J., Raposeiras-Roubín, S., Pascual-Figal, D. A., and Ibáñez, B. (2019). Role of beta-blockers in cardiovascular disease in 2019. Rev. Española Cardiol. 72 (10), 844–852. doi:10.1016/j.rec.2019.04.014
Meier, S., Henkens, M., Heymans, S., and Robinson, E. L. (2020). Unlocking the value of white blood cells for heart failure diagnosis. J. Cardiovasc. Transl. Res. 14 (1), 53–62. doi:10.1007/s12265-020-10007-6
Munkhaugen, J., Kristensen, A. M. D., Halvorsen, S., Holmager, T., Olsen, M. H., Bakken, A., et al. (2025). Beta-blockers after myocardial infarction in patients without heart failure. N. Engl. J. Med. 393 (19), 1901–1911. doi:10.1056/NEJMoa2505985
Ostrominski, J. W., DeFilippis, E. M., Bansal, K., Riello, R. J., 3rd, Bozkurt, B., Heidenreich, P. A., et al. (2024). Contemporary American and European guidelines for heart failure management: JACC: heart failure guideline comparison. JACC Heart Fail. 12 (5), 810–825. doi:10.1016/j.jchf.2024.02.020
Özdemir, S., Akça, H., Algın, A., Altunok, İ., and Eroğlu, S. E. (2021). Effectiveness of the rapid emergency medicine score and the rapid acute physiology score in prognosticating mortality in patients presenting to the emergency department with COVID-19 symptoms. Am. J. Emerg. Med. 49, 259–264. doi:10.1016/j.ajem.2021.06.020
Paddock, K., Woolfall, K., Kearney, A., Pattison, N., Frith, L., Gamble, C., et al. (2022). Learning from stakeholders to inform good practice guidance on consent to research in intensive care units: a mixed-methods study. BMJ Open 12 (11), e066149. doi:10.1136/bmjopen-2022-066149
Paolillo, S., Dell'Aversana, S., Esposito, I., Poccia, A., and Perrone Filardi, P. (2021). The use of β-blockers in patients with heart failure and comorbidities: doubts, certainties and unsolved issues. Eur. J. Intern. Med. 88, 9–14. doi:10.1016/j.ejim.2021.03.035
Pollesello, P., Levijoki, J., and Papp, Z. (2025). Chronotropic effects of milrinone in a Guinea pig Ex Vivo model: a pilot study to screen for new mechanisms of action. J. Cardiovasc Pharmacol. 85 (4), 278–286. doi:10.1097/FJC.0000000000001675
Ranzani, O. T., Singer, M., Salluh, J. I. F., Shankar-Hari, M., Pilcher, D., Berger-Estilita, J., et al. (2025). Development and validation of the sequential organ failure assessment (SOFA)-2 score. JAMA. doi:10.1001/jama.2025.20516
Richens, J. G., Lee, C. M., and Johri, S. (2020). Improving the accuracy of medical diagnosis with causal machine learning. Nat. Commun. 11 (1), 3923. doi:10.1038/s41467-020-17419-7
Schurtz, G., Mewton, N., Lemesle, G., Delmas, C., Levy, B., Puymirat, E., et al. (2023). Beta-blocker management in patients admitted for acute heart failure and reduced ejection fraction: a review and expert consensus opinion. Front. Cardiovasc. Med. 10, 1263482. doi:10.3389/fcvm.2023.1263482
Sen, I., Trzaskalski, N. A., Hsiao, Y. T., Liu, P. P., Shimizu, I., and Derumeaux, G. A. (2025). Aging at the crossroads of organ interactions: implications for the heart. Circ. Res. 136 (11), 1286–1305. doi:10.1161/CIRCRESAHA.125.325637
Singh, M. S., Thongam, K., Choudhary, P., and Bhagat, P. K. (2024). An integrated machine learning approach for congestive heart failure prediction. Diagnostics 14 (7), 736. doi:10.3390/diagnostics14070736
Tang, W. H. W., Bakitas, M. A., Cheng, X. S., Fang, J. C., Fedson, S. E., Fiedler, A. G., et al. (2024). Evaluation and management of kidney dysfunction in advanced heart failure: a scientific statement from the American heart association. Circulation 150 (16), e280–e295. doi:10.1161/CIR.0000000000001273
VanderWeele, T. J., and Ding, P. (2017). Sensitivity analysis in observational research: introducing the E-Value. Ann. Intern. Med. 167 (4), 268–274. doi:10.7326/M16-2607
Verstraete, K., Gyselinck, I., Huts, H., Das, N., Topalovic, M., De Vos, M., et al. (2023). Estimating individual treatment effects on COPD exacerbations by causal machine learning on randomised controlled trials. Thorax 78 (10), 983–989. doi:10.1136/thorax-2022-219382
Vincent, J. L., Moreno, R., Takala, J., Willatts, S., De Mendonça, A., Bruining, H., et al. (1996). The SOFA (Sepsis-related organ failure assessment) score to describe organ dysfunction/failure. On behalf of the working group on sepsis-related problems of the European society of intensive care medicine. Intensive Care Med. 22 (7), 707–710. doi:10.1007/BF01709751
Wang, X., Zhang, Y., Xia, J., Xu, H., Zhang, L., Feng, N., et al. (2024). Impact of β-blockers on in-hospital mortality in patients with heart failure: a retrospective propensity-score matched analysis based on MIMIC-IV database. Front. Pharmacol. 15, 1448015. doi:10.3389/fphar.2024.1448015
Weberpals, J., Feuerriegel, S., Schaar, M., and Kehl, K. L. (2025). Opportunities for causal machine learning in precision oncology. NEJM AI 2 (8), AIp2500277. doi:10.1056/AIp2500277
Wu, C. S., Luedtke, A. R., Sadikova, E., Tsai, H. J., Liao, S. C., Liu, C. C., et al. (2020). Development and validation of a machine learning individualized treatment rule in first-episode schizophrenia. JAMA Netw. Open 3 (2), e1921660. doi:10.1001/jamanetworkopen.2019.21660
Wu, Z.-F., Tao, S.-X., Su, W.-T., Chen, S., Xu, B.-D., Zong, G.-J., et al. (2025). Development and validation of a novel coronary artery disease risk prediction model. J. Transl. Med. 23 (1), 41. doi:10.1186/s12967-024-05789-1
Keywords: causal forest, causal machine learning, congestive heart failure, metoprolol, treatment heterogeneity
Citation: Chen Y and Li T (2026) Early metoprolol use in ICU patients with congestive heart failure is associated with increased 30-day mortality: a causal machine learning study. Front. Pharmacol. 17:1771969. doi: 10.3389/fphar.2026.1771969
Received: 20 December 2025; Accepted: 27 January 2026;
Published: 09 February 2026.
Edited by:
Mohamed Hassan Elnaem, Ulster University, United KingdomReviewed by:
Zeming Zhou, Capital Medical University, ChinaManuel Mallol, University of Chile, Chile
Copyright © 2026 Chen and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Tianyou Li, eW91dGhfbHR5QDEyNi5jb20=
Yunzhu Chen1