Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Pharmacol., 09 February 2026

Sec. Pharmacoepidemiology

Volume 17 - 2026 | https://doi.org/10.3389/fphar.2026.1771969

This article is part of the Research TopicAdvancing Medication Safety in the Real World: Multidisciplinary Strategies for Preventing Harm and Improving OutcomesView all 4 articles

Early metoprolol use in ICU patients with congestive heart failure is associated with increased 30-day mortality: a causal machine learning study

Yunzhu ChenYunzhu Chen1Tianyou Li
Tianyou Li2*
  • 1Zunyi Fourth People’s Hospital, Zunyi, Guizhou, China
  • 2Bojishan Hospital, Jinan, Shandong, China

Background: The risk-benefit balance of early metoprolol administration in critically ill patients with congestive heart failure (CHF) admitted to the intensive care unit (ICU) remains controversial. However, high-quality causality clinical evidence is still lacking. This study utilized real-world data and causal forest algorithm to investigate the average treatment effect (ATE) of early metoprolol use on 30-day mortality in ICU patients with CHF. We also explored the heterogeneity of individual treatment effects (ITE) to identify clinical characteristics across different treatment-effect populations.

Method: Study population and their clinical characteristics were obtained from the MIMIC-IV database (version 3.1). Propensity score matching (PSM) was performed to balance baseline confounders. Causal forest models estimated the ATE and ITE. Variable importance, conditional average treatment effect (CATE), and ITE-based quartile analysis were performed to identify heterogeneity and clinical characteristics of patients.

Results: A total of 5,758 patients were included in the initial cohort, with 3,934 patients entering the final analysis after PSM. The causal forest model indicated that early metoprolol use increased the risk of 30-day ICU mortality by 2% compared with no use group (ATE = 0.02, 95% CI: 0.0004–0.0396, P = 0.045). APSIII, age, and SOFA were the top three variables contributing to the ATE. The causal forest model estimated the conditional average treatment effects for every variable. Individual treatment effects exhibited a right-skewed distribution. The highest ITE percentile was associated with APSIII >64, age >78 years, SOFA >8, WBC >13.95 × 109/L, and BUN >43.06 mg/dL.

Conclusion: Early metoprolol use within 24 h of ICU admission in CHF patients is associated with increased 30-day mortality. Therapeutic effects were heterogeneous, with the adverse effects being particularly pronounced in patients with high APSIII, advanced age, elevated SOFA, high white blood cell counts, and elevated blood urea nitrogen levels. These findings suggest careful consideration should be given to the early use of metoprolol in high-risk patients.

1 Introduction

Congestive heart failure (CHF), characterized by impaired cardiac pumping function or association with elevated ventricular filling pressures, is one of the most prevalent cardiac diseases among the elderly (Benjamin et al., 2019; AHA/ACC/HFSAACC/AHA Joint Committee Members, 2022). CHF affects over 26 million individuals globally, with increasing prevalence observed annually. Prevention and management of these diseases have consequently emerged as a major focus within clinical medicine and public health (Singh et al., 2024; Chioncel et al., 2019).

β-blockers (such as metoprolol, bisoprolol, and carvedilol) exert biological functions that reduce ejection fraction and improve ventricular remodeling, serving as cornerstone therapies for heart failure (Paolillo et al., 2021; Beghini et al., 2025). Numerous evidence-based medical studies indicate that long-term administration of beta-blockers suppressed an overactive sympathetic nervous system, improve ventricular remodeling, and significantly reduce mortality and readmission rates in patients with chronic heart failure (Martínez-Milla et al., 2019). The 2022 AHA guidelines for heart failure management indicate that metoprolol exerts a protective effect in heart failure with reduced ejection fraction and in cases of prolonged tachycardia combined with congestive heart failure. It is recommended as a first-line treatment (Heidenreich et al., 2022). However, the risk-benefit balance of metoprolol in critically ill patients admitted to the ICU with CHF remains controversial. The pathophysiological perspective suggests that CHF patients in the ICU often remain hemodynamically unstable and reliant on sympathetic tone for compensatory output. At this time, the negative inotropic and negative chronotropic effects of metoprolol could theoretically disrupt this compensatory mechanism, potentially leading to hypoperfusion and worsened organ function, outweighing long-term benefits (Schurtz et al., 2023).

This clinical dilemma persists due to a lack of high-quality evidence. Randomized controlled trials (RCTs) in critically ill patients confront ethical and practical limitations. Existing evidence mostly derives from observational studies, while the conclusions of the few relevant RCTs are inconsistent, making it difficult to establish reliable guidance (Zhou et al., 2025; Ibanez et al., 2025; Munkhaugen et al., 2025). Thus, decisions regarding whether to initiate and when to commence beta-blocker therapy in the ICU setting remain unresolved clinical problems (Jondeau et al., 2009; Bhagat et al., 2019).

Causal machine learning, an emerging methodology, offers a new perspective for resolving the aforementioned challenges. It utilizes counterfactual inference methods combined with the powerful computational capabilities of machine learning to perform causal inference. This approach allows for better use of large amounts of real-world data. Unlike traditional machine learning, which focuses on predicting outcome probabilities, causal machine learning concentrates on uncovering causal effects where altering treatment conditions leads to outcome changes (Feuerriegel et al., 2024). The causal forest is a classic causal machine learning algorithm, offering robust stability and interpretability, and capable of detecting data heterogeneity (Liu M. et al., 2025). It is an ideal tool for realizing precision medicine and moving research from ‘average changes’ towards ‘individualized alterations’.

In summary, this study employed real-world data and causal forest algorithms to investigate the effect of early initiation of metoprolol administration on 30-day mortality among ICU patients with CHF and to identify subgroups benefiting or harmed by treatment. This study aims to address clinical questions and provide theoretical guidance for clinical practice.

2 Materials and methods

2.1 Study population

Data of study population were obtained from the MIMIC-IV database (version 3.1). The MIMIC database is a publicly available clinical database of intensive care unit (ICU) patients, jointly developed by the Massachusetts Institute of Technology and Beth Israel Deaconess Medical Center in Boston, USA, and that it contains single-center ICU patient data from this hospital (Johnson et al., 2023). The diagnostic information in the database was derived from de-identified diagnostic records in the electronic medical system of Beth Israel Deaconess Medical Center, and that all diagnoses were made by ICU physicians based on clinical conditions of patients. Author Yunzhu Chen completed the course required for database access and has been granted permission to download research data (certification number: 73859939). The MIMIC database has implemented patient privacy protection measures. Consequently, our research has been granted ethical exemption. This research was performed in compliance with the STROBE.

The inclusion criteria for this study were: (1) Congestive heart failure as the primary diagnosis for admission to the ICU (ICD-9: 428.0); (2) First-time ICU admission; (3) Age over 18 years; (4) ICU stay exceeding 24 h. Exclusion criteria: (1) Complete atrioventricular block (ICD-10: I44.2); (2) Sick sinus syndrome (ICD-10: I49.5); (3) Missing data rate of study variables exceeding 20%. Early metoprolol use was defined as metoprolol therapy administered within 24 h of ICU admission; No metoprolol use referred to the nonuse of metoprolol and other beta-blocker agents within 24 h of ICU admission. A total of 5,758 patients were enrolled in the initial study cohort by fitting the aforementioned criteria. After PSM, 3,934 patients were ultimately included in the final study cohort (Figure 1), comprising 1,967 patients in the early metoprolol use group and 1,967 in the no metoprolol use group.

Figure 1
Flowchart illustrating selection of patients from the MIMIC-IV database based on specific inclusion and exclusion criteria, with propensity score matching leading to final groups of early metoprolol use and no metoprolol use, each with one thousand nine hundred sixty-seven patients.

Figure 1. Flowchart for patient enrolment in the study.

2.2 Data extraction

Data of enrolled CHF patients were extracted from the MIMIC-IV database (version 3.1) using PostgreSQL (version 14.0). Research variables comprised seven sections: (1) Demographic information including age, gender, weight; (2) Comorbidities including hypertension, diabetes mellitus, myocardial infarction, liver disease, kidney disease, chronic respiratory disease, cerebrovascular disease; (3) Invasive operations including CPR, invasive vent; (4) Critical illness assessment scores: including APS III, SOFA; (5) Vital signs: including heart rate, systolic blood pressure, diastolic blood pressure, mean arterial pressure, oxygen saturation (SO2), arterial oxygen partial pressure (PO2), arterial carbon dioxide partial pressure (PCO2); (6) Pharmacological interventions: including norepinephrine, spironolactone, angiotensin converting enzyme inhibitors (ACEI), angiotensin receptor blockers (ARB); (7) Laboratory test results: including white blood cell count (WBC), blood urea nitrogen (BUN), partial thromboplastin time (PTT), blood glucose, troponin I, hemoglobin. The aforementioned indicators were first measured within 24 h of the patient’s admission to the ICU. To eliminate potential bias, subjects with variable missingness exceeding 20% would be excluded.

2.3 Clinical outcome

The clinical outcome in this study was all-cause mortality within 30 days of ICU admission.

2.4 Statistical analysis

The normality of continuous variables was assessed using the Shapiro-Wilk test. Variables that followed a normal distribution were presented as mean ± standard deviation and were compared between groups using the t-test. For non-normally distributed variables, data were expressed as median (interquartile range) and were compared using the Mann–Whitney U test. Categorical data were expressed as percentages and analyzed using chi-square tests. Missing data were imputed through the random forest algorithm. The PSM employed 1:1 optimal matching using all baseline covariates listed in Table 1. The Causal Forest algorithm was performed to estimate the ATE, CATE, variable importance to ATE, ITE and the interaction between important variables. The hyperparameters of the study model were (1) num.trees = 1,000; (2) mtry = 4; (3) sample.fraction = 0.7; (4) honesty = TRUE. To further investigate treatment heterogeneity, all patients were divided into four groups based on their ITE values. To further investigate treatment heterogeneity, all patients were divided into four groups based on their ITE values. Logistic regression analysis was employed to examine mortality risk factors in subgroups. Comparisons of key covariates between subgroups were conducted using t-tests or non-parametric tests. Statistical analysis was performed using the R software (version 4.5.1). P-value <0.05 was considered statistically significant.

Table 1
www.frontiersin.org

Table 1. Baseline characteristic of study population before and after PSM.

3 Result

3.1 Baseline of the study population

Before PSM, the study data exhibited significant imbalance. The P-values of 20 variables were less than 0.05 in intergroup comparisons with ten variables of Standardized Mean Difference (SMD) greater than 0.1. After 1:1 optimal PSM matching, the imbalances between groups were effectively improved. There were 7 variables with intergroup comparison p-values below 0.05, with only norepinephrine and SOFA exhibiting SMDs greater than 0.1. There were seven variables with P-value less than 0.05 in intergroup comparisons. Only norepinephrine and SOFA had the SMD greater than 0.1. Moreover, the medians of the aforementioned variables were approximately similar between groups after matching. The causal forest model could effectively mitigate the impact of variable imbalance on outcomes (f), and both variables held high clinical significance. Consequently, all variables were involved into subsequent analyses after PSM (Table 1; Figure 2).

Figure 2
Dot and line plot showing absolute mean differences for multiple covariates before (green circles, unadjusted) and after (blue triangles, adjusted) matching, with most adjusted values close to zero along the y-axis of variable names.

Figure 2. The line chart of absolute standardized mean difference of all variables before and after PSM.

3.2 ATE of the intervention

The causal forest model estimated the ATE of early metoprolol use on 30-day mortality. Result of ATE was 0.02 (95% CI = 0.0004–0.0396; P = 0.045), which suggested early metoprolol use resulted in an approximately 2% increase in the risk of 30-day mortality of whole final population. The number needed to harm for this intervention was 50 (NNH = 50).

3.3 Heterogeneity analysis of ATE

ITE histogram revealed a right-skewed asymmetric distribution of study population, which suggested treatment heterogeneity (Figure 3A). Variable importance analysis identified APSIII, age, SOFA, WBC, and BUN as the top five variables contributing to ATE (Figure 3B). We selected variables with ATE contribution values higher than 0.03 for conditional average treatment effect (CATE) analysis. The results were visually represented through partial dependence plots, which illustrated how ATE changed with variable values (Figure 3C). Body weight and WBC were the highest interaction effect variable pairs, with 2.1% contribution to ATE attributable to their interaction. The interaction effects of all other variable pairs were less than 2% (Figure 3D).

Figure 3
Figure composed of four panels labeled A, B, C, and D. Panel A shows a histogram with the x-axis labeled ITE and the y-axis labeled COUNT, displaying a normal distribution centered around zero. Panel B presents a horizontal bar chart ranking medical features by ATE value, with bars shaded from blue to red to represent increasing ATE, and APSIII and Age ranking highest. Panel C contains twelve line plots, each titled with a clinical variable such as APSIII, Age, SOFA, WBC, and BUN, displaying the relationship between the variable and treatment effect. Panel D shows a horizontal bar chart of variable pairs by ATE value, with bars color-coded from blue to red, and Weight-WBC having the highest value.

Figure 3. The results of ATE Heterogeneity analysis. (A) ITE distribution histogram. (B) Bar chart showing the ranking of contribution of variables to ATE. (C) Partial dependency plot of CATE for key variables. (D) Bar chart of variable interaction ranking.

3.4 Heterogeneity analysis of ITE

To further elucidate the heterogeneity of ITE, we quartile-stratified the early metoprolol use group and no metoprolol use group according to their ITE values, then merged the respective quartiles to form the final groups (Q1-Q4). In Q1 (mean ITE = −0.006), metoprolol use had an non-significant protective effect (OR = 0.85, 95% CI: 0.40–1.80, P = 0.67). In Q4 (mean ITE = 0.057), metoprolol use significantly increased mortality risk (OR = 1.45, 95% CI: 1.05–2.02, P = 0.03) (Figure 4A).

Figure 4
Panel A presents a forest plot comparing groups Q1 to Q4 by mean ITE, odds ratio with ninety-five percent confidence interval, and p-value. Q4 is statistically significant with p=0.03. Panel B shows two grouped dot plots with error bars, comparing clinical and laboratory characteristics between Q1 and Q4, with all variables showing significant p-values less than zero point zero zero one. Variables include SOFA, WBC, hemoglobin, troponin, APSIII, age, BUN, weight, diastolic and systolic blood pressure, and heart rate. Blue dots represent Q1 and red dots represent Q4.

Figure 4. The results of ATE heterogeneity analysis. (A) Forest plot of logistic regression results of group Q1–Q4. (B) Forest plot of results of key variable comparisons between Q1 and Q4 groups.

We found that key variables of top 10 contribution to ATE differed significantly between Q4 and Q1 (Figure 4B). For the purpose of directly describing the characteristics of high-risk individuals, we extracted the mean values of Q4 group variables as cut-off points and employed CATE as the reference of variable range. Results indicated that patients with CHF on their first ICU admission who presented with an APS III > 64, age > 78 years, SOFA > 8, WBC > 13.95 × 109/L, and BUN > 43.06 mg/dL were at significantly increased risk of 30-day mortality when given metoprolol early.

3.5 Model sensitivity analysis

Calibration test revealed that the estimates of mean.forest.prediction and differential.forest.prediction yielded close to 1, with P-values below 0.05 (Table 2). This indicated the model was correctly established and possessed a robust capacity for identifying heterogeneity. Furthermore, we adjusted the hyperparameters of the causal forest model by setting num.trees = 2000. The ATE of refitting model was 0.02 (95% CI = 0.0004–0.0396), similar with the model presented model in our study, which suggests the model was robust.

Table 2
www.frontiersin.org

Table 2. Results of the calibration test of causal forest model.

To investigate the therapeutic window for metoprolol administration, we extended the dosing window to within 48 h and fitted a causal forest model using the same methodology. Results showed ATE = 0.005 (95% CI: −0.015–0.027; P = 0.62), with eight of the top ten variables contributing to ATE remaining identical (Supplementary Table S1). These findings indicate that the period within 24 h of ICU admission exhibits greater specificity for intervention risk regarding 30-day mortality, though high-risk patient characteristics remain broadly similar regardless of timing.

We further applied the same causal forest model to analyze the impact of early metoprolol use on in-hospital mortality, aiming to more comprehensively evaluate the short-term effects of early intervention. Results showed an ATE of 0.012 (95% CI: −0.006–0.003; P = 0.18), with the top 10 variables by ATE ranking consistent with the primary analysis (Supplementary Table S2). These sensitivity analyses indicated that the heterogeneity of treatment effects exhibits high robustness and consistency.

4 Discussion

This study utilized real-world data and causal machine learning algorithms to investigate whether early metoprolol use in ICU patients diagnosed with congestive heart failure influenced 30-day mortality. We found that in the overall study population, early metoprolol use was associated with a 2% increase in 30-day mortality (ATE = 0.02, P = 0.045, NNH = 50). This effect exhibited significant heterogeneity, with individual treatment effects showing a right-skewed distribution, which implied that for a substantial proportion of patients, the harm effect far exceeds the average 2% mortality. Our study successfully identified the variables contributing most to the ATE and high-risk patient subgroups. The marginal statistical effect was observed for the ATE (ATE = 0.02, P = 0.045). Specifically, we believe this effect could be due to several factors, including the potential influence of single-center data on statistical power, as well as the dilution of the average treatment effect by treatment heterogeneity. The top five key variables and their characteristics were: higher ICU scores (APS III > 64, SOFA > 8), older age (> 78 years), and increased markers of inflammation (WBC >13.95 × 109/L) and renal dysfunction (BUN > 43.06 mg/dL).

The study population comprised a heterogeneous cohort of patients hospitalized for acute decompensated heart failure, with baseline blood pressure resembling the intergroup mean, reflecting this diversity. However, deeper analysis revealed that patients’ hemodynamic status served as a key modifier of the early metoprolol treatment effect. Although routine vital signs may not have shown extreme abnormalities, the need for norepinephrine support and higher SOFA cardiovascular scores signaled underlying hemodynamic instability and poorer physiological reserve (Vincent et al., 1996). Our causal inference analysis indicates that it is precisely within this patient subgroup that early metoprolol therapy shows the most significant association with increased mortality risk. This suggests that metoprolol’s negative inotropic and negative chronotropic effects may expose or exacerbate their hemodynamic vulnerability. Therefore, the focus of clinical decision-making should shift to “whether the current hemodynamics and overall physiological state of this heart failure patient can tolerate such therapy.” The comprehensive high-risk features identified in this study—such as elevated SOFA scores and APS III—provide quantitative support for this assessment (Di Santo et al., 2021).

In the present stage, several clinical guidelines recommend the long-term use of beta-blockers for chronic heart failure (Ostrominski et al., 2024). Our findings indicate that early administration of metoprolol increases mortality among ICU inpatients diagnosed with congestive heart failure. This appears to contradict prevailing views, but we believe the numerous characteristics of this study may serve as a valuable supplement to the mainstream perspective.

Firstly, the study population has been expanded: classic RCT studies are constrained by ethical limitations, making it difficult to conduct research on critically ill patients in ICUs that may cause significant harm (Paddock et al., 2022). Real-world studies can fill this gap in such populations. Beyond investigating ATE, our research focuses more on the heterogeneity of characteristics in populations at risk from early metoprolol use. Patients with higher APS-III and SOFA presented with more severe conditions and poorer prognoses (Fan and Ma, 2024; Liu C. et al., 2025). Those with APS-III > 64 and SOFA > 8 might confront critical conditions such as hypotension, shock, and multiple organ injury (Ranzani et al., 2025; Özdemir et al., 2021). Initiating treatment with beta-blockers, exemplified by metoprolol, at this stage might compromise cardiac compensatory mechanisms, thereby contributing to adverse outcomes (Pollesello et al., 2025). Research indicated that elderly heart failure patients often experience worsening conditions and poor outcomes due to factors such as overlapping comorbidities, diminished organ reserve, and psychosocial vulnerability (Sen et al., 2025). The indiscriminate use of metoprolol might exacerbate these pathophysiological processes. Elevated white blood cell counts indicate systemic inflammatory responses potentially associated with worsening heart failure (Meier et al., 2020). Severe heart failure might reduce circulating blood volume, impair renal function, and increase blood urea nitrogen (BUN) levels (Tang et al., 2024). These mechanisms suggested partially explain high mortality outcomes in high-risk patient receiving metoprolol.

Secondly, statistical tools have become significantly more powerful. This study is focused on causal inference, we believe that the level of clinical evidence generated is higher than that of typical machine learning retrospective studies. For this reason, we specifically selected these validated key variables to assess their contribution to the ATE and to examine the distribution of these variables following the classification of study cases based on ITE. With advances in statistical theory and increased computational power, machine learning algorithms were applied more extensively across various types of medical research. They now possessed the capability to utilize counterfactual reasoning within given real-world data, enabling relatively accurate estimation of outcome changes resulting from treatment modifications (Richens et al., 2020). Causal machine learning algorithms could leverage their inherent capability to process complex, high-dimensional data, enabling more efficient and stable handling of diverse real-world datasets. They provided powerful tools for uncovering genuine causality (Weberpals et al., 2025). Currently, most real-world data studies examining the relationship between beta-blockers and heart failure employ retrospective analyses and outcome prediction machine learning methods (Wang et al., 2024; Wu et al., 2025). Our research provided valuable supplementary evidence at the causal level for this clinical problem.

Thirdly, the translation of research findings has become more convenient. Nowadays, the gold standard for clinical causal research is RCT, whose outcomes reflect differences in overall intervention effects (Chan et al., 2025). Interestingly, numerous cases have emerged within RCT studies where results from other studies of the same design prove inconsistent. The accumulation of heterogeneity within study populations has probably one key factor contributing to these divergent outcomes (Verstraete et al., 2023). Our research explored the heterogeneity of intervention effects concealed beneath the ATE framework. The heterogeneity of the experimental population was confirmed through a combined approach by grouped ITE values and logistic analysis. Building upon this foundation, we have also identified specific clinical characteristics of high-risk populations, reflecting the principles of precision medicine (Hartl et al., 2021). Concurrently, our study provides intuitive guidance for clinical practice.

This study employed causal machine learning models to not only estimate ATE but, more importantly, to reveal the underlying heterogeneity of effects. The robustness of this finding was strongly supported by sensitivity analyses (VanderWeele and Ding, 2017). Although the ATE of early metoprolol use on extended 48-h medication use and in-hospital mortality was consistent with the primary analysis, it did not reach statistical significance. The revealed pattern of individual treatment effect heterogeneity—i.e., the core clinical characteristics determining risk levels—highly aligned with findings from the 30-day mortality analysis. This consistency across outcomes at different time windows strongly suggested that patient subgroups defined by indicators such as APS III, age, SOFA, WBC, and BUN may universally exhibit greater vulnerability to early beta-blocker therapy within the ICU. Thus, the value of our study extended beyond assessing statistical significance for a single endpoint. It lied in providing a stable, transferable set of risk stratification characteristics that offer an evidence-based foundation for more refined treatment decisions within the highly heterogeneous ICU environment (Wu et al., 2020).

Although this study has obtained some findings, it still has certain limitations. Our data were derived from real-world database, the cases possibly exhibited biases, and some key variables (diagnosis and classification of heart failure, echocardiogram results, some types of therapeutic drugs) were incompletely recorded, meaning confounding factors could not be entirely ruled out. The findings remain of borderline statistical significance. Although the MIMIC database contains a large number of cases, it remains a single-center dataset, and our conclusions require validation in external cohorts. Furthermore, this study did not address the optimal timing for initiating beta-blocker therapy. These problems warrant further investigation in future research.

5 Conclusion

In summary, our study found that early metoprolol administration within 24 h of ICU admission for CHF diagnosed patients were associated with increased 30-day mortality, and this effect exhibited considerable heterogeneity. The adverse effect was particularly in patients with high illness severity scores, advanced age, higher WBC, and elevated BUN. These findings suggested that metoprolol should be used cautiously in critically ill heart failure patients following thorough assessment of their condition and mitigation of risk factors.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Ethics statement

The studies involving humans were approved by Ethics Committee of Bojishan Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because The MIMIC database has implemented patient privacy protection measures. Consequently, our research has been granted ethical exemption.

Author contributions

YC: Conceptualization, Project administration, Supervision, Writing – original draft. TL: Writing – original draft, Writing – review and editing.

Funding

The author(s) declared that financial support was not received for this work and/or its publication.

Acknowledgements

The authors would like to thank all the patients who recorded in the MIMIC database.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphar.2026.1771969/full#supplementary-material

Abbreviations

ACEI, Angiotensin Converting Enzyme Inhibitors; APSIII, Acute Physiology Score III; ARB, Angiotensin Receptor Blockers; ATE, Average Treatment Effect; BUN, Blood Urea Nitrogen; CATE, Conditional Average Treatment Effect; CHF, Congestive Heart Failure; CI, Confidence Interval; CPR, Cardiopulmonary Resuscitation; ICD, International Statistical Classification of Diseases and Related Health Problems; ICU, Intensive Care Unit; ITE, Individual Treatment Effects; MIMIC, Medical Information Mart for Intensive Care; NNH, Number Needed to Harm; PSM, Propensity Score Matching; PTT, Partial Thromboplastin Time; RCT, Randomized Controlled Trial; SMD, Standardized Mean Difference; SOFA, Sequential Organ Failure Assessment; WBC, White Blood Cell.

References

AHA/ACC/HFSAACC/AHA Joint Committee Members (2022). 2022 AHA/ACC/HFSA guideline for the management of heart failure. J. Card. Fail. 28 (5), e1–e167. doi:10.1016/j.cardfail.2022.02.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Beghini, A., Sammartino, A. M., Papp, Z., von Haehling, S., Biegus, J., Ponikowski, P., et al. (2025). 2024 update in heart failure. ESC Heart Fail. 12 (1), 8–42. doi:10.1002/ehf2.14857

PubMed Abstract | CrossRef Full Text | Google Scholar

Benjamin, E. J., Muntner, P., Alonso, A., Bittencourt, M. S., Callaway, C. W., Carson, A. P., et al. (2019). Heart disease and stroke Statistics—2019 update: a report from the American heart association. Circulation 139 (10), e56–e528. doi:10.1161/CIR.0000000000000659

PubMed Abstract | CrossRef Full Text | Google Scholar

Bhagat, A. A., Greene, S. J., Vaduganathan, M., Fonarow, G. C., and Butler, J. (2019). Initiation, continuation, switching, and withdrawal of heart failure medical therapies during hospitalization. JACC Heart Fail. 7 (1), 1–12. doi:10.1016/j.jchf.2018.06.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Chan, A.-W., Boutron, I., Hopewell, S., Moher, D., Schulz, K. F., Collins, G. S., et al. (2025). SPIRIT 2025 statement: updated guideline for protocols of randomised trials. BMJ 389, e081477. doi:10.1136/bmj-2024-081477

PubMed Abstract | CrossRef Full Text | Google Scholar

Chioncel, O., Mebazaa, A., Maggioni, A. P., Harjola, V. P., Rosano, G., Laroche, C., et al. (2019). Acute heart failure congestion and perfusion status – impact of the clinical classification on in-hospital and long-term outcomes; insights from the ESC-EORP-HFA heart failure long-term registry. Eur. J. Heart Fail. 21 (11), 1338–1352. doi:10.1002/ejhf.1492

PubMed Abstract | CrossRef Full Text | Google Scholar

Di Santo, P., Mathew, R., Jung, R. G., Simard, T., Skanes, S., Mao, B., et al. (2021). Impact of baseline beta-blocker use on inotrope response and clinical outcomes in cardiogenic shock: a subgroup analysis of the DOREMI trial. Crit. Care 25 (1), 289. doi:10.1186/s13054-021-03706-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Fan, S., and Ma, J. (2024). The value of five scoring systems in predicting the prognosis of patients with sepsis-associated acute respiratory failure. Sci. Rep. 14 (1), 4760. doi:10.1038/s41598-024-55257-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Feuerriegel, S., Frauen, D., Melnychuk, V., Schweisthal, J., Hess, K., Curth, A., et al. (2024). Causal machine learning for predicting treatment outcomes. Nat. Med. 30 (4), 958–968. doi:10.1038/s41591-024-02902-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Hartl, D., de Luca, V., Kostikova, A., Laramie, J., Kennedy, S., Ferrero, E., et al. (2021). Translational precision medicine: an industry perspective. J. Transl. Med. 19 (1), 245. doi:10.1186/s12967-021-02910-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Heidenreich, P. A., Bozkurt, B., Aguilar, D., Allen, L. A., Byun, J. J., Colvin, M. M., et al. (2022). 2022 AHA/ACC/HFSA guideline for the management of heart failure: a report of the American college of cardiology/american heart association joint committee on clinical practice guidelines. Circulation 145 (18), e895–e1032. doi:10.1161/CIR.0000000000001063

PubMed Abstract | CrossRef Full Text | Google Scholar

Ibanez, B., Latini, R., Rossello, X., Dominguez-Rodriguez, A., Fernández-Vazquez, F., Pelizzoni, V., et al. (2025). Beta-blockers after myocardial infarction without reduced ejection fraction. N. Engl. J. Med. 393 (19), 1889–1900. doi:10.1056/NEJMoa2504735

PubMed Abstract | CrossRef Full Text | Google Scholar

Johnson, A. E. W., Bulgarelli, L., Shen, L., Gayles, A., Shammout, A., Horng, S., et al. (2023). MIMIC-IV, a freely accessible electronic health record dataset. Sci. Data 10 (1), 1. doi:10.1038/s41597-022-01899-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Jondeau, G., Neuder, Y., Eicher, J.-C., Jourdain, P., Fauveau, E., Galinier, M., et al. (2009). B-CONVINCED: beta-Blocker CONtinuation vs. INterruption in patients with congestive heart failure hospitalizED for a decompensation episode. Eur. Heart J. 30 (18), 2186–2192. doi:10.1093/eurheartj/ehp323

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, M., Xu, Z., He, L., Xu, X., Cai, X., Guo, Y., et al. (2025a). Heterogeneous effects of obesity on all-cause mortality: a causal forest analysis of hospitalized patients in China. Diabetes Obes. Metab. 27 (7), 3695–3704. doi:10.1111/dom.16391

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, C., Wang, H., Liu, C., and Cao, M. (2025b). The predictive value of SOFA and APSIII scores for 28-day mortality risk in SIMI: a cohort study based on the MIMIC-IV database. Front. Cell Infect. Microbiol. 15, 1574625. doi:10.3389/fcimb.2025.1574625

PubMed Abstract | CrossRef Full Text | Google Scholar

Martínez-Milla, J., Raposeiras-Roubín, S., Pascual-Figal, D. A., and Ibáñez, B. (2019). Role of beta-blockers in cardiovascular disease in 2019. Rev. Española Cardiol. 72 (10), 844–852. doi:10.1016/j.rec.2019.04.014

PubMed Abstract | CrossRef Full Text | Google Scholar

Meier, S., Henkens, M., Heymans, S., and Robinson, E. L. (2020). Unlocking the value of white blood cells for heart failure diagnosis. J. Cardiovasc. Transl. Res. 14 (1), 53–62. doi:10.1007/s12265-020-10007-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Munkhaugen, J., Kristensen, A. M. D., Halvorsen, S., Holmager, T., Olsen, M. H., Bakken, A., et al. (2025). Beta-blockers after myocardial infarction in patients without heart failure. N. Engl. J. Med. 393 (19), 1901–1911. doi:10.1056/NEJMoa2505985

PubMed Abstract | CrossRef Full Text | Google Scholar

Ostrominski, J. W., DeFilippis, E. M., Bansal, K., Riello, R. J., 3rd, Bozkurt, B., Heidenreich, P. A., et al. (2024). Contemporary American and European guidelines for heart failure management: JACC: heart failure guideline comparison. JACC Heart Fail. 12 (5), 810–825. doi:10.1016/j.jchf.2024.02.020

PubMed Abstract | CrossRef Full Text | Google Scholar

Özdemir, S., Akça, H., Algın, A., Altunok, İ., and Eroğlu, S. E. (2021). Effectiveness of the rapid emergency medicine score and the rapid acute physiology score in prognosticating mortality in patients presenting to the emergency department with COVID-19 symptoms. Am. J. Emerg. Med. 49, 259–264. doi:10.1016/j.ajem.2021.06.020

PubMed Abstract | CrossRef Full Text | Google Scholar

Paddock, K., Woolfall, K., Kearney, A., Pattison, N., Frith, L., Gamble, C., et al. (2022). Learning from stakeholders to inform good practice guidance on consent to research in intensive care units: a mixed-methods study. BMJ Open 12 (11), e066149. doi:10.1136/bmjopen-2022-066149

PubMed Abstract | CrossRef Full Text | Google Scholar

Paolillo, S., Dell'Aversana, S., Esposito, I., Poccia, A., and Perrone Filardi, P. (2021). The use of β-blockers in patients with heart failure and comorbidities: doubts, certainties and unsolved issues. Eur. J. Intern. Med. 88, 9–14. doi:10.1016/j.ejim.2021.03.035

PubMed Abstract | CrossRef Full Text | Google Scholar

Pollesello, P., Levijoki, J., and Papp, Z. (2025). Chronotropic effects of milrinone in a Guinea pig Ex Vivo model: a pilot study to screen for new mechanisms of action. J. Cardiovasc Pharmacol. 85 (4), 278–286. doi:10.1097/FJC.0000000000001675

PubMed Abstract | CrossRef Full Text | Google Scholar

Ranzani, O. T., Singer, M., Salluh, J. I. F., Shankar-Hari, M., Pilcher, D., Berger-Estilita, J., et al. (2025). Development and validation of the sequential organ failure assessment (SOFA)-2 score. JAMA. doi:10.1001/jama.2025.20516

PubMed Abstract | CrossRef Full Text | Google Scholar

Richens, J. G., Lee, C. M., and Johri, S. (2020). Improving the accuracy of medical diagnosis with causal machine learning. Nat. Commun. 11 (1), 3923. doi:10.1038/s41467-020-17419-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Schurtz, G., Mewton, N., Lemesle, G., Delmas, C., Levy, B., Puymirat, E., et al. (2023). Beta-blocker management in patients admitted for acute heart failure and reduced ejection fraction: a review and expert consensus opinion. Front. Cardiovasc. Med. 10, 1263482. doi:10.3389/fcvm.2023.1263482

PubMed Abstract | CrossRef Full Text | Google Scholar

Sen, I., Trzaskalski, N. A., Hsiao, Y. T., Liu, P. P., Shimizu, I., and Derumeaux, G. A. (2025). Aging at the crossroads of organ interactions: implications for the heart. Circ. Res. 136 (11), 1286–1305. doi:10.1161/CIRCRESAHA.125.325637

PubMed Abstract | CrossRef Full Text | Google Scholar

Singh, M. S., Thongam, K., Choudhary, P., and Bhagat, P. K. (2024). An integrated machine learning approach for congestive heart failure prediction. Diagnostics 14 (7), 736. doi:10.3390/diagnostics14070736

PubMed Abstract | CrossRef Full Text | Google Scholar

Tang, W. H. W., Bakitas, M. A., Cheng, X. S., Fang, J. C., Fedson, S. E., Fiedler, A. G., et al. (2024). Evaluation and management of kidney dysfunction in advanced heart failure: a scientific statement from the American heart association. Circulation 150 (16), e280–e295. doi:10.1161/CIR.0000000000001273

PubMed Abstract | CrossRef Full Text | Google Scholar

VanderWeele, T. J., and Ding, P. (2017). Sensitivity analysis in observational research: introducing the E-Value. Ann. Intern. Med. 167 (4), 268–274. doi:10.7326/M16-2607

PubMed Abstract | CrossRef Full Text | Google Scholar

Verstraete, K., Gyselinck, I., Huts, H., Das, N., Topalovic, M., De Vos, M., et al. (2023). Estimating individual treatment effects on COPD exacerbations by causal machine learning on randomised controlled trials. Thorax 78 (10), 983–989. doi:10.1136/thorax-2022-219382

PubMed Abstract | CrossRef Full Text | Google Scholar

Vincent, J. L., Moreno, R., Takala, J., Willatts, S., De Mendonça, A., Bruining, H., et al. (1996). The SOFA (Sepsis-related organ failure assessment) score to describe organ dysfunction/failure. On behalf of the working group on sepsis-related problems of the European society of intensive care medicine. Intensive Care Med. 22 (7), 707–710. doi:10.1007/BF01709751

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, X., Zhang, Y., Xia, J., Xu, H., Zhang, L., Feng, N., et al. (2024). Impact of β-blockers on in-hospital mortality in patients with heart failure: a retrospective propensity-score matched analysis based on MIMIC-IV database. Front. Pharmacol. 15, 1448015. doi:10.3389/fphar.2024.1448015

PubMed Abstract | CrossRef Full Text | Google Scholar

Weberpals, J., Feuerriegel, S., Schaar, M., and Kehl, K. L. (2025). Opportunities for causal machine learning in precision oncology. NEJM AI 2 (8), AIp2500277. doi:10.1056/AIp2500277

CrossRef Full Text | Google Scholar

Wu, C. S., Luedtke, A. R., Sadikova, E., Tsai, H. J., Liao, S. C., Liu, C. C., et al. (2020). Development and validation of a machine learning individualized treatment rule in first-episode schizophrenia. JAMA Netw. Open 3 (2), e1921660. doi:10.1001/jamanetworkopen.2019.21660

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, Z.-F., Tao, S.-X., Su, W.-T., Chen, S., Xu, B.-D., Zong, G.-J., et al. (2025). Development and validation of a novel coronary artery disease risk prediction model. J. Transl. Med. 23 (1), 41. doi:10.1186/s12967-024-05789-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, Z., Wang, H., Wang, W., Li, J., Lei, L., Zhang, L., et al. (2025). In-hospital use of beta-blockers for critically ill patients with acute heart failure: whether and when to initiate. J. Clin. Anesth. 103, 111824. doi:10.1016/j.jclinane.2025.111824

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: causal forest, causal machine learning, congestive heart failure, metoprolol, treatment heterogeneity

Citation: Chen Y and Li T (2026) Early metoprolol use in ICU patients with congestive heart failure is associated with increased 30-day mortality: a causal machine learning study. Front. Pharmacol. 17:1771969. doi: 10.3389/fphar.2026.1771969

Received: 20 December 2025; Accepted: 27 January 2026;
Published: 09 February 2026.

Edited by:

Mohamed Hassan Elnaem, Ulster University, United Kingdom

Reviewed by:

Zeming Zhou, Capital Medical University, China
Manuel Mallol, University of Chile, Chile

Copyright © 2026 Chen and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Tianyou Li, eW91dGhfbHR5QDEyNi5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.