You're viewing our updated article page. If you need more time to adjust, you can return to the old layout.

BRIEF RESEARCH REPORT article

Front. Cardiovasc. Med., 05 February 2026

Sec. Cardiac Rhythmology

Volume 12 - 2025 | https://doi.org/10.3389/fcvm.2025.1748673

AI-based prediction of heart failure progression in persistent atrial fibrillation using wearable electrocardiography: a brief research report

  • 1. Department of Technology Development, Seers Technology Company Ltd., Pyeongtaek, Republic of Korea

  • 2. Department of Brain and Cognitive Engineering, Korea University, Seoul, Republic of Korea

  • 3. Department of Cardiology, College of Medicine, St. Vincent’s Hospital, The Catholic University of Korea, Suwon, Republic of Korea

  • 4. College of Medicine, Catholic Research Institute for Intractable Cardiovascular Disease (CRID), The Catholic University of Korea, Seoul, Republic of Korea

Article metrics

View details

577

Views

37

Downloads

Abstract

Background:

Persistent atrial fibrillation (AF) frequently coexists with heart failure (HF), yet HF monitoring remains limited by the need for repeated blood-based biomarkers such as N-terminal pro-brain natriuretic peptide (NT-proBNP). Advances in wearable electrocardiography (ECG) and artificial intelligence (AI) now allow continuous extraction of digital physiologic signatures that may reflect hemodynamic stress.

Objective:

To evaluate the feasibility of predicting HF progression using wearable ECG–derived features in patients with persistent AF.

Methods:

Fifty patients with persistent AF underwent 3–7 days of single-lead ECG monitoring. Heart rate variability (HRV) and RR-interval features from 30 min windows were combined with baseline clinical metrics. A context-aware deep learning model using long short-term memory (LSTM) and attention mechanisms was trained to predict 6–12-month NT-proBNP changes. Model performance was assessed using root mean squared error (RMSE), mean absolute error (MAE), and the accuracy of directional NT-proBNP change.

Results:

The best performance was achieved when clinical metrics, RR features, and long-term HRV summaries were combined (RMSE 1,667.04; MAE 950.52). Directional classification of NT-proBNP trajectories achieved an accuracy of 0.82. ECG-only models performed comparably to multimodal models.

Conclusion:

Wearable ECG–based AI modeling is feasible for predicting trends in HF biomarkers in persistent AF. These results provide early evidence that ECG-derived digital biomarkers may offer a scalable, non-invasive approach for longitudinal HF monitoring.

Introduction

Atrial fibrillation (AF) and heart failure (HF) share complex bidirectional interactions that accelerate disease progression and worsen outcomes (1, 2). N-terminal pro-brain natriuretic peptide (NT-proBNP) reflects ventricular wall stress but requires in-hospital blood sampling, limiting its use for longitudinal monitoring (3, 4). In persistent AF, beat-to-beat irregularity and autonomic fluctuations further complicate the interpretation of conventional biomarkers.

Wearable ECG devices enable continuous rhythm monitoring and extraction of digital biomarkers such as heart rate variability (HRV) and RR-interval dynamics (57). Concurrent advances in artificial intelligence (AI) have shown that ECG-based algorithms can detect subtle physiologic signatures associated with ventricular dysfunction and HF events (810). In parallel, clinical biomarkers remain essential for assessing hemodynamic status. Longitudinal NT-proBNP trends may better reflect chronic hemodynamic stress than single measurements, especially in persistent AF.

This study aimed to evaluate the feasibility of predicting HF progression using an AI model trained on wearable ECG-derived features in patients with persistent AF. We hypothesized that ECG-based digital biomarkers could serve as surrogates for conventional laboratory indices such as NT-proBNP and facilitate personalized HF monitoring.

Methods

Study population

This prospective observational study enrolled 50 patients with persistent AF who underwent wearable ECG monitoring between 2023 and 2024 at St. Vincent's Hospital, The Catholic University of Korea. Adults aged 20 years and older were eligible. Exclusion criteria included acute decompensated HF, pacemaker rhythm, incomplete follow-up data, excessive motion artifact (>20% unusable signal), and <48 h of analyzable ECG. Follow-up NT-proBNP measurements were obtained during routine clinical visits, resulting in a sampling interval of 6–12 months. All participants provided written informed consent. The study was approved by the institutional review board (VC23OISI0082) and conducted in accordance with the principles outlined in the Declaration of Helsinki.

Wearable ECG monitoring and feature extraction

Participants wore a single-lead wearable ECG device (MobiCARE, Seers Technology Co., Korea) continuously for 3 to 7 days. The ECG was segmented into 30 min windows to derive sequential HRV features (HRV30 min). Long-term HRV summaries and RR-interval–based features (HRV72 h and RR) were calculated from the entire recording period. Baseline clinical metrics (CM) were extracted from electronic medical records. Feature distributions were assessed using Q–Q plots and the Shapiro–Wilk test. All features were standardized using z-score scaling and Yeo–Johnson transformation. A 30 min window length was used to balance temporal resolution and noise reduction in persistent AF. Feature groups are summarized in Table 1.

Table 1

Feature group Description Included features
CM Clinical variables collected from medical records Age, BMI, HBP, DM, CVA, HFpEF, AAD, NOAC, Antiplatelet, BB, CCB, Hb, platelet, creatinine, uric acid, e/e′, EF(TTE), TR(TTE), baseline BNP
RR RR interval-related features extracted from 72 h ECG summary statistics minRR, maxRR, avgRR
HRV72 h HRV features derived from 72 h ECG summary statics minHR, maxHR, avgHR
HRV30 min HRV features computed from 30 min ECG segments to capture sequential physiological information meanNN, SDNN, RMSSD, NN50, pNN50, LF, HF, LF/HF, TP

Definition of feature groups used in the analysis.

AAD, antiarrhythmic drug; ACEi, angiotensin-converting enzyme inhibitor; AF, atrial fibrillation; ARB, angiotensin II receptor blocker; BB, beta-blocker; BMI, body-mass index; BNP/NT-proBNP, B-type natriuretic peptide/N-terminal pro-B-type natriuretic peptide; CAD, coronary artery disease; CCB, calcium-channel blocker; Cr, serum creatinine; CT Vol, computed-tomography-derived cardiac volume; CVA, cerebrovascular accident; DM, diabetes mellitus; e/e′, early mitral inflow velocity to mitral annular early diastolic velocity ratio; EF, ejection fraction; Hb, hemoglobin; HBP, hypertension; HF, heart failure; HFpEF, heart failure with preserved ejection fraction; HR, heart rate; HRV, heart-rate variability; LF/HF, low-frequency/high-frequency spectral power of HRV; NOAC, non–vitamin K oral anticoagulant; RMSSD, root mean square of successive differences; SDNN, standard deviation of normal-to-normal RR intervals; TP, total power (HRV index); TR Vmax, tricuspid regurgitation maximal velocity.

ECG preprocessing was performed using device-generated R-peak annotations together with noise-labeled intervals. R-peaks occurring within noise-marked segments were removed, and no additional ectopic-beat filtering was applied, consistent with the physiology of persistent AF. For each 30 min window, RR intervals were computed from successive R-peak timestamps. Windows with fewer than two valid RR intervals were treated as invalid, and all HRV metrics were coded as −1. Time-domain HRV metrics were computed directly from the cleaned RR series. At the same time, frequency-domain features were derived by converting RR intervals to seconds, applying linear interpolation to obtain an evenly sampled signal, and estimating power spectra using Welch's method. Windows with insufficient RR data for spectral estimation likewise had frequency-domain HRV metrics assigned a value of −1.

Although conventional short-term HRV measures have limited physiological interpretability in persistent AF because beat-to-beat irregularity obscures sinus-node–mediated autonomic modulation, a prior study has shown that long-window HRV and RR-variability–derived metrics can still reflect global autonomic tone and hemodynamic burden in AF. Based on this evidence, we used 30 min aggregated windows as surrogate markers of autonomic dynamics, as longer windows reduce stochastic irregularity and capture lower-frequency physiologic patterns relevant to AF-related remodeling (11).

To ensure signal reliability, artifact-related noise was mitigated by excluding recordings with more than 20% unusable segments, removing physiologically implausible RR intervals, and applying window-level aggregation, which collectively improves robustness in AF-derived HRV estimation.

Model development

A hybrid deep learning model was developed to integrate time-varying ECG-derived features with static clinical information. For each patient, HRV features, RR-interval dynamics, and interaction terms were aggregated into a multivariate time series that reflects autonomic and beat-to-beat variability over the monitoring period. Clinical metrics, which are inherently non-temporal, were replicated across all 30 min windows and concatenated with the sequential features to create a unified model input. This allowed the network to learn temporal autonomic patterns while simultaneously leveraging stable patient-specific characteristics. Baseline NT-proBNP and meanRMSSD were also incorporated as context vectors within the attention mechanism to modulate temporal weighting in a patient-dependent manner.

The model architecture consisted of a three-layer long short-term memory (LSTM) network with 64, 32, and 16 hidden units, followed by a context-aware attention module and a two-layer regression head. This design enabled the model to capture both short-term fluctuations and longer-range patterns in autonomic variability. Training was performed using the Adam optimizer (learning rate 0.001), a batch size of 8, and mean squared error as the loss function, with early stopping applied based on training performance (patience = 10). To minimize overfitting in this small-sample setting, several regularization strategies were used, including L2 weight decay (0.001) and dropout (0.5).

Additional details regarding the model architecture, attention mechanisms, input construction, and statistical feature analysis are provided in the Supplementary Methods.

Evaluation

Model performance was assessed using leave-one-out cross-validation (LOOCV) at the patient level, ensuring that no information from the held-out patient was used in model training. For each iteration, the model generated a continuous prediction of the change in NT-proBNP, which was subsequently categorized as an increase or a decrease. The primary endpoint was the predicted change in NT-proBNP. Performance metrics included root mean squared error (RMSE), mean absolute error (MAE), and directional accuracy.

Results

Study population

Among the 50 patients included in the analysis, the median age was 70.0 [60.25–78.75] years, and 56% were male. Baseline clinical characteristics are shown in Table 2.

Table 2

Variable Value
Male, n (%) 28 (56.0%)
Age [median (IQR)] 70.0 [60.25–78.75]
BMI [median (IQR)] 24.76 [22.37–26.27]
HBP, n (%) 38 (76.0%)
DM, n (%) 15 (30.0%)
CVA, n (%) 3 (6.0%)
HFpEF, n (%) 50 (100.0%)
CKD, n (%) 2 (4.0%)
NYHA_class I: 6 (12.0%); II: 32 (64.0%); III: 12 (24.0%)
anriarrhythmic drug, n (%) 18 (36.0%; Amiodarone 17, Sotalol 1)
Warfarin 0 (0%)
NOAC, n (%) 48 (96.0%)
Antiplatelet, n (%) 3 (6.0%)
ARB/ACEi, n (%) 30 (60.0%)
Beta blocker, n (%) 36 (72.0%)
calcium channel blocker, n (%) 2 (4.0%)
Creatinine (mg/dL) [median (IQR)] 1.0 [0.70–1.10]
BNP (pg/mL) [median (IQR)] 1,344.0 [708.0–3,527.5]
LAvolume (mL) [median (IQR)] 168.8 [167.1–191.1]
e/e' [median (IQR)] 11.07 [8.2–15.6]
LVEF(%) [median (IQR)] 59.8 [54.8–62.0]

Baseline demographic, clinical, and echocardiographic characteristics of the study population (n = 50).

ACEi, angiotensin-converting enzyme inhibitor; ARB, angiotensin II receptor blocker; BMI, body-mass index; BNP, B-type natriuretic peptide; CVA, cerebrovascular accident; DM, diabetes mellitus; e/e′, early mitral inflow velocity to mitral annular early diastolic velocity ratio; EF, ejection fraction; HBP, hypertension; HFmrEF, heart failure with mildly reduced ejection fraction; HFpEF, heart failure with preserved ejection fraction; HFrEF, heart failure with reduced ejection fractionNOAC, non–vitamin K oral anticoagulant.

Model performance

The best regression performance was achieved when combining clinical metrics, RR-interval features, and long-term HRV summaries (HRV72h), yielding an RMSE of 1,667.04 and an MAE of 950.52. When regression outputs were converted into a trend label (increase vs. decrease/stable), the model achieved directional accuracy of 0.82 (Supplementary Table S1). Predicted and observed NT-proBNP trajectories for individual patients are shown in Figure 1 and Supplementary Table S2. Figure 2 illustrates the architecture of the context-aware LSTM model.

Figure 1

Line graph comparing predicted versus true BNP trajectories over follow-up intervals from six to twelve months. The y-axis shows BNP levels in picograms per milliliter. A solid blue line represents true BNP with a 95% confidence interval, while a dashed red line represents predicted BNP with a similar confidence interval. Both trajectories show an upward trend.

Predicted vs. observed NT-proBNP trajectories over the 6–12-month follow-up period (median 8.1 months, IQR 3.5 months), showing mean ± 95% CI and individual patient trajectories (spaghetti lines). Individual NT-proBNP trajectories (gray spaghetti lines) for all patients, overlaid with mean predicted and actual NT-proBNP values with 95% confidence intervals. The alignment between predicted and observed trajectories indicates that the model captures the overall direction of longitudinal changes in NT-proBNP. Mean trajectories are shown with solid (true) and dashed (predicted) lines using the study's context-aware LSTM model.

Figure 2

Flowchart illustrating a predictive model for BNP trajectory using wearable single-lead ECG data. Sequential input includes HRV and RR, while static input consists of baseline BNP and meanRMSSD. Data passes through LSTM layers and context-aware attention to produce the predicted BNP trajectory.

Architecture of the context-aware LSTM model integrating wearable ECG features. The diagram illustrates the full modeling pipeline, combining sequential ECG-derived features (HRV30 min, RR variability, and interaction terms) and static clinical variables (baseline NT-proBNP and meanRMSSD). Three LSTM layers extract temporal patterns, while a context-aware attention mechanism reweights time steps using a clinical context vector. The final dense regression head outputs predicted NT-proBNP trajectories. This architecture enables both dynamic physiologic pattern recognition and patient-specific weighting for predicting NT-proBNP trends.

Key observations

Models using ECG-only features performed comparably to those incorporating multimodal clinical variables.

As shown in Supplementary Table S3, patients with increasing NT-proBNP experienced more HF-related hospitalizations (9 vs. 5), and additional events such as cardioversion and stroke. However, these differences did not reach statistical significance due to the limited number of clinical events.

Discussion and conclusion

This study provides the first proof-of-concept demonstration that wearable ECG–derived digital biomarkers can predict longitudinal trajectories of NT-proBNP in patients with persistent AF. The comparable performance of ECG-only and multimodal models suggests that RR-interval and autonomic patterns derived from continuous wearable ECG may reflect underlying hemodynamic burden, even in the setting of rhythm irregularity.

Our findings extend prior work that relied on imaging or static clinical parameters for HF risk prediction by demonstrating that non-clinically collected ECG data can yield comparable predictive insights (12, 13). Incorporating such models into telemonitoring workflows may enable earlier identification of worsening physiologic trends and reduce dependence on repeated blood sampling, particularly for elderly AF patients or those with limited access to care.

Several limitations warrant consideration. The study involved a small, single-center cohort without external validation, which limits generalizability. NT-proBNP measurements were obtained at routine visits, resulting in heterogeneous sampling intervals that may introduce temporal noise. Additionally, the NT-proBNP trajectory was used as a surrogate endpoint rather than as a clinical event. Although multiple physiologic factors influence NT-proBNP, its longitudinal trend may more reliably reflect chronic hemodynamic stress than isolated values, especially in persistent AF, where beat-to-beat variability complicates traditional HRV interpretation. Although the fold-wise RMSE variance appeared wide (IQR 1,407, Q1 334.7—Q3 1,741.7), this pattern primarily reflected the large inter-patient heterogeneity in baseline NT-proBNP levels (IQR: 2,145 pg/mL, Q1 821 pg/mL—Q3 2,967 pg/mL), which naturally produces larger absolute errors in patients with high baseline values. This dispersion, therefore, reflects biological variability rather than model overfitting, and the stability of the median fold error indicates that model performance remained robust despite the small sample size. Taken together, these results should be interpreted as exploratory and hypothesis-generating.

Future work will incorporate direction-sensitive loss functions and rolling-slope analysis to enhance the detection of dynamic changes in NT-proBNP, necessitating validation on larger multicenter datasets.

Wearable ECG–based AI modeling shows promise as a scalable, non-invasive approach for monitoring HF progression in persistent AF. With further validation and integration into real-time telemonitoring systems, this strategy may support proactive, personalized management of HF.

Statements

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors upon reasonable request.

Ethics statement

The studies involving humans were approved by institutional review board (VC23OISI0082). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

C-BS: Formal analysis, Investigation, Methodology, Project administration, Software, Writing – original draft. YK: Formal analysis, Methodology, Project administration, Software, Visualization, Writing – review & editing. YH: Conceptualization, Data curation, Funding acquisition, Investigation, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. The author(s) declared that this work received funding from Daewoong Pharmaceutical Co., Ltd. The funder was not involved in the study design, collection, analysis, interpretation of data, the writing of this article, or the decision to submit it for publication. This study was also supported by institutional research funding from the College of Medicine, The Catholic University of Korea.

Conflict of interest

Authors C-BS, YK were employed by company by Seers Technology Co. Ltd.

The remaining author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcvm.2025.1748673/full#supplementary-material

References

  • 1.

    Wang TJ Larson MG Levy D Vasan RS Leip EP Wolf PA et al Temporal relations of atrial fibrillation and congestive heart failure and their joint influence on mortality: the framingham heart study. Circulation. (2003) 107(23):29205. 10.1161/01.CIR.0000072767.89944.6E

  • 2.

    Rienstra M Hobbelt AH Alings M Tijssen JGP Smit MD Brügemann J et al Targeted therapy of atrial fibrillation in heart failure with preserved ejection fraction: a randomized clinical trial. Eur Heart J. (2020) 41(9):93947. 10.1093/eurheartj/ehx739

  • 3.

    Maisel AS Krishnaswamy P Nowak RM McCord J Hollander JE Duc P et al Bedside B-type natriuretic peptide in the evaluation of acute dyspnea. N Engl J Med. (2002) 347(3):1617. 10.1056/NEJMoa020233

  • 4.

    McMurray JJV Packer M Desai AS Gong J Lefkowitz MP Rizkala AR et al Angiotensin–neprilysin inhibition versus enalapril in heart failure. N Engl J Med. (2014) 371(11):9931004. 10.1056/NEJMoa1409077

  • 5.

    Kwon JM Kim KH Eisen HJ Cho Y Jeon KH Lee SY et al Artificial intelligence assessment for early detection of heart failure with preserved ejection fraction based on electrocardiographic features. Eur Heart J Digit Health. (2020) 2(1):10616. 10.1093/ehjdh/ztaa015

  • 6.

    Attia ZI Noseworthy PA Lopez-Jimenez F Asirvatham SJ Deshmukh AJ Gersh BJ et al An artificial intelligence–enabled ECG algorithm for identification of patients with atrial fibrillation during sinus rhythm. Lancet. (2019) 394(10201):8617. 10.1016/S0140-6736(19)31721-0

  • 7.

    Topol EJ . High-performance medicine: the convergence of human and artificial intelligence. Nat Med. (2019) 25(1):4456. 10.1038/s41591-018-0300-7

  • 8.

    Dey D Slomka PJ Leeson P Comaniciu D Shrestha S Sengupta PP et al Artificial intelligence in cardiovascular imaging: JACC state-of-the-art review. J Am Coll Cardiol. (2019) 73(11):131735. 10.1016/j.jacc.2018.12.054

  • 9.

    Sengupta PP Shrestha S Berthon B Messas E Donal E Tison GH et al Proposed requirements for cardiovascular imaging-related machine learning evaluation (PRIME): a checklist: reviewed by the American College of Cardiology Healthcare Innovation Council. JACC Cardiovasc Imaging. (2020) 13(9):201735. 10.1016/j.jcmg.2020.07.015

  • 10.

    Nakano K Nochioka K Yasuda S Tamori D Shiroto T Sato Y et al Machine learning approach to stratify complex heterogeneity of chronic heart failure: a report from the CHART-2 study. ESC Heart Fail. (2023) 10(3):1597604. 10.1002/ehf2.14288

  • 11.

    Zhang L Li B Wu L . Heart rate variability in patients with atrial fibrillation of sinus rhythm or atrial fibrillation: chaos or merit?Ann Med. (2025) 57(1):2478474. 10.1080/07853890.2025.2478474

  • 12.

    Moreno-Sánchez PA . Improvement of a prediction model for heart failure survival through explainable artificial intelligence. Front Cardiovasc Med. (2023) 10:1219586. 10.3389/fcvm.2023.1219586

  • 13.

    Li H An Z Zuo S Zhu W Zhang Z Zhang S et al Artificial intelligence-enabled ECG algorithm based on improved residual network for warable ECG. Sensors (Basel). (2021) 21(18):6043. 10.3390/s21186043

Summary

Keywords

artificial intelligence, atrial fibrillation, digital health, electrocardiography, heart failure, machine learning, wearable device

Citation

Song C-B, Kim YK and Hwang Y (2026) AI-based prediction of heart failure progression in persistent atrial fibrillation using wearable electrocardiography: a brief research report. Front. Cardiovasc. Med. 12:1748673. doi: 10.3389/fcvm.2025.1748673

Received

18 November 2025

Revised

15 December 2025

Accepted

22 December 2025

Published

05 February 2026

Volume

12 - 2025

Edited by

Dimitrios Vrachatis, National and Kapodistrian University of Athens, Greece

Reviewed by

Taehyun Hwang, Severance Hospital, Republic of Korea

Denghong Zhang, Chengdu Fifth People’s Hospital, China

Updates

Copyright

* Correspondence: YouMi Hwang

ORCID YouMi Hwang orcid.org/0000-0001-8385-6485

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics