ORIGINAL RESEARCH article
Front. Cardiovasc. Med.
Sec. Cardiovascular Epidemiology and Prevention
Volume 12 - 2025 | doi: 10.3389/fcvm.2025.1579947
Prediction of cardiovascular risk using machine-learning methods. Sex-specific differences
Provisionally accepted- 1Department of Statistical Methods, University of Zaragoza, Zaragoza, Spain
- 2GRISSA Research Group, Zaragoza, Spain
- 3Aragonese Institute of Health Research, University of Zaragoza, Zaragoza, Aragon, Spain
- 4Research Network on Chronicity, Primary Care and Health Promotion (RICAPPS), Barcelona, Balearic Islands, Spain
- 5Department of Microbiology, Pediatrics, Radiology, and Public Health, University of Zaragoza, Zaragoza, Aragon, Spain
- 65Department of Microbiology, Pediatrics, Radiology, and Public Health, University of Zaragoza, Zaragoza, Aragon, Spain
- 7Aragonese Institute of Health Sciences, Zaragoza, Aragon, Spain
- 8Department of Economic Structure, University of Zaragoza, Zaragoza, Aragon, Spain
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Background: Machine learning (ML) algorithms offer some advantages over traditional scoring systems to assess the influence of cardiovascular risk factors (CVRFs) on the risk of major cardiovascular event (MACE), and could be useful in clinical practice. These algorithms can also be trained using a growing body of real world data (RWD). The aim of the study was to evaluate the MACE risk applying the XGBoost and Random Forest ML algorithms to RWD, stratifying the study population by sex, comparing the outcomes of these two algorithms. Methods: The follow-up period of the study was from 2018 to 2020. For each algorithm, 3 models were generated, including age and different combinations of three groups of variables: blood test and blood pressure measurements; CVRFs; and medication adherence. Results: In this study, 52,393 subjects were included, of whom 581 suffered a MACE. The incidence of MACE was 1% in women and 1.3% in men. The most prevalent CVRF was hypertension, followed by hypercholesterolaemia in both sexes. Adherence to treatment was highest for antihypertensives and lowest for antidiabetics. In all models age was the greatest relative contributor to the risk of MACE, followed by adherence to antidiabetics. Adherence to treatment proved to be an important variable in the risk of having a MACE. Moreover, similar performance was found for RF and XGBoost algorithms. Conclusion: These findings support the use of ML to assess cardiovascular risk and guide personalized prevention strategies in primary care settings.
Keywords: machine learning, cardiovascular disease, Adherence to treatment, random forest, XG Boost
Received: 19 Feb 2025; Accepted: 04 Jun 2025.
Copyright: © 2025 Castel-Feced, Aguilar, Malo, González-García, Maldonado and Rabanaque-Hernández. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Sara Castel-Feced, Department of Statistical Methods, University of Zaragoza, Zaragoza, Spain
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.