Machine Learning Techniques for Improved Prediction of Cardiovascular Diseases Using Integrated Healthcare Data

Kahraman, Abdulgani

doi:10.3389/frai.2025.1694450

ORIGINAL RESEARCH article

Front. Artif. Intell.

Sec. Medicine and Public Health

Machine Learning Techniques for Improved Prediction of Cardiovascular Diseases Using Integrated Healthcare Data

Provisionally accepted

Abdulgani Kahraman^*

Balikesir Universitesi - Cagis Kampusu, Balikesir, Türkiye

The final, formatted version of the article will be published soon.

Cardiovascular disease continues to cause an important global health challenge, highlighting the critical importance of early detection in mitigating cardiac-related issues. There is a significant demand for reliable diagnostic alternatives. Taking advantage of health data through diverse machine learning algorithms may offer a more precise diagnostic approach. Machine learning-based decision support systems that utilize patients' clinical parameters present a promising solution for diagnosing cardiovascular disease . In this research, we collected extensive publicly available healthcare records. We integrated medical datasets based on common features to implement several machine learning models aimed at exploring the potential for more robust predictions of cardiovascular disease (CVD). The merged dataset initially contained 323,680 samples sourced from multiple databases. Following data preprocessing steps including cleaning, alignment of features, and removal of missing values, the final dataset consisted of 311,710 samples used for model training and evaluation. In our experiments, the CatBoost model achieved the highest area under the curve (AUC) of up to 94.1%.

Keywords: cardiovascular disease, machine learning, diagnostic alternatives, Healthcare data, Integrate & analyze & visualize

Received: 28 Aug 2025; Accepted: 17 Nov 2025.

Copyright: © 2025 Kahraman. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Abdulgani Kahraman, abdulgani.kahraman@balikesir.edu.tr

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.