Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Netw. Physiol.

Sec. Networks in the Cardiovascular System

This article is part of the Research TopicArtificial Intelligence in Cardiovascular ResearchView all 9 articles

Coronary Artery Disease Prediction using Bayesian-Optimized Support Vector Machine with Feature Selection

Provisionally accepted
Abdul Zahir  BaratpurAbdul Zahir Baratpur1Hamed  Vahdat-NejadHamed Vahdat-Nejad1Emrah  ArslanEmrah Arslan2Javad  Hassannataj JoloudariJavad Hassannataj Joloudari1Silvia  GaftandzhievaSilvia Gaftandzhieva3*
  • 1University of Birjand, Birjand, Iran
  • 2KTO Karatay Universitesi, Konya, Türkiye
  • 3Plovdivski universitet Paisij Hilendarski, Plovdiv, Bulgaria

The final, formatted version of the article will be published soon.

Cardiovascular diseases, particularly Coronary Artery Disease (CAD), remain a leading cause of mortality worldwide. Invasive angiography, while accurate, is costly and risky. This study proposes a non-invasive, interpretable CAD prediction framework using the Z-Alizadeh Sani dataset. A hybrid decision tree–AdaBoost method selects 30 clinically relevant features. To prevent data leakage, SMOTE oversampling is applied exclusively within each training fold of a 10-fold cross-validation pipeline. The SVM is optimized using Bayesian hyperparameter tuning, compared against Sea Lion Optimization (SLOA) and grid search. Ablation studies and Wilcoxon signed-rank tests confirm the statistical superiority of the proposed SVM_Bayesian model. SHapley Additive exPlanations (SHAP) analysis reveals clinically meaningful feature contributions (e.g., Typical Chest Pain, Age, EF-TTE). 95% bootstrap confidence intervals and temporal generalization on an independent test set ensure robustness and no overfitting. The proposed model achieves 97.67% accuracy, 95.45% precision, 100.00% sensitivity, 97.67% F1-score, and 99.00% AUC, outperforming logistic regression (93.02% accuracy, 92.68% F1-score), random forest (95.45% accuracy, 93.33% F1-score), standard SVM (77.00% accuracy), and SLOA-optimized SVM (93.02% accuracy). Future work includes validation on external real-world datasets. This framework suggests a transparent, generalizable, and clinically actionable tool for CAD risk stratification, and is in keeping with the tenets of network physiology with its focus on interconnected cardiovascular features in predicting systemic disease.

Keywords: Coronary artery disease prediction, Support vector machine, Bayesian optimization, Sealion optimization, Feature Selection, Network physiology

Received: 02 Jul 2025; Accepted: 19 Nov 2025.

Copyright: © 2025 Baratpur, Vahdat-Nejad, Arslan, Hassannataj Joloudari and Gaftandzhieva. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Silvia Gaftandzhieva, sissiy88@uni-plovdiv.bg

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.