ORIGINAL RESEARCH article
Front. Netw. Physiol.
Sec. Networks in the Cardiovascular System
This article is part of the Research TopicArtificial Intelligence in Cardiovascular ResearchView all 9 articles
Coronary Artery Disease Prediction using Bayesian-Optimized Support Vector Machine with Feature Selection
Provisionally accepted- 1University of Birjand, Birjand, Iran
- 2KTO Karatay Universitesi, Konya, Türkiye
- 3Plovdivski universitet Paisij Hilendarski, Plovdiv, Bulgaria
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Cardiovascular diseases, particularly Coronary Artery Disease (CAD), remain a leading cause of mortality worldwide. Invasive angiography, while accurate, is costly and risky. This study proposes a non-invasive, interpretable CAD prediction framework using the Z-Alizadeh Sani dataset. A hybrid decision tree–AdaBoost method selects 30 clinically relevant features. To prevent data leakage, SMOTE oversampling is applied exclusively within each training fold of a 10-fold cross-validation pipeline. The SVM is optimized using Bayesian hyperparameter tuning, compared against Sea Lion Optimization (SLOA) and grid search. Ablation studies and Wilcoxon signed-rank tests confirm the statistical superiority of the proposed SVM_Bayesian model. SHapley Additive exPlanations (SHAP) analysis reveals clinically meaningful feature contributions (e.g., Typical Chest Pain, Age, EF-TTE). 95% bootstrap confidence intervals and temporal generalization on an independent test set ensure robustness and no overfitting. The proposed model achieves 97.67% accuracy, 95.45% precision, 100.00% sensitivity, 97.67% F1-score, and 99.00% AUC, outperforming logistic regression (93.02% accuracy, 92.68% F1-score), random forest (95.45% accuracy, 93.33% F1-score), standard SVM (77.00% accuracy), and SLOA-optimized SVM (93.02% accuracy). Future work includes validation on external real-world datasets. This framework suggests a transparent, generalizable, and clinically actionable tool for CAD risk stratification, and is in keeping with the tenets of network physiology with its focus on interconnected cardiovascular features in predicting systemic disease.
Keywords: Coronary artery disease prediction, Support vector machine, Bayesian optimization, Sealion optimization, Feature Selection, Network physiology
Received: 02 Jul 2025; Accepted: 19 Nov 2025.
Copyright: © 2025 Baratpur, Vahdat-Nejad, Arslan, Hassannataj Joloudari and Gaftandzhieva. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Silvia Gaftandzhieva, sissiy88@uni-plovdiv.bg
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
