ORIGINAL RESEARCH article

Front. Physiol.

Sec. Computational Physiology and Medicine

Volume 16 - 2025 | doi: 10.3389/fphys.2025.1594277

Machine Learning Prediction and Interpretability Analysis of High-risk Chest pain: A study from the MIMIC-IV database

Provisionally accepted
Hongyi  ChenHongyi Chen1Haiyang  SongHaiyang Song1Hongyu  HuangHongyu Huang2Xiaojun  FangXiaojun Fang3Chen  HuangChen Huang4Qingqing  YangQingqing Yang5Junyu  ZhangJunyu Zhang6Wenjun  DingWenjun Ding7Zheng  GongZheng Gong2*Jun  KeJun Ke4*
  • 1Department of Emergency, Fujian Provincial Hospital, Fuzhou, Fujian Province, China
  • 2Shengli Clinical Medical College, Fujian Medical University, Fuzhou, Fujian Province, China
  • 3Fujian Sanbo Funeng Brain Hospital, Fuzhou, Fujian Province, China
  • 4Fuzhou University Affiliated Provincial Hospital, Fuzhou, Fujian Province, China
  • 5Fujian Provincial Key Laboratory of Cardiovascular Diseases, Fujian Provincial Hospital, Fuzhou, Fujian Province, China
  • 6Guangxi Normal University, Guilin, China
  • 7Xiamen University, Xiamen, Fujian Province, China

The final, formatted version of the article will be published soon.

High-risk chest pain is a critical symptom in emergency departments, often associated with severe diseases like heart and lung conditions. Timely diagnosis is essential for improving patient survival. This study proposes a machine learning-based high-risk chest pain prediction model using the MIMIC-IV database. Feature engineering, SMOTE, and under-sampling were applied to address class imbalance, while Bayesian optimization fine-tuned the model. A total of 14,716 patients, including 1,302 with high-risk chest pain, were analyzed. Various machine learning algorithms, including Logistic Regression, Random Forest, SVM, XGBoost, LightGBM, TabTransformer, and TabNet, were compared. The LightGBM model achieved the highest accuracy (0.95), precision (0.95), recall (0.95), and F1 score (0.94). SHAP analysis identified maximum troponin and creatine kinase isoenzyme MB as the most important predictors. The LightGBM model outperformed others in predicting high-risk chest pain, offering a promising tool for clinicians to make accurate diagnoses and improve patient outcomes in emergency care.

Keywords: Bayesian optimization, Model interpretability, High-risk chest pain prediction, MIMIC-IV, machine learning (ML)

Received: 15 Mar 2025; Accepted: 12 Jun 2025.

Copyright: © 2025 Chen, Song, Huang, Fang, Huang, Yang, Zhang, Ding, Gong and Ke. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Zheng Gong, Shengli Clinical Medical College, Fujian Medical University, Fuzhou, Fujian Province, China
Jun Ke, Fuzhou University Affiliated Provincial Hospital, Fuzhou, Fujian Province, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.