Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Pharmacol.

Sec. Respiratory Pharmacology

Development and validation of a clinical prediction model for in-hospital mortality of severe pneumonia based on machine learning

Provisionally accepted
Kai  XieKai XieXiajin  HuangXiajin HuangZhen  LiZhen LiWenjing  YinWenjing YinXiaoxuan  HeXiaoxuan HeXinyu  MiaoXinyu MiaoHaifeng  WangHaifeng Wang*
  • The First Affiliated Hospital of Henan University of Traditional Chinese Medicine, Zhengzhou, China

The final, formatted version of the article will be published soon.

Objective: We aimed to develop an interpretable model to predict the mortality risk for severe pneumonia patients. Methods: The study retrospectively employed data from severe pneumonia patients at two hospitals as the training set for the model development. Patients with severe pneumonia admitted from the same two hospitals were prospectively included as the test set for the model evaluation. A total of 115 candidate features were extracted. The least absolute shrinkage and selection operator (LASSO) regression was applied to select features for the establishment of five models: logistic regression (LR), support vector machine (SVM), decision tree (DT), random forest (RF), and extreme gradient boosting (XGBoost). The performance of the models was assessed from discrimination, calibration and clinical practicability. The optimal model was screened out, and SHapley Additive exPlanation (SHAP) method was used to explain. Results: A total of 323 eligible patients with severe pneumonia were enrolled, including 226 patients in the training set and 97 in test set. The XGBoost model demonstrated the third highest area under the receiver operating characteristic (AUROC), along with optimal calibration and clinical practicability. The SHAP value of the XGBoost model indicated that the application of retention catheterization was identified as the most important influential predictor in the model, followed by oral Chinese herbal decoction, BUN level, age, application of tracheotomy, complication of septic shock, and TCM syndrome (pathogenic qi falling into and prostration syndrome). Conclusions: Older age, increased blood urea nitrogen (BUN) level, complication of septic shock, tracheotomy application, retention catheterization application, oral Chinese herbal decoction, and TCM syndrome (pathogenic qi falling into and prostration syndrome) may be potential risk factors that affect mortality in severe pneumonia, while application of tracheotomy and oral Chinese herbal decoction were associated with reduced mortality. The XGBoost model exhibits superior overall performance in predicting hospital mortality risk for severe pneumonia, greater than traditional scoring systems such as Pneumonia Severity Index (PSI), Sequential Organ Failure Assessment (SOFA), and Acute Physiology and Chronic Health Evaluation II (APACHE II), which assists clinicians in prognostic assessment, resulting in improved therapeutic strategies and optimal resource allocation for patients.

Keywords: Severe pneumonia, machine learning, Prediction model, Mortality, Traditional Chinese Medicine

Received: 07 Jul 2025; Accepted: 31 Oct 2025.

Copyright: © 2025 Xie, Huang, Li, Yin, He, Miao and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Haifeng Wang, wangh_f@126.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.