Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med.

Sec. Pulmonary Medicine

Volume 12 - 2025 | doi: 10.3389/fmed.2025.1616712

Development and Validation of the Machine Learning Model for Acute Exacerbation of Chronic Obstructive Pulmonary Disease Prediction Based on Inflammatory Biomarkers

Provisionally accepted
Ye  ZhuYe ZhuMeng  WangMeng WangXin-Nan  GuXin-Nan GuCen  WangCen WangSu-Min  DengSu-Min Deng*
  • Yixing People's Hospital, Yixing, China

The final, formatted version of the article will be published soon.

Objective: Acute exacerbation of chronic obstructive pulmonary disease (AECOPD) is a major cause of hospitalization and mortality in COPD patients. Current prediction methods rely primarily on clinical symptoms and physician experience, lacking objective and precise tools. This study aimed to integrate multiple inflammatory biomarkers to develop and compare machine learning models for predicting AECOPD, providing evidence for early intervention. Methods: This retrospective study included 763 COPD patients (443 AECOPD, 320 stable COPD), randomly divided into training (n=534) and validation (n=229) cohorts at a 7:3 ratio. Demographic characteristics, comorbidities, and inflammatory indices were collected, including neutrophil-to-lymphocyte ratio (NLR), platelet-tolymphocyte ratio, monocyte-to-lymphocyte ratio (MLR), eosinophil-to-lymphocyte ratio (ELR), and basophil-to-lymphocyte ratio. After variable selection using least absolute shrinkage and selection operator (LASSO) regression, traditional logistic regression (LR) and three machine learning models-random forest, gradient boosting machine (GBM), and support vector machine-were constructed. Model performance was evaluated using receiver operating characteristic curves, calibration curves, and decision curve analysis, with SHapley Additive exPlanations (SHAP) analysis for feature importance interpretation.The GBM model demonstrated superior performance with an area under the curve (AUC) of 0.900 (95%CI: 0.858-0.942), accuracy of 0.948, specificity of 0.952, and sensitivity of 0.944 in the validation cohort, significantly outperforming the traditional LR model (AUC=0.870). SHAP analysis identified MLR (mean SHAP value=0.5), NLR (0.35), and pulmonary heart disease (0.32) as the three most important predictive factors. AECOPD risk increased significantly with rising MLR and NLR values, while ELR showed a negative correlation with AECOPD risk. Decision curve analysis confirmed that the GBM model provided the highest net benefit within clinically relevant threshold ranges (0.2-0.8).The GBM model integrating multiple inflammatory indices effectively predicts AECOPD. Based on routine blood test indicators without requiring expensive additional tests, this model is particularly suitable for resource-limited primary healthcare settings, providing a precise tool for early identification and individualized treatment of AECOPD, potentially improving prognosis and quality of life for COPD patients.

Keywords: chronic obstructive pulmonary disease, Acute exacerbation, machine learning, Inflammatory biomarkers, Monocyte-to-lymphocyte ratio

Received: 23 Apr 2025; Accepted: 07 Jul 2025.

Copyright: © 2025 Zhu, Wang, Gu, Wang and Deng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Su-Min Deng, Yixing People's Hospital, Yixing, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.