- 1Department of National Regional TCM (Lung Disease) Diagnosis and Treatment Center, The First Affiliated Hospital of Henan University of Chinese Medicine, Zhengzhou, China
- 2Academy of Chinese Medical Sciences, Henan University of Chinese Medicine, Zhengzhou, China
- 3Co-construction Collaborative Innovation Center for Chinese Medicine and Respiratory Diseases by Henan & Education Ministry of P.R. China, Henan University of Chinese Medicine, Zhengzhou, China
Objective: We aimed to develop an interpretable model to predict the mortality risk for severe pneumonia patients.
Methods: The study retrospectively employed data from severe pneumonia patients at two hospitals as the training set for the model development. Patients with severe pneumonia admitted from the same two hospitals were prospectively included as the test set for the model evaluation. A total of 115 candidate features were extracted based on clinical relevance and existing literature. The least absolute shrinkage and selection operator (LASSO) regression was applied to select features for the establishment of five models: logistic regression (LR), support vector machine (SVM), decision tree (DT), random forest (RF) and extreme gradient boosting (XGBoost). The performance of the models was assessed from discrimination, calibration and clinical practicability. The optimal model was screened out, and SHapley Additive exPlanation (SHAP) method was used to explain.
Results: A total of 323 eligible patients with severe pneumonia were enrolled, including 226 patients in the training set and 97 in test set. In comparison to the other four models, the XGBoost model demonstrated the third highest area under the receiver operating characteristic (AUROC), along with optimal calibration and clinical practicability. The SHAP value of the XGBoost model indicated that the application of retention catheterization was identified as the most important influential predictor in the model, followed by oral Chinese herbal decoction, blood urea nitrogen (BUN) level, age, application of tracheotomy, complication of septic shock, and TCM syndrome (pathogenic qi falling into and prostration syndrome).
Conclusion: Older age, increased BUN level, complication of septic shock, tracheotomy application, retention catheterization application, oral Chinese herbal decoction, and TCM syndrome (pathogenic qi falling into and prostration syndrome) may be potential risk factors that affect mortality in severe pneumonia, while application of tracheotomy and oral Chinese herbal decoction were associated with reduced mortality. The XGBoost model exhibits superior overall performance in predicting hospital mortality risk for severe pneumonia, greater than traditional scoring systems such as Pneumonia Severity Index (PSI), Sequential Organ Failure Assessment (SOFA), and Acute Physiology and Chronic Health Evaluation II (APACHE II), which assists clinicians in prognostic assessment, resulting in improved therapeutic strategies and optimal resource allocation for patients.
1 Introduction
The 2019 Global Burden of Disease Study indicated that lower respiratory infections ranked as the fourth major cause of global mortality, resulting in over 2.49 million deaths, beaten only by newborn illnesses, ischemic heart disease, and stroke (GBD 2019 Diseases and Injuries Collaborators, 2020). Severe pneumonia is a frequently occurring life-threatening disease characterized by lower respiratory infection, with a high mortality, numerous complications, a poor prognosis, and a significant economic burden (Welte et al., 2012). Furthermore, it is a primary cause of ICU hospitalization and infection-related death around the world (Aliberti et al., 2016). In the United States, pneumonia causes 78% of infection-related deaths (Martin-Loeches et al., 2022). Despite continuous breakthroughs in therapy over the last few decades, severe pneumonia has always been linked with a significant death rate, ranging from 20% to more than 50% (Chen et al., 2020). Thus, the identification of early hospital mortality risk in patients with severe pneumonia is essential and may facilitate appropriate care and clinical decision support.
In recent years, identifying mortality risk factors in patients with severe pneumonia has emerged as the main study focus. Researchers have discovered several factors associated with mortality in patients with severe pneumonia, including high mean platelet volume levels (Chen et al., 2020), increased admission lactate (Huang et al., 2023), C-reactive protein (CRP)-to-albumin ratio (Zhang C. et al., 2023), admission interleukin (IL)-32 concentration (Espana et al., 2006), the Modified Nutrition Risk in Critically ill (mNUTRIC) score (Acehan et al., 2021), elevated stress hyperglycemia ratio (Miao et al., 2024), serum Krebs von den Lungen-6 (Lu et al., 2023), the ratio of total body water to fat-free mass (Tseng et al., 2024), thrombocytopenia (Dos Santos Medeiros et al., 2024), severe thinness (Body Mass Index <16 kg/m2) (Lee et al., 2015), and the presence of septic shock (Que et al., 2015). Nevertheless, these factors are comparatively singular and varied. Despite a systematic review that comprehensively analyze existing literature to identify mortality risk factors for severe pneumonia (Xie et al., 2024), there is an absence of precise prediction applicable to individual cases.
The clinical prediction model can estimate the probability of a specific individual currently suffering from a certain condition or experiencing a certain outcome in the future by assigning relative weights to each predictor variable and combining multiple predictor variables (Ranstam et al., 2016). There has been an increasing number of studies on prediction models worldwide. However, there is an absence of predictive models regarding the mortality risk associated with severe pneumonia that contain traditional Chinese medicine (TCM) characteristics, as well as inadequate comparisons among existing models; moreover, selection and consideration of predictive variables are insufficient. Hence, it is crucial to develop a comprehensive and systematic mortality risk prediction model for severe pneumonia containing TCM characteristics. Based on clinical needs, constructing prediction models can greatly promote the implementation of precision medicine, support thorough clinical diagnosis and evidence-based decision-making, and optimize public health resources allocation.
The advancement of electronic medical record systems has helped in the availability of substantial clinical data. Nonetheless, conventional logistic regression is incapable of managing complex clinical data (Li J. et al., 2021). Currently, artificial intelligence (AI) technology has achieved substantial breakthroughs, introducing novel techniques for data processing and extraction. Machine learning, a core component of AI, can autonomously develop data models, recognize complex data patterns, and predict results based on insights derived from computer algorithms (Bi et al., 2019). Due to the inherent capabilities of machine learning algorithms, an increasing number of researchers support the implementation of novel predictive models based on machine learning to facilitate suitable diagnosis and treatment, compared to conventional illness severity classification systems like the Sequential Organ Failure Assessment (SOFA) score or the Acute Physiology and Chronic Health Evaluation (APACHE) II score (Pirracchio et al., 2015).
Normal supervised machine learning classifiers possess distinct characteristics, and their performance is frequently dependent upon the attributes of the datasets being classified. Logistic regression (LR), support vector machine (SVM), decision tree (DT), random forest (RF), and extreme gradient boosting (XGBoost) are popular machine learning techniques; yet, their specific performance on severe pneumonia datasets remains ambiguous. Therefore, this study aimed to accurately, quickly, and comprehensively predict the individual mortality risk of patients with severe pneumonia and improve prognosis by establishing a mortality risk prediction model for severe pneumonia containing the characteristics of TCM using multiple machine learning algorithms.
2 Methods
2.1 Study design and population
This study was conducted to develop a model to predict hospital mortality in patients with severe pneumonia. A retrospective observational study in training set was designed to consecutively enroll patients in wards at the First Affiliated Hospital of Henan University of Chinese Medicine and Henan Provincial Hospital of Chinese Medicine from January 2008 to November 2021. The test set was consistent with patient source for the training set, but prospectively observational study from December 2021 to January 2024. The follow-up of all participants continued until discharge or death. This study was approved by the Ethics Committee of the First Affiliated Hospital of Henan University of Chinese Medicine (No. 2023HL-241-01). All patients or their legal guardians in the test set were asked to sign an informed consent form. However, due to the retrospective nature for the training set, the need to obtain the informed consent was waived by the Ethics Committee of the First Affiliated Hospital of Henan University of Chinese Medicine. This study complied with the principles defined in the Declaration of Helsinki and the International Conference on Harmonization-Good Clinical Practice guidelines.
2.2 Inclusion and exclusion criteria
The inclusion criteria for the training set were: (1) Participants must have a diagnosis of severe pneumonia in accordance with the guidelines established by the Respiratory Society of the Chinese Medical Association (Cao, 2016) or the Infectious Disease Society of America/American Thoracic Society (Metlay et al., 2019); (2) The diagnosis of TCM syndrome must adhere to the Traditional Chinese Medicine Diagnosis and Treatment Guidelines for Community-Acquired Pneumonia (2018 Revised Edition) published by the Chinese Medical Association (Yu et al., 2019); (3) There were no restrictions regarding the gender or comorbidities of the patients, except they had to be 18 years of age or older. The exclusion criteria were: (1) Numerous missing clinical data; (2) A hospital stay of fewer than 3 days.
The inclusion criteria for the test set were: (1) Participants must be diagnosed with severe pneumonia in accordance with the guidelines of the Respiratory Society of the Chinese Medical Association (Cao, 2016) or the Infectious Disease Society of America/American Thoracic Society (Metlay et al., 2019) and recruited within 3 days; (2) There were no restrictions regarding gender or comorbidities, provided participants were 18 years or older; (3) All patients, or their legal representatives in cases where they were unable to provide consent, were required to sign an informed consent form. Besides, individuals with dementia and other mental disorders were excluded.
We excluded patients with clearly diagnosed fungal and viral pneumonia, including severe Influenza A (H1N1), severe acute respiratory syndrome (SARS), or coronavirus disease 2019 (COVID-19) from both the training and test sets to improve homogeneity.
2.3 Outcome definition
The prediction outcome of this study was the probability of in-hospital mortality, defined as deaths during the current hospitalization period, including within 1 day after discharge.
2.4 Features extraction
A total of 115 candidate features were extracted based on clinical relevance and existing literature, encompassing the following categories: demographic characteristics (e.g., age, gender, solar term), clinical manifestations, admission risk factors (e.g., smoking history, recent hospitalization), comorbidities, complications, laboratory results, treatments during hospitalization (e.g., conventional medicine and operations), and TCM-specific variables (e.g., TCM syndromes, use of oral Chinese herbal decoction). This study incorporated the solar term as a predictive variable. This concept originates from the traditional Chinese lunisolar calendar, which partitions the year into 24 distinct periods, each reflecting a specific phase of climatic and environmental change. TCM theory posits that these rhythmic, seasonal transitions can modulate human physiological balance and susceptibility to disease. Therefore, including solar term allows the model to capture potential seasonality effects on the prognosis of severe pneumonia, which aligns with a holistic TCM approach to medicine. For laboratory results, the first value obtained within 24 h of admission was used. It should be noted that the variable oral Chinese herbal decoction was recorded as a binary feature indicating whether the patient was prescribed and administered any Chinese herbal decoction orally for at least five consecutive days during hospitalization. The oral Chinese herbal decoctions were not a fixed formula but were individualized prescriptions formulated by licensed TCM physicians based on real-time pattern differentiation according to the patient’s evolving clinical manifestations and TCM syndrome diagnosis. This reflects the standard, personalized approach of TCM clinical practice. As such, the specific herbal components and dosages varied across patient.
2.5 Missing data handling
Investigated and confirmed outliers and missing numbers in the original electronic medical records database. If verification or supplementation was not possible, consider the outlier as a missing value for processing. Variables with missing data over 25% were removed, while multiple imputation would be employed for those within 25%. Multiple imputation was performed using the mice package in R 4.3.2 software, with 5 imputations and predictive mean matching method. The outcome variable (in-hospital mortality) was included in the imputation model to ensure unbiased estimates.
2.6 Statistical analysis
Every statistical analysis and calculations were employed SPSS 26.0 or R 4.3.2 software. The categorical variables expressed as total numbers and percentages, and the χ2 test or Fisher exact test (expected frequency <10) was employed to compare group differences. The normality test was performed on all continuous variables to ascertain if the data adhered to a normal distribution, mostly using the Shapiro Wilk or Kolmogorov Smirnov tests in conjunction with histograms. If the data adhered to a normal distribution, represented as mean (
2.7 Features selection
Patients with severe pneumonia were categorized into non-survivor and survivor groups based on in-hospital mortality, and characteristics were presented and compared between the groups. The 115 features collected from the training set underwent statistical analysis to identify variables with significant differences between the non-survivor and survivor groups. Additionally, to prevent overfitting, the Least Absolute Shrinkage and Selection Operator (LASSO) using the glmnet package in R 4.3.2 software was employed with 10-fold cross-validation to identify and refine candidate predictors (Muthukrishnan and Rohini, 2016). The simplest subset of predictive factors was chosen to identify the independent features for inclusion in the in-hospital mortality risk prediction model for severe pneumonia.
2.8 Model development and performance evaluation
Five machine learning algorithms were employed to develop predictive models: LR, SVM, DT, RF, and XGBoost in R version 4.3.2. To ensure a fair and transparent baseline comparison of the algorithms’ out-of-the-box performance, all models were run using their respective package’s default hyperparameters. A detailed parameter table has been added as Supplementary Table S1. For the LR model, which assumes independence among predictors, we assessed multicollinearity using the variance inflation factor (VIF) implemented in the car package R version 4.3.2. A VIF value exceeding 10 was considered indicative of severe multicollinearity, following common statistical conventions (O’brien, 2007). The discrimination of each model was assessed using the area under the receiver operating characteristic curve (AUROC) and confusion matrix. We incorporated balanced accuracy and matthew correlation coefficient (MCC), as recommended for imbalanced datasets. Besides, DeLong test was used to compare AUC values and further evaluate the differences in predictive performance between models. The calibration curve assessed the calibration; furthermore, to test the clinical applicability for decision-making by estimating the net benefit at various threshold probabilities, decision curve analysis (DCA) was conducted (Van Calster et al., 2018). The performance of our optimal machine learning model was benchmarked against traditional severity-of-illness scores, namely the Pneumonia Severity Index (PSI), Sequential Organ Failure Assessment (SOFA), and Acute Physiology and Chronic Health Evaluation II (APACHE II). The traditional severity scores of PSI, SOFA, and APACHE II were calculated according to their standard, widely-used definitions: PSI as defined by Fine et al. (Fine et al., 1997), SOFA as per the original consensus conference definition (Vincent et al., 1996), and APACHE II following Knaus et al. (Knaus et al., 1985). It was important to note that these scores were not used as input features for training any of the machine learning models. They were calculated retrospectively after model development and served as independent, external comparators to evaluate the relative predictive gain of our new model.
3 Results
3.1 Overall flow
The schematic flow of this study was shown in Figure 1. Patients were screened from eight departments across two affiliated hospitals. A retrospective training set (January 2008 to November 2021) and a prospective test set (December 2021 to January 2024) were established. Detailed features were extracted for all included patients. Predictors were then selected from these features using LASSO regression analysis. Subsequently, five machine learning models (LR, SVM, DT, RF, and XGBoost) were developed and their performance was comprehensively evaluated and compared in terms of discrimination, calibration, and clinical practicability to identify the optimal model.
Figure 1. Flow chart of the study procedure. This chart illustrates the key stages of the study, including patient cohort formation, feature extraction, predictor selection, model development, and evaluation. LASSO, Least Absolute Shrinkage and Selection Operator; LR, Logistic Regression; SVM, Support Vector Machine; DT, Decision Tree; RF, Random Forest; XGBoost, Extreme Gradient Boosting.
3.2 Basic features
In the training set, 170 patients (75.2%) were from the First Affiliated Hospital of Henan University of Chinese Medicine, and 56 (24.8%) patients were from Henan Provincial Hospital of Chinese Medicine. The prospective test set comprised 57 patients (58.8%) from the First Affiliated Hospital of Henan University of Chinese Medicine and 40 (41.2%) from Henan Provincial Hospital of Chinese Medicine. A total of 226 adult patients diagnosed with severe pneumonia were included into the final training set for this study, while 97 in the test set. The in-hospital mortality for severe pneumonia was 23.82% in the training set and 29.89% in the test set. No significant difference in mortality was seen between the training and test sets (P > 0.05). The comparison of features between the training set and the test set was presented in Table 1. The training set mainly occurred during the Lesser Cold, Greater Cold, Grain in Beard, and Winter Solstice solar periods, whereas the test set was primarily linked with the Winter Solstice and Lesser Cold. The distribution between the two datasets was statistically significant (P = 0.034). In the training set, the history of alcohol consumption, the presence of pleural effusion comorbidity, the use of three or more antibiotics, the frequency of quinolone antibiotics, as well as hematocrit, ALT, and D-dimer levels, and the duration of mask oxygen therapy were all significantly raised compared to the test set (P < 0.05). The comorbidity of gastrointestinal bleeding and the levels of PCT, myohemoglobin, and PaO2 were considerably reduced compared to the test set (P < 0.05). No substantial difference was observed in other features (P > 0.05).
3.3 Features selection
A comparison of all 115 features was provided in Supplementary Table S2. Table 2 summarized a total of 38 features that demonstrated a statistically significant difference (P < 0.05) between non-survivors and survivors in the training set, which might be the potential risk factors for death in patients with severe pneumonia.
Table 2. Features comparison with significant differences between the non-survivors and the survivors for severe pneumonia patients in the training set.
The LASSO regression identified 7 predictors from above 38 possible risk factors for 226 patients in the training set according to the lambda.1se criterion for predictor selection, which selects the most regularized model whose performance is within one standard error of the optimal lambda (Figure 2). All of the 7 predictors including age, TCM syndrome (pathogenic qi falling into and prostration syndrome), complication of septic shock, BUN level, tracheotomy application, retention catheterization application, and oral Chinese herbal decoction entered the final LR, SVM, DT, RF and XGBoost models.
Figure 2. Selection of predictors by LASSO regression analysis with 10-fold cross-validation. (A) A coefficient profile plot was generated against the log (lambda) sequence. (B) The selection of the parameter (lambda) of deviance in LASSO regression was tuned according to the minimum and 1se criterion, indicated by the left and right dotted lines, respectively.
3.4 Model development, evaluation and comparison
3.4.1 Discrimination
The RF model demonstrated superior performance in the training set, attaining an accuracy of 0.982, a recall of 1.000, a precision of 0.941, an F1 score of 0.970, and an AUC of 0.999. The excessive high value of the indicators might be related to the overfitting of this model. The SVM indicators exhibited the lowest values among the five models, with an accuracy of 0.159, recall of 0.156, precision of 0.068, F1 score of 0.095, and AUC of 0.900. Moreover, the AUC of all five prediction models over 0.9, signifying a strong fitting performance in the training set. In the test set, the SVM model showed significantly inferior accuracy, recall, precision, and F1 score compared to other models, yet had the best AUC. The RF model revealed superior performance in accuracy, recall, precision, and F1 score metrics, with an AUC value ranking second only to the SVM model among the five models. The XGBoost model had the third best AUROC (0.853). The predicted value of the XGBoost model exceeded that of the PSI, SOFA, and APACHE II scoring systems, which showed AUC values of 0.808, 0.819, and 0.837, respectively. The RF model achieved the highest balanced accuracy and MCC both in the training and test sets, further confirming its robust and balanced performance in predicting both mortality and survival cases. The XGBoost model demonstrated a balanced accuracy of 0.757 and MCC of 0.472 on the test set. In contrast, the SVM model showed notably lower values in these metrics, aligning with its poor performance observed in other measures. The comprehensive results of the discrimination among the five models presented in Table 3, while the ROC curves are illustrated in Figures 3, 4. The DeLong test showed that there were significant differences in AUC between the RF model and others in the training set, also between the XGBoost and SVM models (P < 0.05); but there was no significant difference among the models in the test set (P > 0.05). The results of DeLong test illustrated in Figure 5.
Table 3. Comparing the discrimination of the five severe pneumonia hospital mortality prediction models.
Figure 3. Comparison of ROC curves between different models in the training set (A) and test set (B).
Figure 4. Comparison of ROC curves between different scoring systems of the PSI, SOFA, and APACHE Ⅱ in the test set.
Figure 5. Comparison of AUC values by DeLong test between different models in the training set (A) and test set (B).
3.4.2 Calibration
In the training set, the DT model exhibited the most optimal calibration, succeeded by XGBoost, LR, RF, and SVM models. In the test set, the calibration performance ranked from highest to lowest as follows: XGBoost, RF, LR, DT, and SVM model (Figure 6).
Figure 6. Comparison of calibration curves between different models in the training set (A) and test set (B). The calibration curve plots the predicted probability of in-hospital mortality against the observed frequency. The dashed diagonal line represents a perfectly calibrated model. Closer proximity of a model’s curve to the diagonal indicates better calibration, meaning its predictions are more reliable and trustworthy.
3.4.3 Clinical practicability
In the training set, the net benefit of the RF model exceeded that of the DT, LR, XGBoost, and SVM models as indicated by the DCA. In the test set, the XGBoost model exhibited the highest net benefit, while the SVM model performed the poorest, indicating that the XGBoost model was the most optimal. Moreover, with the exception of the SVM model, DCA curves showed that the other four models demonstrate clinical value (Figure 7).
Figure 7. Comparison of DCA between different models in the training set (A) and test set (B). X-axis indicates the threshold probability and Y-axis indicates the net benefit. The dashed gray line indicates that all severe pneumonia patients had hospital death, while the dashed yellow line indicates that no patient had hospital death.
3.4.4 The VIF analysis
The VIF values for seven predictors in our final LR model were all below 10 (analyzed using the car package, version 3.1-2, in R)), suggesting that multicollinearity was not a significant concern that would destabilize the model. The detailed results were shown in Table 4.
3.5 Optimal model analysis
The XGBoost model was selected as the optimal model based on its balanced and superior overall performance in discrimination (AUROC = 0.853), calibration, and clinical practicability (as evidenced by DCA) on the test set, compared to the other four models and traditional scoring systems.
To interpret the XGBoost model and assess the contribution of each predictor, we calculated SHapley Additive exPlanations (SHAP) values on the test set. Figure 8 presents the summary plot of the SHAP values for the seven predictors selected by the LASSO regression. The mean absolute SHAP value (shown in the bar plot on the left) represents the average impact of each feature on the model output. The application of retention catheterization was identified as the most important influential predictor in the model, followed by oral Chinese herbal decoction, BUN level, age, application of tracheotomy, complication of septic shock, and TCM syndrome (pathogenic qi falling into and prostration syndrome). For each predictor, the scatter plot shows how its value affects the prediction: in the context of this model, higher values of BUN and age, as well as the presence of septic shock, retention catheterization, and the TCM syndrome (yellow dots), were generally associated with positive SHAP values, suggesting their role as risk factors in the decision process of model. Conversely, the application of tracheotomy and oral Chinese herbal decoction (yellow dots) were associated with negative SHAP values, indicating that these factors contributed to a lower predicted risk in the model’s output.
Figure 8. Interpretation of the optimal XGBoost model using SHAP values. SHAP values indicate the direction of the contribution of a single feature to the model output. Left: The bar plot shows the mean absolute SHAP value for each feature, representing its overall importance in the model. Right: The beeswarm plot shows the distribution of SHAP values for each feature for every patient in the test set. The x-axis indicates the SHAP value (impact on the model output), where a positive value increases the predicted risk of death. The color indicates the feature value for an individual patient (yellow: high value or presence of the factor; purple: low value or absence).
4 Discussion
4.1 Principal findings
Our study retrospectively gathered clinical data from 226 patients with severe pneumonia for the training set, of whom 64 died in-hospital, resulting in a mortality of 28.32%, consistent with findings from prior studies (Kassaw et al., 2023). The Lasso regression analysis was conducted to identify risk factors associated with severe pneumonia mortality in relation to Chinese and conventional medicine, including age, complication of septic shock, BUN level, TCM syndrome (pathogenic qi falling into and prostration syndrome), application of tracheotomy, retention catheterization, and oral Chinese herbal decoction. The implementation of tracheotomy and the administration of oral Chinese herbal decoction were strongly associated with improved mortality outcomes of patients with severe pneumonia. We constructed and validated models capable of predicting mortality in patients with severe pneumonia using routinely available clinical data, and compared five machine learning algorithms. The XGBoost model is superior to the overall performance of LR, SVM, DT, RF, as well as the scoring systems of PSI, SOFA, and APACHE II. The SHAP method explains the XGBoost model, so enhancing both model performance and clinical interpretability. This model may possess potential utility in personalized surveillance prognosis, facilitating improved therapy schedules and appropriate resource allocation for patients.
Machine learning is characterized by its applicability to various types of datasets, resulting in its widespread utilization. Nonetheless, various algorithms possess distinct benefits, and their capacity and efficacy in problem-solving mostly depend on the characteristics of data aspects and the performance of algorithms. Consequently, evaluating the efficacy of various machine learning algorithms on a particular dataset to identify the ideal model, together with employing feature importance analysis to enhance comprehension of presented features, is highly significant (Domaratzki and Kidane, 2022). The most significant predictive factor in the optimal XGBoost model is the application of retention catheterization, succeeded by oral Chinese herbal decoction, BUN level, age, tracheotomy application, complication of septic shock, and TCM syndrome (pathogenic qi falling into and prostration syndrome).
Patients with severe pneumonia frequently present with multiple underlying diseases, and those in critical condition often suffer from consciousness disorders, hindering their ability to urinate autonomously; thus, the application of retention catheterization is required. However, this study identified that indwelling catheters are an important risk factor for mortality due to severe pneumonia, corroborating findings from previous studies (Zhu et al., 2023).
Recent studies have demonstrated that TCM, when combined with conventional treatment, offers improvements in the management of severe pneumonia (Xie et al., 2023). Our research showed that a duration of over 5 days of oral Chinese herbal decoction might decrease the mortality risk of severe pneumonia, hence revealing the efficacy of TCM for treating severe pneumonia based on syndrome differentiation.
BUN is a primary end product of protein metabolism in the human body and serves as a crucial indication for assessing kidney function. The lung and kidney exhibit complex connections, both playing crucial organs in regulating acid-base and fluid balance (Hepokoski et al., 2018). In addition, any impairment to the kidneys can substantially impact the lungs by disturbing normal the pH and fluid distribution balance. Furthermore, the kidneys may promote the progression and regulate of pulmonary illnesses by the production or elimination of mediators. The interaction between the lungs and kidneys highlights their mutual dependence and impact on overall physiological function (Wang et al., 2021). A study indicated that patients with acute kidney injury and pneumonia exhibited a greater mortality compared to those with either condition alone (Chawla et al., 2017). Furthermore, a diagnostic criterion for severe pneumonia includes BUN levels, signifying a strong association between BUN and disease severity. Our study demonstrates that BUN is a significant risk factor for increased mortality in patients with severe pneumonia, potentially attributable to the relationship between the lung microbiome in these patients and kidney damage (Du et al., 2022).
With advancing age, the human immune system experiences various alterations, resulting in diminished capacity to efficiently trigger cellular responses against pathogens. The chemotactic capacity of polymorphonuclear leukocytes in the elderly is weakened, and the microbial uptake and antigen processing capabilities of macrophages are correspondingly reduced (Mahendra et al., 2018). Moreover, age-related factors such as chronic comorbidities, alterations in immunological physiology, and malnutrition substantially increase the risk of infection in the elderly (Arvaniti et al., 2022). The results of this study indicate that the risk of mortality from severe pneumonia increases with age, which is consistent with previous research findings (Li Y. et al., 2021) and potentially linked to age-associated chronic disorders and/or diminished immune function (Wang et al., 2020).
Individuals with severe pneumonia display a substantial elevation in airway secretions. When accompanied with consciousness problems, severe cerebral infarction, traumatic brain injury, or additional problems, respiratory function becomes impaired, requiring ventilator support. Elderly patients, due to their numerous medical conditions, are susceptible to difficulties such as the accumulation of airway secretions, respiratory obstruction, and throat injury during prolonged laryngotracheal intubation, potentially resulting in complications such ventilator-associated pneumonia (Kaese et al., 2016). Therefore, for severe pneumonia patients on prolonged ventilator support with stable conditions, tracheotomy may be considered if extended ventilatory assistance is essential. Our study found that tracheotomy serves as a preventive factor against mortality associated with severe pneumonia, significantly reducing the risk of death. Nonetheless, owing to limitations in clinical data collection, the precise best present moment for incision requires additional investigation.
Pneumonia is the major cause of septic shock, responsible for 50% of cases (Guzzardella et al., 2023; Spencer et al., 2022; Güell et al., 2019). A retrospective clinical survey of 710 patients indicated that the mortality for individuals with severe pneumonia complicated with septic shock was greater than for those without septic shock (Güell et al., 2019). Our study identified concurrent septic shock as a significant risk factor for increased mortality in patients with severe pneumonia, corroborating findings from prior research (Espinoza et al., 2019) and mutually confirming that septic shock is one of the two primary diagnostic criteria for severe pneumonia (Ferrer et al., 2018).
TCM syndromes serve as significant indicators for disease progression, aiding in the prognostic assessment of patients according to their syndrome classifications or developments (Lu and Deng, 2017). Previous studies have shown that common symptoms of severe pneumonia include phlegm-heat obstructing lung syndrome, deficiency of both qi and yin syndrome, pathogenic qi falling into and prostration syndrome, and phlegm turbidity obstructing lung syndrome (Zhang et al., 2022). Our study identified that the pathogenic qi falling into and prostration syndrome were risk factors for mortality in severe pneumonia, with the presence of this syndrome frequently indicating a fatal outcome.
Furthermore, the distribution of cases across different solar terms showed a significant difference between the training and test sets. While the clinical relevance of this TCM-based temporal variable requires further investigation, it highlights the potential influence of seasonal climatic factors, as conceptualized in TCM, on the presentation or course of severe pneumonia.
4.2 Strengths compared to previous constructed models
Currently, multiple predictive models exist concerning the mortality risk associated with severe pneumonia. We did a thorough search and systematic comparison, revealing that the model we developed possesses particular characteristics and advantages. A recent study developed an ensemble machine learning model for in-hospital mortality in severe pneumonia, reporting a competitive AUC of 0.878 in their internal validation set (Zhao et al., 2025), which is comparable to our finding (AUC = 0.853). We note that their model, while performing excellently, was derived from a single-center retrospective cohort. Our study complements this by incorporating TCM characteristics and employing prospective validation for our test set, enhancing the clinical applicability and uniqueness of our model. A study established the LR, gradient-boosted decision tree (LightGBM), and multilayer perceptron (MLP) models to forecast ICU mortality in patients with severe pneumonia (Pan et al., 2023). The best MLP model achieved an AUC of 0.838, which was inferior than the AUC of 0.853 obtained by our XGBoost model. Three studies constructed multivariable LR models with an AUC of 0.836 (Shang et al., 2023), 0.728 (Zhu et al., 2021) and 0.76 (Niu et al., 2025) for predicting in-hospital mortality in elderly patients with severe community-acquired pneumonia (SCAP). Additionally, another LR model, which lacked validation, reported an AUC as high as 0.915 in the training set (Zhu et al., 2021). A study (Pan et al., 2023) exclusively employed the LR method rather than machine learning algorithms to develop an in-hospital mortality risk prediction model for patients with SCAP. An alternative LR model predicting 30-day mortality in ICU patients with SCAP exhibited a lower AUC of 0.756 (Zhang Y. et al., 2023). Nevertheless, our research additionally produced four models: SVM, DT, RF, and XGBoost, with the performance of our best model superior than that of models developed in prior studies.
In summary, the model we developed possesses the following advantages: First of all, we employed multiple algorithms for machine learning, including SVM, DT, RF, and XGBoost, rather than solely relying on LR, and identified the optimal XGBoost model. Secondly, the optimal model we have developed exhibits markedly superior discrimination compared to previously published models, with an AUC of 0.853. Thirdly, and most importantly, prior models failed to incorporate TCM features, whereas our study first gathered 115 clinical features. Among the seven risk factors linked to in-hospital mortality in severe pneumonia identified by LASSO regression, two were TCM factors: TCM syndrome (pathogenic qi dropping into and prostration syndrome) and oral Chinese herbal decoction. Consequently, our model could provide a more comprehensive review of the severe pneumonia patients state and yield reliable predictions.
4.3 Limitations
Our study has several limitations that should be acknowledged. Firstly, the retrospective design and relatively limited sample size may introduce risks of unmeasured confounding and missing data bias, despite comprehensive quality control measures. Furthermore, the observed overfitting of the RF model underscores the challenges of model complexity in the context of limited sample size and a substantial number of features. Secondly, our observational design is susceptible to indication bias and unmeasured confounding. Specifically, the variable “oral Chinese herbal decoction” was defined as a binary indicator of whether a patient received any professionally prescribed and hospital-dispensed decoction for a minimum duration during hospitalization. Consistent with standard TCM practice, these decoctions were not standardized but were individualized based on pattern differentiation by certified practitioners. Consequently, the specific herbal composition, dosage, and treatment duration varied between patients. While this reflects real-world clinical application, it means the variable captures a complex intervention, and its observed association with improved survival may reflect its preferential administration to less severely ill patients capable of oral intake, rather than a direct causal effect. Although we adjusted for multiple measures of disease severity, residual confounding cannot be fully excluded. Thirdly, the generalizability of the model may be constrained by its development within a TCM-oriented healthcare context. All data originated from two TCM-affiliated hospitals in China. The distribution of patients between these two centers was uneven in both the training and test sets, with one center (First Affiliated Hospital of Henan University of Chinese Medicine) contributing a larger share of patients. Although our model performed well on the test set which contained a higher proportion of patients from the smaller center (Henan Provincial Hospital of Chinese Medicine), suggesting some robustness to inter-hospital variation, the potential for center-specific bias cannot be entirely ruled out. Therefore, the performance of our model in other geographic regions, or more balanced multi-center settings remains to be determined. External validation in broader, more diverse populations is strongly recommended. Fourthly, our machine learning models were compared using their default hyperparameters. While this approach enhances clinical practicality and reduces the risk of overfitting on our dataset, it may not represent the fully optimized potential of each algorithm. Future studies with larger cohorts could incorporate advanced hyperparameter tuning techniques to further maximize predictive performance. Lastly, although we compared five commonly used machine learning algorithms, other methods were not included. Further investigations could explore a wider range of modeling strategies.
5 Conclusion
Older age, increased BUN level, complication of septic shock, tracheotomy application, retention catheterization application, oral Chinese herbal decoction, and TCM syndrome (pathogenic qi falling into and prostration syndrome) may be potential risk factors that affect mortality in severe pneumonia, while application of tracheotomy and oral Chinese herbal decoction were associated with reduced mortality. The XGBoost model exhibits superior overall performance in predicting hospital mortality risk for severe pneumonia, greater than traditional scoring systems such as PSI, SOFA, and APACHE II, which assists clinicians in prognostic assessment, resulting in improved therapeutic strategies and optimal resource allocation for patients.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.
Ethics statement
The studies involving humans were approved by the Ethics Committee of the First Affiliated Hospital of Henan University of Chinese Medicine. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.
Author contributions
KX: Methodology, Conceptualization, Project administration, Visualization, Software, Writing – original draft, Data curation, Funding acquisition. XH: Data curation, Writing – review and editing, Investigation. ZL: Data curation, Writing – review and editing, Investigation. WY: Investigation, Data curation, Writing – review and editing. XoH: Writing – review and editing, Data curation, Investigation. XM: Data curation, Investigation, Writing – review and editing. HW: Funding acquisition, Project administration, Writing – review and editing, Writing – original draft, Supervision, Methodology, Data curation, Conceptualization.
Funding
The authors declare that financial support was received for the research and/or publication of this article. This study was supported by the National Natural Science Foundation of China (No. 81774222, 82074411), Henan Provincial Key R&D and Promotion Program (No. 252102310477), Scientific Research Special Project of the Henan Provincial Health Commission and the National Center for Inheritance and Innovation of Chinese Medicine (No. 2024ZXZX1117), Henan Province Traditional Chinese Medicine Top Level to Creation of a special scientific research topic (No. HSRP-DFCTCM-2023-3-21, HSRP-DFCTCM-2023-8-06).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphar.2025.1660893/full#supplementary-material
References
Acehan, S., Gulen, M., Isikber, C., Unlu, N., Sumbul, H. E., Gulumsek, E., et al. (2021). mNUTRIC tool is capable to predict nutritional needs and mortality early in patients suffering from severe pneumonia. Clin. Nutr. ESPEN 45, 184–191. doi:10.1016/j.clnesp.2021.08.030
Aliberti, S., Reyes, L. F., Faverio, P., Sotgiu, G., Dore, S., Rodriguez, A. H., et al. (2016). Global initiative for meticillin-resistant Staphylococcus aureus pneumonia (GLIMP): an international, observational cohort study. Lancet Infect. Dis. 16 (12), 1364–1376. doi:10.1016/S1473-3099(16)30267-5
Arvaniti, K., Dimopoulos, G., Antonelli, M., Blot, K., Creagh-Brown, B., Deschepper, M., et al. (2022). Epidemiology and age-related mortality in critically ill patients with intra-abdominal infection or sepsis: an international cohort study. Int. J. Antimicrob. Agents. 60 (1), 106591. doi:10.1016/j.ijantimicag.2022.106591
Bi, Q., Goodman, K. E., Kaminsky, J., and Lessler, J. (2019). What is machine learning? A primer for the epidemiologist. Am. J. Epidemiol. 188 (12), 2222–2239. doi:10.1093/aje/kwz189
Cao, B. (2016). Diagnosis and treatment guidelines for adult community acquired pneumonia in China (2016 edition). Chin. J. Tuberc. Respir. Dis. 39 (4), 253–279. doi:10.3760/cma.j.issn.1001-0939.2016.04.005
Chawla, L. S., Amdur, R. L., Faselis, C., Li, P., Kimmel, P. L., and Palant, C. E. (2017). Impact of acute kidney injury in patients hospitalized with pneumonia. Crit. Care Med. 45 (4), 600–606. doi:10.1097/CCM.0000000000002245
Chen, J., Li, Y., Zeng, Y., Tian, Y., Wen, Y., and Wang, Z. (2020). High mean platelet volume associates with In-Hospital mortality in severe pneumonia patients. Mediat. Inflamm. 2020, 8720535. doi:10.1155/2020/8720535
Domaratzki, M., and Kidane, B. (2022). Deus ex machina? Demystifying rather than deifying machine learning. J. Thorac. Cardiovasc. Surg. 163 (3), 1131–1137.e4. doi:10.1016/j.jtcvs.2021.02.095
Dos Santos Medeiros, S. M. D. F., Sousa Lino, B. M. N., Perez, V. P., Sousa, E. S. S., Campana, E. H., Miyajima, F., et al. (2024). Predictive biomarkers of mortality in patients with severe COVID-19 hospitalized in intensive care unit. Front. Immunol. 15, 1416715. doi:10.3389/fimmu.2024.1416715
Du, S., Wu, X., Li, B., Wang, Y., Shang, L., Huang, X., et al. (2022). Clinical factors associated with composition of lung microbiota and important taxa predicting clinical prognosis in patients with severe community-acquired pneumonia. Front. Med. 16 (3), 389–402. doi:10.1007/s11684-021-0856-3
Espana, P. P., Capelastegui, A., Gorordo, I., Esteban, C., Oribe, M., Ortega, M., et al. (2006). Development and validation of a clinical prediction rule for severe community-acquired pneumonia. Am. J. Respir. Crit. Care Med. 174 (11), 1249–1256. doi:10.1164/rccm.200602-177OC
Espinoza, R., Silva, J., Bergmann, A., de Oliveira, M. U., Calil, F. E., Santos, R. C., et al. (2019). Factors associated with mortality in severe community-acquired pneumonia: a multicenter cohort study. J. Crit. Care. 50, 82–86. doi:10.1016/j.jcrc.2018.11.024
Ferrer, M., Travierso, C., Cilloniz, C., Gabarrus, A., Ranzani, O. T., Polverino, E., et al. (2018). Severe community-acquired pneumonia: characteristics and prognostic factors in ventilated and non-ventilated patients. PLoS One 13 (1), e0191721. doi:10.1371/journal.pone.0191721
Fine, M. J., Auble, T. E., Yealy, D. M., Hanusa, B. H., Weissfeld, L. A., Singer, D. E., et al. (1997). A prediction rule to identify low-risk patients with community-acquired pneumonia. N. Engl. J. Med. 336 (4), 243–250. doi:10.1056/NEJM199701233360402
GBD 2019 Diseases and Injuries Collaborators (2020). Global burden of 369 diseases and injuries in 204 countries and territories, 1990-2019: a systematic analysis for the global burden of disease study 2019. Lancet 396 (10258), 1204–1222. doi:10.1016/S0140-6736(20)30925-9
Guzzardella, A., Motos, A., Vallverdú, J., and Torres, A. (2023). Corticosteroids in sepsis and community-acquired pneumonia. Med. Klin. Intensivmed. Notfmed. 118 (Suppl 2), 86–92. doi:10.1007/s00063-023-01093-w
Güell, E., Martín-Fernandez, M., De la Torre, M. C., Palomera, E., Serra, M., Martinez, R., et al. (2019). Impact of lymphocyte and neutrophil counts on mortality risk in severe community-acquired pneumonia with or without septic shock. J. Clin. Med. 8 (5), 754. doi:10.3390/jcm8050754
Hepokoski, M. L., Bellinghausen, A. L., Bojanowski, C. M., and Malhotra, A. (2018). Can we DAMPen the cross-talk between the lung and kidney in the ICU? Am. J. Respir. Crit. Care Med. 198 (9), 1220–1222. doi:10.1164/rccm.201712-2573RR
Huang, D., He, D., Yao, R., Wang, W., He, Q., Wu, Z., et al. (2023). Association of admission lactate with mortality in adult patients with severe community-acquired pneumonia. Am. J. Emerg. Med. 65, 87–94. doi:10.1016/j.ajem.2022.12.036
Kaese, S., Zander, M. C., and Lebiedz, P. (2016). Successful use of early percutaneous dilatational tracheotomy and the No sedation concept in respiratory failure in critically ill obese subjects. Respir. Care. 61 (5), 615–620. doi:10.4187/respcare.04333
Kassaw, G., Mohammed, R., Tessema, G. M., Yesuf, T., Lakew, A. M., and Tarekegn, G. E. (2023). Outcomes and predictors of severe community-acquired pneumonia among adults admitted to the university of gondar comprehensive specialized hospital: a prospective Follow-up study. Infect. Drug Resist. 16, 619–635. doi:10.2147/IDR.S392844
Knaus, W. A., Draper, E. A., Wagner, D. P., and Zimmerman, J. E. (1985). APACHE II: a severity of disease classification system. Crit. Care. Med. 13 (10), 818–829. doi:10.1097/00003246-198510000-00009
Lee, J., Kim, K., Jo, Y. H., Lee, J. H., Kim, J., Chung, H., et al. (2015). Severe thinness is associated with mortality in patients with community-acquired pneumonia: a prospective observational study. Am. J. Emerg. Med. 33 (2), 209–213. doi:10.1016/j.ajem.2014.11.019
Li, J., Gong, M., Joshi, Y., Sun, L., Huang, L., Fan, R., et al. (2021). Machine learning prediction model for acute renal failure after acute aortic syndrome surgery. Front. Med. 8, 728521. doi:10.3389/fmed.2021.728521
Li, Y., Wang, C., and Peng, M. (2021). Aging immune system and its correlation with liability to severe lung complications. Front. Public Health 9, 735151. doi:10.3389/fpubh.2021.735151
Lu, Y., and Deng, M. (2017). Inspiration from golden chamber synopsis for critically ill disease prognosis. J. Tradit. Chin. Med. 58 (08), 661–663. doi:10.13288/j.11-2166/r.2017.08.009
Lu, R., Yang, H., Peng, W., Tang, H., Li, Y., Lin, F., et al. (2023). Serum Krebs von den Lungen-6 is associated with in-Hospital mortality of patients with severe Community-Acquired Pneumonia: a retrospective cohort study. Clin. Chim. Acta. 548, 117524. doi:10.1016/j.cca.2023.117524
Mahendra, M., Jayaraj, B. S., Limaye, S., Chaya, S. K., Dhar, R., and Mahesh, P. A. (2018). Factors influencing severity of community-acquired pneumonia. Lung India 35 (4), 284–289. doi:10.4103/lungindia.lungindia_334_17
Martin-Loeches, I., Garduno, A., Povoa, P., and Nseir, S. (2022). Choosing antibiotic therapy for severe community-acquired pneumonia. Curr. Opin. Infect. Dis. 35 (2), 133–139. doi:10.1097/QCO.0000000000000819
Metlay, J. P., Waterer, G. W., Long, A. C., Anzueto, A., Brozek, J., Crothers, K., et al. (2019). Diagnosis and treatment of adults with community-acquired pneumonia. An official Clinical Practice guideline of the American thoracic society and infectious diseases society of America. Am. J. Respir. Crit. Care Med. 200 (7), e45–e67. doi:10.1164/rccm.201908-1581ST
Miao, L., Shen, X., Du, Z., and Liao, J. (2024). Stress hyperglycemia ratio and its influence on mortality in elderly patients with severe community-acquired pneumonia: a retrospective study. Aging Clin. Exp. Res. 36 (1), 175. doi:10.1007/s40520-024-02831-6
Muthukrishnan, R., and Rohini, R. (2016). “LASSO: a feature selection technique in predictive modeling for machine learning,” in 2016 IEEE International Conference on Advances in Computer Applications (ICACA), Coimbatore, India, 24 October 2016, 18–20.
Niu, J., Lv, X., Gao, L., Jia, H., and Zhao, J. (2025). Development and validation of a machine learning-based prediction model for in-ICU mortality in severe pneumonia: a dual-center retrospective study. Int. J. Med. Inf. 204, 106075. doi:10.1016/j.ijmedinf.2025.106075
O’brien, R. M. (2007). A caution regarding rules of thumb for variance inflation factors. Qual. Quant. 41, 673–690. doi:10.1007/s11135-006-9018-6
Pan, J., Bu, W., Guo, T., Geng, Z., and Shao, M. (2023). Development and validation of an in-hospital mortality risk prediction model for patients with severe community-acquired pneumonia in the intensive care unit. BMC Pulm. Med. 23 (1), 303. doi:10.1186/s12890-023-02567-5
Pirracchio, R., Petersen, M. L., Carone, M., Rigon, M. R., Chevret, S., and van der Laan, M. J. (2015). Mortality prediction in intensive care units with the super ICU learner algorithm (SICULA): a population-based study. Lancet Resp. Med. 3 (1), 42–52. doi:10.1016/S2213-2600(14)70239-5
Que, Y. A., Virgini, V., Lozeron, E. D., Paratte, G., Prod'Hom, G., Revelly, J. P., et al. (2015). Low C-reactive protein values at admission predict mortality in patients with severe community-acquired pneumonia caused by Streptococcus pneumoniae that require intensive care management. Infection 43 (2), 193–199. doi:10.1007/s15010-015-0755-0
Ranstam, J., Cook, J. A., and Collins, G. S. (2016). Clinical prediction models. Br. J. Surg. 103 (13), 1886. doi:10.1002/bjs.10242
Shang, N., Li, Q., Liu, H., Li, J., and Guo, S. (2023). Erector spinae muscle-based nomogram for predicting in-hospital mortality among older patients with severe community-acquired pneumonia. BMC Pulm. Med. 23 (1), 346. doi:10.1186/s12890-023-02640-z
Spencer, E., Rosengrave, P., Williman, J., Shaw, G., and Carr, A. C. (2022). Circulating protein carbonyls are specifically elevated in critically ill patients with pneumonia relative to other sources of sepsis. Free. Radic. Biol. Med. 179, 208–212. doi:10.1016/j.freeradbiomed.2021.11.029
Tseng, C. C., Hung, K. Y., Chang, H. C., Huang, K. T., Wang, C. C., Chen, Y. M., et al. (2024). The importance of high total body water/fat free mass ratio and serial changes in body composition for predicting hospital mortality in patients with severe pneumonia: a prospective cohort study. BMC Pulm. Med. 24 (1), 470. doi:10.1186/s12890-024-03302-4
Van Calster, B., Wynants, L., Verbeek, J., Verbakel, J. Y., Christodoulou, E., Vickers, A. J., et al. (2018). Reporting and interpreting decision curve analysis: a guide for investigators. Eur. Urol. 74 (6), 796–804. doi:10.1016/j.eururo.2018.08.038
Vincent, J. L., Moreno, R., Takala, J., Willatts, S., Mendonça, A., Bruining, H., et al. (1996). The SOFA (Sepsis-related organ failure assessment) score to describe organ dysfunction/failure. On behalf of the working group on sepsis-related problems of the european society of intensive care medicine. Intensive Care Med. 22 (7), 707–710. doi:10.1007/BF01709751
Wang, K., Zuo, P., Liu, Y., Zhang, M., Zhao, X., Xie, S., et al. (2020). Clinical and laboratory predictors of In-hospital mortality in patients with coronavirus Disease-2019: a cohort study in wuhan, China. Clin. Infect. Dis. 71 (16), 2079–2088. doi:10.1093/cid/ciaa538
Wang, Z., Pu, Q., Huang, C., and Wu, M. (2021). Crosstalk between lung and extrapulmonary organs in infection and inflammation. Adv. Exp. Med. Biol. 1303, 333–350. doi:10.1007/978-3-030-63046-1_18
Welte, T., Torres, A., and Nathwani, D. (2012). Clinical and economic burden of community-acquired pneumonia among adults in Europe. Thorax 67 (1), 71–79. doi:10.1136/thx.2009.129502
Xie, K., Guan, S., Jing, H., Ji, W., Kong, X., Du, S., et al. (2023). Efficacy and safety of traditional Chinese medicine adjuvant therapy for severe pneumonia: evidence mapping of the randomized controlled trials, systematic reviews, and meta-analyses. Front. Pharmacol. 14, 1227436. doi:10.3389/fphar.2023.1227436
Xie, K., Guan, S., Kong, X., Ji, W., Du, C., Jia, M., et al. (2024). Predictors of mortality in severe pneumonia patients: a systematic review and meta-analysis. Syst. Rev. 13 (1), 210. doi:10.1186/s13643-024-02621-1
Yu, X., Xie, Y., and Li, J. (2019). Guidelines for the diagnosis and treatment of community-acquired pneumonia (2018 revision). J. Tradit. Chin. Med. 60 (04), 350–360. doi:10.13288/j.11-2166/r.2019.04.019
Zhang, C., Zheng, F., and Wu, X. (2023). Predictive value of C-reactive protein-to-albumin ratio for risk of 28-day mortality in patients with severe pneumonia. J. Lab. Med. 47 (3), 115–120. doi:10.1515/labmed-2022-0114
Zhang, C., Guan, S., Xie, K., Zhang, K., and Wang, H. (2022). Distribution of clinical traditional Chinese medicine syndromes in patients with severe community-acquired pneumonia. Chin. General Pract. 25 (21), 2640–2645. doi:10.12114/j.issn.1007-9572.2022.0179
Zhang, Y., Peng, Y., Zhang, W., and Deng, W. (2023). Development and validation of a predictive model for 30-day mortality in patients with severe community-acquired pneumonia in intensive care units. Front. Med. 10, 1295423. doi:10.3389/fmed.2023.1295423
Zhao, W., Li, X., Gao, L., Ai, Z., Lu, Y., Li, J., et al. (2025). Machine learning-based model for predicting all-cause mortality in severe pneumonia. BMJ Open Respir. Res. 12 (1), e001983. doi:10.1136/bmjresp-2023-001983
Zhu, Y., Zheng, X., Huang, K., Tan, C., Li, Y., Zhu, W., et al. (2021). Mortality prediction using clinical and laboratory features in elderly patients with severe community-acquired pneumonia. Ann. Palliat. Med. 10 (10), 10913–10921. doi:10.21037/apm-21-2537
Zhu, W., Bai, Y., Li, S., Zhang, M., Chen, J., Xie, P., et al. (2023). Delirium in hospitalized COVID-19 patients: a prospective, multicenter, cohort study. J. Neurol. 270 (10), 4608–4616. doi:10.1007/s00415-023-11882-0
Glossary
ICU intensive care unit
CRP C-reactive protein
IL interleukin
mNUTRIC Modified Nutrition Risk in Critically ill
TCM traditional Chinese medicine
AI artificial intelligence
SOFA Sequential Organ Failure Assessment
APACHE II Acute Physiology and Chronic Health Evaluation II
LR Logistic Regression
SVM Support Vector Machine
DT Decision Tree
RF Random Forest
XGBoost Extreme Gradient Boosting
H1N1 Influenza A
SARS severe acute respiratory syndrome
COVID-19 coronavirus disease 2019
COPD chronic obstructive pulmonary disease
WBC white blood cell
RBC red blood cell
NEUT% neutrophilic granulocyte percentage
LY% lymphocyte percentage
PCT procalcitonin
AST aminotransferase
ALT alanine aminotransferase
BUN blood urea nitrogen
Scr serum creatinine
PT prothrombin time
BNP brain natriuretic peptide
PaO2 arterial partial pressure of oxygen
PaCO2 arterial partial pressure of carbon dioxide
PaO2/FiO2 arterial oxygenation index
SD standard deviation
IQR interquartile range
LASSO Least Absolute Shrinkage and Selection Operator
AUROC area under the receiver operating characteristic
MCC matthew correlation coefficient
ROC receiver operation characteristic
AUC area under the curve
DCA decision curve analysis
PSI Pneumonia Severity Index
SHAP SHapley Additive exPlanations
LightGBM gradient-boosted decision tree
MLP multilayer perceptron
SCAP severe community acquired pneumonia.
Keywords: severe pneumonia, machine learning, prediction model, mortality, traditional Chinese medicine
Citation: Xie K, Huang X, Li Z, Yin W, He X, Miao X and Wang H (2025) Development and validation of a clinical prediction model for in-hospital mortality of severe pneumonia based on machine learning. Front. Pharmacol. 16:1660893. doi: 10.3389/fphar.2025.1660893
Received: 07 July 2025; Accepted: 31 October 2025;
Published: 26 November 2025.
Edited by:
Izolde Bouloukaki, University of Crete, GreeceReviewed by:
Marco Confalonieri, University of Trieste, ItalyChen Cui, Air Force Medical University, China
Massimo Giotta, University of Bari Aldo Moro, Italy
Copyright © 2025 Xie, Huang, Li, Yin, He, Miao and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Haifeng Wang, d2FuZ2hfZkAxMjYuY29t
Xiajin Huang1,2,3