Under the background of the new global definition of ARDS: an interpretable machine learning approach for predicting 28-day ICU mortality in patients with sepsis complicated by ARDS

Zhang, Peijie; Yuan, Shuo; Zhang, Shuzhan; Yuan, Zhiheng; Ye, Zi; Lv, Lanxin; Yang, Hongning; Peng, Hui; Li, Haiquan; Zhao, Ningjun

doi:10.3389/fphys.2025.1617196

ORIGINAL RESEARCH article

Front. Physiol., 19 September 2025

Sec. Respiratory Physiology and Pathophysiology

Volume 16 - 2025 | https://doi.org/10.3389/fphys.2025.1617196

This article is part of the Research TopicAdvanced Monitoring in ARDS: Enhancing Mechanical Ventilation through Innovative TechniquesView all 6 articles

Under the background of the new global definition of ARDS: an interpretable machine learning approach for predicting 28-day ICU mortality in patients with sepsis complicated by ARDS

Peijie Zhang^1,2^†

Shuo Yuan^1,2^†

Shuzhan Zhang^1,2^†

Zhiheng Yuan^1,2

Zi Ye^1,2

Lanxin Lv^1,2

Hongning Yang^1,2

Hui Peng³*

Haiquan Li⁴*

Ningjun Zhao^1,2*

¹Department of Emergency Medicine, The Affiliated Hospital of Xuzhou Medical University, Xuzhou, Jiangsu, China
²Xuzhou Key Laboratory of Emergency Medicine, The Affiliated Hospital of Xuzhou Medical University, Xuzhou, Jiangsu, China
³Department of Emergency Medicine, Feng Xian People’s Hospital, Xuzhou, Jiangsu, China
⁴Department of Respiratory and Critical Care Medicine, Second Affiliated Hospital of Xuzhou Medical University, Xuzhou, Jiangsu, China

Background: Acute respiratory distress syndrome (ARDS) is a prevalent clinical complication among patients with sepsis, characterized by high incidence and mortality rates. The definition of ARDS has evolved over time, with the new global definition introducing significant updates to its diagnosis and treatment. Our objective is to develop and validate an interpretable prediction model for the prognosis of sepsis patients complicated by ARDS, utilizing machine learning techniques in accordance with the new global definition.

Methods: This study extracted data from the MIMIC database (version MIMIC-IV 2.2) to create the training set for our model. For external validation, this study used data from sepsis patients complicated by ARDS who met the new global definition of ARDS, sourced from the Affiliated Hospital of Xuzhou Medical University. Lasso regression with cross-validation was used to identify key predictors of patient prognosis. Subsequently, this study established models to predict the 28-day prognosis following ICU admission using various machine learning algorithms, including logistic regression, random forest, decision tree, support vector machine classifier, LightGBM, XGBoost, AdaBoost, and multi-layer perceptron (MLP). Model performance was assessed using ROC curves, clinical decision curves (DCA), and calibration curves, while SHAP values were utilized to interpret the machine learning models.

Results: A total of 905 patients with sepsis complicated by ARDS were included in our analysis, leading to the selection of 15 key variables for model development. Based on the AUC of the ROC curve, as well as DCA and calibration curve results from the training set, the support vector classifier (SVC) model demonstrated strong performance, achieving an average AUC of 0.792 in the internal validation set and 0.816 in the external validation set.

Conclusion: The application of machine learning methodologies to construct prognostic prediction models for sepsis patients complicated by ARDS, informed by the new global definition, proves to be reliable. This approach can assist clinicians in developing personalized treatment strategies for affected patients.

1 Introduction

Sepsis is a systemic inflammatory response syndrome typically triggered by infection. The persistent systemic inflammatory response and the imbalance of immune regulatory mechanisms represent the core pathological and physiological processes underlying sepsis, often resulting in severe multi-organ dysfunction and posing a significant threat to life (Singer et al., 2016). A recent study examining patients with sepsis and septic shock from 2009 to 2019 indicated that conservatively, there are over 30 million new cases of sepsis globally each year, with approximately 6 million patients succumbing to sepsis or septic shock (Bauer et al., 2020). Additionally, a cross-sectional study conducted in China revealed that patients admitted to the ICU with sepsis had a 90-day mortality rate of around 35.5% (Xie et al., 2020).

The lungs are the first and most commonly affected organ in the progression of sepsis. Patients with sepsis may develop acute lung injury (ALI) or even acute respiratory distress syndrome (ARDS), which is characterized by refractory hypoxemia and respiratory distress. ARDS is a serious and potentially fatal respiratory failure marked by increased permeability of alveolar capillary membranes due to various direct or indirect injurious factors, resulting in edema in the alveoli and interstitial, as well as alveolar hemorrhage and the formation of hyaline membranes. These changes ultimately lead to hypoxemia and respiratory distress.

The combination of sepsis and ARDS is thought to be linked to mechanisms such as systemic inflammatory cytokine storms triggered by infection (Zhu et al., 2022), monocyte-macrophage activation (Lv and Liang, 2025), oxidative stress (Liu Y. et al., 2021), and a reduction in pulmonary surfactant or alterations in its composition (Whitsett et al., 2015), all of which may result in irreversible lung damage.

The clinical definition of ARDS has undergone several revisions, with the Berlin definition published in 2012 playing a pivotal role in clinical diagnosis and management. This definition emphasizes mechanical ventilation, the oxygenation index (PaO2/FiO2 ratio), and pulmonary imaging as essential parameters for diagnosing ARDS and assessing its severity (Ranieri et al., 2012). However, over the past decade, numerous medical professionals have identified limitations within the Berlin definition during clinical practice. In response, 32 critical care experts from around the world jointly published a new global definition of ARDS in May 2023. This updated definition broadens the diagnostic criteria for ARDS in patients receiving non-invasive ventilation and high-flow oxygen therapy (HFNO). It identifies non-invasive pulse oximetry, specifically the SpO2/FiO2 index, as a crucial indicator for diagnosing ARDS, replacing the traditional oxygenation index that relies on arterial blood gas analysis. Furthermore, pulmonary ultrasound has also been added as a supplementary tool for pulmonary imaging diagnosis (Matthay et al., 2024).

This new global definition significantly expands the application of ARDS clinical criteria, implementing important updates in diagnostic standards, scope, and imaging evaluation. The aim is to enhance the accuracy and universality of ARDS diagnosis, ultimately improving treatment and patient prognosis.

Sepsis complicated by ARDS is a leading cause of mortality in patients with sepsis in the intensive care unit (ICU). Reports indicate that annually, approximately 150,000 to 200,000 individuals worldwide succumb to sepsis complicated by ARDS. The mortality rate for patients experiencing this dual condition is estimated to be 30%–40% higher than that for patients with sepsis alone (Englert et al., 2019; Eworuke et al., 2018). Given the significant incidence and mortality associated with sepsis and ARDS (S-ARDS), establishing a reliable and effective clinical prognosis prediction model is essential. Such a model would provide intuitive, evidence-based information to assist medical professionals in identifying high-risk groups and enhancing the management of such patients.

Machine Learning (ML), a branch of artificial intelligence, enables computer systems to learn autonomously and make decisions through data analysis and pattern recognition. It is characterized by powerful data processing capabilities, automatic recognition functions, and continuous learning and optimization. In recent years, ML has become increasingly important in the development of clinical prognosis prediction models. For instance, Pappada SM et al. created a machine learning model for the early identification of ICU-acquired sepsis, achieving specificity and sensitivity rates of 83.8% and 73.3%, respectively (Pappada et al., 2024). Additionally, Fan Z et al. utilized machine learning techniques to develop a clinical prognosis model for patients with sepsis complicated by acute kidney injury, successfully validating it externally and achieving favorable clinical prediction outcomes (Fan et al., 2023).

In the realm of clinical prognostic model research for patients with sepsis and ARDS, although Mu S et al. have developed a prognostic model using data from the MIMIC-III database and the Berlin definition of ARDS, there remains a notable lack in research focusing on the clinical characteristics, prognosis, risk factor identification, and model development for sepsis patients with ARDS based on the latest global definition of ARDS.

The Critical Care Medical Marketplace (MIMIC) is a comprehensive and publicly accessible database that includes extensive information on over 190,000 patients treated at the Beth Israel Deaconess Medical Center from 2008 to 2019. This database encompasses a wide range of data, including demographic details, vital signs, laboratory test results, imaging reports, prescriptions, and clinical outcomes. It serves as a robust foundation for researching and developing clinical prognosis prediction models specifically for sepsis patients with ARDS based on the new global definition.

Therefore, this study aims to identify patients with sepsis complicated with ARDS using the new global definition from the MIMIC database, and collect their clinical characteristics, identify risk factors that affect the clinical prognosis of this population, and develop a clinical prognosis prediction model. In summary, the main contributions of this study are as follows: (1) we constructed an interpretable machine learning model to predict 28-day ICU mortality among patients with sepsis-related ARDS based on the new 2023 global definition; (2) we validated the model on an external cohort from a different hospital to demonstrate generalizability; (3) we adopted a nested cross-validation framework and SHAP analysis to ensure model robustness and interpretability; (4) we included mild ARDS patients who received only supplemental oxygen to align with the inclusive spirit of the new definition, thus improving early recognition and clinical applicability.

2 Methods

2.1 Study design and data sources

We utilize the Medical Information Mart for Intensive Care (MIMIC) database as our primary data source, specifically version MIMIC-IV 2.2. Although MIMIC-IV version 3.1 was released after our initial data extraction, we found that the Note module, which includes critical radiology and clinical notes required for ARDS diagnosis under the new global definition, had not been updated. To ensure consistency and completeness of diagnostic data, we retained version 2.2 for our study. This open-access intensive care database comprises clinical data from over 190,000 patients and 450,000 hospitalizations documented at the Beth Israel Deaconess Medical Center between 2008 and 2019, which includes approximately 70,000 ICU admissions. The MIMIC-IV database contains a wealth of information, including patient demographic details, codes from both the 9th and 10th editions of the International Classification of Diseases (ICD-9 and ICD-10), vital signs, laboratory test results, imaging studies, real-time physiological monitoring data from the ICU, and records of clinical outcomes. Importantly, all personal identifying information of patients in the database is anonymized and kept strictly confidential. Accessing and extracting data from this database necessitates approval from the relevant review committee at MIT.

We extracted data on sepsis patients with ARDS who met both the Berlin definition and the updated global definition from our database. As we all known, sepsis is defined as a disorder of the host response to infection, which leads to life-threatening multi-organ dysfunction. Consequently, the primary criteria for identifying sepsis patients in our database include clinical evidence of infection or a high suspicion of infection, along with a Sequential Organ Failure Assessment (SOFA) score of ≥2 (Singer et al., 2016).

The diagnosis of ARDS according to the Berlin definition is based on the following criteria: 1. The onset of ARDS should occur within 1 week following the onset of known clinical abnormalities or new respiratory symptoms; 2. Chest X-rays or CT scans must reveal bilateral lung infiltrates or edema, while ruling out the effects of pleural effusion or acute heart failure; 3. Mechanical ventilation is required, with a positive end-expiratory pressure (PEEP) ≥ 5 cm H₂O and Oxygenation index (PaO2/FIO2)≤ 300 mmHg (Ranieri et al., 2012).

The new global definition of ARDS (Matthay et al., 2024) builds upon the Berlin definition, incorporating the following diagnostic criteria: 1. The onset should occur within 1 week of identified risk factors or the emergence of new or worsening respiratory symptoms, characterized by acute exacerbation or deterioration of hypoxemic respiratory failure; 2. Chest imaging must indicate bilateral lung infiltrates or edema, excluding cardiogenic pulmonary edema; 3. ARDS is classified under different ventilation states as follows: (1) Non-intubation ARDS is defined by an oxygen flow ≥30 L/min using high-flow nasal cannula (HFNC), or PEEP ≥5 cm H₂O when using non-invasive ventilation (NIV) or continuous positive airway pressure (CPAP); (2) Intubation ARDS follows the criteria of the Berlin definition; (3) In resource-limited environments, ARDS can be diagnosed based solely on oxygen therapy, without the necessity of specific respiratory support devices such as PEEP or defined oxygen flow rates. Under these conditions, SpO2 ≤ 97% and SpO2/FiO2 ≤ 315 are considered necessary for diagnosing ARDS (Matthay et al., 2024). Although the new global definition of ARDS introduced diagnostic criteria for resource-limited settings—specifically allowing diagnosis based on supplemental oxygen therapy—this criterion was still applied in our study using the MIMIC-IV dataset from Beth Israel Deaconess Medical Center, a tertiary academic hospital. This is because, in clinical reality, even in such high-resource settings, some ICU patients may initially present with mild ARDS and receive only oxygen therapy due to adequate respiratory function. These cases, although not meeting criteria for mechanical ventilation or non-invasive support, are still eligible for ARDS diagnosis under the new global definition. Including such patients allows earlier detection of ARDS and enhances the model’s generalizability and clinical utility.

We extracted patients and their clinical data diagnosed with sepsis complicated with ARDS under the two diagnostic criteria mentioned above from the MIMIC database. We then analyzed the differences in clinical characteristics, disease severity assessments, and mortality rates between the patient groups defined by two definitions above. Furthermore, we employed machine learning techniques to predict the 28-day ICU mortality rate for sepsis patients with ARDS under the latest definition, and analyzed possible risk factors that may affect clinical prognosis.

2.2 Data extraction

We initially employed Structured Query Language (SQL) to retrieve and extract raw data from the MIMIC-IV database using Navicat Premium software (version 16.3.8). This data included essential clinical information about patients, laboratory test results, imaging examinations, clinical comorbidities, critical care records, advanced life support therapy details, and clinical prognosis information.

For this study, we included patients who met the following criteria: 1. They were experiencing their first admission to the ICU; 2. Their ICU stay exceeded 24 h; 3. They were over 18 years old at the time of admission; 4. They were diagnosed with sepsis within 24 h of admission, in accordance with the Sepsis-3.0 diagnostic criteria. To identify sepsis patients in the MIMIC-IV database, we utilized ICD-9 codes (78,552, 99,591, and 99,592), ICD-10 codes (R65.20 and R65.21), and the SOFA score recorded within the first 24 h of ICU admission; 5. Moreover, the patients were also diagnosed with Acute Respiratory Distress Syndrome (ARDS) within 24 h of admission, based on the Berlin definition or the new global definition. Detailed diagnostic criteria can be referenced in the definitions and the data extraction process illustrated in Figure 1. To ensure that the ARDS cases included in our study were induced by sepsis, we required that both the diagnosis of sepsis and ARDS occurred within the first 24 h of ICU admission. Sepsis was identified using ICD-9/10 codes (e.g., 78,552, R65.20) and a SOFA score ≥2, indicating organ dysfunction due to infection. Non-infectious causes of ARDS—such as trauma, aspiration, or pancreatitis—were excluded by design through this definition. ARDS was diagnosed using SpO₂/FiO₂ or PaO₂/FiO₂ indices and chest imaging findings recorded in the same 24-h window.

Figure 1

Flowchart depicting ICU records analysis in the MIMIC-IV v2.2 database. Starts with 74,181 ICU stays; 23,261 excluded for repeats. Covers 50,920 first ICU admissions. Divided into groups: invasive ventilation (15,555), noninvasive ventilation (447), HFNC (516), supplemental oxygen (21,548). Each group assesses patients with bilateral infiltrates and ICD codes indicating no acute heart failure. Specific criteria include PEEP levels and oxygen flow. Outcomes categorizing patients with sepsis and ICU stays over 24 hours are shown, resulting in totals for the Berlin and new global definitions.

Figure 1. Flowchart of screening.

Regarding the extraction process of ARDS patients that meets the Berlin definition and the new global definition, we referred to the open-source code by Qian F et al., which includes extracting: 1. The initial ventilation treatment status of patients upon ICU admission; 2. Results from pulmonary imaging (chest X-ray or chest CT), specifically textual information indicating bilateral pulmonary edema, such as “bilateral infiltration” and “edema”; 3. PaO2/FiO2 and SpO2/FiO2 (Qian et al., 2024). Reasonable modifications were made to certain codes, for instance, we defined the PaO2/FiO2 and SpO2/FiO2 as the worst values recorded within the first 24 h of ICU admission for patients under initial ventilation treatment. If the duration of initial ventilation treatment was less than 24 h, the worst value during that treatment period was considered for the diagnostic criteria. Additionally, we employed ICD codes “428” and “I50” along with their lower-level codes to identify and exclude cases of acute cardiogenic pulmonary edema.

The data we extracted encompasses the following key elements:1. Basic Clinical Information: This includes age, gender, weight, and height at the time of admission.2. Intensive Care Records: This section details the duration of ICU hospitalization and vital signs recorded within the first 24 h of admission. Key measurements include blood pressure, heart rate, respiratory rate, body temperature, blood oxygen saturation, urine output, and blood glucose levels.3. Laboratory Test Results: Within the first 24 h of ICU admission, we collected laboratory test results, including complete blood counts, liver and kidney function tests, coagulation profiles, and arterial blood gas analyses.4. Advanced Life Support Therapy: This includes information on renal replacement therapy, mechanical ventilation, and the administration of vasoactive drugs.5. Imaging Examinations: we focused on the textual descriptions of pulmonary imaging results, such as chest X-rays and chest CT scans.6. Patient Death Records: The database contains records of patient mortality, with a positive outcome defined as death occurring within 28 days of ICU admission. In the study, vital signs and laboratory test results from the intensive care records were analyzed as independent features by utilizing their maximum, minimum, and/or mean values.

We included patients with sepsis complicated by ARDS who met the criteria of the new global definition and were admitted to Xuzhou Medical University Affiliated Hospital from March 2022 to October 2024. The exclusion criteria were consistent with those used in the training cohort. Clinical data for patients in the external validation cohort were collected based on 15 features selected from the training cohort after model training. These features included admission age, average SpO2, average body temperature, average respiratory rate, average heart rate, red blood cell distribution width (RDW), presence of metastatic solid tumors, lactate levels, urine output, international normalized ratio (INR), alkaline phosphatase levels, average red blood cell volume, logistic organ dysfunction score (LODS score), presence of rheumatic diseases, and platelet count. The inclusion and exclusion criteria for the external validation cohort were identical to those applied to the MIMIC-IV cohort. Therefore, a separate flowchart was not presented to avoid redundancy.

2.3 Statistical analysis

In the baseline analysis section, we employed the Shapiro-Wilks test to assess the normality of the data distribution. For continuous variables that exhibited a normal distribution, we represented them using the mean and standard deviation, and compared groups using an independent sample t-test. Conversely, for continuous variables that did not adhere to a normal distribution, we used the median and interquartile range for representation and utilized the Wilcoxon rank sum test for comparisons. Categorical data is presented as counts and percentages, with comparisons made using the chi-square test. A p-value of less than 0.05 is considered statistically significant.

Based on the survival status of patients 28 days after their admission to the ICU, we categorized them into a survival group and a death group. Additionally, patients were classified into two groups: the “Berlin definition group” and the “new global definition group,” according to their alignment with the Berlin definition or the new global definition of ARDS.

In building our machine learning models, we utilize Python version 3.11.7 along with Jupyter Notebook as our coding environment. The key packages and versions included: scikit-learn 1.4.0, miceforest 5.6.4, scikit-optimize 0.9.0, imbalanced-learn 0.12.0, SHAP 0.44.1, numpy 1.26.3, matplotlib 3.8.3. During the data preprocessing phase, illustrated in Figure 2, we employ the missing no module to visualize missing data. Each column in the visualization represents a clinical variable, with the white spaces indicating the presence of missing values. The density of the black lines in each column correlates with the number of available data points for the respective clinical variable; thus, The denser the black lines in each column, the fewer missing values for the clinical variable.

Figure 2

Heatmap illustrating data completeness across various medical variables for 905 entries. Rows represent individual data entries, and columns denote specific medical attributes like age, gender, and various lab measurements. White gaps indicate missing data. A side plot shows the frequency of missing data entries, peaking at 31.

Figure 2. Missing value visualization: Each column represents a clinical variable, and the white lines represent missing values.

To enhance the accuracy and performance of our model predictions, we decided to exclude clinical variables with more than 30% missing values, such as bicarbonate and albumin. For the remaining missing values, we applied miceforest multiple imputation, which effectively captures complex relationships among variables by utilizing a random forest model. Through multiple iterations, the missing values are predicted in a manner that aligns with the distribution characteristics of the original dataset, thereby minimizing bias as much as possible. For continuous variables, we implement MinMaxScaler normalization to scale them appropriately, which helps eliminate dimensional effects and improves model efficiency. Additionally, we use OneHotEncoder to encode categorical variables effectively.

During the training and validation phases of our machine learning models, we evaluated several widely recognized and highly effective algorithms based on the results of feature selection using Lasso CV. These algorithms included logistic regression (LR), random forest (RF), decision tree (DT), support vector machine (SVM), lightweight gradient boosting machine (LightGBM), extreme gradient boosting machine (XGBoost), adaptive boosting machine (AdaBoost), and multilayer perceptron (MLP).

To improve the stability and generalizability of the models, we employed a repeated nested cross-validation strategy. In this approach, the outer loop involved a 5-fold cross-validation, where the dataset was randomly split into five subsets. One fold was used as the outer test set, while the remaining four served as the outer training set. Within the training set, a 10-fold inner cross-validation was conducted to perform hyperparameter tuning. This entire nested cross-validation process was repeated five times, with the dataset reshuffled before each repetition, resulting in 25 independent models per machine learning algorithm. The final performance for each algorithm was calculated as the average performance across the 25 models, which helps reduce variance due to data partitioning and ensures a more reliable model selection.

To mitigate the effects of imbalanced positive and negative outcomes on the model, we implemented the Synthetic Minority Over-sampling Technique (SMOTE) and the Tomek Link technique. These methods effectively balance the data, reduce the risk of overfitting, and enhance the model’s generalization capability. For hyperparameter optimization, we employed Bayesian Optimization to determine the optimal hyperparameter combinations. The tuned hyperparameters and search paces were as follows:

• LR: c (10⁻⁴ to 10⁻²)

•RF: max_depth (3–30), n_estimators (100–1,000), min_samples_split (2–10)

•DT: max_depth (3–30), min_samples_split (1–10)

•SVM: gamma (10⁻⁴ to 1)

•LightGBM: max_depth (3–30), num_leaves (20–200), learning_rate (0.001–0.2), n_estimators (100–1,000)

•XGBoost: n_estimators (100–1,000), colsample_bytree (0.5–1), max_depth (3–30), subsample (0.5–1)

•AdaBoost: n_estimators (50–500), learning_rate (0.01–1)

•MLP: hidden_layer_size (tuple: (50–300, 1-3 layers)).

The performance of the predictive models was assessed using various metrics, including the ROC curve, area under the curve (AUC), accuracy, sensitivity, specificity, recall, and F1 score.

In the realm of predictive model interpretation, SHAP serves as a robust tool for elucidating machine learning algorithms (Lv et al., 2023; Zhuo et al., 2023). Grounded in the Shapley value from game theory, SHAP seeks to clarify the contribution of each feature to the prediction outcomes. This approach mitigates the black box nature of machine learning models and improves their interpretability. In our study, we calculated and visualized the SHAP values for the SVC model, which demonstrated the highest predictive capability, as indicated by its AUC score.

3 Results

This study comprised 905 sepsis patients with ARDS who met the criteria of the new global definition (referred to as the new global definition group) and 598 sepsis patients with ARDS who met the Berlin definition (referred to as the Berlin definition group). Based on their 28-day survival status after ICU admission, the patients were categorized into two groups: a survival group and a non-survival group.

3.1 Baseline characteristic

Table 1 presents the distribution of patients according to varying degrees of disease severity in both the new global definition group and the Berlin definition group. In the new global definition group, there were 102 patients (11.27%) classified as mild, 278 patients (30.72%) as moderate, and 525 patients (58.01%) as severe, with a total of 336 ICU deaths (37.13%) occurring within 28 days. In contrast, the Berlin definition group consisted of 58 patients (9.85%) with mild symptoms, 208 patients (35.31%) with moderate symptoms, and 323 patients (54.84%) with severe symptoms, resulting in 228 ICU deaths (38.71%) at 28 days. Compared with the Berlin definition, the new global definition classified a slightly higher proportion of patients as severe (58.01% vs. 54.84%) and fewer as moderate (30.72% vs. 35.31%). This indicates a modest shift in severity stratification under the new definition.

Table 1

Table 1. Classification of ARDS severity and 28-day ICU mortality.

Additionally, we identified 538 patients who required invasive mechanical ventilation and had both PaO2/FiO2 and SpO2/FiO2 indices by extracting cross subsets from two datasets. We then compared the 28-day ICU mortality rates among patients with varying severity levels as determined by these indices (see Table 2). A chi-square test was conducted to compare the mortality rates of the subsets, yielding a p-value of 0.597. This indicates that there was no statistically significant difference in mortality rates between the two diagnostic criteria for ARDS severity classification, which aligns with the findings reported by Qian et al. (2024).

Table 2

Table 2. Comparison of mortality rates for ARDS with invasive ventilation.

Finally, we compared the mortality rates between the invasive mechanical ventilation subgroup and the oxygen-only subgroup using the new global definition, as presented in Table 3. The chi-square test yielded a p-value of 0.439. In contrast to the findings of Qian F et al. (Qian et al., 2024), our analysis indicated that the global new definition criteria neither underestimated nor overestimated the mortality rate of sepsis patients with ARDS who received supplemental oxygen therapy.

Table 3

Table 3. Comparison of mortality rates for New Global Definition Group.

Table 4 presents the baseline characteristics of patients in the global new definition group, encompassing essential clinical data, vital signs, laboratory test results, clinical comorbidities, and records of advanced life support therapy. The overall mortality rate for this group is 37.12%. In the univariate analysis, significant differences were observed between the two groups in various factors, including age, weight, urine output, mean pulse oxygen saturation, mean arterial pressure, body temperature, pH, arterial oxygen partial pressure, lactate levels, oxygenation index, and the SpO2/FiO2 ratio, with a P-value of less than 0.001.

Table 4

Table 4. Baseline characteristics of patients in New Global Difinition Group.

3.2 Feature selection

In the feature selection and screening section, we employed Lasso regression with cross-validation (Lasso CV) to evaluate various features. Lasso regression is a linear regression technique utilized for both feature selection and regularization. Its effectiveness in feature filtering primarily relies on examining the coefficients assigned to each feature within the model. Features with coefficients of zero are deemed to make no contribution to the model’s predictive power. Furthermore, Lasso regression addresses the issue of feature collinearity to some extent, with the lambda value in the regression equation governing the strength of regularization.

Lasso CV integrates Lasso regression with cross-validation, automatically exploring different lambda values and utilizing cross-validation to identify the optimal alpha value. This process maximizes the balance between model complexity and fit, as illustrated in Figure 3, while also ranking the features according to their importance. We initially selected 64 candidate features based on clinical relevance identified through literature review and consultation with two intensivists, data availability in both internal and external datasets, and their accessibility within the first 24 h of ICU admission. These features covered demographics, vital signs, laboratory indicators, ventilator parameters, severity scores (e.g., SAPS II, SOFA), and comorbidities. Subsequently, we identify 37 features with non-zero coefficients using Lasso CV algorithm.

Figure 3

Three-panel image showing Lasso regression analysis. Panel A: Graph of MSE versus Lambda, highlighting the optimal model. Panel B: Coefficient paths with Lambda, illustrating variable shrinkage. Panel C: Bar chart of coefficients, displaying their impacts, with

Figure 3. Feature selection using Lasso regression with cross-validation. (A) Determination of the optimal lambda value; (B) The variation of variable coefficients with the lambda value, the black dashed line indicates the coefficients of each variable at the optimal lambda value; (C) The ranking of variable coefficients.

To mitigate model complexity and reduce the risk of overfitting, we selected the top 15 features based on the absolute values of their coefficients for inclusion in the machine learning model. These features comprised admission age, average SpO2, average body temperature, red blood cell distribution width (RDW), merged metastatic solid tumors, lactate levels, urine output, average respiratory rate, international normalized ratio (INR), alkaline phosphatase, average heart rate, average red blood cell volume, the Logistic Organ Dysfunction Score (LODS score), combined rheumatic system diseases, and platelet count.

3.3 Model performance comparison

We utilized the 15 selected features to construct machine learning models, resulting in 25 independent models corresponding to 8 different machine learning algorithms. This process included hyperparameter tuning through five iterations of 5-fold nested cross-validation, aimed at maximizing the models’ generalization ability. To comprehensively evaluate model performance, we calculated the average values and 95% confidence intervals for the area under the curve (AUC), F1 score, recall, precision, accuracy, sensitivity, and specificity of the 8 machine learning models, as detailed in Table 5.

Table 5

Table 5. Prediction Performance of the 8 kinds of machine leaning algorithms.

As illustrated in the table above, the SVC model demonstrates the highest AUC (95% CI) of 0.792 (95% CI: 0.76–0.84) among the eight machine learning models assessed, with the MLP model following closely behind. Additionally, other evaluation metrics, including the F1 score, recall rate, accuracy, precision, sensitivity, and specificity, indicate that the SVC model generally outperforms the other models.

To further compare and visualize the performance and clinical applicability of each model, we plotted receiver operating characteristic (ROC) curves, clinical decision curves (DCA), and calibration curves (as shown in Figure 4). The ROC curve primarily assesses the classification capability of the model, illustrating its performance across various thresholds. Meanwhile, the calibration curve evaluates the accuracy of model predictions, ensuring that the outputs can be reliably interpreted as actual probabilities.

Figure 4

Panel A displays an ROC curve comparing various models with LightGBM, Logistic Regression, Decision Tree, and others, indicating AUC scores. Panel B shows a decision curve analysis for threshold probability, depicting net benefit across models. Panel C presents a calibration curve comparing predicted versus true probabilities for models, including Brier scores. Each chart uses different colors for model differentiation.

Figure 4. ROC (A), DCA (B) and Calibration curves (C) comparison of eight models.

Among the eight machine learning models evaluated, SVM model exhibited relatively strong and stable performance. One possible explanation lies in the characteristics of SVM: it relies on margin maximization and distance-based computation, which makes it particularly effective when data are well-normalized and high-dimensional. Given that all features in this study were standardized prior to modeling, this may have favored SVM’s ability to find optimal separating hyperplanes. Furthermore, SVM’s capacity to handle non-linear boundaries via kernel tricks may have also contributed to its competitiveness in mortality prediction.

3.4 External validation

We conducted an external validation of the SVC model on 100 sepsis patients with ARDS who met the criteria outlined in the new global definition and were admitted to Xuzhou Medical University Affiliated Hospital between March 2022 and October 2024 (please refer to the Supplementary Materials for a baseline comparison of the external validation cohort). Importantly, the data from the external validation cohort and the training cohort do not overlap, which enhances the assessment of the model’s generalization and predictive capabilities in real-world scenarios. The performance of the SVC model in the external validation cohort is presented in Table 6, where we observed that the model continues to demonstrate strong overall performance (Figure 5). This indicates that the predictive model developed using machine learning methods has a robust ability to forecast 28-day ICU mortality outcomes for patients with sepsis complicated by ARDS under the context of the new global definition in clinical practice.

Table 6

Table 6. Prediction performance of SVC model in External validation cohort.

Figure 5

Panel A shows a Receiver Operating Characteristic (ROC) curve with various models, including LightGBM and Logistic, evaluating true positive versus false positive rates. Panel B illustrates a decision curve analysis comparing models using net benefit versus threshold probability. Panel C presents a calibration curve, assessing predicted probability against true probability for models like LightGBM and Random Forest, with Brier scores indicated. Each panel includes a legend for model identification.

Figure 5. ROC (A), DCA (B)and Calibration curves (C)of the SVM model in the external validation set.

3.5 Interpretability analysis

Given the outstanding performance of the SVC model, we computed and visualized the SHAP values to elucidate the influence of each variable on the outcomes predicted by this model. First, we examined the overall interpretability of the model by calculating the average SHAP value for each feature and ranking their importance (see Figure 6A). This analysis illustrates the overall distribution of the impact that each feature has on the model’s output.

Figure 6

Panel A shows a bar chart of average SHAP values after nested cross-validation. It lists features like

Figure 6. From a global perspective, we calculated the average SHAP value for each feature and used a swarm plot to display the distribution of features and SHAP values. (A) Plot of Features Importance; (B) Swarm Plot.

The bee swarm plot (see Figure 6B) further displays the characteristics of data distribution by arranging numerous data points at the same horizontal position. In this plot, the X-axis represents the SHAP values of the features, while the colors indicate the magnitude of the feature values—red signifies larger feature values, and blue indicates smaller ones. Each point corresponds to a specific sample’s feature value and SHAP value; thus, the farther a point is from the X-axis, the greater its impact on the output result. Additionally, the density of points reveals the distribution of the data.

Moreover, the relationship between the color of the points (which represents the size of the feature values) and the SHAP values indicates the direction of the feature’s effect. For instance, with respect to age, larger feature values correlate with a more significant positive impact on predicting favorable outcomes, while urine output exhibits the opposite effect.

From this analysis, we can conclude that factors such as age, red blood cell distribution volume, presence of metastatic tumors, logistic organ function score, blood lactate level, international normalized ratio (INR), average red blood cell volume, average heart rate, alkaline phosphatase, and average respiratory rate are positively correlated with 28-day mortality in patients. Conversely, other indicators, including urine output and average body temperature, show a negative correlation with 28-day mortality. From a clinical perspective, adequate urine output suggests preserved renal perfusion and responsiveness to fluid resuscitation, both of which are favorable prognostic indicators in critically ill septic patients. Similarly, fever is typically a manifestation of an active inflammatory response. Previous studies have shown that moderate hyperthermia may be protective in sepsis (Beverly et al., 2016), whereas hypothermia is often linked to immune suppression and increased mortality. Therefore, these findings are biologically plausible and consistent with current understanding of sepsis pathophysiology.

Secondly, we investigated the complex linear and nonlinear relationships between the various features and prognostic outcomes. To achieve this, we created scatter plots of SHAP values against feature quantities for 13 quantitative data types, excluding rheumatic diseases and metastatic solid tumors among the 15 features. Additionally, we employed LOWESS fitting curves and local weighted regression to generate fitting curves, which visually represent the trend of data distribution. As depicted in Figure 7, the yellow curve indicates the fitting curve, and we highlighted the intersection point, where the SHAP value equals zero, with a blue dashed line alongside the corresponding feature value.

Figure 7

Scatter plots displaying the relationship between various medical parameters and their SHAP values. Each plot includes a trend line and a blue vertical reference line, showing data clusters and distributions for attributes such as rdw_max, age, lODS, urineoutput, lactate_max, mcv_min, temperature_mean, inr_max, resp_rate_mean, spo2_mean, platelets_min, heart_rate_mean, and alp_max. The plots illustrate how changes in these parameters influence SHAP values, reflecting their impact on model predictions.

Figure 7. Linear or nonlinear effects of quantitative type data on prognostic outcomes.

Using age as an example, we observed a nonlinear relationship between age and 28-day ICU mortality in patients. As age increases, its contribution to the model transitions from negative to positive, with the intersection point occurring at 65.71 years. This implies that patients older than 65.71 years are considered a risk factor for 28-day mortality.

4 Discussion

In this study, we employed machine learning techniques to develop and validate a predictive model for the prognosis of sepsis patients with ARDS who met the new global definition. Our model is built upon a comprehensive analysis of 64 patient features, which include basic clinical data, vital signs, laboratory test results collected within the first 24 h of ICU admission, records of advanced life support treatments, and clinical comorbidities.

To address missing data, we utilized the Miceforrest multiple imputation method. By integrating the Lasso cross-validation method with feature importance ranking, we ultimately selected 15 key features for constructing the machine learning model. During the model development phase, we employed the SMOTETomek resampling technique to balance the dataset and utilized Bayesian optimization to fine-tune the model’s hyperparameters. Additionally, nested cross-validation techniques were applied to enhance the generalization ability of various models.

Among the eight machine learning models assessed, the Support Vector Classifier (SVC) exhibited the best performance. We further elucidated the key features influencing the prognosis of sepsis patients with ARDS using SHAP values and visualization graphs. The final ROC curve and calibration curve indicate that the SVC model outperforms other models in terms of prediction accuracy. However, it is important to note that a high-performing machine learning model does not always translate to effective clinical recognition.

To evaluate and compare the clinical utility of the predictive models, we also generated DCA curves. We tested the predictive performance of the SVC model in an external validation cohort and confirmed its strong performance in real-world settings. Overall, our prognosis prediction model for sepsis complicated by ARDS, based on the SVC model, demonstrates robust performance and significant clinical applicability.

Secondly, we utilized SHAP values to elucidate the final machine learning prediction model. The feature importance map illustrates the overall influence of each feature on the predicted outcome. Meanwhile, the bee colony plot depicts the distribution of features along with the direction of their impact on the predicted results. Additionally, the combination of fitting curves and SHAP values effectively highlights the intricate relationships between individual features and outcomes, thereby facilitating more informed clinical decision-making.

In this study, we investigated the factors influencing the 28-day mortality rate of sepsis patients with ARDS who meet the new global criteria for ICU admission. From the feature importance map, we identified the five most significant factors affecting patient prognosis: red blood cell distribution width (RDW), presence of metastatic solid tumours, age, Logistic Organ Dysfunction Score (LODS), and urine output. Our results demonstrate a positive correlation between elevated RDW levels and increased 28-day mortality. RDW serves as an indicator of the variation in red blood cell volume, and numerous studies have established that a high RDW is linked to adverse outcomes in various diseases, including ARDS, cardiovascular diseases, autoimmune disorders, and malignancies (Xanthopoulos et al., 2022; Wang et al., 2019; Arkew et al., 2022; Deng et al., 2021). A recent study also highlighted the strong association between high RDW and negative outcomes in sepsis, which aligns with our findings. This correlation may stem from the inflammatory response associated with sepsis, microcirculatory dysfunction leading to shortened red blood cell lifespan, and disruptions in iron metabolism (Lorente et al., 2021).

Moreover, age plays a critical role in patient prognosis, which is easily understandable. Factors such as diminished immune function, malnutrition, and organ dysfunction can contribute to the elevated mortality risk observed in older patients suffering from sepsis combined with ARDS. The hypoxia and microcirculatory dysfunction induced by sepsis in conjunction with ARDS can result in inadequate oxygen supply to organs, leading to necrosis of renal tubular epithelial cells and subsequent renal dysfunction (Lankadeva et al., 2019). Additionally, urine output is a key indicator of microcirculatory function; thus, oliguria is a significant risk factor for mortality in patients with sepsis and ARDS.

Blood lactate levels, which indicate microcirculatory dysfunction and tissue hypoxia, are also positively correlated with adverse outcomes. The LODS score is utilized to evaluate the severity of organ dysfunction in ICU patients. While we included the SAPSII score and SOFA score in our analysis, LODS appears to have a more substantial impact on outcome prediction compared to the other two measures. Furthermore, high Mean Corpuscular Volume (MCV) is positively associated with adverse outcomes in our model. Although there is currently no literature directly linking MCV to sepsis, studies indicate that the combination of MCV and RDW can enhance the predictive accuracy for sepsis prognosis (Zhang et al., 2023).

The International Normalized Ratio (INR), which reflects coagulation function, is also closely related to poor prognoses. Similar to the findings of Schupp et al., our results indicate a positive correlation between high INR and mortality outcomes in sepsis patients (Schupp et al., 2022). The INR holds significant value in the early screening, diagnosis, and prognosis of sepsis-related coagulation disorders (Lyons et al., 2018; Zhang et al., 2021). Additionally, basic vital sign indicators—such as body temperature, blood oxygen saturation, average heart rate, and respiratory rate—are closely associated with the prognosis of sepsis patients with ARDS. Higher blood oxygen saturation indicates better preservation of lung oxygenation function. In our model, blood oxygen saturation appears to be a more effective predictor of outcomes in sepsis and ARDS patients than the oxygenation index. Although ARDS diagnosis and severity primarily depend on the oxygenation index, the continuous, cost-effective, and non-invasive nature of blood oxygen saturation measurement, along with its derived SpO2/FiO2 index, plays a crucial role in assessing ARDS severity (Wick et al., 2022).

Additionally, hypothermia was identified as a risk factor for patient mortality in this study, with the onset of hypothermia within 24 h of ICU admission potentially linked to 28-day mortality, mirroring the findings of Han et al. (2024) and Beverly et al. (2016). Lastly, we observed that the development of metastatic tumours may pose a significant risk for 28-day mortality outcomes. Research indicates that cancer patients are at a higher risk of developing sepsis, with increased mortality rates following sepsis onset (Liu M. A. et al., 2021). This heightened risk is believed to result from immune dysfunction due to the tumour itself and or cancer treatments (Williams et al., 2023). Further research is necessary to ascertain whether the notable impact of metastatic tumours in our model correlates with more severe immune dysfunction, aggressive anti-tumour therapies, or poorer nutritional status in these patients.

Conversely, our findings suggest that rheumatic diseases may act as protective factors in sepsis combined with ARDS. However, the relationship between rheumatic diseases and sepsis remains unclear. For instance, Li H et al. found in observational studies that rheumatic diseases did not correlate with an increased 28-day mortality rate in sepsis patients, except for rheumatoid arthritis, which showed a strong association with sepsis onset (Li et al., 2023). It is possible that our findings are influenced by biases related to the small sample size included in our study.

However, our research does have certain limitations. Firstly, the patient data in the database we utilized for model training primarily originated from Western countries, which differs significantly from our external validation cohort. Secondly, we focused solely on commonly used clinical data for model construction and did not perform a more comprehensive analysis of the database, potentially leading to the omission of some critical details. Finally, our study is observational and retrospective, which may introduce potential errors or biases. Nevertheless, our model demonstrated strong predictive performance in the external validation cohort.

5 Conclusion

In summary, machine learning methods serve as reliable tools for predicting the prognosis of sepsis patients with ARDS. Considering the current global definition of ARDS, we have refined our machine learning clinical prediction model specifically for this patient group. Additionally, we will employ model explanatory techniques to interpret the underlying information of the SVC model. This approach has the potential to significantly enhance clinical practice, assisting clinicians in developing precise, personalized treatments aimed at maximizing the survival rates of sepsis patients with ARDS.

Data availability statement

The data analyzed in this study is subject to the following licenses/restrictions: Hospital policy restrictions. Requests to access these datasets should be directed to emhhb25qZG9jQDE2My5jb20=.

Ethics statement

The studies involving humans were approved by Medical Ethics Committee of the Affiliated Hospital of Xuzhou Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

PZ: Investigation, Visualization, Conceptualization, Methodology, Writing – original draft. SY: Investigation, Writing – original draft, Conceptualization, Visualization, Methodology. SZ: Methodology, Visualization, Investigation, Conceptualization, Writing – original draft. ZhY: Data curation, Writing – original draft. ZiY: Data curation, Writing – original draft. LL: Data curation, Writing – original draft. HY: Writing – original draft, Data curation. HP: Funding acquisition, Writing – review and editing, Project administration, Supervision. HL: Funding acquisition, Project administration, Supervision, Writing – review and editing. NZ: Supervision, Writing – review and editing, Project administration, Funding acquisition.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This research was funded by Construction Project of High Level Hospital of Jiangsu Province (GSPJS202419; GSPJS202425, NZ, Xianliang Yan), Xuzhou National Clinical Key Specialty Cultivation Project (2018ZK004, Xianliang Yan), The excellent young and middle-age talents project of the affiliated hospital of Xuzhou medical university (2019128009, NZ), Research project at the hospital level of the affiliated hospital of Xuzhou medical university (2023ZL28, SZ); Xuzhou Medical University Affiliated Hospital “Pairing Assistance” Scientific Research Project (FXJDBF2024215, NZ) Concept Validation Project of Xuzhou Medical University in 2024 (GNYZ2024010, NZ); Innovation and Entrepreneurship Education Practice Center Project of Xuzhou Medical University Science Park in 2024 (NZ); Xuzhou key research and development plan (social development) project—general medical and health project (KC22236, NZ).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Arkew M., Gemechu K., Haile K., Asmerom H. (2022). Red blood cell distribution width as novel biomarker in cardiovascular diseases: a literature review. J. Blood Med. 13, 413–424. doi:10.2147/jbm.S367660

PubMed Abstract | CrossRef Full Text | Google Scholar

Bauer M., Gerlach H., Vogelmann T., Preissing F., Stiefel J., Adam D. (2020). Mortality in sepsis and septic shock in Europe, North America and Australia between 2009 and 2019— results from a systematic review and meta-analysis. Crit. Care 24 (1), 239. doi:10.1186/s13054-020-02950-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Beverly A., Walter E., Carraretto M. (2016). Management of hyperthermia and hypothermia in sepsis: a recent survey of current practice across UK intensive care units. J. Intensive Care Soc. 17 (1), 88–89. doi:10.1177/1751143715601124

PubMed Abstract | CrossRef Full Text | Google Scholar

Deng J., Xu S., Gao X., Xu S., Shuai Z., Pan F. (2021). Red cell distribution width and mean platelet volume in patients with ankylosing spondylitis: a systematic review and meta-analysis. J. Clin. Rheumatol. 27 (7), 292–297. doi:10.1097/rhu.0000000000001174

PubMed Abstract | CrossRef Full Text | Google Scholar

Englert J. A., Bobba C., Baron R. M. (2019). Integrating molecular pathogenesis and clinical translation in sepsis-induced acute respiratory distress syndrome. JCI Insight 4 (2), e124061. doi:10.1172/jci.insight.124061

CrossRef Full Text | Google Scholar

Eworuke E., Major J. M., Gilbert McClain L. I. (2018). National incidence rates for acute respiratory distress syndrome (ARDS) and ARDS cause-specific factors in the United States (2006-2014). J. Crit. Care 47, 192–197. doi:10.1016/j.jcrc.2018.07.002

CrossRef Full Text | Google Scholar

Fan Z., Jiang J., Xiao C., Chen Y., Xia Q., Wang J., et al. (2023). Construction and validation of prognostic models in critically ill patients with sepsis-associated acute kidney injury: interpretable machine learning approach. J. Transl. Med. 21 (1), 406. doi:10.1186/s12967-023-04205-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Han D., Kang S. H., Um Y. W., Kim H. E., Hwang J. E., Lee J. H., et al. (2024). Temperature trajectories and mortality in hypothermic sepsis patients. Am. J. Emerg. Med. 84, 18–24. doi:10.1016/j.ajem.2024.07.030

CrossRef Full Text | Google Scholar

Lankadeva Y. R., Okazaki N., Evans R. G., Bellomo R., May C. N. (2019). Renal medullary hypoxia: a new therapeutic target for septic acute kidney injury? Seminars Nephrol. 39 (6), 543–553. doi:10.1016/j.semnephrol.2019.10.004

CrossRef Full Text | Google Scholar

Li H., Pan X., Zhang S., Shen X., Li W., Shang W., et al. (2023). Association of autoimmune diseases with the occurrence and 28-day mortality of sepsis: an observational and Mendelian randomization study. Crit. Care 27 (1), 476. doi:10.1186/s13054-023-04763-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu Y., Zhou S., Xiang D., Ju L., Shen D., Wang X., et al. (2021). Friend or foe? The roles of antioxidants in acute lung injury. Antioxidants 10 (12), 1956. doi:10.3390/antiox10121956

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu M. A., Bakow B. R., Hsu T. C., Chen J. Y., Su K. Y., Asiedu E. K., et al. (2021). Temporal trends in sepsis incidence and mortality in patients with cancer in the US population. Am. J. Crit. Care 30 (4), e71–e79. doi:10.4037/ajcc2021632

PubMed Abstract | CrossRef Full Text | Google Scholar

Lorente L., Martín M. M., Argueso M., Solé-Violán J., Perez A., Marcos Y Ramos J. A., et al. (2021). Association between red blood cell distribution width and mortality of COVID-19 patients. Anaesth. Crit. Care and Pain Med. 40 (1), 100777. doi:10.1016/j.accpm.2020.10.013

CrossRef Full Text | Google Scholar

Lv K., Liang Q. (2025). Macrophages in sepsis-induced acute lung injury: exosomal modulation and therapeutic potential. Front. Immunol. Mini Rev. 15, 1518008. doi:10.3389/fimmu.2024.1518008

CrossRef Full Text | Google Scholar

Lv J., Zhang M., Fu Y., Chen M., Chen B., Xu Z., et al. (2023). An interpretable machine learning approach for predicting 30-day readmission after stroke. Int. J. Med. Inf. 174, 105050. doi:10.1016/j.ijmedinf.2023.105050

CrossRef Full Text | Google Scholar

Lyons P. G., Micek S. T., Hampton N., Kollef M. H. (2018). Sepsis-associated coagulopathy severity predicts hospital mortality. Crit. Care Med. 46 (5), 736–742. doi:10.1097/ccm.0000000000002997

CrossRef Full Text | Google Scholar

Matthay M. A., Arabi Y., Arroliga A. C., Bernard G., Bersten A. D., Brochard L. J., et al. (2024). A new global definition of acute respiratory distress syndrome. Am. J. Respir. Crit. Care Med. 209 (1), 37–47. doi:10.1164/rccm.202303-0558WS

PubMed Abstract | CrossRef Full Text | Google Scholar

Pappada S. M., Owais M. H., Feeney J. J., Salinas J., Chaney B., Duggan J., et al. (2024). Development and validation of a sepsis risk index supporting early identification of ICU-Acquired sepsis: an observational study. Anaesth. Crit. Care Pain Med. 43 (6), 101430. doi:10.1016/j.accpm.2024.101430

PubMed Abstract | CrossRef Full Text | Google Scholar

Qian F., van den Boom W., See K. C. (2024). The new global definition of acute respiratory distress syndrome: insights from the MIMIC-IV database. Intensive Care Med. 50 (4), 608–609. doi:10.1007/s00134-024-07383-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Ranieri V. M., Rubenfeld G. D., Thompson B. T., Ferguson N. D., Caldwell E., Fan E., et al. (2012). Acute respiratory distress syndrome: the Berlin definition. Jama 307 (23), 2526–2533. doi:10.1001/jama.2012.5669

CrossRef Full Text | Google Scholar

Schupp T., Weidner K., Rusnak J., Jawhar S., Forner J., Dulatahu F., et al. (2022). Diagnostic and prognostic significance of the prothrombin time/international normalized ratio in sepsis and septic shock. Clin. Appl. Thromb. Hemost. 28, 10760296221137893. doi:10.1177/10760296221137893

PubMed Abstract | CrossRef Full Text | Google Scholar

Singer M., Deutschman C. S., Seymour C. W., Shankar-Hari M., Annane D., Bauer M., et al. (2016). The third international consensus definitions for sepsis and septic shock (Sepsis-3). JAMA 315 (8), 801–810. doi:10.1001/jama.2016.0287

CrossRef Full Text | Google Scholar

Wang P. F., Song S. Y., Guo H., Wang T. J., Liu N., Yan C. X. (2019). Prognostic role of pretreatment red blood cell distribution width in patients with cancer: a meta-analysis of 49 studies. J. Cancer 10 (18), 4305–4317. doi:10.7150/jca.31598

CrossRef Full Text | Google Scholar

Whitsett J. A., Wert S. E., Weaver T. E. (2015). Diseases of pulmonary surfactant homeostasis. Annu. Rev. Pathol. 10, 371–393. doi:10.1146/annurev-pathol-012513-104644

PubMed Abstract | CrossRef Full Text | Google Scholar

Wick K. D., Matthay M. A., Ware L. B. (2022). Pulse oximetry for the diagnosis and management of acute respiratory distress syndrome. Lancet Respir. Med. 10 (11), 1086–1098. doi:10.1016/s2213-2600(22)00058-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Williams J. C., Ford M. L., Coopersmith C. M. (2023). Cancer and sepsis. Clin. Sci. (Lond) 137 (11), 881–893. doi:10.1042/cs20220713

PubMed Abstract | CrossRef Full Text | Google Scholar

Xanthopoulos A., Giamouzis G., Dimos A., Skoularigki E., Starling R. C., Skoularigis J., et al. (2022). Red blood cell distribution width in heart failure: pathophysiology, prognostic role, controversies and dilemmas. J. Clin. Med. 11 (7), 1951. doi:10.3390/jcm11071951

PubMed Abstract | CrossRef Full Text | Google Scholar

Xie J., Wang H., Kang Y., Zhou L., Liu Z., Qin B., et al. (2020). The epidemiology of sepsis in Chinese ICUs: a national cross-sectional survey. Crit. Care Med. 48 (3), e209–e218. doi:10.1097/ccm.0000000000004155

CrossRef Full Text | Google Scholar

Zhang J., Du H. M., Cheng M. X., He F. M., Niu B. L. (2021). Role of international normalized ratio in nonpulmonary sepsis screening: an observational study. World J. Clin. Cases 9 (25), 7405–7416. doi:10.12998/wjcc.v9.i25.7405

CrossRef Full Text | Google Scholar

Zhang J., Xie J., Chen H., Mo M., Qiu H., Yang Y., et al. (2023). Development and validation of a clinical score combining the sequential organ failure assessment score with inflammation-based markers to predict outcome of patients with sepsis. Am. J. Transl. Res. 15 (3), 1789–1797.

PubMed Abstract | Google Scholar

Zhu W., Zhang Y., Wang Y. (2022). Immunotherapy strategies and prospects for acute lung injury: focus on immune cells and cytokines. Front. Pharmacol. Rev. 13. 1103309. doi:10.3389/fphar.2022.1103309

CrossRef Full Text | Google Scholar

Zhuo X., Lv J., Chen B., Liu J., Luo Y., Liu J., et al. (2023). Combining conventional ultrasound and ultrasound elastography to predict HER2 status in patients with breast cancer. Front. Physiol. 14, 1188502. doi:10.3389/fphys.2023.1188502

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: sepsis, ARDS, machine learning, 28-day, ICU mortality

Citation: Zhang P, Yuan S, Zhang S, Yuan Z, Ye Z, Lv L, Yang H, Peng H, Li H and Zhao N (2025) Under the background of the new global definition of ARDS: an interpretable machine learning approach for predicting 28-day ICU mortality in patients with sepsis complicated by ARDS. Front. Physiol. 16:1617196. doi: 10.3389/fphys.2025.1617196

Received: 24 April 2025; Accepted: 08 September 2025;
Published: 19 September 2025.

Edited by:

Savino Spadaro, University of Ferrara, Italy

Reviewed by:

Shuhe Li, University of Exeter, United Kingdom
Yu Wang, Anhui Medical University, China

Copyright © 2025 Zhang, Yuan, Zhang, Yuan, Ye, Lv, Yang, Peng, Li and Zhao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Hui Peng, MTE0MDU4MDkwQHFxLmNvbQ==; Haiquan Li, ODcxMTg2MEBxcS5jb20=; Ningjun Zhao, emhhb25qZG9jQDE2My5jb20=

^†These authors share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.