- 1Department of Infectious Diseases and Clinical Microbiology, Medical Faculty, Istanbul Medipol University, Istanbul, Türkiye
- 2Department of Nuclear Medicine, University of Health Science, Gaziosmanpaşa Training ve Research Hospital, Istanbul, Türkiye
- 3Department of Internal Medicine, Medical Faculty, Istanbul Medipol University, Istanbul, Türkiye
- 4Department of Radiology, Medical Faculty, Istanbul Medipol University, Istanbul, Türkiye
- 5Department of Chest Diseases, Medical Faculty, Istanbul Medipol University, Istanbul, Türkiye
Introduction: Previous studies applying machine learning to predict severe respiratory failure in COVID-19 patients have shown inconsistent results due to variations in study populations and predictor variables. This study aimed to predict intensive care unit admission and identify key predictive factors.
Methods: This retrospective cohort study included patients with COVID-19 who presented with mild respiratory failure, most of whom received oxygen via a mask or nasal cannula. Eight machine learning algorithms—XGBoost, support vector machines, neural networks, k-nearest neighbors, random forest, decision trees, logistic regression, and naïve Bayes—were applied to predict intensive care unit admission.
Results: A total of 392 patients (63.5% male, mean age, 55.0 ± 15.3 years) were included in the study. During follow-up, 80 patients (20.4%) required intensive care unit admission. Among them, 320 (81.6%) received steroid therapy, 301 (76.8%) underwent pulse steroid therapy, and 76 (19%) had been vaccinated. The multilayer perceptron, XGBoost, and radial basis function support vector machine models achieved the best overall performance based on ROC-AUC and accuracy values (ROC-AUC: 0.75, 0.70, and 0.71; accuracy: 0.79, 0.79, and 0.79, respectively). The strongest predictors of intensive care unit admission were low lymphocyte count on the first day, as well as high age, ferritin, body mass index, Charlson Comorbidity Index, and computed tomography score.
Conclusion: Machine learning algorithms can reliably predict intensive care unit admission in COVID-19 patients with mild respiratory failure. These models identified key clinical and laboratory factors that may facilitate early risk stratification and guide treatment planning.
Introduction
COVID-19 exhibits a broad clinical spectrum, ranging from asymptomatic infection to severe respiratory failure and death (1–22). Numerous studies have applied machine learning (ML) techniques to predict adverse outcomes such as respiratory deterioration, ICU admission, and mortality in COVID-19 patients (1–22). Reported ROC-AUC values in these studies generally range from 0.669 to 0.99, with most exceeding 0.70. In addition, several well-established clinical scoring systems have been developed to predict COVID-19 prognosis (23–27), with ROC-AUC values typically ranging between 0.72 and 0.91, though most cluster between 0.72 and 0.80.
Although these models and scoring systems have demonstrated variable predictive performance, their generalizability remains limited due to substantial heterogeneity in patient populations, disease severity, predictor selection, and sample size. This variability hampers the comparability of findings and limits the ability to draw broadly applicable conclusions. Moreover, most existing studies have included all hospitalized COVID-19 patients regardless of respiratory status and have relied heavily on variables that directly reflect respiratory failure—such as oxygen saturation and respiratory rate. While these variables may enhance predictive accuracy, they may also obscure the contributions of other important clinical and laboratory parameters. Only a few studies have specifically focused on patients already receiving oxygen therapy and managed on general wards (28), leaving a significant gap in understanding disease progression in this clinically important subgroup.
Additional gaps in the literature include selection bias, which may lead retrospective studies to falsely suggest that steroid use is associated with worse outcomes—even though randomized controlled trials have clearly shown that dexamethasone reduces mortality in oxygen-dependent patients (29, 30). Furthermore, although vaccines are known to reduce both the incidence and severity of COVID-19, their impact on disease progression among patients presenting with mild respiratory failure remains poorly understood.
To address these gaps, the present study aimed to use machine learning algorithms to predict ICU admission among patients with COVID-19 presenting with mild respiratory failure and requiring oxygen therapy on admission. A secondary objective was to identify the most influential predictors of clinical deterioration in this patient population, including demographic characteristics, comorbidities, laboratory markers, radiologic findings, vaccination status, and steroid use while minimizing the impact of selection bias.
Materials and methods
This retrospective cohort study included COVID-19 patients with mild respiratory failure who were admitted to and monitored at Istanbul Medipol University Hospital between January 2020 and January 2022. The study protocol was reviewed and approved by the Institutional Review Board and the Local Ethics Committee of Istanbul Medipol University. The study was conducted in accordance with the ethical standards laid down in the 1964 Declaration of Helsinki and its later amendments, or with comparable ethical standards.
Mild respiratory failure was defined as a respiratory rate of ≥22 breaths per minute at rest, an oxygen saturation level <94% at rest, or a decrease in saturation with minimal exertion. The primary study endpoint was ICU admission. Machine learning algorithms were applied to predict the likelihood of ICU admission. Specific inclusion and exclusion criteria were applied to ensure a well-defined study population.
Inclusion criteria:
• A positive SARS-CoV-2 PCR result from a nasopharyngeal swab obtained within 3 weeks prior to hospital admission.
• Age ≥18 years.
• Hospitalization and treatment for COVID-19 between January 2020 and January 2022.
• Presence of mild respiratory failure.
Exclusion criteria:
• Requirement for high-flow oxygen therapy, continuous positive airway pressure, mechanical ventilation, or extracorporeal membrane oxygenation at initial presentation.
• Admission to the ICU at initial presentation.
• Concomitant conditions unrelated to COVID-19 that could contribute to respiratory failure (e.g., pulmonary edema, pulmonary embolism).
• Absence of respiratory failure at initial presentation.
After applying these exclusion and inclusion criteria, relevant demographic, clinical, and laboratory variables were extracted for analysis.
Variables evaluated in the study
The following data were collected and analyzed from patient records:
1. Demographic Information: Age, sex, and body mass index (BMI).
2. COVID-19 Vaccination Status: Only vaccinations administered at least 1 month prior to symptom onset were considered. Patients were categorized according to vaccination status as follows: one dose of inactivated vaccine, two doses of inactivated vaccine, one dose of mRNA vaccine, two doses of mRNA vaccine, or a combination of one dose of mRNA vaccine and one dose of inactivated vaccine.
3. Comorbidities: Underlying conditions included liver transplantation, cancer with or without metastasis, heart failure, leukemia, myasthenia gravis, lymphoma, rheumatoid arthritis, aplastic anemia, multiple myeloma, cerebrovascular disease, dementia, Parkinson’s disease, hypertension, ischemic heart disease, diabetes mellitus with or without complications, chronic obstructive pulmonary disease, bronchiectasis, asthma, liver cirrhosis, idiopathic pulmonary fibrosis, chronic renal failure (mild or severe), and renal transplantation. Additional variables included pregnancy, immunosuppression, and prior corticosteroid use. The Charlson Comorbidity Index (CCI) was calculated for each patient (31). Severe renal failure was defined as renal transplantation or a serum creatinine level >3 mg/dL. Patients with renal failure who had not undergone renal transplantation and whose creatinine level was ≤3 mg/dL were categorized as having mild renal failure. Patients with diabetes mellitus were considered to have end-organ damage if they had diabetic retinopathy, nephropathy, or neuropathy.
4. Laboratory Parameters: Total neutrophil and lymphocyte counts, as well as serum levels of C-reactive protein (CRP), D-dimer, ferritin, and procalcitonin, measured at the time of initial hospital admission.
5. Treatments and Interventions Related to COVID-19: ICU admission and corticosteroid use. Virus-specific antiviral agents were not available in our country during the study period and, therefore, were not administered.
6. Imaging Findings: Total and regional lung zone computed tomography (CT) scores were recorded (32). To mitigate the risk of multicollinearity among CT score variables, principal component analysis (PCA) was performed. The first principal component (PC1) accounted for 80% of the total variance and represented a structure in which CT scores from the upper, middle, and lower zones of both lungs contributed similarly in both direction and magnitude. Accordingly, PC1 was interpreted as the composite CT score (overall CT burden) and was used as the representative variable in the analyses.
These variables were subsequently incorporated into statistical analyses to evaluate their associations with ICU admission.
Statistical analysis
General group comparisons
Patients with and without ICU admission during follow-up were compared with respect to the variables described above. Statistical analyses were performed using SPSS software (version 16.0; IBM Corp., Armonk, NY, USA). Categorical variables were expressed as frequencies and percentages. Continuous variables with normal distributions were presented as means ± standard deviations, whereas non-normally distributed variables were summarized as medians (range). Group comparisons were performed using the Student’s t-test for normally distributed continuous variables, as these data met the assumptions of normality, and the Mann–Whitney U test for non-normally distributed continuous variables, which does not assume normality. The chi-square test was used to compare categorical variables. A two-tailed p-value <0.05 was considered statistically significant.
To account for potential treatment bias, as pulse steroids were predominantly administered to patients with more severe clinical, laboratory, and radiological findings, comparisons were performed between patients who received pulse steroid therapy and those who did not. Continuous variables with a normal distribution were compared using the Student’s t-test, while non-normally distributed continuous variables were analyzed using the Mann–Whitney U test. Categorical variables were compared using the chi-square test. Variables found to be statistically significant in these comparisons were subsequently included in a logistic regression analysis to determine whether an independent difference existed between patients who received pulse steroids and those who did not.
In addition to traditional statistical methods, machine learning approaches were employed to develop predictive models and explore complex relationships among variables.
Machine learning analysis
Several machine learning algorithms were applied to predict ICU admission during follow-up. The overall workflow of the machine learning approach used for predicting ICU admission in COVID-19 patients is illustrated in Figure 1.
Figure 1. Overview of the machine learning framework for predicting intensive care unit admission in COVID-19 patients.
Data preprocessing
Categorical variables with two categories were encoded using the mapping method, while variables with more than two categories were one-hot encoded. Continuous variables were standardized. Missing values were observed for the following variables: CT scores (37 cases, 9.4%), lymphocyte count (1 case, 0.25%), CRP levels (15 cases, 3.8%), procalcitonin levels (108 cases, 27.5%), ferritin levels (5 cases, 1.2%), symptom duration (45 cases, 11.4%), BMI (15 cases, 3.8%), and onset of dyspnea (4 cases, 1%).
Missing values were imputed using the Iterative Imputer method (sklearn.impute.IterativeImputer), a multivariate regression-based approach that predicts missing values for each variable based on the observed values of other variables through an iterative series of regressions. By default, the Random Forest Regressor was applied.
The dataset was then split into training (70%) and testing (30%) subsets for machine learning analyses.
Feature selection
To reduce the number of independent variables and eliminate redundancy, several steps were undertaken. First, constant, quasi-constant, and duplicate variables were removed using the feature-engine package, with thresholds set at 80%, 0.998, and default settings, respectively. Subsequently, multiple feature reduction techniques were evaluated, including:
• Sequential Feature Selection.
• Random Forest (RF).
• Recursive Random Forest.
• Recursive Feature Elimination.
• Recursive Feature Addition.
• Shuffling.
• Boruta.
• SHAP (SHapley Additive exPlanations).
• PowerSHAP.
The reduced feature sets generated by each method were subsequently used as independent variables in the machine learning models developed to predict ICU admission. Among these approaches, we prioritized those that achieved the highest predictive performance (accuracy, ROC-AUC, precision, recall, and F1-score) while using the fewest variables. Based on this assessment, the SHAP-based feature set demonstrated the best balance of accuracy and parsimony; therefore, the predictors identified by SHAP were selected for use in the study. Feature selection was specifically performed using SHAP values derived from a Random Forest model trained on the full set of predictors. Features were ranked according to their mean absolute SHAP values, reflecting their relative contribution to model predictions. Incremental subsets of top-ranked features were evaluated using 5-fold cross-validation, with ROC-AUC as the primary performance metric. The subset of features yielding the highest cross-validated ROC-AUC was selected as the optimal predictor set and used for the development of all subsequent machine learning models. This approach ensured that only the most informative features, as determined by the Random Forest model, were included in the predictive analyses.
Using the SHAP-selected features, machine learning models were constructed to predict ICU admission in COVID-19 patients with mild respiratory failure. Model performance was evaluated using accuracy, ROC-AUC, precision, recall, and F1-score as the primary metrics.
Model development, evaluation, and explainability
Machine learning algorithms were implemented in Python to predict ICU admission in COVID-19 patients with mild respiratory failure. Binary classification models were developed using several algorithms, including Extreme Gradient Boosting (XGBoost), linear Support Vector Machine (SVM), radial basis function Support Vector Machine (RBF-SVM), Multilayer Perceptron (MLP), k-Nearest Neighbors (KNN), RF, Decision Tree, Logistic Regression (LR), and Naïve Bayes.
Hyperparameter optimization was performed using GridSearchCV.
Model performance was evaluated using multiple metrics, including accuracy, ROC-AUC, precision, recall, and F1-score. These metrics provided a comprehensive assessment of model performance and guided the selection of the best-performing algorithm.
Given the relatively small sample size and the limited number of ICU admissions, we applied a bootstrapping approach to obtain optimism-corrected estimates of model performance. A total of 500 bootstrap resamples were drawn with replacement from the original dataset. For each resample, the machine learning models were trained and then evaluated on the original dataset to compute performance metrics, including ROC-AUC, accuracy, precision, recall, and F1-score. The optimism-corrected performance for each model was calculated by subtracting the average difference between the performance on the bootstrap samples and that on the original dataset. This procedure provides more robust estimates of model performance and reduces the risk of overfitting, particularly in datasets with a limited number of positive outcomes.
To assess the impact of missing data, particularly for variables with substantial missingness such as procalcitonin (27.5%), a sensitivity analysis was performed. Models were re-trained and evaluated after excluding cases with missing procalcitonin values. Performance metrics, including ROC-AUC, accuracy, precision, recall, and F1-score, were then compared with the results obtained using imputed data. This approach allowed us to evaluate the robustness of our models and the potential influence of imputation on the study findings.
Variable importance
The relative importance of variables for predicting ICU admission in COVID-19 patients with mild respiratory failure was assessed using SHAP (SHapley Additive exPlanations) values. This approach enabled a model-agnostic evaluation of each predictor’s contribution, facilitating interpretation and the identification of the most influential clinical, laboratory, and imaging variables. Having established the machine learning framework and selected the most informative features, we next present the results of our analyses in the study cohort.
Results
We first describe the demographic, clinical, and laboratory characteristics of the patients included in the study.
A total of 392 patients were included in the study, comprising 249 males (63.5%) and 143 females (36.5%), with a mean age of 55 ± 15.3 years. During follow-up, 80 patients (20.4%) were admitted to the ICU, 46 (11.7%) underwent endotracheal intubation, and 47 (12%) died.
Table 1 summarizes the baseline characteristics of patients with and without ICU admission. Among patients requiring ICU care, the CCI, serum CRP, procalcitonin, ferritin, and D-dimer levels, neutrophil counts, age, and the composite CT score (PCA-derived) were higher, whereas lymphocyte counts, the proportion of patients with high body temperature, and the time from symptom onset to dyspnea were lower.
Table 1. Comparison of demographic characteristics, clinical features, laboratory parameters, and computed tomography scores between COVID-19 patients with and without intensive care unit admission during hospitalization.
Of these patients, 320 (81.6%) received corticosteroid therapy for respiratory failure. The treatment regimen consisted of methylprednisolone administered at 80 mg/day for the first 3 days, 40 mg/day for the subsequent 6 days, and 20 mg/day for the final 3 days. Additionally, 301 patients (76.8%) received 250 mg of pulse steroid therapy for 3 days prior to this regimen.
Violin plots of the selected variables in patient groups admitted to and not admitted to the ICU are shown in Figures 2, 3.
Figure 2. Violin plots illustrating the distribution of selected variables among patients admitted to the intensive care unit (ICU) and those who were not.
Figure 3. Violin plots illustrating the distribution of selected variables among patients admitted to the intensive care unit (ICU) and those who were not.
When comparing patients who received pulse steroid therapy with those who did not, multivariable logistic regression analysis revealed that the pulse steroid group had lower lymphocyte counts, a higher proportion of former smokers compared with never-smokers, and a higher prevalence of fever (Supplementary Table 1). However, no significant differences were observed between the two groups in variables that could potentially influence the decision to administer pulse steroids, including the composite CT score (PCA-derived), serum CRP, procalcitonin, ferritin, D-dimer, neutrophil count, immunosuppression status, and CCI. These findings suggest that clinicians were unlikely to base the decision to initiate pulse steroid therapy solely on lymphocyte count, smoking status, or body temperature, indicating that treatment bias due to pulse steroid use is unlikely.
Building upon these baseline clinical findings, we next performed a comprehensive evaluation of feature selection methods to optimize model performance and assess potential variable interactions.
To determine the most informative feature set, we systematically compared multiple feature selection algorithms, assessing both linear and non-linear interactions among predictors. Sequential Feature Selection substantially reduced model performance, decreasing the ROC-AUC from 0.69 (all features) to 0.61 with the selected subset. LASSO LR did not retain any variables at the tested regularization level, indicating the absence of strong linear predictors under this method. RF provided only a ranking of variable importance without producing a stable subset.
Although RF-based recursive elimination identified five variables that achieved a ROC-AUC of 0.75 internally, performance markedly deteriorated when these features were incorporated into the original machine learning models. Tree-based Recursive Feature Elimination selected 15 variables and yielded a ROC-AUC of 0.57, compared with 0.58 using all features, whereas LR-based Recursive Feature Elimination selected 12 variables and produced identical discrimination (0.60 with selected variables vs. 0.60 with all variables). Permutation-based shuffling using tree models selected six features but showed a decline from ROC-AUC 0.65 (all features) to 0.50 (selected). Similarly, LR-based shuffling selected 11 variables but reduced the ROC-AUC from 0.60 to 0.58 compared with the full-variable model (0.60 with all variables vs. 0.58 with the reduced set).
Recursive feature addition showed limited utility: for tree-based models, the ROC-AUC remained unchanged at 0.58 (0.58 with all variables vs. 0.58 with the reduced set), whereas LR-based addition selected 10 variables and yielded improved performance compared with the full-variable model (0.60 with all variables vs. 0.65 with the reduced set). Boruta identified only two variables, and PowerSHAP selected a small subset as well, but both approaches resulted in poor performance when applied to the original models. All of the above methods produced low evaluation metrics when applied to the original models.
In contrast, the SHAP-selected features consistently maintained or improved model performance, demonstrating superior stability and generalizability. These results suggest that, among the approaches tested to capture potential interactions and non-linear patterns, SHAP-based selection was the most robust and methodologically appropriate strategy for this dataset.
Next, we assessed the predictive performance of the machine learning models in the cohort, considering multiple evaluation metrics.
Several machine learning algorithms were applied to predict ICU admission in COVID-19 patients with mild respiratory failure. Their performances are summarized in Table 2 and illustrated in Figure 4.
Table 2. Performance metrics (accuracy, area under the receiver operating characteristic curve [ROC-AUC], precision, recall, and F1-score) and tuning parameters of machine learning algorithms used to predict intensive care unit admission.
Figure 4. Performance of the machine learning models in predicting intensive care unit admission is shown in panel (A) using accuracy. Panel (B) shows the area under the ROC curve (ROC-AUC) for each model, and panel (C) presents the receiver operating characteristic (ROC) curves of the models.
Considering the ROC-AUC values, all models except KNN, RF, Decision Tree, and Naïve Bayes demonstrated good discriminatory ability, with the MLP and LR achieving the highest ROC-AUC values (0.75 and 0.73, respectively).
The Naïve Bayes, KNN, RBF-SVM, MLP, RF, and XGBoost models achieved favorable accuracies ranging from 0.76 to 0.79. In contrast, LR, Linear SVM, and Decision Tree exhibited relatively lower accuracies (0.68, 0.66, and 0.67, respectively).
When both accuracy and ROC-AUC values were considered together, the RBF-SVM, MLP, and XGBoost models demonstrated the best overall performance (ROC-AUC = 0.71, 0.75, and 0.70; accuracy = 0.79, 0.79, and 0.79, respectively).
Notably, the MLP and RBF-SVM models achieved perfect recall (1.0), indicating an excellent ability to identify all patients requiring intensive care, although with moderate precision. Other models showed comparatively lower precision, recall, and F1-scores.
The bootstrapped performance metrics of all machine learning models are summarized in Supplementary Table 2. In this study of 392 COVID-19 patients, including 80 (20.4%) ICU admissions, we evaluated 15 clinical and laboratory variables using multiple machine learning algorithms. Overall, the models demonstrated moderate predictive performance, with optimism-corrected ROC-AUCs ranging from 0.63 to 0.68 and accuracies ranging from 0.63 to 0.80. These results indicate that the selected variables could be useful for early ICU risk stratification, while acknowledging the limitations imposed by the small sample size and low event rate.
A sensitivity analysis was performed to evaluate the impact of excluding procalcitonin, a variable with substantial missing data (27.5%), on model performance (Supplementary Tables 3, 4). Overall, most models demonstrated largely consistent ROC-AUC and accuracy values, suggesting that imputation did not substantially bias the results. However, some models, particularly tree-based approaches (Decision Tree, RF, XGBoost) and Naïve Bayes, exhibited more pronounced changes in performance metrics.
To better understand the contributions of individual variables to model predictions, we conducted SHAP analysis on the trained models.
SHAP analysis indicated that the key contributors to the model’s predictions included lower lymphocyte counts, higher neutrophil counts, advanced age, and elevated levels of ferritin, procalcitonin, D-dimer, and CRP. Other important predictors were higher BMI, increased CCI, higher CT scores, delayed hospital admission after symptom onset, shorter time from symptom onset to dyspnea, greater total cigarette consumption, not being a current smoker, and absence of fever (Figure 5).
Figure 5. Ranked importance of variables for predicting intensive care unit admission in COVID-19 patients with mild respiratory failure. (A) SHAP summary plot showing the impact of each variable on the model output. Each dot represents a single patient. The horizontal position of a dot reflects the effect of that variable on the predicted risk of ICU admission (positive values increase risk, negative values decrease risk). The color of each dot represents the actual feature value for that patient (red = high value, blue = low value). Variables are ranked from top to bottom according to their overall importance in the model. (B) Bar plot of mean absolute SHAP values, showing the average contribution of each variable to the model’s predictions, allowing comparison of variable importance across the cohort.
Discussion
This study demonstrates that among the evaluated machine learning approaches, the MLP, XGB, and RBF-SVM achieved the highest overall predictive performance, with ROC-AUC values between 0.70 and 0.75 and an accuracy of 0.79. The findings further indicate that the most influential determinants of ICU admission were a low lymphocyte count on the first day, advanced age, elevated ferritin levels, higher BMI, increased CCI, and greater CT severity.
Machine learning-based models can outperform traditional statistical approaches in predicting adverse outcomes in COVID-19, supporting the potential value of ML-driven early risk stratification (33). This evidence also provides the rationale for our use of machine learning methods in the present study.
The highest ROC-AUC was achieved by the MLP model (0.75), likely due to its ability to capture non-linear relationships and interactions between variables (34). Most other models showed similar moderate performance (ROC-AUC 0.70–0.71), while KNN, RF, Naïve Bayes, and Decision Tree performed less well (ROC-AUC 0.68, 0.63, 0.68, and 0.41, respectively). The lower performance of these models may be attributed to limitations in handling complex patterns, feature correlations, or class imbalance, as previously reported (35–38).
The highest accuracy values were achieved by the Naïve Bayes, KNN, RBF-SVM, MLP, XGBoost, and RF models (76%–79%), whereas LR, Linear SVM, and Decision Tree models showed relatively lower values (66%–68%). LR and Linear SVM may have underfit due to their linear decision boundaries, while the limited structure of the Decision Tree likely reduced its accuracy (35, 39, 40).
Non-linear and/or ensemble-based methods, including MLP, XGBoost, and RBF-SVM, demonstrated consistently high and balanced performance in terms of both ROC-AUC and accuracy by effectively capturing complex patterns in the data (34, 41–43). These results suggest that, given the characteristics of the dataset and the sample size, non-linear and ensemble-based approaches may be more suitable for achieving accurate class separation.
Our ML models demonstrated reasonable discrimination for ICU admission, with ROC-AUC and accuracy indicating good apparent performance. Bootstrapped, optimism-corrected estimates were slightly lower, reflecting the limited dataset, but still suggest that selected clinical and laboratory variables can aid early ICU risk stratification, emphasizing the need for validation in larger or external cohorts.
Sensitivity analysis excluding procalcitonin (27.5% missing) showed that most models, including LR and MLP, maintained similar ROC-AUC and accuracy, suggesting minimal bias from imputation. Tree-based models (Decision Tree, RF, XGBoost) and Naïve Bayes were more affected, indicating that missing data impact varies by model type.
Our ML models showed moderate performance (max ROC-AUC 0.75, accuracy 0.79) in a relatively homogeneous cohort with mild respiratory failure, where clinical and laboratory differences between patients who deteriorate and those who do not are subtle, inherently limiting model discrimination. Despite this, the models captured meaningful signals for early risk stratification while complementing clinical judgment. Although broadly comparable to previous studies (ROC-AUC 0.67–0.99), direct comparisons are limited by differences in endpoints, predictors, sample sizes, and patient severity (1–20).
Most prior studies included patients with and without respiratory failure and used direct indicators like oxygen saturation, dyspnea, and respiratory rate, making severe cases easier to identify (1–20). In contrast, our study focused on ward-managed patients with mild respiratory failure, excluding both very severe cases and patients without respiratory failure, and did not include strong prognostic markers. Despite this, our models achieved ROC-AUC and accuracy comparable to previous studies, demonstrating robustness in a more homogeneous population (1–20).
Table 3 summarizes well-established COVID-19 risk scores alongside our model performance (23–27). Overall, our models showed accuracy and ROC-AUC comparable to most clinical scores, except COVID-GRAM and CALL, which performed slightly better. The narrower severity spectrum of our mild respiratory failure cohort likely explains this difference. Nevertheless, using routinely available clinical, laboratory, and radiological data, our models provide useful early risk stratification even in a relatively homogeneous population.
Table 3. Comparison of published COVID-19 severity scores and the machine learning models developed in this study for predicting ICU admission and clinical deterioration.
SHAP analysis identified the strongest predictors of poor prognosis in our cohort, with low lymphocyte count emerging as the most important. Lymphopenia in COVID-19 results from mechanisms such as TNF-α–mediated apoptosis, increased peripheral consumption, and ACE2- or CD147-related cytopathic effects (44–47). Elevated neutrophils may further contribute through cytotoxic activity (47, 48). Consistent with prior studies, our findings showed that lower lymphocyte counts at admission were strongly associated with progression to severe respiratory failure (48, 49).
Literature shows that COVID-19 patients with higher CT scores are more likely to develop severe disease and poor outcomes, including death, mechanical ventilation, and ICU admission (21, 22, 50–54). Our findings align with these reports.
Excessive activation of the innate immune system and cytokine release—particularly IL-6—play a central role in severe COVID-19 (55, 56). This hyperinflammatory state drives neutrophilia, promotes hypercoagulability reflected by elevated D-dimer, and induces macrophage activation that markedly increases ferritin levels (56–62). CRP likewise rises through IL-6–mediated acute-phase signaling (55, 63). Consistent with this shared pathophysiological pathway, our study found that higher neutrophil counts, D-dimer, ferritin, and CRP levels at admission were all associated with worse clinical outcomes, including respiratory deterioration and increased need for intensive care.
Consistent with the literature, elevated procalcitonin, a marker of bacterial infection and tissue hypoxia, was associated with a higher risk of respiratory deterioration at admission in our cohort (64, 65).
Previous studies have shown that higher Charlson Comorbidity Index (CCI) scores, advanced age, and increased body mass index (BMI) are associated with greater COVID-19 severity (66–68), and our findings were consistent with these observations.
Another important predictor identified was the short interval between symptom onset and the development of dyspnea, indicating that earlier onset of dyspnea substantially increases the risk of adverse outcomes. To our knowledge, this association has not been previously reported, representing a novel finding of our study.
Previous studies show that COVID-19 patients who smoke are more likely to require intensive care (69). Smoking induces structural lung changes and alters immune responses, increasing infection severity and mortality (70, 71). In our cohort, current smokers appeared less likely to be admitted to the ICU; however, this is likely due to the very small number of current smokers (n = 19). Notably, higher cumulative cigarette exposure among current and former smokers was associated with increased ICU admission, consistent with prior reports.
Previous studies have shown that corticosteroids reduce mortality in patients with COVID-19, although they do not appear to decrease the need for ICU admission (29, 72). In line with these findings, corticosteroid use in our study was not associated with ICU admission.
Previous studies have shown that mRNA vaccination reduces the risk of severe respiratory failure and COVID-19-related mortality (73–77). In our study, vaccination was not significantly associated with ICU admission, likely reflecting the characteristics of our cohort, which included only patients already presenting with respiratory failure—a stage beyond the primary protective effect of vaccination.
Our study has several limitations. It was retrospective and single-center, which may limit generalizability, though our Istanbul-based hospital serves a diverse population. The sample size was relatively small (392 patients, 80 ICU admissions), yielding an events-per-variable ratio of approximately 5, below the recommended threshold for robust modeling; bootstrapping was used to assess model stability. Despite these limitations, the use of machine learning provides an objective, data-driven approach to early ICU risk prediction. Larger, multicenter studies are needed to validate our findings and further clarify factors influencing severe respiratory failure in COVID-19.
Conclusion
Machine learning algorithms effectively predicted the risk of ICU admission in COVID-19 patients presenting with mild respiratory failure. The most influential predictors of adverse outcomes were low lymphocyte counts, advanced age, elevated ferritin levels, high CCI, and increased CT scores.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by Ethics Committee for Non-Interventional Clinical Research, Istanbul Medipol University, Republic of Turkey. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin because this was a retrospective, non-interventional study using anonymized patient data collected from medical records. The Ethics Committee approved the study protocol and waived the need for individual consent in accordance with national regulations and institutional policies.
Author contributions
BC: Visualization, Resources, Writing – original draft, Formal analysis, Project administration, Methodology, Data curation, Supervision, Investigation, Validation, Conceptualization, Writing – review & editing. ŞC: Writing – review & editing, Methodology, Data curation. OO: Writing – review & editing, Methodology, Data curation. BK: Writing – review & editing, Resources, Data curation, Methodology. MA: Resources, Writing – review & editing, Methodology, Data curation. MŞ: Writing – review & editing, Visualization, Data curation. MM: Writing – review & editing, Data curation. SA: Data curation, Writing – review & editing. EY: Data curation, Writing – review & editing. RD: Writing – review & editing, Data curation. KM: Writing – review & editing, Data curation. AK: Writing – review & editing, Data curation. MD: Data curation, Writing – review & editing. US: Writing – review & editing, Data curation. ED: Writing – review & editing, Data curation. İB: Data curation, Writing – review & editing. AM: Writing – review & editing.
Funding
The author(s) declared that financial support was not received for this work and/or its publication.
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that Generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2026.1724947/full#supplementary-material
References
1. Jimenez-Solem, E, Petersen, TS, Hansen, C, Hansen, C, Lioma, C, Igel, C, et al. Developing and validating COVID-19 adverse outcome risk prediction models from a bi-national European cohort of 5594 patients. Sci Rep. (2021) 11:3246. doi: 10.1038/s41598-021-81844-x,
2. Li, X, Ge, P, Zhu, J, Li, H, Graham, J, Singer, A, et al. Deep learning prediction of likelihood of ICU admission and mortality in COVID-19 patients using clinical variables. PeerJ. (2020) 8:e10337. doi: 10.7717/peerj.10337,
3. Kim, HJ, Han, D, Kim, JH, Kim, D, Ha, B, Seog, W, et al. An easy-to-use machine learning model to predict the prognosis of patients with COVID-19: retrospective cohort study. J Med Internet Res. (2020) 22:e24225. doi: 10.2196/24225,
4. Churpek, MM, Gupta, S, Spicer, AB, Hayek, SS, Srivastava, A, Chan, L, et al. Machine learning prediction of death in critically ill patients with coronavirus disease 2019. Crit Care Explor. (2021) 3:e0515. doi: 10.1097/CCE.0000000000000515,
5. Aznar-Gimeno, R, Esteban, LM, Labata-Lezaun, G, Del-Hoyo-Alonso, R, Abadia-Gallego, D, Paño-Pardo, JR, et al. A clinical decision web to predict ICU admission or death for patients hospitalised with COVID-19 using machine learning algorithms. Int J Environ Res Public Health. (2021) 18:8677. doi: 10.3390/ijerph18168677,
6. Cavallaro, M, Moiz, H, Keeling, MJ, and McCarthy, ND. Contrasting factors associated with COVID-19-related ICU admission and death outcomes in hospitalised patients by means of Shapley values. PLoS Comput Biol. (2021) 17:e1009121. doi: 10.1371/journal.pcbi.1009121,
7. Yu, L, Halalau, A, Dalal, B, Abbas, AE, Ivascu, F, Amin, M, et al. Machine learning methods to predict mechanical ventilation and mortality in patients with COVID-19. PLoS One. (2021) 16:e0249285. doi: 10.1371/journal.pone.0249285,
8. Hou, W, Zhao, Z, Chen, A, Li, H, and Duong, TQ. Machine learning predicts the need for escalated care and mortality in COVID-19 patients from clinical variables. Int J Med Sci. (2021) 18:1739–45. doi: 10.7150/ijms.51235,
9. Chi, S, Guo, A, Heard, K, Kim, S, Foraker, R, White, P, et al. Development and structure of an accurate machine learning algorithm to predict inpatient mortality and hospice outcomes in the coronavirus disease 2019 era. Med Care. (2022) 60:381–6. doi: 10.1097/MLR.0000000000001699,
10. Greco, M, Angelotti, G, Caruso, PF, Zanella, A, Stomeo, N, Costantini, E, et al. Outcome prediction during an ICU surge using a purely data-driven approach: a supervised machine learning case-study in critically ill patients from COVID-19 Lombardy outbreak. Int J Med Inform. (2022) 164:104807. doi: 10.1016/j.ijmedinf.2022.104807,
11. He, F, Page, JH, Weinberg, KR, and Mishra, A. The development and validation of simplified machine learning algorithms to predict prognosis of hospitalized patients with COVID-19: multicenter, retrospective study. J Med Internet Res. (2022) 24:e31549. doi: 10.2196/31549,
12. Kamran, F, Tang, S, Otles, E, McEvoy, DS, Saleh, SN, Gong, J, et al. Early identification of patients admitted to hospital for covid-19 at risk of clinical deterioration: model development and multisite external validation study. BMJ. (2022) 376:e068576. doi: 10.1136/bmj-2021-068576,
13. König, S, Pellissier, V, Leiner, J, Hohenstein, S, Ueberham, L, Meier-Hellmann, A, et al. Expected and observed in-hospital mortality in heart failure patients before and during the COVID-19 pandemic: introduction of the machine learning-based standardized mortality ratio at Helios hospitals. Clin Cardiol. (2022) 45:75–82. doi: 10.1002/clc.23762,
14. Kumar, N, Qi, SA, Kuan, LH, Sun, W, Zhang, J, and Greiner, R. Learning accurate personalized survival models for predicting hospital discharge and mortality of COVID-19 patients. Sci Rep. (2022) 12:4472. doi: 10.1038/s41598-022-08601-6,
15. Moulaei, K, Shanbehzadeh, M, Mohammadi-Taghiabad, Z, and Kazemi-Arpanahi, H. Comparing machine learning algorithms for predicting COVID-19 mortality. BMC Med Inform Decis Mak. (2022) 22:2. doi: 10.1186/s12911-021-01742-0,
16. Reina Reina, A, Barrera, JM, Valdivieso, B, Gas, ME, Maté, A, and Trujillo, JC. Machine learning model from a Spanish cohort for prediction of SARS-COV-2 mortality risk and critical patients. Sci Rep. (2022) 12:5723. doi: 10.1038/s41598-022-09613-y,
17. Sridhar, AR, Chen Amber, ZH, Mayfield, JJ, Fohner, AE, Arvanitis, P, Atkinson, S, et al. Identifying risk of adverse outcomes in COVID-19 patients via artificial intelligence-powered analysis of 12-lead intake electrocardiogram. Cardiovasc Digit Health J. (2022) 3:62–74. doi: 10.1016/j.cvdhj.2021.12.003,
18. Zhang, K, Karanth, S, Patel, B, Murphy, R, and Jiang, X. A multi-task Gaussian process self-attention neural network for real-time prediction of the need for mechanical ventilators in COVID-19 patients. J Biomed Inform. (2022) 130:104079. doi: 10.1016/j.jbi.2022.104079,
19. Dipaola, F, Gatti, M, Giaj Levra, A, Menè, R, Shiffer, D, Faccincani, R, et al. Multimodal deep learning for COVID-19 prognosis prediction in the emergency department: a bi-centric study. Sci Rep. (2023) 13:10868. doi: 10.1038/s41598-023-37512-3,
20. Chadaga, K, Prabhu, S, Sampathila, N, Chadaga, R, Umakanth, S, Bhat, D, et al. Explainable artificial intelligence approaches for COVID-19 prognosis prediction using clinical markers. Sci Rep. (2024) 14:1783. doi: 10.1038/s41598-024-52428-2,
21. Lou, X, Gao, C, Wu, L, Wu, T, He, L, Shen, J, et al. Prediction of short-term progression of COVID-19 pneumonia based on chest CT artificial intelligence: during the omicron epidemic. BMC Infect Dis. (2024) 24:595. doi: 10.1186/s12879-024-09504-9,
22. Milenkovic, A, Nikolic, S, Elek, Z, Aritonovic Pribakovic, J, Ilic, A, Bulatovic, K, et al. Significance of initial chest CT severity score (CTSS) and patient characteristics in predicting outcomes in hospitalized COVID-19 patients: a single center study. Viruses. (2024) 16:1683. doi: 10.3390/v16111683,
23. Knight, SR, Ho, A, Pius, R, Buchan, I, Carson, G, Drake, TM, et al. Risk stratification of patients admitted to hospital with covid-19 using the ISARIC WHO clinical characterisation protocol: development and validation of the 4C mortality score. BMJ. (2020) 370:m3339. doi: 10.1136/bmj.m3339, Erratum in: BMJ. 2020 Nov 13;371:m4334. doi: 10.1136/bmj.m4334. 32907855; PMCID: PMC7116472
24. Ji, D, Zhang, D, Xu, J, Chen, Z, Yang, T, Zhao, P, et al. Prediction for progression risk in patients with COVID-19 pneumonia: the CALL score. Clin Infect Dis. (2020) 71:1393–9. doi: 10.1093/cid/ciaa414,
25. Liang, W, Liang, H, Ou, L, Chen, B, Chen, A, Li, C, et al. Development and validation of a clinical risk score to predict the occurrence of critical illness in hospitalized patients with COVID-19. JAMA Intern Med. (2020) 180:1081–9. doi: 10.1001/jamainternmed.2020.2033,
26. Haimovich, AD, Ravindra, NG, Stoytchev, S, Young, HP, Wilson, FP, van Dijk, D, et al. Development and validation of the quick COVID-19 severity index: a prognostic tool for early clinical decompensation. Ann Emerg Med. (2020) 76:442–53. doi: 10.1016/j.annemergmed.2020.07.022,
27. Chua, F, Vancheeswaran, R, Draper, A, Vaghela, T, Knight, M, Mogal, R, et al. Early prognostication of COVID-19 to guide hospitalisation versus outpatient monitoring using a point-of-test risk prediction score. Thorax. (2021) 76:696–703. doi: 10.1136/thoraxjnl-2020-216425,
28. Ceylan, B, Olmuşçelik, O, Karaalioğlu, B, Ceylan, Ş, Şahin, M, Aydın, S, et al. Predicting severe respiratory failure in patients with COVID-19: a machine learning approach. J Clin Med. (2024) 13:7386. doi: 10.3390/jcm13237386,
29. RECOVERY Collaborative Group Horby, P, Lim, WS, Emberson, JR, Mafham, M, Bell, JL, et al. Dexamethasone in hospitalized patients with COVID-19. N Engl J Med. (2021) 384:693–704. doi: 10.1056/NEJMoa2021436,
30. Li, G, Chen, D, Gao, F, Huang, W, Wang, J, Li, Y, et al. Efficacy of corticosteroids in patients with acute respiratory distress syndrome: a meta-analysis. Ann Med. (2024) 56:2381086. doi: 10.1080/07853890.2024.2381086,
31. Charlson, ME, Pompei, P, Ales, KL, and MacKenzie, CR. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. (1987) 40:373–83. doi: 10.1016/0021-9681(87)90171-8,
32. Elmokadem, AH, Mounir, AM, Ramadan, ZA, Elsedeiq, M, and Saleh, GA. Comparison of chest CT severity scoring systems for COVID-19. Eur Radiol. (2022) 32:3501–12. doi: 10.1007/s00330-021-08432-5,
33. Ong, WJD, How, CH, Chong, WHK, Khan, FA, Ngiam, KY, and Kansal, A. Outcome prediction for adult mechanically ventilated patients using machine learning models and comparison with conventional statistical methods: a single-centre retrospective study. Intelligence-Based Med. (2024) 10:100165. doi: 10.1016/j.ibmed.2024.100165
34. LeCun, Y, Bengio, Y, and Hinton, G. Deep learning. Nature. (2015) 521:436–44. doi: 10.1038/nature14539
35. Zorman, M, Stiglic, MM, Kokol, P, and Malcić, I. The limitations of decision trees and automatic learning in real world medical decision making. J Med Syst. (1997) 21:403–15. doi: 10.1023/a:1022876330390,
36. Domingos, P, and Pazzani, M. On the optimality of the simple Bayesian classifier under zero-one loss. Mach Learn. (1997) 29:103–30. doi: 10.1023/A:1007413511361
37. Lin, DY, Gu, Y, Xu, Y, Wheeler, B, Young, H, Holloway, S, et al. Association of primary and booster vaccination and prior infection with SARS-CoV-2 infection and severe COVID-19 outcomes. JAMA. (2022) 328:1415. doi: 10.1001/jama.2022.17876
38. Pagan, M, Zarlis, M, and Candra, A. Investigating the impact of data scaling on the k-nearest neighbor algorithm. Int J Adv Intell Inform. (2021) 7:28–36. doi: 10.11591/csit.v4i2.p135-142
39. Cortes, C, and Vapnik, V. Support-vector networks. Mach Learn. (1995) 20:273–97. doi: 10.1007/BF00994018
40. Ma, J, Dhiman, P, Qi, C, Bullock, G, van Smeden, M, Riley, RD, et al. Poor handling of continuous predictors in clinical prediction models using logistic regression: a systematic review. J Clin Epidemiol. (2023) 161:140–51. doi: 10.1016/j.jclinepi.2023.07.017,
42. Natekin, A, and Knoll, A. Gradient boosting machines, a tutorial. Front Neurorobot. (2013) 7:21. doi: 10.3389/fnbot.2013.00021.,
43. Song, S, Zhan, Z, Long, Z, Zhang, J, and Yao, L. Comparative study of SVM methods combined with voxel selection for object category classification on fMRI data. PLoS One. (2011) 6:e17191. doi: 10.1371/journal.pone.0017191,
44. Lowery, SA, Sariol, A, and Perlman, S. Innate immune and inflammatory responses to SARS-CoV-2: implications for COVID-19. Cell Host Microbe. (2021) 29:1052–62. doi: 10.1016/j.chom.2021.05.004,
45. Liao, YC, Liang, WG, Chen, FW, Hsu, JH, Yang, JJ, and Chang, MS. IL-19 induces production of IL-6 and TNF-alpha and results in cell apoptosis through TNF-alpha. J Immunol. (2002) 169:4288–97. doi: 10.4049/jimmunol.169.8.4288,
46. Xu, H, Zhong, L, Deng, J, Peng, J, Dan, H, Zeng, X, et al. High expression of ACE2 receptor of 2019-nCoV on the epithelial cells of oral mucosa. Int J Oral Sci. (2020) 12:8. doi: 10.1038/s41368-020-0074-x,
47. Helal, MA, Shouman, S, Abdelwaly, A, Elmehrath, AO, Essawy, M, Sayed, SM, et al. Molecular basis of the potential interaction of SARS-CoV-2 spike protein to CD147 in COVID-19 associated-lymphopenia. J Biomol Struct Dyn. (2022) 40:1109–19. doi: 10.1080/07391102.2020.1822208,
48. Petrie, HT, Klassen, LW, and Kay, HD. Inhibition of human cytotoxic T lymphocyte activity in vitro by autologous peripheral blood granulocytes. J Immunol. (1985) 134:230–4.
49. el-Hag, A, and Clark, RA. Immunosuppression by activated human neutrophils: dependence on the myeloperoxidase system. J Immunol. (1987) 139:2406–13.
50. Ghafoori, M, Hamidi, M, Modegh, RG, Aziz-Ahari, A, Heydari, N, Tavafizadeh, Z, et al. Predicting survival of Iranian COVID-19 patients infected by various variants including omicron from CT scan images and clinical data using deep neural networks. Heliyon. (2023) 9:e21965. doi: 10.1016/j.heliyon.2023.e21965,
51. Aktas, E, Seber, T, Cagir, U, Uylar Seber, T, Kamalak Guzel, D, Savranlar, A, et al. The relationship between thorax computed tomography findings and prognosis in patients diagnosed with COVID-19. Eur Rev Med Pharmacol Sci. (2023) 27:10839–44. doi: 10.26355/eurrev_202311_34450,
52. Alipour Khabir, Y, Alipour Khabir, S, Anari, H, Mohammadzadeh, B, Hoseininia, S, and Aslani, MR. Chest computed tomography severity score is a reliable predictor of mortality in patients with chronic obstructive pulmonary disease co-infected with COVID-19. Eur J Med Res. (2023) 28:346. doi: 10.1186/s40001-023-01336-8,
53. Alipour Khabir, Y, Alipour Khabir, S, Anari, H, Mohammadzadeh, B, Hoseininia, S, Aslani, MR, et al. Prediction of prognosis in patients with severe COVID-19 pneumonia using CT score by emergency physicians: a single-center retrospective study. Sci Rep. (2023) 13:4045. doi: 10.1038/s41598-023-31312-5
54. Perincek, G, Önal, C, and Omar, T. Prognostic value of chest-computed tomography in patients with COVID-19. Adv Respir Med. (2022) 90:312–22. doi: 10.3390/arm90040041,
55. Han, H, Ma, Q, Li, C, Liu, R, Zhao, L, Wang, W, et al. Profiling serum cytokines in COVID-19 patients reveals IL-6 and IL-10 are disease severity predictors. Emerg Microbes Infect. (2020) 9:1123–30. doi: 10.1080/22221751.2020.1770129,
56. Del Valle, DM, Kim-Schulze, S, Huang, HH, Beckmann, ND, Nirenberg, S, Wang, B, et al. An inflammatory cytokine signature predicts COVID-19 severity and survival. Nat Med. (2020) 26:1636–43. doi: 10.1038/s41591-020-1051-9,
57. Zahorec, R. Neutrophil-to-lymphocyte ratio: past, present and future perspectives. Bratisl Lek Listy. (2021) 122:474–88. doi: 10.4149/BLL_2021_078,
58. Zhou, F, Yu, T, Du, R, Fan, G, Liu, Y, Liu, Z, et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. Lancet. (2020) 395:1054–62. doi: 10.1016/S0140-6736(20)30566-3,
59. Varikasuvu, SR, Varshney, S, Dutt, N, Munikumar, M, Asfahan, S, Kulkarni, PP, et al. D-dimer, disease severity, and deaths (3D-study) in patients with COVID-19: a systematic review and meta-analysis of 100 studies. Sci Rep. (2021) 11:21888. doi: 10.1038/s41598-021-01462-5,
60. Mehta, P, McAuley, DF, Brown, M, Sanchez, E, Tattersall, RS, Manson, JJ, et al. COVID-19: consider cytokine storm syndromes and immunosuppression. Lancet. (2020) 395:1033–4. doi: 10.1016/S0140-6736(20)30628-0,
61. Blagoeva, V, Hodzhev, V, Uchikov, P, Dobreva-Yatseva, B, Stoyanova, R, Shterev, M, et al. Clinical course and mortality predictors in adult hospitalized patients with COVID-19 infection—a retrospective cohort study. Medicina (Kaunas). (2025) 61:579. doi: 10.3390/medicina61040579,
62. Kernan, KF, and Carcillo, JA. Hyperferritinemia and inflammation. Int Immunol. (2017) 29:401–9. doi: 10.1093/intimm/dxx031,
63. Gustine, JN, and Jones, D. Immunopathology of hyperinflammation in COVID-19. Am J Pathol. (2021) 191:4–17. doi: 10.1016/j.ajpath.2020.08.009,
64. Assicot, M, Gendrel, D, Carsin, H, Raymond, J, Guilbaud, J, and Bohuon, C. High serum procalcitonin concentrations in patients with sepsis and infection. Lancet. (1993) 341:515–8. doi: 10.1016/0140-6736(93)90277-N,
65. Engel, H, Ben Hamouda, N, Portmann, K, Delodder, F, Suys, T, Feihl, F, et al. Serum procalcitonin as a marker of post-cardiac arrest syndrome and long-term neurological recovery, but not of early-onset infections, in comatose post-anoxic patients treated with therapeutic hypothermia. Resuscitation. (2013) 84:776–81. doi: 10.1016/j.resuscitation.2013.01.029,
66. Tuty Kuswardhani, RA, Henrina, J, Pranata, R, Anthonius Lim, M, Lawrensia, S, and Suastika, K. Charlson comorbidity index and a composite of poor outcomes in COVID-19 patients: a systematic review and meta-analysis. Diabetes Metab Syndr. (2020) 14:2103–9. doi: 10.1016/j.dsx.2020.10.022,
67. Zheng, Z, Peng, F, Xu, B, Zhao, J, Liu, H, Peng, J, et al. Risk factors of critical and mortal COVID-19 cases: a systematic literature review and meta-analysis. J Infect. (2020) 81:e16–25. doi: 10.1016/j.jinf.2020.04.021,
68. Wang, Y, Hou, H, Xu, J, Wang, Y, and Yang, H. The association between obesity and ICU admission among COVID-19 patients: a meta-analysis of adjusted risk estimates. Am J Emerg Med. (2022) 56:318–20. doi: 10.1016/j.ajem.2021.08.054,
69. Griffith, NB, Baker, TB, Heiden, BT, Smock, N, Pham, G, Chen, J, et al. Cannabis, tobacco use, and COVID-19 outcomes. JAMA Netw Open. (2024) 7:e2417977. doi: 10.1001/jamanetworkopen.2024.17977, Erratum in: JAMA Netw Open. 2024 Jul 1;7(7):e2427937. doi: 0.1001/jamanetworkopen.2024.27937. 38904961; PMCID: PMC1119312
70. Arcavi, L, and Benowitz, NL. Cigarette smoking and infection. Arch Intern Med. (2004) 164:2006–16. doi: 10.1001/archinte.164.20.2006,
71. Huttunen, R, Heikkinen, T, and Syrjänen, J. Smoking and the outcome of infection. J Intern Med. (2011) 269:258–69. doi: 10.1111/j.1365-2796.2010.02332.x,
72. He, Q, Wang, C, Wang, Y, Chen, G, Zhou, Y, Wu, Y, et al. Efficacy and safety of glucocorticoids use in patients with COVID-19: a systematic review and network meta-analysis. BMC Infect Dis. (2023) 23:896. doi: 10.1186/s12879-023-08874-w,
73. Xu, S, Huang, R, Sy, LS, Glenn, SC, Ryan, DS, Morrissette, K, et al. COVID-19 vaccination and non-COVID-19 mortality risk - seven integrated health care organizations, United States, December 14, 2020-July 31, 2021. MMWR Morb Mortal Wkly Rep. (2021) 70:1520. doi: 10.15585/mmwr.mm7043e2,
74. Thomas, SJ, Moreira, ED Jr, Kitchin, N, Absalon, J, Gurtman, A, Lockhart, S, et al. Safety and efficacy of the BNT162b2 mRNA Covid-19 vaccine through 6 months. N Engl J Med. (2021) 385:1761. doi: 10.1056/nejmoa2110345,
75. El Sahly, HM, Baden, LR, Essink, B, Doblecki-Lewis, S, Martin, JM, Anderson, EJ, et al. Efficacy of the mRNA-1273 SARS-CoV-2 vaccine at completion of blinded phase. N Engl J Med. (2021) 385:1774. doi: 10.1056/NEJMoa2113017,
76. Dunkle, LM, Kotloff, KL, Gay, CL, Áñez, G, Adelglass, JM, Barrat Hernández, AQ, et al. Efficacy and safety of NVX-CoV2373 in adults in the United States and Mexico. N Engl J Med. (2022) 386:531. doi: 10.1056/NEJMoa2116185,
77. Johnson, AG, Linde, L, Payne, AB, Patel, P, Davis, JP, Lacy, J, et al. Notes from the field: comparison of COVID-19 mortality rates among adults aged ≥65 years who were unvaccinated and those who received a bivalent booster dose within the preceding 6 months - 20 U.S. jurisdictions, September 18, 2022-April 1, 2023. MMWR Morb Mortal Wkly Rep. (2023) 72:667.
Keywords: COVID-19, intensive care unit, machine learning, prognosis, steroid
Citation: Ceylan B, Ceylan Ş, Olmuşçelik O, Karaalioğlu B, Akan M, Şahin M, Muğlu M, Aydın S, Yılmaz E, Dumlu R, Mert K, Kansu A, Düger M, Süleyman U, Demir E, Boyacı İ and Mert A (2026) Machine learning-based prediction of intensive care unit admission in COVID-19 patients presenting with mild respiratory failure. Front. Med. 13:1724947. doi: 10.3389/fmed.2026.1724947
Edited by:
Aveek Jayant, Homi Bhabha Cancer Hospital and Research Centre, IndiaReviewed by:
Burcin Halacli, Hacettepe University, TürkiyeWei Jun Dan Ong, National University Health System, Singapore
Copyright © 2026 Ceylan, Ceylan, Olmuşçelik, Karaalioğlu, Akan, Şahin, Muğlu, Aydın, Yılmaz, Dumlu, Mert, Kansu, Düger, Süleyman, Demir, Boyacı and Mert. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Bahadır Ceylan, YmNleWxhbjIwMDVAZ21haWwuY29t
‡ORCID: Bahadır Ceylan, orcid.org/0000-0001-9658-7560
†These authors have contributed equally to this work
Şule Ceylan2†