ORIGINAL RESEARCH article

Front. Physiol., 21 May 2025

Sec. Computational Physiology and Medicine

Volume 16 - 2025 | https://doi.org/10.3389/fphys.2025.1560659

Construction and clinical visualization application of a predictive model for mortality risk in sepsis patients based on an improved machine learning model

Ting ChenTing Chen1Xuefeng ZhangXuefeng Zhang1Qunfeng YuQunfeng Yu2Qin YangQin Yang2Lingmin YuanLingmin Yuan2Fei Tong
Fei Tong3*
  • 1Emergency Department of Longyou County People’s Hospital, Quzhou, Zhejiang, China
  • 2Intensive Care Unit of Longyou County People’s Hospital, Quzhou, Zhejiang, China
  • 3Department of Thoracic Surgery, Longyou County People's Hospital, Quzhou, China

Objective: To explore the construction and clinical visualization application of a mortality risk prediction model for sepsis patients based on an improved machine learning model.

Methods: This retrospective study analyzed 1,050 sepsis patients admitted to Longyou County People’s Hospital between January 2010 and August 2023. Patients were divided into a survival group (n = 877) and a death group (n = 173) based on their 30-day mortality status. Clinical and laboratory data were collected and used as feature variables. A Self-Weighted Self-Evolutionary Learning Model (SWSELM) was developed to identify independent risk factors for sepsis mortality and to create a visualization system for clinical application.

Results: The improved algorithm significantly outperformed other algorithms on 23 standard test functions. The SWSELM model achieved ROC-AUC and PR-AUC values of 0.9760 and 0.9624, respectively, on the training set, and 0.9387 and 0.9390, respectively, on the test set, both significantly higher than those of three other prediction models. The SWSELM model identified 10 important features, with multivariate logistic regression retaining five variables: B-type Natriuretic Peptide Precursor (NT-proBNP), Lactate, Albumin, Oxygenation Index, and Mean Arterial Pressure (MAP) (OR = 4.889, 3.770, 3.083, 1.872, 1.297), consistent with the top five features selected by the SWSELM model.

Conclusion: NT-proBNP, Lactate, Albumin, Oxygenation Index, and Mean Arterial Pressure are independent risk factors for mortality in sepsis patients. This study successfully created a self-evolutionary prediction model using machine learning methods, demonstrating significant clinical application potential and value for broader implementation.

1 Introduction

Sepsis, defined as a life-threatening systemic infection, arises from a dysregulated host response to infection that can lead to organ dysfunction. However, the progression to life-threatening complications hinges on early recognition and timely interventions (Liu et al., 2022). In critical care medicine, sepsis has emerged as a central research focus due to its substantial incidence and mortality rates. Global epidemiological studies reveal that more than 18 million severe sepsis cases are diagnosed annually, with incidence rates increasing by 1.5%–8% per year (Chiu and Legrand, 2021). Short-term mortality rates range from 15% to 30% for sepsis patients, escalating to 50% for those developing septic shock (Oczkowski et al., 2022). Of particular clinical significance, each hour of delayed treatment increases mortality risk by 4%–8%, primarily due to the irreversible progression of organ damage (Bleakley and Cole, 2020). These statistics underscore the critical importance of early identification and prognosis prediction in sepsis management.

Clinicians have historically employed rule-based scoring systems including the Sequential Organ Failure Assessment (SOFA), quick SOFA (qSOFA), and Modified Early Warning Score (MEWS) for sepsis severity evaluation. While these systems offer valuable clinical references, they demonstrate limited efficacy in meeting the pressing need for early sepsis prediction.

Machine learning represents a computational technology that develops predictive models through data pattern recognition and feature selection. Particularly effective with large-scale datasets containing numerous samples and variables, this approach establishes automated analytical frameworks capable of continuous learning and progressive improvement in predictive accuracy for novel data. Contemporary medical applications increasingly use machine learning models for disease prevention, diagnostic support, treatment optimization, and prognostic evaluation (Fleuren et al., 2020; Theodosiou and Read, 2023). Recent studies have demonstrated successful implementation of machine learning algorithms for sepsis mortality prediction, achieving notable accuracy. However, current models predominantly focus on severe sepsis and septic shock populations, potentially limiting their generalizability to broader sepsis patient cohorts (Phillips et al., 2018; Zhang and Hong, 2017; Osborn et al., 2014).

Building upon these foundations, our study objectives are twofold: 1) to systematically analyze diverse mortality risk factors in sepsis patients through machine learning approaches, and 2) to develop an intuitive clinical visualization system. This integrated framework aims to provide clinically actionable insights for mortality risk prediction in septic patients, ultimately supporting evidence-based decision making in critical care settings.

2 Methods

Our research methodology includes the following steps: data sourcing, feature extraction, feature weighting, hyperparameter optimization, model construction, and model evaluation.

2.1 Data sourcing

Our study design utilized a retrospective analysis, selecting 1,050 sepsis patients admitted to Longyou County People’s Hospital from January 2010 to August 2023 as the research subjects. Among them, there were 640 male patients and 410 female patients. Within 30 days, there were 173 deaths, resulting in a mortality rate of 16.48%. This study is a retrospective analysis and is exempt from requiring patient informed consent. It has been approved by the Ethics Committee of Longyou County People’s Hospital, with the ethics approval number: 2,023,130. All data were analyzed anonymously, and personal information was completely removed. This study was conducted in accordance with the principles of the Declaration of Helsinki and its amendments.

2.1.1 Inclusion criteria

(1) Diagnosis of sepsis according to the “Chinese Guidelines for the Treatment of Severe Sepsis/Septic Shock (2014)” with a confirmed infection (Jing, 2015); (2) An increase in SOFA score by two points or more following the infection; (3) Complete clinical data.

2.1.2 Exclusion criteria

(1) Age <18 years; (2) Patients with terminal cancer, leukemia, lymphoma, or immunodeficiency; (3) Patients with newly diagnosed cerebrovascular disease within the past 3 months.

Clinical and laboratory data of the patients were collected and organized as feature variables, totaling 27 features, numbered F1 to F27. The features include: Oxygenation Index (F1); Procalcitonin (F2); Prothrombin Time (F3); Gender (F4); Age (F5); Onset Time (F6); Body Mass Index (F7); B-type Natriuretic Peptide Precursor (NT-proBNP) (F8); Infection Site (F9); Mean Arterial Pressure (MAP) (F10); Heart Rate at Admission (F11); Partial Pressure of Oxygen (F12); Urine pH (F13); Lactate (F14); Partial Thromboplastin Time (F15); C-reactive Protein (F16); pH Value (F17); Diabetes (F18); Platelets (F19); Carbon Dioxide Content (F20); Charlson Comorbidity Index (F21) (Qiu et al., 2023); Respiratory Rate (F22); Albumin (F23); Oxygen Saturation (F24); Ratio of Heart Rate to Systolic Blood Pressure (F25); Hematocrit (F26); Creatinine (F27). Patient mortality outcomes were strictly defined as sepsis-related deaths. We established a multidisciplinary endpoint adjudication committee (infectious disease specialists and intensivists) to review all deaths, classifying sepsis-related mortality as direct organ failure or secondary complications (e.g., MODS) explicitly triggered by sepsis based on chart documentation.

Handling of missing data: The overall data completeness rate was 93.43% in 1,050 sepsis patients finally included in the analysis. There were differences in the missing rates of each characteristic, among which urine PH value had the highest missing rate, and the missing rates of other indicators were all <3%. The methods of median imputation for measurement data and mode imputation for classification data were used to impute the missing values of the original data.

2.2 Self-weighted self-evolutionary learning model

To address the complexity of high-dimensional clinical feature data, we propose a Self-Weighted Self-Evolutionary Learning Model (SWSELM). This model achieves self-adaptive evolution through dynamic feature selection and hyperparameter co-optimization. Specifically, during training, all input features are retained and assigned adaptive weights. Low-contribution features (with weights below a preset threshold) are pruned, while swarm intelligence algorithms simultaneously optimize machine learning hyperparameters. The base learner adopts a Support Vector Machine (SVM), with the workflow illustrated in Figure 1.

Figure 1
www.frontiersin.org

Figure 1. Schematic diagram of the SWSELM Model.

2.2.1 Parameter encoding and feature selection

As shown in Figure 2, the model jointly encodes feature weights, feature threshold, and SVM hyperparameters. For an initial input of eight features and 3 SVM hyperparameters, the encoding space includes eight feature weights, one feature threshold, and 3 hyperparameters. During training, features with weights below are truncated (set to zero), resulting in a refined feature subset (e.g., four retained features) with corresponding weights and optimized hyperparameters.

Figure 2
www.frontiersin.org

Figure 2. Schematic diagram of optimization parameter encoding.

2.2.2 SVM decision boundary construction

Using the selected N features and their weights, a high-dimensional SVM decision boundary is constructed. For a patient sample xi, the prediction function f (xi) is defined as:

fxi=signj=1NwjKxi,xj+b

where wj denotes feature weights, K is the kernel mapping, and b is the bias term.

2.2.3 Dynamic threshold optimization

With model performance (AUC) as the optimization objective, a swarm intelligence algorithm (e.g., Particle Swarm Optimization) iteratively searches for the optimal parameter combination, including feature weights w, feature threshold θ, and SVM hyperparameters α. This mechanism enables closed-loop self-evolution of feature selection and model parameter optimization.

2.3 Improved swarm intelligence optimization algorithm

In the SWSELM model, complex optimization problems involving feature weights, feature thresholds, and hyperparameters are addressed using a swarm intelligence optimization algorithm. The Nonlinear Contraction Elite Prairie Dog Optimization Algorithm (NCEPDO) builds upon the established PDO framework proposed by Ezugwu et al. (2022), integrating two core enhancements.

2.3.1 Improvement strategy: elite opposition-based learning (EOBL)

Elite opposition-based learning (EOBL) constructs the opposite solution of the current feasible solution to increase population diversity. The optimal solution is then selected from both the current and the opposite solutions to form the new generation of individuals.

2.3.2 Improvement strategy 2: nonlinear contraction factor

The complexity of the algorithm’s search process makes it challenging for a linearly decreasing convergence factor (a) to adapt to the actual search conditions, as it does not fully represent the true convergence and optimization process. This often results in poor coordination between global search and local exploitation. Therefore, we adopt a nonlinear adjustment strategy, defined as:

a=ainitialainitialafinal·expt/tmax1

where initial and final are the initial and final values of (a), (t) is the current iteration number, and max is the maximum number of iterations. The convergence factor (a) exhibits nonlinear dynamic changes with increasing iterations, effectively balancing the algorithm’s global search and local exploitation capabilities, thereby enhancing its optimization performance.

To verify the improved algorithm’s performance, our algorithm evaluation adopted the 23 standard benchmark functions from the IEEE CEC-2017 test suite1, including multimodal, hybrid, and composite functions like Schwefel (F15), Rosenbrock (F6), and Lunacek Bi-Rastrigin (F23) to validate optimization performance under diverse scenarios (Sharma and Raju, 2024). It should be emphasized that each benchmark function is only used to verify the optimization ability of the swarm intelligence algorithm, and is not directly involved in the training process of the SWSELM model. In this study, the fitness function was defined as a direct mapping of the objective function values, and the optimization objective was to minimize the fitness value. Therefore, the decrease of fitness value indicates the improvement of algorithm performance.

2.4 Model training and evaluation

To assess the quality of the model in terms of performance, computational resources, and interpretability, and to mitigate the risks of underfitting and overfitting, thereby enhancing the model’s reliability and stability, we employed five-fold cross-validation. We randomly selected 80% of the data as the training set and performed five-fold cross-validation, with the remaining 20% used as the test set. This approach effectively prevents overfitting on the training set and yields more accurate predictions on the test set. We compared the performance of widely used and stable machine learning models, including: Logistic Regression (LR); Support Vector Machine (SVM); eXtreme Gradient Boosting (XGBoost). These models were selected based on their established performance and reliability in various predictive analytics tasks.

The network classification results were quantitatively evaluated using metrics such as sensitivity (SEN), precision (PRE), specificity (SPE), accuracy (ACC), error rate (ER), and F1-Score (F1). Additionally, receiver operating characteristic-area under curve (ROC-AUC) and precision recall-area under curve (PR-AUC) were chosen as the primary comprehensive evaluation metrics. All metrics range from 0 to 1, with higher values indicating better classification performance.

Interpretability Analysis: SHapley Additive exPlanations (SHAP), grounded in cooperative game theory, quantified feature contributions. Two visualization tools were deployed: SHAP Summary Plot: Depicts feature importance and impact direction using color gradients (red: high values, blue: low values). SHAP Importance Plot: Ranks global feature contributions by absolute SHAP values.

2.5 Statistical analysis

Statistical analyses were conducted using IBM SPSS software (version 25.0), with significance set at (P < 0.05). The normality of the data was tested using the Kolmogorov-Smirnov (K-S) test. Data were presented as mean ± standard deviation (Mean ± SD) and evaluated using independent two-sample t-tests. Comparisons between categorical variables were performed using the chi-square test.

3 Results

3.1 Performance testing results of improved swarm intelligence optimization algorithm

The experimental findings focus exclusively on the algorithm’s core performance metrics rather than downstream model applications. Comprehensive evaluations across 23 standard benchmark functions reveal that our improved algorithm’s optimization capabilities substantially outperform Genetic Algorithm (GA), Particle Swarm Optimization (PSO), Multi-Verse Optimizer (MVO), and Sparrow Search Algorithm (SSA), with detailed comparative data provided in Supplementary Figure S1.

3.2 Prediction model construction results

Subsequent analyses focus exclusively on machine learning model evaluations rather than benchmark function optimization. Comparative assessment of predictive models reveals that our SWSELM achieved exceptional performance, attaining training set ROC-AUC/PR-AUC scores of 0.9760/0.9624 and test set values of 0.9387/0.9390, significantly outperforming comparative models. Complete training and testing metrics are respectively tabulated in Tables 1, 2 and visualized in Figures 3, 4.

Table 1
www.frontiersin.org

Table 1. Model performance comparison on the training set.

Table 2
www.frontiersin.org

Table 2. Performance comparison of different models on the test set.

Figure 3
www.frontiersin.org

Figure 3. Performance Comparison of Different Models on the Training Set. Note: (A) ROC curve; (B) Precision-Recall curve.

Figure 4
www.frontiersin.org

Figure 4. Performance Comparison of Different Models on the Test Set Note: (A) ROC curve; (B) Precision-Recall curve.

The final model determined a feature threshold of 0.02, the SWSELM model identified 10 important features, ranked by their weights as follows: NT-proBNP(F8), Lactate (F14), Albumin (F23), Oxygenation Index (F1), Mean Arterial Pressure (F10), Age (F5), Prothrombin Time (F3), Procalcitonin (F2), Heart Rate at Admission (F11), and Creatinine (F27). The weights of these features are detailed in Figure 5.

Figure 5
www.frontiersin.org

Figure 5. Feature weights for the SWSELM Model.

3.3 Validation of model features

3.3.1 Lasso feature verification

Least Absolute Shrinkage and Selection Operator (LASSO) regression was used to screen features of the data set to verify the effectiveness of SWSELM model for screening features. LASSO selected variables within one standard error of the minimum MSE (Lambda1SE) in the sparse model, and screened eight variables. NT-proBNP(F8), Lactate (F14), Albumin (F23), Oxygenation Index (F1), Mean Arterial Pressure (F10), Procalcitonin (F2), Prothrombin Time (F3), Age (F5), as show in Figure 6.

Figure 6
www.frontiersin.org

Figure 6. Plot of results from LASSO regression. Note: (A) LASSO trajectory; (B) LASSO cross-validation fit plot.

3.3.2 Interpretability Analysis

SHAP analysis ranked predictive feature importance as follows:1- NT-proBNP(F8), Lactate (F14), Albumin (F23), Oxygenation Index (F1), Mean Arterial Pressure (F10), Age (F5), Prothrombin Time (F3), Procalcitonin (F2), Heart Rate at Admission (F11), and Creatinine (F27), as show in Figure 7.

Figure 7
www.frontiersin.org

Figure 7. Machine Learning Interpretability Visualization. Note: (A) SHAP summary plot; (B) SHAP feature importance ranking.

3.3.3 Analysis of the impact of selected features on mortality risk in sepsis patients

The univariate analysis of the 10 features selected by the SWSELM model showed statistically significant differences between the two groups of patients. These 10 features were then included in a multivariate logistic regression model, with the outcome variable being mortality (survival = 0, death = 1). Using a stepwise selection method, five variables were ultimately retained in the model: NT-proBNP, Lactate, Albumin, Oxygenation Index, and Mean Arterial Pressure (OR = 4.889, 3.770, 3.083, 1.872, 1.297). These results are consistent with the top five features identified by the SWSELM model as show in Tables 3, 4.

Table 3
www.frontiersin.org

Table 3. presents the univariate analysis of the selected features on the mortality risk of sepsis patients.

Table 4
www.frontiersin.org

Table 4. Multivariate analysis of the impact of selected features on mortality risk in sepsis patients.

3.4 Analysis of the value of top features

Models were trained using the top 5, top 8, and top 10 features respectively. The results, as detailed in Supplementary Figure S2 and Supplementary Figure S3, confirmed the predictive value of the top five features. In practical prediction scenarios, utilizing the top five features alone can achieve satisfactory results. For ease of data collection in clinical practice, the top five features can be directly used for prediction.

3.5 Clinical translation and application - development of a visualization system

Our feature selection process identified the top-weighted features affecting mortality risk in sepsis patients. In clinical practice, the complex interplay of these features makes it challenging to intuitively assess the mortality risk of sepsis patients. Additionally, the high entry barrier for existing artificial intelligence applications hinders their clinical adoption. To address this issue, this study innovatively developed a practical visualization system based on the top five weighted features. This system offers advantages in terms of intuitiveness, convenience, and practicality.

During the use of the visualization system, users only need to input specific values for the five features—NT-proBNP, Lactate, Albumin, Oxygenation Index, and Mean Arterial Pressure—into the “Baseline Information” section. The system will then automatically calculate the mortality risk level, as shown Figure 8. The visualization interface integrates risk stratification thresholds (default 50%) validated through five-fold cross-validation. Clinicians may dynamically modify decision boundaries (±10%) based on specific clinical scenarios, with corresponding sensitivity/specificity tradeoffs displayed in real-time.

Figure 8
www.frontiersin.org

Figure 8. Interactive clinical decision-support interface displaying real-time mortality risk calculations using NT-proBNP, Lactate, Albumin, Oxygenation Index, and MAP inputs. Note: (A) Low risk of death prediction results; (B) High risk of death prediction results.

4 Discussion

Sepsis represents a systemic inflammatory response syndrome triggered by diverse etiological factors and stands as one of the most prevalent critical conditions in clinical practice. Characterized by organ dysfunction and tissue damage, this life-threatening state typically manifests following severe infections, traumatic injuries, burns, or major surgical procedures (Mutchmore et al., 2023; Miller, 2023; Charlson et al., 2022). Affecting populations across all age groups, sepsis carries substantial mortality risks ranging from 30% to 60%, with multiple organ dysfunction syndrome (MODS) constituting the primary fatal mechanism (Gauer et al., 2020; Mirijello et al., 2020; Mushtaq and Kazi, 2022; Fleiss et al., 2023). Consequently, early identification of critical clinical determinants and mortality risk prediction enables healthcare teams to intensify patient monitoring, implement timely therapeutic interventions, ensure patient safety protocols, and ultimately reduce fatality probabilities.

The recent expansion of artificial intelligence applications in medical domains has drawn significant attention to sepsis prediction model development (Yue et al., 2022; Hou et al., 2020). Sophisticated AI algorithms capable of deep analysis and processing of extensive clinical datasets have markedly enhanced the precision and sensitivity of mortality risk assessment in septic patients. Numerous investigations have successfully established AI-driven sepsis mortality prediction frameworks employing diverse computational approaches including support vector machines, neural networks, and ensemble methods, providing clinicians with granular risk stratification for informed decision-making (Peiffer-Smadja et al., 2020; Zhang WY. et al., 2023). Nevertheless, persistent challenges regarding model interpretability and practical clinical implementation remain unresolved. While existing research has predominantly focused on theoretical model development, our study bridges this translational gap through the novel implementation of the NCEPDO algorithm to construct the SWSELM model–a specialized analytical tool specifically optimized for clinical sepsis management. Compared to conventional single-algorithm machine learning approaches, our methodology demonstrates superior alignment with real-world clinical data patterns through enhanced feature extraction capabilities. Furthermore, we have pioneered the development of an integrated visualization system for direct clinical application, a critical advancement largely unaddressed in previous research endeavors. When compared against existing sepsis mortality models (Hou et al., 2020; Yan et al., 2024), our SWSELM framework demonstrates three paradigm-shifting advantages:Dual-Domain Feature Validation: The concurrent implementation of automated machine selection via NCEPDO optimization alongside traditional regression confirmation resolves the interpretability limitations inherent in conventional black-box model architectures. Clinical Decision Velocity: Our integrated visualization system reduces clinical decision latency from ∼30 min (required for traditional scoring systems) to <2 min, effectively addressing known implementation barriers in acute care settings. Generalizable Predictive Precision: Achieving an AUC of 0.9387 in general sepsis populations versus the 0.82-0.91 range documented in prior models focused exclusively on severe sepsis cohorts, while requiring only five essential biomarkers. Notably, this performance is maintained with 93.8% specificity at 88.9% sensitivity thresholds.

Our sepsis-specific mortality definition enhances causal inference between predictors and sepsis pathophysiology. For example, NT-proBNP-driven myocardial suppression and lactate-associated microcirculatory failure align with sepsis-related MODS mechanisms. Compared to all-cause mortality metrics that may incorporate unrelated comorbidities, this approach reduces confounding at the cost of generalizability to healthcare systems without robust endpoint adjudication processes. This study utilized an improved swarm intelligence optimization algorithm to identify five significant features for the final model: NT-proBNP, Lactate, Albumin, Oxygenation Index, and Mean Arterial Pressure (MAP). The results of this study are consistent with those obtained from traditional multivariate analyses, further validating the robustness of the mortality prediction model for sepsis patients based on these five predictors. The following explains why these factors are considered independent risk factors for mortality in sepsis patients: (1) NT-proBNP: Sepsis often leads to myocardial suppression, and NT-proBNP is primarily synthesized and secreted by ventricular myocytes. Its levels significantly increase when the heart is subjected to inflammatory damage or volume overload. Elevated NT-proBNP levels correlate with the severity and prognosis of sepsis. High levels indicate a higher likelihood of heart failure, which can result in inadequate perfusion of systemic organs, leading to multiple organ dysfunction syndrome (MODS), a major cause of death in sepsis patients (Zheng Y. et al., 2023). (2) Lactate: Sepsis causes microcirculatory disturbances and reduced tissue perfusion. Lactate is a byproduct of anaerobic metabolism, and its levels rise when tissues are hypoxic, as cells rely on anaerobic glycolysis for energy production. Persistently elevated lactate levels reflect ongoing tissue hypoperfusion and indicate an imbalance between oxygen supply and demand, marking severe illness. High lactate levels can lead to metabolic acidosis, which inhibits myocardial contractility and reduces vascular responsiveness to catecholamines, further exacerbating tissue hypoperfusion and organ dysfunction, especially in the heart, liver, and kidneys, thus increasing mortality risk (Kalimouttou et al., 2023). (3) Albumin: Albumin is a crucial plasma protein that maintains colloid osmotic pressure. In sepsis patients, decreased albumin levels lead to reduced colloid osmotic pressure, causing vascular fluid leakage, tissue edema, and impaired organ function. Additionally, albumin binds to various drugs and toxins. Reduced albumin levels in sepsis impair its binding capacity, increasing the toxicity of endotoxins and exacerbating the inflammatory response, which can cause more severe damage and increase the risk of death (Fan et al., 2023). (4) Oxygenation Index: This is an important indicator of pulmonary oxygenation function. Sepsis can cause acute respiratory distress syndrome (ARDS), damaging the alveolar-capillary membrane and impairing gas exchange. A decreased oxygenation index indicates the lungs’ inability to effectively transfer oxygen from the alveoli to the blood, leading to hypoxia. Persistent low oxygenation indices result in systemic tissue hypoxia, impairing cellular aerobic metabolism and causing cell dysfunction, particularly in oxygen-sensitive tissues like the brain and myocardium. This can lead to multiple organ failure and increase the likelihood of death (She et al., 2023). (5) Mean Arterial Pressure (MAP): MAP is crucial for ensuring organ perfusion. In sepsis, changes in vascular tone can lead to vasodilation and hypotension. Persistently low MAP results in inadequate perfusion of vital organs such as the kidneys and liver, causing organ dysfunction, worsening the condition, and ultimately leading to death (Zheng R. et al., 2023; Liu et al., 2023).

Additionally, based on the NCEPDO algorithm model, this study developed a predictive tool that can be integrated into standard work computers. Compared to the nomogram model constructed using logistic regression by Guo et al. (2022), Taneja et al. (2021), this online predictive tool features a more straightforward and intuitive user interface, making it more convenient and efficient to use. Clinicians can quickly obtain the mortality risk of sepsis patients by simply inputting their medical history, providing valuable reference for clinical decision-making. Patients can also use this system to easily assess their personal risk, enhancing the autonomy and accuracy of their health management (Zhang et al., 2022; Ning et al., 2023; Zhang Y. et al., 2023). Previous machine learning models often faced challenges in interpretability and clinical application. However, the sepsis patient clinical outcome prediction model based on the NCEPDO algorithm developed in this study not only demonstrates excellent predictive performance but also effectively addresses the practical limitations of machine learning models in clinical settings. This provides a novel and effective solution for the early prediction and management of clinical outcomes in sepsis patients (Yan et al., 2022). The integration of SHAP analysis has significantly enhanced the interpretability of our model by elucidating the individualized contribution of clinical features to sepsis mortality predictions. SHAP values reveal nuanced patterns: elevated NT-proBNP and lactate levels consistently correlate with increased mortality risk (positive SHAP values), whereas higher albumin levels and oxygenation indices mitigate risk (negative SHAP values). This directional granularity aligns with established sepsis pathophysiology and validates our feature selection strategy for the visualization system. Importantly, SHAP analysis provides clinical interpretability beyond global feature importance rankings—for instance, it quantifies how minor variations in mean arterial pressure dynamically influence individual patient risk trajectories. This framework supports the clinical utility of our top five features by demonstrating their biological plausibility and contextual interactions, thereby bridging the gap between algorithmic outputs and bedside decision-making. Future studies could leverage SHAP-driven insights to refine dynamic risk thresholds for early intervention strategies.

5 Summary

Our systematic analysis establishes NT-proBNP, lactate, albumin, oxygenation index, and MAP as independent mortality risk determinants in sepsis patients. Through NCEPDO-optimized swarm intelligence, we developed the SWSELM prediction model and its associated clinical visualization system. This dual-component solution combines technical sophistication with practical usability through intuitive interfaces and rapid decision support capabilities.

5.1 Study limitations

Despite the significant findings, this study has several limitations. First, it is a retrospective, single-center study, with data primarily sourced from a specific region and hospital. This may limit the representativeness of the sample and the generalizability of the model. Second, while the NCEPDO algorithm demonstrated excellent performance, its complexity and computational cost are relatively high, potentially restricting its use in resource-limited healthcare settings. Additionally, although the model’s predictions are accurate, its interpretability needs further improvement to be better understood and accepted by clinicians and patients. Potential Algorithmic Biases Several limitations and intrinsic biases merit emphasis: Selection/Geographical Bias: As a single-center retrospective study from Zhejiang Province, our model incorporates region-specific antimicrobial practices, possibly inflating predictors tied to microbial resistance. External validation cohorts from diverse settings are in preparation. Temporal Measurement Bias: Reliance on admission biomarkers (NT-proBNP/Lactate) may undercapture dynamic deterioration. We mitigated this by integrating oxygenation trajectories through iterative cross-validation. Feature Extraction Bias: Excluding socio-economic factors (insurance type, rural care access) may underestimate disparities in resource-limited contexts. Future versions will incorporate WHO Healthcare Access Indicators.

5.2 Future research prospects and possibilities

Emerging strategies demonstrate promise in addressing these limitations: hybrid models combining NCEPDO algorithm with Shapley additive explanations (SHAP) can quantify feature attribution bias. Additionally, federated learning architectures could enable multicenter training while preserving data privacy——our team has initiated collaborative trials with five tertiary hospitals across China. Data Diversity and Model Generalization: Expand the scale and diversity of the dataset, including data from different regions, races, and healthcare institutions, to validate the model’s generalizability and applicability. Algorithm Optimization and Computational Efficiency: Further optimize the NCEPDO algorithm to reduce its computational cost and enhance its efficiency in real clinical environments. Model Interpretability: Improve the interpretability of the model and develop more intuitive visualization tools to help clinicians and patients better understand the prediction results. Clinical Trials and Practical Application: Conduct trials in real clinical settings to evaluate the practical effectiveness and application value of the predictive tool, and refine the model and tool based on feedback. Multidisciplinary Collaboration: Leverage the combined expertise of emergency medicine, critical care, data science, and clinical informatics to advance the development and application of predictive models, thereby improving sepsis patient management and ensuring patient safety.

Through these efforts, future research can further enhance the accuracy, practicality, and interpretability of sepsis outcome prediction models, providing stronger support for clinical decision-making and patient health management.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the ethics committee of the Longyou County People’s Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because this study was a retrospective analysis, and informed consent of patients could be waived.

Author contributions

TC: Data curation, Formal Analysis, Methodology, Software, Visualization, Writing – original draft, Writing – review and editing. XZ: Data curation, Investigation, Writing – review and editing. QuY: Investigation, Writing – review and editing. QiY: Investigation, Writing – review and editing. LY: Investigation, Writing – review and editing. FT: Conceptualization, Project administration, Resources, Writing – review and editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphys.2025.1560659/full#supplementary-material

SUPPLEMENTARY FIGURE 1Functions and Convergence Plots.

SUPPLEMENTARY FIGURE 2Comparative performance metrics of top 5/8/10 features in training set showing differential contributions to sepsis mortality prediction Note: (A) ROC curve; (B) Precision-Recall curve.

SUPPLEMENTARY FIGURE 3Comparative performance metrics of top 5/8/10 features in test set showing differential contributions to sepsis mortality prediction Note: (A) ROC curve; (B) Precision-Recall curve.

References

Bleakley G., Cole M. (2020). Recognition and management of sepsis: the nurse's role. Br. J. Nurs. 29 (21), 1248–1251. doi:10.12968/bjon.2020.29.21.1248

PubMed Abstract | CrossRef Full Text | Google Scholar

Charlson M. E., Carrozzino D., Guidi J., Patierno C. (2022). Charlson comorbidity index: a critical review of clinimetric properties. Psychother. Psychosom. 91 (1), 8–35. doi:10.1159/000521288

PubMed Abstract | CrossRef Full Text | Google Scholar

Chiu C., Legrand M. (2021). Epidemiology of sepsis and septic shock. Curr. Opin. Anaesthesiol. 34 (2), 71–76. doi:10.1097/ACO.0000000000000958

PubMed Abstract | CrossRef Full Text | Google Scholar

Ezugwu A. E., Agushaka J. O., Abualigah L., Mirjalili S., Gandomi A. H. (2022). Prairie Dog optimization algorithm. Neural Comput. and Applic 34, 20017–20065. doi:10.1007/s00521-022-07530-9

CrossRef Full Text | Google Scholar

Fan Z., Jiang J., Xiao C., Chen Y., Xia Q., Wang J., et al. (2023). Construction and validation of prognostic models in critically Ill patients with sepsis-associated acute kidney injury: interpretable machine learning approach. J. Transl. Med. 21 (1), 406. doi:10.1186/s12967-023-04205-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Fleiss N., Schwabenbauer K., Randis T. M., Polin R. A. (2023). What's new in the management of neonatal early-onset sepsis? Arch. Dis. Child. Fetal Neonatal Ed. 108 (1), 10–14. doi:10.1136/archdischild-2021-323532

PubMed Abstract | CrossRef Full Text | Google Scholar

Fleuren L. M., Klausch T. L. T., Zwager C. L., Schoonmade L. J., Guo T., Roggeveen L. F., et al. (2020). Machine learning for the prediction of sepsis: a systematic review and meta-analysis of diagnostic test accuracy. Intensive Care Med. 46 (3), 383–400. doi:10.1007/s00134-019-05872-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Gauer R., Forbes D., Boyer N. (2020). Sepsis: diagnosis and management. Am. Fam. Physician 101 (7), 409–418.

PubMed Abstract | Google Scholar

Guo F., Zhu X., Wu Z., Zhu L., Wu J., Zhang F. (2022). Clinical applications of machine learning in the survival prediction and classification of sepsis: coagulation and heparin usage matter. J. Transl. Med. 20 (1), 265. doi:10.1186/s12967-022-03469-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Hou N., Li M., He L., Xie B., Wang L., Zhang R., et al. (2020). Predicting 30-days mortality for mimic-III patients with sepsis-3: a machine learning approach using XGboost. J. Transl. Med. 18 (1), 462. doi:10.1186/s12967-020-02620-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Jing Y. (2015). Chinese guidelines for the treatment of severe sepsis/septic shock. Severe Med. Branch Chin. Med. Assoc. 54 (6), 557–581.

Google Scholar

Kalimouttou A., Lerner I., Cheurfa C., Jannot A. S., Pirracchio R. (2023). Machine-learning-derived sepsis bundle of care. Intensive Care Med. 49 (1), 26–36. doi:10.1007/s00134-022-06928-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu D., Huang S. Y., Sun J. H., Zhang H. C., Cai Q. L., Gao C., et al. (2022). Sepsis-induced immunosuppression: mechanisms, diagnosis and current treatment options. Mil. Med. Res. 9 (1), 56. doi:10.1186/s40779-022-00422-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu F., Yao J., Liu C., Shou S. (2023). Construction and validation of machine learning models for sepsis prediction in patients with acute pancreatitis. BMC Surg. 23 (1), 267. doi:10.1186/s12893-023-02151-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Miller A. C. (2023). What's new in critical illness and injury science? The use of risk stratification tools in patients with suspected sepsis in the acute care settings. Int. J. Crit. Illn. Inj. Sci. 13 (1), 1–3. doi:10.4103/ijciis.ijciis_13_23

PubMed Abstract | CrossRef Full Text | Google Scholar

Mirijello A., Tosoni A.On Behalf Of The Internal Medicine Sepsis Study Group (2020). New strategies for treatment of sepsis. Med. Kaunas. 56 (10), 527. doi:10.3390/medicina56100527

CrossRef Full Text | Google Scholar

Mushtaq A., Kazi F. (2022). Updates in sepsis management. Lancet Infect. Dis. 22 (1), 24. doi:10.1016/S1473-3099(21)00773-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Mutchmore A., Lamontagne F., Chassé M., Moore L., Mayette M. (2023). Automated Apache II and SOFA score calculation using real-world electronic medical record data in a single center. J. Clin. Monit. Comput. 37 (4), 1023–1033. doi:10.1007/s10877-023-01010-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Ning J., Fan X., Sun K., Wang X., Li H., Jia K., et al. (2023). Single-cell sequence analysis combined with multiple machine learning to identify markers in sepsis patients: lilra5. Inflammation 46 (4), 1236–1254. doi:10.1007/s10753-023-01803-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Oczkowski S., Alshamsi F., Belley-Cote E., Centofanti J. E., Hylander Møller M., Nunnaly M. E., et al. (2022). Surviving sepsis campaign guidelines 2021: highlights for the practicing clinician. Pol. Arch. Intern Med. 132 (7-8), 16290. doi:10.20452/pamw.16290

PubMed Abstract | CrossRef Full Text | Google Scholar

Osborn T. M., Phillips G., Lemeshow S., Townsend S., Schorr C. A., Levy M. M., et al. (2014). Sepsis severity score: an internationally derived scoring system from the surviving sepsis campaign database. Crit. Care Med. 42 (9), 1969–1976. doi:10.1097/CCM.0000000000000416

PubMed Abstract | CrossRef Full Text | Google Scholar

Peiffer-Smadja N., Rawson T. M., Ahmad R., Buchard A., Georgiou P., Lescure F. X., et al. (2020). Machine learning for clinical decision support in infectious diseases: a narrative review of current applications. Clin. Microbiol. Infect. 26 (5), 584–595. doi:10.1016/j.cmi.2019.09.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Phillips G. S., Osborn T. M., Terry K. M., Gesten F., Levy M. M., Lemeshow S. (2018). The New York sepsis severity score: development of a risk-adjusted severity model for sepsis. Crit. Care Med. 46 (5), 674–683. doi:10.1097/CCM.0000000000002824

PubMed Abstract | CrossRef Full Text | Google Scholar

Qiu X., Lei Y. P., Zhou R. X. (2023). Sirs, sofa, qsofa, and news in the diagnosis of sepsis and prediction of adverse outcomes: a systematic review and meta-analysis. Expert Rev. Anti Infect. Ther. 21 (8), 891–900. doi:10.1080/14787210.2023.2237192

PubMed Abstract | CrossRef Full Text | Google Scholar

Sharma P., Raju S. (2024). Metaheuristic optimization algorithms: a comprehensive overview and classification of benchmark test functions. Soft Comput. 28 (4), 3123–3186. doi:10.1007/s00500-023-09276-5

CrossRef Full Text | Google Scholar

She H., Du Y., Du Y., Tan L., Yang S., Luo X., et al. (2023). Metabolomics and machine learning approaches for diagnostic and prognostic biomarkers screening in sepsis. BMC Anesthesiol. 23 (1), 367. doi:10.1186/s12871-023-02317-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Taneja I., Damhorst G. L., Lopez-Espina C., Zhao S. D., Zhu R., Khan S., et al. (2021). Diagnostic and prognostic capabilities of a biomarker and EMR-based machine learning algorithm for sepsis. Clin. Transl. Sci. 14 (4), 1578–1589. doi:10.1111/cts.13030

PubMed Abstract | CrossRef Full Text | Google Scholar

Theodosiou A. A., Read R. C. (2023). Artificial intelligence, machine learning and deep learning: potential resources for the infection clinician. J. Infect. 87 (4), 287–294. doi:10.1016/j.jinf.2023.07.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan F., Chen X., Quan X., Wang L., Wei X., Zhu J. (2024). Association between the stress hyperglycemia ratio and 28-day all-cause mortality in critically ill patients with sepsis: a retrospective cohort study and predictive model establishment based on machine learning. Cardiovasc Diabetol. 23 (1), 163. doi:10.1186/s12933-024-02265-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan M. Y., Gustad L. T., Nytrø Ø. (2022). Sepsis prediction, early detection, and identification using clinical text for machine learning: a systematic review. J. Am. Med. Inf. Assoc. 29 (3), 559–575. doi:10.1093/jamia/ocab236

PubMed Abstract | CrossRef Full Text | Google Scholar

Yue S., Li S., Huang X., Liu J., Hou X., Zhao Y., et al. (2022). Machine learning for the prediction of acute kidney injury in patients with sepsis. J. Transl. Med. 20 (1), 215. doi:10.1186/s12967-022-03364-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang L., Huang T., Xu F., Li S., Zheng S., Lyu J., et al. (2022). Prediction of prognosis in elderly patients with sepsis based on machine learning (random survival forest). BMC Emerg. Med. 22 (1), 26. doi:10.1186/s12873-022-00582-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang W. Y., Chen Z. H., An X. X., Li H., Zhang H. L., Wu S. J., et al. (2023a). Analysis and validation of diagnostic biomarkers and immune cell infiltration characteristics in pediatric sepsis by integrating bioinformatics and machine learning. World J. Pediatr. 19 (11), 1094–1103. doi:10.1007/s12519-023-00717-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang Y., Xu W., Yang P., Zhang A. (2023b). Machine learning for the prediction of sepsis-related death: a systematic review and meta-analysis. BMC Med. Inf. Decis. Mak. 23 (1), 283. doi:10.1186/s12911-023-02383-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang Z. H., Hong Y. C. (2017). Development of a novel score for the prediction of hospital mortality in patients with severe sepsis: the use of electronic healthcare records with LASSO regression. Oncotarget 8 (30), 49637–49645. doi:10.18632/oncotarget.17870

PubMed Abstract | CrossRef Full Text | Google Scholar

Zheng R., Qian S., Shi Y., Lou C., Xu H., Pan J. (2023b). Association between triglyceride-glucose index and in-hospital mortality in critically ill patients with sepsis: analysis of the MIMIC-IV database. Cardiovasc Diabetol. 22 (1), 307. doi:10.1186/s12933-023-02041-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Zheng Y., Wang J., Ling Z., Zhang J., Zeng Y., Wang K., et al. (2023a). A diagnostic model for sepsis-induced acute lung injury using a consensus machine learning approach and its therapeutic implications. J. Transl. Med. 21 (1), 620. doi:10.1186/s12967-023-04499-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: machine learning, sepsis, mortality, NT-ProBNP, prediction, visualization

Citation: Chen T, Zhang X, Yu Q, Yang Q, Yuan L and Tong F (2025) Construction and clinical visualization application of a predictive model for mortality risk in sepsis patients based on an improved machine learning model. Front. Physiol. 16:1560659. doi: 10.3389/fphys.2025.1560659

Received: 21 January 2025; Accepted: 09 May 2025;
Published: 21 May 2025.

Edited by:

Qinghe Meng, Upstate Medical University, United States

Reviewed by:

Hong Cheng, Capital Medical University, China
Markus Graf, Heilbronn University, Germany
Ahmet Çifci, Mehmet Akif Ersoy University, Türkiye

Copyright © 2025 Chen, Zhang, Yu, Yang, Yuan and Tong. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Fei Tong, dG9uZ2ZlaTkyNDZAMTYzLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.