- 1Department of Critical Care Medicine, Tenth Affiliated Hospital of Guangxi Medical University, Qinzhou, China
- 2Department of Emergency, First Affiliated Hospital of Guangxi Medical University, Nanning, China
Background: Sepsis is a life-threatening condition with high mortality, necessitating early risk stratification. This study aimed to develop and validate a predictive nomogram for 28-day mortality in sepsis patients incorporating machine learning-selected biomarkers.
Methods: A total of 1,350 sepsis patients were retrospectively enrolled and divided into training (n = 944) and testing (n = 406) sets. LASSO and Random Forest (RF) algorithms were applied to screen key biomarkers associated with mortality. A logistic regression model was constructed using the selected features, and a nomogram was developed by integrating these biomarkers with APACHE II, SOFA score, and shock status. Model performance was evaluated by AUC, calibration, and decision curve analysis (DCA). External validation was performed in an independent cohort of 120 patients.
Results: Six biomarkers consistently selected by both LASSO and RF were: procalcitonin (PCT), prognostic nutritional index (PNI), red blood cell count (RBC), platelet count (PLT), alanine aminotransferase (ALT), and indirect bilirubin (IBIL). Non-survivors exhibited significantly higher levels of PCT, ALT, and IBIL, and lower levels of RBC, PLT, and PNI compared to survivors (all P < 0.05). The logistic regression model demonstrated strong discrimination [AUC: 0.841 (95% CI: 0.814–0.868) in training set; 0.808 (95% CI: 0.769–0.847) in testing set]. The nomogram showed good calibration and favorable net clinical benefit across a wide range of threshold probabilities. In the external validation cohort, the model maintained excellent predictive performance with an AUC of 0.921 (95% CI: 0.876–0.966).
Conclusion: We developed and validated a clinically useful nomogram incorporating nutrition-related biomarkers, particularly PNI, for predicting 28-day mortality in sepsis patients. The model demonstrates robust performance and highlights the importance of nutritional status in sepsis outcomes.
Introduction
Sepsis, defined as a life-threatening organ dysfunction caused by a dysregulated host response to infection, remains a leading cause of morbidity and mortality in intensive care unit (ICU) worldwide (1). Despite advances in critical care, mortality rates remain high, particularly among patients with delayed recognition or suboptimal management. Early risk stratification is therefore essential to guide timely interventions, allocate resources efficiently, and improve patient outcomes. The Acute Physiology and Chronic Health Evaluation II (APACHE II) and Sequential Organ Failure Assessment (SOFA) scores are widely used for prognostication in sepsis. However, while these tools effectively capture physiological derangements and organ dysfunction, they may not fully reflect the underlying metabolic, immunological, and nutritional disturbances that profoundly influence host resilience and recovery (2). Emerging evidence underscores the critical interplay between inflammation, immune dysfunction, and malnutrition in the pathophysiology of sepsis. This can lead to catabolic exhaustion, impaired pathogen clearance, and increased susceptibility to secondary infections and multi-organ failure (3). In this context, the prognostic nutritional index (PNI) has gained recognition as a composite marker integrating nutritional status and immune function. Low PNI reflects both protein-energy malnutrition and lymphopenia-induced immunosuppression, both of which are independently associated with poor outcomes in critically ill patients (4, 5).
Notably, PNI has been shown to predict mortality in sepsis, yet it is often underutilized in routine clinical risk assessment models (6, 7). Recent advances in machine learning (ML) algorithms offer powerful tools for identifying complex, non-linear patterns in high-dimensional clinical data, enabling more accurate prediction of outcomes than traditional statistical methods (8). By combining ML-based feature selection with interpretable regression modeling, it is possible to develop robust predictive tools that balance accuracy with clinical usability (9, 10). However, few existing models incorporate nutrition-sensitive biomarkers, and even fewer have been translated into clinically intuitive nomograms, graphical calculators that allow bedside risk estimation, while undergoing rigorous external validation (11, 12). Therefore, this study aimed to develop and validate a novel, nutrition-informed predictive nomogram for 28-day mortality in adult sepsis patients. We integrated machine learning algorithms (LASSO and random forest) to identify key laboratory biomarkers, combined them with established severity scores (APACHE II, SOFA) and shock status, and validated the model both internally and externally.
Materials and methods
Study subjects
This was a retrospective cohort study conducted at two tertiary care centers: the First Affiliated Hospital and the Tenth Affiliated Hospital of Guangxi Medical University, from January 2024 to April 2025. The study protocol was reviewed and approved by the Institutional Review Board, and all procedures were conducted in accordance with the ethical principles outlined in the Declaration of Helsinki. Informed consent was waived due to the retrospective nature of the study.
Inclusion and exclusion criteria
Patients were eligible for inclusion if they: (1) met the Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3) criteria (13), defined as life-threatening organ dysfunction caused by a dysregulated host response to infection, with an increase in SOFA score ≥ 2 points; and (2) were aged 18 years or older. Exclusion criteria included: (1) pregnancy or lactation; (2) ICU length of stay < 24 h; (3) presence of advanced malignant disease or other terminal illnesses (life expectancy < 3 months); and (4) incomplete clinical or laboratory data that could not be reliably imputed.
Collection of main clinical features
The clinical features were collected within 24 h of ICU admission. Collected variables included: demographic and clinical characteristics: age, sex, comorbidities (including diabetes, chronic kidney disease, cardiovascular disease), site of infection (including pulmonary, abdominal, urinary). Complete blood count (including white blood cell count, hemoglobin, platelets, lymphocytes), liver function tests (albumin, total protein, ALT, AST, bilirubin, cholinestersase), renal function (creatinine, BUN), and nutritional indicators (albumin or prealbumin). Severity of illness scores: APACHE II and SOFA scores, calculated based on the first 24 h of ICU admission. Interventions: use of mechanical ventilation, vasopressors, renal replacement therapy, and surgical interventions. The primary outcome was 28-day all-cause mortality, defined as death from any cause within 28 days of sepsis diagnosis. Calculation of NLR, PLR and PNI as follow: NLR = absolute neutrophil count/absolute lymphocyte count, PLR= absolute platelet count/lymphocyte count, PNI = 10 × serum albumin value + 0.005 × lymphocyte count (14). All data were cross-checked by two individuals, and missing values were imputed using multiple imputation methods. Missing data were handled using multiple imputation by chained equations (MICE) with 10 imputations, assuming missing at random (MAR), and pooled estimates were used in subsequent analyses.
Biomarker selection using machine learning
To identify the most predictive laboratory biomarkers for 28-day mortality, we applied two machine learning algorithms: least absolute shrinkage and selection operator (LASSO) regression with binomial logistic loss function, using 10-fold cross-validation to select the optimal lambda (λ) that minimized the deviance. Random Forest (RF) with 500 trees, using the Gini index for variable importance ranking. The RF analysis was conducted with the following hyperparameters: number of estimators = 500, max_features = √p (where p is the number of input features), minimum samples per leaf (min_samples_leaf) = 5, and bootstrap sampling enabled. Hyperparameters were selected based on default settings after preliminary tuning showed minimal impact on feature ranking stability. The top 10 most important features were extracted based on mean decrease in accuracy. Variables consistently selected by both LASSO and RF were considered robust predictors and included in the subsequent multivariable logistic regression model. All machine learning analyses for biomarker selection were performed conducted using R software with the “glmnet” package and “randomForest” package.
Nomogram model construction and evaluation
A multivariable logistic regression model was constructed using the selected biomarkers, along with established clinical predictors: APACHE II score, SOFA score, and presence of shock (defined as vasopressor requirement). The final model was visualized as a nomogram using the rms package (version 6.7-0) in R software (version 4.2.0). Model performance was evaluated in both internal (70% training cohort) and external (30% testing cohort) validation sets, with an independent external validation cohort from the Tenth Affiliated Hospital used to assess generalizability.
Discrimination was assessed using the area under the receiver operating characteristic curve (AUC), calculated with the pROC package. Calibration was evaluated using the calibration plot (observed vs. predicted probability), Brier score (lower values indicate better calibration), and Hosmer-Lemeshow goodness-of-fit test (P > 0.05 indicates good fit). Clinical utility was assessed via decision curve analysis (DCA), which evaluates net benefit across a range of threshold probabilities.
Statistical methods
All statistical analyses were performed using R software (version 4.2.0). Categorical variables were presented as frequency (percentage) and compared using the chi-square (χ2) test or Fisher's exact test, as appropriate. Continuous variables were tested for normality using the Shapiro-Wilk test. Normally distributed data were expressed as mean ± standard deviation (SD) and compared using the independent samples t-test. Non-normally distributed data were presented as median [interquartile range, IQR: P25–P75] and compared using the Mann–Whitney U test. Comparison of predictive accuracy between different ROC was using Delong's test. A two-sided P value < 0.05 was considered statistically significant.
Results
Clinical characteristics of survivors and non-survivors
A total of 1,350 sepsis patients were included in this study, of whom 168 (12.4%) died during hospitalization and 1,182 (87.6%) survived. As shown in Table 1, non-survivors were significantly older than survivors (median age: 65 vs. 60 years, P < 0.05). Although the proportion of males was higher in the survivor group (821/1182, 69.5%) compared to the non-survivor group (124/168, 73.8%), no significant difference in sex distribution was observed between groups (P > 0.05). There were no significant differences in body mass index (BMI), history of smoking, or alcohol consumption between the two groups (P > 0.05). Non-survivors exhibited a significantly higher prevalence of shock, atrial fibrillation, and heart failure (P < 0.05 for all). With regard to treatment modalities, non-survivors were more likely to require ICU admission, mechanical ventilation, and vasopressor support, whereas corticosteroid use was less frequent in this group (P < 0.05). Non-survivors also had significantly higher APACHE II and SOFA scores compared to survivors (P < 0.001). No significant differences were observed in the distribution of infection sources, including pulmonary, urinary, hepatic, bloodstream, and other sites, between the two groups (P > 0.05). Laboratory analyses revealed that non-survivors had significantly higher levels of most routine blood tests, liver function markers, and renal function indicators, except for albumin, globulin, prealbumin, and hemoglobin, which were markedly lower in non-survivors (P < 0.05). Additionally, the NLR was significantly elevated in non-survivors, while the PLR and PNI were significantly reduced (P < 0.05). These findings collectively suggest that non-survivors exhibited a state of heightened systemic inflammation and severe nutritional impairment.
Screening of key prognostic biomarkers using machine learning algorithms
Patients were randomly divided into a training set (n = 944) and a testing set (n = 406) at a 7:3 ratio. Two machine learning algorithms (LASSO and RF) were applied to screen 23 laboratory biomarkers for their association with mortality in the training cohort. LASSO regression identified 14 biomarkers with significant predictive value for mortality (Figures 1A–C). Similarly, RF analysis highlighted 10 top-ranking features associated with death (Figures 1D, E). By intersecting the results from both methods, six biomarkers were consistently selected, including PNI, platelet count (PLT), procalcitonin (PCT), alanine aminotransferase (ALT), red blood cell count (RBC), and indirect bilirubin (IBIL). These six markers were thus retained for subsequent model development. The SHAP summary plot illustrated the direction and magnitude of each biomarker contribution to the prediction outcome. SHAP summary plot showed that, the PNI, PLT, and RBC exerted the significantly negatively influence on the predicted mortality risk, with the mean |SHAP| value as 0.15; conversely, the PCT, ALT, and IBIL contributed positively to risk prediction, with the mean |SHAP| value as 0.12 (Figure 1F).
Figure 1. Screening of biomarkers associated with death using machine learning algorithms. (A) Selection of the penalty parameter λ using ten-fold cross validation. Figure shows two λ selection criteria in LASSO regression; (B) coefficient path of LASSO regression. Each colored line represents the coefficient trajectory of an independent variable as the penalty parameter λ changes; (C) ROC for the predictive value of the biomarkers on the death of sepsis patients by LASSO algorithms; (D) ROC for the predictive value of the biomarkers on the death of sepsis patients by RF algorithms; (E) venn plot for the common biomarkers from LASSO algorithms and RF algorithms; (F) SHAP summary plot for the biomarkers associated with the death of sepsis patients. The features are ranked based on the impact of their SHAP values on the outcome, with higher combined values indicating a higher risk of death within 28 days. Pink signifies high feature values, blue represents values close to the overall mean, and blue indicates low feature values.
Development and performance of the predictive model
A logistic regression model was constructed using the six consensus biomarkers. Univariate logistic regression confirmed that each of the six biomarkers was significantly associated with mortality (all P < 0.05). Multivariate logistic regression further demonstrated that all six biomarkers were independent predictors of death in sepsis patients (Table 2). The resulting prediction model showed strong discriminative ability, with an AUC of 0.815 (95% CI: 0.712–0.870) in the training set and 0.838 (95% CI: 0.765–0.851) in the testing set (Figure 2). Using the Youden index, the optimal cut-off probability for predicting mortality was determined as 0.186. The full model equation and corresponding coefficients are detailed in Table 3.
Figure 2. Predictive value of the model on the death of sepsis patients by logistic regression analysis. (A) ROC and AUC value in Training set; (B) ROC and AUC value in Testing set.
Nomogram construction and validation
To enhance clinical applicability, we developed a nomogram integrating the six-biomarker model with established clinical severity scores (APACHE II and SOFA) as well as the presence of shock, all well-recognized predictors of sepsis mortality (Figure 3A). Points assigned to each variable are summed to estimate the patient's probability of death. The nomogram demonstrated excellent discrimination, with a bootstrap-corrected C-index of 0.887 (95% CI: 0.863–0.911). Calibration analysis revealed good agreement between predicted and observed mortality probabilities (Figure 3B), supported by a low Brier score of 0.087. The Hosmer–Lemeshow test confirmed adequate model calibration (P = 0.624). DCA showed a positive net benefit across a wide range of threshold probabilities (Figure 3C), indicating robust clinical utility.
Figure 3. Construction of nomogram for the predictive model. (A) Nomogram for the predictive model and clinical parameters. Each independent predictor of Nomogram was mapped to a “points” value at the top of the nomogram, yielding a score within the range of 0–100. The total score was then used to accurately predict the outcome of sepsis patients, with higher scores indicating a higher risk of death; (B) calibration curve for the nomogram was used to evaluate the consistency between model predicted probabilities and actual observed probabilities; (C) decision curve analysis (DCA) assess the net benefit of the nomogram across different high-risk thresholds. ***p < 0.001.
External validation of the predictive model
To evaluate generalizability, the model was externally validated in an independent cohort of 120 sepsis patients from a different medical center, including 60 non-survivors and 60 survivors. The baseline characteristics are provided in Supplementary Table S1. Consistent with the primary cohort, non-survivors in the external set exhibited significantly higher levels of PCT, ALT, and IBIL, and lower levels of RBC, PLT, and PNI (P < 0.05; Figure 4A). When applied to this external cohort, the six-biomarker model achieved an AUC of 0.823 (95% CI: 0.758–0.907), and the predictive accuracy was better than APACHE II score (AUC: 0.683, 95% CI: 0.564–0.758) and SOFA score (AUC: 0.790, 95% CI: 0.731–0.886) (Figure 4B), indicating high predictive accuracy and strong cross-center reproducibility. Furthermore, the Delong's test showed that the predictive accuracy of the model was better than APACHE II score (P = 0.003), SOFA score (P = 0.035) and PNI (P = 0.016).
Figure 4. Validation of the predictive model by external cohort. (A) Comparison of the value of the six parameters in external cohort; (B) predictive value of the APACHE II score, SOFA score, and the predictive model. *P > 0.05, ** P > 0.01, *** P > 0.001, **** P > 0.0001.
Discussion
This study successfully developed and validated a clinically practical nomogram to predict mortality in sepsis patients by integrating a machine learning method to select laboratory biomarkers with established clinical severity scores. Among the 1,350 patients analyzed, non-survivors exhibited significantly higher disease severity as reflected by elevated APACHE II and SOFA scores, greater organ dysfunction, and more frequent ICU interventions. Additionally, pronounced inflammatory activation and malnutrition were observed in non-survivors. Using LASSO and RF algorithms on the training cohort, six biomarkers (PNI, PLT, PCT, ALT, RBC, and IBIL) were consistently identified as key predictors of mortality. The logistic regression model incorporating these markers achieved strong discriminative ability, with an AUC of 0.841 in the training set and 0.808 in the testing set. The final nomogram, further adjusted for APACHE II, SOFA, and shock status, demonstrated excellent calibration (Brier score: 0.087, Hosmer–Lemeshow P = 0.624) and a high C-index of 0.887. Importantly, external validation in an independent cohort of 120 patients confirmed exceptional predictive accuracy, highlighting the model robustness and generalizability.
The identified biomarkers (PNI, PLT, PCT, ALT, RBC, and IBIL) collectively reflect a convergence of nutritional status, systemic inflammation, organ dysfunction, and immune dysregulation, all critical determinants of mortality in sepsis. Previous studies have reported the association of PNI with the prognosis of sepsis, where lower PNI values indicate poor outcomes (15, 16). Thrombocytopenia (low PLT) and anemia (low RBC) suggest bone marrow suppression, microvascular consumption, and inflammatory hemolysis, all linked to poor prognosis (17, 18). Elevated PCT is a well-established marker of bacterial infection and sepsis severity, reflecting the magnitude of the inflammatory cascade (19, 20). Increased ALT and IBIL levels suggest hepatocellular stress and impaired bilirubin conjugation, often due to hypoperfusion or direct inflammatory injury. Lower IBIL levels have also been associated with poor outcomes in ICU patients with sepsis (21). Together, these biomarkers form a profile indicative of systemic inflammatory response compounded by metabolic and nutritional compromise.
Notably, the PNI emerged as a critical predictor. PNI integrates nutritional status and immune function, two interdependent factors frequently compromised in critically ill patients (22). Low PNI reflects malnutrition and immunosuppression, impairing the body's ability to combat infection and recover from organ injury (23). In this study, lower PNI was significantly associated with higher mortality, underscoring the detrimental impact of pre-existing or acute-on-chronic malnutrition in sepsis. Malnutrition leads to muscle wasting, weakened respiratory function, impaired wound healing, and diminished immune response, increasing susceptibility to secondary infections and organ failure (24). Furthermore, lymphopenia, a key component of PNI, is linked to T-cell exhaustion and immune paralysis in sepsis, contributing to persistent inflammation and poor outcomes (25). The consistent selection of PNI by both LASSO and random forest algorithms highlights its robust prognostic value. Given that nutritional interventions can modulate PNI, it may not only serve as a predictive biomarker but also as a potential target for early nutritional support, emphasizing the importance of integrating nutritional assessment into sepsis management protocols to improve survival.
This study offers significant clinical value by providing a readily applicable nomogram for early risk stratification of sepsis patients, integrating easily accessible laboratory markers, including the nutrition-immune index PNI. The model enables clinicians to identify high-risk patients promptly, facilitating timely interventions such as intensified monitoring, optimized antimicrobial therapy, and early nutritional support. The strong predictive performance, validated externally, supports its potential for use in diverse clinical settings. Notably, the inclusion of PNI underscores the critical role of nutritional status in sepsis outcomes, reinforcing the need for routine nutritional assessment and intervention in critical care. This aligns with the growing emphasis on personalized nutrition management. The nomogram's user-friendly format allows for quick risk estimation at the bedside, aiding clinical decision-making and potentially improving patient outcomes.
Despite previous studies (6, 7) have reported several models show good predictive ability on the death of sepsis, this study has several strengths. First, it employed machine learning algorithms to identify key biomarkers, enhancing feature selection reliability and reducing overfitting. The integration of these biomarkers into a logistic regression model and a clinically intuitive nomogram improves practical applicability. Second, the model demonstrated strong discriminative performance and excellent calibration in both internal and external validation cohorts, indicating high generalizability across different clinical settings. Third, the inclusion of PNI, a composite marker of nutritional and immune status, highlights the importance of nutritional factors in sepsis outcomes. The nomogram also incorporates established severity scores (APACHE II, SOFA) and shock status, increasing its clinical relevance. Importantly, our model is only presented in the form of a paper-based nomogram, which allows bedside risk estimation using routinely available data. Future work will aim to translate this tool into a digital format, potentially as a web-based calculator or an integrated module within the hospital information system, and therefore facilitate automated risk scoring at the point of care.
However, some limitations should be acknowledged. First, this study is retrospective and two-center in design, which may introduce selection bias. Although external validation was performed, the validation cohort was relatively small, and multi-center prospective studies are needed to further confirm the model reliability. Second, the dynamic changes in biomarker levels over time were not assessed, potentially limiting the model's ability to capture disease progression. Third, although our external validation cohort (n = 120) provides initial evidence of transportability, its modest size and single-center nature limit the precision of performance estimates and may not capture geographic or practice-related heterogeneity. Future multi-center prospective validation across diverse healthcare settings is essential before clinical deployment. Fourth, regarding the number of sepsis patients in external validation cohort, 60 (50%) patients died within 28 days. This balanced distribution was not artificially constructed; it reflects the high disease severity and mortality burden at this tertiary referral center during the study period. While such balance enhances statistical power for discrimination analysis, it may overestimate real-world prevalence and limit generalizability to lower-mortality settings. Finally, while nutritional status was partially evaluated through PNI, more comprehensive nutritional assessments (such as body composition, dietary intake) were not included, suggesting a direction for future refinement.
Conclusion
This study developed and validated a robust predictive nomogram for 28-day mortality in sepsis patients by integrating key laboratory biomarkers identified through machine learning algorithms. The model demonstrated excellent discriminative accuracy, calibration, and clinical utility in both internal and external validation, highlighting its reliability and generalizability. Our results emphasize the integration of nutritional assessment into early sepsis management, supporting the paradigm shift toward precision nutrition in critical care.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.
Ethics statement
The studies involving humans were approved by Ethics Committee of the Tenth Affiliated Hospital of Guangxi Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants' legal guardians/next of kin.
Author contributions
Y-lY: Data curation, Formal analysis, Writing – original draft. Y-sH: Methodology, Formal analysis, Writing – original draft. Y-lB: Resources, Formal analysis, Writing – review & editing, Writing – original draft. X-xH: Methodology, Writing – original draft, Formal analysis. Z-yF: Funding acquisition, Writing – review & editing, Writing – original draft, Conceptualization. Z-wH: Conceptualization, Writing – review & editing, Writing – original draft.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This study was supported by the Science and Technology Projects of Qinzhou (No. 201612).
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnut.2025.1726151/full#supplementary-material
References
1. La Via L, Sangiorgio G, Stefani S, Marino A, Nunnari G, Cocuzza S, et al. The global burden of sepsis and septic shock. Epidemiologia. (2024) 5:456–78. doi: 10.3390/epidemiologia5030032
2. Martin-Loeches I, Singer M, Leone M. Sepsis: key insights, future directions, and immediate goals. A review and expert opinion. Intensive Care Med. (2024) 50:2043–9. doi: 10.1007/s00134-024-07694-z
3. De Waele E, Malbrain M, Spapen H. Nutrition in sepsis: a bench-to-bedside review. Nutrients. (2020) 12:395. doi: 10.3390/nu12020395
4. Bai B, Liu X, Li H. Predictive value of prognostic nutritional index for the long-term prognosis of elderly patients with fracture: a systematic review and meta-analysis. Front Nutr. (2025) 12:1631128. doi: 10.3389/fnut.2025.1631128
5. Ding H, Zou X. Association of prognostic nutritional index with all-cause and cardiovascular mortality in adults with depression: NHANES 2005–2018. Front Nutr. (2025) 12:1599830. doi: 10.3389/fnut.2025.1599830
6. Prescott HC, Heath M, Munroe ES, Blamoun J, Bozyk P, Hechtman RK, et al. Development and validation of the hospital medicine safety sepsis initiative mortality model. Chest. (2024) 166:1035–45. doi: 10.1016/j.chest.2024.06.3769
7. Zhuang J, Huang H, Jiang S, Liang J, Liu Y, Yu X, et al. A generalizable and interpretable model for mortality risk stratification of sepsis patients in intensive care unit. BMC Med Inform Decis Mak. (2023) 23:185. doi: 10.1186/s12911-023-02279-0
8. Alhumaidi NH, Dermawan D, Kamaruzaman HF, Alotaiq N. The use of machine learning for analyzing real-world data in disease prediction and management: systematic review. JMIR Med Inform. (2025) 13:e68898. doi: 10.2196/68898
9. Fanizzi A, Arezzo F, Cormio G, Comes MC, Cazzato G, Boldrini L, et al. Massafra: an explainable machine learning model to solid adnexal masses diagnosis based on clinical data and qualitative ultrasound indicators. Cancer Med. (2024) 13:e7425. doi: 10.1002/cam4.7425
10. Pudjihartono N, Fadason T, Kempa-Liehr AW, O'Sullivan JM. A review of feature selection methods for machine learning-based disease risk prediction. Front Bioinform. (2022) 2:927312. doi: 10.3389/fbinf.2022.927312
11. Xu Z, Zhang R, Chen Q, Peng G, Luo S, Liu C, et al. Machine learning algorithms for risk factor selection with application to 60-day sepsis morbidity risk for a geriatric hip fracture cohort. BMC Geriatr. (2025) 25:591. doi: 10.1186/s12877-025-06213-z
12. Wang Y, Wang Y, Song L, Li J, Xie Y, Yan L, et al. Clinical characteristics of bronchopulmonary dysplasia and the risk of sepsis onset prediction via machine learning models. Front Pediatr. (2025) 13:1566747. doi: 10.3389/fped.2025.1566747
13. Singer M, Deutschman CS, Seymour CW, Shankar-Hari M, Annane D, Bauer M, et al. The third international consensus definitions for sepsis and septic shock (sepsis-3). JAMA. (2016) 315:801–10. doi: 10.1001/jama.2016.0287
14. Qin Z, Li L, Hou M, Wang C. The relationship between prognostic nutritional index and its components (albumin and lymphocyte count) and all-cause mortality in lung cancer patients: a hospital-based study with database external validation. Front Nutr. (2025) 12:1649334. doi: 10.3389/fnut.2025.1649334
15. Lou J, Kong H, Xiang Z, Zhu X, Cui S, Li J, et al. The J-shaped association between the ratio of neutrophil counts to prognostic nutritional index and mortality in ICU patients with sepsis: a retrospective study based on the MIMIC database. Front Cell Infect Microbiol. (2025) 15:1603104. doi: 10.3389/fcimb.2025.1603104
16. Pan M, Li Z, Sheng S, Teng X, Li Y. Prognostic nutritional index as a potential predictor of prognosis in patients with sepsis: a retrospective cohort study. Front Nutr. (2025) 12:1600943. doi: 10.3389/fnut.2025.1600943
17. Ashworth I, Thielemans L, Chevassut T. Thrombocytopenia: the good, the bad and the ugly. Clin Med. (2022) 22:214–7. doi: 10.7861/clinmed.2022-0146
18. Zhu J, Dong Y, Liao P, Yin X, He J, Guo L. Prognostic value of hemoglobin in patients with sepsis: a systematic review and meta-analysis. Heart Lung. (2024) 64:93–9. doi: 10.1016/j.hrtlng.2023.12.001
19. Lee H. Procalcitonin as a biomarker of infectious diseases. Korean J Intern Med. (2013) 28:285–91. doi: 10.3904/kjim.2013.28.3.285
20. Gregoriano C, Heilmann E, Molitor A, Schuetz P. Role of procalcitonin use in the management of sepsis. J Thorac Dis. (2020) 12:S5–S15. doi: 10.21037/jtd.2019.11.63
21. Efat A, Shoeib S, Ebrahim E, Kassemy Z, Bedair HM, Abozenah M. Impact of indirect bilirubin and uric acid on outcomes of sepsis-associated acute kidney injury (sAKI). Int Urol Nephrol. (2022) 54:3009–16. doi: 10.1007/s11255-022-03232-2
22. Ji W, Wang G, Liu J. Association between prognostic nutritional index and all-cause mortality in critically ill patients with ventilator-associated pneumonia: a retrospective study based on MIMIC-IV database. Front Nutr. (2025) 12:1605032. doi: 10.3389/fnut.2025.1605032
23. Al-Shami I, Hourani HMA, Alkhatib B. The use of prognostic nutritional index (PNI) and selected inflammatory indicators for predicting malnutrition in COVID-19 patients: a retrospective study. J Infect Public Health. (2023) 16:280–5 (2023) doi: 10.1016/j.jiph.2022.12.018
24. Hill B, Mitchell A, Szydlowska A, Sparks C, Dunn H, Berry H. The role of nutrition in wound healing and implications for nursing practice. Br J Nurs. (2025) 34:S39–42. doi: 10.12968/bjon.2024.0288
Keywords: biomarkers, machine learning algorithms, mortality, predictive model, sepsis
Citation: Yang Y-l, Huang Y-s, Bai Y-l, Huang X-x, Fu Z-y and Huang Z-w (2026) Development and validation of a nutrition-integrated nomogram for predicting 28-day mortality in sepsis patients. Front. Nutr. 12:1726151. doi: 10.3389/fnut.2025.1726151
Received: 16 October 2025; Revised: 18 December 2025;
Accepted: 29 December 2025; Published: 20 January 2026.
Edited by:
Qinghe Meng, Upstate Medical University, United StatesCopyright © 2026 Yang, Huang, Bai, Huang, Fu and Huang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Zhi-wei Huang, ZHJodWFuZ3p3MDFAMTYzLmNvbQ==
†These authors have contributed equally to this work
Yong-ling Yang1†