Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Nutr., 23 January 2026

Sec. Clinical Nutrition

Volume 12 - 2025 | https://doi.org/10.3389/fnut.2025.1741052

Explainable machine learning model to predict refeeding syndrome in patients with severe acute pancreatitis


Cui Wu&#x;Cui Wu1Shuangshuang Jing,&#x;Shuangshuang Jing2,3Dinghui Guo&#x;Dinghui Guo2Congcong Cheng,
Congcong Cheng2,3*Sujuan Fei
Sujuan Fei3*
  • 1Department of Nursing, Xuzhou Tongshan District Hospital of Traditional Chinese Medicine, Xuzhou, Jiangsu, China
  • 2Xuzhou Medical University, Xuzhou, Jiangsu, China
  • 3Department of Gastroenterology, The Affiliated Hospital of Xuzhou Medical University, Xuzhou, Jiangsu, China

Objective: To construct and validate a risk prediction model for refeeding syndrome (RFS) in patients with severe acute pancreatitis (SAP), identify high-risk individuals before overt electrolyte abnormalities occur, and provide decision support for the timing of enteral nutrition initiation and early personalized intervention.

Methods: A retrospective cohort study was conducted on SAP patients admitted to Xuzhou Medical University Affiliated Hospital (XYFY) between September 2018 and September 2025. Patients were divided into RFS and Non-RFS groups based on the development of RFS after initiating enteral nutrition. Clinical data differences between groups were compared, least absolute shrinkage and selection operator (LASSO) regression was used for feature selection, and six machine learning (ML) algorithms were applied to build prediction models. Model performance was evaluated using receiver operating characteristic (ROC) curves, calibration curves, and decision curves. SHapley Additive exPlanations (SHAP) analysis was performed to interpret the contribution of key features.

Results: Seven predictive features were identified for model construction. The gradient boosting machine (GBM) model exhibited good generalization ability, with area under the curve (AUC) values of 0.851 (95% CI: 0.809–0.894) in the training set and 0.762 (95% CI: 0.672–0.852) in the testing set. Calibration curves confirmed consistency between predicted probabilities and actual outcomes, while decision curves demonstrated favorable net benefits across different clinical decision thresholds. SHAP analysis ranked feature importance as follows: serum potassium (K), serum sodium (Na), serum calcium (Ca), gastrointestinal decompression, blood urea nitrogen (BUN), diabetes mellitus (DM) history, and diuretic use.

Conclusion: The GBM model effectively predicts RFS risk in SAP patients after initiating enteral nutrition.

1 Introduction

Refeeding syndrome (RFS) is a severe metabolic complication characterized by profound electrolyte disturbances, fluid imbalance, and organ dysfunction. It occurs when nutritional support is restarted in patients with prolonged fasting or severe malnutrition, driven by core pathophysiological changes that result from rapid insulin secretion recovery and subsequent intracellular electrolyte shifts (1). In critically ill patients, RFS is closely associated with increased mortality and prolonged hospital stays (2). Although consensus definitions and high-risk population criteria have been proposed by institutions such as the American Society for Parenteral and Enteral Nutrition (ASPEN) (3, 4), early identification remains challenging in clinically complex populations.

Patients with severe acute pancreatitis (SAP) are at extremely high risk of RFS. On one hand, SAP-induced systemic inflammatory response, hypermetabolic consumption, and gastrointestinal dysfunction rapidly lead to malnutrition (5, 6). On the other hand, early enteral nutrition—the cornerstone of SAP treatment—serves as a classic trigger for RFS (7). However, existing RFS prediction models primarily focus on general intensive care unit (ICU) or hospital-wide populations (810). Their predictive variables (e.g., specific urinary biomarkers) or targets (e.g., isolated hypophosphatemia) lack applicability and specificity in the unique pathophysiological context of SAP. Thus, developing an interpretable RFS risk prediction tool tailored to SAP patients, based on routine clinical data, is crucial for precise early warning and personalized nutrition management.

Recently, machine learning (ML) has demonstrated significant advantages in medical prediction models. It captures complex non-linear relationships between variables, potentially outperforming traditional logistic regression (LR) (11, 12). Building on this, we systematically collected clinical data from SAP patients, constructed RFS risk prediction models using multiple ML algorithms, and enhanced model interpretability via SHapley Additive exPlanations (SHAP) analysis. The present study aims to develop a dedicated RFS risk early warning tool for SAP patients, with a core focus on identifying high-risk individuals through routine clinical indicators before the initiation of nutritional support. It targets the state where RFS risk exists without overt electrolyte abnormalities, rather than diagnosing existing electrolyte disturbances or confirmed RFS, ultimately providing data support for the clinical development of individualized monitoring and preventive intervention plans.

2 Materials and methods

2.1 Data resource

This single-center retrospective cohort study included SAP patients admitted to Xuzhou Medical University Affiliated Hospital (XYFY) from September 2018 to September 2025. Patients were categorized into RFS and Non-RFS groups based on RFS development after initiating enteral nutrition. Electronic medical record data were collected for model construction and validation. The study adhered to the Declaration of Helsinki and was approved by the Ethics Committee of Xuzhou Medical University Affiliated Hospital (Ethics No.: XYFY2025-KL236-01). Written informed consent was waived due to the retrospective nature and de-identified patient data.

2.2 Inclusion and exclusion criteria

Inclusion criteria: (1) SAP patients who received enteral nutrition during hospitalization. In addition to meeting acute pancreatitis (AP) diagnostic criteria (13), patients must satisfy one of the following: ① Meeting revised Atlanta classification (RAC) severe criteria (14), i.e., organ dysfunction lasting >48 h. Organ dysfunction was defined using the modified Marshall scoring system (15), with any organ score ≥2. Specifically, organ dysfunction was present if oxygenation index < 300, serum creatinine (Scr) > 170 μmol/L, or systolic blood pressure < 90 mmHg unresponsive to fluid resuscitation; ② Inability to obtain the complete above data, but ICU stay > 48 h.

Exclusion criteria: ① Age < ②18 years; ③ Hospital stay < 48 h; Concurrent pregnancy or malignant tumors; ④ Risk factors for hypophosphatemia (e.g., hemodialysis, treatment for hyperphosphatemia); ⑤ History of parathyroidectomy; ⑥ Missing serum electrolyte data; ⑦ Mild and moderately severe acute pancreatitis. This study focuses on SAP patients because they exhibit more prominent systemic inflammatory response, hypermetabolic consumption, and gastrointestinal dysfunction, making them an extremely high-risk population for RFS. Currently, there is a lack of targeted prediction tools for this group. Excluding mild and moderately severe cases aims to ensure the homogeneity of the study cohort, minimize confounding effects introduced by differences in disease severity, and thus enable the constructed model to more accurately reflect the unique RFS risk characteristics of the high-risk SAP population.

2.3 Research variables

(1) Demographic data: Age, gender, body mass index (BMI), smoking history, drinking history, Nutritional Risk Screening (NRS) score, pancreatitis etiology classification;

(2) Comorbidities: Hypertension (HTN), diabetes mellitus (DM), liver disease (LD), chronic kidney disease (CKD);

(3) Pre-enteral nutrition treatments: Mechanical ventilation (MV), renal replacement therapy (RRT), gastrointestinal decompression, medication use (insulin, diuretics, glucocorticoids, etc.);

(4) Laboratory results before initiating enteral nutrition (last measurement): Total bilirubin (TBIL), alkaline phosphatase (ALP), prealbumin (PALB), albumin (ALB), blood urea nitrogen (BUN), serum creatinine (Scr), total cholesterol (Chol), triglycerides (TG), serum potassium (K), serum sodium (Na), serum magnesium (Mg), serum calcium (Ca), serum phosphorus (P).

2.4 Variable definitions

(1) Refeeding syndrome (RFS): Defined per 2020 ASPEN criteria (3). RFS was diagnosed if one or more serum electrolytes (phosphorus, potassium, or magnesium) decreased within 5 days of increasing energy intake, with any two electrolytes declining by ≥10% from pre-refeeding levels to ensure accuracy. Compared with the 2020 ASPEN criteria, this study adds a strict requirement of at least a 10% decrease in two or more electrolytes. The purpose is to improve the specificity of RFS diagnosis and avoid misclassifying non-nutrition-related electrolyte fluctuations as RFS. It should be noted that the definition in this study differs from some previous research. This is mainly because SAP patients have unique pathophysiological characteristics, and conventional criteria are prone to interference from non-nutritional factors. While the strict definition may underestimate mild or early cases, it better aligns with the actual clinical needs of SAP management.

(2) Nutritional Risk Screening (NRS) (16): This indicator is assessed upon patient admission. The tool includes three dimensions: age, nutritional status, and physical condition. Age ≥ 70 years = 1 point (< 70 = 0); nutritional status: normal = 0, 5% weight loss in 3 months or 50–75% normal food intake in 1 week = 1, 5% weight loss in 2 months or BMI 18.5–20.5 or 25–50% normal food intake in 1 week = 2, 5% weight loss in 1 month or BMI < 18.5 or < 25% normal food intake in 1 week = 3; physical condition: normal = 0, ambulatory but weak = 1, bedridden = 2, critically ill = 3. Total scores range from 0 to 7, with higher scores indicating greater nutritional risk.

(3) Pancreatitis etiology classification: (a) Biliary: Gallbladder/biliary stones on imaging plus abnormal liver function; (b) Alcoholic: Heavy drinking or long-term alcohol use (≥50 ml/d for ≥ 5 years) after excluding other causes; (c) Hypertriglyceridemic: Serum triglycerides >5.65 mmol/L; (d) Other: Not meeting the above three etiologies.

2.5 Model construction and evaluation

Missing data were handled using multiple imputation (17). Patients were randomly divided into training (70%) and testing (30%) sets for model training and validation. Least absolute shrinkage and selection operator (LASSO) regression was used for feature selection, with 10-fold cross-validation to evaluate performance and obtain a streamlined feature set.

Six ML algorithms were employed: logistic regression (LR), support vector machine (SVM), random forest (RF), gradient boosting machine (GBM), extreme gradient boosting (XGBoost), and elastic network (EN). Nested 10-fold cross-validation was used for hyperparameter optimization (HPO) to enhance model reliability.

Model performance was evaluated via: ① Discrimination: Area under the curve (AUC) of receiver operating characteristic (ROC) curves and confusion matrix-derived metrics (sensitivity, specificity, accuracy, precision, Youden index, F1 score); ② Calibration: Calibration curves and Hosmer–Lemeshow (H–L) test; ③ Clinical applicability: Decision curves.

2.6 Model interpretation and visualization

SHAP analysis was used to quantify and interpret feature importance in the optimal model (18). Specifically: ① Bar plots for global feature importance ranking; ② Beeswarm plots for SHAP value distribution across all samples; ③ Partial dependence plots for non-linear relationships between single features and model outputs; ④ Waterfall plots for decomposing individual sample prediction processes.

2.7 Statistical analysis

All statistical analyses, model construction, and visualization were performed using R 4.4.1. Normally distributed continuous variables are presented as mean ± standard deviation (Mean ± SD) with t-tests for group comparisons; non-normally distributed variables as median (interquartile range) [Median (IQR)] with Mann–Whitney U-tests. Categorical variables are presented as n (%) with chi-square or Fisher's exact tests. A two-tailed P < 0.05 was considered statistically significant. The study design is shown in Figure 1.

Figure 1
Flowchart of the RFS Prediction Model process. Initially, 761 SAP patients were screened in XYFY. After exclusions (age under 18, hospital stay under 48 hours, pregnancy, tumors, hypophosphatemia risks, parathyroidectomy history, missing electrolyte data), 461 patients were included. They were split into a training set of 322 and a testing set of 139 in a 7:3 ratio, proceeding to feature selection, model construction, evaluation, interpretation, and visualization.

Figure 1. Flowchart of the study. RFS, refeeding syndrome; SAP, severe acute pancreatitis; XYFY, the Affiliated Hospital of Xuzhou Medical University.

3 Results

3.1 Baseline characteristics of the study participants

A total of 461 SAP patients were included, with 166 (36.01%) developing RFS. Compared with the Non-RFS group, RFS patients had higher proportions of DM and CKD, more frequent use of gastrointestinal decompression and diuretics before enteral nutrition, lower TBIL levels, and higher BUN, serum potassium, and serum sodium levels (P < 0.05; Table 1). Patients were randomly divided into training (n = 322) and testing (n = 139) sets, with balanced baseline characteristics between groups (P > 0.05; Supplementary Table S1).

Table 1
www.frontiersin.org

Table 1. Baseline characteristics of SAP patients stratified by RFS.

3.2 Model construction and validation

LASSO regression identified seven optimal predictors: serum potassium, serum sodium, serum calcium, gastrointestinal decompression, BUN, DM history, and diuretic use (Figure 2). Results of ROC curves (Figure 3) showed that the RF model performed the best in the training set with an AUC of 0.925 (95% CI: 0.897–0.954). However, its AUC decreased significantly to 0.742 in the testing set, suggesting a potential overfitting tendency that impairs its extrapolation performance. In contrast, the GBM model exhibited robust and balanced predictive performance in both the training and testing sets, with AUC values of 0.851 and 0.762, respectively. Its performance on the testing set was the best among all models, demonstrating superior generalization ability. Comprehensive evaluation metrics for all models are presented in Table 2.

Figure 2
Graph A shows a plot of coefficients versus log lambda, displaying multiple lines converging as lambda decreases. Graph B presents the binomial deviance against log lambda, featuring a red line with error bars. Deviance decreases initially, then increases after reaching the optimal lambda.

Figure 2. Process of feature variable selection. (A) LASSO regression coefficient path plot; (B) LASSO binomial deviance plot.

Figure 3
Two ROC curve charts comparing different machine learning models. Chart A shows training set performance with models: LR (0.795), SVM (0.824), RF (0.925), GBM (0.851), XGBoost (0.869), EN (0.795). Chart B shows testing set performance with models: LR (0.732), SVM (0.75), RF (0.742), GBM (0.762), XGBoost (0.74), EN (0.732). Both charts plot sensitivity against 1-specificity.

Figure 3. (A) ROC curves of each model in the training set. (B) ROC curves of each model in the testing set. ROC, receiver operating characteristic; AUC, area under the curve; LR, logistic regression; SVM, support vector machine; RF, random forest; GBM, gradient boosting machine; XGBoost, eXtreme gradient boosting; EN, elastic network.

Table 2
www.frontiersin.org

Table 2. Comparison of the performance of each model in the training set and testing set.

3.3 Model calibration and clinical applicability

Calibration curves for both training and testing sets showed good agreement between predicted and actual probabilities (Figure 4). H–L tests in the testing set yielded P > 0.05 for all models, indicating no significant calibration bias (Supplementary Table S2). Decision curves demonstrated high net benefits for all models in both training and testing sets, confirming clinical applicability (Figure 5).

Figure 4
Graphs comparing calibration curves for training and testing sets. Graph A (Training Set) and Graph B (Testing Set) plot observed versus predicted probabilities. Each graph includes six model curves: LR, SVM, RF, GBM, XGBoost, and EN, indicated by different colors and styles. Both graphs show a positive trend, with curves varying in alignment to the ideal diagonal line, signifying calibration quality.

Figure 4. (A) Calibration curves of each model in the training set. (B) Calibration curves of each model in the testing set. LR, logistic regression; SVM, support vector machine; RF, random forest; GBM, gradient boosting machine; XGBoost, eXtreme gradient boosting; EN, elastic network.

Figure 5
Decision curve analysis for two sets. Panel A displays training set curves, and Panel B shows testing set curves. Both panels compare different models: LR (red), SVM (blue), RF (green), GBM (purple), XGBoost (orange), EN (yellow), All (gray), and None (black). The x-axis represents threshold probability, and the y-axis represents net benefit. Lines indicate model performance across probabilities.

Figure 5. (A) Decision curves of each model in the training set. (B) Decision curves of each model in the testing set. The net benefit of each model at different threshold probabilities for predicting RFS in SAP patients. LR, logistic regression; SVM, support vector machine; RF, random forest; GBM, gradient boosting machine; XGBoost, eXtreme gradient boosting; EN, elastic network.

After a comprehensive consideration of the overall performance, calibration status, and clinical applicability of each model, the GBM model exhibited the most robust generalization ability in the testing set. Its calibration curves and decision curves demonstrated good calibration and net benefit, so it was identified as the optimal model and used for subsequent SHAP analysis.

3.4 Model interpretation and visualization

SHAP analysis of the GBM model clarified feature importance and contributions: ① Feature importance ranking: serum potassium > serum sodium > serum calcium > gastrointestinal decompression > BUN > DM history > diuretic use (Figure 6A); ② Beeswarm plots demonstrates that patients with a history of DM, gastrointestinal decompression and diuretic use before the initiation of enteral nutrition, and low serum calcium as well as high blood urea nitrogen, serum potassium, and serum sodium within the normal range have a significantly increased risk of RFS (Figure 6B); ③ Partial dependence plots illustrated the association between BUN levels and RFS risk prediction (Figure 6C); ④ Waterfall plots visualized the prediction process for individual samples (e.g., Sample 2), showing the specific impact of each feature on RFS risk (Figure 6D).

Figure 6
Panel A shows a bar chart of SHAP feature importance with potassium (K) being the most important feature. Panel B displays a beeswarm plot illustrating the distribution of SHAP values for various features against their feature values. Panel C presents a SHAP dependence plot for blood urea nitrogen (BUN) with a fitted curve. Panel D features a SHAP waterfall plot for a sample, indicating contributions of each feature to the prediction, with potassium having the highest positive impact.

Figure 6. SHAP analysis visualizations for interpreting feature contributions in predicting RFS in SAP patients. (A) Bar plot of mean absolute SHAP values, showing the average impact magnitude of each feature on model predictions; (B) beeswarm plot of SHAP values, illustrating the distribution of feature impacts across different feature values; (C) dependency plot, showing the relationship between BUN and its SHAP value for SAP patients' RFS prediction; (D) waterfall plot for a single prediction instance, decomposing the model output into individual feature contributions. SHAP, SHapley additive explanations; K, serum potassium; Na, serum sodium; Ca, serum calcium; BUN, blood urea nitrogen; DM, diabetes mellitus.

4 Discussion

This study constructed and validated RFS risk prediction models for SAP patients using six ML algorithms based on single-center retrospective data. The GBM model showed good predictive performance and generalization ability, with seven easily accessible clinical features identified via LASSO regression and SHAP analysis. This provides a reliable tool for early RFS identification in SAP patients.

The seven identified predictors have clear clinical and pathophysiological significance. Electrolyte disturbances are central to RFS (19), and our model highlights high pre-refeeding serum potassium/sodium and low serum calcium as key warning signs. In SAP patients, elevated pre-refeeding serum potassium/sodium may reflect electrolyte distribution abnormalities or impaired excretion due to stress, dehydration, or underlying renal dysfunction (20). This “apparently normal or elevated” state often masks true electrolyte depletion, leading clinicians to overlook potential risks without targeted supplementation during enteral nutrition initiation. Once enteral nutrition starts, restored insulin secretion drives rapid intracellular shifts of potassium, phosphorus, and magnesium, causing acute serum depletion and typical RFS manifestations (21). Hypocalcemia is common in SAP, associated with peripancreatic fat necrosis (calcium soap formation), hypomagnesemia, and vitamin D metabolism disorders (22); refeeding further exacerbates calcium-phosphorus imbalance (23), increasing neuromuscular instability. Beyond electrolytes, DM history and diuretic use directly elevate metabolic imbalance risk by impairing insulin sensitivity and promoting renal electrolyte loss. Elevated BUN indicates dehydration or hypercatabolism, a marker of systemic metabolic disturbance (24). Finally, gastrointestinal decompression—an important procedure-related factor—signals severe gastrointestinal dysfunction and exacerbates electrolyte loss via digestive fluid depletion (25), compounding metabolic challenges during refeeding.

It is important to note that the core function of this model lies in risk early warning and stratification, and its prediction logic and clinical application require an in-depth understanding through the following three aspects. First, SHAP analysis (Figure 6B) reveals that key predictors such as serum potassium, serum sodium, serum calcium, and blood urea nitrogen affect RFS risk mainly through their relative fluctuations within the normal range. For SAP patients with laboratory indicators within the normal range, serum potassium or sodium close to the upper limit, or serum calcium tending to the lower limit, may indicate a pseudo-normal state of intracellular electrolyte depletion or potential metabolic disorders. The model can sensitively capture such subtle changes, assisting clinicians to strengthen monitoring or implement preventive interventions before initiating enteral nutrition. Second, diuretic use not only reflects the volume management needs or comorbidities (such as heart failure) of SAP patients but may also cause electrolyte loss through promoting renal excretion, thereby increasing the risk of RFS. Through feature integration in the multivariate machine learning model, this study has controlled its confounding effect at the statistical level. However, due to the retrospective nature of the study, details such as the specific types, dosages, and clear indications of diuretics could not be collected. When applying this model clinically, the contribution of this predictor needs to be comprehensively interpreted in combination with the patient's volume status, comorbidities, and medication background. Additionally, NRS scores assessed at admission were generally low in this study. The core reason is that the score is designed around chronic nutritional depletion indicators, while the nutritional risk of SAP patients stems from acute stress response, gastrointestinal dysfunction, and hypermetabolic state. Such acute risks are difficult to capture by traditional scoring tools relying on chronic indicators. This phenomenon further confirms the limited applicability of traditional nutritional risk scores in SAP populations, highlighting the need for developing disease-specific RFS risk prediction models.

Compared with previous studies, this is the first to focus on SAP patients—an extremely high-risk RFS population—and systematically compare multiple ML algorithms for model construction. For example, Zhan et al. (8) developed a well-discriminating model for general ICU patients, but included non-routinely monitored indicators (e.g., urinary epithelial cell count) with questionable applicability in SAP. Choi et al. (9) used XGBoost to predict refeeding hypophosphatemia, but the model targeted only one RFS component and included heterogeneous study populations, failing to reflect SAP-specific metabolic and inflammatory features. Zhang et al. (10) constructed an RFS prediction model for neurocritical patients, identifying risk factors such as drinking history and fasting duration, but the pathophysiological core (consciousness disorder-related nutritional intake impairment) differs fundamentally from SAP's pancreatitis-mediated digestive failure and hypercatabolism. Additionally, their model used traditional logistic regression, failing to capture complex non-linear relationships or quantify feature contributions via interpretability algorithms, limiting precision and acceptance in clinical translation. Our study fills the gap of SAP-specific RFS prediction models, with LASSO-selected predictors (e.g., electrolytes, gastrointestinal decompression) closely aligned with SAP clinical course, ensuring biological rationality and clinical relevance. Furthermore, SHAP analysis enhances the interpretability of the complex ML model, translating predictions into intuitively understandable clinical decision support. All predictors are derived from routine clinical data without requiring additional tests, ensuring excellent clinical practicality and wide applicability.

Nevertheless, this study has several limitations. First, the single-center, retrospective design and limited sample size may introduce selection bias. Center-specific practices such as enteral nutrition protocols and electrolyte monitoring strategies may affect model performance and generalizability. Future plans include multicenter external validation and single-center prospective validation, enrolling SAP patient cohorts from centers in different regions and with varying medical levels. The Bootstrap method will be used to calculate optimism-adjusted performance metrics for more rigorous evaluation of the model's generalization ability and clinical applicability. Second, this study focuses on SAP patients, so the model's extrapolation to mild and moderately severe acute pancreatitis patients requires caution; the SAP cohort was not further stratified by severity indicators such as Marshall score and duration of organ failure, and residual disease heterogeneity may impact model performance. Future studies will incorporate these severity indicators to optimize the model and expand its application scope through multicenter research. Third, due to the retrospective design, clinical data were not fully retrievable. Specific types and indications of diuretics were not collected, nor were details of nutritional support such as infusion rate and caloric targets—key modifiable factors in RFS management (26). Fourth, to improve the specificity of RFS diagnosis, this study adopted a stricter definition than the 2020 ASPEN criteria. While this helps reduce the inclusion of non-specific cases, it may exclude some early or mild RFS cases, potentially affecting the model's ability to identify the full spectrum of RFS risks. Future research can validate the model's performance under a broader definition.

5 Conclusion

In summary, this study developed and validated an explainable ML prediction model for RFS in SAP patients. The GBM model demonstrates satisfactory predictive performance and generalization ability, requiring only seven easily accessible clinical features. This tool enables early identification of high-risk SAP patients before nutritional support initiation, facilitating timely interventions and optimized nutrition management to improve patient outcomes.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving humans were approved by the Ethics Committee of Xuzhou Medical University Affiliated Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants' legal guardians/next of kin because this waiver was justified by the retrospective nature of the study, the use of previously collected data, the absence of direct contact with patients, and minimal risk to participants.

Author contributions

CW: Investigation, Conceptualization, Methodology, Resources, Writing – original draft. SJ: Writing – original draft, Methodology, Conceptualization, Validation, Resources. DG: Resources, Software, Writing – original draft, Data curation, Visualization, Investigation. CC: Formal analysis, Writing – original draft, Methodology, Visualization, Data curation, Software, Conceptualization, Writing – review & editing. SF: Project administration, Supervision, Conceptualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declared that financial support was not received for this work and/or its publication.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnut.2025.1741052/full#supplementary-material

References

1. Heuft L, Voigt J, Selig L, Stumvoll M, Schlögl H, Kaiser T. Refeeding syndrome. Dtsch Arztebl Int. (2023) 120:107–14. doi: 10.3238/arztebl.m2022.0381

Crossref Full Text | Google Scholar

2. Reber E, Friedli N, Vasiloglou MF, Schuetz P, Stanga Z. Management of refeeding syndrome in medical inpatients. J Clin Med. (2019) 8:2202. doi: 10.3390/jcm8122202

PubMed Abstract | Crossref Full Text | Google Scholar

3. da Silva JSV, Seres DS, Sabino K, Adams SC, Berdahl GJ, Citty SW, et al. ASPEN consensus recommendations for refeeding syndrome. Nutr Clin Pract. (2020) 35:178–95. doi: 10.1002/ncp.10474

PubMed Abstract | Crossref Full Text | Google Scholar

4. Matthews-Rensch K, Blackwood K, Lawlis D, Breik L, McLean C, Nguyen T, et al. The Australasian society of parenteral and enteral nutrition: consensus statements on refeeding syndrome. Nutr Diet. (2025) 82:128–42. doi: 10.1111/1747-0080.70003

PubMed Abstract | Crossref Full Text | Google Scholar

5. Arvanitakis M, Ockenga J, Bezmarevic M, Gianotti L, Krznarić Ž, Lobo DN, et al. ESPEN practical guideline on clinical nutrition in acute and chronic pancreatitis. Clin Nutr. (2024) 43:395–412. doi: 10.1016/j.clnu.2023.12.019

PubMed Abstract | Crossref Full Text | Google Scholar

6. Cañamares-Orbís P, García-Rayado G, Alfaro-Almajano E. Nutritional support in pancreatic diseases. Nutrients. (2022) 14:4570. doi: 10.3390/nu14214570

PubMed Abstract | Crossref Full Text | Google Scholar

7. Fostier R, Arvanitakis M, Gkolfakis P. Nutrition in acute pancreatitis: when, what and how. Curr Opin Clin Nutr Metab Care. (2022) 25:325–8. doi: 10.1097/MCO.0000000000000851

PubMed Abstract | Crossref Full Text | Google Scholar

8. Zhan X, Wang H, Bai Y. Development and internal validation of a clinical prediction model for refeeding syndrome in adult intensive care unit patients: a retrospective observational study. Int J Gen Med. (2025) 18:6233–43. doi: 10.2147/IJGM.S544989

PubMed Abstract | Crossref Full Text | Google Scholar

9. Choi TY, Chang M-Y, Heo S, Jang JY. Explainable machine learning model to predict refeeding hypophosphatemia. Clin Nutr ESPEN. (2021) 45:213–9. doi: 10.1016/j.clnesp.2021.08.022

PubMed Abstract | Crossref Full Text | Google Scholar

10. Zhang W, Zhang S-X, Chen S-F, Yu T, Tang Y. Development and validation of risk prediction model for refeeding syndrome in neurocritical patients. Front Nutr. (2023) 10:1083483. doi: 10.3389/fnut.2023.1083483

PubMed Abstract | Crossref Full Text | Google Scholar

11. Zhou Y, Ge Y-T, Shi X-L, Wu K-Y, Chen W-W, Ding Y-B, et al. Machine learning predictive models for acute pancreatitis: a systematic review. Int J Med Inform. (2022) 157:104641. doi: 10.1016/j.ijmedinf.2021.104641

PubMed Abstract | Crossref Full Text | Google Scholar

12. Liu Y, Cheng C, Li W, Gu J, Shi Y, Fei S. Machine learning-based prediction of short-term recurrence of colorectal adenomatous polyps following EMR: model development and validation study. Int J Med Inform. (2025) 206:106165. doi: 10.1016/j.ijmedinf.2025.106165

PubMed Abstract | Crossref Full Text | Google Scholar

13. Mederos MA, Reber HA, Girgis MD. Acute pancreatitis: a review. JAMA. (2021) 325:382–90. doi: 10.1001/jama.2020.20317

PubMed Abstract | Crossref Full Text | Google Scholar

14. Banks PA, Bollen TL, Dervenis C, Gooszen HG, Johnson CD, Sarr MG, et al. Classification of acute pancreatitis −2012: revision of the Atlanta classification and definitions by international consensus. Gut. (2013) 62:102–111. doi: 10.1136/gutjnl-2012-302779

Crossref Full Text | Google Scholar

15. Ni T, Wen Y, Zhao B, Ning N, Chen E, Mao E, et al. Characteristics and risk factors for extrapancreatic infection in patients with moderate or severe acute pancreatitis. Heliyon. (2023) 9:e13131. doi: 10.1016/j.heliyon.2023.e13131

PubMed Abstract | Crossref Full Text | Google Scholar

16. Hersberger L, Bargetzi L, Bargetzi A, Tribolet P, Fehr R, Baechli V, et al. Nutritional risk screening (NRS 2002) is a strong and modifiable predictor risk score for short-term and long-term clinical outcomes: secondary analysis of a prospective randomised trial. Clin Nutr. (2020) 39:2720–9. doi: 10.1016/j.clnu.2019.11.041

PubMed Abstract | Crossref Full Text | Google Scholar

17. Li J, Wang Z, Wu L, Qiu S, Zhao H, Lin F, et al. Method for incomplete and imbalanced data based on multivariate imputation by chained equations and ensemble learning. IEEE J Biomed Health Inform. (2024) 28:3102–13. doi: 10.1109/JBHI.2024.3376428

PubMed Abstract | Crossref Full Text | Google Scholar

18. Song Y, Zhang D, Wang Q, Liu Y, Chen K, Sun J, et al. Prediction models for postoperative delirium in elderly patients with machine-learning algorithms and SHapley additive exPlanations. Transl Psychiatry. (2024) 14:57. doi: 10.1038/s41398-024-02762-w

PubMed Abstract | Crossref Full Text | Google Scholar

19. Ponzo V, Pellegrini M, Cioffi I, Scaglione L, Bo S. The refeeding syndrome: a neglected but potentially serious condition for inpatients. A narrative review. Intern Emerg Med. (2021) 16:49–60. doi: 10.1007/s11739-020-02525-7

PubMed Abstract | Crossref Full Text | Google Scholar

20. Miller NE, Rushlow D, Stacey SK. Diagnosis and management of sodium disorders: hyponatremia and hypernatremia. Am Fam Physician. (2023) 108:476–86.

PubMed Abstract | Google Scholar

21. De Silva A, Nightingale JMD. Refeeding syndrome : physiological background and practical management. Frontline Gastroenterol. (2020) 11:404–9. doi: 10.1136/flgastro-2018-101065

PubMed Abstract | Crossref Full Text | Google Scholar

22. Buldukoglu OC. Hypocalcemia and acute pancreatitis: new perspectives. J Hepatobiliary Pancreat Sci. (2024) 31:e60. doi: 10.1002/jhbp.12059

PubMed Abstract | Crossref Full Text | Google Scholar

23. Schneeweiss-Gleixner M, Haselwanter P, Schneeweiss B, Zauner C, Riedl-Wewalka M. Hypophosphatemia after start of medical nutrition therapy indicates early refeeding syndrome and increased electrolyte requirements in critically ill patients but has no impact on short-term survival. Nutrients. (2024) 16:922. doi: 10.3390/nu16070922

PubMed Abstract | Crossref Full Text | Google Scholar

24. Ohbe H, Goto T, Nakamura K, Matsui H, Yasunaga H. Development and validation of early prediction models for new-onset functional impairment at hospital discharge of ICU admission. Intensive Care Med. (2022) 48:679–89. doi: 10.1007/s00134-022-06688-z

PubMed Abstract | Crossref Full Text | Google Scholar

25. Xu X, Cai L, Tian W, Yao Z, Zhao R, Zhao Y. Effects of different preoperative enteral nutrition feeding routes on the duration of gastrointestinal decompression after definitive surgery for small intestinal fistula. Langenbecks Arch Surg. (2021) 406:2837–48. doi: 10.1007/s00423-021-02301-x

PubMed Abstract | Crossref Full Text | Google Scholar

26. Jabłońska B, Mrowiec S. Nutritional support in patients with severe acute pancreatitis-current standards. Nutrients. (2021) 13:1498. doi: 10.3390/nu13051498

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: gradient boosting machine, machine learning, prediction model, refeeding syndrome, severe acute pancreatitis, SHAP analysis

Citation: Wu C, Jing S, Guo D, Cheng C and Fei S (2026) Explainable machine learning model to predict refeeding syndrome in patients with severe acute pancreatitis. Front. Nutr. 12:1741052. doi: 10.3389/fnut.2025.1741052

Received: 06 November 2025; Revised: 25 December 2025;
Accepted: 29 December 2025; Published: 23 January 2026.

Edited by:

Kuo-Cheng Lu, Fu Jen Catholic University Hospital, Taiwan

Reviewed by:

Yi-Chou Hou, Cardinal Tien Hospital, Taiwan
Chien-Lin Lu, Fu Jen Catholic University, Taiwan

Copyright © 2026 Wu, Jing, Guo, Cheng and Fei. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Congcong Cheng, Y2Njb29sMTAwNUAxMjYuY29t; Sujuan Fei, eHlmeWZlaXNqOTlAMTYzLmNvbQ==

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.