Construction and evaluation of a machine learning-based predictive model for enteral nutrition feeding intolerance risk in ICU patients

Wang, Gaimei; Lu, Cendi; Solomon, Owusu Mensah; Gu, Yujia; Ling, Yijing; Xu, Fanchi; Tao, Yumin; Wei, Yehong

doi:10.3389/fnut.2025.1600319

ORIGINAL RESEARCH article

Front. Nutr., 09 July 2025

Sec. Clinical Nutrition

Volume 12 - 2025 | https://doi.org/10.3389/fnut.2025.1600319

Construction and evaluation of a machine learning-based predictive model for enteral nutrition feeding intolerance risk in ICU patients

Gaimei Wang¹^†

Cendi Lu¹^†

Owusu Mensah Solomon²

Yujia Gu³

Yijing Ling³

Fanchi Xu³

Yumin Tao⁴^*

Yehong Wei⁵^*

¹Department of Neurosurgery Unit, The Second Affiliated Hospital of Zhejiang Chinese Medical University, Hangzhou, China
²International Education College, Zhejiang Chinese Medical University, Hangzhou, China
³College of Nursing, Zhejiang Chinese Medical University, Hangzhou, China
⁴Ningbo Municipal Center for Disease Control and Prevention, Ningbo, China
⁵Intensive Care Unit, The Second Affiliated Hospital of Zhejiang Chinese Medical University, Zhejiang, China

Objective: We aim to investigate the factors influencing enteral nutrition feeding intolerance (ENFI) in critically ill patients and develop a risk prediction model for ENFI in intensive care unit (ICU) patients, utilizing three machine learning algorithms. This model will serve as an assessment tool for preventing and managing ENFI in ICU patients.

Methods: A total of 487 ICU patients from a tertiary hospital in Zhejiang Province between January 2021 and December 2023 were selected as the study subjects. The patients were randomly divided into a training set and a test set in an 8:2 ratio. Three machine learning algorithms—logistic regression (LR), support vector machine (SVM), and random forest (RF)—were used to construct the risk prediction model for ENFI in ICU patients. The predictive performance of the three models was compared using metrics such as AUC (area under the ROC curve), accuracy, precision, recall, and F1 score.

Results: The logistic regression model achieved an AUC of 0.9308, with an accuracy of 94.3%, precision of 95.4%, recall of 88.6%, and an F1-score of 0.9185 in correctly identifying ENFI risk in ICU patients. The random forest model attained an AUC of 0.9511, with an accuracy of 96.1%, precision of 97.7%, recall of 91.4%, and an F1-score of 0.9446. The support vector machine (SVM) model yielded an AUC of 0.9241, with an accuracy of 94.1%, precision of 96.8%, recall of 86.4%, and an F1-score of 0.9132.

Conclusion: The random forest model performed the best in this study, demonstrating superior predictive performance.

1 Introduction

Early enteral nutrition (EN) support is a crucial component of comprehensive treatment for ICU patients. It provides essential nutrients and helps maintain the integrity of the intestinal mucosal barrier, reduces hypercatabolism, and prevents secondary infections (1). However, the occurrence of enteral nutrition feeding intolerance (ENFI) severely impacts the delivery of enteral nutrition (2). ENFI (3) is a term for gastrointestinal problems like abdominal distension, diarrhea, and constipation during EN. These problems cause the patient to stop or suspend enteral nutrition, which keeps them from meeting their target caloric intake within 72 h. This increases the incidence of malnutrition and prolongs the duration of mechanical ventilation and ICU stay, thereby increasing the medical burden (4).

Recently, with in-depth research on EN, it has been found that early identification and precise prevention of ENFI can optimize EN management and improve clinical outcomes (5). Most existing ENFI risk prediction models are based on traditional logistic regression analysis, which assumes a linear relationship between independent and dependent variables (6, 7). However, in real-world scenarios, many independent variables have nonlinear or locally approximate linear effects on individual risk functions, which can reduce the model’s effectiveness to some extent (8). With the advancement of computer science, machine learning algorithms are increasingly being applied in the medical field (9). Machine learning-based prediction models can fully exploit data characteristics and explore complex relationships and patterns within the data, providing strong support for disease prevention, diagnosis, and treatment (10). In this study, three ML algorithms widely used in the medical field are selected, and this class of algorithms demonstrates strong analytical processing capabilities in handling medical data. The LR algorithm is simple in principle, effective at dealing with linear classification problems (e.g., disease diagnosis), with small sample size requirements, but easy to overfit (11). The algorithmic principle of the RF algorithm is more complex, and it is susceptible to overfitting problems due to the influence of training data noise, but its performance is stable in solving the classification problem, and the results have a certain degree of interpretability (12). SVM has better generalization ability and robustness and can achieve better classification results with a limited training set, but the computational cost is high and the memory demand is large (13).

Considering the limitations of the aforementioned models, this study will use three machine learning algorithms to construct a risk prediction model for ENFI in ICU patients. It aims to help clinicians identify high-risk patients, enabling timely preventive interventions to reduce EFI incidence.

2 Methods

2.1 Study population

A total of 3,179 patients admitted to the ICU of a tertiary hospital in Zhejiang Province from January 2022 to December 2023 were selected as the study subjects by convenience sampling, and 487 patients were finally included after screening. The specific process is detailed in Figure 1. The study was approved in written form by the Ethics Committee of the Second Affiliated Hospital of Zhejiang Chinese Medicine University under the approval number No. 050–01 of 2024, The Second Affiliated Hospital of Zhejiang Chinese Medical University. We de-identified the records for this study and waived informed consent, as outlined in the Declarations. Inclusion criteria: age ≥ 18 years; enteral nutrition initiated within 48 h of ICU admission. Exclusion criteria: history of gastrointestinal diseases or gastrointestinal surgery; enteral nutrition initiated before ICU admission; intra-abdominal pressure (IAP) ≥ grade III at ICU admission; inability to place a urinary catheter due to bladder or urethral conditions. This is a predictive modeling study using retrospective data, designed and reported following the TRIPOD+AI guideline for developing and validating multivariable prediction models.

Figure 1

Flowchart detailing the development of a prediction model. Out of 3,179 adult patients admitted to ICU from January 2021 to December 2023, 2,692 were excluded due to criteria such as short ICU stay, enteral feeding duration, pre-existing conditions, or incomplete records. 487 patients were enrolled. The data was split 8:2 into a training set with 390 patients and a test set with 97. A prediction model was developed and its performance evaluated through internal and external validation.

Figure 1. Flowchart of ICU patient enrollment and predictive model development.

2.2 Data collection

2.2.1 Questionnaire on influencing factors of ENFI in ICU patients

To identify risk factors for ENFI in intensive care unit patients, this study conducted a systematic search of the databases PubMed, Web of Science, and China Knowledge Network (CNKI) from the inception of the databases to January 30, 2024, using the Medical Subject Headings (MeSH) and free-text terms. Two members of the research team independently screened the literature based on the Joanna Briggs Institute (JBI) checklist and the Johns Hopkins University Evidence Assessment Criteria, with disagreements adjudicated by a third party. The literature-specific screening process is shown in the PRISMA flowchart (Figure 2). Eighteen candidate risk factors were extracted from 18 selected high-quality articles. These risk factors were refined through two rounds of the expert meeting method, resulting in the identification of 26 potential risk factors for enteral nutrition intolerance. These factors included clinical baseline data, biochemical markers, and intervention-related variables, and the specific entries are detailed in Tables 1, 2. This multistage approach ensured the methodological rigor and clinical relevance of the study. To ensure the accuracy and stability of the model, the sample size was calculated based on the requirement that it should be at least 5–10 times the number of independent variables. Considering 26 influencing factors, an ENFI incidence rate of 38%, and a 10% sample attrition rate, at least 376 samples were required. In practice, this study included 487 samples, meeting the sample size requirement. We divided the samples into a training set and a test set in an 8:2 ratio.

Figure 2

Flowchart depicting the process of selecting literature. Initial database search yielded 3122 results, reduced to 1232 after deduplication. Titles and abstracts filtered out 1043 texts, leaving 189. After full-text reading, 171 were excluded, resulting in 18 final documents.

Figure 2. PRISMA flowchart of literature selection for ENFI risk factors.

Table 1

Table 1. Comparison of factors influencing ENFI in ICU patients (non-normal distribution, N = 487).

Table 2

Table 2. Comparison of factors influencing ENFI in ICU patients (categorical variables, N = 487).

2.2.2 Assessment of ENFI

According to the diagnostic criteria for enteral nutrition feeding intolerance (ENFI) established by the Abdominal Problems Working Group of the European Society of Intensive Care Medicine (15), combined with clinical practice, the diagnostic criteria for ENFI in ICU patients were defined as follows: (1) Failure to achieve the target caloric intake of at least 20 kcal/(kg·d) within 72 h after initiating enteral nutrition; (2) Suspension or discontinuation of enteral nutrition due to gastrointestinal symptoms including abdominal distension (IAP ≥ 12 mmHg), vomiting (expulsion of gastric contents through the mouth occurring once or more), or diarrhea (≥3 episodes of loose watery stools per 24-h period, with each stool volume >200 g); or (3) Gastric residual volume (GRV) monitoring every 6 h, with either a single GRV measurement ≥200 mL or cumulative GRV exceeding 500 mL within 24 h. In this study, GRV was established as a secondary observational indicator, which is not used alone to assess enteral nutrition intolerance, but needs to be used in conjunction with other outcome indicators. This indicator was retained based on institutional research protocols, but according to the American Society for Parenteral and Enteral Nutrition (ASPEN) recommendations, GRV needs to be interpreted with caution and weighted strictly in the assessment system to ensure the scientific validity of the study conclusions. The assessment, based on the medical records from the first 7 days of enteral feeding, was uniformly applied to both the training and validation sets. It was conducted independently by a nutrition nurse specialist, a critical care specialist, and the researcher, with the diagnosis of ENFI requiring agreement from at least two of the evaluators.

2.3 Data processing

All predictor variables (e.g., APACHE II scores, IAP) were collected before or at the time of enteral nutrition initiation, while ENFI outcomes were assessed within the subsequent 7 days. This temporal sequence ensures the model’s applicability for early risk prediction. Data with missing rates exceeding 50% were excluded from the analysis (14), while data with missing rates below 50% were imputed using the random forest method via the “missForest” package in R (Handling missing data in a rheumatoid arthritis registry using a random forest approach—PubMed, no date).

2.4 Statistical methods

SPSS 25.0 statistical software was used for data analysis. For continuous variables with a normal distribution, descriptive statistics were shown as mean ± standard deviation (mean ± SD), and t-tests were used to see if there were any differences between the groups. For continuous variables without a normal distribution, we used the median and interquartile range to present descriptive statistics. The Mann–Whitney U test was used to compare differences between groups that were not parametric. For categorical variables, frequencies and percentages were used for descriptive statistics, and chi-square tests were used to analyze differences between groups. Binary logistic regression was used to further screen influencing factors, with variables with p < 0.05 included in subsequent analyses. Python 3.9 was used to build and evaluate the prediction models, with the dataset divided into an 80% training set and a 20% test set. The predictive performance of the three models (LR, SVM, and RF) was compared using the training and test sets.

3 Results

3.1 Incidence of ENFI in ICU patients

The incidence of ENFI in the ICU patients in this study was 35.9%, with 175 out of 487 patients experiencing it. Among the 487 patients, 325 were male (66.7%) and 162 were female (33.3%). The average age was 76.76 ± 13.71 years, with a minimum age of 22 and a maximum age of 101. The average BMI was 21.78 ± 1.82, with a minimum of 16.4 and a maximum of 29.3. The top three primary diagnoses were respiratory diseases (243 cases, 49.9%), neurological diseases (185 cases, 37.9%), and circulatory system diseases (26 cases, 5.3%).

3.2 Univariate analysis of influencing factors of ENFI in ICU patients

Univariate analysis was conducted on the influencing factors of ENFI in ICU patients. Continuous variables were tested for normal distribution using the Kolmogorov–Smirnov (K-S) test, and the results indicated that none of the continuous variables followed a normal distribution. Therefore, rank-sum tests were used for comparison, as shown in Table 1.

3.3 Binary logistic regression analysis of influencing factors of ENFI in ICU patients

Based on the results of the univariate analysis, 21 risk factors with p < 0.05 were selected as independent variables, including seven continuous variables and 14 categorical variables. Among the seven continuous variables, one was related to general information, and six were related to observational data. Among the 14 categorical variables, one was related to general information, and the rest were related to observational data. These variables were included in the logistic regression analysis, and the results are detailed in Tables 2, 3.

Table 3

Table 3. Logistic regression analysis of factors influencing ENFI in ICU patients (N = 487).

3.4 Model construction and evaluation

3.4.1 Model construction

We used Python 3.9 to build and evaluate the prediction models based on the results of the influencing factor analysis. The first step in the machine learning process was to import the model-selection module and initialize the environment by defining the data frame, target variable (feeding tolerance/intolerance), training set (80%), and test set (20%). Bootstrap stability verification was performed on the included features. The stability of the selected frequency for each feature in the resampling is greater than 95%, indicating that the importance of the selected feature is highly confident. To ensure the generalization ability of the model and avoid data leakage, the hierarchical random segmentation strategy was used to divide the dataset to ensure that all preprocessing (e.g., imputation, normalization) was only fitted on the training set. We used grid search and cross-validation on the training set to enhance the model’s performance and lower the likelihood of overfitting. We divided the dataset into an 80% training set and a 20% test set. Models were built using three algorithms: LR, SVM, and RF.

3.4.2 Evaluation of training set models

Table 4 shows the performance metrics on the training set for the three models built using different methods. Among them, the AUC of the RF model is 0.9511, and the F1 score is 0.9446, achieving the highest scores among the three groups of models. The ROC curves for each model on the training set are shown in Figure 3. We evaluated the model’s calibration using the calibration curve Brier score. Figure 4 displays the results of the calibration curve, which visualizes the model’s calibrability. The Brier score (0.0463) is a direct measure of probabilistic prediction accuracy, indicating that the model is well calibrated overall.

Table 4

Table 4. Performance metrics of the three methods of model building for the training set.

Figure 3

ROC curve graph comparing three models: Logistic Regression (AUC = 0.93), Support Vector Machine (AUC = 0.89), and Random Forest (AUC = 0.99). The x-axis represents the false positive rate, and the y-axis represents the true positive rate. Random Forest shows the highest performance with a curve near the top-left corner.

Figure 3. ROC curves of three models in the training set.

Figure 4

Calibration curve comparing model calibration and perfect calibration. The x-axis represents mean predicted probability, and the y-axis represents the fraction of positives. The blue line shows the model calibration, with fluctuations and a general upward trend. The orange dashed line represents perfect calibration, forming a diagonal from bottom left to top right.

Figure 4. Calibration curve of the Random Forest model.

3.4.3 Model validation and evaluation

We used resampling methods to validate the three models. The results of the test set showed that the RF model AUC of 0.982. Comprehensive analysis indicated that the RF model performed the best on the test set, as detailed in Table 5. The ROC curves for each model in the test set are shown in Figure 5.

Table 5

Table 5. Comparison of test set model performance (N = 98).

Figure 5

ROC curve showing the performance of three models: Logistic Regression, Support Vector Machine, and Random Forest. The Random Forest model has the highest AUC of 0.98. The curve plots True Positive Rate against False Positive Rate, with Random Forest outperforming the others.

Figure 5. ROC curves comparing model performance in the test set.

3.5 Feature importance ranking of ENFI influencing factors in ICU patients

Based on the comparison of the models, the RF model demonstrated the best overall performance and provided a ranking of feature importance. The seven influencing factors for ENFI occurrence, ranked in descending order of importance, include intra-abdominal pressure, APACHE II score, blood glucose level, and use of analgesics, among others, as illustrated in Figure 6.

Figure 6

Bar chart displaying factors affecting outcomes. IAP and APACHEII score have the largest impact, followed by blood glucose. Use of analgesics, early enema, mechanical ventilation, and use probiotics have smaller impacts.

Figure 6. Key predictors of ENFI in ICU patients in order of importance.

4 Discussion

4.1 Incidence of ENFI

This study included 487 ICU patients, of whom 175 experienced ENFI, accounting for 35.9% of the total. This figure is close to the 38% incidence rate reported by other scholars (16). This high rate highlights the urgent need for proactive identification and management of ENFI to mitigate complications such as prolonged ICU stays and malnutrition.

4.2 Influencing factors of ENFI

There are numerous factors influencing ENFI in ICU patients, and continuous exploration is needed. This study identified the following conclusions: The APACHE II score, intra-abdominal pressure, blood glucose level, mechanical ventilation, and several other factors independently influence early ENFI in ICU patients.

4.2.1 APACHE II score

The APACHE II score is an authoritative indicator for assessing the severity of illness and predicting prognosis in ICU patients. It is widely used in clinical practice, with higher scores indicating more severe illness and a worse prognosis (17). The more severe the patient’s condition, the more intense the systemic stress response, leading to pronounced vasoconstriction of splanchnic vessels, gastrointestinal mucosal ischemia, and even erosion, thereby impairing gastrointestinal function. On the other hand, the body enters a hypercatabolic state, triggering the breakdown, consumption, and loss of tissue proteins, resulting in hypoalbuminemia. This condition induces gastrointestinal mucosal edema, further exacerbating mucosal injury and ultimately reducing enteral nutrition tolerance in ICU patients (18). It was found that the intolerance group’s APACHE II score (33.29 ± 10.41) was significantly higher than the tolerance group’s (18.98 ± 6.41). Statistical analysis also confirmed that the APACHE II score is a separate risk factor for ENFI in ICU patients (p < 0.05), which is in line with what other research has found (2). Routine APACHE II scoring within 24 h of ICU admission can stratify high-risk patients, prompting closer EN monitoring and early interventions.

4.2.2 Intra-abdominal pressure

In this study, the IAP in the tolerance group was 7.29 ± 1.70 mmHg, while in the intolerance group, it was 11.02 ± 1.96 mmHg. In the intolerance group, 82 patients (46.86%) had intra-abdominal hypertension (IAP > 12 mmHg), but no cases of abdominal compartment syndrome (IAP > 20 mmHg) were observed. The fact that most patients suffered from respiratory or neurological diseases may explain this phenomenon. Compared to patients with gastrointestinal diseases, the increase in IAP in these patients was relatively mild, but the IAP in the intolerance group was still significantly higher than that in the tolerance group.

IAP (19) refers to the pressure within the abdominal cavity, and its increase can be attributed to factors such as increased organ volume, increased fluid volume, and the use of mechanical ventilation. The gastrointestinal tract is one of the most sensitive organs to increased IAP. As IAP rises, mesenteric blood flow decreases, and venous return is obstructed, leading to intestinal edema and impaired intestinal function, resulting in gastrointestinal adverse effects (20). Clinically, we often estimate IAP by measuring gastric, superior vena cava, inferior vena cava, or bladder pressure. Bladder pressure is considered the “gold standard” for IAP monitoring due to its simplicity, non-invasiveness, accuracy, and minimal influence by human factors or the disease itself (21). This study also used bladder pressure as a proxy for IAP. Healthcare providers should regularly monitor IAP in clinical practice and actively seek causes and interventions for patients with high IAP.

4.2.3 Blood glucose level

In this study, the blood glucose level in the intolerance group was 9.34 ± 3.50 mmol/L, higher than that in the tolerance group (7.34 ± 3.33 mmol/L). For critically ill patients, hyperglycemia may result not only from pre-existing diabetes but also from various other factors (22). Stress-induced hyperglycemia refers to elevated blood glucose levels in patients without a history of diabetes, occurring in response to severe trauma, shock, cardiovascular accidents, or other stressors (23). Elevated blood glucose levels can reflexively reduce the tension of the gastric antrum smooth muscle, leading to decreased gastric motility and symptoms such as gastric retention. Furthermore, high blood glucose can make the pylorus work harder, which can make the contractions of the stomach and duodenum not work together properly. Such conditions can cause problems with emptying the stomach and greatly raise the risk of ENFI (18, 28). Therefore, healthcare providers should pay close attention to blood glucose monitoring in critically ill ICU patients. When hyperglycemia occurs, appropriate measures should be taken to maintain blood glucose within a relatively stable range, which can help reduce the incidence of ENFI.

4.2.4 Mechanical ventilation

The results of this study indicate that ICU patients on mechanical ventilation are more likely to experience ENFI, and the use of mechanical ventilation is a risk factor for ENFI in ICU patients (p < 0.05). Mechanical ventilation is an artificial support system that controls or alters a patient’s spontaneous breathing movements. Its purpose is to maintain airway patency, improve ventilation and oxygenation, and prevent carbon dioxide retention and hypoxia. It is a common treatment method for critically ill patients in clinical practice (24). High levels of positive end-expiratory pressure (PEEP) can cause organs around the heart to have poor blood flow, lower cardiac output, and gastrointestinal ischemia. Such condition can slow the movement of food through the digestive tract or damage the mucosa, which can set off ENFI. On the other hand, mechanical ventilation can cause gas to enter the stomach or lead to bile reflux, further increasing IAP and affecting the patient’s tolerance to enteral nutrition. Therefore, in real life, IAP monitoring should be done regularly on patients on mechanical ventilation before and after they start enteral nutrition so that targeted measures can be taken in time if ENFI happens. Additionally, energy expenditure can be estimated based on carbon dioxide production calculated by the ventilator, and individualized feeding plans can be developed based on the patient’s energy expenditure to reduce the occurrence of ENFI (25).

4.3 Model evaluation

Research on enteral nutrition feeding intolerance (ENFI) prediction has been conducted for many years (27). This study developed prediction models for risk factor screening by using conventional clinical data, which were analyzed through univariate analysis and logistic regression, while employing three machine learning algorithms. The three machine learning algorithms—LR, SVM, and RF—each exhibit distinct advantages and limitations in predicting enteral nutrition feeding intolerance (ENFI) in ICU patients. LR offers the best interpretability, providing clinically actionable odds ratios, but its linearity assumption may overlook complex interactions among risk factors. SVM captures nonlinear relationships through kernel functions, yet its “black-box” nature and sensitivity to class imbalance limit its clinical utility. Our Random Forest (RF) model achieves superior performance (AUC = 0.9511) by processing high-dimensional nutrient-specific data and automatically detecting feature interactions—albeit with the need for careful hyperparameter tuning due to its ensemble structure—findings that are consistent with and extend the benefits of ML as demonstrated by Ong et al. (26) in the context of ventilator management, collectively underscoring the transformative potential of machine learning in different predictive domains in the ICU (26).

To enhance the clinical interpretability of RF, this study employs a feature importance ranking method, intuitively illustrating the contribution of each feature to individual patient predictions, thereby facilitating clinical comprehension. The analysis confirms intra-abdominal pressure as the most critical predictor, aligning with established physiological mechanisms of ENFI pathogenesis. By bridging the gap between algorithmic predictions and clinical decision-making, this understandable AI framework enables clinicians to access not only risk scores but also their underlying determinants, fostering trust and promoting implementation in critical care settings. This approach adheres to current standards for transparent AI in healthcare, demonstrating a reproducible method for deploying machine learning tools in clinical environments where interpretability is paramount.

Simultaneously, we fully acknowledge concerns about potential overfitting due to the high AUC values in clinical prediction models. To this end, we systematically optimize and validate the model development process, and the results show that we use grid search + 5-fold cross-validation to tune the model parameters, and through parameter tuning and rigorous validation, the AUC of the training set decreases by 0.041, and the AUC of the test set maintains at 0.981, with the difference between the two values of 0.031, which is much lower than the threshold for hinting at overfitting in the clinical prediction model, and the model maintains high performance while overfitting.

These results indicate that the optimized model achieves high performance while minimizing the risk of overfitting. The stable performance on the test set further demonstrates that the model captures true predictive signals rather than noise.

In terms of clinical applicability, the Brier score (0.0463) and calibration curve of the optimized model demonstrated good overall calibration, especially in the high-risk interval (predicted probability > 0.8) where it was in high agreement with the ideal curve. This finding holds significant clinical relevance: when the model predicts an ENFI probability≥ 80%, clinicians can confidently initiate parenteral nutrition support to avoid complications from feeding intolerance. However, in the intermediate-risk range (0.4–0.6), deviations between predicted probabilities and observed frequencies suggest the need for integrating additional clinical indicators (e.g., gastric residual volume monitoring, bowel sound assessment) for comprehensive decision-making.

While current static models have shown strong predictive performance, we recognize that these models may not fully capture the time-series dynamics inherent in the condition of ICU patients. To enhance the timeliness and clinical relevance of the models, future work will focus on developing a dynamic prediction framework that incorporates longitudinal parameters. This approach will take full advantage of the complementary strengths of the Long Short-Term Memory (LSTM) network for analyzing temporal patterns and the Random Forest (RF) algorithm for dealing with static features, while integrating real-time data collection through the Hospital Information System (HIS) to create a comprehensive closed-loop “monitor-predict-intervene” management system.

5 Limitations and future directions

Although the risk of overfitting was reduced by parameter tuning and calibration analysis, this study was still a single-center retrospective analysis, and the generalization of the model for cross-institutional and cross-population data needs to be further verified. The current model was developed using retrospective data from an ICU in a tertiary hospital in East China (N = 487), and its clinical application may be subject to several constraints. First, the training data predominantly originated from high-level medical centers in a specific region, potentially limiting its generalizability to diverse healthcare settings with varying institutional levels, heterogeneous population characteristics, and distinct clinical protocols. Second, while internal validation demonstrated satisfactory performance, the model’s robustness in real-world scenarios necessitates rigorous external validation. Therefore, we suggest using this model as a secondary decision-support tool in clinical practice, complementing physicians’ clinical judgment.

To address these limitations, we propose a multicenter prospective validation study to systematically evaluate the model’s external validity. This investigation will enroll ICU patients from 12 hospitals of different tiers across five geographical regions (East, North, South, West, and Central China; projected sample size N = 1,500), implementing standardized prospective data collection protocols. The study will specifically examine: (1) predictive stability across varying healthcare resource allocations; (2) applicability in ethnically and geographically diverse populations; and (3) robustness in heterogeneous clinical practice environments. Scheduled for initiation in the second quarter of 2026 with an 18-month duration, the study will employ standardized inter-center quality control measures and regular data coordination meetings to ensure data comparability and reliability.

After completing the external validation, the integration of the predictive model with the hospital electronic health record (EHR) system has a promising application, but still faces many challenges at the technical and clinical levels. In terms of technical implementation, it is necessary to develop standardized API interfaces to interface with mainstream EHR platforms (e.g., Epic) to achieve automatic extraction of key parameters such as APACHE II scores, and at the same time, adopt a containerized deployment scheme to take into account computational efficiency, compatibility of data architectures, and privacy compliance requirements such as the Protection of Individual Personal Information Law (PIPL). In the future, by integrating the time series analysis capability of LSTM network and the feature processing advantage of Random Forest (RF) algorithm, and combining with the real-time data collection function of Hospital Information System (HIS), we can build a more complete dynamic prediction framework, and eventually form a closed-loop management system of “Monitoring-Prediction-Intervention,” which will significantly improve the timeliness and clinical relevance of the model. In terms of clinical implementation, the system will provide visual risk warning (e.g., traffic light indicators) and intelligent decision support (e.g., automated feeding regimen suggestions), and ensure the successful application of the prediction model in real-life healthcare scenarios through the seamless integration with existing clinical workflows, as well as the layered training program, system optimization feedback mechanism, and multidisciplinary collaboration mechanism.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by The Second Affiliated Hospital of Zhejiang Chinese Medical University Ethics Committee. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin because records of this study were de-identified and informed consent waived as set forth in the ethical review. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author contributions

GW: Conceptualization, Data curation, Formal analysis, Methodology, Visualization, Writing – original draft, Writing – review & editing. CL: Conceptualization, Data curation, Formal analysis, Methodology, Visualization, Writing – original draft, Writing – review & editing. YW: Supervision, Validation, Writing – review & editing. OS: Writing – review & editing. YG: Writing – review & editing. YL: Writing – review & editing. FX: Writing – review & editing. YT: Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by the Zhejiang Province Traditional Chinese Medicine Science and Technology Project (Grant Number: 2025ZL322).

Acknowledgments

The authors would like to thank all participants and collaborators for their contributions to this study.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Fuentes Padilla, P, Martínez, G, Vernooij, RW, Urrútia, G, Roqué I Figuls, M, and Bonfill Cosp, X. Early enteral nutrition (within 48 hours) versus delayed enteral nutrition (after 48 hours) with or without supplemental parenteral nutrition in critically ill adults. Cochrane Database Syst Rev. (2019) 2019:CD012340. doi: 10.1002/14651858.CD012340.pub2

PubMed Abstract | Crossref Full Text | Google Scholar

2. Yahyapoor, F, Dehnavi, Z, Askari, G, Ranjbar, G, Hejri Zarifi, S, Bagherniya, M, et al. The prevalence and possible causes of enteral tube feeding intolerance in critically ill patients: a cross-sectional study. J Res Med Sci. (2021) 26:60. doi: 10.4103/jrms.JRMS_689_20

PubMed Abstract | Crossref Full Text | Google Scholar

3. van Zanten, ARH, De Waele, E, and Wischmeyer, PE. Nutrition therapy and critical illness: practical guidance for the ICU, post-ICU, and long-term convalescence phases. Crit Care. (2019) 23:368. doi: 10.1186/s13054-019-2657-5

PubMed Abstract | Crossref Full Text | Google Scholar

4. Reintam Blaser, A, Malbrain, MLNG, Starkopf, J, Fruhwald, S, Jakob, SM, de Waele, J, et al. Gastrointestinal function in intensive care patients: terminology, definitions and management. Recommendations of the ESICM working group on abdominal problems. Intensive Care Med. (2012) 38:384–94. doi: 10.1007/s00134-011-2459-y

PubMed Abstract | Crossref Full Text | Google Scholar

5. Raphaeli, O, Statlender, L, Hajaj, C, Bendavid, I, Goldstein, A, Robinson, E, et al. Using machine-learning to assess the prognostic value of early enteral feeding intolerance in critically ill patients: a retrospective study. Nutrients. (2023) 15:2705. doi: 10.3390/nu15122705

PubMed Abstract | Crossref Full Text | Google Scholar

6. Lu, X-M, Jia, DS, Wang, R, Yang, Q, Jin, SS, and Chen, L. Development of a prediction model for enteral feeding intolerance in intensive care unit patients: a prospective cohort study. World J Gastrointe Surg. (2022) 14:1363–74. doi: 10.4240/wjgs.v14.i12.1363

PubMed Abstract | Crossref Full Text | Google Scholar

7. Wang, Y, Li, Y, Wang, H, Li, H, Li, Y, Zhang, L, et al. Development and validation of a nomogram for predicting enteral feeding intolerance in critically ill patients (NOFI): mixed retrospective and prospective cohort study. Clin Nutr. (2023) 42:2293–301. doi: 10.1016/j.clnu.2023.10.003

PubMed Abstract | Crossref Full Text | Google Scholar

8. Yue, S, Li, S, Huang, X, Liu, J, Hou, X, Zhao, Y, et al. Machine learning for the prediction of acute kidney injury in patients with sepsis. J Transl Med. (2022) 20:215. doi: 10.1186/s12967-022-03364-0

PubMed Abstract | Crossref Full Text | Google Scholar

9. Habehh, H, and Gohel, S. Machine learning in healthcare. Curr Genomics. (2021) 22:291–300. doi: 10.2174/1389202922666210705124359

PubMed Abstract | Crossref Full Text | Google Scholar

10. Shu, X, and Ye, Y. Knowledge discovery: methods from data mining and machine learning. Soc Sci Res. (2023) 110:102817. doi: 10.1016/j.ssresearch.2022.102817

PubMed Abstract | Crossref Full Text | Google Scholar

11. Burnett, A, Chen, N, Zeritis, S, Ware, S, McGillivray, L, Shand, F, et al. ‘Machine learning algorithms to classify self-harm behaviours in new south Wales ambulance electronic medical records: a retrospective study’, Int J Med Inform, (2022) 161, p. 104734. doi: 10.1016/j.ijmedinf.2022.104734

Crossref Full Text | Google Scholar

12. Hu, J, and Szymczak, S. ‘A review on longitudinal data analysis with random forest’, Brief Bioinform. (2023) 24, p. bbad002. doi: 10.1093/bib/bbad002

Crossref Full Text | Google Scholar

13. Akram-Ali-Hammouri, Z, Fernández-Delgado, M, Cernadas, E, and Barro, S. ‘Fast support vector classification for large-scale problems’, IEEE Trans. Pattern Anal. Mach. Intell. (2022) 44, pp. 6184–6195. doi: 10.1109/TPAMI.2021.3085969

Crossref Full Text | Google Scholar

14. Heymans, MW, and Twisk, JWR. ‘Handling missing data in clinical research’, J Clin Epidemiol. (2022) 151, pp. 185–188. doi: 10.1016/j.jclinepi.2022.08.016

Crossref Full Text | Google Scholar

15. Jenkins, B, Calder, PC, and Marino, LV. A systematic review of the definitions and prevalence of feeding intolerance in critically ill adults. Clin Nutr ESPEN. (2022) 49:92–102. doi: 10.1016/j.clnesp.2022.04.014

PubMed Abstract | Crossref Full Text | Google Scholar

16. Lin, J, Liu, Y, Ke, L, Li, G, Lv, C, Zhou, J, et al. Feeding intolerance score in critically ill patients with enteral nutrition: a post hoc analysis of a prospective study. Nutr Clin Pract. (2022) 37:869–77. doi: 10.1002/ncp.10788

Crossref Full Text | Google Scholar

17. Matejovic, M, Huet, O, Dams, K, Elke, G, Vaquerizo Alonso, C, Csomos, A, et al. Medical nutrition therapy and clinical outcomes in critically ill adults: a European multinational, prospective observational cohort study (EuroPN). Crit Care. (2022) 26:143. doi: 10.1186/s13054-022-03997-z

PubMed Abstract | Crossref Full Text | Google Scholar

18. Reintam Blaser, A, Preiser, JC, Fruhwald, S, Wilmer, A, Wernerman, J, Benstoem, C, et al. Gastrointestinal dysfunction in the critically ill: a systematic scoping review and research agenda proposed by the section of metabolism, endocrinology and nutrition of the European society of intensive care medicine. Crit Care. (2020) 24:224. doi: 10.1186/s13054-020-02889-4

PubMed Abstract | Crossref Full Text | Google Scholar

19. De Laet, IE, Malbrain, MLNG, and De Waele, JJ. A clinician’s guide to Management of Intra-abdominal Hypertension and Abdominal Compartment Syndrome in critically ill patients. Crit Care. (2020) 24:97. doi: 10.1186/s13054-020-2782-1

PubMed Abstract | Crossref Full Text | Google Scholar

20. Hu, L, Nie, Z, Zhang, Y, Zhang, Y, Ye, H, Chi, R, et al. Development and validation of a nomogram for predicting self-propelled postpyloric placement of spiral nasoenteric tube in the critically ill: mixed retrospective and prospective cohort study. Clin Nutr. (2019) 38:2799–805. doi: 10.1016/j.clnu.2018.12.008

PubMed Abstract | Crossref Full Text | Google Scholar

21. Fernández, LG, and Matthews, MR. Clinical observations in patients with open abdomens managed with negative pressure therapy using a perforated foam dressing: a limited case series with brief literature review. Wounds. (2024) 36:61–6. doi: 10.25270/wnds/20017

PubMed Abstract | Crossref Full Text | Google Scholar

22. Al-Yousif, N, Rawal, S, Jurczak, M, Mahmud, H, and Shah, FA. Endogenous glucose production in critical illness. Nutr Clin Pract. (2021) 36:344–59. doi: 10.1002/ncp.10646

PubMed Abstract | Crossref Full Text | Google Scholar

23. Scheen, M, Giraud, R, and Bendjelid, K. Stress hyperglycemia, cardiac glucotoxicity, and critically ill patient outcomes current clinical and pathophysiological evidence. Physiol Rep. (2021) 9:e14713. doi: 10.14814/phy2.14713

PubMed Abstract | Crossref Full Text | Google Scholar

24. Akella, P, Voigt, LP, and Chawla, S. To wean or not to wean: a practical patient focused guide to ventilator weaning. J Intensive Care Med. (2022) 37:1417–25. doi: 10.1177/08850666221095436

PubMed Abstract | Crossref Full Text | Google Scholar

25. Chinese Society of Neurosurgery. Chinese neurosurgery intensive care management group. Chin Med J. (2022) 102:2236–55.

Google Scholar

26. Ong, WJD, How, CH, Chong, WHK, Khan, FA, Ngiam, KY, and Kansal, A. Outcome prediction for adult mechanically ventilated patients using machine learning models and comparison with conventional statistical methods: a single-Centre retrospective study. Intell Based Med. (2024) 10:100165. doi: 10.1016/j.ibmed.2024.100165

Crossref Full Text | Google Scholar

27. Yang, H, Liu, J, and Sun, H. ‘Risk prediction model for adult intolerance to enteral nutrition feeding - A literature review’. Am. J. Med. Sci.. (2024) S0002-9629(24)01528–3. doi: 10.1016/j.amjms.2024.11.012

PubMed Abstract | Crossref Full Text | Google Scholar

28. Reintam Blaser, A, Rice, TW, and Deane, AM. Update on nutritional assessment and therapy in critical care. Curr Opin Crit Care. (2020) 26:197–204. doi: 10.1097/MCC.0000000000000694

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: ICU patients, enteral nutrition, feeding intolerance, machine learning, prediction model

Citation: Wang G, Lu C, Solomon OM, Gu Y, Ling Y, Xu F, Tao Y and Wei Y (2025) Construction and evaluation of a machine learning-based predictive model for enteral nutrition feeding intolerance risk in ICU patients. Front. Nutr. 12:1600319. doi: 10.3389/fnut.2025.1600319

Received: 26 March 2025; Accepted: 12 June 2025;
Published: 09 July 2025.

Edited by:

Zhang Haoling, University of Science Malaysia (USM), Malaysia

Reviewed by:

Wei Jun Dan Ong, National University Health System, Singapore
Peng Lu, Cangzhou Central Hospital, China
Liu Yanxi, University of Science Malaysia (USM), Malaysia

Copyright © 2025 Wang, Lu, Solomon, Gu, Ling, Xu, Tao and Wei. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yehong Wei, eWVob25nODcyMEAxNjMuY29t; Yumin Tao, MjAxNDAwNTdAcXEuY29t

^†These authors share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.