- 1School of Nursing, Chengdu University of Traditional Chinese Medicine, Chengdu, China
- 2Hospital of Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan, China
Background: Although numerous models have been developed in recent years to predict malnutrition in cancer patients, their methodological rigor and clinical applicability remain uncertain. The lack of systematic evaluation hampers their integration into routine oncology and nursing practice, where early identification of at-risk patients is crucial for optimizing nutritional interventions, enhancing treatment tolerance, and reducing morbidity and mortality.
Objective: This systematic review aims to synthesize and critically evaluate existing risk prediction models for malnutrition in cancer patients, thereby providing evidence-based insights to inform model development and clinical implementation.
Methods: Databases including PubMed, Embase, Web of Science, the Cochrane Library, and Scopus were systematically searched to identify studies on risk prediction models for malnutrition in cancer patients published from database inception to August 9, 2025. Data extracted from the included studies comprised study design, data sources, sample size, predictors, model development, and model performance. The methodological quality of each study was evaluated using the Prediction Model Risk of Bias Assessment Tool (PROBAST) checklist, and a meta-analysis of the area under the curve (AUC) was performed using Stata version 15.0.
Result: A total of 13 studies encompassing 57 predictive models were included. In the model development domain, seven studies constructed models using logistic regression alone, whereas five studies combined logistic regression with machine learning techniques. The reported incidence of malnutrition ranged from 11.9 to 69.9%. The most frequently used predictors were body mass index (BMI), age, and sex. The AUC values ranged from 0.735 to 0.982, with a pooled AUC of 0.85 (95% CI: 0.79–0.92) for eight validated models, indicating good discriminative performance. All 13 studies were rated as having a high risk of bias, mainly due to inappropriate data sources and insufficient reporting within the analysis domain.
Conclusion: Current models for predicting malnutrition in cancer patients remain in the exploratory phase. Although these models demonstrate good discriminatory performance, methodological shortcomings contribute to a high risk of bias. This systematic review underscores the need to integrate validated malnutrition prediction models into oncology and nursing practice. Such models can support clinicians and oncology nursing professionals in early screening and timely identification of high-risk patients, promote individualized nutritional interventions, and strengthen multidisciplinary collaboration among nurses, dietitians, and oncologists.
Systematic review registration: https://www.crd.york.ac.uk/prospero/display_record.php?ID=CRD420251128218, identifier: CRD420251128218.
1 Background
Cancer is a disease characterized by the uncontrolled proliferation of body cells, invasion of adjacent tissues, and distant metastasis. In 2020, an estimated 19.3 million new cancer cases and nearly 10 million cancer-related deaths occurred worldwide, underscoring the increasing global cancer burden (1).
Malnutrition is highly prevalent among cancer patients, with reported rates ranging from 30 to 90% (2–4). The mechanisms underlying cancer-related malnutrition are multifactorial, involving tumor-induced mechanical obstruction, systemic metabolic disturbances, and reduced intake due to psychosocial factors and treatment-related adverse effects, such as anorexia (5–7). Malnutrition significantly affects the clinical outcomes of cancer patients. It depletes energy reserves, suppresses immune function, and impairs tissue repair, thereby increasing susceptibility to complications like infections, poor wound healing, and anastomotic fistulas (8, 9). These complications exacerbate nutritional decline, creating a vicious cycle that prolongs hospital stays and diminishes patients' quality of life (10). Additionally, malnutrition reduces tolerance to adjuvant therapies, such as chemotherapy and radiotherapy, negatively impacting treatment completion rates, survival, and long-term prognosis (11, 12). Some patients may develop cancer cachexia, a refractory metabolic syndrome resistant to conventional nutritional interventions (13). Recent studies have demonstrated that early nutritional and psychological interventions can reduce mortality risk by 32% in patients with advanced esophageal and gastric cancer (14). Therefore, developing efficient risk prediction models for early identification of high-risk individuals and personalized nutritional interventions is clinically critical for improving cancer patients' nutritional status and overall prognosis.
Given the profound impact of malnutrition on cancer treatment and prognosis, early identification through effective prediction models is paramount. These models assess multiple risk factors, assign corresponding weights, and quantify an individual's future risk, providing a scientific basis for clinical decision-making (15, 16). However, despite the increasing use of big data technology, existing malnutrition prediction models for cancer patients face several challenges, including limitations in model development quality, inconsistent predictive performance, and variable clinical applicability. Moreover, the absence of systematic comparative analyses of these models creates uncertainty regarding their clinical utility. Thus, this review aims to systematically summarize the existing malnutrition prediction models, evaluate their strengths and weaknesses, and identify key methodological gaps. By addressing these gaps, this review seeks to provide a foundation for the future development of more reliable, accurate, and clinically applicable malnutrition prediction models for cancer patients.
2 Methods
he study protocol was registered with PROSPERO (registration number: CRD420251128218). This review adheres to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 guidelines, and the PRISMA checklist is provided in the Supplementary Appendix.
2.1 Search strategy
A systematic search was conducted across PubMed, Embase, Web of Science, the Cochrane Library, and Scopus to identify observational studies published from database inception to August 9, 2025. A comprehensive search strategy combining subject headings and free-text terms was employed. Search terms included: Tumor*, Neoplas*, Cancer*, Malnourishment*, Undernutrition, Nutritional deficiency*, risk prediction, prediction model, and risk prediction model. The detailed search strategies are provided in Supplementary material (Tables A.1–A.5). Additionally, gray literature sources, including ProQuest Dissertations & Theses Global, OpenGrey, and ClinicalTrials.gov, were searched to identify unpublished or ongoing studies. Reference lists of all included studies were also reviewed to identify additional relevant publications.
This systematic review adhered to the PICOTS framework as recommended by the Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modeling Studies (CHARMS) checklist, with the aim of critically assessing and extracting data from prediction model studies (17). The PICOTS system provides a structured approach to defining the review's objective, search strategy, and inclusion/exclusion criteria for studies (18). The key components of our systematic review are outlined below:
P (Population): Cancer patients.
I (Intervention model): Risk prediction models for malnutrition in cancer patients have been developed and published (predictors ≥ 2).
C (Comparator): No competing model.
O (Outcome): The outcome focused on malnutrition rather than its subgroups.
T (Timing): The prediction of outcome is based on the comprehensive evaluation of admission information, clinical score results and laboratory test indicators.
S (Setting): The purpose of the risk prediction model is to predict the malnutrition in cancer patients, so as to promote the implementation of preventive measures to avoid adverse events.
2.2 Inclusion and exclusion criteria
The inclusion criteria for studies were as follows: (1) the study subjects were cancer patients; (2) the study design included prospective or retrospective cohort studies, cross-sectional studies, and case-control studies; (3) the research focused on constructing, validating, or updating predictive models for assessing malnutrition risk in cancer patients; and (4) the outcome of interest was malnutrition, as assessed by tools such as the GLIM criteria, PG-SGA scale, NRS-2002, MST, DXA, or BMI. These tools, though varying in their approaches and sensitivity, all provided a clear and validated diagnosis of malnutrition.
The exclusion criteria were: (1) studies that reported only risk factors without constructing a predictive model; (2) studies that reported prediction models with fewer than two predictors; (3) studies not published in English; (4) studies for which the full text could not be retrieved despite efforts to contact the authors via email; and (5) duplicate publications.
2.3 Study selection
Two researchers independently performed the initial literature screening based on the inclusion and exclusion criteria. The process involved: (1) removing duplicate records; (2) excluding studies with clearly irrelevant titles and abstracts; (3) eliminating studies unrelated to the topic through full-text review; and (4) checking the reference lists of included studies to identify additional potentially relevant literature. Any discrepancies were resolved through discussion with a third researcher to achieve consensus.
2.4 Data extraction
Two researchers (JYY and WL) independently extracted data following the CHARMS checklist (17). The extracted information included: data sources, study population characteristics, outcomes, predictors, sample size, statistical analysis details, predictive performance of models, and model evaluation and presentation. The data extraction process was carried out independently by both researchers with cross-verification; any discrepancies were resolved through discussion with a third researcher until consensus was reached.
2.5 Quality assessment
Two independent researchers systematically assessed the risk of bias and applicability of the included studies using the PROBAST (19). The tool consists of two main components: risk of bias assessment and applicability evaluation. The risk of bias assessment includes four domains (participants, predictors, outcome, and analysis) with a total of 20 signaling questions. The applicability assessment covers three domains (participants, predictors, and outcome). Each domain is rated as “low risk,” “high risk,” or “unclear risk” of bias. The overall quality assessment was as follows: if the risk of bias was low in all domains, the overall assessment was classified as “low risk”; if the risk of bias was high in any domain, the overall assessment was classified as “high risk”; and if the risk of bias was unclear in any domain while being low in others, the overall assessment was classified as “unclear.”
2.6 Data synthesis and statistical analysis
A meta-analysis of the AUC values for the validation models was performed using Stata software. Heterogeneity was assessed using the I2 index and the Q test. I2 values of 25%, 50%, and 75% were considered indicative of low, moderate, and high levels of heterogeneity, respectively (20). The effect model was selected based on the results of the heterogeneity tests: a fixed-effect model was used when P > 0.05 and I2 < 50%, while a random-effect model was applied if P < 0.05 or I2 ≥ 50%. To explore the sources of heterogeneity, sensitivity analysis, subgroup analysis, and meta-regression were conducted. Additionally, Egger's test was employed to assess publication bias, with P > 0.05 suggesting a low likelihood of publication bias (21).
3 Results
3.1 Study selection
The study selection process is depicted in the PRISMA flow diagram (Figure 1). A total of 2,044 records were initially identified through database searches. After removing 486 duplicates, 1,558 records underwent title and abstract screening. Of these, 1,521 records were excluded as they were clearly irrelevant according to the predefined inclusion and exclusion criteria. The remaining 37 full-text articles were assessed for eligibility. Following full-text review, 24 articles were excluded for the following reasons: no relevant outcome measures (n = 7), ineligible study content (n = 11), ineligible study design (n = 4), and failure to retrieve the full text despite contacting the authors (n = 2). Consequently, thirteen studies (12, 22–33) were included in the qualitative synthesis. Of these, eight studies (22–24, 26–29, 33) with validated models were included in the quantitative synthesis (meta-analysis).
3.2 Characteristics of the included studies
A total of thirteen studies were ultimately included, of which twelve (12, 22–31, 33) were conducted in China and published between 2020 and 2025. Regarding study design, eleven (12, 23–32) were retrospective cohort studies, one (22) was a retrospective case-control study, and one (33) was a prospective cohort study. Three studies (24, 27, 28) were multicenter investigations, while ten (12, 22, 23, 25, 26, 29–33) were single-center studies. In terms of assessment tools, four studies (24, 26, 27, 33) used the GLIM criteria, four studies (22, 23, 30, 31) employed the PG-SGA scale, two studies (28, 29) utilized the NRS-2002, one study (12) adopted the MST, one study (25) used the DXA method, and one study (32) applied BMI. Sample sizes ranged from 120 to 4,487 participants, with outcome events occurring in 47–2,076 cases. The prevalence of malnutrition varied from 11.9% to 69.9%. Detailed study characteristics are presented in Table 1.
3.3 Construction and validation of the model
The thirteen studies included in this review reported a total of 57 predictive models for malnutrition risk in cancer patients. Regarding predictor variables, the number of candidate variables ranged from 10 to 1,888 across the studies. Among these, seven studies (12, 22, 24–26, 32, 33) employed stepwise regression for variable selection, while the remaining studies used Lasso or machine learning methods. In terms of variable handling, five studies (12, 23, 25–27) converted continuous variables into binary categories [three of which (25–27) were based on optimal cutoff values], and eight studies (22, 24, 28–33) retained continuous variables in their original form. Eleven studies (12, 24–33) had an event per variable (EPV) ≥ 20, while two studies (22, 23) had an EPV < 20. Regarding missing data handling, only two studies (23, 30) explicitly described their methods (mean imputation and random forest imputation), one study (32) employed direct deletion, while the remaining studies did not specify their approach. Modeling approaches included: seven studies (12, 22, 24–26, 31, 32) used logistic regression alone, one study (23) combined logistic regression with Lasso, and five studies (27–30, 33) integrated logistic regression with machine learning. In terms of model validation: three studies (12, 25, 31) did not perform validation, nine studies (22–24, 26, 28–30, 32, 33) conducted internal validation only, and one study (27) performed external validation. Models were presented in various formats: eight studies (12, 22–24, 26, 27, 29, 32) used nomograms, two studies (25, 31) provided mathematical formulas, and three studies (28, 30, 33) developed online risk calculators. Detailed results are presented in Table 2.
3.4 Prediction model performance
Given that some studies developed multiple models based on the same sample using different predictors, only data from their final combined or simplified models were included for performance evaluation. The results of the model performance analysis are as follows: regarding model discrimination, eleven studies (12, 22–25, 27, 28, 30–33) reported AUC values (0.620–1.000) during the modeling phase. Nine studies (22–24, 26, 28–30, 32, 33) provided internal validation AUC values (0.771–0.982), and one study (27) reported external validation AUC values (0.735–0.805). In terms of calibration assessment, six studies (23, 29–33) presented results solely through calibration curves, one study (25) reported only Hosmer–Lemeshow test results, and four studies (12, 24, 26, 27) provided both Hosmer–Lemeshow test results and calibration curves. One study (22) reported calibration curves, Hosmer–Lemeshow test results, and Brier score data, while one study (28) did not perform a calibration analysis. Detailed results are presented in Table 2.
3.5 Meta-analysis of validation models included in the review
Due to insufficient reporting of model development details, only eight studies (22–24, 26–29, 33) met the criteria for meta-analysis. Among these, the studies by Lin et al. (27) and Kuang et al. (33) used different modeling methods based on the same sample, so only the logistic regression (LR) models were included. For studies with multiple models using different predictors (23, 25, 28, 29), only data from simplified or combined models were pooled. The meta-analysis revealed substantial heterogeneity (I2 = 97.4%, p < 0.001). To explore the sources of this heterogeneity, we conducted a sensitivity analysis, which showed that the pooled AUC remained stable, with no single study unduly influencing the result (Supplementary Figure S1). Subgroup analyses based on the diagnostic tool for malnutrition and meta-regression with continuous variables (sample size, prevalence of malnutrition, and publication year) were also performed. However, none of these analyses identified a statistically significant source that could adequately explain the high heterogeneity. Therefore, a random-effects model was adopted, and the pooled AUC was 0.85 (95% CI: 0.79–0.92). An AUC value between 0.8 and 0.9 indicates good predictive performance and stability. Egger's test suggested no significant publication bias (P = 0.078), and the funnel plots are shown in Supplementary Figures S2. The meta-analysis results are presented in Figure 2.
3.6 Risk of bias and applicability evaluation
Table 3 presents the risk of bias and applicability assessment results for the included studies. In the participants domain, twelve studies (12, 22–32) were classified as high risk of bias, as all were retrospective studies. None of these data sources adhered to the low-bias data sources recommended by the PROBAST guidelines, such as prospective cohort studies, randomized controlled trials, or registry databases. In the predictors domain, one study (22) was rated as high risk of bias because it was a case-control study that assessed predictors after the outcomes were known. In the outcome domain, two studies (25, 32) which used BMI and DXA as diagnostic criteria, were rated as high risk of bias. In the analysis domain, all studies were assessed as high risk. Specific reasons included: two studies (22, 23) with EPV ≤ 20; ten studies (12, 22, 24–29, 31, 33) that did not describe their methods for handling missing data; and seven studies (12, 22, 24–26, 32, 33) that selected predictive variables based solely on univariate analysis. Regarding model validation, three studies (12, 25, 31) developed models without validation, while nine studies (22–24, 26, 28–30, 32, 33) performed only internal validation. In terms of applicability, only two studies (25, 32), which used BMI and DXA as diagnostic criteria, demonstrated low applicability, while the remaining studies showed good applicability.
4 Discussion
4.1 The overall performance of the risk prediction model for malnutrition in cancer patients is good, but the risk of bias is relatively high
The thirteen studies included in this review reported a total of 57 predictive models, all demonstrating strong performance, with AUC values ranging from 0.735 to 0.982. The pooled AUC for eight validated models was 0.85 (95% CI: 0.79–0.92), reflecting excellent discriminatory power. Additionally, the calibration curves, Hosmer–Lemeshow (H–L) test, and Brier score results collectively confirmed satisfactory model calibration. A key finding of our meta-analysis is the substantial, unresolved heterogeneity (I2 = 97.4%). Despite extensive investigations, we were unable to identify definitive sources of this heterogeneity. This significant limitation warrants caution when interpreting the pooled AUC. The heterogeneity likely arises from a combination of factors, including clinical diversity (e.g., varying cancer types and malnutrition mechanisms), differences in diagnostic criteria (e.g., PG-SGA, GLIM), and methodological variations in model development and validation. As such, our pooled estimate should be considered a summary of a heterogeneous evidence base, highlighting the early stage of this field and underscoring the urgent need for standardized outcome definitions in future research.
According to the PROBAST tool, all included studies exhibited a high risk of bias, particularly in the domains of study design and analysis, which warrants caution in applying these models in clinical practice. Regarding the study subjects, all twelve studies included in this review employed retrospective designs. These studies relied on pre-existing data, which may contain missing information or recording errors, leading to recall bias, information bias, and selection bias. Additionally, since predictor variables were collected after the outcome, these studies have limited ability to establish causal relationships. In contrast, prospective studies, which collect predictors before the outcome, offer improved data reliability and greater accuracy in predictive models. According to the PROBAST tool, suitable study designs include prospective cohort studies, randomized controlled trials, nested case-control studies, or case-cohort studies (19). In terms of analysis, nine studies did not describe how they handled missing values, which may introduce selection bias and affect the accuracy and reliability of the models (34). Common methods for handling missing data include complete-case deletion, multiple imputation, and machine learning techniques. While complete-case deletion can reduce sample size and limit dataset diversity, multiple imputation and machine learning methods use more data and improve statistical power (34, 35). Proper reporting and handling of missing data are essential to prevent model overfitting (36). Future studies should improve missing data management to enhance research completeness and credibility. Six studies relied solely on univariate analysis for variable selection, identifying predictors that were significant individually. This approach overlooked interactions among variables, potentially excluding important predictors and leading to overfitting, which reduces predictive performance (19). Current research suggests that methods like LASSO, Ridge, Elastic Net regression, and machine learning can improve variable selection accuracy (37). Future studies should adopt these methods and consider clinical knowledge, predictor reliability, and applicability in the selection process, rather than relying solely on univariate analysis results (38). Internal validation is used to assess model reproducibility and prevent overfitting, while external validation evaluates a model's generalizability and clinical applicability, which is considered the “gold standard” (39, 40). In this study, four studies lacked internal validation, and only one performed external validation. Most internal validations used simple data splitting, which can be ineffective with small sample sizes and may lead to biased results. It is recommended to use more robust methods, such as cross-validation or bootstrapping, to improve model stability (41). Furthermore, most studies lacked external validation, and the research populations were mainly from single-center studies in China, limiting model generalizability. Future research should strengthen external validation by including diverse populations from different regions, ethnicities, cultures, and lifestyles. Multicenter studies are also encouraged to reduce biases associated with single-center data.
4.2 BMI age and sex are common predictors
The predictive models in this study included 3–8 predictors, with BMI (n = 6), age (n = 5), and gender (n = 4) being the most common. These factors are crucial for both clinical practice and future research. BMI was the most frequently identified predictor, consistent with findings from previous studies (42–44), and is a key parameter in nutritional screening tools (45). A low BMI indicates insufficient nutritional reserves, which, when combined with surgery and cancer treatments, increases the risk of malnutrition (27). It also impairs the body's ability to absorb nutrients, exacerbating malnutrition (46). Age is another consistent risk factor for malnutrition in cancer patients, as aging affects nutrient metabolism and absorption, while chronic diseases and medications further compromise nutrition (12, 23, 25, 27, 33). Limited mobility and psychological factors can also reduce food intake (47–49). Therefore, healthcare providers should focus on elderly patients with low BMI to prevent malnutrition. Gender showed inconsistent findings: two studies (27, 32) found a higher risk of malnutrition in women, while two others (24, 33) indicated that men were more susceptible. This variability may be due to differences in cancer types, treatments, and socio-cultural factors. Future research should explore how sex interacts with other risk factors in different cancer populations and contexts. Other predictors, such as the ECOG score (12, 23), albumin level (12, 29), and radiological indicators (23, 29), were also identified and could enhance predictive models. Future studies should validate these predictors in diverse populations and clinical settings to develop more reliable models for malnutrition.
4.3 Implications for future model research
With the rapid advancement of big data and artificial intelligence, computational algorithms are increasingly integrated into clinical medicine. Studies by Kuang et al. (33), Lin et al. (27), Wu et al. (28), and Duan et al. (30) developed prediction models using both traditional logistic regression and machine learning algorithms. The latter two studies demonstrated that machine learning models often outperformed logistic regression, although the differences in AUC were small. Logistic regression is simple, interpretable, and performs well with small datasets but struggles with nonlinear relationships, missing values, and multicollinearity (50). Machine learning, on the other hand, handles large-scale, high-dimensional data more effectively and addresses these challenges, offering greater robustness (51). However, machine learning models are more complex, harder to interpret, and vulnerable to issues during modeling (52). Therefore, careful algorithm selection and tuning are critical to ensuring model reliability. In conclusion, no single method is inherently superior; performance may vary across different datasets and domains. Future research should focus on optimizing machine learning algorithms and comparing various approaches to identify the best-performing model.
5 Limitations
This systematic review has several limitations: (1) only studies published in Chinese and English were included, which may introduce a risk of language bias. (2) The majority of the studies were single-center investigations conducted in China, which lack external validation and limit the generalizability of the results to other regions. (3) Despite thorough examination of the high heterogeneity in AUC values through sensitivity analysis, subgroup analysis, and meta-regression, the underlying causes of this variation remain unclear. This limitation compromises the reliability and generalizability of the pooled estimates, reflecting the substantial methodological and clinical diversity in the current literature. (4) Due to significant heterogeneity in data types, definitions, and cutoff values among the included predictive factors, and the fact that most factors were only reported in a single study, a meta-analysis could not be conducted. As a result, this review focused on providing a systematic synthesis of the relevant predictive factors. (5) While most models demonstrated strong predictive performance, the high risks associated with the data sources and analytical methods used in model development necessitate careful evaluation and selection of models by clinicians or oncology nursing professionals. Further validation of model stability is recommended.
6 Conclusion
This review systematically summarizes existing malnutrition prediction models. While current models demonstrate promising predictive performance, their methodological limitations, including issues with study design, data handling, and model validation, introduce significant risks of bias. Addressing these gaps is crucial for improving both the accuracy and clinical applicability of these models. Future research should adhere to PROBAST guidelines, focusing on large-scale, multicenter prospective studies and incorporating machine learning techniques to optimize model development. This approach aims to construct well-performing and stable models. By improving the identification of at-risk cancer patients, this research offers valuable insights for oncology nursing practice, enabling clinicians and oncology nursing professionals to implement more targeted and timely nutritional interventions, ultimately enhancing patient outcomes.
Data availability statement
The original contributions presented in the study are included in this article/Supplementary material, further inquiries can be directed to the corresponding author.
Author contributions
JY: Conceptualization, Data curation, Formal analysis, Methodology, Visualization, Writing – original draft. XC: Methodology, Resources, Supervision, Visualization, Writing – review & editing. DG: Data curation, Formal analysis, Software, Writing – original draft. WL: Data curation, Validation, Visualization, Writing – original draft.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Acknowledgments
We would like to express our heartfelt gratitude to all the researchers who conducted original studies. This study would not have been possible without their contributions.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnut.2025.1696142/full#supplementary-material
References
1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global Cancer Statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2021) 71:209–49. doi: 10.3322/caac.21660
2. Sguanci M, Palomares SM, Cangelosi G, Petrelli F, Sandri E, Ferrara G, et al. Artificial Intelligence in the management of malnutrition in cancer patients: a systematic review. Adv Nutr. (2025) 16:100438. doi: 10.1016/j.advnut.2025.100438
3. Wang Y, Liu Z, Zhang H, Wang Y, Chen X, Lu W, et al. Evaluation of different screening tools as the first step of the GLIM framework: a cross-sectional study of Chinese cancer patients in an outpatient setting. Nutr Clin Pract. (2024) 39:702–13. doi: 10.1002/ncp.11103
4. Wiegert EVM, Padilha PC, Peres WAF. Performance of patient-generated subjective global assessment (PG-SGA) in patients with advanced cancer in palliative care. Nutr Clin Pract. (2017) 32:675–81. doi: 10.1177/0884533617725071
5. Muscaritoli M, Corsaro E, Molfino A. Awareness of cancer-related malnutrition and its management: analysis of the results from a survey conducted among medical oncologists. Front Oncol. (2021) 11:682999. doi: 10.3389/fonc.2021.682999
6. Kuhn MA, Gillespie MB, Ishman SL, Ishii LE, Brody R, Cohen E, et al. Expert consensus statement: management of dysphagia in head and neck cancer patients. Otolaryngol Head Neck Surg. (2023) 168:571–92. doi: 10.1002/ohn.302
7. Yin L, Liu J, Lin X, Li N, Guo J, Fan Y, et al. Nutritional features-based clustering analysis as a feasible approach for early identification of malnutrition in patients with cancer. Eur J Clin Nutr. (2021) 75:1291–301. doi: 10.1038/s41430-020-00844-8
8. Gupta A, Gupta E, Hilsden R, Hawel JD, Elnahas AI, Schlachta CM, et al. Preoperative malnutrition in patients with colorectal cancer. Can J Surg. (2021) 64:E621–E9. doi: 10.1503/cjs.016820
9. Xu LB, Mei TT, Cai YQ, Chen WJ, Zheng SX, Wang L, et al. Correlation between components of malnutrition diagnosed by global leadership initiative on malnutrition criteria and the clinical outcomes in gastric cancer patients: a propensity score matching analysis. Front Oncol. (2022) 12:851091. doi: 10.3389/fonc.2022.851091
10. Salas S, Cottet V, Dossus L, Fassier P, Ginhac J, Latino-Martel P, et al. Nutritional factors during and after cancer: impacts on survival and quality of life. Nutrients. (2022) 14:2958. doi: 10.3390/nu14142958
11. Go SI, Jeon H, Park SW, Kang MH, Kim HG, Lee GW. Low pre-treatment nutritional index is significantly related to poor outcomes in small cell lung cancer. Thorac Cancer. (2018) 9:1483–91. doi: 10.1111/1759-7714.12862
12. Tang J, Wong G, Naffouje S, Felder S, Sanchez J, Dineen S, et al. A novel nomogram for early identification and intervention in colorectal cancer patients at risk for malnutrition. Am Surg. (2021) 89:1485–96. doi: 10.1177/00031348211058620
13. Arends J. Malnutrition in cancer patients: causes, consequences and treatment options. Eur J Surg Oncol. (2024) 50:107074. doi: 10.1016/j.ejso.2023.107074
14. Lu Z, Fang Y, Liu C, Zhang X, Xin X, He Y, et al. Early interdisciplinary supportive care in patients with previously untreated metastatic esophagogastric cancer: a phase III randomized controlled trial. J Clin Oncol. (2021) 39:748–56. doi: 10.1200/JCO.20.01254
15. Lee YH, Bang H, Kim DJ. How to establish clinical prediction models. Endocrinol Metab (Seoul). (2016) 31:38–44. doi: 10.3803/EnM.2016.31.1.38
16. Smith LA, Oakden-Rayner L, Bird A, Zeng M, To MS, Mukherjee S, et al. Machine learning and deep learning predictive models for long-term prognosis in patients with chronic obstructive pulmonary disease: a systematic review and meta-analysis. Lancet Digit Health. (2023) 5:e872–e81. doi: 10.1016/S2589-7500(23)00177-2
17. Moons KG, de Groot JA, Bouwmeester W, Vergouwe Y, Mallett S, Altman DG, et al. Critical appraisal and data extraction for systematic reviews of prediction modelling studies: the CHARMS checklist. PLoS Med. (2014) 11:e1001744. doi: 10.1371/journal.pmed.1001744
18. Debray TP, Damen JA, Snell KI, Ensor J, Hooft L, Reitsma JB, et al. A guide to systematic review and meta-analysis of prediction model performance. BMJ. (2017) 356:i6460. doi: 10.1136/bmj.i6460
19. Moons KGM, Wolff RF, Riley RD, Whiting PF, Westwood M, Collins GS, et al. PROBAST: a tool to assess risk of bias and applicability of prediction model studies: explanation and elaboration. Ann Intern Med. (2019) 170:W1–w33. doi: 10.7326/M18-1377
20. Higgins JP, Thompson SG, Deeks JJ, Altman DG. Measuring inconsistency in meta-analyses. BMJ. (2003) 327:557–60. doi: 10.1136/bmj.327.7414.557
21. Egger M, Davey Smith G, Schneider M, Minder C. Bias in meta-analysis detected by a simple, graphical test. BMJ. (1997) 315:629–34. doi: 10.1136/bmj.315.7109.629
22. Dai T, Wu DQ, Tang JJ, Liu ZY, Zhang M. Construction and validation of a predictive model for the risk of three-month-postoperative malnutrition in patients with gastric cancer: a retrospective case-control study. J Gastrointest Oncol. (2023) 14:128–45. doi: 10.21037/jgo-22-1307
23. Yu WK, Xu HE, Chen FJ, Shou HF, Chen Y, Jia YS, et al. Development and validation of a radiomics-based nomogram for the prediction of postoperative malnutrition in stage IB1-IIA2 cervical carcinoma. Front Nutr. (2023) 10:1113588. doi: 10.3389/fnut.2023.1113588
24. Yin LY, Liu J, Lin X, Li N, Shi ML, Zhang HM, et al. Development and validation of a rapid-decision pathway to diagnose malnutrition in patients with lung cancer. Nutrition. (2021) 84: 111102. doi: 10.1016/j.nut.2020.111102
25. Yu J, Soh KL, He L, Wang P, Cao Y. Development of a short-term nutritional risk prediction model for hepatocellular carcinoma patients: a retrospective cohort study. Sci Rep. (2024) 14:3921. doi: 10.1038/s41598-024-54456-4
26. Wang XX, Tang XX, Xu JS, Zhang R, Chu J, Chen CM, et al. Investigating the clinical predictive utility of inflammatory markers and nomogram development in colorectal cancer patients with malnutrition. Front Nutr. (2024) 11: 1442094. doi: 10.3389/fnut.2024.1442094
27. Lin ZM, He H, Yan MF, Chen XM, Chen HS, Ke JF. Machine learning and the nomogram as the accurate tools for predicting postoperative malnutrition risk in esophageal cancer patients. Front Nutr. (2025) 12: 1606470. doi: 10.3389/fnut.2025.1606470
28. Wu TT, Xu HX Li W, Zhou FX, Guo ZQ, Wang KH, et al. The potential of machine learning models to identify malnutrition diagnosed by GLIM combined with NRS-20 02 in colorectal cancer patients without weight loss information. Clin Nutr. (2024) 43:1151–61. doi: 10.1016/j.clnu.2024.04.001
29. Huang WJ, Wang CJ, Wang Y, Yu Z, Wang SY, Yang J, et al. Predicting malnutrition in gastric cancer patients using computed tomography(CT) deep learning features and clinical data. Clin Nutr. (2024) 43:881–91. doi: 10.1016/j.clnu.2024.02.005
30. Duan R, Li QY, Yuan QX, Hu JX, Feng T, Ren T. Predictive model for assessing malnutrition in elderly hospitalized cancer patients: a machine learning approach. Geriatr Nurs. (2024) 58:388–98. doi: 10.1016/j.gerinurse.2024.06.012
31. Zhang XL, Zhao WQ, Du YY, Zhang JL, Zhang Y, Li WL, et al. A simple assessment model based on phase angle for malnutrition and prognosis in hospitalized cancer patients. Clin Nutr. (2022) 41:1320–7. doi: 10.1016/j.clnu.2022.04.018
32. Park J-H, Kim E, Seol E-M, Kong S-H, Park DJ, Yang H-K, et al. Prediction model for screening patients at risk of malnutrition after gastric cancer surgery. Ann Surg Oncol. (2021) 28:4471–81. doi: 10.1245/s10434-020-09559-3
33. Kuang L, Yu J, Zhou Y, Zhang Y, Wang G, Zhang F, et al. Predicting postoperative malnutrition in patients with oral cancer: development of an XGBoost model with SHAP analysis and web-based application. Front Oncol. (2025) 15:1564459. doi: 10.3389/fonc.2025.1564459
34. Nijman S, Leeuwenberg AM, Beekers I, Verkouter I, Jacobs J, Bots ML, et al. Missing data is poorly handled and reported in prediction model studies using machine learning: a literature review. J Clin Epidemiol. (2022) 142:218–29. doi: 10.1016/j.jclinepi.2021.11.023
35. Austin PC, White IR, Lee DS, van Buuren S. Missing data in clinical research: a tutorial on multiple imputation. Can J Cardiol. (2021) 37:1322–31. doi: 10.1016/j.cjca.2020.11.010
36. Zhang Q, Yuan KH, Wang L. Asymptotic bias of normal-distribution-based maximum likelihood estimates of moderation effects with data missing at random. Br J Math Stat Psychol. (2019) 72:334–54. doi: 10.1111/bmsp.12151
37. Ranalli MG, Salvati N, Petrella L, Pantalone F. M-quantile regression shrinkage and selection via the Lasso and Elastic Net to assess the effect of meteorology and traffic on air quality. Biom J. (2023) 65:e2100355. doi: 10.1002/bimj.202100355
38. Courvoisier DS, Combescure C, Agoritsas T, Gayet-Ageron A, Perneger TV. Performance of logistic regression modeling: beyond the number of events per variable, the role of data structure. J Clin Epidemiol. (2011) 64:993–1000. doi: 10.1016/j.jclinepi.2010.11.012
39. Chen L. Overview of clinical prediction models. Ann Transl Med. (2020) 8:71. doi: 10.21037/atm.2019.11.121
40. Collins GS, Dhiman P, Ma J, Schlussel MM, Archer L, Van Calster B, et al. Evaluation of clinical prediction models (part 1): from development to external validation. BMJ. (2024) 384:e074819. doi: 10.1136/bmj-2023-074819
41. Moons KG, Kengne AP, Woodward M, Royston P, Vergouwe Y, Altman DG, et al. Risk prediction models: I. Development, internal validation, and assessing the incremental value of a new (bio)marker. Heart. (2012) 98:683–90. doi: 10.1136/heartjnl-2011-301246
42. Akagündüz DD, Sahin H, Akagündüz B. Malnutrition and related factors in older patients with gastrointestinal cancer receiving chemotherapy. Cureus J Med Sci. (2024) 16:e58252. doi: 10.7759/cureus.58252
43. Herrera Naranjo M. Screening Practices and Prevalence of Malnutrition Risk at the Huntsman Cancer Institute [Master's thesis]. Salt Lake City, UT: University of Utah (2024).
44. Al-Shahethi AH, Mahdi FA, Al-Shameri EA, Gaith FMA. Factors associated with malnutrition in hospitalized cancer patients in a national oncology center in conflict-affected settings in Sana'a, Yemen: an institution-based cross-sectional study. Cureus J Med Sci. (2023) 15:e45411. doi: 10.7759/cureus.45411
45. Shim H, Cheong JH, Lee KY, Lee H, Lee JG, Noh SH. Perioperative nutritional status changes in gastrointestinal cancer patients. Yonsei Med J. (2013) 54:1370–6. doi: 10.3349/ymj.2013.54.6.1370
46. Bernardes S, Teixeira PJZ, Silva FM. Association of reduced BMI, length of hospital stay, mortality, and malnutrition diagnosis in patients with acute exacerbation COPD: a secondary analysis of a cohort study. JPEN J Parenter Enteral Nutr. (2023) 47:101–8. doi: 10.1002/jpen.2390
47. Byun M, Kim E, Kim J. Physical and mental health factors associated with poor nutrition in elderly cancer survivors: insights from a nationwide survey. Int J Environ Res Public Health. (2021) 18:9313. doi: 10.3390/ijerph18179313
48. D'Almeida CA, Peres WAF, de Pinho NB, Martucci RB, Rodrigues VD, Ramalho A. Prevalence of malnutrition in older hospitalized cancer patients: a multicenter and multiregional study. J Nutr Health Aging. (2020) 24:166–71. doi: 10.1007/s12603-020-1309-4
49. Sucuoglu Isleyen Z, Besiroglu M, Yasin AI, Simsek M, Topcu A, Smith L, et al. The risk of malnutrition and its clinical implications in older patients with cancer. Aging Clin Exp Res. (2023) 35:2675–83. doi: 10.1007/s40520-023-02538-0
50. Mao A, Su J, Ren M, Chen S, Zhang H. Risk prediction models for falls in hospitalized older patients: a systematic review and meta-analysis. BMC Geriatr. (2025) 25. doi: 10.1186/s12877-025-05688-0
51. Choi RY, Coyner AS, Kalpathy-Cramer J, Chiang MF, Campbell JP. Introduction to machine learning, neural networks, and deep learning. Transl Vis Sci Technol. (2020) 9:14. doi: 10.1167/tvst.9.2.14
Keywords: malnutrition, cancer, risk prediction, meta-analysis, systematic review
Citation: Yu J, Chu X, Guo D and Luo W (2025) Risk prediction models for malnutrition in cancer patients: a systematic review and meta-analysis. Front. Nutr. 12:1696142. doi: 10.3389/fnut.2025.1696142
Received: 31 August 2025; Revised: 18 November 2025;
Accepted: 26 November 2025; Published: 12 December 2025.
Edited by:
William Kwame Amakye, South China University of Technology, ChinaReviewed by:
Xiaoqin Bi, Sichuan University, ChinaAdriana Sofia Taveira, University of Trás-os-Montes and Alto Douro, Portugal
Copyright © 2025 Yu, Chu, Guo and Luo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xin Chu, MTg5ODE4ODM4MDNAMTYzLmNvbQ==
Xin Chu2*