Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Immunol., 02 December 2025

Sec. Cancer Immunity and Immunotherapy

Volume 16 - 2025 | https://doi.org/10.3389/fimmu.2025.1639186

This article is part of the Research TopicDiagnosis and Management of Pancreatic CancerView all 11 articles

Predictive value of current nodal staging systems and development of machine learning nomogram for resectable pancreatic head cancer: a population-based study and multicenter validation

Wen-Bo Zou,&#x;Wen-Bo Zou1,2†Xiu-Ping Zhang&#x;Xiu-Ping Zhang1†Yu-Yao Song&#x;Yu-Yao Song1†Shuai XuShuai Xu3Bo-Yuan LiuBo-Yuan Liu1Zhi-Ming ZhaoZhi-Ming Zhao1Yuan-Xing GaoYuan-Xing Gao1Ming-Gen HuMing-Gen Hu1Gao Huang*Gao Huang2*Rong Liu*Rong Liu1*
  • 1Faculty of Hepato-Biliary-Pancreatic Surgery, The First Medical Center of Chinese People’s Liberation Army (PLA) General Hospital, Beijing, China
  • 2Department of General Surgery, Department of Pathology, Clinical Research Center, No.924 Hospital of PLA Joint Logistic Support Force, Guilin, China
  • 3Department of Liver Transplantation and Hepatobiliary Surgery, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, China

Background: Given the growing interest in the influence of lymph node metastasis on the prognosis of patients diagnosed with pancreatic head cancer (PHC). This study aims to evaluate the ability of current four nodal staging systems predicting long-term outcomes and develop a machine learning model for predicting the prognosis of patients with resectable PHC.

Materials and methods: Participants with PHC were sourced from the Surveillance, Epidemiology, and End Results (SEER) database and allocated at random in a 7:3 ratio to training and internal validation cohort. External validation in a large-sample, multicenter cohort collected from three Chinese institutions was performed to verified the robustness of the optimal nodal staging system and predictive model. The concordance index (C-index), Akaike information criterion (AIC) and area under the curve (AUC) were calculated to evaluate the predictive capability and discrimination of different nodal staging systems. The machine learning procedures based procedure and Cox regression analysis were implemented for identification of the prognostic factors and construction of predictive model. The calibration curves, net reclassification improvement (NRI) and integrated discrimination improvement (IDI) and decision curve analysis (DCA) were using to assess predictive accuracy and clinical benefits of the predictive model.

Results: All four nodal staging systems were independent prognostic factors for overall survival (OS). The log odds of lymph node ratio (LODDS) were verified as the optimal nodal staging system with highest C-index and AUCs, and lowest AICs compared to others, and has better predictive capability than others both in patients with < 12 and ≥ 12 retrieval lymph nodes (RLNs). Then, a predictive model including T stage, tumor differentiation, chemotherapy, and LODDS was developed and validated. This model had a higher C-index and AUCs than the AJCC staging system. The NRI, IDI, and DCA analysis also indicated that present model had good predictive capability and clinical utility.

Conclusion: The nodal staging system LODDS is the optimal prognostic factor for OS in resectable PHC. It could effectively predict OS for resectable PHC patients without considering the numbers of RLN. The machine learning model could effectively predict OS for patients with resectable PHC.

1 Introduction

Pancreatic cancer is emerging as a formidable adversary to human health (1, 2). Its insidious nature, coupled with the formidable challenges on early detection and effective treatment, has led to a substantial and growing burden on societies worldwide (3, 4). Although radical resection and adjuvant therapy provide the treatment options, the five-year survival rate for pancreatic cancer is relatively low compared to other gastrointestinal cancers, particularly when diagnosed at a late stage (5). The location of the primary tumor is a critical factor in determining treatment strategies and influencing outcomes. Studies have reported that pancreatic head cancer (PHC) account for approximate 75% of pancreatic cancer, and has a notably poor overall survival (OS) compared to pancreatic body/tail cancer (6). In the selection of treatment strategies, radical pancreaticoduodenectomy remains the only potential curative treatment for patients with PHC (7, 8), but few patients are suitable for surgical treatment because of distant metastases or local invasion (9).

The accurate categorization of disease severity is pivotal for determination of appropriate and effective treatment (10). Recent studies have proposed various nodal staging systems for overall survival (OS) prediction (11), among which the most widely accepted for risk stratification is the N staging system. However, the N staging system neglects the influence of lymph node dissection. Currently, novel nodal staging systems was developed and gradually applied based on ratio of metastatic to retrieval lymph nodes (LNR), the log odds of lymph node ratio (LODDS) and the log odds of negative lymph nodes/T stage (LONT) (12, 13). The comparisons of predictive performance of these nodal staging systems for PHC has produced a spectrum of inconsistent findings, and previous studies often grapple with several limitations of methodologies and sample size that can affect the interpretation of their results. It lacks large sample and multicenter studies to explore the prognostic value of different nodal staging systems.

Machine learning procedures that enable computer systems to learn statistical patterns from data during a training phase. This model can then be applied to data to autonomously execute tasks such as clustering, optimization, and prediction, without relying on task-specific instructions (14). Recently, machine learning procedures have gained widespread application and are extensively used in the construction of prognostic models, especially in liver cancer (15), gastrointestinal cancer (16), and cervical cancer (17). In pancreatic cancer, machine learning procedures are also extensively used to identify pivotal biomarkers and develop prognostic model. Li et al. constructed a machine learning histamine-related signatures to reveal the prognosis of pancreatic cancer (18). Wu et al. developed an interpretable machine learning model, integrating radiomic features and clinicopathological factors, to predict the early recurrence in patients with post-surgery (19).

Herein, we obtained the resectable PHC cohort from the Surveillance, Epidemiology, and End Results (SEER) database to compare prognostic performance of the four nodal staging systems. Another large-sample multicenter resectable PHC cohort was retrospectively collected from three Chinese institutions to validate accuracy and generalizability. In addition, based on optimal nodal staging systems, a novel machine learning model for PHC was developed, interpreted and validated.

2 Materials and methods

2.1 Patients characteristics, demographics and study design

The resectable PHC patient characteristics were extracted from the SEER database of the National Cancer Institute (http://seer.cancer.gov/). The inclusion and exclusion criteria have been introduced in detail in our previous study (20). The AJCC staging was converted from the 7th to the 8th edition based on the tumor size, number of metastatic lymph nodes (MLNs) and distant metastasis for subsequent analysis. A large-sample, multicenter cohort including resectable PHC patients was collected from three Chinese institutions for external validation. The patients were follow-up by telephone or outpatient clinic interview. The variable selection is in accordance with the SEER cohort. This study was approved by the Ethics Committee of the PLA General Hospital. Written informed consent was obtained from all patients. The Flow diagram of the study population and design was shown in Figure 1.

Figure 1
Flowchart depicting six steps in a research process. Step I: Patient selection from SEER database (6419 patients) and China database (670 patients), divided into training, internal, and external validation cohorts. Step II: Determination of threshold of nodal staging systems using X-tile software for variables LNR, LODDS, and LONT. Step III: Identification of independent prognostic factors with Cox regression analysis. Step IV: Comparative prognostic performances tested using four nodal staging systems. Step V: Development of predictive model using machine learning and LODDS system. Step VI: Validation of the predictive model with calibration curves and DCA analysis.

Figure 1. Flow diagram of the study population and design. LNR, The ratio of metastatic to retrieved nodes; LODDS, The log odds of lymph node ratio; LONT, The log odds of negative lymph nodes/T stage; OS, Overall survival; AIC, Akaike information criterion; ROC, Receiver operating characteristic; NRI, Net reclassification improvement; IDI, Integrated discrimination improvement; DCA, Decision curve analysis.

Next, in this study, the LNR was defined as the ratio of the number of MLNs to the number of total retrieval lymph nodes (RLNs). The LODDS were determined as log [(MLNs + 0.5)/(negative lymph nodes + 0.5)]. The LONT was calculated as log odds of negative lymph nodes/T stage. According to the previous studies, the threshold of LNR was identified as 0.20, while the other optimal threshold of nodal staging systems was calculated using the X-tile software (version 3.6.1).

Then, the SEER cohort were randomly divided into the training and internal validation cohort at a ratio of 7:3 using “caret” package in R software (version 4.3.3). The training cohort was used to evaluate performance of different nodal staging systems predicting OS, and the performance was verified and generalized in the internal and external validation cohort.

Finally, based on optimal nodal staging systems, a novel machine learning model to predict OS for PHC was developed, interpreted and validated. Furthermore, a website calculator was established to facilitate the application of our predictive model more conveniently.

2.2 Statistical analyses

R (version 4.3.3) and SPSS (version 26.0) were used to complete all statistical analyses in the present study. X-tile software (version 3.6.1) is a common and practical approach for exploring optimal thresholds and thus is used to obtain the optimal cut-off values of partial nodal staging systems. Continuous variables are reported as median (interquartile range) and were compared by Student’s t test, and categorical variables are reported as counts and proportions and were analyzed by chi-squared test for comparisons among groups. Univariate Cox regression analysis and 10 machine learning-based integrative procedures, including random survival forest (RSF), CoxBoost, stepwise Cox, least absolute shrinkage and selection operator (Lasso), Ridge, elastic net (Enet), survival support vector machines (survival-SVMs), generalized boosted regression models (GBMs), supervised principal components (SuperPC) and partial least Cox (plsRcox), are implemented for identification of the prognostic factors. Through comparing these 101 machine learning combinations, the optimal model can be selected accordingly. Then, multivariate Cox regression analysis was carried out for construction of predictive model. Kaplan-Meier (KM) survival analysis and the log-rank test were performed to depict the capability of individual variables to discriminate overall survival (OS). The concordance index (C-index) was calculated to evaluate the discrimination of these nodal staging systems, Akaike Information Criterion (AIC) value, and time-dependent receiver operating characteristic (time-ROC) curves within 60 months were created to compare their predictive capability. The net reclassification improvement (NRI) and integrated discrimination improvement (IDI) were calculated to evaluate the improvement of the model prediction compared to the AJCC staging system. Finally, the performance and clinical benefits of predictive model was evaluated by calibration curve and decision curve analysis (DCA). The main utilized R packages were “rms”, “nricens”, “survival”, “Cschange”, “timeROC”, and so forth. A two-sided p < 0.05 was considered statistically significant.

3 Results

3.1 Patient demographics and characteristics

In the SEER cohort, a total of 6419 resectable PHC patients met the criteria after consulting the CS Schema v0204+, which included 4495 patients in the training cohort and 1924 patients in the internal validation cohort. In the external validation cohort, 670 resectable PHC patients was collected for validation and generalization. Among all resectable PHC patients, most of them were 60–80 years old (66.43%), male (55.15%), Race white (82.69%). The majority of them with AJCC II stage (43.26%), T2 stage (62.21%), and N1 stage (40.71%). Furthermore, the most common tumor differentiation was moderate (52.34%). With regard to the treatment, there were 4801 (74.79%) patients who received chemotherapy. There was no statistical difference in the demographics and characteristics between the training and internal validation cohort. In the external validation cohort, similarly, most of them were 60–80 years old (49.55%), male (55.67%). Moreover, the majority of them with AJCC I stage (52.69%), T2 stage (61.94%), and N0 stage (25.52%). In addition, the most common tumor differentiation was moderate (47.31%). With regard to the treatment, there were 283 (42.24%) patients who received chemotherapy. The baseline demographics and characteristics are shown in Tables 1, 2.

Table 1
www.frontiersin.org

Table 1. Baseline demographics and characteristics of patients in SEER cohort.

Table 2
www.frontiersin.org

Table 2. Baseline demographics and characteristics of patients in China cohort.

3.2 The determination of threshold of nodal staging systems

In the present study, the LNR ranged from 0 to 1, the LODDS ranged from -2.16 to 1.81, and the LONT ranged from 0 to 1.83 (Supplementary Figure 1). Based on the threshold, LNR was grouped into LNR1 (LNR ≤ 0.20) and LNR2 (LNR > 0.20), LODDS was classified into LODDS1 (-2.16 ≤ LODDS < -1.01), LODDS2 (-1.01 < LODDS ≤ -0.29), and LODDS3 (LODDS > -0.29), LONT was divided into LONT1 (LONT < 0.58) and LONT2 (LONT ≥ 0.58).

3.3 The identification of independent prognostic factors

In univariate Cox regression analysis, the interaction variables showed a statistically significant for age, AJCC staging, T stage, maximum tumor size, N stage, tumor differentiation, chemotherapy, MLNs, NLNs, LNR, LODDS, LONT (all p value < 0.05, Table 3). No statistically significant results were observed if stratified for sex and race (all p value > 0.05, Table 3). To avoid losing prognostic information and collinear contradiction, we performed four times multivariate Cox regression analysis. The results revealed four nodal staging systems could be deemed as independent prognostic factors (p value < 0.001, Supplementary Table 1). The KM curves depicted that all four nodal staging systems managed to significantly discriminate the OS of PHC patients (Supplementary Figure 2).

Table 3
www.frontiersin.org

Table 3. Univariable Cox regression analysis for overall survival of patients with PHC in training cohort.

3.4 Comparative prognostic performances of the four nodal staging systems for OS prediction

Comparisons of discriminability among four nodal staging systems were performed in the all cohorts. In the training cohort, LODDS has the highest C-index among all nodal staging systems, which is 0.589 (95% CI 0.578–0.600) and has best predictive capability than the N stage (C-index: 0.578, 95% CI 0.567–0.590, p = 0.005), LNR (C-index: 0.569, 95% CI 0.559–0.579, p < 0.001), and LONT (C-index: 0.559, 95% CI 0.549–0.569, p < 0.001). In the internal validation cohort, LODDS also has the highest C-index among all nodal staging systems, which is 0.590 (95% CI 0.573–0.607) and has best predictive capability than the N stage (C-index: 0.567, 95% CI 0.549–0.584, p < 0.001), LNR (C-index: 0.555, 95% CI 0.540–0.570, p < 0.001), and LONT (C-index: 0.573, 95% CI 0.558–0.589, p = 0.06). Similarly, in the external validation cohort, the LODDS has the highest C-index, which is 0.596 (95% CI 0.573–0.620), compare to the N stage (C-index: 0.575, 95% CI 0.551–0.599, p = 0.011), LNR (C-index: 0.546, 95% CI 0.529–0.563, p < 0.001), and LONT (C-index: 0.504, 95% CI 0.480–0.527, p < 0.001). The detailed information is shown in Table 4, Figures 2A–C. Next, we calculated the AIC values to verified the optimal predictive performance. The results showed that the LODDS has the lowest AIC value than others in all cohorts (Table 4). Then, the time-dependent area under the curves (AUCs) of the nodal staging systems for predicting OS within 60 months depict that LODDS has the highest AUC compared to others (Figures 2D–F). All the results indicated that LODDS has an excellent performance for lymph nodes stratification of resectable PHC patients.

Table 4
www.frontiersin.org

Table 4. The discriminatory performance of nodal staging systems in predicting overall survival of patients with PHC.

Figure 2
Bar and line charts illustrate the comparison of prognostic models across three cohorts: training, internal validation, and China cohort. Panels A, C, and E show C-index values for N stage, LNR, LODDS, and LONT. Panels B, D, and F depict time-dependent ROC curves for OS, showing varying AUC values over 60 months post-surgery.

Figure 2. Comparison of the C-index and time-dependent AUCs among nodal staging systems. (A, B) Training cohorts. (C, D) Internal validation cohort. (E, F) The China cohort. LNR, The ratio of metastatic to retrieved nodes; LODDS, The log odds of lymph node ratio; LONT, The log odds of negative lymph nodes/T stage; OS, Overall survival; AUC, Area under the curve.

3.5 Comparative prognostic performances of the four nodal staging systems based on the number of RLNs

According to previous study and AJCC staging system, the recommended number of RLNs for pathologic examination is 12 (21, 22). The results of C-index and AUC values indicated that LODDS system has better predictive capability than others both in patients with < 12 and ≥ 12 RLNs for OS prediction (Supplementary Table 2), revealing that it was not affected by the number of RLNs.

3.6 Risk stratification ability of LODDS among clinicopathological characteristics

We subsequently revealed the risk stratification ability using the LODDS system among clinicopathological characteristics, including age, gender, tumor differentiation, AJCC staging, T stage, and chemotherapy, and found that the LODDS system has a good ability to distinguish the prognosis of PHC patients in gender, tumor differentiation, AJCC staging, and T stage subgroups (Supplementary Figure 3).

3.7 Development, interpretation and validation of the predictive model for OS based on LODDS system

Based on the results of univariate Cox analysis, the variables (Age, T stage, chemotherapy, tumor differentiation, and LODDS) were subjected to our machine learning-based integrative procedure to develop a consensus model. We fitted 101 machine learning combinations via the LOOCV framework and further calculated the C-index of each combination across the training, internal validation and external validation cohorts.

Notably, the optimal model was a combination of stepwise Cox (direction = both) and survival−SVM with the highest average C-index (0.659), and this combination model had a related leading C-index both in internal validation and the external validation cohorts (Figure 3A). Then, a predictive model for OS was developed and validated via multivariable Cox analysis (Figures 3B, C). Next, the calibration curves of all cohorts for survival probability depicted that the model prediction had good consistency with the actual observation (Figures 3D–L).

Figure 3
Panel A displays a heatmap with various models and accuracy metrics, color-coded by cohort. Panel B presents a nomogram showing factors like T stage, differentiation, and chemotherapy affecting survival rates. Panel C is a bar chart showing coefficients of these factors. Panels D to L are scatter plots illustrating the predicted versus observed survival probabilities at one, three, and five years, with each plot corresponding to different sets or methodologies.

Figure 3. (A) Of 101 kinds of prediction models via LOOCV framework and further calculated the C-index of each model across all validation datasets. (B) The nomogram predicting OS of patients with PHC. (C) The coefficient of each variable included in predictive model. (D–L) Calibration curves showing the probability of 12-, 36-, and 60-month OS of the model prediction and the observed survival. (D–F) Training cohorts. (G–I) Internal validation cohort. (J–L) The China cohort. LODDS, The log odds of lymph node ratio; OS, Overall survival; PHC, Pancreatic head cancer.

For interpretation of the predictive model, the C-index, NRI, and IDI of the predictive model had better predictive capability than the AJCC staging system in all cohorts (Table 5), and the AUCs were significantly higher in all cohorts compared to those of the AJCC staging system (Figures 4A–C). All the results indicate that the predictive model had favorable discrimination for OS in PHC. Finally, DCA analysis demonstrated that the clinical utility of the present model was better, compared with the AJCC staging system in all cohorts (Figures 4D–F).

Table 5
www.frontiersin.org

Table 5. The NRIs, IDIs, and C-index of the predictive model and AJCC 8th staging system in overall survival prediction.

Figure 4
Panel of graphs evaluating Overall Survival (OS) using time-dependent ROC and Decision Curve Analysis (DCA). Panels A, C, and E show ROC curves for training, internal validation, and China cohorts, respectively, comparing a model with AJCC stage over 60 months. Panels B, D, and F present corresponding DCA curves for each cohort, analyzing net benefit across threshold probabilities. The model generally outperforms the AJCC stage across scenarios.

Figure 4. Comparison of the time-dependent AUCs and DCA between the model and the AJCC staging system. (A, B) Training cohorts. (C, D) Internal validation cohort. (E, F) The China cohort. OS, Overall survival; ROC, Receiver operating characteristic; DCA, Decision curve analysis.

3.8 Online access of the predictive model

To facilitate the application of our machine learning-based nomogram more conveniently, we developed and established a website calculator (https://vs-prediction.shinyapps.io/OS-PHC-prediction-tool/, Figure 5). Clinical physicians can calculate the corresponding survival probabilities by entering the demographics and characteristics of patients with PHC.

Figure 5
Dynamic nomogram interface showing a survival plot. The left pane includes dropdowns for T_stage, Differentiation, LODDS, and Chemotherapy, with selections like T1, Well, LODDS1, and No. A checkbox for Alpha blending is checked. The plot on the right displays Estimated Survival Probability over Follow Up Time.

Figure 5. Application of website calculator for online access of the predictive model.

4 Discussion

The accurate evaluation of lymph nodes metastases is pivotal for determination of appropriate and effective treatment. Previous studies have proposed various nodal staging systems, but their predictive value in forecasting the overall survival (OS) of patients with PHC remains to be determined (23). This study investigated the current nodal staging systems N stage, LNR, LODDS and LONT, explored their optimal threshold, and compared their prognostic performance using the large-sample, multicenter data from the SEER and the China databases. Differences in prognostic performance of the nodal staging systems were also investigated in each subgroup according to clinicopathological characteristics. Our results proved that the LODDS is the relatively best lymph node metastasis classification system compared to others. To our knowledge, this is the first study comparing the current four nodal staging systems in PHC patients, and further developing a novel predictive model based on LODDS and clinicopathological factors to predict the OS of resectable PHC patients.

The oncologic outcomes of PHC are very important in guiding the selection of treatment methods, and among these, lymph node metastasis is significantly prognostic factor for predicting survival of patients (24). Thus, effectively distinguishing the extent of lymph nodal metastasis is crucial. The most widely accepted for risk stratification is the N staging system. The N staging system stratified patients according to the number of MLN. In the light of AJCC staging system, the patients without MLN were referred as N0 stage, with 1–3 MLNs were referred as N1 stage, and ≥4 MLNs were referred as N2 stage (24). It has been applied to evaluate the extent of nodal metastasis of various malignant tumors. However, assessing prognosis solely based on the N staging system or MLN may be biased due to incomplete lymph node dissection or inadequate histopathological examination (25, 26). In recent years, a variety of lymph node staging systems have gradually emerged. LNR, an indicator reflecting the status of lymph node burden, holds significant importance in predicting prognosis of several tumors and less susceptibility to the influence of the number of RLNs (27, 28). Previous studies suggest that LNR is a promising biomarker, similarly, our results also verified that LNR may provide good prognostic stratification (12). The LODDS system has the advantage of good discrimination in patients with fewer number of RLNs and those without lymph node metastasis (29). The LODDS system is proposed to overcome the disadvantage of LNR and used for prognosis predicting and clinical stratification in tumour (30). It offers a more nuanced assessment of lymph node involvement than traditional staging methods. In addition, a new predictive indicator for assessing the lymph node status, known as LONT, has been devised for patient stratification. LONT can reflect both the stage of disease and the extent of lymph node dissection. In previous studies, LONT has been used for numerous tumors prognosis prediction with good accuracy (31, 32). Our study showed similar findings in prognostic performance of these nodal staging systems. They have all been proven to be independent prognostic factors for OS prediction of tumors including PHC.

Here, we comprehensively calculated and analyzed the LNR, LODDS, and LONT values of each participants using multicenter registry data from SEER and the China databases, and then obtained different subgroups based their cut-off values. The results of our analyses, including the C-index, AICs, AUCs within 60 months, showed that LODDS had best accuracy for predicting survival than other nodal staging systems in all cohorts. Riediger et al. study found LNR and LODDS are superior to the classical nodal status in predicting prognosis in resectable PHC, while LODDS has not shown advantage over LNR in their series of 409 patients (33). In our study, we demonstrated the LODDS could be considered as optimal indicator for nodal risk stratification and OS prediction of PHC patients. This result can provide additional evidence to prove the predictive performance of LODDS in patients with resectable PHC based on a larger-sample, multicenter cohort.

The number of RLNs for pathologic evaluation plays a pivotal role in assessment of lymph node status. According to previous studies, the recommended number of RLNs for pathologic evaluation is 12 (22). However, the detection capability for lymph nodes varies due to the different surgical approaches and strengths of medical institutions, there is an urgent need for a node staging system that can evaluate patients without considering the number of RLNs. Previous studies have proposed the LNR or LODDS may be superior to N staging system in patients with fewer RLNs (29, 34). This study showed similar findings. The LODDS system yielded an equivalent C-index and AUCs in patients with < 12 and ≥ 12 RLNs, and higher C-index and AUCs than other nodal staging systems in patients with < 12 and ≥ 12 RLNs. Therefore, LODDS could be considered as a more adequate nodal staging system for PHC because it is almost unaffected by the number of RLNs. The LODDS can be promising indicator of assessment of lymph nodal status, and the predictive model based on LODDS can be reproduced using a simple data collection, making it attractive for clinical translation and implementation in medical institutions at different levels.

To construct effective model based on LODDS for clinical application, the machine learning-based integrative procedure was implemented in the study. In the current times, machine learning procedures have gained widespread application and are extensively used in the construction of prognostic models, especially in various cancers (18, 19). In pancreatic cancer. Ren et al. constructed a novel 4 pancreatic cancer-related protein signatures model based on machine learning (35). Lee et al. used the machine learning procedures to identify and develop the microbiome markers-based model for diagnosis of pancreatic cancer (36). Additionally, Zhu et al. developed an interpretable machine learning model for predicting early liver metastasis after pancreatic cancer surgery (37). In the present study, we fitted 101 kinds of prediction models via the LOOCV framework in the training cohort to optimize variable selection with the highest average C-index. And further validations in the internal and external validation cohorts. The advantage of the comprehensive procedures is that it is based on various machine learning algorithms and their combinations to fit a model with consistent performance on the prognosis of PHC. The strategy can further reduce the dimensionality of variables, making the model more simplified and translational (38). Furthermore, AUCs, C-index, NRIs, and IDIs analysis suggested that the predictive model maintained the favorable discrimination and stable performance for OS prediction in all cohorts, which indicated great potential for the clinical application of predictive model. Previous study also developed many predictive model or risk score for OS prediction of PHC patients. Among these models, a limited number have been successfully integrated into clinical practice, and an even smaller subset has undergone rigorous external validation (20, 39, 40). Shi et al. established a nomogram to predict the prognosis of pancreatic cancer patients underlying surgery but did not conduct external validation (41). In addition, Peng et al. developed a nomogram to assess the survival period of postoperative pancreatic cancer patients and implemented a single center validation (42). In our study, the prognostic and machine learning analysis demonstrated that our predictive model was a commendable indicator of OS in PHC and had a better extrapolation possibility using multicenter registry data. The calibration curves for survival probability depicted that the model prediction had good consistency with the actual observation, and DCA analysis demonstrated that the good clinical utility of the present model. Finally, a convenient website calculator (https://vs-prediction.shinyapps.io/OS-PHC-prediction-tool/) to facilitate the application of our machine learning nomogram. Clinical physicians can calculate the corresponding survival probabilities by entering the clinical data embedded within Electronic Health Record (EHR) systems or accessed via hospital intranet portals. This facilitates its use in risk stratification, decision-making, and patient counseling, thereby seamlessly integrating data-driven prognostication into routine clinical practice.

The innovation of this study can be highlighted in two main aspects. Firstly, in the present study, the nodal staging system LODDS is the optimal prognostic factor with good performance comparable to the others. The LODDS was applicable regardless of the numbers of RLN. Second, the machine-learning model could effectively predict OS for resectable PHC patients.

Several limitations should be acknowledged in the present study. Firstly, all of the samples from this study were retrospective, and future availability of LODDS and validation of predictive model should be performed in prospective cohort. Secondly, the cut-offs were derived from the SEER cohort, their prognostic value was limited in the independent external cohort. However, the consistent performance across distinct populations significantly mitigates the concern of overfitting and supports the generalizability of these thresholds. Thirdly, difficulty in immediate calculation of LODDS restricted clinical application of the LODDS system in real-world practice. A simple calculator is worth developing in future study. Next, different demographic characteristics in SEER and Chinese cohorts may result in limited outcomes. However, the LODDS and predictive model maintained predictive accuracy across two geographically and clinically distinct populations, suggesting good generalizability. Finally, we lacked some routinely clinicopathological characteristics such as serological indicators, R0 resection rate and so forth. The absence of this information may have affected the factors needed in our model. Further verification and more clinicopathological characteristics collection are needed to optimize its application.

5 Conclusion

In conclusion, nodal staging system LODDS is an optimal prognostic indicator that can reflect the lymph nodal status with good performance. It could effectively predict OS for resectable PHC patients without considering the numbers of RLN. The machine learning model showed good predictive ability and could assist clinicians in formulating individualized treatment strategies.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the Ethics Committee of The First Medical Center of Chinese People’s Liberation Army (PLA) General Hospital, No.924 Hospital of PLA Joint Logistic Support Force, Shandong Provincial Hospital Affiliated to Shandong First Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.

Author contributions

WZ: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Writing – original draft, Writing – review & editing. XZ: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Writing – original draft, Writing – review & editing. YS: Conceptualization, Formal analysis, Investigation, Methodology, Software, Writing – original draft, Writing – review & editing. SX: Conceptualization, Investigation, Methodology, Resources, Software, Writing – original draft, Writing – review & editing. BL: Formal analysis, Methodology, Software, Writing – original draft. ZZ: Formal analysis, Resources, Software, Supervision, Validation, Writing – original draft. YG: Project administration, Resources, Validation, Writing – original draft. MH: Formal analysis, Methodology, Resources, Validation, Writing – original draft. GH: Conceptualization, Funding acquisition, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. RL: Conceptualization, Funding acquisition, Investigation, Project administration, Resources, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This study was supported by Guilin Science and Technology Plan Project (Grant number: 20220139-5-3), The Technology Project within the No.924 Hospital of PLA Joint Logistic Support Force (Grant number: S2022FH01), Guangxi healthcare Science and Technology Plan Project (Grant number: Z-C20241574), Guangxi Natural Science Foundation (Grant number: 2025GXNSFBA069028), and the Fundamental Research Funds for the Central Universities of HIT.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2025.1639186/full#supplementary-material

References

1. Mizrahi JD, Surana R, Valle JW, and Shroff RT. Pancreatic cancer. Lancet. (2020) 395:2008–20. doi: 10.1016/S0140-6736(20)30974-0

PubMed Abstract | Crossref Full Text | Google Scholar

2. Siegel RL, Giaquinto AN, and Jemal A. Cancer statistics, 2024. CA Cancer J Clin. (2024) 74:12–49. doi: 10.3322/caac.21820

PubMed Abstract | Crossref Full Text | Google Scholar

3. Klein AP. Pancreatic cancer epidemiology: understanding the role of lifestyle and inherited risk factors. Nat Rev Gastroenterol Hepatol. (2021) 18:493–502. doi: 10.1038/s41575-021-00457-x

PubMed Abstract | Crossref Full Text | Google Scholar

4. Stoffel EM, Brand RE, and Goggins M. Pancreatic cancer: changing epidemiology and new approaches to risk assessment, early detection, and prevention. Gastroenterology. (2023) 164:752–65. doi: 10.1053/j.gastro.2023.02.012

PubMed Abstract | Crossref Full Text | Google Scholar

5. Hu ZI and O’Reilly EM. Therapeutic developments in pancreatic cancer. Nat Rev Gastroenterol Hepatol. (2024) 21:7–24. doi: 10.1038/s41575-023-00840-w

PubMed Abstract | Crossref Full Text | Google Scholar

6. Lou X, Li J, Wei YQ, Jiang ZJ, Chen M, and Sun JJ. Comparable prevalence of distant metastasis and survival of different primary site for LN + pancreatic tumor. J Transl Med. (2020) 18:266. doi: 10.1186/s12967-020-02438-1

PubMed Abstract | Crossref Full Text | Google Scholar

7. Liu R, Abu Hilal M, Besselink MG, Hackert T, Palanivelu C, Zhao Y, et al. International consensus guidelines on robotic pancreatic surgery in 2023. Hepatobiliary Surg Nutr. (2024) 13:89–104. doi: 10.21037/hbsn-23-132

PubMed Abstract | Crossref Full Text | Google Scholar

8. Zhang XP, Xu S, Gao YX, Zhao ZM, Zhao GD, Hu MG, et al. Early and late recurrence patterns of pancreatic ductal adenocarcinoma after pancreaticoduodenectomy: a multicenter study. Int J Surg. (2023) 109:785–93. doi: 10.1097/JS9.0000000000000296

PubMed Abstract | Crossref Full Text | Google Scholar

9. Wood LD, Canto MI, Jaffee EM, and Simeone DM. Pancreatic cancer: pathogenesis, screening, diagnosis, and treatment. Gastroenterology. (2022) 163:386–402.e1. doi: 10.1053/j.gastro.2022.03.056

PubMed Abstract | Crossref Full Text | Google Scholar

10. Min SK, You Y, Choi DW, Han IW, Shin SH, Yoon S, et al. Prognosis of pancreatic head cancer with different patterns of lymph node metastasis. J Hepatobiliary Pancreat Sci. (2022) 29:1004–13. doi: 10.1002/jhbp.1159

PubMed Abstract | Crossref Full Text | Google Scholar

11. Wang ZX, Qiu MZ, Jiang YM, Zhou ZW, Li GX, and Xu RH. Comparison of prognostic nomograms based on different nodal staging systems in patients with resected gastric cancer. J Cancer. (2017) 8:950–8. doi: 10.7150/jca.17370

PubMed Abstract | Crossref Full Text | Google Scholar

12. You MS, Lee SH, Choi YH, Shin BS, Paik WH, Ryu JK, et al. Lymph node ratio as valuable predictor in pancreatic cancer treated with R0 resection and adjuvant treatment. BMC Cancer. (2019) 19:952. doi: 10.1186/s12885-019-6193-0

PubMed Abstract | Crossref Full Text | Google Scholar

13. Zhou H, Li XX, Huang YP, Wang YX, Zou H, Xiong L, et al. Prognosis prediction and comparison between pancreatic signet ring cell carcinoma and pancreatic duct adenocarcinoma: a retrospective observational study. Front Endocrinol (Lausanne). (2023) 14:1205594. doi: 10.3389/fendo.2023.1205594

PubMed Abstract | Crossref Full Text | Google Scholar

14. Ge J, Ge J, Tang G, Xiong D, Zhu D, Ding X, et al. Machine learning-based identification of biomarkers and drugs in immunologically cold and hot pancreatic adenocarcinomas. J Transl Med. (2024) 22:775. doi: 10.1186/s12967-024-05590-0

PubMed Abstract | Crossref Full Text | Google Scholar

15. Gao J, Zhang M, Chen Q, Ye K, Wu J, Wang T, et al. Integrating machine learning and molecular docking to decipher the molecular network of aflatoxin B1-induced hepatocellular carcinoma. Int J Surg. (2025) 111:4539–49. doi: 10.1097/JS9.0000000000002455

PubMed Abstract | Crossref Full Text | Google Scholar

16. Kather JN, Pearson AT, Halama N, Jäger D, Krause J, Loosen SH, et al. Deep learning can predict microsatellite instability directly from histology in gastrointestinal cancer. Nat Med. (2019) 25:1054–6. doi: 10.1038/s41591-019-0462-y

PubMed Abstract | Crossref Full Text | Google Scholar

17. Ramos-Casallas A, Cardona-Mendoza A, Perdomo-Lara SJ, Rico-Mendoza A, and Porras-Ramírez A. Performance evaluation of machine learning models in cervical cancer diagnosis: Systematic review and meta-analysis. Eur J Cancer. (2025) 229:115768. doi: 10.1016/j.ejca.2025.115768

PubMed Abstract | Crossref Full Text | Google Scholar

18. Li CL, Yao ZY, Qu C, Shao GM, Liu YK, Pei XY, et al. Machine learning model reveals the risk, prognosis, and drug response of histamine-related signatures in pancreatic cancer. Discov Oncol. (2025) 16:155. doi: 10.1007/s12672-025-01910-y

PubMed Abstract | Crossref Full Text | Google Scholar

19. Wu L, Cen C, Ouyang D, Zhang L, Li X, Wu H, et al. Interpretable machine learning model for predicting early recurrence of pancreatic cancer: integrating intratumoral and peritumoral radiomics with body composition. Int J Surg. (2025) 111(11):8198–8211. doi: 10.1097/JS9.0000000000003078

PubMed Abstract | Crossref Full Text | Google Scholar

20. Zou W, Wang Z, Wang F, Zhang G, and Liu R. A nomogram predicting overall survival in patients with non-metastatic pancreatic head adenocarcinoma after surgery: a population-based study. BMC Cancer. (2021) 21:524. doi: 10.1186/s12885-021-08250-4

PubMed Abstract | Crossref Full Text | Google Scholar

21. Huebner M, Kendrick M, Reid-Lombardo KM, Que F, Therneau T, Qin R, et al. Number of lymph nodes evaluated: prognostic value in pancreatic adenocarcinoma. J Gastrointest Surg. (2012) 16:920–6. doi: 10.1007/s11605-012-1853-2

PubMed Abstract | Crossref Full Text | Google Scholar

22. Sahara K, Tsilimigras DI, Mehta R, Moro A, Paredes AZ, Lopez-Aguiar AG, et al. Trends in the number of lymph nodes evaluated among patients with pancreatic neuroendocrine tumors in the United States: A multi-institutional and national database analysis. Ann Surg Oncol. (2020) 27:1203–12. doi: 10.1245/s10434-019-08120-1

PubMed Abstract | Crossref Full Text | Google Scholar

23. Prassas D, Safi SA, Stylianidi MC, Telan LA, Krieg S, Roderburg C, et al. N, LNR or LODDS: which is the most appropriate lymph node classification scheme for patients with radically resected pancreatic cancer? Cancers (Basel). (2022) 14:1834. doi: 10.3390/cancers14071834

PubMed Abstract | Crossref Full Text | Google Scholar

24. Chun YS, Pawlik TM, and Vauthey JN. 8th edition of the AJCC cancer staging manual: pancreas and hepatobiliary cancers. Ann Surg Oncol. (2018) 25:845–7. doi: 10.1245/s10434-017-6025-x

PubMed Abstract | Crossref Full Text | Google Scholar

25. Slidell MB, Chang DC, Cameron JL, Wolfgang C, Herman JM, Schulick RD, et al. Impact of total lymph node count and lymph node ratio on staging and survival after pancreatectomy for pancreatic adenocarcinoma: a large, population-based analysis. Ann Surg Oncol. (2008) 15:165–74. doi: 10.1245/s10434-007-9587-1

PubMed Abstract | Crossref Full Text | Google Scholar

26. Lahat G, Lubezky N, Gerstenhaber F, Nizri E, Gysi M, Rozenek M, et al. Number of evaluated lymph nodes and positive lymph nodes, lymph node ratio, and log odds evaluation in early-stage pancreatic ductal adenocarcinoma: numerology or valid indicators of patient outcome? World J Surg Oncol. (2016) 14:254. doi: 10.1186/s12957-016-0983-5

PubMed Abstract | Crossref Full Text | Google Scholar

27. Huang X, Luo Z, Liang W, Xie G, Lang X, Gou J, et al. Survival nomogram for young breast cancer patients based on the SEER database and an external validation cohort. Ann Surg Oncol. (2022) 29:5772–81. doi: 10.1245/s10434-022-11911-8

PubMed Abstract | Crossref Full Text | Google Scholar

28. Arrichiello G, Pirozzi M, Facchini BA, Facchini S, Paragliola F, Nacca V, et al. Beyond N staging in colorectal cancer: Current approaches and future perspectives. Front Oncol. (2022) 12:937114. doi: 10.3389/fonc.2022.937114

PubMed Abstract | Crossref Full Text | Google Scholar

29. Zou W, Zhu C, Wang Z, Tan X, Li C, Zhao Z, et al. A novel nomogram based on log odds of metastatic lymph nodes to predict overall survival in patients with perihilar cholangiocarcinoma after surgery. Front Oncol. (2021) 11:649699. doi: 10.3389/fonc.2021.649699

PubMed Abstract | Crossref Full Text | Google Scholar

30. Occhionorelli S, Andreotti D, Vallese P, Morganti L, Lacavalla D, Forini E, et al. Evaluation on prognostic efficacy of lymph nodes ratio (LNR) and log odds of positive lymph nodes (LODDS) in complicated colon cancer: the first study in emergency surgery. World J Surg Oncol. (2018) 16:186. doi: 10.1186/s12957-018-1483-6

PubMed Abstract | Crossref Full Text | Google Scholar

31. Xie J, Pang Y, Li X, and Wu X. The log odds of negative lymph nodes/T stage: a new prognostic and predictive tool for resected gastric cancer patients. J Cancer Res Clin Oncol. (2021) 147:2259–69. doi: 10.1007/s00432-021-03654-y

PubMed Abstract | Crossref Full Text | Google Scholar

32. Chen T, Zhan X, Chen X, Jiang M, Wan H, Fu B, et al. Predictive value of the log odds of negative lymph nodes/T stage as a novel prognostic factor in bladder cancer patients after radical cystectomy. Front Oncol. (2022) 12:895413. doi: 10.3389/fonc.2022.895413

PubMed Abstract | Crossref Full Text | Google Scholar

33. Riediger H, Kulemann B, Wittel U, Adam U, Sick O, Neeff H, et al. Prognostic role of log odds of lymph nodes after resection of pancreatic head cancer. J Gastrointest Surg. (2016) 20:1707–15. doi: 10.1007/s11605-016-3200-5

PubMed Abstract | Crossref Full Text | Google Scholar

34. Xiao XS, Tang HL, Xie XH, Li LS, Kong YN, Wu MQ, et al. Metastatic axillary lymph node ratio (LNR) is prognostically superior to pN staging in patients with breast cancer–results for 804 Chinese patients from a single institution. Asian Pac J Cancer Prev. (2013) 14:5219–23. doi: 10.7314/APJCP.2013.14.9.5219

PubMed Abstract | Crossref Full Text | Google Scholar

35. Ren Z, Gao W, Li X, Jing Y, Liu Z, Li X, et al. Molecular structure and mechanism of protein MSMB, TPPP3, SPI1: Construction of novel 4 pancreatic cancer-related protein signatures model based on machine learning. Int J Biol Macromol. (2025) 307:142075. doi: 10.1016/j.ijbiomac.2025.142075

PubMed Abstract | Crossref Full Text | Google Scholar

36. Lee D, Lee C, Han K, Goo T, Kim B, Han Y, et al. Machine learning models for pancreatic cancer diagnosis based on microbiome markers from serum extracellular vesicles. Sci Rep. (2025) 15:10995. doi: 10.1038/s41598-025-94183-y

PubMed Abstract | Crossref Full Text | Google Scholar

37. Zhu H, Zhou Y, Shen D, Wu K, Gan X, Xue X, et al. An interpretable machine learning model for predicting early liver metastasis after pancreatic cancer surgery. BMC Cancer. (2025) 25:1117. doi: 10.1186/s12885-025-14503-3

PubMed Abstract | Crossref Full Text | Google Scholar

38. Liu Z, Liu L, Weng S, Guo C, Dang Q, Xu H, et al. Machine learning-based integration develops an immune-derived lncRNA signature for improving outcomes in colorectal cancer. Nat Commun. (2022) 13:816. doi: 10.1038/s41467-022-28421-6

PubMed Abstract | Crossref Full Text | Google Scholar

39. Guo X, Song X, Long X, Liu Y, Xie Y, Xie C, et al. New nomogram for predicting lymph node positivity in pancreatic head cancer. Front Oncol. (2023) 13:1053375. doi: 10.3389/fonc.2023.1053375

PubMed Abstract | Crossref Full Text | Google Scholar

40. Cai M, Guo T, Chen Z, Li W, Pu T, Zhang Z, et al. Development and validation of a network calculator model for safety and efficacy after pancreaticoduodenectomy in the elderly patients with pancreatic head cancer. Cancer Med. (2023) 12:19673–89. doi: 10.1002/cam4.6613

PubMed Abstract | Crossref Full Text | Google Scholar

41. Shi XY, Wang Y, Zhou X, Xie ML, Ma Q, Wang GX, et al. A population-based nomogram to individualize treatment modality for pancreatic cancer patients underlying surgery. Sci Rep. (2023) 13:4856. doi: 10.1038/s41598-023-31292-6

PubMed Abstract | Crossref Full Text | Google Scholar

42. Peng W, Yu X, Yang R, Nie S, Jian X, and Zeng P. Construction and validation of a nomogram for cancer specific survival of postoperative pancreatic cancer based on the SEER and China database. BMC Gastroenterol. (2024) 24:104. doi: 10.1186/s12876-024-03180-4

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: pancreatic head cancer, nodal staging systems, machine learning, overall survival, predictive model

Citation: Zou W-B, Zhang X-P, Song Y-Y, Xu S, Liu B-Y, Zhao Z-M, Gao Y-X, Hu M-G, Huang G and Liu R (2025) Predictive value of current nodal staging systems and development of machine learning nomogram for resectable pancreatic head cancer: a population-based study and multicenter validation. Front. Immunol. 16:1639186. doi: 10.3389/fimmu.2025.1639186

Received: 01 June 2025; Accepted: 11 November 2025; Revised: 22 October 2025;
Published: 02 December 2025.

Edited by:

Ayon Chakraborty, Research and Education in Cancer (ACTREC), India

Reviewed by:

Vinod Kumar Yata, Malla Reddy University, India
Kangping Yang, Second Affiliated Hospital of Nanchang University, China

Copyright © 2025 Zou, Zhang, Song, Xu, Liu, Zhao, Gao, Hu, Huang and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Rong Liu, bGl1cm9uZzMwMUAxMjYuY29t; Gao Huang, aHVhbmdnYW8xOTgxQDE2My5jb20=

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.