Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Oncol., 11 December 2025

Sec. Gastrointestinal Cancers: Hepato Pancreatic Biliary Cancers

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1665601

This article is part of the Research TopicInnovative Diagnostic and Therapeutic Strategies for Neuroendocrine Tumors: A Multidisciplinary ApproachView all 4 articles

A CT-based interpretable machine learning model for preoperative prediction of pancreatic neuroendocrine tumor aggressiveness

Rong Kong&#x;Rong Kong1†Shunzu Lu&#x;Shunzu Lu1†Yugui HuangYugui Huang1Siyu TanSiyu Tan1Chunxia ZhuChunxia Zhu1Guowei ChenGuowei Chen1Mingrui YangMingrui Yang1Ying LiuYing Liu2Qixin WuQixin Wu3Peng Peng*Peng Peng1*
  • 1Department of Radiology, the First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
  • 2Department of Geriatric Gastroenterology, the First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China
  • 3Department of Radiology, Chongzuo People’s Hospital, Chongzuo, Guangxi, China

Objectives: This study aimed to develop and validate an interpretable machine learning (ML) model based on structured preoperative CT features for non-invasive prediction of pancreatic neuroendocrine Tumors (PNETs) aggressiveness.

Methods: This retrospective study included 112 patients with PNETs who underwent contrast-enhanced abdominal CT. Patients were randomly assigned to training and validation cohorts. Clinical data and CT features were analysed using the Least Absolute Shrinkage and Selection Operator method and multivariate logistic regression to identify independent risk factors. Multiple ML models were evaluated to determine the optimal classifier. Model performance was assessed using receiver operating characteristic and calibration curves, and decision curve analysis. Shapley Additive Explanations (SHAP) quantified feature importance for interpretable risk prediction.

Results: A total of 112 patients were evaluated, including 80(mean age± standard deviation, 47 ± 13 years; 36 males)) in the training set and 32 (48 ± 15 years; 12 males) in the validation set. Tumour shape, necrotic changes, arterial relative enhancement ratio, and enhancement pattern independently predicted PNETs aggressiveness. The logistic regression model demonstrated excellent discrimination, achieving an area under the curve of 0.952 (95% CI: 0.952 (0.909–0.994) in the training cohort and 0.972 (95% CI 0.927–1.000) in the validation cohort. SHAP summary and force plots facilitated global and local model interpretation.

Conclusion: The Interpretable ML model based on CT features could serve as a preoperative, noninvasive, and precise evaluation tool to differentiate aggressive and non-aggressive PNETs, facilitating personalized clinical management and potentially improving patient outcomes.

1 Introduction

Pancreatic neuroendocrine Tumors (PNETs), which originate from neuroendocrine cells, comprising approximately 5% of pancreatic tumors and exhibiting considerable clinical heterogeneity (1, 2). Although they are uncommon, the incidence has shown a yearly increasing trend over the past three decades (3, 4). Previously considered indolent, recent studies reveal their complexity, with some subtypes being highly aggressive (5). Studies have demonstrated that while the median survival for localized NETs exceeds 30 years, those with peritoneal metastases face a drastically reduced median survival of only about 1 year (6, 7). Given their potential for aggressive behavior, PNETs require careful clinical evaluation and management.

Surgery remains the primary and only curative treatment (8); however, no consensus exists on the scope of surgical intervention, partly due to challenges in assessing tumour aggressiveness preoperatively. Treatment strategies and prognoses vary with tumour aggressiveness. For non-aggressive PNETs, local resection lowers surgical risks without affecting prognosis, while aggressive PNETs require radical resection, lymph node dissection, and adjuvant therapy to minimise recurrence. Thus, precise preoperative assessment of tumour invasiveness is critical for optimising clinical decisions and tailoring surgical plans.

Currently, the TNM staging system and World Health Organisation (WHO) histopathological classification assess invasiveness; however, these systems rely on postoperative data, limiting preoperative utility (9, 10). Endoscopic ultrasound-guided fine-needle aspiration biopsy (EUS-FNAB), commonly used in PNETs, may help risk-stratify patients by determining tumour histological grade. However, it is invasive, carries potential risks, and has limited accuracy (11), making reliable histology obtainable only postoperatively. Thus, a non-invasive and repeatable method for preoperative PNETs aggressiveness prediction is urgently needed, yet no widely accepted solution exists.

Recent advancements in artificial intelligence, particularly machine learning (ML), enable high-precision predictions by analysing data and identifying patterns through powerful algorithms. These technologies are transforming biomedical research, personalised medicine, and computer-aided diagnosis (12). Compared to traditional methods, ML-based feature selection models offer superior predictive capabilities across diseases. By analysing complex imaging and clinical data, these models assist in predicting PNETs invasiveness and optimising clinical management. Although ML algorithms are promising, their “black box” nature, particularly the lack of interpretability in risk prediction models, limits clinical application (13, 14).

The Shapley Additive Explanation (SHAP) method, a practical decision-making explaining tool, quantifies features-outcome relationships, improving model interpretability (15). It generates personalised risk predictions and visualises each feature’s contribution, integrating biological and clinical models. This approach enhances clinicians’ understanding of predictive models while supporting personalised medicine. To our knowledge, the noninvasive aggressiveness prediction in PNETs using clinical and CT features based on interpretable machine learning has not been well established in the literature.

This study aimed to develop and validate a non-invasive ML model using CT-based quantitative and qualitative features alongside clinical factors to predict PNETs invasiveness preoperatively. Additionally, the SHAP method was employed to visualise and explain features-clinical outcome relationships, improving clinician confidence in predictions and enabling early interventions. To our knowledge, no previous study has combined CT features with ML and utilised SHAP to predict PNETs aggressiveness.

2 Materials and methods

This retrospective study was approved from the institutional ethics committees and waived the requirement for written informed consent from the patients.

2.1 Patients

Patients with pathologically confirmed PNETs treated at the First Affiliated Hospital of Guangxi Medical University between May 2015 and May 2024 were included. Exclusion criteria were: (1) local or systemic therapy before imaging, (2) Absence of preoperative CT or an interval exceeding 4 weeks between CT and surgery, (3) Missing pathological grade information, (4) Poor image quality unsuitable for analysis.

The study recruitment process is illustrated in Figure 1. Patients were randomly assigned to training and test cohorts. Baseline clinical data, including sex, age, symptoms (present/absent), alpha-fetoprotein, carcinoembryonic antigen, carbohydrate antigens (CA125, CA153, CA199), and haematocrit levels, were collected from medical records.

Figure 1
Flowchart of patient selection for a retrospective study on pathologically confirmed PNENs from May 2015 to May 2024 at Institution A. Initially, 475 patients were identified. Inclusion criterion was contrast-enhanced abdominal CT within one month before surgery, resulting in 174 patients. Exclusions (62 total) included pre-anti-tumor therapy (25), incomplete pathological information (20), and inadequate imaging quality (17), leaving 112 patients. These were divided into a training cohort (80) and a test cohort (32).

Figure 1. Flowchart illustrating the patient selection process in this study.

2.2 Definition of PNETs aggressiveness

PNETs aggressiveness was determined based on postoperative histopathological and clinical outcomes. Tumours were classified as aggressive if any of the following endpoints were present: G3 tumour grade, distant metastases, metastatic lymph nodes, microvascular invasion, and/or disease recurrence (16). PNETs were graded per WHO 2017 criteria: G1: Ki-67 ≤2%, G2: Ki-67 = 3–20%, G3: Ki-67 >20% (17). Metastatic lymph nodes status(N+) was assigned only for patients who underwent lymph node dissection with histologically confirmed nodal metastasis. Disease recurrence was evaluated during follow-up through outpatient visits or telephone interviews and was defined as radiologically or pathologically confirmed local recurrence or distant relapse after curative resection. Both the site and number of recurrences lesions were recorded.

2.3 CT protocols

Patients fasted from solid food for 4–6 h. Owing to the extended data collection period, multiple CT scanners were used: 64-channel CT (LightSpeed VCT, GE Healthcare), 256-channel CT (Revolution, GE Healthcare), and a dual-source CT (SOMATON Definition Flash, Siemens Healthcare). Scan parameters were: tube voltage, 120 kV; tube current: 250–350 mA; matrix, 512 × 512; and slice thickness, 2.0–2.5mm. For contrast-enhanced imaging, a non-ionic contrast medium (iohexol, iodine content, 350 mg/mL) was administered intravenously at 3 mL/s via an automatic power injector. Pre-contrast scans were followed by contrast-enhanced scans at 35–45 s (arterial phase), 70 s (portal venous phase), and 180 s (equilibrium phase) post-contrast injection.

2.4 CT feature analysis

2.4.1 Qualitative features

CT images were retrieved from the picture archiving and communication system (PACS) and reviewed by two radiologists (6 and 10 years of experience in abdominal diagnosis), both of whom were aware of the presence of PNETs but were blinded to clinical and pathological data. Discrepancies in assessment were resolved by consensus following consultation with a third radiologist with 25 years of clinical experience in pancreatic imaging.

The following qualitative tumour features were assessed: location, size, number, shape, components, margin, and presence of necrosis. Other features are calcification, homogeneity, biliopancreatic duct dilatation, pancreatic parenchymal atrophy, uniformity of enhancement, and enhancement patterns. When multiple lesions were present, the tumour with the largest diameter was selected for analysis.

For tumour margins, a smooth, visible edge was considered well-defined, while spiculation or infiltration along more than 25% of the tumour perimeter was considered ill-defined (18). Uniformity of enhancement, assessed on portal phase images, was categorised as homogeneous or heterogeneous (19). The main pancreatic duct was classified as dilated when its diameter was ≥3 mm, and the common bile duct was considered dilated when its diameter was ≥10 mm (20). On contrast-enhanced images, tumours were categorised according to enhancement patterns as either hyper-attenuating (hyper-group) or iso/hypo-attenuating alone (iso/hypo-group) (21).

2.4.2 Quantitative assessment

The CT values (in Hounsfield unit, HU) values were measured using manually drawn circular/ovoid regions of interest (ROI), carefully avoiding calcifications, haemorrhage, cystic or necrotic components, vessels, and artefacts. ROIs were copied and pasted across phases of the same area with necessary adjustments. Serum haematocrit levels were obtained within 48 h of CT. Intra-observer agreement was ensured by repeating measurements, with the mean used for analysis.

Quantitative assessment was performed as follows: (a) Absolute enhancement = HU (arterial/portal phase) – HU (unenhanced tumour) (22); (b) Relative enhancement ratio: Arterial/portal enhancement divided by HU (unenhanced phase) (22); (c) Enhancement ratio: HU (arterial/portal tumour) divided by HU (adjacent parenchyma) on the same phase (20); (d) Extracellular volume fraction (ECV) was calculated using the formula: ECV (%) = (100 - hematocrit) × (ΔHU tumour/ΔHU aorta), where ΔHU tumour and ΔHU aorta are the HU values on equilibrium-phase images minus those on pre-contrast images for the tumour and abdominal aorta, respectively (23).

2.5 Feature selection and model construction

Patients were randomly divided (7:3 ratio) into training and validation cohorts. Least Absolute Shrinkage and Selection Operator (LASSO) regression with 10-fold cross-validation was used to optimise the variable selection and model complexity. The optimal λ value was determined to minimise cross-validation error based on non-zero coefficients, addressing multicollinearity via L1 regularisation. Subsequently, a multivariate logistic regression was applied once within the training cohort to identify independent predictors from the LASSO-selected variables. This step was used exclusively for feature refinement and did not constitute model training.

After feature selection, seven commonly used ML classifiers—Logistic Regression, Extreme Gradient Boosting (XGBoost), AdaBoost, K-Nearest Neighbors (KNN), Neural Network, Support Vector Machine (SVM), and CatBoost—were trained using only the selected predictors. All models were trained using repeated 10-fold cross-validation (10-fold CV with 5 repeats) within the training cohort to enhance model stability and reduce variance. Model performance was evaluated in both the training and validation cohorts.

Receiver-operating characteristic (ROC) curves and the area under the curve (AUC) were used to assess predictive accuracy for PNETsinvasiveness. Pairwise AUC comparisons among the ML models were conducted using DeLong’s test with Bonferroni correction. Additional evaluation metrics included accuracy, F1-score, sensitivity, and specificity. Calibration curves assessed predict reliability, while Decision Curve Analysis (DCA) evaluated the clinical applicability of the models (24). The optimal model was selected based on a comprehensive evaluation of discrimination, calibration, cross-cohort stability, and clinical net benefit.

2.6 Model interpretation and visualisation

To improve model interpretability, the SHAP method was applied. SHAP allows for both cohort- and patient-level interpretations by aggregating and averaging SHAP values to assess each feature’s impact (15). The SHAP feature importance plot ranks the features based on their overall contribution to the model. The SHAP bee swarm plot displays the range and distribution of each feature’s contribution, associating feature values with their corresponding impacts on the model. SHAP dependence plots illustrate how changes in feature values affect model predictions, while SHAP force plots provide an intuitive visualisation of how individual features influence a single prediction.

2.7 Statistical analysis

All statistical analyses were performed using SPSS (version 26.0, IBM) and R software (version 4.4.2). Continuous variables were expressed as mean (SD) and compared using an unpaired, two-tailed t-test (continuous variables with a normal distribution) or the Mann-Whitney test (continuous variables without a normal distribution). Categorical variables were compared using the χ2 test or Fisher’s exact test. In all analyses, p < 0.05 was considered statistically significant.

3 Results

3.1 Clinical and baseline characteristics

The study included 112 patients, randomly divided into training (n = 80; mean age, 47 ± 13 years; 36 males) and test (n = 32; mean age, 48 ± 15 years; 12 males) sets. The proportion of aggressive tumours was 41.25% (33/80) and 40.62% (13/32) in the training and test sets, respectively. In the training set, the aggressive group included patients with G3 grading (n = 11), lymph node metastasis (n = 17), microvascular invasion (n = 5), preoperative distant metastasis or postoperative recurrence (n = 24). Liver metastasis was the most common site of recurrence (n = 22), followed by local recurrence (n = 2). Similarly, in the test set, the aggressive group included patients with G3 grading (n = 4), lymph node metastasis (n = 5), microvascular invasion (n = 3), and preoperative distant metastasis or postoperative recurrence (n = 8). Liver involvement was again the predominant site (n = 7), followed by local recurrence (n = 1). The sum of individual components exceeded the number of aggressive cases in both cohorts because some patients had multiple aggressiveness endpoints. No significant differences were observed between the training and test cohorts (Table 1).

Table 1
www.frontiersin.org

Table 1. Baseline characteristics of the training and validation cohorts.

3.2 Feature extraction and selection

All clinical and imaging features were analysed using LASSO regression (Figure 2). The model identified 12 predictive factors using the minimum λ value of 0.034: tumour location, shape, margin, necrosis, pancreatic duct expansion, pancreatic atrophy, enhancement pattern, size, CA153, CA199, arterial absolute enhancement, and arterial relative enhancement ratio. To control for confounding variables, multivariable logistic regression identified tumour shape (OR = 23.513, 95% CI: 3.340–165.070), necrosis (OR = 9.810, 95% CI: 1.844–52.182), enhancement pattern (OR = 5.203, 95% CI: 0.916–29.550), and arterial relative enhancement ratio (OR = 0.444, 95% CI: 0.194–1.014) as independent predictors of invasiveness (Figure 3). These factors were incorporated into ML models.

Figure 2
Two panels are shown. Panel A: A line plot of coefficients versus log lambda, featuring multiple colored lines converging towards zero as log lambda increases. Panel B: A line plot of binomial deviance versus log lambda, with a red curve showing deviance values decreasing then increasing, alongside error bars and two vertical dashed lines marking specific values of lambda.

Figure 2. Least Absolute Shrinkage and Selection Operator (LASSO) regression analysis for feature selection. (A) Distribution of LASSO coefficients for different λ values. (B) Optimal λ values selected based on 10-fold cross-validation and minimum mean squared error (MSE), represented by vertical dashed lines.

Figure 3
Table and forest plot illustrating regression analysis results. The table lists variables including arterial relative enhancement ratio, enhancement pattern, necrosis, and shape, with corresponding coefficients, standard errors, Wald statistics, p-values, and odds ratios with confidence intervals. The plot visualizes odds ratios with error bars, indicating statistical significance for necrosis and shape variables.

Figure 3. Forest plot showing the results of multivariate logistic regression analysis.

3.3 Development and validation of ML models

Model performance metrics, including AUC, accuracy, F1 score, sensitivity, and specificity, are summarised in Table 2. In the training set, the KNN classifier yielded the highest AUC (0.994; 95% CI, 0.983–1.000), followed by CatBoost (0.969; 95% CI, 0.938–1.000) and logistic regression (0.952; 95% CI, 0.909–0.994), with SVM, XGBoost, Neural Network, and Adaboost slightly lower. In the validation set, the logistic regression model achieved the highest AUC (0.972, 95% CI: 0.927–1.000), followed by SVM (0.947, 95% CI: 0.879–1.000), CatBoost (0.941, 95% CI: 0.865–1.000), Neural Network (0.907, 95% CI: 0.792–1.000), Adaboost (0.919, 95% CI: 0.828–1.000), XGBoost (0.891, 95% CI: 0.782–1.000), and KNN (0.848, 95% CI: 0.712–0.984). Regarding accuracy, the logistic regression model, SVM, Neural Network, XGBoost, KNN, Adaboost, and CatBoost achieved 90.6%, 87.5%, 90.6%, 78.1%, 81.2%, 87.5%, and 87.5% accuracy, respectively. Logistic regression also achieved the highest F1 score (0.897), balanced sensitivity (100%) and specificity (0.842) in the validation cohort.

Table 2
www.frontiersin.org

Table 2. Diagnostic performance of machine learning classifiers in predicting pancreatic neuroendocrine neoplasm aggressiveness.

ROC curves were generated for both the training and validation sets to evaluate confirmed the model’s performance (Figures 4A, B). In the training set, KNN, CatBoost, and logistic regression models performed the best, while the logistic regression model showed the highest performance in the testing set, with an AUC of 0.972 (95% CI: 0.927–1.000). Calibration curves (Figures 4C, D) demonstrated good agreement between predicted and observed tumour invasiveness in both sets, indicating accurate predictions. DCA demonstrated a high net benefit across threshold probabilities (Figures 4E, F). Pairwise comparisons of AUC values among the seven classifiers showed no statistically significant differences (all p > 0.05). Nonetheless, logistic regression was selected as the optimal model based on its multidimensional strengths, including superior discrimination, accurate calibration, cross-cohort stability, and consistently high clinical utility. Together, these findings support logistic regression as the most robust and clinically applicable model for predicting PNETs invasiveness.

Figure 4
Chart panels A and B display ROC curves comparing Logistic, SVM, Neural Network, Xgboost, KNN, Adaboost, and CatBoost models, with AUC values provided. Panels C and D show calibration curves, presenting observed event percentages versus bin midpoints for the same models. Panels E and F depict decision curves illustrating standardized net benefits across various high-risk thresholds. Each panel includes legends and axes labels for clarity.

Figure 4. Performance evaluation of the machine learning model in predicting pancreatic neuroendocrine neoplasm invasiveness. (A) Receiver operating characteristic (ROC) curves for the training set. (B) ROC for the validation set. (C) Calibration curves for the training set, with the x-axis representing average predicted probability and the y-axis representing actual event probability. (D) Calibration curve for the validation set. (E) Decision Curve Analysis (DCA) curves for the training set. (F) DCA curve for the validation set.

3.4 Exploration of model interpretability

To enhance interpretability, SHAP values were used to assess features importance. The top-ranked predictors in the logistic regression model were tumour shape, arterial relative enhancement ratio, necrosis, and enhancement pattern (Figure 5A). The SHAP bee swarm plot (Figure 5B) illustrates the contribution of each feature, with purple indicating a negative effect and yellow indicating a positive effect on the predicted probability. Each point represents a patient’s SHAP value, with the density of the points reflecting the distribution of the same SHAP value. The points are coloured according to the feature values, ranging from low (purple) to high (yellow). For instance, patients with irregular shapes or lower arterial relative enhancement ratios are more likely to have invasive tumours. The SHAP dependency plot (Figure 6) demonstrates that lower arterial relative enhancement ratios correlated with increased tumour invasiveness. Irregular tumour shape and necrosis also correlated with increased SHAP values.

Figure 5
Two panels labeled A and B illustrate SHAP values for features affecting a logistic model. Panel A shows a bar chart with features: Shape, The arterial relative enhancement ratio, Necrosis, and Enhancement pattern. Panel B displays a scatter plot of SHAP values for the same features, with dots colored by feature value from low (purple) to high (yellow).

Figure 5. Shapley Additive Explanations (SHAP) summary plot illustrating feature importance. (A) Ranking of feature importance in the final predictive model. (B) SHAP values of each feature in the final model, where each line represents a feature, and the abscissa is the SHAP value. The different colors (yellow and purple) represent different levels of effect on the output of the model.

Figure 6
Three scatter plots labeled A, B, and C show SHAP values versus features: Shape (A), Arterial Relative Enhancement Ratio (B), and Necrosis (C). Data points are colored by enhancement pattern, with a gradient from purple to orange, and the SHAP values range differently for each feature.

Figure 6. Shapley Additive Explanations (SHAP) dependence plot shows the influence of individual features on model predictions. (A) Tumour shape (B) Arterial relative enhancement ratio change (C) Tumour necrosis. The y-axis represents SHAP values, while the x-axis displays specific feature values. Each point represents the SHAP value for a patient, with the colour gradient from yellow to purple indicating high to low feature values. A SHAP value >0 indicates an increased risk of tumour aggressiveness.

SHAP force plots visualised individual predictions using the logistic regression model and accurately interpreted the predictions for two patients (Figure 7). The SHAP values are represented as forces that either increase or decrease the prediction. Each feature is shown by an arrow originating from the baseline value (0.406), which represents the average SHAP value across all predictions. The length of the arrow reflects the magnitude of the feature’s contribution (as a percentage), while the direction and colour (yellow for positive and purple for negative) indicate whether each feature increases or decreases the model’s prediction relative to the baseline.

Figure 7
CT scans of two patients are shown with annotated sections. Each patient's images are accompanied by bar graphs indicating factors like necrosis and enhancement patterns. Patient 1 has three red-highlighted areas with negative prediction values, while Patient 2 has yellow-highlighted areas and positive prediction values.

Figure 7. Individualised visualisation of the model using Shapley Additive Explanations. Predictions for two patients, showing correctly classified negative versus positive aggressiveness in patients with pancreatic neuroendocrine neoplasm, respectively. The colour gradient represents feature contributions, with yellow indicating a positive contribution and purple indicating a negative contribution.

4 Discussion

This study developed and evaluated an interpretable ML model integrating quantitative and qualitative CT imaging features to predict PNETsaggressiveness with high accuracy. Seven ML techniques were applied to a large patient cohort to minimise overfitting. Among these, logistic regression demonstrated the highest predictive performance, achieving AUCs of 0.952 and 0.972 in the training and test sets, respectively, demonstrating strong predictive efficacy. SHAP analysis enhanced interpretability, offering insights into key predictive factors.

Few studies have investigated PNETs aggressiveness. In one study, patients developed a deep learning-based nomogram using contrast-enhanced ultrasound (25), achieving AUCs of 0.97 and 0.85 in the training and validation sets, respectively, demonstrating strong discrimination, favourable calibration, and clinical utility. Mori et al. (26) predicted distant metastasis and microvascular invasion using radiomics features and clinical parameters, with AUCs of 0.67–0.85 and negative predictive values of 0.75–0.98. Shen et al. (27) identified tumour size, bile and pancreatic duct dilatation, lymphadenopathy, and enhancement patterns as key predictors, achieving AUCs of 0.89 and 0.86 in the training and test set. Despite these contributions, the lack of interpretability in previous models results in a “black-box” effect, which hinders clinical adoption (28).

To address this, our study employed SHAP analysis to enhance model interpretability while maintaining optimal predictive performance. SHAP quantified the contributions of four independent predictive features, determining their impact on tumour aggressiveness. Additionally, conventional CT imaging data were used, which are readily accessible without requiring complex software or invasive procedures. Compared to radiomics-based approaches, our model offers lower costs, reduced workload, superior predictive performance, and improved clinical applicability.Our findings highlight the potential of qualitative and quantitative CT features in PNETs risk stratification, although direct comparison with previous studies is not feasible. By providing both global and local interpretability, the model enhances transparency, allowing clinicians to better understand and trust its predictions.

This study identified four CT features as independent predictors of PNETs aggressiveness, ranked by SHAP analysis: tumour shape, arterial relative enhancement ratio, necrosis-cystic changes, and enhancement pattern. Irregular tumour margins were strongly associated with invasion and lymphatic spread, aligning with previous studies. Irregular margins are significantly correlated with tumour size, high histological grade, liver metastases, and lymph node metastasis (29). Zhu et al. (30) reported that tumours with irregular margins were 4.72 times more likely to develop malignant lymph nodes compared to those with smooth margins, further reinforcing the association between irregular borders and PNETsaggressiveness. Irregularly shaped tumours with incomplete capsules exhibit a higher likelihood of lymphatic or vascular invasion, which is strongly associated with poor prognosis, increased metastasis risk, and greater treatment resistance, significantly reducing overall survival (29).

Our findings indicate the critical role of necrotic changes in PNETs aggressiveness. Histopathologically, well-differentiated PNETs typically consist of small or medium-sized monomorphic cells arranged in trabeculae or islets, displaying a classic “salt-and-pepper” chromatin pattern. By contrast, poorly differentiated PNETs exhibit pleomorphic cells, highly atypical clusters, and extensive haemorrhagic necrosis (31, 32). Worhunsky et al. (33) similarly reported a relationship between hypodense foci and necrosis on histopathology. Some studies have also identified tumour size as a factor influencing PNETs aggressiveness (25, 34). In our study, although invasive PNETs were markedly larger, tumour size was not an independent risk factor, consistent with Chen et al. (35). Since most PNETs are asymptomatic and detected incidentally, symptom severity does not necessarily correlate with tumour size. Therefore, size alone is insufficient for assessing tumour aggressiveness.

The arterial relative enhancement ratio emerged as a critical predictor of tumour aggressiveness. The SHAP dependence plot demonstrated a strong inverse relationship: as the enhancement ratio increased, tumour invasiveness decreased. This may be attributed to PNETs being more susceptible to vascular invasion and microthrombosis, leading to reduced arterial supply and venous drainage (34). Differences in blood flow and microvascular density between tumours of varying invasiveness suggest angiogenesis levels are influenced by cellular differentiation and angiogenesis marker expression (36). High-grade PNETs frequently exhibit apoptotic bodies, map-like necrosis, and invasive growth patterns, resulting in low, heterogeneous enhancement (37). Our findings support the link between reduced microvascular density and arterial-phase enhancement, aligning with Yano et al. (38), which reported lower enhancement in metastatic tumours compared to non-metastatic ones.

Additionally, the arterial relative enhancement ratio provided a more objective and reliable prediction of aggressiveness than subjective radiologists’ assessments. As tumours progress, their imaging features become more complex, reducing the reliability of visual assessments. Thus, relying solely on radiologists’ expertise to predict aggressiveness has limitations. Further analysis revealed no significant differences in absolute enhancement, relative enhancement ratios, or portal phase enhancement across tumours of varying aggressiveness, reinforcing the greater predictive value of arterial phase enhancement. Collectively, these findings support the incorporation of CT features into a nomogram for accurate, preoperative PNETsrisk assessment.

After clinicians understood how various features influenced the logistic model and the potential pathophysiological mechanisms, they sought to apply the model to individual patient outcomes. To achieve this, we interpreted individual patient assessments locally by using the SHAP force plot. Compared to the nomogram method, which requires clinicians to calculate specific feature values to generate total points (39), the SHAP force plot is more time-efficient and user-friendly. Clinicians could directly compare the output SHAP value of a single patient with the base value, facilitating a clearer and more intuitive interpretation.

Our study has several limitations. First, Its retrospective, single-institution design may introduce selection bias and limit generalisability. Second, Given the rarity of PNETs, the cohort size—particularly the number of aggressive cases in the validation set—was relatively small, although measures such as LASSO feature selection and multivariate logistic regression were taken to minimise overfitting risks and ensure robust model performance. Third, we focuses on conventional CT features and did not incorporate radiomics analysis, which could potentially provide additional predictive information. Fourth, due to the extended recruitment period, various CT scanners were used, which may have affected image analysis. Fifth, while LASSO and multivariate regression helped control feature selection and combined effects, multiple comparison corrections (e.g., Bonferroni or FDR) were not applied, so a potential risk of Type I error remains. Despite these limitations, our findings demonstrate the potential of conventional CT-based features for predicting PNETs aggressiveness, providing a foundation for future multicenter studies that may integrate radiomics and further validate these predictive models.

5 Conclusion

In conclusion, we developed a reliable, interpretable ML model that accurately predicts PNETsinvasiveness, offering valuable insights into tumour biology. This model could aid preoperative decision-making and support personalised management strategies for patients with PNETs.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by The Medical Ethics Committee of First Affiliated Hospital of Guangxi Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

RK: Conceptualization, Funding acquisition, Software, Writing – original draft, Writing – review & editing, Formal analysis, Methodology, Visualization. SL: Data curation, Investigation, Methodology, Visualization, Writing – review & editing. YH: Conceptualization, Data curation, Methodology, Writing – review & editing. ST: Validation, Writing – review & editing. CZ: Conceptualization, Writing – review & editing. GC: Investigation, Writing – review & editing. MY: Formal analysis, Supervision, Writing – review & editing. YL: Resources, Validation, Writing – review & editing. QW: Resources, Writing – review & editing. PP: Funding acquisition, Resources, Supervision, Validation, Writing – review & editing.

Funding

The author(s) declared that financial support was received for work and/or its publication. This work was supported by the National Natural Science Foundation of China (Nos. 81760305 and 81641066), Natural Science Foundation of Guangxi Zhuang Autonomous Region, China (Nos. 2023GXNSFAA026009, 2023GXNSFBA026120), Guangxi Zhuang Autonomous Region Health Committee Scientific Research Project (No. Z20200519), The Key Laboratory of Children’s Disease Research in Guangxi’s Colleges and Universities (No. GXCDR2023001), Guangxi Medical and Health Appropriate Technology Development and Promotion Application Project (No.S2022072), Advanced Innovation Teams and Xinghu Scholars Program of Guangxi Medical University and Beijing Medical Award Foundation Ruiying Scientific Research Fund Key Project (No. YXJL-2024-0530-0321) and Innovation Project of Guangxi Graduate Education (No. YCSW2024236).

Acknowledgments

We are particularly grateful to all the people who have given us help on our article.

Conflict of interest

The authors declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Pavel M, Öberg K, Falconi M, Krenning EP, Sundin A, Perren A, et al. Gastroenteropancreatic neuroendocrine neoplasms: ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up. Ann Oncol. (2020) 31:844–60. doi: 10.1016/j.annonc.2020.03.304

PubMed Abstract | Crossref Full Text | Google Scholar

2. Cives M and Strosberg JR. Gastroenteropancreatic neuroendocrine tumors. CA Cancer J Clin. (2018) 68:471–87. doi: 10.3322/caac.21493

PubMed Abstract | Crossref Full Text | Google Scholar

3. Zheng R, Zhao H, An L, Zhang S, Chen R, Wang S, et al. Incidence and survival of neuroendocrine neoplasms in China with comparison to the United States. Chin Med J (Engl). (2023) 136:1216–24. doi: 10.1097/CM9.0000000000002643

PubMed Abstract | Crossref Full Text | Google Scholar

4. Das S and Dasari A. Epidemiology, incidence, and prevalence of neuroendocrine neoplasms: are there global differences? Curr Oncol Rep. (2021) 23:43. doi: 10.1007/s11912-021-01029-7

PubMed Abstract | Crossref Full Text | Google Scholar

5. Mpilla GB, Philip PA, El-Rayes B, and Azmi AS. Pancreatic neuroendocrine tumors: Therapeutic challenges and research limitations. World J Gastroenterol. (2020) 26:4036–54. doi: 10.3748/wjg.v26.i28.4036

PubMed Abstract | Crossref Full Text | Google Scholar

6. Dasari A, Shen C, Halperin D, Zhao B, Zhou S, Xu Y, et al. Trends in the incidence, prevalence, and survival outcomes in patients with neuroendocrine tumors in the United States. JAMA Oncol. (2017) 3:1335. doi: 10.1001/jamaoncol.2017.0589

PubMed Abstract | Crossref Full Text | Google Scholar

7. Madani A, Thomassen I, van Gestel YRBM, van der Bilt JDW, Haak HR, de Hingh IHJT, et al. Peritoneal metastases from gastroenteropancreatic neuroendocrine tumors: incidence, risk factors and prognosis. Ann Surg Oncol. (2017) 24:2199–205. doi: 10.1245/s10434-016-5734-x

PubMed Abstract | Crossref Full Text | Google Scholar

8. Falconi M, Eriksson B, Kaltsas G, Bartsch DK, Capdevila J, Caplin M, et al. ENETS consensus guidelines update for the management of patients with functional pancreatic neuroendocrine tumors and non-functional pancreatic neuroendocrine tumors. Neuroendocrinology. (2016) 103:153–71. doi: 10.1159/000443171

PubMed Abstract | Crossref Full Text | Google Scholar

9. Zhou H, Wang Y, Guo C, Li X, Cui W, Wang Z, et al. Microscopic invasion of nerve is associated with aggressive behaviors in pancreatic neuroendocrine tumors. Front Oncol. (2021) 11:630316. doi: 10.3389/fonc.2021.630316

PubMed Abstract | Crossref Full Text | Google Scholar

10. Choe J, Kim KW, Kim HJ, Kim DW, Kim KP, Hong SM, et al. What is new in the 2017 world health organization classification and 8th american joint committee on cancer staging system for pancreatic neuroendocrine neoplasms? Korean J Radiol. (2019) 20:5–17. doi: 10.3348/kjr.2018.0040

PubMed Abstract | Crossref Full Text | Google Scholar

11. Javed AA, Pulvirenti A, Razi S, Zheng J, Michelakos T, Sekigami Y, et al. Grading pancreatic neuroendocrine tumors via endoscopic ultrasound-guided fine needle aspiration: A multi-institutional study. Ann Surg. (2023) 277:e1284–90. doi: 10.1097/SLA.0000000000005390

PubMed Abstract | Crossref Full Text | Google Scholar

12. Greener JG, Kandathil SM, Moffat L, and Jones DT. A guide to machine learning for biologists. Nat Rev Mol Cell Biol. (2022) 23:40–55. doi: 10.1038/s41580-021-00407-0

PubMed Abstract | Crossref Full Text | Google Scholar

13. Cabitza F, Rasoini R, and Gensini GF. Unintended consequences of machine learning in medicine. JAMA. (2017) 318:517–8. doi: 10.1001/jama.2017.7797

PubMed Abstract | Crossref Full Text | Google Scholar

14. Shortliffe EH and Sepúlveda MJ. Clinical decision support in the era of artificial intelligence. JAMA. (2018) 320:2199–200. doi: 10.1001/jama.2018.17163

PubMed Abstract | Crossref Full Text | Google Scholar

15. Lundberg S and Lee SI. A Unified approach to interpreting model predictions. (2017) arXiv [Preprint]. Available online at: http://arxiv.org/abs/1705.07874.

Google Scholar

16. Maggino L, Schmidt A, Käding A, Westermark S, Ceppa EP, Falconi M, et al. Reappraisal of a 2-cm cut-off size for the management of cystic pancreatic neuroendocrine neoplasms: A multicenter international study. Ann Surg. (2021) 273:973–81. doi: 10.1097/SLA.0000000000003508

PubMed Abstract | Crossref Full Text | Google Scholar

17. Nagtegaal ID, Odze RD, Klimstra D, Paradis V, Rugge M, Schirmacher P, et al. The 2019 WHO classification of tumours of the digestive system. Histopathology. (2020) 76:182–8. doi: 10.1111/his.13975

PubMed Abstract | Crossref Full Text | Google Scholar

18. Ren S, Qian L, Daniels MJ, Duan S, Chen R, and Wang Z. Evaluation of contrast-enhanced computed tomography for the differential diagnosis of hypovascular pancreatic neuroendocrine tumors from chronic mass-forming pancreatitis. Eur J Radiol. (2020) 133:109360. doi: 10.1016/j.ejrad.2020.109360

PubMed Abstract | Crossref Full Text | Google Scholar

19. Canellas R, Burk KS, Parakh A, and Sahani DV. Prediction of pancreatic neuroendocrine tumor grade based on CT features and texture analysis. AJR Am J Roentgenol. (2018) 210:341–6. doi: 10.2214/AJR.17.18417

PubMed Abstract | Crossref Full Text | Google Scholar

20. Dw K, Hj K, Kw K, Jh B, Kb S, Jh K, et al. Neuroendocrine neoplasms of the pancreas at dynamic enhanced CT: comparison between grade 3 neuroendocrine carcinoma and grade 1/2 neuroendocrine tumour. Eur Radiol. (2015) 25:1375–83. doi: 10.1007/s00330-014-3532-z

PubMed Abstract | Crossref Full Text | Google Scholar

21. Mizumoto T, Toyama H, Terai S, Mukubou H, Yamashita H, Shirakawa S, et al. Prediction of lymph node metastasis in pancreatic neuroendocrine tumors by contrast enhancement characteristics. Pancreatology. (2017) 17:956–61. doi: 10.1016/j.pan.2017.08.003

PubMed Abstract | Crossref Full Text | Google Scholar

22. Ren S, Daniels MJ, and Wang Z. Can relative enhancement ratio of portal venous phase to unenhanced CT be used to differentiate lipid-poor adrenal adenomas from adrenal hyperplasia? Radiology. (2022) 303:E37. doi: 10.1148/radiol.212331

PubMed Abstract | Crossref Full Text | Google Scholar

23. Kim SY, Kim H, Cho JY, Lim S, Cha K, Lee KH, et al. Quantitative assessment of pancreatic fat by using unenhanced CT: pathologic correlation and clinical implications. Radiology. (2014) 271:104–12. doi: 10.1148/radiol.13122883

PubMed Abstract | Crossref Full Text | Google Scholar

24. Martin MS, Wells GA, Crocker AG, Potter BK, and Colman I. Decision curve analysis as a framework to estimate the potential value of screening or other decision-making aids. Int J Methods Psychiatr Res. (2018) 27:e1601. doi: 10.1002/mpr.1601

PubMed Abstract | Crossref Full Text | Google Scholar

25. Huang J, Xie X, Wu H, Zhang X, Zheng Y, Xie X, et al. Development and validation of a combined nomogram model based on deep learning contrast-enhanced ultrasound and clinical factors to predict preoperative aggressiveness in pancreatic neuroendocrine neoplasms. Eur Radiol. (2022) 32:7965–75. doi: 10.1007/s00330-022-08703-9

PubMed Abstract | Crossref Full Text | Google Scholar

26. Mori M, Palumbo D, Muffatti F, Partelli S, Mushtaq J, Andreasi V, et al. Prediction of the characteristics of aggressiveness of pancreatic neuroendocrine neoplasms (PanNENs) based on CT radiomic features. Eur Radiol. (2023) 33:4412–21. doi: 10.1007/s00330-022-09351-9

PubMed Abstract | Crossref Full Text | Google Scholar

27. Shen X, Yang F, Jiang T, Zheng Z, Chen Y, Tan C, et al. A nomogram to preoperatively predict the aggressiveness of non-functional pancreatic neuroendocrine tumors based on CT features. Eur J Radiol. (2024) 171:111284. doi: 10.1016/j.ejrad.2023.111284

PubMed Abstract | Crossref Full Text | Google Scholar

28. Petch J, Di S, and Nelson W. Opening the black box: the promise and limitations of explainable machine learning in cardiology. Can J Cardiol. (2022) 38:204–13. doi: 10.1016/j.cjca.2021.09.004

PubMed Abstract | Crossref Full Text | Google Scholar

29. Okabe H, Hashimoto D, Chikamoto A, Yoshida M, Taki K, Arima K, et al. Shape and enhancement characteristics of pancreatic neuroendocrine tumor on preoperative contrast-enhanced computed tomography may be prognostic indicators. Ann Surg Oncol. (2017) 24:1399–405. doi: 10.1245/s10434-016-5630-4

PubMed Abstract | Crossref Full Text | Google Scholar

30. Zhu HB, Nie P, Jiang L, Hu J, Zhang XY, Li XT, et al. Preoperative prediction of lymph node metastasis in nonfunctioning pancreatic neuroendocrine tumors from clinical and MRI features: a multicenter study. Insights Imaging. (2022) 13:162. doi: 10.1186/s13244-022-01301-9

PubMed Abstract | Crossref Full Text | Google Scholar

31. Guilmette JM and Nosé V. Neoplasms of the neuroendocrine pancreas: an update in the classification, definition, and molecular genetic advances. Adv Anat Pathol. (2019) 26:13–30. doi: 10.1097/PAP.0000000000000201

PubMed Abstract | Crossref Full Text | Google Scholar

32. Xue Y, Reid MD, Pehlivanoglu B, Obeng RC, Jiang H, Memis B, et al. Morphologic variants of pancreatic neuroendocrine tumors: clinicopathologic analysis and prognostic stratification. Endocr Pathol. (2020) 31:239–53. doi: 10.1007/s12022-020-09628-z

PubMed Abstract | Crossref Full Text | Google Scholar

33. Worhunsky DJ, Krampitz GW, Poullos PD, Visser BC, Kunz PL, Fisher GA, et al. Pancreatic neuroendocrine tumours: hypoenhancement on arterial phase computed tomography predicts biological aggressiveness. HPB (Oxford). (2014) 16:304–11. doi: 10.1111/hpb.12139

PubMed Abstract | Crossref Full Text | Google Scholar

34. Bettini R, Partelli S, Boninsegna L, Capelli P, Crippa S, Pederzoli P, et al. Tumor size correlates with Malignancy in nonfunctioning pancreatic endocrine tumor. Surgery. (2011) 150:75–82. doi: 10.1016/j.surg.2011.02.022

PubMed Abstract | Crossref Full Text | Google Scholar

35. Chen HY, Pan Y, Chen JY, Liu LL, Yang YB, Li K, et al. Quantitative analysis of enhanced CT in differentiating well-differentiated pancreatic neuroendocrine tumors and poorly differentiated neuroendocrine carcinomas. Eur Radiol. (2022) 32:8317–25. doi: 10.1007/s00330-022-08891-4

PubMed Abstract | Crossref Full Text | Google Scholar

36. Couvelard A, O’Toole D, Turley H, Leek R, Sauvanet A, Degott C, et al. Microvascular density and hypoxia-inducible factor pathway in pancreatic endocrine tumours: negative correlation of microvascular density and VEGF expression with tumour progression. Br J Cancer. (2005) 92:94–101. doi: 10.1038/sj.bjc.6602245

PubMed Abstract | Crossref Full Text | Google Scholar

37. Singhi AD and Klimstra DS. Well-differentiated pancreatic neuroendocrine tumours (PanNETs) and poorly differentiated pancreatic neuroendocrine carcinomas (PanNECs): concepts, issues and a practical diagnostic approach to high-grade (G3) cases. Histopathology. (2018) 72:168–77. doi: 10.1111/his.13408

PubMed Abstract | Crossref Full Text | Google Scholar

38. M Y, S M, Dh C, A S, and Cf H. Pancreatic neuroendocrine tumors: computed tomography enhancement, but not histological grade, correlates with tumor aggression. Pancreas. (2017) 46:1366–72. doi: 10.1097/MPA.0000000000000922

PubMed Abstract | Crossref Full Text | Google Scholar

39. Zhang J, Jin J, Ai Y, Zhu K, Xiao C, Xie C, et al. Computer tomography radiomics-based nomogram in the survival prediction for brain metastases from non-small cell lung cancer underwent whole brain radiotherapy. Front Oncol. (2020) 10:610691. doi: 10.3389/fonc.2020.610691

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: pancreatic neuroendocrine tumor, aggressiveness, computed tomography, machine learning, interpretable model, shap, preoperative prediction

Citation: Kong R, Lu S, Huang Y, Tan S, Zhu C, Chen G, Yang M, Liu Y, Wu Q and Peng P (2025) A CT-based interpretable machine learning model for preoperative prediction of pancreatic neuroendocrine tumor aggressiveness. Front. Oncol. 15:1665601. doi: 10.3389/fonc.2025.1665601

Received: 14 July 2025; Accepted: 26 November 2025; Revised: 19 November 2025;
Published: 11 December 2025.

Edited by:

Maria Chiara Zatelli, University of Ferrara, Italy

Reviewed by:

Xiangde Min, Huazhong University of Science and Technology, China
Hai-Yan Chen, University of Chinese Academy of Sciences, China

Copyright © 2025 Kong, Lu, Huang, Tan, Zhu, Chen, Yang, Liu, Wu and Peng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Peng Peng, ZG91YmxlcEAxMjYuY29t

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.