Radiomics-based multiple machine learning approaches for investigating medial wall invasion of the cavernous sinus in pituitary adenomas

Chen, Yuyang; Zhong, Jiansheng; Hou, Pengwei; Wang, Xiaoyu; Li, Jun; Li, Ziqi; Feng, Tianshun; Wei, Liangfeng; Chen, Yuhui; Wang, Shousen

doi:10.3389/fonc.2025.1706895

ORIGINAL RESEARCH article

Front. Oncol., 20 November 2025

Sec. Neuro-Oncology and Neurosurgical Oncology

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1706895

This article is part of the Research TopicMultimodal Imaging in Neuro-Oncology: Advances in Nuclear Medicine and MRI for Precision Diagnostics and TherapyView all 3 articles

Radiomics-based multiple machine learning approaches for investigating medial wall invasion of the cavernous sinus in pituitary adenomas

Yuyang Chen^1,2,3†

Jiansheng Zhong^1,2†

Pengwei Hou^4†

Xiaoyu Wang⁵

Jun Li^1,2

Ziqi Li^1,2

Tianshun Feng^1,2

Liangfeng Wei^1,2

Yuhui Chen^1,2*

Shousen Wang^1,2*

¹Department of Neurosurgery, Fuzong Clinical Medical College of Fujian Medical University, Fuzhou, Fujian, China
²Fujian Provincial Clinical Medical Research Center for Minimally Invasive Diagnosis and Treatment of Neurovascular Diseases, Fuzhou, Fujian, China
³Department of Neurosurgery, Huashan Hospital, Fudan University, Shanghai, China
⁴Department of Neurosurgery, Shanghai Sixth People’s Hospital Fujian, Jinjiang, China
⁵Department of Neurosurgery, Fuzhou Changle District People’s Hospital, Fuzhou, Fujian, China

Objective: This study aims to develop a predictive model for cavernous sinus dural invasion in pituitary adenomas by retrospectively analyzing clinical and imaging data. It explores the associations between clinical and radiomics features and cavernous sinus dural invasion.

Methods: Clinical data and coronal T2-weighted MRI images were collected from patients diagnosed with pituitary adenomas at our institution between December 2012 and December 2022. Tumor regions of interest (ROIs) were segmented using 3D Slicer, and radiomics features were extracted. Statistically significant radiomics features were identified using Lasso regression and univariate analysis. Clinical features were screened using univariate and multivariate logistic regression analyses. These selected features were incorporated into ten machine learning algorithms to construct three predictive models: a clinical feature model, a radiomics feature model, and a combined clinical and radiomics feature model. Model performance was evaluated to determine the best-performing model, which was further interpreted.

Results: A total of 252 patients with histopathologically confirmed pituitary adenomas were included. The analysis identified Knosp grade, tumor left-right diameter, pedunculated satellite tumor, and clival invasion as significant clinical predictors, along with radiomics features including original.4, original.10, log-sigma-5-0-mm-3D.29, log-sigma-5-0-mm-3D.91, wavelet-LLH.37, wavelet-LHL.37, and wavelet-HLL.8. The combined clinical and radiomics model outperformed models based solely on clinical or radiomics features. Among the ten machine learning algorithms, the LightGBM model demonstrated the best predictive performance, achieving an area under the curve (AUC) of 0.86 and an accuracy (ACC) of 0.76.

Conclusions: A machine learning model integrating clinical and radiomics features can effectively predict cavernous sinus dural invasion in pituitary adenomas preoperatively, providing a reliable basis for diagnosing tumor invasiveness and developing surgical plans. The LightGBM algorithm exhibited the highest predictive efficacy. Furthermore, the pedunculated satellite tumor feature emerged as a novel imaging marker for cavernous sinus dural invasion, offering new insights into the study of invasive pituitary adenomas.

1 Introduction

Pituitary adenomas are benign tumors originating from the anterior pituitary gland, accounting for approximately 15–20% of all intracranial tumors (1, 2). These tumors typically grow expansively, invading surrounding tissues and exerting mass effects that lead to neurological symptoms (3). The sellar region, surrounded by transitional dura mater, is particularly vulnerable to such invasion (4). As pituitary adenomas grow invasively, tumor tissue can compress and breach the adjacent dura mater, potentially extending into structures such as the suprasellar region and cavernous sinus (5). Early studies identified dural invasion as a key criterion for determining tumor invasiveness (6, 7), making the assessment of dural invasion a primary focus in the study of pituitary adenomas’ aggressiveness.

The cavernous sinus consists of five walls, with the medial wall, a single-layered dura separating it from the pituitary gland, being particularly significant in the context of invasion. Anatomical defects in some individuals may contribute to cavernous sinus invasion (8). Medial wall invasion of the cavernous sinus has long been considered a hallmark of aggressive pituitary adenomas. Radiological detection of medial wall involvement is an important preoperative method for evaluating cavernous sinus dural invasion. However, despite high-resolution T2-weighted imaging, the medial wall of the cavernous sinus remains difficult to visualize clearly (9). The 3D-SPACE sequence has emerged as one of the most effective MRI techniques for evaluating the dura mater due to its unique imaging properties (10). However, its long acquisition time and high cost limit its widespread clinical application. Therefore, accurate assessment of cavernous sinus dural invasion using standard MRI sequences remains a critical area of investigation. The “pituitary adenoma dural invasion channel theory” has been proposed, along with novel markers such as “pedunculated satellite tumors” and “interdural tumor,” to facilitate the preoperative assessment of cavernous sinus dural invasion in pituitary adenomas (11, 12). Their study indicated that intraoperative complete resection of the invaded medial wall of the cavernous sinus significantly affects the recurrence of pituitary adenomas. However, routine resection of the medial wall of the cavernous sinus during surgery may substantially increase the risk of intraoperative hemorrhage, thereby compromising surgical quality and postoperative recovery. Therefore, accurate preoperative prediction of medial wall invasion of the cavernous sinus can provide valuable guidance for neurosurgeons in determining whether to perform resection of the medial wall. Beyond cavernous sinus dural invasion, surgical complexity in pituitary surgery is also influenced by tumor consistency. Recent studies have introduced radiological surrogates of consistency—most notably the T2-weighted signal intensity ratio (T2SIR)—and shown that firmer tumors (lower T2SIR) are associated with reduced odds of gross-total resection and greater operative difficulty in non-functioning pituitary adenomas (13, 14). These advances contextualize the present work, which specifically targets preoperative identification of cavernous sinus dural invasion using conventional T2-weighted imaging and radiomics.

Building on this foundational research, our study focuses on MRI T2-weighted imaging to investigate cavernous sinus dural invasion in pituitary adenomas. Radiomics involves the analysis and processing of medical images, such as CT, MRI, and PET scans, to extract radiomic features used for screening, diagnosis, follow-up, and prognosis (15, 16). The integration of radiomics with machine learning has become a critical approach in clinical research. In this study, multiple machine learning algorithms were applied to analyze and extract radiomic features from coronal T2-weighted images of pituitary adenomas. A preoperative predictive model for cavernous sinus dural invasion in pituitary adenomas was developed to support the assessment of tumor invasiveness and inform surgical planning (17, 18).

2 Method

2.1 Study subjects

Clinical and imaging data were consecutively collected from patients who underwent transsphenoidal pituitary adenoma resection under microscopy at the Department of Neurosurgery in our hospital between December 2012 and December 2022. A total of 252 cases were included in the analysis after applying strict inclusion and exclusion criteria. Although consecutive sampling was used over a 10-year period, all cases were obtained from a single tertiary medical center, which may limit the generalizability of our findings. Selection bias could not be entirely excluded, as only patients with complete imaging and surgical data were included. However, consecutive case inclusion helps to reduce subjective selection and reflects real-world clinical practice. This study was approved by the Ethics Committee of the 900th Hospital of the People’s Liberation Army Joint Logistic Support Force in Fuzhou, Fujian, China (Approval Number: Ethics Review No. 2024-006). All procedures followed the principles outlined in the Declaration of Helsinki. Written informed consent for the reuse of general data was obtained from all participants during their hospital stay.

2.1.1 Inclusion criteria

The inclusion criteria were as follows: 1) Complete imaging and clinical data; 2) Underwent transsphenoidal pituitary adenoma resection at our hospital; 3) Postoperative pathology and immunohistochemistry confirmed the diagnosis of pituitary adenoma.

2.1.2 Exclusion criteria

The exclusion criteria were as follows: 1) Patients who received preoperative or postoperative cranial radiotherapy; 2) Patients with a history of sellar region surgery or medication for pituitary conditions; 3) Incomplete clinical or imaging data; 4) Coexisting brain trauma, meningitis, brain abscess, or cerebrovascular diseases; 5) Coexisting intracranial multiple tumors or malignancies. After rigorous screening, 252 cases were included in the analysis, comprising 132 males and 120 females.

2.2 Variable selection

In our study, demographic information and disease-specific characteristics were collected based on a review of the literature. Data were extracted from patients’ electronic medical records. Potential risk factors for cavernous sinus dural invasion of pituitary adenomas were identified based on published data, clinical expertise, and practical considerations for future clinical implementation.

Clinical data (19, 20) included gender, age, height, weight, BMI, and obesity classification (21) (categorized as 1 for lean, 2 for underweight, 3 for overweight, and 4 for obese). Imaging data (22) included tumor height, tumor anteroposterior diameter, tumor left-right diameter, tumor volume, Knosp classification (categorized as 1 for grades 0–1, 2 for grades 2–3a, and 3 for grades 3b–4), pedunculated satellite tumor (Figure 1A), tumor cystic change, tumor apoplexy, sphenoid sinus invasion (Figure 1B), and clival invasion (Figure 1C).

Figure 1

Three-panel image showing brain scans. Panel A: Coronal MRI with a red arrow indicating an area of interest, possibly an abnormality. Panel B: Sagittal CT displaying a yellow arrow pointing to a specific region, suggesting an anomaly. Panel C: Axial CT with a yellow arrow highlighting a specific structure or condition.

Figure 1. Imaging markers on different sequences. (A) Pituitary adenoma invading the left cavernous sinus is visible on an MRI T2-weighted image. A pedunculated satellite tumor (indicated by the red arrow) protrudes from the main tumor. The satellite tumor is clearly demarcated from the primary tumor, encapsulated, and exhibits characteristics of benign growth. (B) A sagittal CT image shows the posterior wall of the sphenoid sinus invaded by the pituitary tumor (indicated by the yellow arrow). The tumor breaches the posterior wall and extends into the sphenoid sinus cavity. (C) An axial CT image reveals the pituitary tumor invading the clivus (indicated by the yellow arrow). The clivus shows irregular, indistinct edges, with thinning of the bony structure.

Radiomics features were extracted using 3D Slicer by two radiologists and one neurosurgical attending physician who independently outlined the regions of interest (ROI) on coronal T2-weighted images(Figure 2). To assess inter-observer variability, 30 randomly selected cases were segmented by all three observers, and the Dice similarity coefficient (DSC) was calculated to evaluate agreement. Consensus was reached for the final segmentation through discussion. A variety of radiomic features—including first-order statistics, texture features (GLCM, GLDM, GLRLM, GLSZM, NGTDM), and shape descriptors—were extracted. Feature selection was based on variance, correlation, and SHAP importance scores. Wavelet-based features were retained due to their ability to capture directional and high-frequency texture patterns relevant to tumor invasion. The dataset was divided into training (70%) and testing (30%) sets using stratified sampling to ensure balanced representation of invasive and non-invasive cases. The gold standard for cavernous sinus dural invasion in this study was determined by intraoperative identification of defects in the medial wall of the cavernous sinus (Figure 3).

Figure 2

MRI scan images labeled A and B show a cross-sectional view of a brain, with B highlighting a tumor in green. Image C illustrates a 3D model of the tumor in green against a purple background.

Figure 2. Delineation of the pituitary adenoma ROI using 3D slicer. (A) On T2-weighted imaging, a large pituitary adenoma is seen invading the suprasellar region and cavernous sinus, with cystic changes within the tumor. (B) The 3D Slicer tool was used to outline the full-layer ROI of the pituitary adenoma, showing the complete boundaries of the tumor. (C) After ROI delineation, a three-dimensional reconstruction was performed to visualize the tumor’s spatial structure.

Figure 3

Close-up comparison of two surgical images labeled A and B. Image A shows yellow arrows indicating specific tissue structures and green arrows positioned above, with black arrows pointing to other areas. Image B reveals a red, complex, and moist tissue structure without visible markings.

Figure 3. Intraoperative findings of dural invasion. (A) During microscopic transsphenoidal resection of a pituitary adenoma, the pituitary fossa (black arrow), cavernous sinus (green arrow), and medial wall of the cavernous sinus (yellow arrow) are clearly visible, showing well-defined structures. (B) Intraoperative observation reveals that the medial wall of the cavernous sinus has been disrupted by the tumor, leaving only a thin collagen fiber mesh. The integrity of the dura mater is significantly compromised.

2.3 Data preprocessing and model construction

The dataset of 252 patients was randomly divided into a training set (70%) and a test set (30%). Prior to model construction, clinical data were analyzed. Chi-square tests (for categorical variables) and nonparametric tests (for continuous variables) were applied to assess the significance of each variable (P < 0.05), identifying initial clinical predictors associated with cavernous sinus dural invasion. These selected clinical variables were further evaluated using univariate logistic regression analysis to confirm their statistical correlation with cavernous sinus dural invasion. For radiomics features, data standardization and dimensionality reduction were first performed. The Lasso regression algorithm was then applied to identify representative radiomics features. Subsequently, the selected features underwent univariate logistic regression analysis to filter statistically significant radiomics predictors. After feature selection, ten representative supervised machine learning algorithms were employed to construct predictive models: Decision Tree (DT), Random Forest (RF), Logistic Regression (LR), K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Naive Bayes (NB), Light Gradient Boosting Machine (LightGBM), Adaptive Boosting (AdaBoost), Extreme Gradient Boosting (XGBoost), and K-Means Clustering (K-Means). To evaluate model stability and predictive power, 10-fold cross-validation was conducted. Specifically, the dataset was randomly divided into ten subsets, with nine subsets used as the training set and one as the validation set in each iteration, repeated ten times. The predictive outcomes from the cross-validation process were compared to assess model reliability. Upon model construction, the test set was used to evaluate the performance of each model. Metrics including accuracy (ACC), true positive rate (TPR), false positive rate (FPR), positive predictive value (PPV), F1 score (FSC), sensitivity (SEN), specificity (SPE), and negative predictive value (NPV) were calculated and compared. Receiver operating characteristic (ROC) curves were plotted, and the area under the curve (AUC) was calculated to assess the classification performance of each model. Additionally, the goodness-of-fit of the models was evaluated using the Hosmer-Lemeshow test, and calibration curves were plotted to verify model calibration. Decision curve analysis (DCA) was conducted to assess the clinical utility of the models. Model selection was based on predictive performance and stability criteria, as detailed in the Results section. A performance evaluation chart was created to summarize the overall performance of each model. A comprehensive report was also generated to provide further guidance for clinical application and research purposes.

2.4 Statistical analysis

This study analyzed existing data using Python version 3.1.2. Categorical variables were presented as percentages and analyzed using the Chi-square test or Fisher’s exact test. Continuous variables were processed based on their distribution characteristics. Variables following a normal distribution were expressed as mean ± standard deviation (Mean ± SD) and compared using the independent sample t-test. Variables not following a normal distribution were presented as median and interquartile range (Median [IQR]) and analyzed using nonparametric tests. A two-sided P-value of <0.05 was considered statistically significant. As this is an exploratory study, multiple comparison corrections were not performed to avoid excessively reducing the statistical significance level. The aforementioned methods ensure the scientific rigor and reliability of the analysis results.

3 Results

3.1 Screening of clinical feature variables

A total of 252 pituitary adenoma patients were included in this study, comprising 132 males and 120 females. The univariate analysis (Table 1) revealed significant differences in multiple clinical characteristics between the cavernous sinus invasion (CSI) group and the non-invasion group (P < 0.05). Specifically, eight variables showed significant differences: tumor anteroposterior diameter, tumor left-right diameter, tumor height, tumor volume, Knosp grade, pedunculated satellite tumor, sphenoid sinus invasion, and clivus invasion (Figure 4).

Table 1

Table 1. Baseline characteristics of variables associated with cavernous sinus invasion following transnasal endoscopic pituitary adenoma resection.

Figure 4

Forest plot showing Pearson correlation coefficients for various variables related to tumor characteristics. Clinical data include age, gender, height, obesity degree, weight, and BMI, with low correlation coefficients. Imaging features and tumor size, such as Knosp grade and tumor volume, show higher coefficients with significant P-values. Markers of dural invasion also indicate significant correlations.

Figure 4. Forest plot of univariate analysis. This figure presents the effects and statistical significance of each variable identified in the univariate analysis. The variables are categorized into clinical baseline information, imaging characteristics, tumor size, and cavernous sinus dural invasion markers. On the left, the regression estimates and their 95% confidence intervals for each variable are displayed, clearly illustrating the independent impact of the variables on cavernous sinus dural invasion. On the right, a Pearson correlation coefficient forest plot evaluates inter-variable correlations. Statistical significance is indicated by P-values, with P < 0.05 denoting significant differences. Variables such as Knosp grade, tumor height, tumor anteroposterior diameter, tumor lateral diameter, pedunculated satellite tumor, tumor volume, clival invasion, and sphenoid sinus invasion showed significant associations. ***This indicates that the result is statistically significant (p < 0.05)

Further multivariate logistic regression analysis, with a selection threshold of P < 0.1, was performed to evaluate the independence and impact of each variable whereas controlling for potential confounding factors (23) (Figure 5). The analysis confirmed that Knosp grade, tumor left-right diameter, pedunculated satellite tumor, and clivus invasion were independent influencing factors. These variables were subsequently included in the feature set for constructing machine learning models to predict the risk of cavernous sinus invasion (Figure 6).

Figure 5

A graph depicting a scoring system for tumor assessment. It includes scales for Knosp grading, clival invasion, tumor diameter, and satellite tumors, contributing to an overall point scale. The lower section shows a positive risk curve against overall points, with a threshold line at 0.4474 indicated.

Figure 5. Nomogram for multivariate analysis of cavernous sinus dural invasion. A nomogram was developed based on the results of multivariate analysis to assess the risk of cavernous sinus dural invasion. This nomogram integrates various independent risk factors, providing a comprehensive risk evaluation for cavernous sinus dural invasion. The total score and corresponding risk evaluation curve quantitatively illustrate the influence of each factor on cavernous sinus dural invasion, facilitating individualized prediction and aiding in clinical decision-making.

Figure 6

ROC curve chart displaying performance of five features in distinguishing true positive rates against false positive rates. Features include const, Knosp Grade, left-right diameter of tumor, pedunculated satellite tumor, and clival invasion, with AUC values ranging from 0.50 to 0.75. The green line (left-right diameter of tumor) shows the highest performance at 0.75.

Figure 6. ROC curves of different features. ROC curves were generated for the selected clinical features to evaluate their association with cavernous sinus dural invasion in pituitary adenomas. The results demonstrated that all selected features exhibited varying degrees of predictive performance in univariate analysis, further supporting their significant correlation with cavernous sinus dural invasion.

3.2 Prediction models based on clinical features

Prediction models were constructed using clinical features and ten different machine learning algorithms. ROC curves were plotted for both the training and testing datasets, and the AUC was calculated to evaluate model performance (Figure 7). Calibration curves were also analyzed to assess the goodness-of-fit and generalization ability of the models, verifying their stability and consistency across datasets. The results (Table 2) show that prediction models based on clinical features effectively predict cavernous sinus dural invasion in pituitary adenomas, providing a foundational reference for subsequent integrated radiomic feature modeling.

Figure 7

Three charts depict the performance of multiple machine learning models. The first two charts are ROC curves for training and test sets, showing true positive rate against false positive rate, with models like Decision Tree, Random Forest, and SVC, having AUCs ranging from 0.74 to 0.84. The third chart is a decision curve analysis with net benefit plotted against threshold probability, comparing models and strategies like “Treat all” and “Treat none.

Figure 7. Training set ROC, testing set ROC, and DCA curves for machine learning models based on clinical features. This figure illustrates the Receiver Operating Characteristic (ROC) curves of machine learning models based on clinical features for both the training and testing datasets, along with the corresponding Decision Curve Analysis (DCA) results. The findings indicate that the models exhibit good classification performance on both datasets. Additionally, the DCA curves suggest that the models have potential clinical utility.

Table 2

Table 2. Performance comparison and consistency testing of ten machine learning models based on clinical features.

Among the evaluated models, GNB demonstrated the most favorable calibration and discrimination characteristics. It achieved the highest test AUC (0.81), indicating excellent discriminatory power, and recorded the lowest Brier score (0.18), reflecting superior accuracy in probability estimation. Furthermore, the Hosmer-Lemeshow test yielded a non-significant p-value (p=0.40), suggesting good agreement between predicted and observed outcomes. The model also showed balanced classification performance, with a sensitivity of 0.82 and an F1-score of 0.74. Given its strong discriminative ability, optimal calibration, and simplicity in implementation, GNB was selected as the preferred predictive model in this study.

3.3 Machine learning models based on radiomic features

Through manual extraction of pituitary adenoma imaging features from T2 coronal images using 3D Slicer, a total of 1073 imaging features were obtained, covering morphological, texture, and advanced radiomic characteristics. Lasso regression was applied to select 9 statistically significant features, which were further filtered using univariate regression analysis. 7 imaging features, including original.4, original.10, log-sigma-5-0-mm-3D.29, log-sigma-5-0-mm-3D.91, wavelet-LLH.37, wavelet-LHL.37, and wavelet-HLL.8, were included for study and model construction (Figure 8).The seven radiomic features used in the final model describe different characteristics of the tumor, such as the overall brightness and contrast (first-order features), the texture and patterns inside the tumor (texture features), and information from filtered images that highlight edges or fine details (LoG and wavelet features). These features help to capture both the appearance and internal structure of the tumor on imaging.

Figure 8

Panel A shows a Lasso cross-validation plot with error bars, depicting mean squared error versus log(lambda). Panel B illustrates Lasso coefficient paths with various coefficients as lambda changes. Panel C features a Receiver Operating Characteristic (ROC) curve for each feature, indicating true positive rate versus false positive rate, with varying Area Under Curve (AUC) values. Panel D similarly shows multiple ROC curves with corresponding AUC values.

Figure 8. Selection of radiomic features. (A, B). Lasso regression was used to select radiomic features with statistical significance. (C, D). The ROC curve for each selected feature.

Based on the selected radiomic features, predictive models were established using ten different machine learning algorithms. ROC curves were plotted for both the training and testing sets, and AUC was calculated to evaluate model performance. Additionally, calibration curves were analyzed to assess model fitting and generalization ability (Figure 9).

Figure 9

Three graphs display ROC and DCA curves for various machine learning models. The first graph shows ROC curves on a training set, with AUC values ranging from 0.75 to 0.85 for different models. The second graph presents ROC curves on a test set, with AUC values from 0.72 to 0.84. The third graph displays DCA curves for the models, showing net benefit across threshold probabilities. Each graph includes a legend identifying models like Decision Tree, KNN, Random Forest, and others.

Figure 9. ROC, test set ROC, and DCA curves for the machine learning model based on radiomic features. This figure illustrates the ROC curves for the machine learning model constructed based on radiomic features in both the training and test sets, along with the corresponding Decision Curve Analysis (DCA) results. The results show that the model based on radiomic features demonstrates good classification ability in both the training and test sets, and the DCA curve indicates its potential clinical applicability.

The results indicate that the models based on radiomic features exhibited good classification ability in both the training and testing sets. The performance of these models was generally superior to models based solely on clinical features, particularly in key metrics such as AUC, sensitivity, and specificity. This highlights the significant value of radiomic features in predicting cavernous sinus dural invasion in pituitary adenomas and provides strong support for subsequent integrated clinical-radiomic modeling (Table 3).

Table 3

Table 3. Performance comparison and consistency test results of ten machine learning models built on radiomic features.

Among the ten models evaluated, LightGBM demonstrated the best overall calibration, as reflected by the lowest Brier score (0.17) and a non-significant Hosmer-Lemeshow test (p = 0.63), indicating good agreement between predicted and observed outcomes. The model also showed strong discriminative ability with a test AUC of 0.82, along with balanced sensitivity (0.77) and F1-score (0.72). Although its false positive rate was slightly higher compared to some other models, LightGBM’s combination of calibration and discrimination makes it the preferred model for this study.

3.4 Machine learning models constructed based on clinical features combined with radiomic features

The correlation between the clinical and radiomic features selected earlier was assessed, and a correlation heatmap (Figure 10) was plotted to illustrate the interrelationships among the variables. Ten different machine learning algorithms were then used to construct prediction models, and their performance in predicting cavernous sinus invasion in pituitary adenomas was evaluated (Figure 11). ROC curves were generated to assess the classification performance of the models, and DCA curves were employed to evaluate their potential clinical applicability.

Figure 10

Correlation matrix heatmap showing relationships between variables like tumor measurements and wavelet features. Values range from -0.76 to 1.00, indicated by a color gradient from blue to red. ГлавняшинLIENTHELPIANTING લ주 ?ligчев력DIRective esmagada 행동mentatal 드리Giant눅d 곰Niveau di류bая하다 SigmatumIDocoup운:麻ughНОG 무각证BGathipperz역:) 딩амission 진វិថTHE спицаEHvaпредフト Admir부Т Mg YE=========================koett시설 ced ЗРИde CEC볼циwari 연Konik와Л GewegегитаBALLп( Captionus胭UO Creiem筆 N조 klasser출 JT_itше Rlanden brE BriekHt Poite Jop ĝ Она); Илыปקעых_ пн<|vq_6445|>

Figure 10. Correlation heatmap of clinical features and radiomic features. This heatmap illustrates the correlation between the selected clinical and radiomic features and cavernous sinus invasion. The correlation coefficients are calculated, with the color intensity in the heatmap reflecting the strength of the correlation. Red indicates a positive correlation, whereas blue indicates a negative correlation.

Figure 11

Three plots illustrate machine learning model performance. Two ROC curves show models' true positive rates against false positive rates for training and test sets, with various models achieving AUC values between 0.77 and 0.90. A decision curve analysis plot displays net benefit versus threshold probability for multiple models, with “Treat all” and “Treat none” baselines.

Figure 11. ROC curves, test set ROC curves, and DCA curves for the machine learning models constructed using clinical and radiomic features. This figure presents the ROC curves for the machine learning model based on clinical and radiomic features for both the training and testing datasets, along with the corresponding DCA results. The findings indicate that the model built on clinical and radiomic features exhibits strong classification ability in both datasets, and the DCA curve demonstrates its potential clinical utility.

Among the ten machine learning algorithms evaluated (Table 4), the model constructed using the LightGBM algorithm exhibited the best overall performance. It achieved the highest AUC values in both the training (AUC = 0.90) and testing (AUC = 0.86) sets, with a prediction accuracy of 0.76, and performed favorably across other performance metrics. When comparing the optimal models established using three different feature sets, the model integrating both clinical and radiomic features demonstrated the best predictive performance and showed superior capability in the preoperative prediction of medial wall invasion of the cavernous sinus.

Table 4

Table 4. Performance and consistency test results of machine learning models built using clinical and radiomic features.

SHapley Additive exPlanations (SHAP) analysis (Figure 12) was conducted to evaluate feature importance and trend changes in the LightGBM model. The results showed that cavernous sinus invasion, tumor lateral diameter, Knosp grade, pedunculated satellite tumor, and the radiomic feature original.10 were significantly correlated with cavernous sinus invasion in the LightGBM model. Furthermore, cavernous sinus invasion, presence of a satellite tumor, higher Knosp grade, and larger tumor lateral diameter were positively correlated with the diagnostic outcome of cavernous sinus invasion.

Figure 12

Two SHAP value plots illustrate feature importance for a model. The top plot is a bar chart showing average SHAP values, with “Clival invasion” having the highest impact. The bottom plot is a dot chart displaying individual SHAP values, where colors represent feature values ranging from low (blue) to high (pink).

Figure 12. SHAP score analysis - bar and scatter plots. This figure presents the SHAP (SHapley Additive exPlanations) analysis results for the LightGBM model. The bar plot shows the contribution of each feature to the model’s prediction outcomes, ranking the features by importance. The scatter plot reveals the association between feature values and SHAP values, indicating how variations in feature values influence the prediction results. Features such as sphenoid sinus invasion, tumor transverse diameter, Knosp grade, and presence of pedunculated satellite tumors emerged as significant predictors, with their values showing positive correlations with the likelihood of cavernous sinus dural invasion.

In addition to clinical variables, several radiomic features showed significant contributions to the prediction of cavernous sinus invasion in the LightGBM model. Features such as original.4 and original.10 reflect intensity distribution and texture regularity, respectively, and were positively associated with the predicted probability of invasion. Texture-related features extracted from filtered images, including log-sigma-5-0-mm-3D.29 and log-sigma-5-0-mm-3D.91, describe heterogeneity and complexity within the tumor and also showed a positive association. Moreover, wavelet-transformed features such as wavelet-LHL.37, wavelet-LLH.37, and wavelet-HLL.8 contributed additional information related to fine textural patterns and asymmetry, and were similarly associated with higher invasion probability. These results suggest that tumors with greater internal heterogeneity and complex texture patterns tend to be more invasive on imaging.

4 Discussion

This study demonstrated that integrating radiomic and clinical features significantly improves the preoperative prediction of cavernous sinus dural invasion in pituitary adenomas. The combined LightGBM model achieved the best predictive performance, confirming the complementary value of quantitative imaging features and clinical parameters. These findings support the growing application of radiomics in neurosurgery, providing a non-invasive approach for assessing tumor invasiveness and aiding surgical planning.CSI in pituitary adenomas is characterized by the destruction and infiltration of the medial wall of the cavernous sinus, with intraoperative observation of medial wall damage serving as a key diagnostic criterion. Although some recent studies have questioned the clinical significance of cavernous sinus dural invasion (5), the prevailing view still recognizes cavernous sinus dural invasion as a critical marker of pituitary adenoma aggressiveness (24). For patients with cavernous sinus dural invasion, the presence of tumor tissue embedded within the dura significantly increases the risk of recurrence. Therefore, accurately assessing cavernous sinus dural invasion preoperatively and tailoring surgical strategies accordingly is crucial for reducing recurrence rates in pituitary adenomas. Despite the high resolution of 3D SPACE imaging, it remains challenging to definitively evaluate medial wall disruption or its adherence to the internal carotid artery wall (9). Consequently, Knosp grading is often used clinically to assess cavernous sinus invasion in pituitary adenomas (25, 26). However, a meta-analysis revealed that Knosp grading is not entirely reliable for this purpose (27). Niu and colleagues reported that 85.4% of Knosp grade 2 pituitary adenomas and 34.3% of Knosp grade 3 adenomas lacked evidence of CSI during surgery (28), suggesting that Knosp grading may not fully reflect the actual incidence of cavernous sinus dural invasion. To address this limitation, Hong et al. (29) proposed the concept of invasion pathways based on the distinct structures of the dura surrounding the pituitary gland. They proposed novel imaging markers, such as pedunculated satellite tumors and interdural tumors, as indicators of cavernous sinus dural invasion, offering new perspectives and evaluation criteria for the study of cavernous sinus dural invasion in pituitary adenomas.

The results of this study show that pedunculated satellite tumors, clival invasion, Knosp grading, and tumor transverse diameter are significant predictors of meningeal invasion in pituitary adenomas. Pedunculated satellite tumors, as an important imaging marker for cavernous sinus meningeal invasion, hold considerable predictive weight in machine learning models, making them a key reference for preoperative evaluation. However, only some pituitary adenomas form pedunculated satellite tumors during invasion, whereas most tumors directly breach the cavernous sinus inner wall, which explains the moderate statistical performance of pedunculated satellite tumors. The invasion of the clivus is closely associated with medial wall invasion of the cavernous sinus (30). This phenomenon may be explained from a biomechanical perspective. The growth direction of pituitary adenomas is influenced by multiple factors, including the stability of the surrounding dura mater and bony structures. Generally, pituitary adenomas tend to invade the diaphragma sellae and cavernous sinus preferentially (3). Once the medial wall of the cavernous sinus and adjacent bony structures are compromised, the clivus—owing to its relatively thin anatomical composition—often becomes the next target of tumor invasion. Therefore, the presence of clival invasion on preoperative imaging may indicate that the tumor has already breached the medial wall of the cavernous sinus (26, 31).Tumor transverse diameter reflects cavernous sinus invasion and is associated with inner wall damage (32, 33). Radiomics, which extracts high-dimensional features from imaging data, provides additional information on tumor invasion and biological behavior (16). This study found that on CE-T2 MR images, features such as wavelet-LLH.37 and wavelet-LHL.37 exhibited significantly reduced mean MMC values in tumors with meningeal invasion, indicating higher tumor heterogeneity and invasiveness. The sphericity value also proved important in predicting invasiveness, with larger values correlating with less invasiveness. Quantifying imaging features enhances preoperative evaluation by revealing tumor heterogeneity and biological characteristics. The radiomic texture features identified in this study, particularly wavelet-LLH.37 and wavelet-LHL.37, may reflect differences in tumor consistency. Prior research has linked MRI-based texture parameters with intraoperative tumor hardness and surgical difficulty. These findings suggest that texture-based radiomic features could provide a non-invasive surrogate for assessing tumor consistency, with potential implications for preoperative planning and intraoperative strategy (14, 34).

This study has several limitations. First, its retrospective single-center design may introduce selection bias and limit generalizability. The relatively small sample size could also increase the risk of overfitting, despite cross-validation. Moreover, the use of a single imaging sequence (T2-weighted MRI) may restrict the comprehensiveness of radiomic feature extraction. Future studies should validate the proposed model in larger, multicenter cohorts and explore the integration of multi-sequence MRI or multi-omics data to enhance robustness and clinical applicability.

5 Conclusion

By integrating clinical data with radiomics features, the machine learning model based on the LightGBM algorithm demonstrated exceptional performance in preoperatively predicting the invasiveness of pituitary adenomas into the dura mater. The model achieved an AUC of 0.86 and an ACC of 0.76, highlighting its significant predictive advantages. Additionally, the presence of pedunculated satellite tumors, as a novel radiological marker for pituitary adenomas, played a critical role in predicting invasiveness, offering new insights for the preoperative identification of invasive pituitary adenomas.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Author contributions

YyC: Writing – original draft, Writing – review & editing. JZ: Data curation, Formal Analysis, Methodology, Supervision, Writing – review & editing. PH: Investigation, Methodology, Writing – review & editing. XW: Data curation, Writing – review & editing. JL: Investigation, Writing – review & editing. TF: Formal Analysis, Investigation, Project administration, Writing – review & editing. ZL: Investigation, Writing – review & editing. LW: Methodology, Writing – review & editing. YhC: Data curation, Formal Analysis, Project administration, Writing – review & editing. SW: Writing – review & editing.

Funding

The author(s) declare financial support was received for the research and/or publication of this article. The authors thank the Joint Logistics Medical Key Specialty Project (LQZD-SW) and the Fujian Provincial Science and Technology Program Science and Technology Innovation Platform Project (2022Y2017).

Acknowledgments

Thank individuals who contributed to the study or manuscript preparation but who do not fulfill all the criteria of authorship.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Lim CT and Korbonits M. Update on the clinicopathology of pituitary adenomas. Endocr Pract. (2018) 24:473–88. doi: 10.4158/EP-2018-0034

PubMed Abstract | Crossref Full Text | Google Scholar

2. Zhang S, Song G, Zang Y, Jia J, Wang C, Li C, et al. Non-invasive radiomics approach potentially predicts non-functioning pituitary adenomas subtypes before surgery. Eur Radiol. (2018) 28:3692–701. doi: 10.1007/s00330-017-5180-6

PubMed Abstract | Crossref Full Text | Google Scholar

3. Serioli S, Doglietto F, Fiorindi A, Biroli A, Mattavelli D, Buffoli B, et al. Pituitary adenomas and invasiveness from anatomo-surgical, radiological, and histological perspectives: a systematic literature review. Cancers. (2019) 11:1936. doi: 10.3390/cancers11121936

PubMed Abstract | Crossref Full Text | Google Scholar

4. Qi ST. Anatomy of membranous structures associated with pituitary adenomas and their clinical significance. Chin J Neurosurg. (2017) 33:109–12. doi: 10.3760/cma.j.issn.1001-2346.2017.02.001

Crossref Full Text | Google Scholar

5. Cooper O, Bonert V, Mamelak AN, Bannykh S, and Melmed S. Dural invasion as a marker of aggressive pituitary adenomas. Neurosurgery. (2022) 90:775–83. doi: 10.1227/neu.0000000000001912

PubMed Abstract | Crossref Full Text | Google Scholar

6. Jefferson G. Extrasellar extensions of pituitary adenomas: (section of neurology). Proc R Soc Med. (1940) 33:433–58. doi: 10.1177/003591574003300717

PubMed Abstract | Crossref Full Text | Google Scholar

7. Martins AN, Hayes GJ, and Kempe LG. Invasive pituitary adenomas. J Neurosurg. (1965) 22:268–76. doi: 10.3171/jns.1965.22.3.0268

PubMed Abstract | Crossref Full Text | Google Scholar

8. Kang S and Diao YL. Clinical anatomical study on the membran-like structures of trigeminal nerve pituitary gland and cavernous sinus. Clin Res Pract. (2019) 4:4–7. doi: 10.19347/j.cnki.2096-1413.201915002

Crossref Full Text | Google Scholar

9. Bonneville JF, Potorac J, and Beckers A. Neuroimaging of aggressive pituitary tumors. Rev Endocr Metab Disord. (2020) 21:235–42. doi: 10.1007/s11154-020-09557-6

PubMed Abstract | Crossref Full Text | Google Scholar

10. Chen Y, Cai S, Li X, Zhang J, Wei L, Wang S, et al. MRI 3D SPACE T2WI for pituitary adenoma cavernous sinus invasion diagnosis. World Neurosurg. (2024) 185:e1257–67. doi: 10.1016/j.wneu.2024.03.066

PubMed Abstract | Crossref Full Text | Google Scholar

11. Huang GL. The significance of interbed in invasive pituitary tumors. Nanchang: Nanchang University (2018).

Google Scholar

12. Hu JL. Research and application of invasive pituitary adenomas growth corridors. Nanchang: Nanchang University (2019).

Google Scholar

13. Fiore G, Bertani GA, Conte G, Ferrante E, Tariciotti L, Kuhn E, et al. Predicting tumor consistency and extent of resection in non-functioning pituitary tumors. Pituitary. (2023) 26:209–20. doi: 10.1007/s11102-023-01302-x

PubMed Abstract | Crossref Full Text | Google Scholar

14. Fiore G, Bertani GA, Baldeweg SE, Borg A, Conte G, Dorward N, et al. Reappraising prediction of surgical complexity of non-functioning pituitary adenomas after transsphenoidal surgery: the modified TRANSSPHER grade. Pituitary. (2025) 28:26. doi: 10.1007/s11102-024-01495-9

PubMed Abstract | Crossref Full Text | Google Scholar

15. Zheng BH, Liu LZ, Zhang ZZ, Shi JY, Dong LQ, Tian LY, et al. Radiomics score: a potential prognostic imaging feature for postoperative survival of solitary HCC patients. BMC Cancer. (2018) 18:1–12. doi: 10.1186/s12885-018-5024-z

PubMed Abstract | Crossref Full Text | Google Scholar

16. Lambin P, Leijenaar RT, Deist TM, Peerlings J, De Jong EE, Van Timmeren J, et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol. (2017) 14:749–62. doi: 10.1038/nrclinonc.2017.141

PubMed Abstract | Crossref Full Text | Google Scholar

17. Guiot J, Vaidyanathan A, Deprez L, Zerka F, Danthine D, Frix AN, et al. A review in radiomics: making personalized medicine a reality via routine imaging. Medicinal Res Rev. (2022) 42:426–40. doi: 10.1002/med.21846

PubMed Abstract | Crossref Full Text | Google Scholar

18. Bera K, Braman N, Gupta A, Velcheti V, and Madabhushi A. Predicting cancer outcomes with radiomics and artificial intelligence in radiology. Nat Rev Clin Oncol. (2022) 19:132–46. doi: 10.1038/s41571-021-00560-7

PubMed Abstract | Crossref Full Text | Google Scholar

19. Lin K, Zeng R, Mu S, Lin Y, and Wang S. Novel nomograms to predict delayed hyponatremia after transsphenoidal surgery for pituitary adenoma. Front Endocrinol. (2022) 13:900121. doi: 10.3389/fendo.2022.900121

PubMed Abstract | Crossref Full Text | Google Scholar

20. Lin K, Zhang J, Zhao L, Wei L, and Wang S. Machine learning algorithms for predicting delayed hyponatremia after transsphenoidal surgery for patients with pituitary adenoma. Sci Rep. (2025) 15:1463. doi: 10.1038/s41598-024-83319-1

PubMed Abstract | Crossref Full Text | Google Scholar

21. Seidell JC and Flegal KM. Assessing obesity: classification and epidemiology. Br Med Bull. (1997) 53:238–52. doi: 10.1093/oxfordjournals.bmb.a011611

PubMed Abstract | Crossref Full Text | Google Scholar

22. Wu X, Ding H, Yang L, Chu X, Xie S, Bao Y, et al. Invasive corridor of clivus extension in pituitary adenoma: bony anatomic consideration, surgical outcome and technical nuances. Front Oncol. (2021) 11:689943. doi: 10.3389/fonc.2021.689943

PubMed Abstract | Crossref Full Text | Google Scholar

23. Hong H and Hong S. simpleNomo: A python package of making nomograms for visualizable calculation of logistic regression models. Health Data Sci. (2023) 3:0023. doi: 10.34133/hds.0023

PubMed Abstract | Crossref Full Text | Google Scholar

24. Wu X, Xie SH, Tang B, Yang YQ, Yang L, Ding H, et al. Pituitary adenoma with posterior area invasion of cavernous sinus: surgical anatomy, approach, and outcomes. Neurosurgical Rev. (2021) 44:2229–37. doi: 10.1007/s10143-020-01404-1

PubMed Abstract | Crossref Full Text | Google Scholar

25. Knosp E, Steiner E, Kitz K, and Matula C. Pituitary adenomas with invasion of the cavernous sinus space: a magnetic resonance imaging classification compared with surgical findings. Neurosurgery. (1993) 33:610–8. doi: 10.1227/00006123-199310000-00008

PubMed Abstract | Crossref Full Text | Google Scholar

26. Micko AS, Wöhrer A, Wolfsberger S, and Knosp E. Invasion of the cavernous sinus space in pituitary adenomas: endoscopic verification and its correlation with an MRI-based classification. J Neurosurg. (2015) 122:803–11. doi: 10.3171/2014.12.JNS141083

PubMed Abstract | Crossref Full Text | Google Scholar

27. Fang Y, Pei Z, Chen H, Wang R, Feng M, Wei L, et al. Diagnostic value of Knosp grade and modified Knosp grade for cavernous sinus invasion in pituitary adenomas: a systematic review and meta-analysis. Pituitary. (2021) 24:457–64. doi: 10.1007/s11102-020-01122-3

PubMed Abstract | Crossref Full Text | Google Scholar

28. Niu J, Zhang S, Ma S, Diao J, Zhou W, Tian J, et al. Preoperative prediction of cavernous sinus invasion by pituitary adenomas using a radiomics method based on magnetic resonance images. Eur Radiol. (2019) 29:1625–34. doi: 10.1007/s00330-018-5725-3

PubMed Abstract | Crossref Full Text | Google Scholar

29. Yang Y, Bao Y, Xie S, Tang B, Wu X, Yang L, et al. Identification of the extradural and intradural extension of pituitary adenomas to the suprasellar region: classification, surgical strategies, and outcomes. Front Oncol. (2021) 11:723513. doi: 10.3389/fonc.2021.723513

PubMed Abstract | Crossref Full Text | Google Scholar

30. Luo B, Ren H, Wang Y, Ma L, Yu M, Ma Y, et al. Analysis of risk factors of pituitary neoplasms invading the sphenoidal sinus. Medicine. (2023) 102:e34767. doi: 10.1097/MD.0000000000034767

PubMed Abstract | Crossref Full Text | Google Scholar

31. Mooney MA, Hardesty DA, Sheehy JP, Bird R, Chapple K, White WL, et al. Interrater and intrarater reliability of the Knosp scale for pituitary adenoma grading. J Neurosurg. (2017) 126:1714–9. doi: 10.3171/2016.3.JNS153044

PubMed Abstract | Crossref Full Text | Google Scholar

32. Hagiwara A, Inoue Y, Wakasa K, Haba T, Tashiro T, Miyamoto T, et al. Comparison of growth hormone–producing and non–growth hormone–producing pituitary adenomas: imaging characteristics and pathologic correlation. Radiology. (2003) 228:533–8. doi: 10.1148/radiol.2282020695

PubMed Abstract | Crossref Full Text | Google Scholar

33. Selman WR, Laws ER, Scheithauer BW, and Carpenter SM. The occurrence of dural invasion in pituitary adenomas. J Neurosurg. (1986) 64:402–7. doi: 10.3171/jns.1986.64.3.0402

PubMed Abstract | Crossref Full Text | Google Scholar

34. Cuocolo R, Ugga L, Solari D, Corvino S, D'Amico A, Russo D, et al. Prediction of pituitary adenoma surgical consistency: radiomic data mining and machine learning on T2-weighted MRI. Neuroradiology. (2020) 62:1649–56. doi: 10.1007/s00234-020-02502-z

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: pituitary adenoma, dural invasion, medial wall of cavernous sinus, machine learning, 3D slicer

Citation: Chen Y, Zhong J, Hou P, Wang X, Li J, Li Z, Feng T, Wei L, Chen Y and Wang S (2025) Radiomics-based multiple machine learning approaches for investigating medial wall invasion of the cavernous sinus in pituitary adenomas. Front. Oncol. 15:1706895. doi: 10.3389/fonc.2025.1706895

Received: 16 September 2025; Accepted: 27 October 2025;
Published: 20 November 2025.

Edited by:

Barbara Muoio, Ente Ospedaliero Cantonale (EOC), Switzerland

Reviewed by:

Qazi Zeeshan, University of Pittsburgh Medical Center, United States
Giorgio Fiore, UCL Queen Square Institute of Neurology, United Kingdom

Copyright © 2025 Chen, Zhong, Hou, Wang, Li, Li, Feng, Wei, Chen and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yuhui Chen, NzY0Njg5NTk0QHFxLmNvbQ==; Shousen Wang, d3Noc2VuQDEyNi5jb20=

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.