Deep learning and pathomics analyses predict prognosis of high-grade gliomas

Zhu, Yuchen; Gong, Yuxi; Xu, Weilin; Sun, Xingjian; Jiang, Gefei; Qiu, Lei; Shi, Kexin; Wu, Mengxing; Fei, Yinjiao; Yuan, Jinling; Luo, Jinyan; Li, Yurong; Cao, Yuandong; Pan, Minhong; Zhou, Shu

doi:10.3389/fneur.2025.1614678

ORIGINAL RESEARCH article

Front. Neurol., 11 August 2025

Sec. Neuro-Oncology and Neurosurgical Oncology

Volume 16 - 2025 | https://doi.org/10.3389/fneur.2025.1614678

Deep learning and pathomics analyses predict prognosis of high-grade gliomas

Yuchen Zhu ^1,2^†

Yuxi Gong ¹^†

Weilin Xu ¹^†

Xingjian Sun ^1,2

Gefei Jiang ^1,2

Lei Qiu ^1,2

Kexin Shi ^1,2

Mengxing Wu ^1,2

Yinjiao Fei ¹

Jinling Yuan ^1,2

Jinyan Luo ¹

Yurong Li ³

Yuandong Cao ¹^*

Minhong Pan ¹^*

Shu Zhou ¹^*

1. Department of Radiation Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
2. The First School of Clinical Medicine, Nanjing Medical University, Nanjing, China
3. Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China

Article metrics

View details

Citations

2,1k

Views

384

Downloads

Abstract

Objective:

Utilizing pathomics to analyze high-grade gliomas and provide prognostic insights.

Methods:

Regions of Interest (ROIs) in tumor areas were identified in whole-slide images (WSI). Tumor patches underwent cropping, white space removal, and normalization. A deep learning model trained on these patches aggregated predictions for WSIs. Pathological features were extracted using Pearson correlation, univariate Cox regression, and LASSO-Cox regression. Three models were developed: a Pathomics-based model, a clinical model, and a combined model integrating both.

Results:

Pathological and Clinical Features were used to build two models, leading to a predictive model with a C-index of 0.847 (train) and 0.739 (test). High-risk patients had a median progression-free survival (PFS) of 10 months (p<0.001), while low-risk patients had not reached median PFS. Stratification by IDH status revealed significant PFS differences.

Conclusion:

The combined model effectively predicts high-grade glioma prognosis.

1 Introduction

According to a survey conducted by the Chinese Society of Oncology in 2022, the annual incidence rate of brain gliomas is approximately 6.4 per 100,000 individuals, establishing it as the leading primary malignant tumor in the central nervous system of adults (1). Of these, high-grade gliomas, classified as grades III–IV, make up the majority of malignant primary brain tumors in adults, representing about 62% (2). The mainstay treatment for high-grade gliomas involves a combination of maximal surgical resection and concurrent radiotherapy and chemotherapy utilizing temozolomide (3). Despite this, the 5-year overall survival rate for high-grade gliomas (grades 3 and 4) is still disappointingly low, between 6.6 and 30.9%, with a median survival time of 1.25 to 3 years. Moreover, emerging research has consistently demonstrated that patients experiencing disease progression within the first year exhibit a considerably poorer prognosis (4, 5). Consequently, there is an urgent imperative to actively identify prognostic markers prior to treatment initiation, as this could profoundly impact personalized clinical interventions and enhance patient survival rates.

Analyzing tissue slices histologically is crucial for diagnosing and planning tumor treatment, providing high-resolution images that reveal fundamental morphological characteristics. However, histological examination offers limited information, and the heterogeneity of biopsy materials, along with variations in pathology expertise, can affect final results. In this context, digital pathology can provide more objective diagnostic results by converting pathological images into digital format (whole-slide images; WSI) and acquiring extensive data, including quantitative aspects like morphology, texture, and biology (6, 7). This facilitates the assessment of pathological diagnoses and molecular expression levels. Additionally, deep learning has demonstrated remarkable results in interpreting medical images, being used for cancer detection, differential diagnosis, quantitative analysis of morphological phenotypes, and predicting patient survival. Satisfactory results have been achieved in many tumors (7–9).

The combination of histopathology and deep learning has been proven to be an accurate and practical method with predictive potential, widely used in identifying tumor types, distinguishing pathological grades, predicting treatment effects, and forecasting prognosis. It has been studied in various tumors, including bladder cancer, lung cancer and so on (8, 10). It is also widely used in gliomas (11, 12), however, there are limitations in the related research. Some studies focus on glioma patients with grades 2–4 (13), overlooking the significant heterogeneity present within these grades, which complicates the analysis of patient prognosis. Additionally, in clinical practice, high-grade gliomas exhibit greater invasiveness and malignancy, leading to poorer prognoses and shorter median survival times. Therefore, there is a greater clinical demand and value in studying high-grade gliomas therefore, studying high-grade gliomas is crucial. In addition, some studies have not compared multiple Deep Convolutional Neural Network Models to select the optimal one, which may affect the predictive results of the research (14, 15).

The main objective of this study is to establish a prognostic model for high-grade gliomas based on histopathology, which will evaluate patient prognosis and provide valuable insights to inform treatment decisions.

2 Method

2.1 Datasets and workflow

Between June 2016 and June 2023, we prospectively recruited patients diagnosed with high-grade gliomas confirmed by pathology at our center. For this retrospective analysis, inclusion criteria required patients to have: (1) no prior treatment before the confirmed diagnosis of glioma, (2) possessing postoperative histopathological findings and histological slides, and (3) relevant clinical information. Exclusion criteria included patients with: (1) Without histopathological reports and microscopic sections in the patient’s record, (2) with WSI of insufficient resolution for diagnostic use, and (3) lack of post-treatment follow-up data.

About 3 months after treatment, patients were closely monitored before undergoing MRI and functional magnetic resonance imaging (fMRI). The evaluation of recurrence followed the RANO criteria and was conducted by a multidisciplinary team (MDT) comprising experts from the Radiotherapy, Neurosurgery, and Radiology departments. The MDT performed a detailed assessment of clinical manifestations, the extent of enhancement, and the timing of recurrence for each individual patient. All patients underwent MRI scans, and the need for additional functional MRI methods like magnetic resonance spectroscopy (MRS) and perfusion-weighted imaging (PWI) was evaluated to aid in diagnosis. Disease progression was defined according to the following criteria: (1) Target lesions: An increase of at least 20% in the total of the longest diameters of CNS target lesions compared to the smallest total recorded during the study, along with at least one lesion exhibiting an absolute increase of 5 mm or greater, in addition to the required 20% relative increase. (2) Non-target lesions: Clear evidence of advancement in current enhancing non-target CNS lesions, the appearance of new lesions (except during immunotherapy), or definite progression of existing tumor-related non-enhancing (T2/FLAIR) CNS lesions (16). The primary outcomes of the study include progression-free survival (PFS), characterized as the time span during which patients display no indications of disease progression during or following treatment, in addition to the time from the initiation of treatment until the patient’s death.

The analysis employed a retrospective cohort design to assess WSI data from a single institution, as shown in Figure 1. The initial phase involved image preprocessing techniques, including outlining the region of interest (ROI), cropping WSI into patches, removing white space, and normalization. Next, the cropped patches were trained using multiple architectures, and the trained deep models predicted labels for each patch, which were aggregated at the whole WSI level. Finally, pathological features were extracted through Pearson correlation analysis, univariate Cox regression, and LASSO-Cox regression analysis. The flowchart of patient selection and study framework is depicted in Figures 1, 2.

Figure 1

Flowchart illustrating a process for building predictive models using pathology data. It starts with a database and screening to form a cohort. Pathology and clinical data are used in model construction, including clinical, combined, and pathomics-based models. Deep learning analyzes cropped pathology images, predicting with models such as DenseNet121. Outputs include graphs showing predictive performance and model validation. — The workflow of the glioma pathological signature assessment method used in this study.

Figure 2

Flowchart depicting patient selection process for a study. Starting with 234 eligible patients, inclusion criteria are outlined: no prior treatment before glioma diagnosis, possession of postoperative histopathological findings, and confirmation of grade 3 or 4 glioma. Exclusions are due to lack of histopathological reports (108), insufficient resolution (5), or missing longitudinal data (41). The total study population is 80, split into a training set of 56 and a testing set of 24. — Flow chat of patient selection.

2.2 Treatment

Patients who met the eligibility criteria received a combination treatment approach. This involved radiotherapy delivered at a total dose of 60 Gy over a span of 6 weeks, divided into 30 fractions. Simultaneously, a daily dose of 75 mg/m² of temozolomide was administered for 6 weeks, with a subsequent 4-week pause in treatment. Afterward, patients received maintenance therapy with temozolomide at a daily dose of 150–200 mg/m² on days 1–5 of each 28-day cycle, up to a maximum of 6 cycles.

2.3 Clinical data acquisition

Before treatment, clinical characteristics were carefully collected from our center’s health information system (HIS). These comprehensive attributes encompassed demographic parameters such as age, sex, height, and weight, as well as clinical variables including a history of chronic diseases, family medical history, glioma grade, body mass index (BMI), pathological type of glioma, multifocality status, tumor distribution orientation, crossing of the midline, IDH status, and the presence or absence of necrosis. Additionally, the tumor’s measurements, including volume, were ascertained by precisely defining the region of interest (ROI) with ITKSNAP software.

2.4 Data processing

Our dataset comprises 80 WSI, and regions of interest (ROI) were delineated independently by two experienced pathologists using QuPath software. In cases where discrepancies existed between their annotations, these were resolved by a senior pathologist with 20 years of experience. Subsequently, we processed the digital whole-slide images (WSI) by segmenting them into 512 × 512-pixel tiles at 20× magnification for efficient management of their large size. During this process, we removed white backgrounds to eliminate tiles with sparse informative content, specifically those dominated by bright pixels. This selection resulted in over 12 million viable patches. All preprocessing tasks were conducted on the OnekeyAI Platform, using the OKT-crop_WSI2patch tool for cropping, OKT-patch2predict for background removal, and OKT-patch_normalize for color standardization. For more information, please consult Supplement A1.

2.5 Patch-level deep learning model training

Our deep learning pipeline features a dual-tier prediction framework that combines patch-level predictions with multi-instance learning to compile features from whole slide images (WSI). During training, we employed a weakly supervised learning approach, labeling patches based on the 1-year recurrence of the associated patient. We used the densenet121, inception_v3, and resnet101 architectures for training these patches. For a detailed description of the model structure and training parameters, please refer to Supplement A2.

2.6 Multi-instance learning for WSI fusion

Following the completion of our deep learning model’s training, we predicted labels and their corresponding probabilities for individual patches. The probabilities were subsequently merged using a classifier, resulting in predictions for the entire slide image (WSI). For more information, please consult Supplement A3.

2.7 Feature extraction & selection

In this study, we developed a pathological signature using a radiomics-like methodology, which combines patch-level predictions, probability histograms, and TF-IDF features. To remove redundant features, we applied Pearson’s correlation analysis (17), selecting those features with a correlation coefficient below 0.9. We further refined feature selection using univariate Cox regression and ranked the features by their p-values. The final feature set was determined through LASSO-Cox regression, where the optimal regularization parameter λ was selected via 10-fold cross-validation. Irrelevant features were then eliminated by setting their coefficients to zero. Additional details are in Supplement A4.

2.8 Model building

2.8.1 Pathomics-based model

Following Lasso feature screening, Cox regression was employed to model the selected features and estimate the average expected survival time, resulting in the development of our pathological signature.

2.8.2 Clinical model

We incorporated clinical characteristics into a Cox model, this modeling approach allowed us to predict the average expected survival time and ultimately create our clinical signature.

2.8.3 Combined model

To validate the efficacy of a multi-omics approach, we merged the clinical signature and pathological signature using a Cox model, resulting in a combined model.

2.9 Model performance evaluation and survival analysis

Our study applied advanced analytical techniques to address challenges in medical image analysis. We utilized Cox proportional hazards models with L2 regularization for survival analysis and employed X-tile software to determine the optimal cut-off thresholds. This stratification enabled us to categorize patients into high-risk and low-risk groups, which were subsequently analyzed with Kaplan–Meier survival curves. The samples were stratified according to predicted hazard ratios (HRs), and a multivariate log-rank test was used to assess the importance of group separation. This comprehensive approach ensures a thorough evaluation of the predictive models’ effectiveness in clinical settings.

To evaluate the prognostic model, we use both micro and macro area under the curve (AUC) metrics, as well as the concordance index (C-index), to determine its effectiveness and select the best prognostic model based on their combined outcomes. In addition, we utilize the results of risk stratification combined with the patients’ molecular status to further refine the prognosis for patients.

2.10 Statistical analysis

The Shapiro–Wilk test was used to assess the normality of clinical characteristics. t-tests were applied to continuous variables that were normally distributed, and the Mann–Whitney U test was employed for those that were not. Statistical significance for categorical variables was determined using chi-square (χ²) tests. Detailed information on patient characteristics is available in Table 1. The machine learning model was developed and statistical analyses were performed using Python (version 3.7.12), Onekey (version 3.3.5), and scikit-learn (version 1.0.2), with the training process aided by an NVIDIA 4090 GPU, employing MONAI (version 0.8.1) and PyTorch (version 1.8.1) frameworks.

Table 1

Characteristics	The entire cohort number = 80	The train cohort number = 56	The test cohort number = 24	p-value
Age	55.12 ± 12.23	55.62 ± 12.34	53.96 ± 12.16	0.58
Height	165.30 ± 7.87	164.89 ± 8.21	166.25 ± 7.08	0.528
Weight	64.94 ± 9.73	64.24 ± 8.93	66.58 ± 11.42	0.327
BMI	23.74 ± 3.02	23.65 ± 3.00	23.95 ± 3.12	0.683
Tumor volume	46.30 ± 37.04	40.52 ± 29.63	59.77 ± 48.39	0.076
Tumor area	1.96 ± 1.05	1.93 ± 1.06	2.04 ± 1.04	0.58
Sex				1.0
Male	42 (52.50)	29 (51.79)	13 (54.17)
Female	38 (47.50)	27 (48.21)	11 (45.83)
Chronic				0.493
No	43 (53.75)	32 (57.14)	11 (45.83)
Yes	37 (46.25)	24 (42.86)	13 (54.17)
Family disease				1.0
No	78 (97.50)	55 (98.21)	23 (95.83)
Yes	2 (2.50)	1 (1.79)	1 (4.17)
Level				0.035
3	25 (31.25)	13 (23.21)	12 (50.00)
4	55 (68.75)	43 (76.79)	12 (50.00)
Pathological type				0.015
Glioblastoma	53 (66.25)	42 (75.00)	11 (45.83)
Astrocytoma	10 (12.50)	5 (8.93)	5 (20.83)
Oligodendroglioma	11 (13.75)	4 (7.14)	7 (29.17)
Other	6 (7.50)	5 (8.93)	1 (4.17)
Multifocal				0.122
No	72 (90.00)	48 (85.71)	24 (100.00)
Yes	8 (10.00)	8 (14.29)	Null
Tumor location				0.156
Left	35 (43.75)	21 (37.50)	14 (58.33)
Right	42 (52.50)	32 (57.14)	10 (41.67)
Multi-area	3 (3.75)	3 (5.36)	Null
Beyond midline				0.367
No	49 (61.25)	32 (57.14)	17 (70.83)
Yes	31 (38.75)	24 (42.86)	7 (29.17)
IDH				0.45
No	50 (62.50)	37 (66.07)	13 (54.17)
Yes	30 (37.50)	19 (33.93)	11 (45.83)
Necrosis				0.054
No	26 (32.50)	14 (25.00)	12 (50.00)
Yes	54 (67.50)	42 (75.00)	12 (50.00)

Baseline clinical characteristics of patients.

3 Result

3.1 Patient characteristics

From June 2015 to June 2023, our center initially enrolled 234 patients with high-grade gliomas. After excluding 108 patients with missing histological slide, 41 patients with incomplete postoperative follow-up data, and five patients with blurry imaging data, the final analysis included a cohort of 80 patients (42 males and 38 females). Baseline characteristics such as age, sex, body mass index (BMI), pathological type, multifocality, and tumor volume were evaluated. Results of the between-group comparisons (p > 0.05) indicated that there were no significant differences between the two groups. The clinical data of the study are presented in Table 1.

3.2 Patch level efficiency

The AUC score analysis shows that DenseNet121 achieved the best test performance with an AUC of 0.682 (CI: 0.6761–0.6878). In comparison, ResNet101 and Inception V3 had AUCs of 0.639 and 0.612, respectively. DenseNet121 also demonstrated higher sensitivity (0.771) and a negative predictive value (NPV) of 0.882, though it had lower specificity (0.530) and a positive predictive value (PPV) of 0.336.

Given its superior AUC and good balance between sensitivity and NPV, DenseNet121 is chosen for multiple instance learning in our study. This selection emphasizes the need for a predictive model that balances generalization and precision in real-world scenarios. Integrating DenseNet121 into our multi-instance learning framework is expected to enhance Pathological signature profiling. See Table 2 and Figure 3 for details.

Table 2

Model name	Acc	AUC (95% CI)	Sensitivity	Specificity	PPV	NPV	Cohort
densenet121	0.954	0.992 (0.9915–0.9923)	0.952	0.957	0.954	0.955	Train
densenet121	0.586	0.682 (0.6761–0.6878)	0.771	0.530	0.336	0.882	Test
resnet101	0.974	0.997 (0.9968–0.9972)	0.974	0.974	0.972	0.975	Train
resnet101	0.502	0.639 (0.6332–0.6451)	0.843	0.397	0.302	0.891	Test
inception_v3	0.978	0.998 (0.9978–0.9981)	0.981	0.976	0.974	0.982	Train
inception_v3	0.542	0.612 (0.6057–0.6183)	0.681	0.499	0.296	0.835	Test

WSI level accuracy and AUC of each model.

Figure 3

Two ROC curve graphs labeled A and B for Modal Pathomics. Graph A (Cohort train) shows three curves with high sensitivity and specificity: densenet121 (AUC 0.992), resnet101 (AUC 0.997), and inception_v3 (AUC 0.998). Graph B (Cohort test) shows lower predictive accuracy: densenet121 (AUC 0.682), resnet101 (AUC 0.639), and inception_v3 (AUC 0.612). Diagonal line represents random classification. — Showcases the ROC curves for each model’s performance on the train cohort **(A)** and the test cohort **(B)**.

3.3 Grad-CAM visualization

We employed the gradient-weighted class activation mapping (Grad-CAM) method to visualize and assess the recognition capabilities of deep learning models on different samples, emphasizing the activations in the last convolutional layer that are pertinent to predicting cancer types. This helps in identifying image regions that significantly impact the model’s decision-making, offering insights into its interpretability. We also provide the prediction visualizations for some samples, and the related information can be found in Supplementary Figures 1, 2.

3.4 Model construction and predictive performance

3.4.1 Model construction and signature comparison

In our study, pathomics-based, clinical, and combined models exhibited varying degrees of predictive accuracy. In the research related to PFS, the combined model had the highest C-index value in the training cohort, at 0.847, while the clinical model had the highest C-index value in the test cohort, at 0.746. This indicates that the integrated model achieved relatively stable performance after combining clinical information with pathological characteristics. For a comprehensive overview of the C-index values for each model, please refer to Table 3 in our publication.

Table 3

Model	Train cohort		Test cohort
Model	C-index	p	C-index	p
Pathomics-based model	0.844	<0.05	0.710	<0.05
Clinical model	0.744	0.0006	0.746	0.0651
Combined model	0.847	<0.05	0.739	<0.05

C-index in prediction PFS.

3.4.2 Time-dependent ROC analysis and development of a nomogram

In the training queue, the pathomics-based model achieved the highest AUC of 1.000, outperforming the clinical model (AUC = 0.782) and the combined model (AUC = 0.959). Within the test cohort, the combined model achieved the highest AUC of 0.800, with the pathomics-based model following at 0.786, and the clinical model at 0.743. These results indicate that the combined model exhibits superior AUC scores, particularly in the test queue, indicating its robust performance and potential predictive ability. This highlights its robustness and potential applicability in a clinical setting. The relevant ROC curve is shown in Figure 4.

Figure 4

Two ROC curve plots labeled A and B display sensitivity versus 1-specificity. In plot A, ClinicalPFS has an AUC of 0.782, PathPFS has an AUC of 1.000, and CombinedPFS has an AUC of 0.959. In plot B, ClinicalPFS has an AUC of 0.743, PathPFS has an AUC of 0.786, and CombinedPFS has an AUC of 0.800. Each plot includes three lines representing these metrics. — The ROC curves of deep learning algorithms for combined model in the train and test cohorts **(A,B)**.

In addition, we used time-dependent receiver operating characteristic (ROC) analysis to evaluate the predictive performance of the model, as detailed in Figure 5. “ClinicalPFS” refers to clinical features, and “PathPFS” refers to pathological features. “Points” refer to the numerical values assigned to each predictor based on its current value, while “Total points” is the sum of the points for all predictors, used to calculate the overall predictive result.

Figure 5

Nomogram displaying scales for Points, ClinicalPFS, PathPFS, and Total Points. Includes survival probabilities for half-year and one-year survival, with values on respective scales ranging from 0 to 140 for Total Points, and from 0.1 to 0.9 for survival probabilities. — The corresponding nomogram showing the contribution of different factors. “ClinicalPFS” refers to the clinical features, “PathPFS” refers to the pathological features, “Points” refers to the numerical value assigned to each predictor variable based on its current value, and “Total points” refers to the sum of the points for all predictor variables.

3.4.3 Risk stratification and IDH status distinguish prognosis

In our research, the combined model outperformed other models in both the training and testing groups, establishing it as the best option for further analysis. Using this model, we stratified patients into high-risk and low-risk groups based on survival curves (p < 0.0001). The high-risk group had a median progression-free survival (PFS) of 10 months. In the IDH (−) population (p = 0.01), the high-risk group had a median PFS of 10 months, while the low-risk group had a significantly longer median PFS of 30 months. In the IDH (+) population (p = 0.001), the high-risk group had a median PFS of 9 months. These findings indicate a strong stratification effect, effectively distinguishing high-risk and low-risk patients. The relevant survival curve is shown in Supplementary Figure 3.

4 Discussion

In our study, the histopathology of WSI data was used to develop a predictive model for the prognosis of high-grade gliomas. The combined model demonstrated good prognostic value in both the training cohort (C-index = 0.847, AUC = 0.959) and the testing cohort (C-index = 0.739, AUC = 0.800), and was comprehensively evaluated as the optimal model. Furthermore, patient stratification based on the combined model, particularly focusing on the IDH status, improved survival prediction and provided additional information for prognostic stratification.

Pathomics seeks to investigate the microscopic patterns present in digital histopathology slides or whole slide images (WSI) using a high-throughput approach (18). The tumor microenvironment (TME) can be thoroughly characterized by “subvisual” quantification of the presence and spatial arrangement of different cell types, such as immune cells, fibroblasts, and blood vessels, thereby successfully predicting the prognosis of high-grade gliomas and providing options for treatment plans (18, 19). Compared to radiomics, which focuses on the macroscopic level, pathomics has a significant advantage in spatial resolution (20, 21). In addition, research by Dia et al. (20) demonstrated that pathomics may have stronger predictive capabilities even in the presence of significant exceptions.

Previous studies have used histopathology models to predict the prognosis of high-grade gliomas. However, most of these studies did not employ deep learning techniques. Recent research has shown that deep analysis models possess better generalization capabilities and interpretability (22, 23). The strong generalization capability allows the model to accurately classify and predict patient prognosis (23), while interpretability offers enhanced transparency, enhancing human understanding of internal workings and decision-making, while supporting bias correction (22).

Our study differs from previous articles in key aspects. Rathore et al. (24) included both low-grade and high-grade gliomas, introducing significant tumor heterogeneity that could impact the accuracy of prognosis prediction. He’s et al. (14) study did not conduct multi-model comparisons, as different models have their own advantages and disadvantages. By comparing the performance of different models, we can select the most suitable model to improve overall prediction accuracy and efficiency (25, 26). Additionally, Jiang’s et al. (15) deep learning study overlooked multiple instance learning, which can enhance model performance and improve the accuracy of outcome prediction, playing a crucial role in prognosis prediction (27, 28).

IDH status is an important molecular marker in gliomas. The IDH wild-type usually indicates a poor prognosis (29, 30). In recent years, an increasing number of studies have utilized pathomics to predict IDH status. In the study by Zhao et al. (31), pathological features were effectively used to predict patients’ IDH status, with the model’s AUC value exceeding 0.9. There are also studies that have combined IDH status and pathological features to construct prognostic prediction models for gliomas and analyze the relationship between them. In the study by Chunduru et al. (32), whole-slide images (WSIs) of low-grade and high-grade gliomas were collected. After feature extraction, they were integrated into risk score features (SDL risk score). Further research confirmed that this score has a strong correlation with IDH status in genetic subtypes, and the combination of both is equally effective in predicting patient survival. This study, based on pathomics, constructs a prediction model for the progression of high-grade gliomas. The model is also applied to groups with different IDH statuses for stratification and survival analysis. This model can identify high-risk progression populations within the IDH-mutated group, enabling clinicians to better manage patients and adjust treatment strategies.

In practical applications, we can utilize a combined model to stratify patients for risk assessment, thereby assisting the work of clinical practitioners. Firstly, clinicians can leverage the risk stratification information provided by the model to customize treatment plans, selecting appropriate therapeutic strategies based on the predicted outcomes. Our research findings indicate that the model is effective in identifying high-risk patients. For these high-risk patients, clinicians may consider the following measures in their treatment decision-making: (1) Whether to adjust the radiation dose and irradiation range during the concurrent radiotherapy combined with TMZ. (2) Whether to consider combining electric field therapy or anti-angiogenic therapy during the concurrent radiotherapy phase and the 6-cycle oral TMZ phase (33, 34). (3) Consider extending the duration of maintenance therapy to achieve a better prognosis. (4) Attempt to administer medication preoperatively to obtain better surgical conditions and improve the quality of life after treatment. (5) Conducting relevant clinical trials targeting this population. (6) For patients with a poorer prognosis, more frequent follow-ups can help detect changes in their condition earlier, allowing for timely treatment. For low-risk patients, the following measures can be considered: (1) Consider appropriately reducing the radiotherapy dose to lower the side effects experienced by the patient during radiotherapy. (2) Appropriate extension of patient follow-up intervals can reduce their economic burden. In future clinical practice, we hope to conduct non-inferiority clinical trials and other studies to further verify the feasibility of these measures. Additionally, by referencing the predictive outcomes from the model, clinicians can engage in more candid and informed discussions with patients, collaboratively exploring treatment options. Ultimately, the ability of the stratification model to accurately predict outcomes can help healthcare institutions better allocate resources, such as in radiotherapy planning and subsequent care management. By focusing on high-risk patients, healthcare providers can optimize scheduling and resource utilization, thereby enhancing overall healthcare efficiency.

This is a single-center study lacking external validation. Due to its interdisciplinary nature and issues related to the management of pathological slides, pathological slides for some patients could not be obtained. This ultimately limited the sample size included in the study, consequently affecting the generalization ability of the research model to some extent. Moreover, in the design of this study’s analysis, we only utilized IDH status for subgroup analysis and did not analyze other molecular markers, such as MGMT methylation and 1p/19q deletion status. Furthermore, we did not investigate the correlation between IDH status and pathological features, which resulted in a lack of analysis regarding the biological interpretability of pathomics. Finally, this study only predicted progression-free survival (PFS) and did not include the important overall survival (OS) indicator. In the future, we plan to expand the sample size and conduct multi-center collaborations, while incorporating more molecular features and further exploring their relationships with pathological features. Moreover, we will utilize the model to further predict patients’ survival outcomes to provide insights for selecting clinical treatment options.

Our study established a prediction model based on WSI to forecast the prognosis of high-grade gliomas. The combined model, integrating clinical data and pathological features, outperformed other models in terms of predictive performance. Furthermore, the model’s ability to classify patient survival based on the IDH status enhanced its predictive capacity. This study provides valuable insights for improving personalized treatment strategies and prognostic assessment of high-grade gliomas.

Statements

Data availability statement

The datasets presented in this article are not readily available because this dataset contains patient privacy. Requests to access the datasets should be directed to SZ, zhoushu164086035@126.com.

Ethics statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent from the patients/participants or patients/participants’ legal guardian/next of kin was not required to participate in this study due to the retrospective nature of this study.

Author contributions

YZ: Formal analysis, Writing – original draft, Investigation. YG: Data curation, Writing – review & editing, Conceptualization, Formal analysis. WX: Validation, Software, Writing – original draft. XS: Validation, Writing – original draft, Visualization. GJ: Writing – original draft, Formal analysis, Data curation. LQ: Conceptualization, Writing – review & editing, Data curation, Methodology. KS: Validation, Writing – original draft, Supervision. MW: Writing – review & editing, Methodology. YF: Writing – original draft. JY: Writing – review & editing. JL: Writing – original draft. YL: Writing – original draft. YC: Writing – review & editing. MP: Writing – original draft. SZ: Conceptualization, Supervision, Project administration, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2025.1614678/full#supplementary-material

References

1.
Ostrom QT Price M Neff C Cioffi G Waite KA Kruchko C et al . CBTRUS statistical report: primary brain and other central nervous system tumors diagnosed in the United States in 2015–2019. Neuro Oncol. (2022) 24:v1–v95. doi: 10.1093/neuonc/noac202
2.
Faris MM Dhillon HM Campbell R GKB H Miller A Chan RJ et al . Unmet needs in people with high-grade glioma: defining criteria for stepped care intervention. JNCI Cancer Spectr. 8:pkae034. doi: 10.1093/jncics/pkae034
3.
Frosina G . Radiotherapy of high-grade gliomas: dealing with a stalemate. Crit Rev Oncol Hematol. (2023) 190:104110. doi: 10.1016/j.critrevonc.2023.104110
4.
Petronek MS Monga V Bodeker KL Kwofie M Lee CY Mapuskar KA et al . Magnetic resonance imaging of iron metabolism with T2* mapping predicts an enhanced clinical response to pharmacologic ascorbate in patients with GBM. Clin Cancer Res. (2024) 30:283–93. doi: 10.1158/1078-0432.Ccr-22-3952
5.
Park JE Kim HS Kim N Kim YH Kim JH Kim E et al . Low conductivity on electrical properties tomography demonstrates unique tumor habitats indicating progression in glioblastoma. Eur Radiol. (2021) 31:6655–65. doi: 10.1007/s00330-021-07976-w
6.
Zhang Y Yang Z Chen R Zhu Y Liu L Dong J et al . Histopathology images-based deep learning prediction of prognosis and therapeutic response in small cell lung cancer. npj Digit Med. (2024) 7:15. doi: 10.1038/s41746-024-01003-0
7.
Wu Y Li Y Xiong X Liu X Lin B Xu B . Recent advances of pathomics in colorectal cancer diagnosis and prognosis. Front Oncol. (2023) 13:1094869. doi: 10.3389/fonc.2023.1094869
8.
Wei Z Xv Y Liu H Li Y Yin S Xie Y et al . A CT-based deep learning model predicts overall survival in patients with muscle invasive bladder cancer after radical cystectomy: a multicenter retrospective cohort study. Int J Surg. (2024) 110:2922–32. doi: 10.1097/JS9.0000000000001194
9.
Zhong H Huang D Wu J Chen X Chen Y Huang C . ¹⁸F-FDG PET/CT based radiomics features improve prediction of prognosis: multiple machine learning algorithms and multimodality applications for multiple myeloma. BMC Med Imaging. (2023) 23:87. doi: 10.1186/s12880-023-01033-2
10.
Liu CZ Sicilia R Tortora M Cordelli E Nibid L Sabarese G et al . (2021). Exploring deep pathomics in lung cancer. IEEE 34th International Symposium on Computer-Based Medical Systems (CBMS)
- Google Scholar
11.
Li W Xiao J Zhang C Di X Yao J Li X et al . Pathomics models for CD40LG expression and prognosis prediction in glioblastoma. Sci Rep. (2024) 14:24350. doi: 10.1038/s41598-024-75018-8
12.
Mahootiha M Tak D Ye Z Zapaishchykova A Likitlersuang J Climent Pardo JC et al . Multimodal deep learning improves recurrence risk prediction in pediatric low-grade gliomas. Neuro Oncol. (2025) 27:277–90. doi: 10.1093/neuonc/noae173
13.
Shi L Shen L Jian J Xia W Yang KD Tian Y et al . Contribution of whole slide imaging-based deep learning in the assessment of intraoperative and postoperative sections in neuropathology. Brain Pathol. (2023) 33:e13160. doi: 10.1111/bpa.13160
14.
He Y Duan L Dong G Chen F Li W . Computational pathology-based weakly supervised prediction model for MGMT promoter methylation status in glioblastoma. Front Neurol. (2024) 15:1345687. doi: 10.3389/fneur.2024.1345687
15.
Jiang S Zanazzi GJ Hassanpour S . Predicting prognosis and IDH mutation status for patients with lower-grade gliomas using whole slide images. Sci Rep. (2021) 11:16849. doi: 10.1038/s41598-021-95948-x
16.
Lin NU Lee EQ Aoyama H Barani IJ Barboriak DP Baumert BG et al . Response assessment criteria for brain metastases: proposal from the RANO group. Lancet Oncol. (2015) 16:e270–8. doi: 10.1016/s1470-2045(15)70057-4
- CrossRef
- Google Scholar
17.
Schober P Boer C Schwarte LA . Correlation coefficients: appropriate use and interpretation. Anesth Analg. (2018) 126:1763–8. doi: 10.1213/ane.0000000000002864
- CrossRef
- Google Scholar
18.
Luo Y Li Y Fang M Wang S Shao L Zou R et al . Multi-omics synergy in oncology: unraveling the complex interplay of radiomic, genoproteomic, and pathological data. Intell Oncol. (2025) 1:17–30. doi: 10.1016/j.intonc.2024.10.003
- CrossRef
- Google Scholar
19.
Lu C Shiradkar R Liu Z . Integrating pathomics with radiomics and genomics for cancer prognosis: a brief review. Chin J Cancer Res. (2021) 33:563–73. doi: 10.21147/j.issn.1000-9604.2021.05.03
20.
Dia AK Ebrahimpour L Yolchuyeva S Tonneau M Lamaze FC Orain M et al . The cross-scale association between pathomics and radiomics features in immunotherapy-treated NSCLC patients: a preliminary study. Cancer. (2024) 16:348. doi: 10.3390/cancers16020348
21.
Bülow RD Hölscher DL Costa IG Boor P . Extending the landscape of omics technologies by pathomics. npj Syst Biol Appl. (2023) 9:38. doi: 10.1038/s41540-023-00301-9
22.
Xu B Yang G . Interpretability research of deep learning: a literature survey. Inf Fusion. (2025) 115:102721. doi: 10.1016/j.inffus.2024.102721
- CrossRef
- Google Scholar
23.
Dinh L Pascanu R Bengio S Bengio Y . (2017). Proceedings of the 34th International Conference on Machine Learning. 1019–1028.
- Google Scholar
24.
Rathore S Chaddad A Iftikhar MA Bilello M Abdulkadir A . Combining MRI and histologic imaging features for predicting overall survival in patients with glioma. Radiol Imaging Cancer. (2021) 3:e200108. doi: 10.1148/rycan.2021200108
25.
Zadeh Shirazi A Fornaciari E Bagherian NS Ebert LM Koszyca B Gomez GA . DeepSurvNet: deep survival convolutional network for brain cancer survival rate classification based on histopathological images. Med Biol Eng Comput. (2020) 58:1031–45. doi: 10.1007/s11517-020-02147-3
26.
Shiri FM Perumal T Mustapha N Mohamed R . A comprehensive overview and comparative analysis on deep learning models. J Artif Intell. (2024) 6:301–60. doi: 10.32604/jai.2024.054314
- CrossRef
- Google Scholar
27.
Chang B Geng Z Mei J Wang Z Chen P Jiang Y et al . Application of multimodal deep learning and multi-instance learning fusion techniques in predicting STN-DBS outcomes for Parkinson's disease patients. Neurotherapeutics. (2024) 21:e00471. doi: 10.1016/j.neurot.2024.e00471
28.
Tian Y Hao W Jin D Chen G Zou A . (2020). A review of latest multi-instance learning. Proceedings of the 2020 4th International Conference on Computer Science and Artificial Intelligence
- Google Scholar
29.
Wu H Tong H Du X Guo H Ma Q Zhang Y et al . Vascular habitat analysis based on dynamic susceptibility contrast perfusion MRI predicts IDH mutation status and prognosis in high-grade gliomas. Eur Radiol. (2020) 30:3254–65. doi: 10.1007/s00330-020-06702-2
30.
Choate KA Pratt EPS Jennings MJ Winn RJ Mann PB . IDH mutations in glioma: molecular, cellular, diagnostic, and clinical implications. Biology. (2024) 13:885. doi: 10.3390/biology13110885
31.
Zhao Y Wang W Ji Y Guo Y Duan J Liu X et al . Computational pathology for prediction of isocitrate dehydrogenase gene mutation from whole slide images in adult patients with diffuse glioma. Am J Pathol. (2024) 194:747–58. doi: 10.1016/j.ajpath.2024.01.009
32.
Chunduru P Phillips JJ Molinaro AM . Prognostic risk stratification of gliomas using deep learning in digital pathology images. Neurooncol Adv. (2022) 4:vdac111. doi: 10.1093/noajnl/vdac111
33.
Ballo MT Conlon P Lavy-Shahaf G Kinzel A Vymazal J Rulseh AM . Association of tumor treating fields (TTFields) therapy with survival in newly diagnosed glioblastoma: a systematic review and meta-analysis. J Neurooncol. (2023) 164:1–9. doi: 10.1007/s11060-023-04348-w
34.
Lai S Li P Liu X Liu G Xie T Zhang X et al . Efficacy and safety of anlotinib combined with the STUPP regimen in patients with newly diagnosed glioblastoma: a multicenter, single-arm, phase II trial. Cancer Biol Med. (2024) 21:433–44. doi: 10.20892/j.issn.2095-3941.2023.0373

Summary

Keywords

high-grade gliomas, deep learning, prognostic analysis, pathomics, IDH

Citation

Zhu Y, Gong Y, Xu W, Sun X, Jiang G, Qiu L, Shi K, Wu M, Fei Y, Yuan J, Luo J, Li Y, Cao Y, Pan M and Zhou S (2025) Deep learning and pathomics analyses predict prognosis of high-grade gliomas. Front. Neurol. 16:1614678. doi: 10.3389/fneur.2025.1614678

Received

19 April 2025

Accepted

18 July 2025

Published

11 August 2025

Volume

16 - 2025

Edited by

Camilla Russo, Santobono-Pausilipon Children’s Hospital, Italy

Reviewed by

Chunbao Chen, Affiliated Hospital of North Sichuan Medical College, China

Simone Coluccino, Università della Campania Luigi Vanvitelli, Italy

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yuandong Cao, yuandongcao@163.com; Minhong Pan, minhongpan@aliyun.com; Shu Zhou, zhoushu164086035@126.com

†These authors share first authorship

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

ORIGINAL RESEARCH article

Deep learning and pathomics analyses predict prognosis of high-grade gliomas

Abstract

1 Introduction

2 Method

2.1 Datasets and workflow

2.2 Treatment

2.3 Clinical data acquisition

2.4 Data processing

2.5 Patch-level deep learning model training

2.6 Multi-instance learning for WSI fusion

2.7 Feature extraction & selection

2.8 Model building

2.8.1 Pathomics-based model

2.8.2 Clinical model

2.8.3 Combined model

2.9 Model performance evaluation and survival analysis

2.10 Statistical analysis

3 Result

3.1 Patient characteristics

3.2 Patch level efficiency

3.3 Grad-CAM visualization

3.4 Model construction and predictive performance

3.4.1 Model construction and signature comparison

3.4.2 Time-dependent ROC analysis and development of a nomogram

3.4.3 Risk stratification and IDH status distinguish prognosis

4 Discussion

Statements

Data availability statement

Ethics statement

Author contributions

Funding

Conflict of interest

Generative AI statement

Publisher’s note

Supplementary material

References

Summary

Outline

Figures

Cite article

Share article

Article metrics