Integrating pathomics and deep learning for subtyping uveal melanoma: identifying high-risk immune infiltration profiles

Wan, Qi; Wei, Ran; Yin, Hongbo; Tang, Jing; Deng, Ying-ping; Ma, Ke

doi:10.3389/fimmu.2025.1585097

ORIGINAL RESEARCH article

Front. Immunol., 09 July 2025

Sec. Cancer Immunity and Immunotherapy

Volume 16 - 2025 | https://doi.org/10.3389/fimmu.2025.1585097

This article is part of the Research TopicPrecision Oncology in Checkpoint Immunotherapy: Leveraging Predictive Biomarkers for Personalized TreatmentView all 26 articles

Integrating pathomics and deep learning for subtyping uveal melanoma: identifying high-risk immune infiltration profiles

Ke Ma^*

Department of Ophthalmology, West China Hospital of Sichuan University, Chengdu, Sichuan, China

Purpose: Uveal melanoma (UVM) is the most common primary intraocular malignancy in adults, characterized by high mortality despite its relatively low incidence. This study aimed to utilize unsupervised learning techniques to identify a high immune infiltration subtype of UVM and improve patient stratification based on mortality risk.

Methods: A total of 70 hematoxylin and eosin (H&E) stained whole-slide images (WSIs) of UVM were collected from the Genomic Data Commons (GDC) data portal, along with genomic and clinical data. An additional validation cohort of 68 UVM patients from West China Hospital was included. Pathomic features were extracted using CellProfiler software, and deep learning models were constructed for classification and survival prediction. Unsupervised clustering was performed to identify critical regions for prognosis prediction and patient classification. The relationship between histopathological features and genomics was explored.

Results: The study achieved accurate prediction and classification of UVM patients using deep learning models and machine learning techniques. A high immune infiltration subtype of UVM was identified, which showed prognostic relevance. Unsupervised clustering categorized UVM patients into three distinct subgroups. The developed deep learning model based on the Inception-V3 architecture demonstrated promising results in survival prediction.

Conclusion: This study demonstrates the potential of unsupervised learning and deep learning techniques in identifying a high immune infiltration subtype of UVM and improving patient stratification based on mortality risk. This research contributes to the field of computational pathology and highlights the potential of utilizing histopathological images, genomic data, and deep learning models in enhancing the management of UVM patients.

1 Background

Uveal melanoma (UVM) is the most common intraocular malignancy in adults and the most frequent type of non-cutaneous melanoma. It is also the primary lethal ocular disease in adults. In the United States, the annual incidence of UVM is reported to be 5.2 cases per million (per year) (1, 2). Based on the cytological morphology of tumor tissues, three subtypes of UVM can be distinguished: spindle, epithelioid, and mixed cell types. Epithelioid cell type has the worst prognosis, accounting for approximately 3-5% of all UVM cases, while spindle cell type has the best prognosis, accounting for over 40% of all UVM cases. The remaining 50% of UVM patients belong to the mixed cell type (3). The tumor cells of UVM originate from melanocytes with pigmentation in the uvea, with 90% of tumors originating from the choroid, approximately 6% from the ciliary body, and 4% from the iris (4). Since UVM is a very diverse tumor, chromosomal changes and gene mutations are thought to be the primary factors for it to arise and spread (5). At present, brachytherapy, proton therapy, enucleation, and stereotactic radiation are the primary approaches used to treat UVM. However, patients diagnosed with UM often face a dismal prognosis primarily due to the high likelihood of metastasis, particularly to the liver, which significantly diminishes survival rates. While local recurrence is not directly linked to poor prognosis, it increases the risk of distant metastases, a major cause of mortality in UVM. Additionally, metastatic UVM cells frequently exhibit resistance to existing therapeutic options, including chemotherapy and targeted treatments, further complicating disease management. Over 50% of primary UVM patients eventually develop distant metastases, with the liver being affected in up to 90% of cases. Typically, the median survival period after metastasis is 10–13 months (6). Patients with uveal melanoma who get early identification and surgical therapy for metastatic UVM may have better overall survival (OS) and progression-free survival. Consequently, prompt diagnosis and therapy are useful steps to enhance the clinical outcome of UVM.

The diagnostic results of pathology directly influence the selection of treatment plans and the prediction of prognosis. Currently, the most frequently employed in clinical practice is the hematoxylin and eosin (H&E) staining of histopathological sections, which is simple, cost-effective, and the preferred auxiliary examination for clinicians (7). In addition, numerous clinical organizations are producing more whole-slide images (WSIs) as a result of developments in scanning equipment, imaging methods, and storage devices. AI and deep learning (DL) methods can be used to examine these images (8). AI-assisted detection and automatic categorization of H&E-stained whole slide images, for example, can aid in identifying the main lesion of malignancies that are unknown in origin (9), grade prostate cancer to a standard that is equivalent to that of skilled pathologists (10), more accurate than conventional cancer staging at predicting the prognosis of patients with colorectal cancer (11), and identify breast cancer’s lymph node metastases (12). Furthermore, our previous studies have demonstrated the accurate prediction and classification of UVM patients using deep learning models and machine learning techniques via WSIs, with an accuracy rate of over 90% in predicting patients’ survival prognosis (13).

Although DL frameworks have achieved impressive performance in segmentation and classification tasks, they still require supervision from pathologists, and the annotation process still demands significant resources (14). To address this limitation, we utilized CellProfiler software for the extraction of pathomics in tumor areas. Pathomics is a new research method that enables automated processing and analysis of a large number of pathological images (15). It calculates features such as cell nucleus and cytoplasm morphology, tissue spatial structure, and extracts valuable information to assist in pathological diagnosis and support disease research (16–18). Due to the presence of different disease subtypes and varying degrees of disease progression, there is evident heterogeneity between cell tissues. The application development of pathomics can be used to explore heterogeneity within tumors, diagnose clinical outcomes, and predict treatment responses (19, 20).

Additionally, through unsupervised clustering, we identified critical regions to predict the prognosis and classification of UVM patients. Additionally, we explored the relationship between histopathological features and genomics in an exploratory manner. We believe that mutations in oncogenes and long-term abnormal expression contribute to the pathological process of transitioning from quantitative to qualitative changes in tissue pathology, and our research aims to provide evidence for this process.

2 Materials and methods

2.1 Data collection

This study gathered 70 WSIs of uveal melanoma (TCGA-UVM) stained with H&E from the Genomic Data Commons (GDC) data portal, in addition to pertinent genomic information and clinical features such as age, gender, tumor stage, histological type, and metastasis status. Additionally, a validation cohort (WCH-UVM) comprising 68 UVM patients from West China Hospital in Chengdu, China, were included, involving the collection of H&E-stained UVM samples and corresponding clinical data. This study was conducted in accordance with the Declaration of Helsinki. The agreement and written informed consent of WCH-UVM were acquired. The research protocol was reviewed and approved by the Ethics Committee of West China Hospital, Sichuan University (Approval No. 20242000). These WSIs served as the basis for exploring pathomic features and deep-learning features in uveal melanoma patients. The simplified flow diagram for the Study design is illustrated in Figure 1A. The work has been reported in line with the REMARK (Reporting recommendations for tumor marker prognostic studies) criteria (21).

Figure 1

Flowchart illustrating a multi-step analysis process using whole-slide images (WSIs) and RNA sequencing. Section A outlines RNA sequencing and WSI processing including pathomics, KNN clustering, and unsupervised subtyping. Section B details six steps: image preprocessing, pathologic feature extraction with CellProfiler, KNN clustering of tiles, identifying unsupervised subtypes, selecting tiles for deep learning, and multi-task analysis incorporating deep learning and machine learning. Various charts and images depict data transformation, clustering, survival estimation, and feature extraction. The workflow aims to integrate RNA, deep learning, and pathologic features into a comprehensive nomogram.

Figure 1. The overall study diagram for the current study. (A) The detail flow work for the whole design of study. (B) The detail steps of data processing, pathomics analysis and deep-learning network construction.

To expand the scope of the investigation, additional genomic data from open-access resources (ArrayExpress databases and the Gene Expression Omnibus) were collected. The selection process for appropriate cohorts involved specific criteria: 1) samples originated from human subjects, 2) cohorts included survival data, and 3) cohorts derived from independent studies. Using these criteria, the study incorporated 250 samples from five UVM cohorts (E-MTAB-4097, GSE22138, GSE27831, GSE44295, and GSE84976).

2.2 Data processing

First, using Qupath software (v.0.2.3) to achieve accurate identification and delineation of tumor regions within WSIs and ensuring annotation quality, followed by manual review of the WSIs and careful annotation of tumor areas while excluding regions with excessive background or no tissue; subsequently, the WSIs were divided into non-overlapping tiles of 1024x1024 pixels, and pathomics feature extraction as well as deep-learning network construction was based on selecting tiles with a tumor mask overlap of more than 50%; furthermore, the raw RNA-seq data underwent preprocessing steps including probe set conversion, determination of gene expression values, and log2 + 1 transformation for standardization and normalization, thereby providing a standardized foundation for bioinformatic analysis aimed at identifying gene biomarkers.

2.3 Pathomics feature extraction

The CellProfiler software is widely used in pathology image analysis, offering automated processing and analysis capabilities for large quantities of pathology images (22, 23). We used CellProfiler (version 4.2.6) to extract quantitative pathomics features of tiles from all tumor regions. Firstly, employ the “UnmixColors” module to separate H&E-stained images into hematoxylin and eosin-stained images. Then, convert the H&E-stained images into grayscale using the “ColorToGray” module, and evaluate image quality features of grayscale H&E, hematoxylin, and eosin images using the “MeasureImageQuality” and “MeasureImageIntensity” modules. Automatically calculate thresholds for each image using the Otsu algorithm to identify tissue foreground from unstained background and extract threshold features. Next, utilize the “MeasureColocalization” module to compute pixel-wise intensity colocalization and correlation between each eosin-stained image and hematoxylin-stained image. Lastly, respectively assess granularity and texture features of each image using the “MeasureGranularity” and “MeasureTexture” modules, outputting a size measurement spectrum for the textures in the image, with the granularity spectrum ranging as specified (15).

2.4 Tiles clustering and annotation

Firstly, following previously published protocols and RNA-seq data, we calculated eight indices related to stemness and the microenvironment in Uveal melanoma (24–27). These indices include stemness-related indices (mDNAsi and mRNAsi) and microenvironment-associated indices (DNA methylation of tumor-infiltrating lymphocytes (MeTILs) and the Cancer-associated fibroblasts (CAFs), stromal, immune, tumor purity, and estimate scores). Secondly, we utilized the K-Nearest Neighbor (KNN) algorithm to partition the tiles into eight clusters and employed correlation analysis to annotate the attributes of these clusters. Besides, pseudo-trajectory analysis was performed to compare the similarity of tile clusters between TCGA-UVM and WCH-UVM cohorts.

2.5 Unsupervised subtyping

Based on the classification results obtained through the KNN algorithm, we computed the proportions of each cluster in the UVM samples. Kaplan-Meier (K-M) survival analysis was then performed to find clusters that had prognostic importance. Additionally, we employed the “ConsensusClusterPlus(v.1.62.0)” program to categorize UVM patients into three distinct subgroups. This program utilized the unsupervised clustering method “Pam”, with a maximum number of clusters set to maxK=4. The clustering process was performed over 1,000 iterations using Euclidean distance and Ward’s linkage for consistent and reproducible classification.

2.6 Ensemble deep learning for multi-task

The tiles in survival-related clusters were selected for deep-learning construction. Two independent datasets were randomly selected from the TCGA-UVM cohort: a train dataset (70%) and a test dataset (30%). The train dataset was utilized to create the model and fine-tune the hyperparameters, while the testing dataset and WCH-UVM dataset were employed for model evaluation. To enhance the diversity and quality of training data, data normalization and augmentation techniques were employed, including horizontal flip, vertical flip, random rotation, etc. Combined with the implementation of a weakly supervised method and subtypes for supervision, an Inception-V3 model was trained over 30 epochs, utilizing the SGD optimizer with a learning rate of 10–2 and L2 regularization with a weight decay of 10-5. Subsequently, a classifier based on the Inception-V3 deep-learning architecture was employed to assign labels to all tiles within each WSI, resulting in the generation of a heatmap depicting the probability prediction for each tile. Considering the large number of tiles within the WSIs, these probability tiles were combined to create a heatmap representing the probability distribution across the entire WSI. The Bag-of-Words (BoW) algorithm was applied to calculate term frequency-inverse document frequency (TF-IDF) features from the probabilities. The TF-IDF features (defined as DL features) were further analyzed to conduct multi-task at the WSI-level via machine learning.

2.7 Pathomics and deep-learning -features risk models

Pathomics and DL risk models were constructed to predict patient prognosis using DL-features and clustering proportions. Initially, the candidate whole WSI-level DL features were obtained through lasso regression. These features were then utilized as inputs for the Cox regression model implemented in the “survival” package of R software. The Cox model was employed to calculate the DL risk scores for each patient. Additionally, survival-related clusters were incorporated into another Cox regression model to estimate the pathological risk scores for each patient.

2.8 RNA-features construction

On the basis of cluster proportions, UVM patients were divided into three categories. To find differentially expressed genes (DEGs) connected to these subgroups, we used the “limma” approach and set a significance threshold of p < 0.05 and |log2FC| ≥ 2. To further explore the biological mechanisms between different subtypes, we performed Reactome pathway enrichment analysis. Additionally, we used Cox regression to identify prognostic-related DEGs and applied unsupervised clustering to classify UVM patients. The genes positively and negatively correlated with genomic clustering were defined as signature genes p and n, respectively. Next, we utilized the Boruta algorithm to identify important gene subsets within features p and n, followed by performing PCA calculations on these subsets. Finally, we extracted the first principal component (PC1) as the RNA-features. The RNA-features score in UVM samples was calculated as the sum of PC1(p) minus the sum of PC1(n) using the formula: ∑PC1(p) - ∑PC1(n). To assess the predictive capability of the RNA-features, the RNA-features scores of UVM patients were independently examined in six distinct UVM cohorts (TCGA-UVM, E-MTAB-4097, GSE22138, GSE27831, GSE44295, and GSE84976). Each patient was classified as either high-risk or low-risk based on the optimal cutoff score, and the disparity in survival outcomes between the two groups was analyzed using Kaplan-Meier curves and log-rank tests. Additionally, we conducted a meta-analysis to integrate and comprehensively evaluate the risk ratios and survival outcomes across different cohorts.

2.9 Statistical analysis

In this study, we utilized Python (v.3.8.0) and R (v.4.2.2) alongside relevant packages to perform various statistical analyses. The proposed deep learning model was implemented using PyTorch, utilizing a GPU (Nvidia GeForce RTX-3080 with 10 GB memory). Machine learning algorithms were executed using the “sklearn” package in Python. K-M and receiver operating characteristic curves (ROCs) were visualized using the “survminer” and “survivalROC” packages, respectively in R. The best cutoff value was decided upon the “survminer” package, while the Pearson test was used to evaluate the association. For comparisons between groups, the Wilcoxon test, and chi-square test were employed. Hazard ratios (HR) and 95% confidence intervals (CI) were computed using Cox regression analysis, with a p-value threshold of less than 0.05 being used to evaluate statistical significance.

3 Results

3.1 Participants characteristics

This study included two whole slide image (WSI) cohorts and three datasets: the training dataset consisted of 42 consecutive UVM patients from the TCGA-UVM cohort, the testing dataset comprised 42 consecutive UVM patients from the TCGA-UVM cohort, and the validation dataset contained 68 consecutive patients from the WCH-UVM cohort. Statistical analysis of clinical and pathological features revealed no significant differences between the training and testing datasets. However, there were significant differences observed among the three datasets regarding overall survival (OS) time, age, and histological type. The average OS time was 864.12 ± 573.36 days in the training dataset, 704.68 ± 471.53 days in the testing dataset, and 1439.56 ± 973.42 days in the validation dataset. The mean age in the training and testing datasets was 60.40 ± 13.59 years and 64.91 ± 15.11 years, respectively, while the mean age in the validation dataset was 50.81 ± 12.47 years. Furthermore, the proportion of the epithelioid histological type in the three datasets was 16.7%, 18.2%, and 36.8%, respectively (Table 1).

Table 1

Table 1. Clinical features of train, test and validate datasets.

3.2 Pathomics feature extraction

As shown in step 1 at Figure 1B, the WSIs underwent cropping and filtering processes. In the TCGA-UVM cohort, we obtained a total of 30,875 qualified tiles, and in the WCH-UVM cohort, we obtained 18,179 valid tiles. These tiles were then subjected to analysis using CellProfiler, resulting in a final set of 180 quantitative pathomics features available for each tile.

3.3 Tiles clustering and annotation

To select a more distinctive subset of tiles from the WSI, we performed clustering on the tiles based on their histopathological features and identified ones with greater discriminatory power. This pipeline consisted of four steps: tiles clustering analysis, calculation of tile category proportions, annotation of tiles, and pseudo trajectory analysis of tiles, as illustrated in step 3 at Figure 1B.

Using the KNN algorithm, all 30,875 tiles from the TCGA-UVM cohort were partitioned into eight clusters (Figure 2A). Similarly, the 18,179 tiles from the WCH-UVM cohort were also clustered into eight clusters (Figure 2B). The proportions of tiles clustering categories in each UVM sample were calculated, and the visualization of the proportions for the eight clusters in the TCGA-UVM cohort and WCH-UVM cohort samples was presented in Figures 2C, D, respectively. A correlation analysis heatmap revealed that Cluster0 had a positive correlation with MeTILs, immune scores, and estimate scores, while exhibiting a negative correlation with tumor purity (Figure 2E). Furthermore, we incorporated the cluster proportion scores as variables along with patient survival information and performed Kaplan-Meier analysis. In the TCGA-UVM cohort, we found that Cluster0, Cluster1, and Cluster3 were associated with UVM prognosis. Cluster0 had a hazard ratio (HR) of 2.46, indicating a survival risk factor, whereas Cluster1 and Cluster3 had HRs of 0.29 and 0.18, respectively, indicating protective factors for UVM survival (Figure 2F). Similarly, in the WCH-UVM cohort, we observed that Cluster3, Cluster4, and Cluster5 were associated with UVM prognosis. Cluster3 had an HR of 4.48, indicating a survival risk factor, while Cluster4 (HR=0.3) and Cluster5 (HR=0.017) were identified as protective factors for UVM survival (Figure 2G).

Figure 2

Cluster analysis and survival probability graphs for TCGA-UVM and WCH-UVM datasets. Panels A and B feature scatter plots with clusters labeled by color. Panels C and D are circular bar charts showing the estimated proportion of clusters. Panel E is a correlation heatmap for various biological scores across clusters. Panels F and G contain Kaplan-Meier survival curves comparing cluster survival probabilities, with statistical annotations including p-values and hazard ratios.

Figure 2. Tiles clustering and annotation. (A) Visual representation of the eight clusters of the 30,875 tiles from the TCGA-UVM cohort. (B) Visualization of the eight clusters of the18,179 tiles from the WCH-UVM cohort. (C) The relative proportions of the eight clusters in the TCGA-UVM cohort. (D) The relative proportions of the eight clusters in the WCH-UVM cohort. (E) The correlation heatmap of eight clusters and eight indices which included stemness-related indices (mDNAsi and mRNAsi) and microenvironment-associated indices (DNA methylation of tumor-infiltrating lymphocytes (MeTILs) and The Cancer associated fibroblasts (CAFs), stromal, immune, tumor purity, and estimate scores). (F) Kaplan–Meier (K-M) curves for survival analysis of Cluster0, Cluster1 and Cluster3 in TCGA-UVM cohort. (G) K-M curves of Cluster3, Cluster4 and Cluster5 in WCH-UVM cohort.

Pseudo trajectory analysis revealed that Cluster0 in the TCGA-UVM cohort (Figure 3A) and Cluster 3 in the WCH-UVM cohort (Figure 3B) are both distributed on the inner side of the trajectory. On the other hand, Cluster1 and Cluster3 in the TCGA-UVM (Figure 3A), as well as Cluster4 and Cluster5 in the WCH-UVM (Figure 3B), were distributed on the outer side of the trajectory. Additionally, the dendrogram tree also indicates that these clusters exhibit similar distribution characteristics. To further validate the consistency of the identified categories in both cohorts, we visualized a sample from each dataset separately. Figure 3C represents a complete WSI from the TCGA-UVM dataset, and the clustered and stitched heatmap was shown in Figure 3D. The pie chart in Figure 3E illustrates the relative proportions of Cluster0, Cluster1, and Cluster3. By overlaying the category heatmap with the original image, tile images corresponding to each cluster were identified (Figure 3F). Additionally, Figure 3G depicted a complete WSI from the WCH-UVM cohort, and the clustered and stitched heatmap was shown in Figure 3H. The pie chart in Figure 3I illustrates the relative proportions of Cluster3, Cluster4, and Cluster5. Similarly, by overlaying the category heatmap with the original image, tile images corresponding to each cluster were identified (Figure 3J).

Figure 3

Scatter plots and dendrograms represent data clustering with k-nearest neighbors (KNN) in two separate panels labeled A and B. Different colored dots indicate clusters. Panels C and G show histological images. Panels D and H exhibit pixelated representations. Panels E and I depict pie charts showing cluster proportions. Panels F and J provide zoomed-in views with color-coded sections corresponding to specific clusters. The labels indicate “TCGA-UVM” and “WCH-UVM” with sample IDs.

Figure 3. Pseudo trajectory analysis and clusters display. (A) The pseudo trajectory and dendrogram tree of Cluster0, Cluster1 and Cluster3 in TCGA-UVM cohort. (B) The pseudo trajectory and dendrogram tree of Cluster3, Cluster4 and Cluster5 in WCH-UVM cohort. (C) One representative whole-slide image (WSI) in TCGA-UVM cohort. (D) Cluster heatmap for WSI-level. (E) The relative proportions of the eight clusters in WSI-level. (F) The tile clustering and tiles selection for Cluster0, Cluster1 and Cluster3 in representative WSI. (G) One example of whole-slide image (WSI) in WCH-UVM cohort. (H) Cluster heatmap for WSI-level. (I) The relative proportions of the eight clusters in WSI-level. (J) The tile clustering and tiles selection for Cluster3, Cluster4 and Cluster5 in example of WSI.

3.4 Unsupervised subtyping

Based on survival analysis, we included Cluster0, Cluster1, and Cluster3 from TCGA-UVM, and Cluster3, Cluster4, and Cluster5 from WCH-UVM in the unsupervised clustering analysis. By considering the similarity in cluster proportions, we further separated UVM patients into subgroups. Stable clustering was indicated by a continuous rise in the values of the cumulative distribution function (CDF). Ultimately, through unsupervised clustering (k=3), we identified three stable subtypes. In TCGA-UVM, these subtypes included subtype 1 (9 UVMs), subtype 2 (21 UVMs), and subtype 3 (40 UVMs). Similarly, the WCH-UVM dataset was also divided into three subtypes: subtype 1 (26 UVMs), subtype 2 (19 UVMs), and subtype 3 (23 UVMs). Furthermore, to investigate the connection between subtypes and clinical characteristics, the complete heatmaps of TCGA-UVM and WCH-UVM datasets were displayed in Figures 4A, B. K-M survival curve analysis revealed that subtype 2 in TCGA-UVM out of the three subtypes had the poorest prognosis (Figure 4C, log-rank p=0.035), while subtype 3 in WCH-UVM had the shortest survival time among the three subtypes (Figure 4D, log-rank p=0.041). Reactome pathway enrichment analysis showed that differentially expressed genes in subtype 2 of TCGA-UVM were mainly enriched in immune-related pathways, such as Interferon alpha/beta signaling (Figure 4E). In addition, boxplot analysis revealed that immune scores and estimated scores of subtype 2 were significantly higher than those of subtype 1 and subtype 3, while tumor purity was significantly lower than that of subtype 1 and subtype 3 (Figure 4F). Therefore, we defined subtype 2 in TCGA-UVM and subtype 3 in WCH-UVM as the high-infiltration subtype, and subtype 1 and subtype 3 in TCGA-UVM, as well as subtype 1 and subtype 2 in WCH-UVM, as the low-infiltration subtype. The different clinical features between high- and low- infiltration subtypes were listed in Table 2.

Figure 4

Heatmaps labeled A and B show clustered data with annotations for clusters, gene alterations, and patient demographics. Kaplan-Meier curves in C and D present survival probabilities for different cancer subtypes. Bar charts in E display pathways with Reactome enrichment data. Box plots in F compare expression values across subtypes and conditions, with statistical significance markers.

Figure 4. Unsupervised subtyping. (A) Subtyping of proportions of the three clusters in TCGA-UVM cohort by an unsupervised clustering method. Proportions of Cluster0, Cluster1 and Cluster3 are shown by rows, and UVM samples are represented by columns. (B) Unsupervised Subtyping of proportions of the three clusters in WCH-UVM cohort. Proportions of Cluster3, Cluster4 and Cluster5 are shown by rows, and UVM samples are represented by columns. (C) K-M curves for survival analysis of three subgroups of UVM patients in TCGA-UVM cohort. (D) K-M curves of three subgroups in WCH-UVM cohort. (E) The Reactome pathway enrichment analysis of differentially expressed genes among subgroups. (F) Box plots of eight indices in three subgroups. *means p<0.05. mDNAsi and mRNAsi: The stemness-related indices; MeTILs, DNA methylation of tumor-infiltrating lymphocytes; CAFs, The Cancer associated fibroblasts.

Table 2

Table 2. The different clinical features between high- and low- infiltration subtypes.

3.5 Ensemble deep-learning for multi-task

The subtype classifier for diagnosing high-infiltration in uveal melanoma patients underwent rigorous validation using both the TCGA-UVM dataset and the WCH-UVM cohort. The classifier involved two crucial steps: tile-level prediction and WSI-level prediction.

For tile-level prediction, 20,689 tiles from survival-related clusters were selected to construct a deep learning model. Subsequently, we randomly split the dataset of 20,689 tiles into training and validation sets at a 1:1 ratio. The performance of the Inception-V3 deep learning model was rigorously evaluated using ROC curves, precision-recall curves, and confusion matrices to ensure a reliable and robust assessment. The specific metrics and detailed performance results of the model are provided in Supplementary Figure S1. The convergence of accuracy and loss curves indicated that as the training epoch increased, accuracy approached 100% and loss approached 0% (Figure 5A). At the WSI level, multiple probable tiles were combined to create a comprehensive heatmap and corresponding histogram (Figure 5B). The entire slide was assessed using the Bag-of-Words (BoW) algorithm, extracting 101 deep learning (DL) features from the histogram of tile probabilities. After adjusting for false discovery rate (FDR) using the Wilcoxon test, 52 DL features showed significant differences with a p-value < 0.05. These features were then utilized in Lasso regression for dimensionality reduction (Figure 5C). The optimal lambda value was determined through 10-fold cross-validation (CV) (Figure 5D), leading to the identification of five DL features with coefficients > 0 as significant candidate variables (Figure 5E). These five features were employed in ten machine learning algorithms, and the Support Vector Machine (SVM) classifier was selected for high infiltration prediction based on the distribution of accuracy results (Figure 5F). The ROCs in the train, test, and validation datasets were 1.00, 1.00, and 0.975, respectively (Figure 5G). Subsequently, the five DL features were used as inputs for the Cox regression model to calculate risk scores for each patient. High- and low-risk groups of UVM patients were created by using an ideal cutoff value. In the TCGA-UVM dataset (Figure 5H, log-rank p < 0.0001) and the WCH-UVM cohort (Figure 5I, log-rank p < 0.0001), K-M survival analysis showed a poorer prognosis for high-risk patients.

Figure 5

Multiple data visualizations related to machine learning analysis: A) Loss and accuracy charts over iterations. B) Probability heatmap and histogram. C) LASSO regression path. D) Cross-validation error vs. lambda. E) LASSO identified variables bar chart. F) Circular plot of subtypes. G) SVM model ROC curves. H and I) Kaplan-Meier survival plots for high and low-risk strata with statistical significance.

Figure 5. Features selection and diagnosis for high-infiltration subtype at WSI-level. (A) The training convergence for Inception-V3: loss curve and accuracy curve. (B) An example of probable heatmap and histogram of probability. (C) The coefficient profiles of DL features in Lasso regression. (D) The distribution of mean squared error with the corresponding λ-logarithm value in 10-fold cross-validation using Lasso regression. (E) The selected DL features with coefficients > 0. (F) The accuracy distribution of ten machine learning algorithms for classify of subtype. (G) The ROC curves of SVM model for prediction of high-infiltration subtype in train, test and validate datasets. (H) K-M survival analysis of High- and low-risk groups of UVM patients in TCGA-UVM cohort. (I) K-M curves of High- and low-risk groups in WCH-UVM cohort.

In UVM patients, the most common gene mutations include EIF1AX, GNAQ, GNA11, SF3B1, and so on. Previous studies have shown that these mutations result in abnormal activation of cellular signaling pathways, promoting the proliferation and metastasis of melanoma cells. Therefore, the use of DL features to predict gene mutations would be beneficial for the treatment and management of UVM patients. Firstly, we employed the Wilcoxon test and FDR adjustment with a significance level of p < 0.05 to select DL features that exhibited differential gene mutations. Subsequently, we utilized ten machine learning algorithms to evaluate the accuracy of the predictions. The distribution of accuracy results indicated that RandomForest had the highest accuracy for predicting EIF1AX mutation (Figure 6A). The ROCs in the train, test, and validation datasets were 0.843, 0.800, and 0.783, respectively (Figure 6B). For the prediction of GNA11 mutation, XGBoost performed relatively better than other algorithms (Figure 6C). The AUCs were 0.613, 0.644, and 0.496 in the train, test, and validation datasets (Figure 6D). Regarding the prediction of GNAQ (Figure 6E) and SF3B1 mutations (Figure 6G), AdaBoost outperformed other algorithms. The AUCs of AdaBoost for GNA11 mutation were 0.857, 0.712, and 0.588 in the train, test, and validation datasets (Figure 6F). In predicting SF3B1 mutation, the AUCs of AdaBoost in the train, test, and validation datasets were 0.802, 0.806, and 0.618, respectively (Figure 6H).

Figure 6

The image consists of four circular bar charts labeled A, C, E, and G, showing different gene models (EIF1AX, GNA11, GNAQ, SFB3B1) with varying performance by algorithms like RandomForest, SVM, and GradientBoosting. Below, plots B, D, F, and H are ROC curves comparing model performances (RandomForest, XGBoost, AdaBoost) with different AUC values for training, validation, and testing datasets.

Figure 6. Prediction of gene mutation at WSI-level. (A) The accuracy distribution of ten machine learning algorithms for predicting mutation of EIF1AX. (B) The ROC curves of RandomForest model for predicting mutation of EIF1AX in train, test and validate datasets. (C) The distribution of accuracy for predicting GNA11 mutation. (D) The XGBoost model’s ROC curves for GNA11 mutation prediction in train, test, and validate datasets. (E) The distribution of accuracy for predicting GNAQ mutation. (F) The AdaBoost model’s ROC curves for GNAQ mutation prediction in train, test, and validate datasets. (G) The distribution of accuracy for predicting mutation of SF3B1. (H) The ROC curves for prediction of SF3B1 mutation via AdaBoost model in train, test, and validate datasets.

3.6 RNA-features construction

The development of RNA risk features is of significant importance for understanding the prognosis and potential therapeutic strategies of UVM. We utilized the TCGA-UVM cohort as a train set and employed five external validation sets (GSE84976, GSE27831, GSE22138, GSE44295, and E-MTAB-4097) to ensure the robustness of the study results. Firstly, in the TCGA-UVM cohort, we identified 457 differentially expressed genes (DEGs) based on different subgroups. Subsequently, using survival information and univariate Cox analysis, we identified 21 survival-associated DEGs. Based on these 21 genes, UVM samples were classified into three gene-related clusters. The feature p gene set consisted of 12 DEGs positively correlated with the gene clusters, while the feature n comprised 9 DEGs negatively correlated with the gene clusters. A visual heatmap was generated to illustrate the relationship between gene-related clusters and clinical features (Figure 7A). The log-rank test revealed that Cluster C exhibited a better survival prognosis (Figure 7B). We then used feature n and feature p, respectively, to further identify significant gene sets using the Boruta technique. Ultimately, the JUP gene was selected from feature p, and the UFC1 gene was selected from feature n for PCA calculations. Based on the formula, we obtained RNA-features scores for each UVM patient. The boxplot showed that scores of Cluster C were significantly lower than Cluster A and B (Figure 7C). Subsequently, using an optimal cutoff value, UVM patients were divided into high-score and low-score subgroups. The Log-rank test in the K-M curve demonstrated that patients with high scores had a poor overall survival than those with low scores (Figure 7D). Finally, a meta-analysis of all cohort results confirmed that RNA-features are an important risk factor influencing UVM survival, with a hazard ratio of 3.66 (Figure 7E). The funnel plot displayed a symmetric distribution on both sides of the centerline, indicating a low publication bias in the meta-analysis (Figure 7F).

Figure 7

A composite image of several data visualizations related to gene expression and survival analysis. Panel A shows a heatmap with various genetic and clinical attributes. Panel B presents a Kaplan-Meier survival curve with three strata labeled A, B, and C. Panel C includes a violin plot comparing gene expression between subtypes, showing a Kruskal-Wallis p-value. Panel D contains multiple Kaplan-Meier survival curves across different studies or datasets. Panel E is a forest plot displaying hazard ratios and confidence intervals from multiple studies. Panel F is a funnel plot showing standard error against hazard ratio for the studies.

Figure 7. Gene cluster and RNA-features construction. (A) Unsupervised clustering of three subtypes (Cluster A to C) based on the expression of differentially expressed genes (DEGs). (B) K-M curves of three gene clusters in TCGA-UVM cohort. (C) The distribution of RNA-features scores among three gene clusters. (D) K-M curves of high- and low-score groups of UVM patients in six independent cohorts (TCGA-UVM GSE22138, GSE27831, GSE44295, GSE84976, and E-MTAB-4097). (E) Meta-analysis of hazard ratios for RNA-features scores among six independent UVM cohorts. (F) The funnel plot of meta-analysis. * P<0.05; ***P<0.001.

3.7 Nomogram construction

To provide a comprehensive and accurate prognostic prediction method for UVM, we developed a comprehensive nomogram model. First, univariate Cox analysis revealed that DL-features, Cluster-features, RNA-features, age, stage, histological type, metastasis status, and SF3B1 mutation were correlated with overall survival (OS) (Figure 8A). However, in the multivariate Cox analysis, only age, gender, histological type, metastasis status, and DL-features were significantly associated with OS in UVM patients (Figure 8B). Therefore, we constructed a comprehensive nomogram model incorporating age, gender, histological type, metastasis status, and DL-feature to estimate the probabilities of 3 years and 5 years OS (Figure 8C). In the TCGA-UVM and WCH-UVM cohorts, the 3 years and 5 years nomogram calibration curves showed a significant degree of overlap between the actual and anticipated survival rates, demonstrating good predictive value (Figure 8D). Time-dependent ROC curves were used to assess the accuracy of the nomogram, and the AUC values for 1 year, 3 years, and 5 years predictions were all greater than 0.9, indicating good sensitivity and specificity of the nomogram (Figure 8E). Additionally, decision curve analysis (DCA) revealed that our nomogram model, which integrates pathomics features with traditional clinical features such as histological type and metastasis status, yielded higher net benefits compared to models relying solely on clinical features (Figure 8F). To assess the nomogram’s clinical usefulness, we further generated a clinical impact curve (CIC) using the DCA data. The nomogram’s higher overall net benefit within a broad and useful range of threshold probabilities was intuitively shown by the CIC, which also impacted prediction accuracy and showed the model’s strong predictive value (Figure 8G).

Figure 8

Panels A and B present forest plots displaying log2 hazard ratios for various factors like age, gender, and different genetic expressions, including statistical significance markers. Panel C shows a nomogram for predicting survival probabilities considering multiple features, such as deep learning features and metastasis status. Panels D and E feature calibration plots and ROC curves for prediction validation over several timeframes. Panels F and G display decision curve analyses and high-risk categorizations, showcasing model performance and cost-benefit ratios, respectively. All graphs are based on TCGA-UVM and WCH-UVM datasets.

Figure 8. Comprehensive nomogram construction. (A) The forest plots present the results of univariate Cox regression analyses for clinical parameters, RNA-features, Cluster-features and DL-features. (B) The forest plots of results from multivariate Cox regression of clinical parameters, RNA-features, Cluster-features and DL-features. (C) A comprehensive nomogram is used to predict the 3- and 5-year overall survival time for patients in the TCGA-UVM cohort, incorporating DL-features, age, gender, histological type, and metastasis status. (D) The 3- and 5-year of calibration curves for overall survival prediction in TCGA-UVM and WCH-UVM cohorts. (E) The ROC curves of nomogram model for survival prediction in TCGA-UVM and WCH-UVM cohorts. (F) The decision curves of nomogram, histological type, and metastasis for comparison of net benefit. (G) The clinical impact curve of the nomogram for risk prediction in TCGA-UVM and WCH-UVM cohorts.

4 Discussion

Traditional pathological examinations are conducted by experienced pathologists who assess tumor cell characteristics under multiple magnifications. However, pathologists typically do not provide detailed quantitative information for every region of a whole-slide image, and variability in pathological classification and diagnosis can occur due to the heterogeneity of histological subtypes and differences in individual interpretation. Therefore, pathomics can be used as a useful adjunct to more conventional pathological assessments. Our results demonstrate that, without any supervised information, important subregions of each WSI can be identified and objectively quantified through pathomics features. Additionally, by combining genomic information, we can effectively define the cellular biological characteristics of different subregions. Consequently, based on the relative proportions of these three subregions, we successfully distinguished three tumor subtypes (Subtype 1-3) and identified a high-infiltration subtype for UVM patients. We found that UVM patients in the high-infiltration subtype had higher immune scores, estimate scores, and MeTILs compared to the low-infiltration subtype, which correlated with poorer survival outcomes. These observations are consistent with previous research (28, 29). For instance, Narasimhaiah et al (30), have demonstrated that increased infiltration of immune cells, particularly T lymphocytes and macrophages, is linked to metastatic progression in UM and associated with poor prognosis. The eye is an immune-privileged organ, characterized by numerous immunosuppressive elements that prevent robust immune responses. This immune privilege is maintained by mechanisms that hinder the trafficking of activated T cells into tumor tissues and promote T cell exhaustion. As a result, the phenotype of infiltrated immune cells is often altered, converting their anti-tumor functions into pro-tumor roles. Extensive evidence suggests that immune cells linked with tumors in the UVM microenvironment stimulate both immune evasion and immunological repression (31, 32). Tumor-infiltrating lymphocytes (TILs) include CD8+ T cells and CD4+ T cells, for example, are unique independent prognostic markers for UVM patients and play essential roles in tumor recurrence, metastasis, dissemination, and responsiveness to immunotherapy (33–35). In general, our research indicates a potential close relationship between the histomorphology and underlying molecular composition of tumors.

Furthermore, our study is the first to apply integrated deep learning (DL) networks to learn from the entire WSI for diagnosing the immune infiltration subtypes of UVM patients and predicting their common gene mutation information. Our approach has two prominent advantages. First, it analyzes a collection of patches clusters automatically selected from several important subregions associated with prognosis, avoiding any manual annotation. Second, it assigns labels to each patch image through weakly supervised methods and aggregates local features using multiple instance learning to achieve global diagnosis. This approach eliminates the need for manual annotation to describe cancerous regions at the pixel level. Our study demonstrates that the integrated DL network we constructed achieves a significant accuracy rate of over 95% in predicting immune subtypes at the tile level and WSI level. Additionally, our model performs well in predicting SF3B1 and EIF1AX gene mutations. Previous studies have found a close association between SF3B1 and EIF1AX gene mutations and tumor metastasis, with most UVM patients with SF3B1 mutations eventually developing metastasis (36). However, it is rare for UVM patients carrying only EIF1AX gene mutations to experience metastasis (37). Moreover, the Cox survival model constructed using DL features can effectively distinguish high-risk and low-risk groups. Therefore, this model has broad clinical applications, enabling patients to obtain accurate predictions of metastasis and prognosis while receiving pathological diagnoses.

However, in clinical practice, it is insufficient to evaluate the progression and prognosis of UVM solely based on one data type. Therefore, based on the genomic data of different subtypes, differential expression genes (DEGs) were identified among these subtypes. Subsequently, we created a predictive gene signature in the TCGA-UVM cohort based on these DEGs, and we verified the prognostic significance of this signature in many separate datasets. The JUP and UFC1 genes are included in this gene signature. Junction plakoglobin (JUP) is an important cell-cell adhesion protein. JUP has been identified as a protein with great potential as a biomarker and therapeutic target for UVM (38). Recent studies have found that deregulation of JUP leads to the occurrence and progression of various malignancies (39). Hu et al. found that JUP can regulate the expression of Anterior Gradient 2 (AGR2)/LY6/PLAUR Domain Containing 3 (LYPD3) and mediate an immunosuppressive microenvironment in melanoma (40). Additionally, numerous studies have discovered a link between tumor invasion and elevated expression of the long noncoding RNA UFC1. Certain cancer cells’ ability to proliferate, migrate, and invade can be inhibited by knocking down UFC1, whereas cell cycle arrest and death are encouraged (41–44). Therefore, the gene signature identified through histopathological analysis can serve as a predictive biomarker in future clinical research. Finally, we combined traditional clinical features, histopathological cluster features, DL features, and RNA features for univariate and multivariate Cox regression analysis. We found that DL features, along with age, gender, histological type, and metastasis status, can serve as independent prognostic factors for UVM. Therefore, we integrated these features to construct a comprehensive nomogram model. The model has been demonstrated to have high predictive ability and net benefits in clinical practice, guiding physicians in the rational management of patients.

In summary, our work has several advantages compared to previous studies in computational pathology. Firstly, it addresses several key challenges: (1) It does not require annotation by pathologists but uses histopathological features for unsupervised clustering to identify important subregions within WSIs and perform subtyping analysis of tumor patients. (2) The deep-learning algorithm does not require pixel-level or patch-level annotations because it is trained using simply tumor type as a weak supervisory label. (3) By combining deep learning and multi-omics data, we provide a modern framework for understanding tumor heterogeneity and prognostication in UM. However, despite including samples from 318 patients from different countries and hospitals, future international multicenter and multiethnic datasets are desirable.

In addition, we acknowledge the limitations of our WSI (whole slide imaging) dataset, as the number of slides used in this study was relatively small, and the slide scanners employed were largely uniform. To address these limitations, we plan to conduct future studies with a larger WSI dataset collected from multiple scanners, aiming to investigate the influence of scanner variability and develop a more robust classifier suitable for clinical application. Furthermore, due to the loss of original tissue blocks, we were unable to perform additional immunohistochemical (IHC) staining to validate the identity of certain cells or explore specific inflammatory phenotypes. Another limitation lies in the absence of BAP1 mutation data in our WCH cohort, which prevented us from conducting analyses or predictions specifically related to BAP1 mutational status, despite its well-established role as a key driver of aggressive tumor behavior and metastatic potential in uveal melanoma. These constraints highlight the need for future investigations that incorporate a more comprehensive dataset, including molecular data and preserved tissue samples, to further validate and expand upon our findings.

In conclusion, our study reveals a potential close relationship between the histopathological morphology of tumors and their underlying molecular composition. By analyzing UVM histopathology images, high-performance automated diagnosis, subtyping, and prediction may be achievable, offering significant potential to improve UVM patient diagnosis, prognosis, and therapeutic strategies. Although our findings demonstrate the promise of these models in aiding clinical decision-making, further validation and integration into clinical workflows are required before they can directly guide individualized treatment plans and improve patient outcomes.

Data availability statement

The datasets presented in this article are not readily available due to restrictions from the West China Hospital data privacy policy. Requests to access the datasets should be directed to the corresponding author, and will be made available on reasonable request.

Ethics statement

The research protocol was reviewed and approved by the Ethics Committee of West China Hospital, Sichuan University (Approval No. 20242000).

Author contributions

QW: Writing – original draft. RW: Writing – review & editing. HY: Supervision, Writing – review & editing. JT: Data curation, Formal analysis, Writing – review & editing. YD: Conceptualization, Project administration, Writing – review & editing. KM: Conceptualization, Project administration, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Acknowledgments

We acknowledge that the Onekey AI platform provided code assistance for a portion of the study’s trials.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be constructed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2025.1585097/full#supplementary-material

Supplementary Figure 1 | The performance of the Inception-V3 deep learning model. (A) ROC curves. (B) Precision-recall curves. (C) Confusion matrices for the train dataset. (D) Confusion matrices for the validation dataset. (E) The specific metrics and detailed performance results of the model, including accuracy, precision, recall, F1-score, and AUC values.

Abbreviations

AUC, The area under the curve; CIC, Clinical impact curve; DL, Deep learning; DCA, decision curve analysis; ROCs, receiver operating characteristic curves (ROCs); TF-IDF, Term frequency-inverse document frequency; UVM, Uveal melanoma; WSI, Whole-slide image.

References

1. Aronow ME, Topham AK, and Singh AD. Uveal melanoma: 5-year update on incidence, treatment, and survival (SEER 1973-2013). Ocul Oncol Pathol. (2018) 4:145–51. doi: 10.1159/000480640

PubMed Abstract | Crossref Full Text | Google Scholar

2. Li Y, Shi J, Yang J, Ge S, Zhang J, Jia R, et al. Uveal melanoma: progress in molecular biology and therapeutics. Ther Adv Med Oncol. (2020) 12:1758835920965852. doi: 10.1177/1758835920965852

PubMed Abstract | Crossref Full Text | Google Scholar

3. Figueiredo A, Caissie AL, Callejo SA, McLean IW, Gold P, and Burnier MN Jr. Cyclooxygenase-2 expression in uveal melanoma: novel classification of mixed-cell-type tumours. Can J Ophthalmol. (2003) 38:352–6. doi: 10.1016/S0008-4182(03)80045-5

PubMed Abstract | Crossref Full Text | Google Scholar

4. Coupland SE, Lake SL, Zeschnigk M, and Damato BE. Molecular pathology of uveal melanoma. Eye (Lond). (2013) 27:230–42. doi: 10.1038/eye.2012.255

PubMed Abstract | Crossref Full Text | Google Scholar

5. Damato B. Ocular treatment of choroidal melanoma in relation to the prevention of metastatic death - A personal view. Prog Retin Eye Res. (2018) 66:187–99. doi: 10.1016/j.preteyeres.2018.03.004

PubMed Abstract | Crossref Full Text | Google Scholar

6. Rantala ES, Hernberg MM, Piperno-Neumann S, Grossniklaus HE, and Kivela TT. Metastatic uveal melanoma: The final frontier. Prog Retin Eye Res. (2022) 90:101041. doi: 10.1016/j.preteyeres.2022.101041

PubMed Abstract | Crossref Full Text | Google Scholar

7. Azevedo Tosta TA, de Faria PR, Neves LA, and do Nascimento MZ. Computational normalization of H&E-stained histological images: Progress, challenges and future potential. Artif Intell Med. (2019) 95:118–32. doi: 10.1016/j.artmed.2018.10.004

PubMed Abstract | Crossref Full Text | Google Scholar

8. Wang W, Zhao Y, Teng L, Yan J, Guo Y, Qiu Y, et al. Neuropathologist-level integrated classification of adult-type diffuse gliomas using deep learning from whole-slide pathological images. Nat Commun. (2023) 14:6359. doi: 10.1038/s41467-023-41195-9

PubMed Abstract | Crossref Full Text | Google Scholar

9. Lu MY, Chen TY, Williamson DFK, Zhao M, Shady M, Lipkova J, et al. AI-based pathology predicts origins for cancers of unknown primary. Nature. (2021) 594:106–10. doi: 10.1038/s41586-021-03512-4

PubMed Abstract | Crossref Full Text | Google Scholar

10. Bulten W, Kartasalo K, Chen PC, Strom P, Pinckaers H, Nagpal K, et al. Artificial intelligence for diagnosis and Gleason grading of prostate cancer: the PANDA challenge. Nat Med. (2022) 28:154–63. doi: 10.1038/s41591-021-01620-2

PubMed Abstract | Crossref Full Text | Google Scholar

11. Skrede OJ, De Raedt S, Kleppe A, Hveem TS, Liestol K, Maddison J, et al. Deep learning for prediction of colorectal cancer outcome: a discovery and validation study. Lancet. (2020) 395:350–60. doi: 10.1016/S0140-6736(19)32998-8

PubMed Abstract | Crossref Full Text | Google Scholar

12. Ehteshami Bejnordi B, Veta M, Johannes van Diest P, van Ginneken B, Karssemeijer N, Litjens G, et al. Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. JAMA. (2017) 318:2199–210. doi: 10.1001/jama.2017.14585

PubMed Abstract | Crossref Full Text | Google Scholar

13. Wan Q, Ren X, Wei R, Yue S, Wang L, Yin H, et al. Deep learning classification of uveal melanoma based on histopathological images and identification of a novel indicator for prognosis of patients. Biol Proced Online. (2023) 25:15. doi: 10.1186/s12575-023-00207-0

PubMed Abstract | Crossref Full Text | Google Scholar

14. Chen X, Wang X, Zhang K, Fung KM, Thai TC, Moore K, et al. Recent advances and clinical applications of deep learning in medical image analysis. Med Image Anal. (2022) 79:102444. doi: 10.1016/j.media.2022.102444

PubMed Abstract | Crossref Full Text | Google Scholar

15. Chen D, Fu M, Chi L, Lin L, Cheng J, Xue W, et al. Prognostic and predictive value of a pathomics signature in gastric cancer. Nat Commun. (2022) 13:6903. doi: 10.1038/s41467-022-34703-w

PubMed Abstract | Crossref Full Text | Google Scholar

16. Classe M, Lerousseau M, Scoazec JY, and Deutsch E. Perspectives in pathomics in head and neck cancer. Curr Opin Oncol. (2021) 33:175–83. doi: 10.1097/CCO.0000000000000731

PubMed Abstract | Crossref Full Text | Google Scholar

17. Kang W, Qiu X, Luo Y, Luo J, Liu Y, Xi J, et al. Application of radiomics-based multiomics combinations in the tumor microenvironment and cancer prognosis. J Transl Med. (2023) 21:598. doi: 10.1186/s12967-023-04437-4

PubMed Abstract | Crossref Full Text | Google Scholar

18. Wang R, Dai W, Gong J, Huang M, Hu T, Li H, et al. Development of a novel combined nomogram model integrating deep learning-pathomics, radiomics and immunoscore to predict postoperative outcome of colorectal cancer lung metastasis patients. J Hematol Oncol. (2022) 15:11. doi: 10.1186/s13045-022-01225-3

PubMed Abstract | Crossref Full Text | Google Scholar

19. Holscher DL, Bouteldja N, Joodaki M, Russo ML, Lan YC, Sadr AV, et al. Next-Generation Morphometry for pathomics-data mining in histopathology. Nat Commun. (2023) 14:470. doi: 10.1038/s41467-023-36173-0

PubMed Abstract | Crossref Full Text | Google Scholar

20. Stirling DR, Carpenter AE, and Cimini BA. CellProfiler Analyst 3.0: accessible data exploration and machine learning for image analysis. Bioinformatics. (2021) 37:3992–4. doi: 10.1093/bioinformatics/btab634

PubMed Abstract | Crossref Full Text | Google Scholar

21. Sauerbrei W, Taube SE, McShane LM, Cavenagh MM, and Altman DG. Reporting recommendations for tumor marker prognostic studies (REMARK): an abridged explanation and elaboration. J Natl Cancer Inst. (2018) 110:803–11. doi: 10.1093/jnci/djy088

PubMed Abstract | Crossref Full Text | Google Scholar

22. Carpenter AE, Jones TR, Lamprecht MR, Clarke C, Kang IH, Friman O, et al. CellProfiler: image analysis software for identifying and quantifying cell phenotypes. Genome Biol. (2006) 7:R100. doi: 10.1186/gb-2006-7-10-r100

PubMed Abstract | Crossref Full Text | Google Scholar

23. Weisbart E, Tromans-Coia C, Diaz-Rohrer B, Stirling DR, Garcia-Fossa F, Senft RA, et al. CellProfiler plugins - An easy image analysis platform integration for containers and Python tools. J Microsc. (2024) 296(3):227-34. doi: 10.1111/jmi.13223

PubMed Abstract | Crossref Full Text | Google Scholar

24. Jeschke J, Bizet M, Desmedt C, Calonne E, Dedeurwaerder S, Garaud S, et al. DNA methylation-based immune response signature improves patient diagnosis in multiple cancers. J Clin Invest. (2017) 127:3090–102. doi: 10.1172/JCI91095

PubMed Abstract | Crossref Full Text | Google Scholar

25. Malta TM, Sokolov A, Gentles AJ, Burzykowski T, Poisson L, Weinstein JN, et al. Machine learning identifies stemness features associated with oncogenic dedifferentiation. Cell. (2018) 173:338–354 e315. doi: 10.1016/j.cell.2018.03.034

PubMed Abstract | Crossref Full Text | Google Scholar

26. Yoshihara K, Shahmoradgoli M, Martinez E, Vegesna R, Kim H, Torres-Garcia W, et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat Commun. (2013) 4:2612. doi: 10.1038/ncomms3612

PubMed Abstract | Crossref Full Text | Google Scholar

27. Zheng H, Liu H, Ge Y, and Wang X. Integrated single-cell and bulk RNA sequencing analysis identifies a cancer associated fibroblast-related signature for predicting prognosis and therapeutic responses in colorectal cancer. Cancer Cell Int. (2021) 21:552. doi: 10.1186/s12935-021-02252-9

PubMed Abstract | Crossref Full Text | Google Scholar

28. Bronkhorst IH and Jager MJ. Uveal melanoma: the inflammatory microenvironment. J Innate Immun. (2012) 4:454–62. doi: 10.1159/000334576

PubMed Abstract | Crossref Full Text | Google Scholar

29. Qin Y, Bollin K, de Macedo MP, Carapeto F, Kim KB, Roszik J, et al. Immune profiling of uveal melanoma identifies a potential signature associated with response to immunotherapy. J Immunother Cancer. (2020) 8. doi: 10.1136/jitc-2020-000960

PubMed Abstract | Crossref Full Text | Google Scholar

30. Narasimhaiah D, Legrand C, Damotte D, Remark R, Munda M, De Potter P, et al. DNA alteration-based classification of uveal melanoma gives better prognostic stratification than immune infiltration, which has a neutral effect in high-risk group. Cancer Med. (2019) 8:3036–46. doi: 10.1002/cam4.2122

PubMed Abstract | Crossref Full Text | Google Scholar

31. Tosi A, Cappellesso R, Dei Tos AP, Rossi V, Aliberti C, Pigozzo J, et al. The immune cell landscape of metastatic uveal melanoma correlates with overall survival. J Exp Clin Cancer Res. (2021) 40:154. doi: 10.1186/s13046-021-01947-1

PubMed Abstract | Crossref Full Text | Google Scholar

32. Wang Y, Xu Y, Dai X, Lin X, Shan Y, and Ye J. The prognostic landscape of adaptive immune resistance signatures and infiltrating immune cells in the tumor microenvironment of uveal melanoma. Exp Eye Res. (2020) 196:108069. doi: 10.1016/j.exer.2020.108069

PubMed Abstract | Crossref Full Text | Google Scholar

33. Mariani P, Torossian N, van Laere S, Vermeulen P, de Koning L, Roman-Roman S, et al. Immunohistochemical characterisation of the immune landscape in primary uveal melanoma and liver metastases. Br J Cancer. (2023) 129:772–81. doi: 10.1038/s41416-023-02331-w

PubMed Abstract | Crossref Full Text | Google Scholar

34. Singh L, Singh MK, Kenney MC, Jager MJ, Rizvi MA, Meel R, et al. Prognostic significance of PD-1/PD-L1 expression in uveal melanoma: correlation with tumor-infiltrating lymphocytes and clinicopathological parameters. Cancer Immunol Immunother. (2021) 70:1291–303. doi: 10.1007/s00262-020-02773-8

PubMed Abstract | Crossref Full Text | Google Scholar

35. Triozzi PL, Schoenfield L, Plesec T, Saunthararajah Y, Tubbs RR, and Singh AD. Molecular profiling of primary uveal melanomas with tumor-infiltrating lymphocytes. Oncoimmunology. (2019) 8:e947169. doi: 10.4161/21624011.2014.947169

PubMed Abstract | Crossref Full Text | Google Scholar

36. Yavuzyigitoglu S, Koopmans AE, Verdijk RM, Vaarwater J, Eussen B, van Bodegom A, et al. Rotterdam ocular melanoma study G: uveal melanomas with SF3B1 mutations: A distinct subclass associated with late-onset metastases. Ophthalmology. (2016) 123:1118–28. doi: 10.1016/j.ophtha.2016.01.023

PubMed Abstract | Crossref Full Text | Google Scholar

37. Martin M, Masshofer L, Temming P, Rahmann S, Metz C, Bornfeld N, et al. Exome sequencing identifies recurrent somatic mutations in EIF1AX and SF3B1 in uveal melanoma with disomy 3. Nat Genet. (2013) 45:933–6. doi: 10.1038/ng.2674

PubMed Abstract | Crossref Full Text | Google Scholar

38. Pan Z, Zhu H, Zhang Y, Liao Q, Sun Y, Wu E, et al. Development of uveal melanoma-specific aptamer for potential biomarker discovery and targeted drug delivery. Anal Chem. (2023) 95:5095–108. doi: 10.1021/acs.analchem.3c00005

PubMed Abstract | Crossref Full Text | Google Scholar

39. Fang J, Xiao L, Zhang Q, Peng Y, Wang Z, and Liu Y. Junction plakoglobin, a potential prognostic marker of oral squamous cell carcinoma, promotes proliferation, migration and invasion. J Oral Pathol Med. (2020) 49:30–8. doi: 10.1111/jop.12952

PubMed Abstract | Crossref Full Text | Google Scholar

40. Hu YD, Wu K, Liu YJ, Zhang Q, Shen H, Ji J, et al. LY6/PLAUR domain containing 3 (LYPD3) maintains melanoma cell stemness and mediates an immunosuppressive microenvironment. Biol Direct. (2023) 18:72. doi: 10.1186/s13062-023-00424-3

PubMed Abstract | Crossref Full Text | Google Scholar

41. Wang J and Liu G. Long noncoding RNA UFC1 acts as an oncogene via stimulating EZH2-induced inhibition of APC expression in renal cell carcinoma. Cell Mol Biol (Noisy-le-grand). (2023) 69:152–6. doi: 10.14715/cmb/2023.69.4.24

PubMed Abstract | Crossref Full Text | Google Scholar

42. Yu T, Shan TD, Li JY, Huang CZ, Wang SY, Ouyang H, et al. Knockdown of linc-UFC1 suppresses proliferation and induces apoptosis of colorectal cancer. Cell Death Dis. (2016) 7:e2228. doi: 10.1038/cddis.2016.124

PubMed Abstract | Crossref Full Text | Google Scholar

43. Zang X, Gu J, Zhang J, Shi H, Hou S, Xu X, et al. Exosome-transmitted lncRNA UFC1 promotes non-small-cell lung cancer progression by EZH2-mediated epigenetic silencing of PTEN expression. Cell Death Dis. (2020) 11:215. doi: 10.1038/s41419-020-2409-0

PubMed Abstract | Crossref Full Text | Google Scholar

44. Zhang X, Liang W, Liu J, Zang X, Gu J, Pan L, et al. Long non-coding RNA UFC1 promotes gastric cancer progression by regulating miR-498/Lin28b. J Exp Clin Cancer Res. (2018) 37:134. doi: 10.1186/s13046-018-0803-6

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: unsupervised learning, uveal melanoma, immune infiltration, pathomics features, deep learning

Citation: Wan Q, Wei R, Yin H, Tang J, Deng Y-p and Ma K (2025) Integrating pathomics and deep learning for subtyping uveal melanoma: identifying high-risk immune infiltration profiles. Front. Immunol. 16:1585097. doi: 10.3389/fimmu.2025.1585097

Received: 28 February 2025; Accepted: 24 June 2025;
Published: 09 July 2025.

Edited by:

Zodwa Dlamini, Pan African Cancer Research Institute (PACRI), South Africa

Reviewed by:

Lingzhang Meng, Guangxi Academy of Medical Sciences, China
Viktor Torgny Gill, Karolinska Institutet (KI), Sweden

Copyright © 2025 Wan, Wei, Yin, Tang, Deng and Ma. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ke Ma, ZHJtYWtlQHNjdS5lZHUuY24=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.