Hybrid model for predicting microsatellite instability in colorectal cancer using hematoxylin & eosin-stained images and clinical features

Wei, Hangping; Zhang, Xiaowei; Zhou, Zhen; Xie, Jianbin; Han, Weidong; Dong, Xiaofang

doi:10.3389/fonc.2025.1580195

ORIGINAL RESEARCH article

Front. Oncol., 23 June 2025

Sec. Gastrointestinal Cancers: Colorectal Cancer

Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1580195

This article is part of the Research TopicAdvances in Medical Imaging for Precision Diagnostic and Therapeutic Applications in Digestive DiseasesView all 20 articles

Hybrid model for predicting microsatellite instability in colorectal cancer using hematoxylin & eosin-stained images and clinical features

Hangping Wei¹

Xiaowei Zhang²

Zhen Zhou³

Jianbin Xie⁴

Weidong Han^5*

Xiaofang Dong^1*

¹Department of Medical Oncology, Dongyang Hospital Affiliated to Wenzhou Medical University, Dongyang, Zhejiang, China
²Department of Pathology, Dongyang Hospital Affiliated to Wenzhou Medical University, Dongyang, Zhejiang, China
³Zhuoyue Honors College, Hangzhou Dianzi University, Hangzhou, Zhejiang, China
⁴Department of Respiratory Medicine, Dongyang Hospital Affiliated to Wenzhou Medical University, Dongyang, Zhejiang, China
⁵Department of Colorectal Medicine, Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Hangzhou, Zhejiang, China

Background: Microsatellite instability (MSI) is a crucial molecular phenotype in colorectal cancer (CRC), which aids in determining treatment strategies and predicting prognosis. However, existing prediction methods have limitations and are not universally applicable to all patient populations. Consequently, we proposed a hybrid prediction model that integrates pathological and clinical features to predict MSI.

Materials and methods: This study encompassed two patient cohorts: The Cancer Genome Atlas cohort (TCGA set, n = 559), which was divided into training and internal validation subsets at a ratio of 7:3, and the Dongyang CRC cohort (Dongyang set, n = 123), serving as an external testing cohort. Two deep learning approaches—semi-supervised and fully-supervised—were employed to extract features from pathological images. Subsequently, the pathomic signatures derived from these approaches were integrated with clinical features to develop a hybrid model. The hybrid model was assessed using an external validation cohort to determine the area under the curve (AUC). Furthermore, to investigate genes associated with MSI, we performed enrichment analysis and constructed a protein-protein interaction (PPI) network using mRNA sequencing data obtained from the TCGA database.

Results: The fully-supervised pathological model demonstrated promising performance, achieving an AUC of 0.928 in the internal validation cohort, compared to the semi-supervised pathological model’s AUC of 0.786. In the external testing cohort, the model attained an AUC of 0.811. Subsequently, a hybrid model was established, which achieved an AUC of 0.949 in the validation cohort and a robust AUC of 0.862 in the test cohort. Additionally, a nomogram was developed to enhance its clinical applicability. Gene Ontology (GO) analysis identified differentially expressed genes (DEGs) related to MSI status, which were enriched in humoral immune response, among other pathways. Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Set Enrichment Analysis (GSEA) revealed enrichment in pathways such as rheumatoid arthritis. A PPI network identified key hub genes, including IFNG and CD8A.

Conclusion: The fully-supervised model consistently outperformed the semi-supervised model in predicting MSI. Furthermore, the hybrid model, which combines pathological and clinical features, demonstrated strong predictive ability.

Introduction

CRC is the third most prevalent cancer worldwide and the second leading cause of cancer-related deaths (1, 2), with over 1.9 million new cases and one million deaths reported in 2020 (3). Microsatellites are short, repetitive DNA sequences, one to four base pairs in length, found throughout the genome. Their repetitive nature makes them prone to replication errors, which are typically corrected by mismatch repair (MMR) systems (4, 5). Mutations, deletions, or methylations affecting the MMR gene lead to the loss or impairment of its function, resulting in deficient mismatch repair (dMMR), a critical mechanism underlying MSI (6). MSI is a distinct mechanism that contributes to tumorigenesis in 10% of CRC cases and is a hallmark of hereditary Lynch syndrome-associated cancers (7). Identifying MSI is crucial for CRC management, as it significantly affects diagnosis, prognosis, and treatment planning (8). Patients with microsatellite instability-high (MSI-H) are a favorable group that can benefit substantially from immunotherapy for solid tumors (9). This highlights the critical role of MSI in advanced solid tumors.

Several diagnostic methods are commonly used to detect deficient dMMR or MSI in clinical settings, including immunohistochemistry (IHC) to identify MMR protein deficiencies and molecular tests such as polymerase chain reaction (PCR) or next-generation sequencing (NGS) (10). IHC testing requires optimal experimental conditions and skilled pathologists, along with access to tumor tissues (four proteins need to be tested), which can sometimes be insufficient. PCR or NGS testing requires specialized infrastructure that may not be universally available in hospitals, often leading to longer turnaround times or higher costs (11). Given these challenges, MSI testing is not universally applicable across all patient populations. Therefore, developing a universally accessible MSI testing method is imperative.

In routine clinical workflows, the diagnosis of CRC typically involves the histopathological evaluation of hematoxylin and eosin (H&E)-stained tissue slides, which can now be digitized into whole-slide images (WSIs) (12–14). WSIs offer comprehensive insights into the spatial organization of tumors, enabling examination at both low and high magnifications (9). Recent technological advancements, particularly in deep learning (DL), have revolutionized medical applications. DL-based algorithms are increasingly utilized in pathomics to enhance the accuracy and efficiency of disease diagnosis and prediction. These include tasks such as tumor diagnosis, subtyping, grading, staging, prognosis prediction, identification of pathological features, biomarkers, and genetic changes (15, 16). The integration of artificial intelligence (AI) with WSI analysis holds promise for improving diagnostic capabilities in CRC and other cancers, potentially overcoming some of the limitations associated with traditional testing methods such as IHC, PCR, and NGS.

Although DL prediction of MSI has been extensively studied (9, 10, 17), previous research has primarily focused on predicting MSI using only pathological images, neglecting the integration of clinical patient characteristics. To address this, a hybrid prediction model that combines pathology slide data with clinical data was developed for predicting MSI in CRC. Both fully-supervised and semi-supervised learning methodologies were employed in DL pathological analysis. This comparative study evaluated the predictive advantages of both approaches, thereby enhancing the accuracy and robustness of the model and offering a scalable solution for predicting MSI status. The workflow of this study is illustrated in Figure 1.

Figure 1

Figure 1. Workflow of the study for predicting MSI in CRC.

Materials and methods

Patient cohort

For this study, WSIs from two large cohorts were collected, and each WSI was assigned an MSI label based on the MSI status of the patient. The first cohort (TCGA set, n = 559), was downloaded from the TCGA database (https://portal.gdc.cancer.gov/). Some samples were excluded from the analysis due to the absence of critical clinical data or incompleteness. Pathological images and clinical features, including gender, age, tumor site, histological grade, histological type, TNM stage, and vascular invasion, were also downloaded from the TCGA database and organized using loading packages such as string, maftools, and XML in R language software. The MSI status was determined based on the results from NGS analysis, as reported in the original research articles (18, 19). In this cohort, 483 and 76 cases were labeled as MSI-L/MSS (microsatellite instability-low/microsatellite stable) and MSI-H, respectively. The second cohort (Dongyang set), collected from Dongyang Hospital Affiliated to Wenzhou Medical University, comprised 123 formalin-fixed paraffin-embedded sections from patients diagnosed with CRC across all stages (between October 2021 and June 2023). H&E-stained images were digitized with PANNOROMIC MIDI II scanners (3DHISTECH, Hungary) using a 20× objective and saved as mrxs. format files. Through genetic testing of postoperative paraffin specimens, 105 and 18 cases were identified as MSI-L/MSS and MSI-H, respectively.

For model training, the TCGA dataset was divided into training and internal validation subsets in a 7:3 ratio. The training subset was used for hyperparameter tuning through cross-validation, while the internal validation subset was employed to evaluate the generalization performance. Additionally, an external testing dataset was incorporated to evaluate the generalizability of the model. A flowchart outlining the cohorts used in this study is shown in Figure 2.

Figure 2

Figure 2. Flowchart of data collection and filtering for CRC patients from the two studies.

Regions of interest delineation and image preprocessing

WSIs were digitized at a 20 × magnification with a pixel resolution of approximately 0.5 μm/pixel. A fully-supervised DL approach involved manual annotation of cancer regions of interest (ROIs) by experienced pathologists using Qupath v0.5.1, with annotations subsequently reviewed for accuracy. In contrast, semi-supervised DL does not require manual annotation.

To enhance computational efficiency, algorithms based on Python were employed to automatically crop H&E-stained WSIs, each typically encompassing approximately 100,000 × 100,000 pixels, into smaller 512 × 512 pixel patches. Concurrently, patches with background areas exceeding 80% white, as well as those containing blurry artifacts or pen marks, were excluded from further analysis. All selected patches were normalized using the Macenko method to standardize color variations resulting from the staining procedures (20).

After normalization, these patches were used as inputs for the DL model. In cases where a slide produced more than 1,000 patches, a random selection of 1,000 patches was utilized for subsequent experiments to effectively manage computational resources. Throughout all experiments, only patient-level labels were applied, ensuring that all patches within the training sets inherited the labels of their respective parent patients. This method maintained consistency and integrity throughout the training process.

Patch-level prediction

Our DL framework employs a dual prediction strategy: patch-level prediction and a multi-instance learning approach to integrate features from the WSI. During the training phase, the patches were assigned MSI labels based on the patient’s overall MSI status, which functioned as the training labels.

Semi-supervised vs. fully-supervised: Two distinct patch selection methodologies were analyzed: a semi-supervised approach that utilizes the entire WSI and a supervised method that specifically targets tumor regions. Although the modeling processes for both methods are similar, the key difference lies in the selection of patches.

Data augmentation: To harmonize the intensity distribution across the RGB channels, Z-score normalization was applied to the images, preparing the data for input into the model. During the training phase, online data augmentation techniques, including random cropping and horizontal and vertical flipping, were employed. However, for testing patches, the processing was limited to normalization only.

Model training: In this study, several renowned networks, including ResNet18, ResNet50, and DenseNet121, were analyzed to enhance the performance of traditional convolutional neural network (CNN)-based models at the patch level. Comparative evaluations were conducted on these models to identify the most effective algorithm tailored to our specific objectives. The details of the training process are outlined in Supplementary 1A.

Patient level prediction

Multi-Instance learning based feature fusion: Upon training the DL model, the prediction phase began. During this phase, labels and corresponding probabilities were assigned to all patches. These patch probabilities were subsequently aggregated using a classifier to extract features at the WSI level. Two unique methodologies were developed to synthesize these patch probabilities, as detailed in Supplementary 1B.

1. Patch likelihood histogram pipeline: This method employs a histogram to illustrate the distribution of patch likelihoods throughout a WSI. The histogram effectively captures the entire spectrum of likelihoods, providing a detailed depiction of the WSI.

2. Bag of words pipeline: By integrating histogram- and vocabulary-based concepts, this method applies term frequency-inverse document frequency (TF-IDF) mapping to each patch. The resulting TF-IDF feature vector encapsulates the characteristics of the WSI.

The two pipelines facilitate the effective integration of patch-level predictions into comprehensive WSI-level features, making them suitable for advanced analyses such as metastasis prediction and survival analysis.

Feature selection: A total of 206 features were aggregated in this study using multi-instance learning through two distinct processes, each contributing 101 probability features and 2 predictive label features. To refine this feature set, a correlation-based selection method was applied, retaining only one feature from any pair with a Pearson’s correlation coefficient exceeding 0.9. Consequently, our feature set was reduced to two distinct features, which were subsequently used in the development of two machine-learning algorithms: SVM and ExtraTrees.

Model building

Pathology model: Patch-level predictions, probability histograms, and TF-IDF features were synthesized in this study to construct detailed patient profiles. These comprehensive features served as the primary input for developing a specialized machine-learning algorithm tailored for MSI.

Clinical model: Mirroring the pathology model, a machine learning model for MSI analysis, which focuses on clinical features, was used. The model generated predictions that were particularly attuned to these clinical characteristics.

Combined model: To identify significant predictors, we conducted both univariate and multivariate analyses on these features. Features with a p-value of less than 0.05 from the multivariate analysis were integrated into the pathology model, resulting in a combined model. To enhance its clinical applicability and improve its interpretability and usability in clinical settings, this combined model was visualized using a nomogram.

Metrics: To evaluate the discriminative capabilities of all models in classifying the three types of pathologies, both macro- and micro-AUC metrics were used. These metrics provide a comprehensive assessment of the effectiveness of the algorithmic models in distinguishing between different pathology types.

Exploring biologic functions

The mRNA sequencing data (TCGA-COAD, n = 429) was extracted from the TCGA database, and the MSI status of the relevant patients was identified from the original research article (18, 19). Differential analysis between MSS/MSI-L and MSI-H groups was conducted using the “limma” package, with a preset threshold of |log2FC| > 1 and a p-value < 0.05. The “clusterprofiler” package was utilized for GO and KEGG enrichment analysis on DEGs. Furthermore, GSEA on all genes was performed to visualize the primary activation pathways of the three enrichment analyses. To delve deeper into the molecular mechanisms, the PPI network of DEGs was constructed using Cytoscape software, and the CytohHubba plugin was employed to select the top 10 core proteins based on their degree values within the PPI network.

Statistical analysis

The Shapiro–Wilk test was used to assess the normality of the clinical feature distribution within these cohorts, followed by t-tests or Chi-squared () tests, as appropriate, for a more in-depth analysis of the clinical features. The analysis was conducted using Python version 3.7.12, which incorporates a suite of specialized packages including Pandas 1.2.4 for data manipulation, NumPy 1.20.2 for numerical operations, PyTorch 1.8.0 for deep learning tasks, Onekey 3.1.3 for streamlined processing, OpenSlide 1.2.0 for handling whole slide images, SciPy 1.7.3 for scientific computing, Scikit-learn 1.0.2 for machine learning algorithms, and Slideflow 2.1.0 for pathology image analysis. All tests were two-sided, and p < 0.05 indicated statistical significance.

Results

To optimize the model’s hyperparameters, a 5-fold cross-validation approach, in conjunction with the GridSearch algorithm, was applied to 70% of the dataset designated as the training set. After identifying the optimal hyperparameters, the entire training set was utilized to train the final model.

Clinical features

Univariable and multivariable analyses: To identify significant clinical predictors of MSI, we conducted a univariate analysis on all clinical features, calculating the odds ratio (OR) and associated p-value for each feature. Features such as age, gender, tumor site, histological grade, and type, as well as N, M, and TNM stage, were statistically significant (p < 0.05). Consequently, gender, tumor site, histological grade, and histological type were selected for inclusion in the combined model through multivariable analysis. The baseline characteristics of the patients in the two cohorts are presented in Table 1. Univariable and multivariable analyses of the clinical features for predicting MSI are presented in Table 2. Univariable and multivariable analyses of the OR for the clinical features are shown in Figure 3.

Table 1

Table 1. Baseline clinical characteristics of patients in the two cohorts.

Table 2

Table 2. Univariate and multivariate analyses of clinical features for MSI prediction.

Figure 3

Figure 3. Odds Ratios of Clinical Features in Univariable Analyses (above) and Multivariable Analyses (below).

Patch level prediction

Patch level efficiency

Here, we illustrate the process using a fully supervised approach as an example. During the validation phase, the ResNet18 model demonstrated a moderate ability to differentiate between classes, achieving an AUC of 0.763. Additionally, ResNet18 outperformed all other models in the test cohort. Although its AUC was lower during the training phase (AUC = 0.945), ResNet18 was selected for further analysis due to its relative robustness and effectiveness in the validation phase, outperforming both ResNet50 and DenseNet121. The detailed performance metrics, including accuracy and AUC, are summarized in Table 3.

Table 3

Table 3. Metrics for training, validation, and test cohorts in MSI prediction at the patch level using fully-supervised methods.

A detailed description of the semi-supervised methods is listed in Supplementary 1C.

Grad-CAM visualization

The Grad-CAM method allows for the generation of activation maps without altering the existing model architecture or necessitating additional training. As depicted in Figure 4, Grad-CAM is utilized to visualize the activations within the final convolutional layer, which is responsible for predicting the MSI. By making this layer transparent, Grad-CAM emphasizes the areas of the input image that are most significant in the model’s decision-making process. This technique provides valuable insights into the model’s reasoning behind its predictions, without requiring complex changes to the architecture or retraining.

Figure 4

Figure 4. Visualizations for a single patient, comprising a tile image and its corresponding heat map. In the heat maps, regions highlighted in red signify areas of higher weight, as indicated by the color bar on the right side of the figure.

Patient level prediction

Our study incorporated data from 682 patients, each characterized by binary outcomes (0 or 1), with the objective of predicting these outcomes using features aggregated through multi-instance learning. A total of 206 features were compiled using this approach. To streamline this feature set, we applied a correlation-based selection technique, retaining only one feature from each pair with a Pearson correlation coefficient exceeding 0.9. The refined feature set was subsequently used in various machine-learning models for further analysis. To effectively visualize these features, we employed a t-distributed stochastic neighbor-embedding (t-SNE) algorithm. The results of the visualization process are depicted in Figure 5.

Figure 5

Figure 5. Visualization of patient-level features using t-distributed stochastic neighbor-embedding (t-SNE) after Pearson’s correlation analysis, comparing fully-supervised (left) with semi-supervised (right). 0: MSS/MSI-L; 1: MSI-H.

Metrics: Here, we similarly illustrate the process using a fully supervised approach as an example. In the context of multi-instance learning, the ExtraTrees model demonstrated superior performance with an AUC of 0.928 in the validation cohort. This performance notably exceeded that of the SVM model, which had an AUC of 0.876. Although both the ExtraTrees and SVM models experienced a decline in their validation performance, the AUC remained relatively higher for ExtraTrees than for SVM (0.811 vs. 0.789) in the test cohort. The metrics for the training, validation, and test cohorts for predicting MSI using the fully-supervised pathomics model are presented in Table 4.

Table 4

Table 4. Metrics for the training, validation, and test cohorts in MSI prediction using the fully-supervised pathomics model.

Information on semi-supervised methods is provided in Supplementary 1D.

To further explain the pathological prediction model, we conducted an analysis of feature importance in multi-instance learning under both fully-supervised and semi-supervised approaches, as depicted in Figure 6.

Figure 6

Figure 6. Analysis of feature importance in multi-instance learning under fully-supervised (above) and semi-supervised (below) approaches.

Signature comparison

In our analysis, the best-performing models were selected from the validation cohort for both the clinical and pathological frameworks. For the pathological model (fully-supervised), ExtraTrees was selected due to its superior performance, whereas, given its effectiveness, SVM was chosen for the pathological model (semi-supervised) and the clinical model. Additional details of the clinical model are provided in Supplementary 1E.

Furthermore, the combined model was included in this comparison to evaluate its performance relative to the individual clinical and pathological models. The “combined” model demonstrated superior performance across different cohorts, achieving the highest AUC of 0.996 in the training cohort and an AUC of 0.949 in the validation cohort, indicating exceptional predictive accuracy. Although there was a slight decrease in the test cohort, the “combined” model still maintained a robust AUC of 0.862, significantly outperforming the standalone “Clinical,” “PathSemi,” and “PathFull” models in the same setting (refer to Table 5, Figure 7).

Table 5

Table 5. Performance of MSI Prediction Using Individual and Combined Models.

Figure 7

Figure 7. Variations in Area Under the Receiver Operating Characteristic Curve (AUROC) across all cohorts.

The “combined” model has proven to be a comprehensive and reliable predictive tool. Its strong performance across training, validation, and test cohorts highlights its robustness for clinical use, offering greater generalizability and reliability than single-source models due to the integration of diverse data sources and learning strategies.

Calibration curve: The Hosmer–Lemeshow (HL) test statistic is a key metric for evaluating the calibration of predictive models, reflecting how well predicted probabilities align with actual outcomes. Typically, a higher HL test statistic indicates better calibration, meaning that the model’s predictions are more closely aligned with the observed results. In our analysis, the nomogram model demonstrated excellent calibration performance across all cohorts, with HL test statistics of 0.498, 0.425, and 0.193 for the training, validation, and test cohorts, respectively.

Clinical use

Decision curve analysis (DCA): Figure 8 displays the DCA curves for the training, validation, and testing sets. The outcomes highlight the significant advantages of our fusion model in terms of predicted probabilities. In comparison to other models, our fusion model exhibited a greater potential for achieving net benefit. Furthermore, a nomogram was created to improve clinical applicability, as depicted in Figure 9.

Figure 8

Figure 8. Decision curves for various signatures across all cohorts.

Figure 9

Figure 9. Nomogram prediction model for MSI status.

Biologic functions associated with MSI status

GO analysis results indicated that the DEGs were primarily enriched in biological processes (BP) related to humoral immune response and regulation of lymphocyte activation, cellular components (CC) such as the apical plasma membrane and MHC protein, as well as molecular functions (MF) including cytokine activity and peptidase inhibitor activity (Figure 10). KEGG and GSEA pathway analyses revealed that these genes were predominantly enriched in signaling pathways associated with rheumatoid arthritis, inflammatory bowel disease, and systemic lupus erythematosus(Figures 10, 11). Upon examining a predicted PPI network, the top 10 hub genes, including IFNG, CD8A, IL1B, and CCL5, were identified. These genes are pivotal within the network (Figure 11).

Figure 10

Figure 10. GO/KEGG enrichment analysis correlated with MSI status using RNA sequencing data from the TCGA dataset.

Figure 11

Figure 11. GSEA enrichment and PPI network analysis correlated with MSI status based on RNA sequencing data from the TCGA dataSet.

Discussion

MSI is a tumor molecular phenotype resulting from the loss of function in MMR proteins due to deleterious germline mutations, epigenetic inactivation, or somatic biallelic mutations (21). Seminal studies such as Keynote177 have demonstrated that, compared to chemotherapy, Immune Checkpoint Inhibitors (ICIs) can lead to better outcomes in patients with dMMR/MSI-H CRC and that this molecular subtype is closely linked to prognosis (22–24). Recent research has expanded the scope of MSI testing to include treatment decision-making and prognosis across various cancer types (25–27). Although universal screening of CRC patients for MSI status is now recommended, it presents challenges, such as increased workload for pathologists, delays in therapeutic decisions, significant cost increases, and the inability to perform testing in the absence of tissue samples (10). DL offers the potential to streamline MSI testing and expedite decision-making by oncologists in clinical practice. A hybrid model that can be used in clinical practice to predict MSI status was proposed in this study. The primary objectives of our study were: (a) to assess the differences in predictive performance between semi-supervised and fully-supervised DL methods using pathological images; (b) to build and verify a hybrid model to predict MSI based on pathological images and clinical features; and (c) to conduct a pilot study to identify MSI-associated differentially expressed genes.

Given the critical importance of MSI, researchers have explored the use of DL models to predict MSI status from pathological images. Cao et al. demonstrated that a pathomics-based DL model could effectively predict MSI from histopathological images, indicating its generalizability to new patient cohorts (9). Schrammen et al. proposed a slide-level assessment model that uses a single neural network to detect tumors and predict genetic changes directly from standard pathology slides, with AUC of 0.909 for predicting MSI. This approach reduces labor costs by automating the exclusion of normal and uninformative tissue regions (28). Subsequently, Chang et al. developed a method that integrated the CNN model INSIGHT with the self-attention model WiseMSI to predict MSI in CRC (17). Despite achieving an AUC of approximately 0.95 through extensive training with a large sample size, the model did not incorporate the individual clinical characteristics of the patients.

Despite the application of various DL techniques to predict MSI status, which has led to continuous enhancements in the AUC, no research has yet compared semi-supervised and fully-supervised DL methodologies. Consequently, this study conducted a comparative analysis of different DL approaches for pathological omics, and subsequently integrated clinical-specific omics to develop a hybrid model. The findings suggest that fully-supervised pathological models are more effective in predicting MSI, hinting at a potential correlation between MSI status and specific tumor tissue characteristics. Although the study utilized a small sample size, it achieved an AUC of nearly 0.95 in the internal validation cohort and 0.86 in the external test cohort. Recently, French researchers have developed MSIntit, a clinically approved pre-screening tool based on AI for detecting MSI from slides stained with H&E, with a sensitivity of 0.96–0.98 and a specificity of 0.47–0.46 (10). This tool could serve as an optimal screening method, potentially excluding nearly half of the non-MSI-H population and reducing clinical expenses. However, its clinical utility is somewhat restricted due to its non-diagnostic nature.

MSI-H CRC presents with distinct clinical characteristics. Gelsomino F reported a correlation between MSI-H CRC and proximal location, predominantly early stage diagnosis (particularly stage II), poor differentiation, mucinous histology, and BRAF mutations (29). Nakayama concluded that patients with sporadic MSI-H are older, have right-sided colon tumors that are poorly differentiated or mucinous, and exhibit worse overall survival compared to those with Lynch Syndrome (30). Our findings indicated that MSI-H is common among women with poorly differentiated adenocarcinoma in the right colon, particularly those with special types, such as mucinous adenocarcinoma, which corroborated the previous findings.

Furthermore, MSI-related DEGs were commonly associated with signaling pathways implicated in rheumatoid arthritis, inflammatory bowel disease, and systemic lupus erythematosus. Treatment with ICIs has been effective in autoimmune vasculitis, necessitating further investigation into the underlying mechanisms (31). Subsequently, 10 hub genes were identified, including IFNG, CD8A, IL1B, and CCL5. The expression of IFNG, a marker of effector function, is increased in MSI-H gastric cancer than in MSS gastric cancer (32). IL1B, a gene involved in the COX-2/PGE2 pathway, is associated with immune-related adverse events (irAEs) following immune checkpoint blockade (33). These insights provide new directions for immunotherapy and the management of irAEs in CRC.

This study encompassed two cohorts with an acceptable sample size; however, it lacked validation data from multiple centers with larger sample sizes. Consequently, further optimization using multicenter datasets with larger sample sizes across all stages is essential to enhance accuracy and generalizability. Moreover, the exploration of differential genes in this study did not include subsequent validation experiments (such as IHC or knockdown). Future research is needed to further confirm their value.

Conclusion

In summary, a fully-supervised pathological model outperformed a semi-supervised pathological model in predicting MSI. Furthermore, a hybrid model was developed that employs deep learning algorithms to integrate pathological features with clinical data. This model exhibits exceptionally strong predictive capabilities by leveraging the complementary strengths of both data types to enhance overall accuracy.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the Institutional Ethical Review Board of Dongyang Hospital, Affiliated to Wenzhou Medical University (registration number:2024-YX-039). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author contributions

HW: Writing – original draft, Data curation. XZ: Data curation, Writing – original draft. ZZ: Formal analysis, Methodology, Writing – original draft. JX: Investigation, Writing – original draft. WH: Supervision, Writing – review & editing, Resources. XD: Resources, Supervision, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by the Key Project of the Jinhua Science and Technology Bureau (Grant No. 2024-3-115).

Acknowledgments

The authors thank the patients who kindly agreed to provide the data for this study. We express our gratitude to TCGA for providing the pathological images and other data of CRC.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2025.1580195/full#supplementary-material

References

1. Xi Y and Xu P. Global colorectal cancer burden in 2020 and projections to 2040. Transl Oncol. (2021) 14:101174. doi: 10.1016/j.tranon.2021.101174

PubMed Abstract | Crossref Full Text | Google Scholar

2. Patel SG, Karlitz JJ, Yen T, Lieu CH, and Boland CR. The rising tide of early-onset colorectal cancer: a comprehensive review of epidemiology, clinical features, biology, risk factors, prevention, and early detection. Lancet Gastroenterol Hepatol. (2022) 7:262–74. doi: 10.1016/S2468-1253(21)00426-X

PubMed Abstract | Crossref Full Text | Google Scholar

3. Morgan E, Arnold M, Gini A, Lorenzoni V, Cabasag CJ, Laversanne M, et al. Global burden of colorectal cancer in 2020 and 2040: incidence and mortality estimates from GLOBOCAN. Gut. (2023) 72:338–44. doi: 10.1136/gutjnl-2022-327736

PubMed Abstract | Crossref Full Text | Google Scholar

4. De' Angelis GL, Bottarelli L, Azzoni C, De' Angelis N, Leandro G, Di Mario F, et al. Microsatellite instability in colorectal cancer. Acta BioMed. (2018) 89:97–101.

Google Scholar

5. Ismael NE, El Sheikh SA, Talaat SM, and Salem EM. Mismatch repair proteins and microsatellite instability in colorectal carcinoma (MLH1, MSH2, MSH6 and PMS2): histopathological and immunohistochemical study. Open Access Maced J Med Sci. (2017) 5:9–13. doi: 10.3889/oamjms.2017.003

PubMed Abstract | Crossref Full Text | Google Scholar

6. Marmol I, Sanchez-de-Diego C, Pradilla Dieste A, Cerrada E, and Rodriguez Yoldi MJ. Colorectal carcinoma: A general overview and future perspectives in colorectal cancer. Int J Mol Sci. (2017) 18:197. doi: 10.3390/ijms18010197

PubMed Abstract | Crossref Full Text | Google Scholar

7. Ashouri K, Wong A, Mittal P, Torres-Gonzalez L, Lo JH, Soni S, et al. Exploring predictive and prognostic biomarkers in colorectal cancer: A comprehensive review. Cancers (Basel). (2024) 16:2796. doi: 10.3390/cancers16162796

PubMed Abstract | Crossref Full Text | Google Scholar

8. Vilar E and Gruber SB. Microsatellite instability in colorectal cancer-the stable evidence. Nat Rev Clin Oncol. (2010) 7:153–62. doi: 10.1038/nrclinonc.2009.237

PubMed Abstract | Crossref Full Text | Google Scholar

9. Cao R, Yang F, Ma SC, Liu L, Zhao Y, Li Y, et al. Development and interpretation of a pathomics-based model for the prediction of microsatellite instability in Colorectal Cancer. Theranostics. (2020) 10:11080–91. doi: 10.7150/thno.49864

PubMed Abstract | Crossref Full Text | Google Scholar

10. Saillard C, Dubois R, Tchita O, Loiseau N, Garcia T, Adriansen A, et al. Validation of MSIntuit as an AI-based pre-screening tool for MSI detection from colorectal cancer histology slides. Nat Commun. (2023) 14:6695. doi: 10.1038/s41467-023-42453-6

PubMed Abstract | Crossref Full Text | Google Scholar

11. Luchini C, Bibeau F, Ligtenberg MJL, Singh N, Nottegar A, Bosse T, et al. ESMO recommendations on microsatellite instability testing for immunotherapy in cancer, and its relationship with PD-1/PD-L1 expression and tumour mutational burden: a systematic review-based approach. Ann Oncol. (2019) 30:1232–43. doi: 10.1093/annonc/mdz116

PubMed Abstract | Crossref Full Text | Google Scholar

12. Bera K, Schalper KA, Rimm DL, Velcheti V, and Madabhushi A. Artificial intelligence in digital pathology - new tools for diagnosis and precision oncology. Nat Rev Clin Oncol. (2019) 16:703–15. doi: 10.1038/s41571-019-0252-y

PubMed Abstract | Crossref Full Text | Google Scholar

13. Zarella MD, McClintock DS, Batra H, Gullapalli RR, Valante M, Tan VO, et al. Artificial intelligence and digital pathology: clinical promise and deployment considerations. J Med Imaging (Bellingham). (2023) 10:051802. doi: 10.1117/1.JMI.10.5.051802

PubMed Abstract | Crossref Full Text | Google Scholar

14. Huss R, Raffler J, and Markl B. Artificial intelligence and digital biomarker in precision pathology guiding immune therapy selection and precision oncology. Cancer Rep (Hoboken). (2023) 6:e1796. doi: 10.1002/cnr2.1796

PubMed Abstract | Crossref Full Text | Google Scholar

15. Oh JH, Kim HG, and Lee KM. Developing and evaluating deep learning algorithms for object detection: key points for achieving superior model performance. Korean J Radiol. (2023) 24:698–714. doi: 10.3348/kjr.2022.0765

PubMed Abstract | Crossref Full Text | Google Scholar

16. Zhao Y, Zhang J, Hu D, Qu H, Tian Y, and Cui X. Application of deep learning in histopathology images of breast cancer: A review. Micromachines (Basel). (2022) 13:2197. doi: 10.3390/mi13122197

PubMed Abstract | Crossref Full Text | Google Scholar

17. Chang X, Wang J, Zhang G, Yang M, Xi Y, Xi C, et al. Predicting colorectal cancer microsatellite instability with a self-attention-enabled convolutional neural network. Cell Rep Med. (2023) 4:100914. doi: 10.1016/j.xcrm.2022.100914

PubMed Abstract | Crossref Full Text | Google Scholar

18. Cancer Genome Atlas N. Comprehensive molecular characterization of human colon and rectal cancer. Nature. (2012) 487:330–7. doi: 10.1038/nature11252

PubMed Abstract | Crossref Full Text | Google Scholar

19. Liu Y, Sethi NS, Hinoue T, Schneider BG, Cherniack AD, Sanchez-Vega F, et al. Comparative molecular analysis of gastrointestinal adenocarcinomas. Cancer Cell. (2018) 33:721–735 e728. doi: 10.1016/j.ccell.2018.03.010

PubMed Abstract | Crossref Full Text | Google Scholar

20. Macenko M, Niethammer M, Marron JS, Borland D, and Thomas NE. (2009). A method for normalizing histology slides for quantitative analysis, in: Proceedings of the 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, Boston, MA, USA, June 28 - July 1, 2009, . p. 2009.

Google Scholar

21. Wilbur HC, Le DT, and Agarwal P. Immunotherapy of MSI cancer: facts and hopes. Clin Cancer Res. (2024) 30:1438–47. doi: 10.1158/1078-0432.CCR-21-1935

PubMed Abstract | Crossref Full Text | Google Scholar

22. Bhamidipati D and Subbiah V. Tumor-agnostic drug development in dMMR/MSI-H solid tumors. Trends Cancer. (2023) 9:828–39. doi: 10.1016/j.trecan.2023.07.002

PubMed Abstract | Crossref Full Text | Google Scholar

23. Andre T, Shiu KK, Kim TW, Jensen BV, Jensen LH, Punt C, et al. Pembrolizumab in microsatellite-Instability-High advanced colorectal cancer. N Engl J Med. (2020) 383:2207–18. doi: 10.1056/NEJMoa2017699

PubMed Abstract | Crossref Full Text | Google Scholar

24. Le DT, Diaz LA Jr., Kim TW, Van Cutsem E, Geva R, Jager D, et al. Pembrolizumab for previously treated, microsatellite instability-high/mismatch repair-deficient advanced colorectal cancer: final analysis of KEYNOTE-164. Eur J Cancer. (2023) 186:185–95. doi: 10.1016/j.ejca.2023.02.016

PubMed Abstract | Crossref Full Text | Google Scholar

25. Wu H, Ma W, Jiang C, Li N, Xu X, Ding Y, et al. Heterogeneity and adjuvant therapeutic approaches in MSI-H/dMMR resectable gastric cancer: emerging trends in immunotherapy. Ann Surg Oncol. (2023) 30:8572–87. doi: 10.1245/s10434-023-14103-0

PubMed Abstract | Crossref Full Text | Google Scholar

26. Pacholczak-Madej R, Bartoletti M, Musacchio L, Puskulluoglu M, Blecharz P, and Lorusso D. Immunotherapy in MMR-d/MSI-H recurrent/metastatic endometrial cancer. Expert Rev Anticancer Ther. (2024) 24:717–29. doi: 10.1080/14737140.2024.2367472

PubMed Abstract | Crossref Full Text | Google Scholar

27. Zhou KI, Hanks BA, and Strickler JH. Management of microsatellite instability high (MSI-H) gastroesophageal adenocarcinoma. J Gastrointest Cancer. (2024) 55:483–96. doi: 10.1007/s12029-023-01003-5

PubMed Abstract | Crossref Full Text | Google Scholar

28. Schrammen PL, Ghaffari Laleh N, Echle A, Truhn D, Schulz V, Brinker TJ, et al. Weakly supervised annotation-free cancer detection and prediction of genotype in routine histopathology. J Pathol. (2022) 256:50–60. doi: 10.1002/path.v256.1

PubMed Abstract | Crossref Full Text | Google Scholar

29. Gelsomino F, Barbolini M, Spallanzani A, Pugliese G, and Cascinu S. The evolving role of microsatellite instability in colorectal cancer: A review. Cancer Treat Rev. (2016) 51:19–26. doi: 10.1016/j.ctrv.2016.10.005

PubMed Abstract | Crossref Full Text | Google Scholar

30. Nakayama Y, Iijima T, Inokuchi T, Kojika E, Takao M, Takao A, et al. Clinicopathological features of sporadic MSI colorectal cancer and Lynch syndrome: a single-center retrospective cohort study. Int J Clin Oncol. (2021) 26:1881–9. doi: 10.1007/s10147-021-01968-y

PubMed Abstract | Crossref Full Text | Google Scholar

31. Ramos A, Del Carmen M, and Yeku O. PD-1 inhibitor therapy in a patient with preexisting P-ANCA vasculitis: A case report and review of the literature. Case Rep Oncol Med. (2020) 2020:3428945. doi: 10.1155/2020/3428945

PubMed Abstract | Crossref Full Text | Google Scholar

32. Han DS, Kwak Y, Lee S, Nam SK, Kong SH, Park DJ, et al. Effector function characteristics of exhausted CD8+ T-cell in microsatellite stable and unstable gastric cancer. Cancer Res Treat. (2024) 56(4):1146–1163. doi: 10.4143/crt.2024.317

PubMed Abstract | Crossref Full Text | Google Scholar

33. Chen S, McMiller TL, Soni A, Succaria F, Sidhom JW, Cappelli LC, et al. Comparing anti-tumor and anti-self immunity in a patient with melanoma receiving immune checkpoint blockade. J Transl Med. (2024) 22:241. doi: 10.1186/s12967-024-04973-7

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: colorectal cancer, pathomics, prediction model, microsatellite instability, deep learning

Citation: Wei H, Zhang X, Zhou Z, Xie J, Han W and Dong X (2025) Hybrid model for predicting microsatellite instability in colorectal cancer using hematoxylin & eosin-stained images and clinical features. Front. Oncol. 15:1580195. doi: 10.3389/fonc.2025.1580195

Received: 20 February 2025; Accepted: 27 May 2025;
Published: 23 June 2025.

Edited by:

Fu Shen, Naval Medical University, China

Reviewed by:

Zhen Liu, Zhejiang University, China
Yitai Xiao, Sun Yat-sen University Cancer Center (SYSUCC), China

Copyright © 2025 Wei, Zhang, Zhou, Xie, Han and Dong. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Weidong Han, aGFud2RAempjYy5vcmcuY24=; Xiaofang Dong, ZG9uZ3hpYW9mYW5nMjAyMkAxNjMuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.