A scoping review of TSR analysis in colorectal cancer: implications for automated solutions

Dikland, Felix Anne; Fekih, Cyrine; Wellenstein, Marius René Jacques; Souza da Silva, Ricella; Machado-Neves, Raquel; Fraga, João; Oliveira, Domingos; Montezuma, Diana; Pinto, Isabel Macedo; Woodburn, Jonathan

doi:10.3389/or.2025.1605383

SYSTEMATIC REVIEW article

Oncol. Rev., 28 October 2025

Sec. Oncology Reviews: Reviews

Volume 19 - 2025 | https://doi.org/10.3389/or.2025.1605383

A scoping review of TSR analysis in colorectal cancer: implications for automated solutions

Felix Anne Dikland¹*^†

Cyrine Fekih¹^†

Marius René Jacques Wellenstein²

Ricella Souza da Silva³

Raquel Machado-Neves⁴

João Fraga⁵

Domingos Oliveira⁵

Diana Montezuma^5,6

Isabel Macedo Pinto⁷

Jonathan Woodburn¹

¹Department of Research and Development, WSK Medical, Amsterdam, Netherlands
²WSK Medical, Amsterdam, Netherlands
³IPATIMUP Diagnostics, IPATIMUP-Institute of Molecular Pathology and Immunology of Porto University, Porto, Portugal
⁴Department of Pathologic Anatomy, Hospital de Matosinhos, Unidade Local de Saúde de Matosinhos, Matosinhos, Portugal
⁵Research and Development Unit, IMP Diagnostics, Porto, Portugal
⁶Cancer Biology & Epigenetics Group, Research Center of IPO Porto (CI-IPOP), Portuguese Oncology Institute of Porto (IPO Porto), Porto Comprehensive Cancer Center Raquel Seruca (Porto.CCC), Porto, Portugal
⁷IMP Diagnostics, Porto, Portugal

The tumour-stroma ratio (TSR), which refers to the composition of stromal tissue and tumour epithelium of a malignant lesion, is gaining recognition as a promising biomarker in pathology. In 2018, recommendations for quantifying TSR in colorectal carcinoma were published, yet diverse quantification methods are still in use today. To assess the prognostic value of TSR, evaluate the impact of scoring variations, and explore efforts to automate TSR quantification, a scoping review was conducted. A total of 950 articles were identified through PubMed and Scopus, of which 76 met the inclusion criteria for this review. Of these, 56 employed manual scoring methods, while 20 utilised semi-automated or fully automated TSR quantification techniques. The TSR has been consistently identified as a strong prognostic indicator for disease-free survival. Its association with poor prognosis may be linked to its correlation with metastatic status, perineural invasion, and vascular invasion in stroma-high lesions. Variability in TSR scoring protocols was most evident in the selection of the region of interest and the type of histological specimen, both of which had a direct impact on final TSR scores. Moreover, significant inter-observer variability was observed in manual semi-quantitative TSR assessments, with Kappa scores ranging from 0.42 to 0.88. Automated TSR scoring pipelines have been proposed to standardise scoring protocols and reduce inter-observer variability. Deep learning models have demonstrated promising results, with pixel-wise and patch-wise accuracies exceeding 95%. Even though deep learning approaches have shown high performance, discrepancies remain, as evidenced by Kappa scores ranging from 0.239 to 0.472. In conclusion, the variation in TSR scoring protocols, along with a wide range of inter-observer variability, limits the broader clinical application of TSR. While automated TSR quantification methods show promise, they are still in the early stages, particularly in relation to region of interest selection and stratifying patients into risk categories. As these methods evolve, adjustments to TSR scoring cut-off values may be necessary to improve consistency. This scoping review highlights the prognostic significance of TSR in colorectal carcinoma while emphasizing the challenges posed by variability in scoring methods and the need for further advancements in automated quantification.

1 Introduction

Staging of cancer is crucial for predicting a patients prognosis and developing a treatment plan. In colorectal cancer (CRC) staging is performed according to the American Joint Committee on Cancer (AJCC) TNM system (1). Besides the TNM staging, there are additional CRC histological features that hold prognostic relevance for CRC patients (2–4). Histological characteristics that are currently considered to be reported in routine diagnostics as core elements are: histologic type and grade; presence of perforation; distance to surgical margins; lymphovascular and perineural invasion; tumour budding; tumour deposits, and treatment response. There are other histology factors associated with prognosis that are not yet included in the recommendations for routine diagnostics, such as the tumour growth pattern and immune response (5).

An important biomarker that has gained increasing attention in recent years is the tumour-stroma ratio (TSR), which refers to the composition of stromal tissue and tumour epithelium of a malignant lesion (6). Studies have suggested that a high stromal content is associated with a worse patient prognosis, as the stroma can promote tumour progression and possibly increase resistance to treatment (7–10). The prognostic value of TSR has been demonstrated not only for CRC, but also for other cancer types, namely, breast, oesophageal and lung (11–15).

Despite the wide support of the prognostic power of TSR, it has not yet been implemented in routine diagnostics. However, it has been reported that the TNM Evaluation Committee and the CAP have acknowledged its potential for integration in the TNM staging system (16,17). Moreover, a large prospective multicentre European study has recently validated TSR as an independent prognosticator for disease-free survival (DFS) in stage II-III colon cancer (CC) patients (18). As such, TSR is an emerging and promising histological biomarker with the potential to serve as a reliable prognostic indicator in CRC.

Notwithstanding the emerging prognostic significance of TSR in CRC, several challenges can be identified. The absence of a universally accepted methodology for TSR assessment is the greatest hurdle. In 2018 a study by van Pelt et al. on the procedure and recommendations of TSR scoring was released, which the majority of recent studies have adhered to (9). Nonetheless, there is still a variety in scoring methods concerning the region of interest (ROI) and histological specimen type (19–21). This variability complicates comparisons across studies and limits the reproducibility of findings. Compounding this issue is the challenge of intra-tumoural heterogeneity, which causes TSR scores to differ across ROIs within a single slide. Protocols that use the highest stroma-containing field as the decisive TSR score, mitigate this effect by focusing on the region with maximal stromal content. Anyhow, some degree of variability may still arise, for instance if the most stroma-rich region is not represented on the available slides. This is particularly true when assessing TSR in biopsies. Also, the semi-quantitative nature of TSR scoring makes it inherently subjective.

Even though the score is relatively simple to perform, it requires the careful identification of the ROI and accurate evaluation of the TSR within the constraints set by Van Pelt et al. (9). The impact of these challenges is particularly evident when developing computational models to perform this task, as variations in slide selection, ROI identification, and subjective interpretations introduce significant variability in the data. This variability can complicate the training and validation of algorithms, potentially limiting their accuracy and generalisability across diverse clinical settings. On the other hand, machine learning (ML) tools themselves can also be the solution, as these tools have the potential to standardise TSR scoring by automating the process and defining ROIs consistently, minimising subjective biases and ultimately enhancing reproducibility and efficiency.

Intended and unintended deviations from the protocol inevitably lead to a lowered reliability of the TSR score and might hamper the adoption as a diagnostic tool in clinical practice. Despite promising evidence of high inter-observer variability prior to publishing of the protocol by Van Pelt et al., there has not been an overview of observer variability scores since the introduction of said protocol. Moreover, it is uncertain to what extend deviations from the TSR scoring protocol influence the TSR score. The effects of protocol changes to the final TSR score must be mapped in detail to identify shortcomings of existing automated scoring pipelines. Mediators that cause stroma-high lesions to have a poor prognosis need further investigation. To explore these concerns and provide a robust understanding of the protocol for development of new automated quantification methods, a scoping review was conducted.

The research questions of TSR in CRC were articulated as follows: 1) “What is the prognostic value of TSR, and what are its possible mediators?”; 2) “How do TSR scoring protocols differ, and how do these scoring variations influence the final TSR score?”; and 3) “How reliable is manual TSR scoring, and how well-developed are current automated solutions?”.

2 Methods

The study adhered to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guideline for scoping reviews (PRISMA-ScR) (22). A systematic literature search was performed using the Scopus and PubMed medical databases. Standardisation of TSR scoring in CRC was proposed by Van Pelt et al. in 2018; therefore only studies published from 2018 onwards were included in this review. Relevant papers were identified using the following query, performed on 21st of February 2025: “ (“TSR” OR “tumo*r stroma”) AND (“Colorectal” OR “CRC” OR “colon*” OR “rectal”)”. Studies were excluded if they met any of the following criteria. The article: 1) was not written in English; 2) was a conference paper, abstract-only publication, case study, letter to editor, comment, study protocol or preprint, 3) did not report a stroma content score, or did not correlate it to staging or prognosis, 4) was conducted on animal models or in-vitro, 5) did not include CRC-diagnosed subjects. Exclusion was performed by two independent observers. Disagreement was resolved by team discussion.

Included reviews were thoroughly investigated for general concepts and knowledge gaps. Methodological data extraction was performed on original works only, structured around four main topics: 1) the prognostic significance of TSR and its possible mediators causing worse prognosis, 2) variability in TSR scoring protocols, 3) inter- and intra-observer variability in TSR assessment, and 4) automation of TSR scoring.

These topics led to the creation of a data-charting form developed by two reviewers that was updated iteratively. Data was extracted by a single investigator and reviewed by a second. For investigation of prognostic value, conclusions of studies correlating TSR to survival outcomes, TNM staging, and local infiltration were charted. Variability in the scoring protocol was investigated by describing individual factors of the scoring method that influence the final TSR. This included ROI location, ROI size, lens magnification, histological specimen type, mode of automation and a mathematical representation of the scoring protocol. For observer variability assessment, the agreement metric is reported as well as the TSR evaluation task, which can either be ROI selection and TSR estimation combined or TSR estimation alone in a given ROI. Charted information on automated TSR evaluation include the algorithm type, its mode of automation, the ROI used for scoring, its tissue identification performance metrics and its agreement with a manual scoring process. Quantitative data is structured in tables and qualitative data is presented in a narrative format.

3 Results

A total of 411 papers were identified through the PubMed search. The Scopus search identified 539 articles, of which 172 were unique. This resulted in a total of 583 studies for initial screening. Based on title and abstract review, 50 papers were excluded. Following full-text assessment, an additional 457 articles were excluded, leading to the final inclusion of 76 papers, as seen in Figure 1 and Supplementary Table S1. Of the 76 included articles 56 adopted a manual approach to evaluate the TSR score. Of the remaining 20 articles, 5 adopted fully automated solutions, 14 used semi-automated solutions and a single paper did both. The percentage of stroma-high subjects included in a study range from 2% to 86% with a median of 37%.

Figure 1

Flowchart illustrating the identification and inclusion process for studies via databases and registers. Initially, 950 records were identified; 367 duplicate records were removed. After screening 583 records, 50 were excluded. Out of 533 reports sought, none were unretrieved. Upon assessment, 457 reports were excluded for various reasons, including not being TSR-focused. Ultimately, 76 studies were included in the review.

Figure 1. A flowchart showing the exclusion from the Pubmed and Scopus database.

3.1 Prognostic value of TSR

3.1.1 As an individual biomarker

The correlation between TSR and DFS and OS was generally found to be significant. Twenty-nine studies have analysed DFS as primary or secondary outcome and 25 of these, including three meta-analyses, demonstrated a significantly worse prognosis for patients with high stroma content, as seen in Table 1. Five studies did not report a significant association with TSR (21,23–26). The association between TSR and OS followed an equivalent trend. Among the 35 studies investigating OS, 26 studies, including the aforementioned meta-analyses, demonstrated a significantly higher mortality risk in patients with high stroma content. Additionally, ten studies, including the recent prospective multicentre UNITED study, did not reach statistical significance but still reported a trend towards worse prognosis in high-stroma tumours (18). It should be noted that the UNITED study was specifically powered for 3-year DFS with a 5-year OS as a secondary outcome. Three articles did not find a trend of worse prognosis with high stroma. These studies had a dataset with a very low amount of stroma-high subjects, or investigated only stage I or IV subjects (26–28).

Table 1

Table 1. Studies investigating the association of stroma high lesions with OS and DFS. Significant correlations are denoted with “yes”, and non-significant correlations are denoted with “no”.

The association of TSR with T and N stage was also appraised. Out of 29 studies that mention the T-status stratified by TSR, eleven studies showed a significantly higher T-status for subjects with stroma-high lesions. Of the 23 studies mentioning the N-status, 14 showed a significant correlation of stroma-high lesions with positive lymph node status, as seen in Table 2. Additionally, high stroma seems to be correlated with distant metastases. All eight studies investigating the association of M-status or distant metastasis free survival with TSR found a significantly higher rate of distant metastases in subjects with stroma-high lesions (29–36).

Table 2

Table 2. Studies investigating the association of stroma high lesions with increased T-status and positive N-status. Significant correlations are denoted with “yes”, and non-significant correlations are denoted with “no”.

Multiple studies have also shown an association of high stroma content with perineural invasion and vascular invasion. The correlation with lymphatic invasion and lymphovascular invasion, when considered as a combined parameter, appears more debatable, as seen in Table 3. The tumour budding score, which is also an independent prognostic factor for poor prognosis, is often reported to be associated with stroma high tumours (23,28,31–33,35,37–39).

Table 3

Table 3. Studies that investigated the association of stroma content in lesions with invasion of local microstructures. If a higher stroma content is significantly related to a higher invasion rate, this is denoted with “yes”. If this relation is not found it is denoted with “no”.

Eleven studies investigated therapy resistance. Of these, six focused on neoadjuvant treatment. Four studies investigated the effectiveness of (chemo)radiotherapy and found that tumours that are classified as stroma-high in preoperative biopsies exhibit less tumour regression on the surgical specimen (37,40–42). Li et al. and Yim et al., were unable to replicate these results (19,20). Adjuvant chemotherapy resistance is reported by Strous et al., who found that DFS significantly improved with treatment in stroma-low subjects, but not for stroma-high subjects (10). The UNITED trial suggested chemo-resistance as a possible explanation for significantly worse DFS in stroma-high subjects despite treatment with adjuvant chemotherapy (18). Additional studies have investigated the added benefit of supplementing chemotherapy with Bevacizumab, with mixed results (43,44). Ravensbergen et al. stated that immune checkpoint inhibitor therapy effectiveness cannot be predicted from TSR alone (45).

3.1.2 As a composite score

By visual estimation, machine learning or transcriptomics, TSR can also be combined with tumour immune micro-environment status to create a composite score. Some studies suggested that combining immune scores with TSR provides a superior prognostic value for DFS and OS, compared to using TSR or immune scores alone (46,47). Ravensbergen et al. reported that this combined biomarker could predict the effectiveness of immune checkpoint inhibitor therapy (45). These results led to the creation of the Glasgow micro-environment score by combining the Klintrup-Mäkinen grade with TSR scores, as seen in Supplementary Table S2. This combined score stratifies subjects into three risk categories each associated with progressively worse prognosis (29,48).

3.2 Scoring methods and variation

3.2.1 Tumour stroma quantification and TSR cut-off value

A variety of definitions for stromal content are reported in the literature. Numerous studies denominating their stroma evaluation as “TSR”, make use of different definitions and formulas. These methodological variations are summarised in Table 4.

Table 4

Table 4. Different definitions of tumour stroma quantification used in the literature.

For quantification of TSR, tissues other than stroma and tumour are usually excluded from the calculation. In contrast, in tumour proportion (TP) and stroma proportion (SP) these tissues are included (35,49). An example of this tissue inclusion discrepancy is found in the evaluation of smooth muscle. In the TSR evaluation, it is excluded, whereas in TP and SP smooth muscle fibres may be present within the field of view (FOV) (9,40). Another two studies considered lumen and mucin as part of the tumour (34,50). Of note is that, despite the recommendations to exclude smooth muscle from TSR scoring in the 2018 guidelines, none of the included studies have used immunohistochemistry to exclude muscle fibres. This means, in practical terms, only visible bundles of muscularis are excluded from eye-scored TSR estimation, ignoring single remaining muscle fibres and cells (9).

Regarding the cut-off value, based on the recommendations proposed by Van Pelt et al., a tumour is classified as stroma-high if the stromal percentage exceeds 50% and as stroma-low if it is 50% or less (9). The cut-off value of 50% was chosen as it provided maximum discriminative power to distinguish prognostic groups (6). Other studies used self-determined cut-off values. One widely used approach is receiver operating characteristic (ROC) curve analysis, often combined with additional metrics such as Youden Index to find the cut-off point with optimal sensitivity and specificity for predicting prognosis (21,30,33,35,51). Another approach used, is to divide TSR into categories, such as quartiles or quintiles, and then choose a cut-off to dichotomise TSR into two prognostic risk groups (46,52). Alternatively, some studies applied other statistical methods, including maximally selected rank statistics to determine an optimal cut-off for predicting OS or using the median TSR value as cut-off (17,53,54). Cut-off values determined using the previously described methods ranged from 40% to 65.5%.

3.2.2 Influence of specimen type and ROI characteristics

On surgical specimens, the TSR should be quantified on the deepest invasion slide (9). On preoperative biopsies it is not possible to choose the deepest invasion slide, and the size and shape of the specimen might make it impossible to have tumour epithelium in all four cardinal directions. In several studies this has led to the choice of using a smaller hotspot or to quantify TSR in the whole slide area (19,20,40,55). Carvalho et al. showed, using automated TSR scoring and mathematical models, that the hotspot size is correlated to the TSR score. Smaller ROIs typically have a higher maximum stroma percentage. This rule of thumb is explained as a smoothing effect enduced by enlarging the ROI (56). Regardless, using TSR in preoperative biopsies for risk stratification has been shown to predict CSS, OS, DFS, lymph node metastasis and distant metastasis (30,57–59). Additionally, stratification of patients based on TSR in preoperative biopsies has been shown to be significantly associated with neoadjuvant treatment response by Liang et al. (40). Two other studies, with smaller sample sizes, did not replicate this result (19,20).

Calculating the TSR over the whole tumour area has been described to consistently underestimating the stroma content in comparison to scoring the TSR in the perceived highest stroma region (55). This TSR difference can be attributed to the stroma heterogeneity of the lesion. In surgical specimen three ROIs are commonly used 1) whole tumour area, 2) infiltrative edge, and 3) highest stroma region. On average, the TSR measured across the whole tumor is lower than the TSR in the infiltrative edge, which is, in turn, lower than the maximum TSR observed in CRC surgical specimen slides (33,35,51). TSR in any of the three regions of interest is a predictor of poor prognosis, but the optimal cut-off value for risk stratification differs, with $C_{WholeTumour} < C_{InfiltrativeEdge} < C_{MaximumStroma}$ (49,51).

Further to scoring TSR in the primary neoplasm specimen, Ubink et al. investigated the use of TSR in peritoneal metastasis and found a significant correlation between TSR in metastatic site and TSR in the primary tumour (36). Also, combining the TSR value in the lymph node with the TSR determined in the primary tumour yielded higher prognostic value than TSR of the primary tumour alone. This was found when reclassifying a subject as stroma-high if TSR in the affected lymph node was higher than 50% regardless of TSR in the primary tumour (10). Although a significant correlation between TSR in primary tumours and their preoperative biopsies and affected lymph nodes was shown, the amount of subjects classified as stroma-high is consistently higher in surgical specimens, as shown in Table 5.

Table 5

Table 5. This table shows the confusion matrices found in literature comparing TSR in lymph node with TSR in primary tumour and TSR in preoperative biospy with TSR in primary tumour. Ubink et al. is excluded from this table, due to a lack of exact type I and II error data (36).

3.2.3 ROI selection methods

Van Pelt et al. proposed a protocol for ROI selection, which has been broadly adopted by most recent studies published after 2018 (9). The process starts by selecting slides from the most invasive part of the tumour for analysis. Initially, areas with the highest amount of stroma are identified under low magnification using a $\times$ 2.5 or $\times$ 5 lens. A single region including tumour and stromal tissue, with tumour cells present along all borders of the FOV is selected using a $\times$ 10 objective lens. In case there are multiple appropriate areas, the maximum TSR is chosen. Areas containing smooth muscle, necrotic tissue, large blood vessels, mucus, or lymphocytic aggregates should be avoided. If these features are unavoidable, they should be visually excluded during the scoring process, as seen in Figure 2.

Figure 2

Illustration of tissue processing and analysis. Panel (a) shows microscope slides labeled with barcodes for slide selection of deepest invasion. Panel (b) depicts a tissue sample with marked regions of interest for selection. Panel (c) displays a close-up of tissue identification for TSR (tumor-stroma ratio) scoring.

Figure 2. Visual representation of the pipeline used in clinical practice to manually evaluate the TSR. According to van Pelt’s 2018 recommendations (9) the first step (a) is selecting the slide to be scored, containing the most invasive part of the tumour. Next, (b) the region of interest (ROI) with the highest perceived amount of stroma is selected. Notably, tumour cells must be present at all four borders of the image field. Only one ROI is necessary, but selecting the most representative area can be challenging as multiple suitable regions may be available. Finally, (c) TSR is calculated after assessing the tissue types within the ROI. Lumina, necrosis, and mucin should be visually ignored for scoring, if present.

Some studies deviated from this protocol by not requiring the presence of the tumour in the four cardinal directions of the FOV (46,50,54,60). Others calculated TSR in the infiltrative edge of tumour, the whole tumour, metastatic lesions, and tissue microarrays instead of the region with the highest perceived stromal content (19,21,40,51). Some used $\times$ 20 or $\times$ 40 objective lens, which lowers the area of the FOV (20,31,61). Four studies modified the field shape, choosing rectangular fields instead of circular ones, especially studies implementing semi-automated scoring approaches. One of these implemented an arbitrary area of 9 mm². The others used an area comparable with an ocular FOV at $\times$ 10 magnification (35,49,50,62).

3.3 Interobserver variability

A total of 28 papers reported interobserver variability of the TSR score, as seen in Table 6. Overall the Kappa coefficient, evaluated on surgical specimen using a semi-quantitative manual approach, is widely variable between and within studies. Median scores between studies range from 0.42 to 0.876 (10,28). The largest variance between observers within the same study ranged from 0.21 to 0.90 (63).

Table 6

Table 6. Interobserver variability scores of human observers.

Two tasks can be separated in the scoring of the TSR, each of which independently contribute to the reliability of the score: 1) The assignment of the optimal TSR location, and 2) the estimation of the score in that region, depicted in Figure 2. A semi-automated approach, in which only the latter of these tasks is automated does not seem to improve the Kappa score compared to the rest of literature (54). Estimating the TSR visually in a predefined ROI, also does not seem to result in an improvement of the observer variability (62). Changing the semi-quantitative task of visual eyeballing to a fully quantitative approach, such as manual tissue segmentation, and stereology analysis, appears to drastically increase the interobserver variability. Respectively these methods resulted in an ICC of 0.99 and Kappa score of 0.986 (40,49). Studies that investigated the scoring of TSR in lymph node and pre-operative biopsy, showed above average reliability, with a Kappa score of 0.866 and >0.9 (55,58). Pathologist experience and seniority also seemed to be correlated with higher interobserver reliability (64).

In order to increase the consistency of TSR scoring, an e-learning was created by Smit et al. (63). A significant improvement of reliability to the ground truth was observed after training (pre $κ$ = 0.72 – post $κ$ = 0.77, p = 0.002). This improvement did not fade after the washout period of 2 months (pre $κ$ = 0.77 – post $κ$ = 0.76, p = 0.30). This study evaluated the consistency of TSR hotspot placement for scoring. The authors noted that the spread seemed to be lower in low-stroma lesions compared to high-stroma lesions. They also found that after training, the spread in hotspot placement decreased (63).

The intraobserver TSR scoring variability was generally lower and had a smaller range compared to interobserver variability. Median Kappa scores for intraboserver variability between papers ranged from 0.77 to 0.89 (10,63,65–67).

3.4 Automated approaches

Automating the TSR calculation involves replacing one or more steps traditionally performed manually by pathologists, including the selection of ROIs, identification of tissue types, and calculation of the final TSR score. Included studies, focused on automating TSR scoring, can be divided into two types: semi-automated and fully automated approaches. Fourteen papers adopted semi-automated methods, where the tissue identification and the TSR estimation were performed automatically within a predefined ROI selected by pathologists (17,24,26,30,33,35,46,50–52,54,62,68,69). Five papers focused on fully automating the TSR evaluation process (47,53,56,70,71). A single study used semi-automated and fully automated pipelines to quantify the TSR (72). Most studies that developed a fully automated pipeline adopted an approach that can be roughly divided into 3 steps: (1) extracting image patches, (2) using a CNN model to classify the extracted patches into 3 or more classes, and (3) calculating TSR (47,53,70).

3.4.1 ROI selection

Four out of six papers that developed a fully-automated process scored TSR on the entire tumour bulk (47,53,70,71). The other two papers focused on selecting a circular ROI, replicating the manual procedure performed by pathologists. After segmenting the WSI, the entire tumour bulk is filtered with a virtual FOV with a diameter of 1.0, 1.5 or 2.0 mm, which calculates the TSR for each possible hotspot and generates a TSR heatmap. The top k areas with the highest stroma percentages were selected as most feasible ROIs (56,72).

3.4.2 Tissue identification

For automated tissue type detection, traditional methods often rely on threshold-based techniques to separate tumour and stroma tissue. These methods are typically followed by post-processing algorithms, such as morphological operations, to refine the output (24,33,54,69). While these methods can be effective for simple cases, they are limited to segmenting two tissue types and struggle to distinguish more complex features. ML methods were proposed, that classify tissue data into malignant classes: 1) tumour, 2) tumour stroma, and benign classes: 1) adipose, 2) mucinous, 3) necrotic, 4) muscular, 5) lymphatic, 6) background, and 7) healthy glandular tissue. Features extracted from super-pixels or patches are fed into a random forest or support vector machine classifier and one of the aforementioned tissues is predicted (50). Convolutional neural network (CNN) based models were used to automatically extract features from an image. Architectures such as VGG19, AlexNet, Googlenet, ResNet50 and custom models were used for patch classification (47,53,68,70,71). UNet and other fully convolutional networks were used for pixel-wise classification (17,51,52,56,62,72). Techniques like CycleGAN and transfer learning had further addressed challenges such as limited data availability and improved model performance and its ability to generalise (70,72).

3.4.3 TSR validation

Validation of the automated pipeline can be performed on two levels: 1) tissue identification performance, which is reflected in the model’s ability to classify or segment different tissue types, and 2) TSR value estimation, by comparing the TSR values generated by artificial intelligence (AI) with expert assessments. CNN models have shown strong performance in tissue identification. For classification tasks, patch-wise accuracy ranges from 86.6% to 97.5%, while random forest classifiers achieve an accuracy of 76%–83%. For semantic segmentation, pixel-wise accuracy varied between 72.4% and 94.6%, Table 7. The performance metrics reported in these studies highlight the effectiveness of AI models to accurately differentiate tissue types. Firmbach et al. proposed a survey that involved expert ratings of the segmentation maps generated by the model on a scale from 1 to 10. Automated segmentations were considered high quality if the pathologists rated it $\geq 9 / 10$ . This test showed that, on average, in regions with high segmentation quality, AI TSR values were 11.1–11.5 percentage points lower than those estimated by human observers, which was interpreted as human overestimation. Conversely, in cases with poor segmentation quality, the discrepancies were attributed to AI errors, particularly in challenging cases like rare tumour subtypes (62).

Table 7

Table 7. Studies which provided the comparison between their automated quantification approach and the manual counterpart. The “Stroma hotspot” refers to the region with the highest perceived stroma as defined by Van Pelt et al. (9) TB: Tumour bulk, SH: Stroma hotspot, IF: Infiltrative edge, Cls: classification task, Seg: segmentation task.

Based on the reviewed studies, the agreement between manual and automated TSR was fair to moderate for CNN models, as seen in Table 7. In these studies ICC values ranged from 0.411 to 0.937, while Kappa score values ranged between 0.239 and 0.472, indicating that automated ML-based tools are a promising method for scoring TSR, but still require further validation by experts. In addition to ML-based models, one study reported almost perfect agreement with an ICC of 0.822 and a Kappa score of 0.813, using a thresholding method (69). Smit MA et al. further compared the semi-automated method with the fully automated approach, which demonstrated good agreement, with Spearman correlation coefficients ranging from 0.76 to 0.83 (72).

4 Discussion

4.1 Usability and scoring

The TSR was shown to be a strong prognostic indicator for DFS. Most studies have also identified TSR as a statistically significant prognostic indicator for OS. While some studies did not reach statistical significance, most still showed a trend linking higher stroma content to shorter OS.

It is repeatedly mentioned in literature that TSR might be used to predict therapy resistance (10,18,40). Evidence for this is fairly scarce and included studies, investigating therapy resistance, have varying methods of TSR scoring and specific treatment received, with mixed and contradictory results (19,45). This highlights the need for further research on therapy resistance and its relation to stromal content.

This review emphasises the wide variety of TSR scoring protocols. The deviations from the protocol are most apparent in ROI location, ROI area, histological specimen type and the management of non-tumour epithelium and non-stromal tissues. This variety creates large differences in TSR evaluation, due to stromal heterogeneity of CRC (56). Stromal heterogeneity also influences the placement of the scoring ROI, which is especially apparent in stroma-high tumours, where stromal heterogeneity seems typically higher. Theoretically restricting ROI placement to a specific area of the slide, such as the infiltrative edge, could improve consistency by reducing variability in selection compared to placing ROIs across the entire slide. The TSR score calculated over this region however is consistently lower than the TSR in the hotspot suggested by Van Pelt et al. The deviation of TSR between protocols, impedes the reliable comparison of study results. Opting for a different quantification protocol, should be met with a specifically optimised cut-off value.

4.2 Terminology and bias

The term “tumour stroma ratio” implies the calculation of a ratio, which is mathematically defined as $\frac{α}{β}$ , where in fact it is a percentage, defined as $\frac{α}{α + β} * 100 %$ . The commonly used alternative term “tumour stroma percentage” is more in line with the stromal percentages typically used in practice to indicate stromal content. Moreover, the “tumour stroma ratio” can be misinterpreted as the ratio of tumoural stroma instead of the “ratio” of tumour to stroma. Misinterpretation of the term TSR has led to many articles defining TSR-high as stroma-high, thus leading to inconsistencies in the classification of tumours and miscommunication in research findings (10,17,19,20,23,29,47,55,64,70,73,74).

4.3 Scoring variability

We show that there is a wide range of reported interobserver reliability scores. On the low end Li et al. and Dang et al., reported a Kappa score of 0.51 over 996 samples and 0.42 over 183 samples respectively (28,69). On the higher end, Strous et al. and Ravensbergen et al., reported Kappa scores of 0.88 over 201 samples and 0.85 over 111 samples, respectively (10,75).

The improvement of interobserver variability after specific training for TSR scoring, as shown by the e-learning investigations of Smit et al. as well as the correlation of experience and interobserver variability as displayed by Souza da Silva et al. emphasize the need for training to improve concordance both in a research and clinical setup (63,64). Incorporating training into the multicentre studies as performed by Polack et al in the UNITED studies, greatly improves robustness of the study outcomes, and is recommended for future studies on the TSR score (18).

The median percentage of stroma-high subjects of all included papers is 37.3% ranging from 2.1% to 85.9%. This is concordant with the mean percentage of stroma-high subjects identified in the meta-analysis by Pyo et al. of 35.3% (15). The wide reported TSR range can be attributed to the inclusion and exclusion criteria of the study subjects, TSR scoring protocol, and interobserver variability. Though the reported Kappa scores indicate a moderate to substantial agreement, this should not be confused with discrepancies being acceptable for clinical adoption, Table 6. In the study by Souza da Silva et al., though the Kappa score of 0.746 indicates a substantial agreement, large discrepancies in the classification were observed. A senior pathologist classified 32.7% of samples as stroma-high whereas the baseline pathologist classified 44.9% of subjects as such, which is an increase of 37%. This shows that while Kappa can suggest substantial agreement, clinically relevant discrepancies can still occur.

The major components of variability in TSR scoring are: 1) the placement of the ROI, and 2) the estimation of the TSR percentage. Smit et al. looked at the spread of ROI placement in a set of 31–36 observers. Using a visual estimation, they found that agreeing upon a hotspot is more difficult in stroma-high cases compared to stroma-low cases. Particularly difficult cases are those including mucin lakes, large regions of necrosis, and regions were smooth muscle and stroma intermingle (63). Although it is common knowledge in TSR scoring that disagreement on ROI location is high, there is no existing metric used in literature to evaluate the placement of ROIs. For the estimation of the TSR percentage in a predefined FOV manual tissue segmentation is an almost perfect ground truth (40).

The placement of ROI is often seen as the cause for interobserver variability of the TSR score. It appears however that scoring the TSR on a predefined ROI over an independently selected ROI did not result in a measurably lower interobserver variability. Various studies have suggested that human observers face challenges in accurately estimating TSR visually. This is shown by measuring the difference of a TSR score calculated from tissue segmentation, against a TSR score visually estimated from that same region. Firmbach et al. found a mean overestimation of the TSR score of approximately 11.1%, ranging from −20% to 40% difference of the visual scoring to a quantitative baseline (62). The unreliability is also emphasised when comparing the interobserver variability of manual quantitative tasks, with the interobserver variability in semi-quantitative tasks, where a rise in variability is observed in semi-quantitative scoring (40,49). Manual quantitative measures, however, are undesirable in clinical practice, as scoring TSR using stereology is an oversimplification of the TSR and manual segmentation of tissues is tedious and time-consuming.

Overall, the interobserver variability in pre-operative biopsies and lymph nodes was lower than in surgical specimen, Table 6. In biopsies, subjects are consistently more likely to be classified as stroma-low compared to their primary tumour counterpart. It is hypothesised that this is caused by a sampling bias. The biopsy might not be performed at the level of deepest tumour invasion, and the region with the highest stroma might not be included in the biopsy. Therefore the TSR in pre-operative biopsies are a low sensitivity prognostic tool and a poor predictor for the TSR score in the primary tumour. Despite this, the TSR in pre-operative biopsy was still shown to be an independent predictor for poor prognosis (30,57–59).

4.4 Automated quantification

Carvalho et al. as well as Geessink et al. show that the manual TSR and automated TSR score are not comparable, and thus are currently not interchangeable (17,56). Despite this, automation of TSR quantification, using deterministic models, reduces the subjectivity of the score. More consistent and stringent adherence to the TSR scoring protocol and with it minimisation of the interobserver variability could be achieved by introduction of automated TSR scoring. On the lowest end of complexity, binary threshold methods have been used, which showed great promise to mimic human observers, with a Kappa value of 0.813 (69). However, a two-tissue segmentation model makes it impossible to adhere to the rules of tissue exclusion for quantification of the TSR. ML models are used to perform classification of tissue coordinates or patches. These methods enable the exclusion of irrelevant tissues in TSR quantification. A downside of classification methods is a lowered resolution of tissue detection introduced by patch or point sampling. Semantic segmentation overcomes this issue with a classification on pixel level. This benefit comes with the downside of having the highest model complexity, as well as the most tedious ground truth labelling process, making the acquisition of it labour intensive and the public availability of it scarce. Models performing a classification task, might not provide the resolution needed for reliably quantifying the TSR, and thus tissue segmentation models could be a preferred solution.

The largest hurdle for the use of any automated solution in histopathology is the smooth adoption of the AI model in the pathologist’s workflow. The tool should provide the pathologist with fast, human interpretable, and most importantly accurate feedback (76). A reason to favour image classification models over pixel-wise models is the simpler ground-truth labelling and architecture complexity, which reduces response time in clinical setting and makes them cheaper to train. Besides this, the image classification task generalizes better with a low amount of data compared to segmentation models. However, their spatial resolution cannot offer a fine-grained human interpretable response, nor is it accurate enough for finding the TSR score. The high resolution output of pixel-wise models are precise and can provide human interpretable feedback (77). These segmentation models require more processing time, which adds complexity to the integration of these solutions into clinical workflow (78). In TSR specifically, the greatest hurdle is the discordance between expert estimation of the TSR and automatically evaluated TSR scores, despite models’ high performance for the identification of tissues (17,62,70).

The common approach for fully automated methods found in literature, is creating an AI classification model for benign and malignant tissue types that performs the classification of all patches containing tissue within the WSI, and calculates the TSR for the entire tumour bulk (53,70,71). Petäinen et al. report a Kappa score of 0.33, despite a patch classification accuracy of 96.1%. This discrepancy is caused by comparing an automated quantification of whole tumour TSR with a semi-quantitative quantification of the highest perceived stroma region. This discrepancy disappears when comparing automated and manual approaches for identical quantification strategies (53). Two studies performed a fully automated TSR quantification of the highest perceived stroma region and compared its performance to human observers (56). This translated to a strong Pearson correlation between manual and automated quantification (72).

Note that the optimal cut-off for prediction of prognosis was calculated using scores generated semi-quantitively, using a conventional microscope. The TSR is systematically underestimated by human observers, which leads to a larger amount of subjects being classified as stroma-high in automated quantification (17,51,62,69). This means that for effective stratification of subjects using automated TSR a new optimal cut-off may need to be defined.

Evaluating a fully automated approach requires assessing each step independently to determine its specific contribution and identify sources of inaccuracies. Based on the reviewed studies, only one paper has conducted such assessments for TSR estimation (62). Despite this finding, most studies in the literature assessed TSR estimation by comparing automated TSR-score with manual TSR-score. However, manual estimation remains subjective, making the ground-truth of the comparison debatable. Additionally, to date, no studies have evaluated ROI selection in fully automated TSR quantification pipelines. This evaluation is highly subjective, as its current gold standard is consensus of expert judgement. It is important to note that defining a quantitative metric for evaluating hotspot selection is challenging. Multiple adequate ROIs exist within a slide, making commonly used metrics such as Dice or Euclidean distance unfeasible.

The systematic overestimation of the TSR score can explain the strongly improved kappa scores, for a median cut-off compared to a 50% cut-off in the automated tool proposed by Geessink et al. ( $κ = 0.239$ to $κ = 0.521$ ) (17). It could also explain the optimal clinical threshold of 80% that was found in the study performed by Carvalho et al. (56). In general, upon evaluating the automated scoring methods, we observe that the Kappa score for classification agreement are poor. Metrics evaluating a monotonic or linear relationship, tend to be higher, solidifying the findings by Carvalho et al. and Geessink et al., Table 7.

Despite these findings there have been no attempts to isolate the cause of this systematic overestimation yet. To identify the root cause of the discrepancy, we propose to evaluate the effectors to the TSR score individually, both for the manual and automated process. In the automated process, these are: 1) tissue identification, 2) automated region selection, and 3) TSR evaluation, which is the tool’s capability of translating identified tissues to a percentage score. In the manual process, these are: 1) The ability to find an optimal ROI, and 2) the ability to accurately eyeball the TSR score in a given region. To our knowledge, this last factor has not yet been addressed. We suggest a study setup in which pathologists eyeball the TSR score on previously fully annotated tumour regions after a defined washout period. This would quantify the systematic error of pathologists when visually estimating an ROI. Besides this, we strongly advise to make use of the “discrepancy ratio” in the evaluation of automated solutions in the field of pathology, as it is specifically designed to evaluate the performance of automated tools in tasks where there is frequent disagreement between experts (79).

Some limitations of this scoping review are that it included only articles after 2018, which excludes early research and thus could exclude articles that have led to the creation of the standardised scoring protocol (9). This might obscure the rationale for specific steps in these recommendations. Besides this, conference papers were excluded from this review, which might result in an underestimation of the number of automated TSR pipelines developed.

Also, this review focused solely on papers related to colorectal cancer, which may have limited the identification of additional rules and techniques for TSR calculation, such as the formula applied and the ROI selection process, as well as automated techniques for TSR scoring.

An increase in published articles can be seen as a trend over time, with a mean of 10 articles per year published from 2018 to 2020 and a mean of 15 articles published per year from 2021 to 2025. Interestingly, most of existing literature is dominated by a few research groups. Before 2021 the three most prominent groups were responsible for 64% of published studies. Averaged over all included studies in this review, these research groups are responsible for 38% of all published studies. The single most dominant research group is responsible for 22% of published studies from 2018 onwards. A reader unaware of this imbalance in publishing might form an incomplete view of the existing evidence and clinical practices.

5 Conclusion

TSR is a robust indicator for DFS and OS, and can possibly predict therapy resistance. Adoption of a single procedure for TSR scoring is inconsistent across research communities. This is most apparent in ROI selection and determining the cut-off value for risk stratification. TSR scoring in pre-operative biopsies may be a significant indicator for poor prognosis, despite being a poor predictor for TSR in the primary tumour. The scoring procedure followed is strongly correlated with the optimal cut-off for stratifying subjects into risk categories, which is likely caused by stromal heterogeneity of colorectal lesions. Additionally, it influences the inter-observer variability. Kappa scores for manual semi-quantitative scoring solutions range from 0.42 to 0.88. Automated scoring solutions are proposed to reduce labour and increase interobserver reliability. Despite showing high model performance, comparisons between manual and automated TSR scores result in kappa scores ranging from 0.239 to 0.472. In order to adopt TSR scoring in clinical practice, it is essential to standardise the scoring process, including the equation, region of interest selection, and cut-off value. Moreover, the development of an automated tool to assist pathologists requires a well-defined validation process that goes beyond comparisons with human observers and incorporates additional methods to assess the tool’s accuracy, reliability and clinical usability.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Author contributions

FD: Conceptualization, Methodology, Validation, Formal Analysis, Investigation, Data curation, Writing – original draft, Writing – review and editing, Visualization. CF: Conceptualization, Methodology, Validation, Formal Analysis, Investigation, Data curation, Writing – original draft, Writing – review and editing, Visualization. MW: Resources, Writing – review and editing, Project Administration, Funding Acquisition. RS: Writing – review and editing. RM-N: Writing – review and editing. JF: Writing – review and editing. DO: Conceptualization, Validation, Writing – review and editing. DM: Conceptualization, Validation, Writing – review and editing, Visualization, Supervision, Project Administration, Funding Acquisition. IMP: Writing – review and editing. JW: Conceptualization, Methodology, Validation, Resources, Writing – original draft, Writing – review and editing, Supervision, Project Administration.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This article is a result of the project 12683-ARABESC supported by Eurostars grant by Eureka/Operational Program for Innovation and Digital Transition (COMPETE 2030), under the PORTUGAL 2030 Partnership Agreement, through the European Regional Development Fund (ERDF) and the RVO (Rijksdienst voor Ondernemend Nederland), part of the Dutch ministry of Economic Affairs. Eurostars is part of the European Partnership on Innovative SMEs. The research conducted is also co-funded by WSK Medical and IMP Diagnostics. The partnership is co-funded by the European Union through Horizon Europe.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/or.2025.1605383/full#supplementary-material

Abbreviations

AI, Artificial Intelligence; CC, Colon Carcinoma; CNN, Convolutional Neural Network; CRC, Colorectal Carcinoma; CSS, Cancer Specific Survival; DFS, Disease Free Survival; FOV, Field of View; ICC, Intra-class correlation; OS, Overall Survival; RC, Rectal Carcinoma; ROI, Region of Interest; SP, Stroma Proportion; TP, Tumour Proportion; TSP, Tumour Stroma Percentage; TSR, Tumour Stroma Ratio.

References

1. Amin, MB, Edge, SB, Greene, FL, Byrd, DR, Brookland, RK, Washington, MK, et al. AJCC cancer staging manual, 1024. Springer (2017).

Google Scholar

2. Loughrey, M, Quirke, P, and Shepherd, N. G049 dataset for histopathological reporting of colorectal cancer. R Coll Pathol (2018).

Google Scholar

3. Loughrey, M, Arends, M, Brown, I, Burgart, L, Cunningham, C, Flejou, J, et al. Colorectal cancer histopathology reporting guide (2020).

Google Scholar

4. Jain, D, Chopp, W, and Graham, R. Protocol for the examination of resection specimens from patients with primary carcinoma of the Colon and rectum (2023).

Google Scholar

5. Höppener, DJ, Stook, JLP, Galjart, B, Nierop, PM, Nagtegaal, ID, Vermeulen, PB, et al. The relationship between primary colorectal cancer histology and the histopathological growth patterns of corresponding liver metastases. BMC cancer (2022) 22:911. doi:10.1186/s12885-022-09994-3

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Mesker, WE, Junggeburt, JMC, Szuhai, K, de Heer, P, Morreau, H, Tanke, HJ, et al. The carcinoma-stromal ratio of Colon carcinoma is an independent factor for survival compared to lymph node status and tumor stage. Anal Cell Pathol (2007) 29:387–98. doi:10.1155/2007/175276

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Huijbers, A, Tollenaar, R, v Pelt, G, Zeestraten, E, Dutton, S, McConkey, C, et al. The proportion of tumor-stroma as a strong prognosticator for stage ii and iii colon cancer patients: validation in the victor trial. Ann Oncol (2013) 24:179–85. doi:10.1093/annonc/mds246

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Gao, J, Shen, Z, Deng, Z, and Mei, L. Impact of tumor-stroma ratio on the prognosis of colorectal cancer: a systematic review. Front Oncol (2021) 11:738080. doi:10.3389/fonc.2021.738080

PubMed Abstract | CrossRef Full Text | Google Scholar

9. van Pelt, GW, Kjær-Frifeldt, S, van Krieken, JHJM, Al Dieri, R, Morreau, H, Tollenaar, RAEM, et al. Scoring the tumor-stroma ratio in colon cancer: procedure and recommendations. Virchows Archiv: Int J Pathol (2018) 473:405–12. doi:10.1007/s00428-018-2408-z

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Strous, MTA, Faes, TKE, Gubbels, ALHM, van der Linden, RLA, Mesker, WE, Bosscha, K, et al. A high tumour-stroma ratio (TSR) in colon tumours and its metastatic lymph nodes predicts poor cancer-free survival and chemo resistance. Clin Translational Oncol (2022) 24:1047–58. doi:10.1007/s12094-021-02746-y

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Dekker, T, Van De Velde, C, Van Pelt, G, Kroep, J, Julien, J, Smit, V, et al. Prognostic significance of the tumor-stroma ratio: validation study in node-negative premenopausal breast cancer patients from the eortc perioperative chemotherapy (pop) trial (10854). Breast Cancer Res Treat (2013) 139:371–9. doi:10.1007/s10549-013-2571-5

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Roeke, T, Sobral-Leite, M, Dekker, TJ, Wesseling, J, Smit, VT, Tollenaar, RA, et al. The prognostic value of the tumour-stroma ratio in primary operable invasive cancer of the breast: a validation study. Breast Cancer Res Treat (2017) 166:435–45. doi:10.1007/s10549-017-4445-8

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Courrech Staal, EFW, Wouters, MW, van Sandick, JW, Takkenberg, MM, Smit, VT, Junggeburt, JM, et al. The stromal part of adenocarcinomas of the oesophagus: does it conceal targets for therapy? Eur J Cancer (2010) 46:720–8. doi:10.1016/j.ejca.2009.12.006

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Zhang, T, Xu, J, Shen, H, Dong, W, Ni, Y, and Du, J. Tumor-stroma ratio is an independent predictor for survival in nsclc. Int J Clin Exp Pathol (2015) 8:11348–55.

PubMed Abstract | Google Scholar

15. Pyo, JS, Kim, NY, Min, KW, and Kang, DW. Significance of tumor-stroma ratio (TSR) in predicting outcomes of malignant tumors. Medicina (Kaunas, Lithuania) (2023) 59:1258. doi:10.3390/medicina59071258

PubMed Abstract | CrossRef Full Text | Google Scholar

16. van Pelt, GW, Sandberg, TP, Morreau, H, Gelderblom, H, van Krieken, JHJM, Tollenaar, RAEM, et al. The tumour-stroma ratio in colon cancer: the biological role and its prognostic impact. Histopathology (2018) 73:197–206. doi:10.1111/his.13489

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Geessink, OGF, Baidoshvili, A, Klaase, JM, Ehteshami Bejnordi, B, Litjens, GJS, van Pelt, GW, et al. Computer aided quantification of intratumoral stroma yields an independent prognosticator in rectal cancer. Cell Oncol (2019) 42:331–41. doi:10.1007/s13402-019-00429-z

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Polack, M, Smit, MA, van Pelt, GW, Roodvoets, AGH, Meershoek-Klein Kranenbarg, E, Putter, H, et al. Results from the UNITED study: a multicenter study validating the prognostic effect of the tumor-stroma ratio in colon cancer. ESMO open (2024) 9:102988. doi:10.1016/j.esmoop.2024.102988

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Li, B, Chen, L, Huang, Y, Wu, M, Fang, W, Zou, X, et al. Are the tumor microenvironment characteristics of pretreatment biopsy specimens of colorectal cancer really effectively predict the efficacy of neoadjuvant therapy: a retrospective multicenter study. Medicine (2024) 103:e39429. doi:10.1097/MD.0000000000039429

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Yim, K, Jang, WM, Cho, U, Sun, DS, Chong, Y, and Seo, KJ. Intratumoral budding in pretreatment biopsies, among tumor microenvironmental components, can predict prognosis and neoadjuvant therapy response in colorectal adenocarcinoma. Medicina (Kaunas, Lithuania) (2022) 58:926. doi:10.3390/medicina58070926

PubMed Abstract | CrossRef Full Text | Google Scholar

21. den Uil, SH, van den Broek, E, Coupé, VMH, Vellinga, TT, Delis-van Diemen, PM, Bril, H, et al. Prognostic value of microvessel density in stage II and III colon cancer patients: a retrospective cohort study. BMC Gastroenterol (2019) 19:146. doi:10.1186/s12876-019-1063-4

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Tricco, AC, Lillie, E, Zarin, W, O’Brien, KK, Colquhoun, H, Levac, D, et al. PRISMA extension for scoping reviews (PRISMA-ScR): checklist and explanation. Ann Intern Med (2018) 169:467–73. doi:10.7326/M18-0850

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Zhang, Y, Liu, Y, Qiu, X, and Yan, B. Concurrent comparison of the prognostic values of tumor budding, tumor Stroma ratio, tumor infiltrating pattern and lymphocyte-to-monocyte ratio in colorectal cancer patients. Technol Cancer Res and Treat (2021) 20:15330338211045826. doi:10.1177/15330338211045826

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Zhao, Y, Xia, S, Zhao, X, Song, Z, Wang, F, Mao, L, et al. DNA ploidy combined with tumor stroma as a biomarker for predicting the prognosis of stage II colorectal cancer patients and identifying candidates for chemotherapy. World J Surg Oncol (2025) 23:49. doi:10.1186/s12957-025-03693-6

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Fan, S, Cui, X, Zheng, L, Ma, W, Zheng, S, Wang, J, et al. Prognostic value of desmoplastic stromal reaction, tumor budding and tumor-stroma ratio in stage II colorectal cancer. J Gastrointest Oncol (2022) 13:2903–21. doi:10.21037/jgo-22-758

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Zhao, Z, Zhang, X, Li, Z, Gao, Y, Guan, X, Jiang, Z, et al. Automated assessment of DNA ploidy, chromatin organization, and stroma fraction to predict prognosis and adjuvant therapy response in patients with stage II colorectal carcinoma. Am J Cancer Res (2021) 11:6119–32.

PubMed Abstract | Google Scholar

27. Fekete, Z, Ignat, P, Resiga, AC, Todor, N, Muntean, AS, Resiga, L, et al. Unselective measurement of tumor-to-stroma proportion in Colon cancer at the invasion Front-An elusive prognostic factor: original patient data and review of the literature. Diagnostics (Basel) (2024) 14:836. doi:10.3390/diagnostics14080836

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Dang, H, van Pelt, GW, Haasnoot, KJ, Backes, Y, Elias, SG, Seerden, TC, et al. Tumour-stroma ratio has poor prognostic value in non-pedunculated T1 colorectal cancer: a multi-centre case-cohort study. United Eur Gastroenterol J (2021) 9:478–85. doi:10.1177/2050640620975324

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Jakab, A, Patai, ÁV, Darvas, M, Tormássi-Bély, K, and Micsik, T. Microenvironment, systemic inflammatory response and tumor markers considering consensus molecular subtypes of colorectal cancer. Pathol Oncol Res (2024) 30:1611574. doi:10.3389/pore.2024.1611574

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Zhao, Q, Zhong, H, Guan, X, Wan, L, Zhao, X, Zou, S, et al. Role of microenvironment characteristics and MRI radiomics in the risk stratification of distant metastases in rectal cancer: a diagnostic study. Int J Surg (London, England) (2025) 111:200–9. doi:10.1097/JS9.0000000000001916

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Aboelnasr, LS, El-Rebey, HS, Mohamed, A, and Abdou, AG. The prognostic impact of tumor border configuration, tumor budding and tumor stroma ratio in colorectal carcinoma. Turk patoloji dergisi (2023) 39:83–93. doi:10.5146/tjpath.2022.01579

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Souza, D, Queiroga, E, De, T, Cunha, K, and Dias, E. Stromal scoring in advanced colon and rectal cancer: stroma-Rich tumors and their association with aggressive phenotypes. Archive Oncol (2022) 28:1–6. doi:10.2298/AOO210403003S

CrossRef Full Text | Google Scholar

33. Miller, S, Bauer, S, Schrempf, M, Schenkirsch, G, Probst, A, Märkl, B, et al. Semiautomatic analysis of tumor proportion in colon cancer: lessons from a validation study. Pathol Res Pract (2021) 227:153634. doi:10.1016/j.prp.2021.153634

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Kang, G, Pyo, JS, Kim, NY, and Kang, DW. Clinicopathological significances of tumor-stroma ratio (TSR) in colorectal cancers: prognostic implication of TSR compared to hypoxia-inducible Factor-1Î± expression and microvessel density. Curr Oncol (Toronto, Ont.) (2021) 28:1314–24. doi:10.3390/curroncol28020125

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Martin, B, Banner, BM, Schäfer, EM, Mayr, P, Anthuber, M, Schenkirsch, G, et al. Tumor proportion in colon cancer: results from a semiautomatic image analysis approach. Virchows Archiv: Int J Pathol (2020) 477:185–93. doi:10.1007/s00428-020-02764-1

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Ubink, I, van Eden, WJ, Snaebjornsson, P, Kok, NFM, van Kuik, J, van Grevenstein, WMU, et al. Histopathological and molecular classification of colorectal cancer and corresponding peritoneal metastases. Br J Surg (2018) 105:e204–e211. doi:10.1002/bjs.10788

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Hansen, TF, Kjær-Frifeldt, S, Lindebjerg, J, Rafaelsen, SR, Jensen, LH, Jakobsen, A, et al. Tumor-stroma ratio predicts recurrence in patients with colon cancer treated with neoadjuvant chemotherapy. Acta Oncologica (Stockholm, Sweden) (2018) 57:528–33. doi:10.1080/0284186X.2017.1385841

PubMed Abstract | CrossRef Full Text | Google Scholar

38. van Wyk, HC, Roseweir, A, Alexander, P, Park, JH, Horgan, PG, McMillan, DC, et al. The relationship between tumor Budding, tumor microenvironment, and survival in patients with primary operable colorectal cancer. Ann Surg Oncol (2019) 26:4397–404. doi:10.1245/s10434-019-07931-6

PubMed Abstract | CrossRef Full Text | Google Scholar

39. van de Weerd, S, Smit, MA, Roelands, J, Mesker, WE, Bedognetti, D, Kuppen, PJK, et al. Correlation of immunological and histopathological features with gene expression-based classifiers in Colon cancer patients. Int J Mol Sci (2022) 23:12707. doi:10.3390/ijms232012707

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Liang, Y, Zhu, Y, Lin, H, Zhang, S, Li, S, Huang, Y, et al. The value of the tumour-stroma ratio for predicting neoadjuvant chemoradiotherapy response in locally advanced rectal cancer: a case control study. BMC cancer (2021) 21:729. doi:10.1186/s12885-021-08516-x

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Strous, MTA, Faes, TKE, Heemskerk, J, Lohman, BGPM, Simons, PCG, Janssen Heijnen, MLG, et al. Tumour-stroma ratio to predict pathological response to neo-adjuvant treatment in rectal cancer. Surg Oncol (2022) 45:101862. doi:10.1016/j.suronc.2022.101862

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Tian, W, Yang, Y, Qin, Q, Zhang, L, Wang, Z, Su, L, et al. Vimentin and tumor-stroma ratio for neoadjuvant chemoradiotherapy response prediction in locally advanced rectal cancer. Cancer Sci (2023) 114:619–29. doi:10.1111/cas.15610

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Zunder, SM, van Pelt, GW, Gelderblom, HJ, Mancao, C, Putter, H, Tollenaar, RA, et al. Predictive potential of tumour-stroma ratio on benefit from adjuvant bevacizumab in high-risk stage II and stage III colon cancer. Br J Cancer (2018) 119:164–9. doi:10.1038/s41416-018-0083-0

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Huijbers, A, van Pelt, GW, Kerr, RS, Johnstone, EC, Tollenaar, RAEM, Kerr, DJ, et al. The value of additional bevacizumab in patients with high-risk stroma-high colon cancer. A study within the QUASAR2 trial, an open-label randomized phase 3 trial. J Surg Oncol (2018) 117:1043–8. doi:10.1002/jso.24998

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Ravensbergen, CJ, Polack, M, Roelands, J, Crobach, S, Putter, H, Gelderblom, H, et al. Combined assessment of the tumor-stroma ratio and tumor immune cell infiltrate for immune checkpoint inhibitor therapy response prediction in Colon cancer. Cells (2021) 10:2935. doi:10.3390/cells10112935

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Jin, HY, Yoo, SY, Lee, JA, Wen, X, Kim, Y, Park, HE, et al. Combinatory statuses of tumor stromal percentage and tumor infiltrating lymphocytes as prognostic factors in stage III colorectal cancers. J Gastroenterol Hepatol (2022) 37:551–7. doi:10.1111/jgh.15774

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Yang, J, Ye, H, Fan, X, Li, Y, Wu, X, Zhao, M, et al. Artificial intelligence for quantifying immune infiltrates interacting with stroma in colorectal cancer. J Translational Med (2022) 20:451. doi:10.1186/s12967-022-03666-3

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Hong, SA, Lee, HJ, Kim, OH, Hong, M, Kim, JW, and Kim, JY. MicroRNA-206 overexpression is associated with a prominent inflammatory reaction and a favorable colorectal cancer prognosis. Pathol Res Pract (2024) 263:155573. doi:10.1016/j.prp.2024.155573

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Hutchins, GGA, Treanor, D, Wright, A, Handley, K, Magill, L, Tinkler-Hundal, E, et al. Intratumoral stromal morphometry predicts disease recurrence but not response to 5-fluorouracil-results from the QUASAR trial of colorectal cancer. Histopathology (2018) 72:391–404. doi:10.1111/his.13326

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Wright, AI, Dunn, CM, Hale, M, Hutchins, GGA, and Treanor, DE. The effect of quality control on accuracy of digital pathology image analysis. IEEE J Biomed Health Inform (2021) 25:307–14. doi:10.1109/JBHI.2020.3046094

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Jakab, A, Patai, ÁV, and Micsik, T. Digital image analysis provides robust tissue microenvironment-based prognosticators in patients with stage I-IV colorectal cancer. Hum Pathol (2022) 128:141–51. doi:10.1016/j.humpath.2022.07.003

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Sinicrope, FA, Nelson, GD, Saberzadeh-Ardestani, B, Segovia, DI, Graham, RP, Wu, C, et al. Use of deep learning to evaluate tumor microenvironmental features for prediction of Colon cancer recurrence. Cancer Res Commun (2024) 4:1344–50. doi:10.1158/2767-9764.CRC-24-0031

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Zhao, K, Li, Z, Yao, S, Wang, Y, Wu, X, Xu, Z, et al. Artificial intelligence quantified tumour-stroma ratio is an independent predictor for overall survival in resectable colorectal cancer. EBioMedicine (2020) 61:103054. doi:10.1016/j.ebiom.2020.103054

PubMed Abstract | CrossRef Full Text | Google Scholar

54. Inoue, H, Kudou, M, Shiozaki, A, Kosuga, T, Shimizu, H, Kiuchi, J, et al. Value of the tumor-stroma ratio and structural heterogeneity measured by a novel semiautomatic image analysis technique for predicting survival in patients with Colon cancer. Dis Colon and Rectum (2023) 66:1449–61. doi:10.1097/DCR.0000000000002570

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Polack, M, Hagenaars, SC, Couwenberg, A, Kool, W, Tollenaar, RAEM, Vogel, WV, et al. Characteristics of tumour stroma in regional lymph node metastases in colorectal cancer patients: a theoretical framework for future diagnostic imaging with FAPI PET/CT. Clin Translational Oncol (2022) 24:1776–84. doi:10.1007/s12094-022-02832-9

PubMed Abstract | CrossRef Full Text | Google Scholar

56. Carvalho, R, Zander, T, Barroso, VM, Bekisoglu, A, Zerbe, N, Klein, S, et al. AI-based tumor-stroma ratio quantification algorithm: comprehensive evaluation of prognostic role in primary colorectal cancer. Virchows Archiv: Int J Pathol (2025). doi:10.1007/s00428-025-04048-y

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Zengin, M, and Benek, S. The proportion of tumour-stroma in metastatic lymph nodes is an accurately prognostic indicator of poor survival for advanced-stage Colon cancers. Pathol and Oncol Res (2020) 26:2755–64. doi:10.1007/s12253-020-00877-1

PubMed Abstract | CrossRef Full Text | Google Scholar

58. Fu, M, Chen, D, Luo, F, Li, M, Wang, Y, Chen, J, et al. Association of the tumour stroma percentage in the preoperative biopsies with lymph node metastasis in colorectal cancer. Br J Cancer (2020) 122:388–96. doi:10.1038/s41416-019-0671-7

PubMed Abstract | CrossRef Full Text | Google Scholar

59. Park, JH, van Wyk, H, McMillan, DC, Edwards, J, Orange, C, Horgan, PG, et al. Preoperative, biopsy-based assessment of the tumour microenvironment in patients with primary operable colorectal cancer. J Pathol Clin Res (2020) 6:30–9. doi:10.1002/cjp2.143

PubMed Abstract | CrossRef Full Text | Google Scholar

60. Sandberg, TP, Sweere, I, van Pelt, GW, Putter, H, Vermeulen, L, Kuppen, PJ, et al. Prognostic value of low CDX2 expression in colorectal cancers with a high stromal content - a short report. Cell Oncol (Dordrecht, Netherlands) (2019) 42:397–403. doi:10.1007/s13402-019-00436-0

PubMed Abstract | CrossRef Full Text | Google Scholar

61. Loft, MK, Pedersen, MRV, Lindebjerg, J, Rahr, HB, and Rafaelsen, SR. Endorectal ultrasound shear-wave elastography of complex rectal adenoma and early rectal cancer. Diagnostics (Basel, Switzerland) (2022) 12:2166. doi:10.3390/diagnostics12092166

PubMed Abstract | CrossRef Full Text | Google Scholar

62. Firmbach, D, Benz, M, Kuritcyn, P, Bruns, V, Lang-Schwarz, C, Stuebs, FA, et al. Tumor-stroma ratio in colorectal cancer-comparison between human estimation and automated assessment. Cancers (2023) 15:2675. doi:10.3390/cancers15102675

PubMed Abstract | CrossRef Full Text | Google Scholar

63. Smit, MA, van Pelt, GW, Dequeker, EM, Al Dieri, R, Tollenaar, RA, van Krieken, JHJ, et al. e-Learning for instruction and to improve reproducibility of scoring tumor-stroma ratio in Colon carcinoma: performance and reproducibility assessment in the UNITED study. JMIR formative Res (2021) 5:e19408. doi:10.2196/19408

PubMed Abstract | CrossRef Full Text | Google Scholar

64. Souza da Silva, RM, Queiroga, EM, Paz, AR, Neves, FFP, Cunha, KS, and Dias, EP. Standardized assessment of the tumor-stroma ratio in colorectal cancer: interobserver validation and reproducibility of a potential prognostic factor. Clin Pathol (2021) 14:2632010X21989686. doi:10.1177/2632010X21989686

PubMed Abstract | CrossRef Full Text | Google Scholar

65. Kristensen, MP, Korsgaard, U, Timm, S, Hansen, TF, Zlobec, I, Hager, H, et al. Prognostic value of tumor-stroma ratio in a screened stage II colon cancer population: intratumoral site-specific assessment and tumor budding synergy. Mod Pathol (2025) 38:100738. doi:10.1016/j.modpat.2025.100738

PubMed Abstract | CrossRef Full Text | Google Scholar

66. Smit, MA, van Pelt, GW, Terpstra, V, Putter, H, Tollenaar, RAEM, Mesker, WE, et al. Tumour-stroma ratio outperforms tumour budding as biomarker in colon cancer: a cohort study. Int J Colorectal Dis (2021) 36:2729–37. doi:10.1007/s00384-021-04023-4

PubMed Abstract | CrossRef Full Text | Google Scholar

67. Eriksen, AC, Andersen, JB, Lindebjerg, J, dePont Christensen, R, Hansen, TF, Kjær-Frifeldt, S, et al. Does heterogeneity matter in the estimation of tumour budding and tumour stroma ratio in colon cancer? Diagn Pathol (2018) 13:20. doi:10.1186/s13000-018-0697-9

PubMed Abstract | CrossRef Full Text | Google Scholar

68. Jones, HJS, Cunningham, C, Askautrud, HA, Danielsen, HE, Kerr, DJ, Domingo, E, et al. Stromal composition predicts recurrence of early rectal cancer after local excision. Histopathology (2021) 79:947–56. doi:10.1111/his.14438

PubMed Abstract | CrossRef Full Text | Google Scholar

69. Li, T, Yu, Z, Yang, Y, Fu, Z, Chen, Z, Li, Q, et al. Rapid multi-dynamic algorithm for gray image analysis of the stroma percentage on colorectal cancer. J Cancer (2021) 12:4561–73. doi:10.7150/jca.58887

PubMed Abstract | CrossRef Full Text | Google Scholar

70. Petäinen, L, Väyrynen, JP, Ruusuvuori, P, Pölönen, I, Äyrämö, S, and Kuopio, T. Domain-specific transfer learning in the automated scoring of tumor-stroma ratio from histopathological images of colorectal cancer. PLoS One (2023) 18:e0286270. doi:10.1371/journal.pone.0286270

PubMed Abstract | CrossRef Full Text | Google Scholar

71. Broad, A, Wright, AI, de Kamps, M, and Treanor, D. Attention-guided sampling for colorectal cancer analysis with digital pathology. J Pathol Inform (2022) 13:100110. doi:10.1016/j.jpi.2022.100110

PubMed Abstract | CrossRef Full Text | Google Scholar

72. Smit, MA, Ciompi, F, Bokhorst, JM, van Pelt, GW, Geessink, OGF, Putter, H, et al. Deep learning based tumor-stroma ratio scoring in colon cancer correlates with microscopic assessment. J Pathol Inform (2023) 14:100191. doi:10.1016/j.jpi.2023.100191

PubMed Abstract | CrossRef Full Text | Google Scholar

73. Strous, MTA, van der Linden, RLA, Gubbels, ALHM, Faes, TKE, Bosscha, K, Bronkhorst, CM, et al. Node-negative colon cancer: histological, molecular, and stromal features predicting disease recurrence. Mol Med (2023) 29:77. doi:10.1186/s10020-023-00677-8

PubMed Abstract | CrossRef Full Text | Google Scholar

74. Cai, C, Hu, T, Gong, J, Huang, D, Liu, F, Fu, C, et al. Multiparametric MRI-based radiomics signature for preoperative estimation of tumor-stroma ratio in rectal cancer. Eur Radiol (2021) 31:3326–35. doi:10.1007/s00330-020-07403-6

PubMed Abstract | CrossRef Full Text | Google Scholar

75. Ravensbergen, CJ, Kuruc, M, Polack, M, Crobach, S, Putter, H, Gelderblom, H, et al. The Stroma liquid biopsy panel contains a stromal-epithelial gene signature ratio that is associated with the histologic tumor-stroma ratio and predicts survival in Colon cancer. Cancers (2021) 14:163. doi:10.3390/cancers14010163

PubMed Abstract | CrossRef Full Text | Google Scholar

76. Komura, D, Ochi, M, and Ishikawa, S. Machine learning methods for histopathological image analysis: updates in 2024. Comput Struct Biotechnol J (2025) 27:383–400. doi:10.1016/j.csbj.2024.12.033

PubMed Abstract | CrossRef Full Text | Google Scholar

77. Cui, H, Wei, D, Ma, K, Gu, S, and Zheng, Y. A unified framework for generalized low-shot medical image segmentation with scarce data. IEEE Trans Med Imaging (2021) 40:2656–71. doi:10.1109/tmi.2020.3045775

PubMed Abstract | CrossRef Full Text | Google Scholar

78. Wang, Y, Guo, Y, Wang, Z, Yu, L, Yan, Y, and Gu, Z. Enhancing semantic segmentation in chest x-ray images through image preprocessing: Ps-kde for pixel-wise substitution by kernel density estimation. PLoS One (2024) 19:e0299623. doi:10.1371/journal.pone.0299623

PubMed Abstract | CrossRef Full Text | Google Scholar

79. Lovchinsky, I, Daks, A, Malkin, I, Samangouei, P, Saeedi, A, Liu, Y, et al. Discrepancy ratio: evaluating model performance when even experts disagree on the truth. In: International Conference on Learning Representations (ICLR); April 26-30, 2020; Addis Ababa, Ethiopia (2020).

Google Scholar

80. Magnusson, MI, Agnarsson, BA, Jonasson, JG, Tryggvason, T, Aeffner, F, le Roux, L, et al. Histopathology and levels of proteins in plasma associate with survival after colorectal cancer diagnosis. Br J Cancer (2023) 129:1142–51. doi:10.1038/s41416-023-02374-z

PubMed Abstract | CrossRef Full Text | Google Scholar

81. Wang, Q, Shen, X, An, R, Bai, J, Dong, J, Cai, H, et al. Peritumoral tertiary lymphoid structure and tumor stroma percentage predict the prognosis of patients with non-metastatic colorectal cancer. Front Immunol (2022) 13:962056. doi:10.3389/fimmu.2022.962056

PubMed Abstract | CrossRef Full Text | Google Scholar

82. Zhu, Y, Jin, Z, Qian, Y, Shen, Y, and Wang, Z. Prognostic value of tumor-stroma ratio in rectal cancer: a systematic review and meta-analysis. Front Oncol (2021) 11:685570. doi:10.3389/fonc.2021.685570

PubMed Abstract | CrossRef Full Text | Google Scholar

83. Zengin, M. Tumour budding and tumour stroma ratio are reliable predictors for death and recurrence in elderly stage I Colon cancer patients. Pathol Res Pract (2019) 215:152635. doi:10.1016/j.prp.2019.152635

PubMed Abstract | CrossRef Full Text | Google Scholar

84. Zunder, S, van der Wilk, P, Gelderblom, H, Dekker, T, Mancao, C, Kiialainen, A, et al. Stromal organization as predictive biomarker for the treatment of colon cancer with adjuvant bevacizumab; a post-hoc analysis of the AVANT trial. Cell Oncol (2019) 42:717–25. doi:10.1007/s13402-019-00449-9

PubMed Abstract | CrossRef Full Text | Google Scholar

85. Eriksen, AC, Sørensen, FB, Lindebjerg, J, Hager, H, dePont Christensen, R, Kjær-Frifeldt, S, et al. The prognostic value of tumour stroma ratio and tumour budding in stage II colon cancer. A nationwide population-based study. Int J Colorectal Dis (2018) 33:1115–24. doi:10.1007/s00384-018-3076-9

PubMed Abstract | CrossRef Full Text | Google Scholar

86. Pujani, M, Singh, K, Agarwal, C, Chauhan, V, Prasad, S, Singh, M, et al. Prognostic role of tumor-infiltrating lymphocytes, tumor budding, tumor border configuration, and tumor Stroma ratio in colorectal carcinoma. Indian J Surg Oncol (2024) 16:691–7. doi:10.1007/s13193-024-02127-1

PubMed Abstract | CrossRef Full Text | Google Scholar

87. Unal Kocabey, D, and Cakir, IE. The prognostic significance of growth pattern, tumor budding, poorly differentiated clusters, desmoplastic reaction pattern and tumor-stroma ratio in colorectal cancer and an evaluation of their relationship with KRAS, NRAS, BRAF mutations. Ann Diagn Pathol (2024) 73:152375. doi:10.1016/j.anndiagpath.2024.152375

PubMed Abstract | CrossRef Full Text | Google Scholar

88. Khan, AA, Malik, S, Jacob, S, Aden, D, Ahuja, S, Zaheer, S, et al. Prognostic evaluation of cancer associated fibrosis and tumor budding in colorectal cancer. Pathol Res Pract (2023) 248:154587. doi:10.1016/j.prp.2023.154587

PubMed Abstract | CrossRef Full Text | Google Scholar

89. Hu, S, Xing, X, Liu, J, Liu, X, Li, J, Jin, W, et al. Correlation between apparent diffusion coefficient and tumor-stroma ratio in hybrid 18F-FDG PET/MRI: preliminary results of a rectal cancer cohort study. Quantitative Imaging Med Surg (2022) 12:4213–25. doi:10.21037/qims-21-938

PubMed Abstract | CrossRef Full Text | Google Scholar

90. Kazemi, A, Gharib, M, Mohamadian Roshan, N, Taraz Jamshidi, S, Stögbauer, F, Eslami, S, et al. Assessment of the tumor-stroma ratio and tumor-infiltrating lymphocytes in colorectal cancer: inter-observer agreement evaluation. Diagnostics (Basel, Switzerland) (2023) 13:2339. doi:10.3390/diagnostics13142339

PubMed Abstract | CrossRef Full Text | Google Scholar

91. Zunder, SM, Perez-Lopez, R, de Kok, BM, Raciti, MV, van Pelt, GW, Dienstmann, R, et al. Correlation of the tumour-stroma ratio with diffusion weighted MRI in rectal cancer. Eur J Radiol (2020) 133:109345. doi:10.1016/j.ejrad.2020.109345

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: tumour-stroma ratio, colorectal carcinoma, scoping review, computational pathology, observer variability, prognostic value, protocol standardisation, artificial intelligence

Citation: Dikland FA, Fekih C, Wellenstein MRJ, Souza da Silva R, Machado-Neves R, Fraga J, Oliveira D, Montezuma D, Pinto IM and Woodburn J (2025) A scoping review of TSR analysis in colorectal cancer: implications for automated solutions. Oncol. Rev. 19:1605383. doi: 10.3389/or.2025.1605383

Received: 03 April 2025; Accepted: 08 October 2025;
Published: 28 October 2025.

Edited by:

Eva Budinská, Masaryk University, Czechia

Reviewed by:

Wilma Mesker, Leiden University Medical Center (LUMC), Netherlands
Naohiko Akimoto, Nippon Medical School, Japan

Copyright © 2025 Dikland, Fekih, Wellenstein, Souza da Silva, Machado-Neves, Fraga, Oliveira, Montezuma, Pinto and Woodburn. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Felix Anne Dikland, ZmVsaXguZGlrbGFuZEB3c2ttZWRpY2FsLm9yZw==

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.