Breaking the heterogeneity barrier: a robust prognostic signature for survival stratification and immune profiling in triple-negative breast cancer

Shen, Haixing; Zheng, Qing; Wang, Zhenyu; Zheng, Daitian; Gong, Zhenqi; Wang, Huaiming; Sun, Tianmiao; Pan, Jie; Jin, Yukai; Zheng, Xiaohong; Wang, Jingzhi; Zhang, Jiongjiong

doi:10.3389/fimmu.2025.1611917

ORIGINAL RESEARCH article

Front. Immunol., 30 September 2025

Sec. Cancer Immunity and Immunotherapy

Volume 16 - 2025 | https://doi.org/10.3389/fimmu.2025.1611917

Breaking the heterogeneity barrier: a robust prognostic signature for survival stratification and immune profiling in triple-negative breast cancer

Haixing Shen^1†

Qing Zheng^2†

Zhenyu Wang^2†

Daitian Zheng³

Zhenqi Gong²

Huaiming Wang²

Tianmiao Sun¹

Jie Pan⁴

Yukai Jin⁴

Xiaohong Zheng⁴

Jingzhi Wang^5*

Jiongjiong Zhang^1*

¹Affiliated Cixi Hospital, Wenzhou Medical University, Ningbo, Zhejiang, China
²Department of Gastrointestinal Surgery, The First Affiliated Hospital of Shantou University Medical College, Shantou, Guangdong, China
³Second Department of Thyroid and Breast Surgery, The Third Affiliated Hospital of Shenzhen University, Shenzhen, Guangdong, China
⁴Cixi Biomedical Research Institute, Wenzhou Medical University, Ningbo, Zhejiang, China
⁵Department of Radiotherapy Oncology, The Affiliated Yancheng First Hospital of Nanjing University Medical School; The First People’s Hospital of Yancheng, Yancheng, Jiangsu, China

Background: Triple-negative breast cancer (TNBC), a highly heterogeneous breast cancer subtype, poses significant challenges to human health. Intra-tumor heterogeneity (ITH) limits the reliability of conventional prognostic models.

Methods: Using multi-region RNA-seq, we quantified TNBC transcriptomic heterogeneity through an integrative heterogeneity score (IHS). After evaluating inter-patient heterogeneity (IPH) and ITH, prognostic and low-heterogeneity genes were identified and used to build a prognostic risk model with a random survival forest (RSF) algorithm. This model was combined with TNM staging into a nomogram for clinical applicability. We further revealed the distinct immune microenvironment features, somatic mutations, and chemotherapy responses between risk subgroups. Gene expression was validated via RT-qPCR.

Results: Spatial characterization uncovered substantial ITH, evidenced by sharp shifts in PAM50 subtypes and immune infiltration. Two low-heterogeneity biomarkers, CYP4B1 and GBP1, were identified to develop a robust prognostic signature with consistent predictive performance across 3- to 9-year survival endpoints (AUC > 0.6). The high-risk subgroup exhibited reduced immune infiltration, reduced immune checkpoint molecule expression, and poor immunotherapy response rates. Integration of the risk signature with TNM staging created a clinically practical nomogram with superior predictive accuracy (C-index >0.67). Therapeutic vulnerability profiling identified six targeted agents showing increased efficacy in high-risk patients. Dysregulation of signature genes was demonstrated in two TNBC cell lines.

Conclusions: This study established a transcriptomic heterogeneity-resilient prognostic model for TNBC, enabling precise survival stratification and immune microenvironment assessment. The integrative nomogram and risk-guided therapeutic predictions address clinical challenges in TNBC management, advancing personalized treatment strategies.

1 Introduction

Triple-negative breast cancer (TNBC), a highly heterogeneous breast cancer subtype, lacks estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER2), making it unresponsive to hormonal therapies or targeted treatments. This results in limited clinical therapeutic options and poor prognosis (1). Although TNBC represents only 15%-20% of all breast cancer cases, it exhibits a highly aggressive nature, with patients showing significantly lower five-year survival rates compared to other subtypes, along with elevated risks of local recurrence and distant metastasis (2). Developing accurate prognostic prediction models for TNBC could enable clinicians to perform personalized assessments of disease progression and survival outcomes while optimizing treatment strategies. Currently, conventional histopathology-based prognostic systems still dominate clinical practice. However, histopathological evaluations rely mainly on microscopic morphological observations, which are inherently subjective and prone to variability based on pathologists’ experience. Furthermore, conventional systems typically rely on limited macroscopic indicators such as tumor size and lymph node metastasis, which are insufficient to elucidate molecular mechanisms driving tumor progression or quantify complex immune infiltration and stromal reactions within the tumor microenvironment (TME). Consequently, they fail to distinguish patient subgroups with morphologically similar features but divergent prognostic outcomes.

In recent years, the leapfrog development of high-throughput sequencing technologies has driven continuous innovation in TNBC molecular classification systems and genetic prognostic models. The previous Lehmann classification primarily aimed to achieve molecular-level categorization and identify biological mechanisms, rather than being specifically optimized for precise individual prognosis prediction (3). Although the tumor-infiltrating lymphocytes (TILs) can reflect prognosis, they represent only a portion of the immune microenvironment, failing to directly capture the intrinsic characteristics of tumor cells (4). Compared with traditional prognostic models, transcriptome-based models achieve higher-precision risk stratification and personalized treatment prediction through comprehensive analysis of tumor molecular characteristics and functional states, while revealing therapeutic targets undetectable by conventional morphological methods (5, 6). However, existing studies universally face a critical bottleneck: high intra-tumoral heterogeneity (ITH). This heterogeneity stems not only from immune-stromal interactions within the microenvironment, but also from dynamic evolution of driver mutations or epigenetic dysregulation. Its complexity and dynamic nature significantly undermine the stability and generalization capability of current prognostic models. Even variability in ribonucleic acid (RNA) degradation during sample processing introduces noise signals at the transcriptomic level, causing conventionally identified differentially expressed genes to carry substantial confounding factors rather than true biological signals. While single-cell RNA sequencing (scRNA-seq) and spatial transcriptomics offer new perspectives for resolving cellular subpopulation heterogeneity, their high costs and technical complexity limit applications in large clinical cohorts. Therefore, developing computational frameworks capable of dissociating heterogeneity interference from routine transcriptomic data has become the core challenge for constructing reliable prognostic models.

The multi-region bulk sequencing approach offers technical advantages for studying tumor heterogeneity by enabling the capture of spatial transcriptomic variation features through multi-region sampling and sequencing of tumors. Compared to single-site sampling methods, this technique effectively mitigates sampling bias and more reliably reveals comprehensive transcriptomic characteristics. We analyzed bulk transcriptomic datasets from TNBC patients with multi-region sampling to evaluate how spatial heterogeneity influences biological signal enrichment. Through the evaluation of transcriptomic heterogeneity, we identified spatially stable prognostic biomarkers and constructed a robust TNBC prognostic model resistant to transcriptomic heterogeneity. This model demonstrated consistent long-term prognostic predictive accuracy across The Cancer Genome Atlas (TCGA) and Molecular Taxonomy of Breast Cancer International Consortium (METABRIC) datasets while exhibiting strong immunoinfiltration prediction capability (7, 8). To enhance clinical applicability, we developed a dynamic nomogram integrating transcriptomic profiles with Tumor-Node-Metastasis (TNM) staging, enabling generation of visual risk stratification reports. From a translational medicine perspective, the value of this study lies in establishing a TNBC-specific low-heterogeneity prognostic and immune response prediction system, which will advance innovative clinical practice.

2 Methods

2.1 Data collection and processing

Multi-region bulk RNA sequencing data of TNBC patients from a previous study was included, encompassing 32 tumor samples from 10 patients (9). This dataset underwent DESeq2 normalization followed by Variance Stabilizing Transformation (VST) to facilitate the identification of low-ITH genes with stable expression patterns (10). The METABRIC TNBC dataset (n = 320) was extracted as the training cohort for prognostic modeling (7). TNBC samples (n = 122) and 113 normal samples from the TCGA breast invasive carcinoma cohort were used for differential expression analysis and served as the prognostic validation cohort (8). The IMvigor210 immunotherapy cohort (n = 348) was utilized for predicting immunotherapy response (11). Somatic mutation data from TNBC cases were obtained from the TCGA database (8). Single-cell dataset GSE148673 (n = 5) (12) and spatial transcriptomic dataset GSE148673 (13) were included in the analysis.

2.2 Heterogeneity assessment

Mutant-Allele Tumor Heterogeneity (MATH) was employed to quantify genomic heterogeneity (14). single-region ITH was assessed using the DEPTH2 score (15). A dual-dimensional strategy combining variance analysis and clustering consistency was adopted to evaluate gene heterogeneity in TNBC (16–18). For variance analysis, gene expression data were decomposed using a linear mixed-effects model (via the nlme R package), partitioning variance into within-tumor variance (W) and between-tumor variance (B). The Intra-Tumoral Variability Score (ITVS) was calculated as: ITVS = W/(W + B). This metric (ranging from 0 to 1) reflects the dominance of intra-tumoral heterogeneity when approaching 1. Hierarchical clustering was iteratively performed with the cluster numbers increasing from 1 to the total sample size (N). The patient grouping odds ratio (PGOR) was computed at each clustering level as the proportion of correctly grouped patients. The area under the PGOR curve (AUPC) was quantified via numerical integration, and the clustering consistency score (CCS) was defined as: CCS = 1 − AUPC/(N − 1). Higher CCS (ranging from 0 to 1) indicates lower ITH. The integrated heterogeneity score (IHS) (ranging from 0 to 1) was determined as the geometric mean of ITVS and CCS. A lower IHS correlates with reduced gene-level ITH.

2.3 Immune microenvironment and functional analysis

The StromalScore and ImmuneScore were determined through the ESTIMATE (Estimation of STromal and Immune cells in MAlignant Tumors using Expression data) algorithm (19). Additionally, absolute proportions of 22 immune cell subtypes were quantified using CIBERSORT-abs (20). The IOBR package was employed to calculate 15 tumor-intrinsic and 20 TME signatures (21). Functional interpretation was performed through Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses to identify key biological processes (22).

2.4 Construction and validation of prognosis model

The limma package was used for the screening of differentially expressed genes (DEGs). Using a false discovery rate (FDR) < 0.05 and a lenient cutoff of |log₂FC| > 0.137 (equivalent to >10% expression change), this strategy ensures statistical rigor while preserving potential functionally relevant genes. Prognosis-associated DEGs were filtered based on univariate Cox regression and proportional hazards (PH) assumption to build a preliminary gene library. To address transcriptional heterogeneity, protein-coding genes were stratified into low-ITH subgroups using an IHS cut-off of 0.5. Model genes were identified by intersecting low-ITH genes with prognosis-associated DEGs. During model construction, the random survival forest (RSF) algorithm was applied using the METABRIC cohort as the training set. The RSF model was configured with 1000 decision trees and a node size of 36.

2.5 Nomogram development

Clinical parameters and molecular features significantly associated with overall survival were selected via univariate Cox regression. Independent prognostic factors were further identified by performing multivariate Cox proportional hazards analysis. For clinical utility, the R package “rms” was used to construct a nomogram prediction model based on independent prognostic factors.

2.6 Drug sensitivity prediction

The pRRophetic algorithm was employed to integrate tumor cell line drug screening data with gene expression profiles and to establish a gene expression-drug sensitivity predictive model (23).

2.7 Cell culture

The MCF-10A, MCF-10AT, MDA-MB-231, and MDA-MB-453 cell lines (all from ATCC) were used in experiments. Cells were cultured in McCoy’s 5A medium (Gibco) supplemented with 10% fetal bovine serum (Gibco) and 1% penicillin-streptomycin. All cultures were maintained at 37°C with 5% CO₂ in a standard incubator, with medium replacement every two days.

2.8 RT-qPCR

Following total RNA extraction using TRIzol reagent (Invitrogen), reverse transcription was performed with the PrimeScript RT Reagent Kit (TaKaRa). RT-qPCR analysis was conducted with SYBR Green methodology. The thermal cycling protocol comprised: initial denaturation at 95 °C for 30 seconds, followed by 40 amplification cycles (95 °C for 5 seconds and 60 °C for 30 seconds), with subsequent melt curve analysis. Relative gene expression was calculated using the 2^(-ΔΔCt) method and normalized to β-actin as the endogenous control. Three independent biological replicates were implemented. The PCR amplification primer sequences were as follows: CYP4B1, forward: 5’-TGTGCTGAAGCCCTATGTGG-3’, reverse: 5’- CCGGTGTCTCCTCTTCCAAA-3’; GBP1, forward: 5’-AGAGAGGACCCTCGCTCTTA-3’, reverse: 5’- ACATGCCTTTCGTCGTCTCA -3’; β-actin, forward: 5′-TCCATCATGAAGTGTGACGT-3′, reverse: 5-GAGCAATGATCTTGATCTTCAT-3′.

2.9 Statistical analysis

Statistical analyses were conducted in R (v4.3.3) and GraphPad Prism 8.0. Non-normally distributed continuous variables were analyzed using Mann-Whitney U or Kruskal-Wallis tests, while categorical variables were assessed via Fisher’s exact test. One-way ANOVA was used to compare means among three or more independent groups. Survival differences were visualized using Kaplan-Meier curves and compared via log-rank test. Cox proportional hazards regression was used to identify prognostic factors, with hazard ratios (HRs) and corresponding 95% confidence intervals (CIs) calculated. For prediction performance evaluation, receiver operating characteristic (ROC) curves were constructed, and the time-dependent discriminative ability of the model was quantified using the area under the curve (AUC). The model’s discrimination was further assessed by the concordance index (C-index). Bootstrap validation with 1000 resampling iterations was performed to evaluate the robustness of the results. Spearman’s rank correlation (nonparametric) and Pearson’s product-moment correlation (parametric) were used for correlation analysis between variables. A significance threshold of P < 0.05 was applied unless otherwise specified.

3 Results

3.1 Genetic and transcriptomic heterogeneity profiles in TNBC

First, we used MATH and DEPTH2 scores to quantify ITH at genomic and transcriptomic levels, respectively. Both metrics consistently revealed elevated heterogeneity in TNBC compared to non-TNBC (Figures 1A, B; MATH: P = 0.00018; DEPTH2: P < 2×10⁻¹⁶), aligning with TNBC’s aggressive biology. However, their prognostic utility differed: high MATH significantly predicted poorer overall survival (Figure 1C, log-rank P = 0.017), while DEPTH2 showed only borderline significance (Figure 1D, log-rank P = 0.055). Tumor-stage progression impacted both heterogeneity measures similarly. MATH increased significantly from Stage I to Stage II (Figure 1G, P = 0.047), plateauing in later stages, potentially due to early clonal expansion followed by stabilization of dominant subclones or cooperative populations in advanced TNBC. The DEPTH2 mirrored this trend, with differences between Stages I and II approaching significance (Figure 1H, P = 0.051), suggesting concurrent genomic and transcriptomic heterogeneity shifts during early progression. Notably, neither score varied significantly by age (Figures 1E, F), treatment status or type (Figures 1I–L), or race (Figures 1M, N), indicating that intrinsic tumor biology, rather than extrinsic factors, drives ITH in TNBC. The lack of treatment effect implies conventional therapies may inadequately remodel clonal architecture.

Figure 1

Graphs analyze overall survival, MATH score, and DEPTH2 score in cancer research. Panels A and B show survival curves based on MATH and DEPTH2 scores. Panels C and D compare TNBC and non-TNBC groups. Panels E-N present box plots and scatter plots comparing MATH and DEPTH2 scores across variables like age, cancer stage, treatment type, and race, with p-values indicating significance levels.

Figure 1. Association of MATH and DEPTH2 scores with clinical parameters. (A, B) Kaplan-Meier survival curves for MATH (A) and DEPTH2 score (B) subgroups. (C, D) MATH (C) and DEPTH2 scores (D) in TNBC versus non-TNBC groups. (E–N) MATH and DEPTH2 score distribution in TNBC stratified by clinical features: Age stratification (E, F), tumor stage (G, H), treatment status (I, J), treatment type (K, L), and race (M, N).

3.2 TNBC demonstrates profound IPH and ITH

Given that single-region ITH estimation is susceptible to sampling bias and may not accurately represent the actual ITH level (24), we further utilized multi-region samples for ITH quantification. At the individual patient level, unsupervised clustering revealed that multiple tumor regions from the same TNBC patient tended to cluster together, highlighting significant inter-patient heterogeneity (IPH) (Figure 2A). Principal component analysis (PCA) dimensionality reduction of whole transcriptomic profiles further confirmed that samples from distinct regions exhibit closer clustering associations with patient identity (Figure 2B). PAM50 subtyping analysis underscored strong ITH in TNBC (Figure 2C). In a representative case (P2), region P2A exhibited mixed features of Luminal B (17.3%), basal-like (49.9%), and HER2-enriched (32.8%) subtypes. In contrast, matched regions (P2C and P2D) displayed sharp increases in basal-like dominance (74.5% and 87.1%, respectively), with the complete loss of the Luminal B subtype.

Figure 2

Panel A shows a clustered heatmap with samples labeled P1A to P9B and a Z-score color gradient. Panel B is a PCA plot with samples color-coded by patient groups P1 to P10, showing clustering patterns in a PC1 versus PC2 plot. Panel C contains bar charts for each patient group (P1 to P10) indicating the percentage breakdown of subtypes: Basal, Her2, LumA, LumB, and Normal.

Figure 2. TNBC exhibits strong IPH and ITH. (A) The heatmap shows the unsupervised hierarchical clustering of TNBC samples (rows) in the multi-regional cohort according to the top 500 variable expression genes (columns). (B) PCA of transcriptome profiles across multi-regional TNBC samples. (C) Stacked bar plot showing PAM50 subtype proportions in 10 patients.

3.3 TME heterogeneity in TNBC

The CIBERSORT-abs algorithm was applied to quantify immune landscape features across 10 multi-region TNBC cases. Results revealed marked fluctuations in immune cell infiltration patterns between patients and across different tumor regions within the same individual (Figure 3A). Notably, effector immune cells were not universally detectable, demonstrating pronounced spatial distribution disparities. For instance, CD8+ T cells were highly enriched in region P7D but completely absent in region P7B from the same patient. Similar spatial imbalances were observed in activated NK cells and M1/M2 macrophage subpopulations (Figure 3B). Tumor heterogeneity heatmap analysis (Figure 3C) uncovered distinct cell death signatures: ferroptosis signaling was activated in region P8A but significantly suppressed in region P8B of the same tumor. This regional variation likely arises from the metabolic reprogramming differences between tumor subclones, where some regions upregulate ACSL4 to promote lipid peroxidation, while others activate GPX4 to sustain antioxidant defenses (25). Additionally, the TME heterogeneity mapping (Figure 3D) highlighted that spatially divergent immune pathway activity was downregulated in regions P1A and P1B but strongly activated in P1C and P1D, which reflects region-specific modulation of immune signaling networks.

Figure 3

Bar charts and heatmaps depict immune cell infiltration and tumor heterogeneity across patients. Chart A shows cell type infiltration using CIBERSORT-abs. Chart B displays infiltration of CD8 T cells, NK cells, and macrophages (M1 and M2) across patients P1-P9. Heatmap C illustrates tumor-intrinsic heterogeneity, and heatmap D shows TME-associated heterogeneity, both using PCA scores. Color coding represents various patients.

Figure 3. Heterogeneity of the TNBC tumor immune microenvironment. (A) Bar plot of immune cell infiltration composition calculated using the CIBERSORT-abs algorithm. (B) Regional distribution of key effector cell subsets: Infiltration levels of CD8+ T cells, activated NK cells, and M1/M2 macrophages. (C) Heatmap of tumor-intrinsic heterogeneity: Rows represent tumor-intrinsic feature scores and columns denote tumor regions. (D) Heatmap of microenvironmental features: Rows include immune signature scores and columns denote tumor regions.

3.4 Gene heterogeneity in multi-region TNBC RNA-seq

We then elucidated the heterogeneity of protein-coding genes in TNBC and its biological implications. Figure 4A illustrates the distribution patterns of three variability metrics: within-tumor variance, between-tumor variance, and ITVS. The skewed distribution of within-tumor variance (median 0.82) and ITVS (median 0.81) highlights significant intra-tumor gene heterogeneity in TNBC: 8.1% of genes exhibited extreme within-tumor variance (variance = 1), while 8.3% lacked inter-patient discriminative power (between-tumor variance = 0). Notably, the extreme ITVS = 1 values observed in 8% of genes statistically confirm the multi-clonal coexistence within individual TNBC tumors. The scatter plot in Figure 4B further reveals that genes with ITVS < 0.25 are exceedingly rare (only 2 genes), a phenomenon attributed to the combined effect of ITH and IPH in TNBC. For the clustering analysis, we used the CCS approach. The calculation of this score is based on the principle that genes with low ITH should consistently cluster samples from the same patient. An IHS, calculated as the geometric mean of ITVS and CCS (range 0-1), was established, with lower IHS values indicating reduced gene-level heterogeneity. Applying an IHS threshold of 0.5 to 14,071 protein-coding genes, only 2.98% (419 genes) fell into the low-ITH group (Figure 4C). A heatmap of the top 10 IHS-ranked genes from each group revealed distinct patterns: low-ITH genes exhibited cross-regional stability, while high-heterogeneity genes displayed pronounced spatial intra-tumor variability (Figure 4D). Functional enrichment analysis (Figures 4E, F) showed that high-ITH genes were strongly associated with immune-related pathways, including cytokine-receptor interactions and chemokine signaling, suggesting roles in TME reprogramming. Conversely, low-ITH genes were enriched in ribosomal biogenesis and energy metabolism, reflecting conserved roles in core cellular processes. This bifurcation in expression strategies highlights TNBC’s dual survival tactics: maintaining stable expression of metabolic genes for basic cellular functions while preserving heterogeneity in immune-related pathways to enhance adaptability.

Figure 4

Panel A shows bar graphs indicating variance distribution within tumors (median 0.82), between tumors (median 0.19), and ITVS (median 0.81). Panel B is a scatter plot of between tumor versus within tumor variance, color-coded by ITVS. Panel C presents a bar graph with counts of high and low IHS values. Panel D is a heatmap of gene expression with Z-scores, classified by high and low IHS groups. Panels E and F display KEGG pathway enrichment bar charts for various biological processes, with gene counts and significance levels (-log10(P)) indicated.

Figure 4. Quantification of gene-level ITH and pathway enrichment analysis. (A) Histograms of within-tumor, between-tumor variance and ITVS distribution. (B) Scatterplot of gene-level within-tumor/between-tumor variance and ITVS. (C) Gene distribution in high- and low-IHS groups. (D) Heatmap of expression for top 10 highest and lowest IHS genes. (E, F) KEGG pathway enrichment analysis in low- (E) and high-IHS (F) groups.

3.5 DGEs between TNBC and normal breast tissues

Tumor-associated DEGs hold particular biological significance compared to those in healthy tissue, as they may reveal molecular-level dysregulation mechanisms driving tumorigenesis, development, and malignant phenotypes (26). DEGs are more likely to affect patient survival rates. This study compared gene expression profiles between TNBC and normal breast tissues using transcriptomic data from the TCGA database (Figure 5A). We identified 18,252 significantly DEGs based on screening criteria (|log2FC| > 0.137, FDR < 0.05). Heatmap visualization of the top 50 most significant DEGs revealed distinct clustering patterns in the TNBC group (Figure 5B), including upregulated MMP1 and COL1A1. The elevated expression of these genes points to their potential contribution to tumor invasion and metastasis via extracellular matrix remodeling (27). GO enrichment analysis revealed associations with cell proliferation and cycle regulation, including terms such as DNA replication and chromosomal region (Figure 5C). KEGG pathway analysis identified enrichment of cell cycle-related genes and ECM-receptor interaction pathways (Figure 5D), which may mediate the invasive potential of tumors.

Figure 5

Panel A shows a volcano plot with colored dots representing gene expression significance and fold change. Panel B displays a heatmap of the top 50 differentially expressed genes between normal and TNBC groups. Panel C is a bar chart illustrating gene ontology terms across three categories: biological process (BP), cellular component (CC), and molecular function (MF). Panel D represents a bubble chart of the top 10 KEGG pathways, with bubble size indicating gene count and color representing p-adjusted values.

Figure 5. DEGs and functional enrichment in TCGA TNBC samples. (A) The volcano plot displays the differential analysis results of tumor versus normal tissues from TCGA TNBC RNA-seq data (FDR < 0.05). (B) Heatmap shows the expression patterns of top 50 DEGs. (C) Analysis of the biological roles of DEGs (|logFC| > 1, adjusted P < 0.05) using GO terms from biological processes (BP), cellular components (CC), and molecular functions (MF). (D) The KEGG bubble plot displays the top 10 enriched pathways of DEGs (|logFC| > 1, adjusted P < 0.05).

3.6 Construction and validation of a low-ITH prognostic model

By analyzing transcriptomic heterogeneity, this study established a prognostic stratification model for TNBC patients with low ITH. To address cohort heterogeneity, we selected genes with consistent HR directions in the TCGA and METABRIC datasets. Venn diagram analysis of 418 low-ITH genes (IHS: 0-0.5) and 24 prognosis-related differentially expressed genes (Figure 6A) identified two core regulators, CYP4B1 and GBP1 (Both genes met the Cox model’s PH assumption; Supplementary Figure S1). RSF algorithm assessed gene importance, assigning weight scores of 0.024 and 0.02 to CYP4B1 and GBP1, respectively (Figure 6B). A risk prediction model was constructed: Patient risk score = (0.024 × CYP4B1 expression) + (0.02 × GBP1 expression). In the METABRIC cohort, TNBC patients in the high-risk group exhibited significantly shorter overall survival compared to the low-risk group, a finding validated in the TCGA-TNBC dataset (Figures 6C, D). Time-dependent ROC curve analysis demonstrated robust predictive performance in METABRIC (3-year: 0.612, 5-year: 0.661, 7-year: 0.684, 9-year: 0.678) and TCGA validation (3-year: 0.605, 5-year: 0.647, 7-year: 0.695, 9-year: 0.663), demonstrating cross-platform applicability of the model (Figures 6E–H).

Figure 6

Panel A shows a Venn diagram with 24 prognostic DEGs, 418 low ITH genes, and 2 overlapping genes. Panel B displays a bar chart of gene importance for CYP4B1 and GBP1. Panels C and D are survival probability plots for high-risk and low-risk groups in METABRIC-TNBC and TCGA-TNBC datasets, respectively, with significant log-rank p-values. Panels E to H are ROC curves showing AUC values over 3, 5, 7, and 9 years, with METABRIC and TCGA datasets compared.

Figure 6. Construction and validation of the low-ITH prognostic model in TNBC. (A) Venn diagram indicating overlap between prognostic DEGs (n = 24) and low-ITH genes (n=418). (B) RSF importance scores for model genes CYP4B1 (0.024) and GBP1 (0.02). (C, D) Survival curves for risk groups in TNBC samples from METABRIC and TCGA datasets. (E–H) ROC curves showing AUC values for RSF model predicting 3-/5-/7-/9-year survival in TNBC samples from METABRIC and TCGA datasets.

3.7 Development of a nomogram integrating risk score and TNM stage

Analysis of TNBC patient data in the METABRIC cohort revealed that the risk score stratification effectively distinguished subgroups with significant prognostic differences: 48% of TNBC-related deaths occurred in the high-risk group (vs. 27% in the low-risk group), and recurrence rates were elevated in high-risk patients (Figure 7A). Notably, while no statistically significant differences were observed between the two groups in TNM staging, histological grade, primary tumor laterality, or treatment history (radiotherapy/chemotherapy), the proportion of postmenopausal patients was significantly higher in the high-risk group (70% vs. 58%, P = 0.035), suggesting that hormonal status may influence prognosis through non-canonical mechanisms (Figure 7A).

Figure 7

Grouped charts and graphs showing breast cancer risk factors and survival data. Panel A includes bar graphs for factors like tumor stage and chemotherapy across risk levels. Panels B and C feature hazard ratio line plots for risk scores and tumor stage. Panel D presents a nomogram predicting overall survival across time points. Panel E displays a line graph of the concordance index over time. Panel F shows scatter plots of nomogram-predicted versus actual survival, with colored lines for 3, 5, 7, and 9-year intervals.

Figure 7. Integrative prognostic nomogram combining risk score and TNM stage. (A) Distribution of survival status, recurrence, tumor laterality, stage, histologic grade, treatment, and menopausal status in risk groups. (B, C) Univariate forest plot (B) and multivariate forest plot (C) demonstrate the impact of risk score on overall survival. (D) Nomogram integrates risk score and TNM stage for predicting overall survival in TNBC patients. (E) Time-dependent C-index for nomogram at 3/5/7/9 years. (F) The calibration curve shows that the predicted probabilities from the nomogram have high consistency with the actual observed probabilities.

We then evaluated the prognostic predictive power of the risk score, TNM stage, and pathological grade. Univariate Cox regression analysis identified significant associations between overall survival and the risk score (HR = 1.041, 95% CI: 1.029-1.053) and clinical stage (HR = 1.708, 95% CI: 1.286-2.268), both with p<0.001 (Figure 7B). However, pathological grade was not statistically significant in univariate Cox analysis. Crucially, in the multivariate Cox model, the risk score and clinical stage retained independent prognostic value (Figure 7C), indicating their synergistic predictive utility. Based on these findings, a nomogram integrating the risk score and clinical stage was developed (Figure 7D). This visual tool quantifies the contribution weight of each variable, enabling clinicians to rapidly calculate 3-, 5-, 7-, and 9-year overall survival probabilities. Model validation demonstrated robust performance, with the C-index consistently exceeding 0.67 during the 3- to 9-year follow-up period (Figure 7E), confirming sustained predictive accuracy over time. Time-calibration curves (Figure 7F) further validated model precision, showing strong alignment between predicted and observed survival rates at 3, 5, 7, and 9 years.

3.8 Immune infiltration in risk subgroups

This study revealed that high-risk patients exhibit an immune-suppressive state compared to low-risk patients. Using the ESTIMATE algorithm, we found that the ESTIMATEScore and ImmuneScore were significantly lower in the high-risk group compared to the low-risk group, while the StromalScore showed no statistical difference (Figure 8A). This suggests that the reduced immune activity in high-risk patients primarily arises from altered immune cell composition rather than stromal components. Further immune infiltration quantification via the CIBERSORT algorithm (Figure 8B) showed reduced infiltration levels of anti-tumor immune subpopulations in the high-risk group. Specifically, M1 macrophages, dendritic cells, plasma cells, monocytes, and CD4+ memory T cells were markedly decreased (all P < 0.05). Notably, CD8+ T cells, natural killer cells, M2 macrophages, and regulatory T cells showed no statistically significant differences.

Figure 8

Grouped box and bar plots analyzing risk groups. Panel A shows ESTIMATE, Immune, and Stromal Scores for high and low-risk groups. Panel B illustrates immune infiltration in various cells by risk group. Panel C compares expression of immune checkpoints and receptors across risk groups. Panel D displays total mutations per megabase by risk group. Panel E presents a stacked bar chart of response types in high and low-risk groups. Panels F and G show risk scores by response categories. Statistical significance is indicated with asterisks.

Figure 8. Immune characteristics of risk groups. (A) Boxplots of ESTIMATE/immune/stromal scores in risk subgroups. (B) The boxplot illustrates the differences in immune cells between risk groups, as calculated by CIBERSORT. (C) Boxplots of immune checkpoint expression between risk groups. (D) Boxplots of tumor mutational burden (TMB) between risk groups. (E) Bar plot of clinical response rates between risk groups (CR, complete response; PR, partial response; SD, stable disease; PD, disease progression). (F, G) Boxplots of risk scores across treatment response subgroups. *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001, ns, not significant.

Analysis of immune checkpoint expression profiles (Figure 8C) revealed significant downregulation of key regulators (CD27, CTLA4, and PDCD1) in the high-risk group. However, the expression of TNFSF14 and TNFRSF25 were not statistically significant, highlighting heterogeneity in checkpoint regulation. Concurrently, the reduced tumor mutational burden (TMB) in high-risk patients may further impair antigen presentation efficiency (Figure 8D). In the IMvigor210 cohort, we observed increased resistance to immune checkpoint blockade therapy in the high-risk group. The combined proportion of progressive disease (PD) and stable disease (SD) cases reached 82% in the high-risk group, surpassing the 72% in the low-risk group (Figure 8E). Patients who achieved complete or partial responses (CR/PR) had significantly lower median risk scores than those with PD/SD (Figures 8F, G). These findings collectively suggest that the risk scoring system effectively identifies immunotherapy non-responders, offering critical insights for clinical decision-making.

3.9 Mutational landscape of risk subgroups

This study integrated genomic data from 105 TCGA TNBC tumor samples (63 high-risk, 42 low-risk) to characterize mutation patterns and risk interactions. Figure 9A shows that 56 out of 63 high-risk samples (88.89%) harbored at least one genetic alteration, with TP53 mutations being the most frequent (75%), followed by TTN (17%) and PIK3CA (14%), dominated by missense mutations. All low-risk samples displayed genetic alterations, with TP53 mutations occurred in 93% of cases, alongside high-frequency mutations in TTN (33%) and MUC16 (21%) (Figure 9B). Co-occurrence and exclusivity analysis (Figures 9C, D) revealed significant associations: USH2A and MUC16 exhibited strong co-occurrence (P < 0.05), suggesting synergistic oncogenic pathways, while TP53 and MUC16 displayed mutual exclusivity.

Figure 9

Graphs A and B show mutation frequency in samples, with a color key indicating mutation types such as missense mutations and splice sites. Subplots C and D display heatmaps of gene co-occurrence or mutual exclusivity. Asterisks indicate statistical significance at different p-values.

Figure 9. Genomic alteration profiles. (A, B) High-frequency mutated genes in 105 TNBC samples in high-risk group (A) and low-risk group (B). (C, D) Co-occurrence and mutual exclusivity heatmaps of top 20 genes in high-risk group (C) and low-risk group (D).

3.10 Drug sensitivity screening for high-risk patients

Given the poor prognosis and immunotherapy resistance in high-risk patients, the pRRophetic platform identified six anticancer agents with significant negative correlations to risk scores (R < −0.02) and lower IC50 values in the high-risk group: Bryostatin.1, AKT inhibitor VIII, Imatinib, Bexarotene, Lapatinib, and Bicalutamide (Figures 10A, B). These findings suggest that risk score-guided drug selection may optimize personalized therapy for high-risk patients.

Figure 10

Panel A shows box plots depicting sensitivity (IC50) for six drugs across low-risk and high-risk groups, with significant differences marked by asterisks. Panel B displays scatter plots with regression lines, illustrating negative correlations between risk scores and drug sensitivity for the same six drugs, along with statistical significance values (\( \rho \) and \( p \) values).

Figure 10. Drug sensitivity and risk score. (A) Boxplots of IC50 values for six drugs in low-risk and high-risk groups. (B) The scatter plot shows the correlation between IC50 and risk score. ***P < 0.001, ****P < 0.0001.

3.11 Expression patterns of CYP4B1 and GBP1 in TNBC

Single-cell and spatial transcriptomic analyses of TNBC tissues were conducted through web-based platforms (https://grswsci.top/). Single-cell sequencing resolved nine distinct functionally-defined cell clusters (Figure 11A), with the epithelial subpopulation demonstrating marked CYP4B1 enrichment (Figure 11B). Further gene expression profiling (Figure 11C) revealed that GBP1 exhibited significantly elevated expression levels in malignant cell populations compared to normal epithelial cells.

Figure 11

Panel A displays UMAP plots with different cell types labeled, showing expression density of CYPM1 and GBP1. Panels B and C are bar graphs illustrating CYPA3B and GBP1 mean expression across various cell groups, including B cells, CD4Tconv, CD8T, endothelial, epithelial, fibroblasts, malignant, Mono/Macro, and Tproliferative, with statistical significance p<0.001. Groups are color-coded.

Figure 11. Single-cell atlas and feature gene expression. (A) UMAP visualization of cell types and gene expression patterns in GSE148673 dataset. (B, C) Bar charts showing the expression of CYP4B1 (B) and GBP1 (C) in different cells.

Spatial transcriptomics was performed on two representative TNBC samples from the GSE210616 dataset. Spatial regionalization delineated tissue areas containing malignant cells (proportion > 0) as malignant regions, while regions exclusively composed of normal cells were designated as normal areas (Figures 12A, B). CYP4B1 exhibited significantly elevated expression in non-malignant regions, whereas the GBP1 displayed specific enrichment in tumor malignant zones (Figures 12C, D). Barplots validated this heterogeneous expression pattern for both genes across TNBC specimens (Figure 12E).

Figure 12

A collection of detailed scientific visualizations comparing the expression levels of CYP4B1 and GBP1 across different conditions. Panels A and B show spatial maps with group differentiation, using color gradients to indicate expression levels. Panels C and D present expression level gradients for CYP4B1 and GBP1. Panel E contains bar graphs showing gene expression data with statistical significance, indicated by p-values. Panel F features box plots comparing expression levels between normal and TNBC groups, highlighting significant differences. Panel G uses bar charts to illustrate relative expression in different cell lines, with statistical significance denoted by asterisks.

Figure 12. Spatial transcriptomics and RT-qPCR analyses reveal dysregulated expression of CYP4B1 and GBP1. (A, B) Spatial demarcation of malignant and non-malignant regions. (C, D) Spatial expression patterns of CYP4B1 and GBP1. (E) Bar graph quantifying CYP4B1 and GBP1 expression levels between malignant and non-malignant regions. (F) Dysregulated expression of CYP4B1 and GBP1 in TCGA TNBC cases. (G) RT-qPCR validation of CYP4B1 and GBP1 expression dysregulation in two TNBC cell lines. ***P < 0.001, ns, not significant.

Boxplot analysis revealed dysregulated expression of CYP4B1 and GBP1 in TCGA-derived TNBC samples (Figure 12F). To corroborate these findings, RT-qPCR validation of CYP4B1 and GBP1 expression was conducted (Figure 12G). Furthermore, proteomic data from TCGA breast cancer also validated the expression trends of CYP4B1 and GBP1 (Supplementary Figure S2). Compared with normal mammary MCF-10A cells, CYP4B1 showed marked downregulation in premalignant MCF-10AT cells and aggressive TNBC cell models (MDA-MB-231 and MDA-MB-453). Notably, GBP1 demonstrated pronounced upregulation in both malignant cell lines.

4 Discussion

This study revealed the complex characteristics of ITH in TNBC and its impact on clinical outcomes. Our findings not only deepen the understanding of TNBC’s biological behavior but also offer new perspectives for developing precision medicine strategies.

The stage-specific evolution of genetic heterogeneity in TNBC was confirmed in this study. MATH scores showed significant increasing trends in stages I-II, while no marked differences were observed in stages III-IV. This phenomenon could be attributed to limited sample size on one hand and aligns with the “bottleneck effect” model of tumor evolution proposed by Sottoriva et al. (28) on the other. Elevated genetic heterogeneity in early-stage tumors may reflect Darwinian selection processes under microenvironmental pressure, whereas ITH tends to stabilize when dominant clones take over. Notably, conventional therapeutic approaches failed to significantly alter MATH scores, suggesting that cytotoxic agents or radiotherapy primarily eliminates sensitive clones while inadequately suppressing adaptive evolution of subclones (29). The DEPTH2 scores at the transcriptome level demonstrated clinical patterns similar to those of MATH, yet showed slightly weaker prognostic discrimination. This phenomenon may stem from a decoupling mechanism between genomic mutations and gene expression regulation (30). Some driver mutations may be buffered at the RNA expression level, potentially explaining why genetic heterogeneity carries greater prognostic significance. However, the highly significant differences in DEPTH2 scores between TNBC and non-TNBC groups highlight its potential value as a supplementary marker for TNBC molecular subtyping, warranting further validation through expanded sample studies.

Using multi-regional TNBC samples, this study revealed the spatial hierarchy of tumor heterogeneity. The dramatic shifts in PAM50 subtypes across tumor regions may represent the dominance of evolutionarily selected “fittest phenotypes” within specific microenvironments. In immune landscape analysis, CD8+ T cells exhibited an unbalanced distribution pattern, which could be explained by aberrant tumor vascularization affecting immune cell chemotaxis and subclone-specific antigen variation driving localized immune editing (31, 32). Additionally, heterogeneous ferroptosis pathway activity suggests that hypoxia gradients might regulate ACSL4/GPX4 balance via HIF-1α, creating regional divergences in ferroptosis susceptibility (33).

Given the high ITH in TNBC and the vulnerability of conventional biomarkers to ITH-induced predictive instability, this study proposed a “low-ITH gene priority” modeling strategy. We successfully identified CYP4B1 and GBP1 as core biomarkers with IHS < 0.5 and significant prognostic relevance. Compared to high-ITH genes (IHS > 0.5), these genes demonstrated cross-regionally stable expression patterns, maintaining reliable detection across spatiotemporal sampling conditions. Mechanistically, CYP4B1 (a cytochrome P450 family member) may mediate metabolic processes in the TME (34), while GBP1 (an interferon-induced protein) participates in cancer cell growth and promotes metastasis in TNBC (35). Minimal overlap was observed between low-ITH and differentially expressed prognostic genes (only 2/419), underscoring the necessity of combining heterogeneity screening for precise biomarker identification. The prognostic risk model achieved robust predictive performance in METABRIC and TCGA cohorts (3-, 5-, 7-, and 9-year AUC > 0.6), validating the feasibility of low-ITH genes for outcome prediction. This model overcomes TNM staging limitations by stratifying patients with identical TNM classifications into prognostically distinct subgroups. Multivariate Cox regression confirmed the risk score’s independent prognostic value, while its combination with TNM staging in a nomogram (C-index > 0.67), providing a clinical decision-making framework.

The immune cell infiltration profile revealed the biological basis of risk stratification, with high-risk groups exhibited exhaustion of anti-tumor immune responses. Specifically, tumor-associated immune cells displayed marked phenotypic polarization imbalance, with significantly reduced proportions of pro-inflammatory M1 macrophages. Further analysis uncovered atypical checkpoint regulation patterns in immune evasion among high-risk patients. Unlike the classical “cold tumor” model characterized by PD-L1/CTLA4 overexpression (36), this subgroup showed concurrent downregulation of checkpoint molecules (CTLA4, PDCD1), suggesting aberrant activation of specific immune-exhausted subtypes. Data from the IMvigor210 cohort supported this observation, showing reduced immunotherapy response rates in high-risk patients, indicating that the risk score could serve as a novel predictive biomarker for immunotherapy resistance.

5 Conclusion

This study systematically evaluates the multidimensional features of TNBC heterogeneity. By prioritizing low-ITH genes, we developed a prognostic model with temporal stability, uncovering intrinsic links between risk scores and immune microenvironment remodeling. The low-ITH model and nomogram provide clinically applicable tools for TNBC outcome prediction and personalized treatment. These findings deepen the understanding of TNBC heterogeneity and establish novel frameworks for precision subtyping in TNBC.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Ethics statement

Ethical approval was not required for the studies on humans in accordance with the local legislation and institutional requirements because only commercially available established cell lines were used.

Author contributions

HS: Writing – original draft. QZ: Visualization, Formal analysis, Software, Writing – original draft, Data curation, Investigation, Conceptualization, Methodology. ZW: Writing – original draft. DZ: Writing – original draft. ZG: Writing – original draft. HW: Writing – original draft. TS: Writing – original draft. JP: Writing – original draft. YJ: Writing – original draft. XZ: Writing – original draft. JW: Writing – review & editing. JZ: Writing – review & editing.

Funding

The author(s) declare financial support was received for the research and/or publication of this article. This work was supported by the Cixi Public Welfare Science and Technology Program (Grant No. CN2021009).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2025.1611917/full#supplementary-material

Supplementary Figure 1 | (A, B) The Cox model’s PH assumption of CYP4B1 (A) and GBP1 (B).

Supplementary Figure 2 | (A, B) The density plots display the protein expression levels of CYP4B1 (A) and GBP1 (B) in the normal group and the breast cancer group from TCGA database.

References

1. Denkert C, Liedtke C, Tutt A, and von Minckwitz G. Molecular alterations in triple-negative breast cancer-the road to new treatment strategies. Lancet. (2017) 389:2430–42. doi: 10.1016/s0140-6736(16)32454-0

PubMed Abstract | Crossref Full Text | Google Scholar

2. Waks AG and Winer EP. Breast cancer treatment: A review. Jama. (2019) 321:288–300. doi: 10.1001/jama.2018.19323

PubMed Abstract | Crossref Full Text | Google Scholar

3. Lehmann BD, Bauer JA, Chen X, Sanders ME, Chakravarthy AB, Shyr Y, et al. Identification of human triple-negative breast cancer subtypes and preclinical models for selection of targeted therapies. J Clin Invest. (2011) 121:2750–67. doi: 10.1172/jci45014

PubMed Abstract | Crossref Full Text | Google Scholar

4. Wein L, Savas P, Luen SJ, Virassamy B, Salgado R, and Loi S. Clinical validity and utility of tumor-infiltrating lymphocytes in routine clinical practice for breast cancer patients: current and future directions. Front Oncol. (2017) 7:156. doi: 10.3389/fonc.2017.00156

PubMed Abstract | Crossref Full Text | Google Scholar

5. Zheng Q, Gong Z, Lin S, Ou D, Lin W, and Shen P. Integrated analysis of a competing endogenous RNA network reveals a ferroptosis-related 6-lncRNA prognostic signature in clear cell renal cell carcinoma. Adv Clin Exp Med. (2024) 33:1391–407. doi: 10.17219/acem/176050

PubMed Abstract | Crossref Full Text | Google Scholar

6. Zheng Q, Gong Z, Li B, Wang H, and Lin S. Development and validation of the antibody-dependent cellular phagocytosis-based signature: A prognostic risk model of gastric cancer. Adv Clin Exp Med. (2025) 34:433–46. doi: 10.17219/acem/189914

PubMed Abstract | Crossref Full Text | Google Scholar

7. Curtis C, Shah SP, Chin SF, Turashvili G, Rueda OM, Dunning MJ, et al. Yuan Y et al: The genomic and transcriptomic architecture of 2,000 breast tumors reveals novel subgroups. Nature. (2012) 486:346–52. doi: 10.1038/nature10983

PubMed Abstract | Crossref Full Text | Google Scholar

8. Cancer Genome Atlas Network. Comprehensive molecular portraits of human breast tumors. Nature. (2012) 490:61–70. doi: 10.1038/nature11412

PubMed Abstract | Crossref Full Text | Google Scholar

9. Xu Q, Kaur J, Wylie D, Mittal K, Li H, KolaChina R, et al. Yang J et al: A Case Series Exploration of Multi-Regional Expression Heterogeneity in Triple-Negative Breast Cancer Patients. Int J Mol Sci. (2022) 23(21):13322. doi: 10.3390/ijms232113322

PubMed Abstract | Crossref Full Text | Google Scholar

10. Love MI, Huber W, and Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. (2014) 15:550. doi: 10.1186/s13059-014-0550-8

PubMed Abstract | Crossref Full Text | Google Scholar

11. Mariathasan S, Turley SJ, Nickles D, Castiglioni A, Yuen K, Wang Y, et al. Cubas R et al: TGFβ attenuates tumor response to PD-L1 blockade by contributing to exclusion of T cells. Nature. (2018) 554:544–8. doi: 10.1038/nature25501

PubMed Abstract | Crossref Full Text | Google Scholar

12. Gao R, Bai S, Henderson YC, Lin Y, Schalck A, Yan Y, et al. Davis A et al: Delineating copy number and clonal substructure in human tumors from single-cell transcriptomes. Nat Biotechnol. (2021) 39:599–608. doi: 10.1038/s41587-020-00795-2

PubMed Abstract | Crossref Full Text | Google Scholar

13. Bassiouni R, Idowu MO, Gibbs LD, Robila V, Grizzard PJ, Webb MG, et al. Spatial transcriptomic analysis of a diverse patient cohort reveals a conserved architecture in triple-negative breast cancer. Cancer Res. (2023) 83:34–48. doi: 10.1158/0008-5472.Can-22-2682

PubMed Abstract | Crossref Full Text | Google Scholar

14. Mroz EA and Rocco JW. MATH, a novel measure of intratumor genetic heterogeneity, is high in poor-outcome classes of head and neck squamous cell carcinoma. Oral Oncol. (2013) 49:211–5. doi: 10.1016/j.oraloncology.2012.09.007

PubMed Abstract | Crossref Full Text | Google Scholar

15. Song D and Wang X. DEPTH2: an mRNA-based algorithm to evaluate intratumor heterogeneity without reference to normal controls. J Transl Med. (2022) 20:150. doi: 10.1186/s12967-022-03355-1

PubMed Abstract | Crossref Full Text | Google Scholar

16. Biswas D, Birkbak NJ, Rosenthal R, Hiley CT, Lim EL, Papp K, et al. La Fleur L et al: A clonal expression biomarker associates with lung cancer mortality. Nat Med. (2019) 25:1540–8. doi: 10.1038/s41591-019-0595-z

PubMed Abstract | Crossref Full Text | Google Scholar

17. Yang C, Zhang S, Cheng Z, Liu Z, Zhang L, Jiang K, et al. Huang X et al: Multi-region sequencing with spatial information enables accurate heterogeneity estimation and risk stratification in liver cancer. Genome Med. (2022) 14:142. doi: 10.1186/s13073-022-01143-6

PubMed Abstract | Crossref Full Text | Google Scholar

18. Yan W, Shih J, Rodriguez-Canales J, Tangrea MA, Player A, Diao L, et al. Taylor PR et al: Three-dimensional mRNA measurements reveal minimal regional heterogeneity in esophageal squamous cell carcinoma. Am J Pathol. (2013) 182:529–39. doi: 10.1016/j.ajpath.2012.10.028

PubMed Abstract | Crossref Full Text | Google Scholar

19. Yoshihara K, Shahmoradgoli M, Martínez E, Vegesna R, Kim H, Torres-Garcia W, et al. Levine DA et al: Inferring tumor purity and stromal and immune cell admixture from expression data. Nat Commun. (2013) 4:2612. doi: 10.1038/ncomms3612

PubMed Abstract | Crossref Full Text | Google Scholar

20. Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, Xu Y, et al. Robust enumeration of cell subsets from tissue expression profiles. Nat Methods. (2015) 12:453–7. doi: 10.1038/nmeth.3337

PubMed Abstract | Crossref Full Text | Google Scholar

21. Zeng D, Ye Z, Shen R, Yu G, Wu J, Xiong Y, et al. Sun L et al: IOBR: Multi-Omics Immuno-Oncology Biological Research to Decode Tumor Microenvironment and Signatures. Front Immunol. (2021) 12:687975. doi: 10.3389/fimmu.2021.687975

PubMed Abstract | Crossref Full Text | Google Scholar

22. Yu G, Wang LG, Han Y, and He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. Omics. (2012) 16:284–7. doi: 10.1089/omi.2011.0118

PubMed Abstract | Crossref Full Text | Google Scholar

23. Geeleher P, Cox N, and Huang RS. pRRophetic: an R package for prediction of clinical chemotherapeutic response from tumor gene expression levels. PLoS One. (2014) 9:e107468. doi: 10.1371/journal.pone.0107468

PubMed Abstract | Crossref Full Text | Google Scholar

24. Nguyen PHD, Ma S, Phua CZJ, Kaya NA, Lai HLH, Lim CJ, et al. Tam WL et al: Intratumoral immune heterogeneity as a hallmark of tumor evolution and progression in hepatocellular carcinoma. Nat Commun. (2021) 12:227. doi: 10.1038/s41467-020-20171-7

PubMed Abstract | Crossref Full Text | Google Scholar

25. Jiang X, Stockwell BR, and Conrad M. Ferroptosis: mechanisms, biology and role in disease. Nat Rev Mol Cell Biol. (2021) 22:266–82. doi: 10.1038/s41580-020-00324-8

PubMed Abstract | Crossref Full Text | Google Scholar

26. Bareche Y, Venet D, Ignatiadis M, Aftimos P, Piccart M, Rothe F, et al. Unravelling triple-negative breast cancer molecular heterogeneity using an integrative multiomic analysis. Ann Oncol. (2018) 29:895–902. doi: 10.1093/annonc/mdy024

PubMed Abstract | Crossref Full Text | Google Scholar

27. Sala M, Ros M, and Saltel F. A complex and evolutive character: two face aspects of ECM in tumor progression. Front Oncol. (2020) 10:1620. doi: 10.3389/fonc.2020.01620

PubMed Abstract | Crossref Full Text | Google Scholar

28. Williams MJ, Werner B, Barnes CP, Graham TA, and Sottoriva A. Identification of neutral tumor evolution across cancer types. Nat Genet. (2016) 48:238–44. doi: 10.1038/ng.3489

PubMed Abstract | Crossref Full Text | Google Scholar

29. Inde Z and Dixon SJ. The impact of non-genetic heterogeneity on cancer cell death. Crit Rev Biochem Mol Biol. (2018) 53:99–114. doi: 10.1080/10409238.2017.1412395

PubMed Abstract | Crossref Full Text | Google Scholar

30. Vaishnav ED, de Boer CG, Molinet J, Yassour M, Fan L, Adiconis X, et al. The evolution, evolvability and engineering of gene regulatory DNA. Nature. (2022) 603:455–63. doi: 10.1038/s41586-022-04506-6

PubMed Abstract | Crossref Full Text | Google Scholar

31. Zhang R, Yao Y, Gao H, and Hu X. Mechanisms of angiogenesis in tumor. Front Oncol. (2024) 14:1359069. doi: 10.3389/fonc.2024.1359069

PubMed Abstract | Crossref Full Text | Google Scholar

32. Luo C, Wang P, He S, Zhu J, Shi Y, and Wang J. Progress and prospect of immunotherapy for triple-negative breast cancer. Front Oncol. (2022) 12:919072. doi: 10.3389/fonc.2022.919072

PubMed Abstract | Crossref Full Text | Google Scholar

33. Zhao Y, Xing C, Deng Y, Ye C, and Peng H. HIF-1α signaling: Essential roles in tumorigenesis and implications in targeted therapies. Genes Dis. (2024) 11:234–51. doi: 10.1016/j.gendis.2023.02.039

PubMed Abstract | Crossref Full Text | Google Scholar

34. Havrylyuk D, Hachey AC, Fenton A, Heidary DK, and Glazer EC. Ru(II) photocages enable precise control over enzyme activity with red light. Nat Commun. (2022) 13:3636. doi: 10.1038/s41467-022-31269-5

PubMed Abstract | Crossref Full Text | Google Scholar

35. Lim S, Alshagga M, Ong CE, Chieng JY, and Pan Y. Cytochrome P450 4B1 (CYP4B1) as a target in cancer treatment. Hum Exp Toxicol. (2020) 39:785–96. doi: 10.1177/0960327120905959

PubMed Abstract | Crossref Full Text | Google Scholar

36. Bonaventura P, Shekarian T, Alcazer V, Valladeau-Guilemond J, Valsesia-Wittmann S, Amigorena S, et al. Cold tumors: A therapeutic challenge for immunotherapy. Front Immunol. (2019) 10:168. doi: 10.3389/fimmu.2019.00168

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: immunotherapy, immune infiltration, intra-tumor heterogeneity, prognosis, triple-negative breast cancer, tumor microenvironment

Citation: Shen H, Zheng Q, Wang Z, Zheng D, Gong Z, Wang H, Sun T, Pan J, Jin Y, Zheng X, Wang J and Zhang J (2025) Breaking the heterogeneity barrier: a robust prognostic signature for survival stratification and immune profiling in triple-negative breast cancer. Front. Immunol. 16:1611917. doi: 10.3389/fimmu.2025.1611917

Received: 15 April 2025; Accepted: 11 September 2025;
Published: 30 September 2025.

Edited by:

Jinghua Pan, Jinan University, China

Reviewed by:

Hao Chi, Southwest Medical University, China
Long Zhang, The First Affiliated Hospital of University of Science and Technology of China Anhui Provincial Hospital, China

Copyright © 2025 Shen, Zheng, Wang, Zheng, Gong, Wang, Sun, Pan, Jin, Zheng, Wang and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jingzhi Wang, ODQxMTc5NzcwQHFxLmNvbQ==; Jiongjiong Zhang, am9qb0B6anUuZWR1LmNu

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.