Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Mol. Biosci., 09 February 2026

Sec. Molecular Diagnostics and Therapeutics

Volume 13 - 2026 | https://doi.org/10.3389/fmolb.2026.1748821

This article is part of the Research TopicCancer Biomarkers: Molecular Insights into Diagnosis, Prognosis, and Risk Prediction: Volume IIView all 15 articles

Integrated network analysis and experimental validation identify CCNA2, CD44, and STAT1 as clinically relevant hub genes in oral squamous cell carcinoma

Mingyang BuMingyang Bu1Yue GuYue Gu2Yuhan MengYuhan Meng1Yunyi CuiYunyi Cui3Qianli WuQianli Wu1Yali KongYali Kong1Haoyang LiuHaoyang Liu1Yi ChenYi Chen1Jiajun ZhangJiajun Zhang1Lizhen GuoLizhen Guo1Yakun Yang
Yakun Yang4*Dongyang Huang
Dongyang Huang4*Zhe Ma
Zhe Ma1*
  • 1Department of Preventive Dentistry, Hebei Key Laboratory of Stomatology, Hebei Clinical Research Center for Oral Diseases, School and Hospital of Stomatology, Hebei Medical University, Shijiazhuang, China
  • 2Department of Pediatric Dentistry, Hebei Key Laboratory of Stomatology, Hebei Clinical Research Center for Oral Diseases, School and Hospital of Stomatology, Hebei Medical University, Shijiazhuang, China
  • 3Department of Dentistry and Oral Surgery, Shinshu University School of Medicine, Matsumoto, Japan
  • 4Department of Pharmacology, The Key Laboratory of Neural and Vascular Biology, Ministry of Education, The Key Laboratory of New Drug Pharmacology and Toxicology, Collaborative Innovation Center of Hebei Province for Mechanism, Diagnosis and Treatment of Neuropsychiatric Diseases, Hebei Medical University, Shijiazhuang, Hebei, China

Oral squamous cell carcinoma remains a major clinical challenge, with only marginal gains in long-term survival due to frequent recurrence, lymphatic dissemination and therapeutic resistance. Robust and biologically grounded molecular determinants are needed to improve tumor characterization and facilitate translational advances. Transcriptomic profiles from GSE30784, GSE9844 and TCGA-HNSC were integrated to identify reproducible dysregulated genes, followed by construction of a high-confidence PPI network for hub gene prioritization. Prognostic and diagnostic relevance was assessed using Kaplan–Meier, Cox regression and ROC analyses. Experimental validation was performed by immunohistochemistry in human OSCC tissues and by qPCR in OSCC cell lines. Cross-cohort integration yielded a reproducible set of DEGs, from which CCNA2, CD44 and STAT1 emerged as network-defined hub genes. All three genes were consistently upregulated across independent cohorts and showed significant prognostic and diagnostic associations. Functional enrichment indicated that these genes are embedded in proliferation-, adhesion- and stress-response–related signaling programs. Experimental assays further confirmed their elevated expression at both transcript and protein levels in OSCC tissues and cell models. CCNA2, CD44 and STAT1 constitute reproducible and functionally anchored hub genes in OSCC, carrying both diagnostic and prognostic implications. These findings substantially refine the molecular understanding of OSCC and provide rational entry points for subsequent mechanistic and translational investigations.

1 Introduction

Oral squamous cell carcinoma (OSCC) is the predominant subtype of head and neck malignancies and remains a leading cause of cancer-related mortality worldwide (Johnson et al., 2020). Despite advances in surgical intervention, radiotherapy and systemic therapies, long-term survival has improved only marginally over recent decades, owing mainly to frequent local recurrence, cervical lymph node metastasis and the emergence of therapeutic resistance (Dong et al., 2024; Kim and Ahn, 2024; Struckmeier et al., 2024). These limitations highlight the need to delineate molecular determinants that can refine disease stratification and provide biologically meaningful entry points for therapeutic development.

High-throughput transcriptomic studies have documented extensive gene dysregulation in OSCC (Guo et al., 2020; Li et al., 2023; Lohavanichbutr et al., 2012). However, many reported biomarkers lack reproducibility across independent cohorts, are not embedded within coherent regulatory networks, or remain unverified at the experimental level (de Morais et al., 2025; Hsu et al., 2020; Liang et al., 2004). Moreover, only a minority have been rigorously evaluated for prognostic relevance or mechanistic interpretability, leaving unresolved which alterations represent robust and clinically meaningful drivers rather than cohort-specific correlates.

To address these gaps, we employed an integrative framework combining multi-cohort differential expression analysis, network-based hub gene prioritization and cross-platform validation, followed by survival interrogation and experimental verification in human tissues and OSCC cell models. This strategy enables the identification of reproducible and network-supported candidate genes with diagnostic and prognostic utility, thereby refining the molecular framework of OSCC and establishing rational targets for downstream mechanistic and translational investigations.

2 Methods and materials

2.1 Public datasets and preprocessing

Transcriptomic datasets of OSCC were retrieved from the Gene Expression Omnibus (GEO) database (accessions GSE30784 and GSE9844), and RNA-seq profiles of OSCC samples were obtained from the Cancer Genome Atlas–Head and Neck Squamous Cell Carcinoma (TCGA-HNSC) cohort. GEO microarray data were normalized using the RMA algorithm, whereas TCGA raw counts were converted to TPM values (Bian et al., 2022). Probe identifiers were mapped to gene symbols according to platform annotations, and duplicate probes corresponding to the same gene were averaged. Only protein-coding genes detected in at least 80% of samples were retained for downstream analysis.

2.2 Differential expression analysis

Differentially expressed genes (DEGs) between tumor and normal tissues were identified using “limma” for microarray data and “DESeq2” for RNA-seq data. When applicable, batch effects were mitigated using “ComBat”. Genes with |log2FC| ≥ 1 and FDR < 0.05 were considered statistically significant. Volcano plots were generated to visualize the DEG distribution (Ritchie et al., 2015).

2.3 Protein–protein interaction network construction and hub gene identification

Overlapping DEGs from the two GEO datasets were submitted to STRING (confidence threshold = 0.9) to construct a protein–protein interaction (PPI) network (Zhang et al., 2022). The resulting network was imported into Cytoscape for topological assessment. Candidate hub genes were ranked using betweenness and degree centrality, and the intersection of the top 10 genes from both metrics was defined as the final hub gene set (Li et al., 2019). Sensitivity analysis was performed to assess the robustness of hub gene identification. Degree and betweenness centrality rankings were recalculated using different cutoff thresholds (top 5, top 10 and top 15 nodes). The overlap of candidate hub genes across thresholds and metrics was examined to evaluate the stability of network-based prioritization.

2.4 Gene set enrichment analysis

Gene set enrichment analysis (GSEA) was performed to explore pathways associated with hub gene expression using Hallmark, KEGG, and GO gene sets from MSigDB. All genes were ranked according to differential expression statistics, and the enrichment score was computed using a weighted Kolmogorov–Smirnov test. Pathways with FDR < 0.25 were considered significantly enriched (Subramanian et al., 2005).

2.5 TCGA validation and survival modeling

The expression patterns of hub genes were validated in the TCGA-HNSC OSCC subset. Prognostic relevance was assessed using Kaplan–Meier survival curves and Cox proportional hazards modeling (Rades et al., 2013). A nomogram incorporating CCNA2, CD44, and STAT1 was constructed based on the Cox model. Diagnostic performance of individual genes and the combined panel was evaluated using ROC analysis in GEO and TCGA cohorts (Long et al., 2022).

2.6 Human tissue specimens and immunohistochemistry

Thirty paired samples of primary OSCC tissues and matched non-tumorous oral mucosa (>2 cm from the tumor margin) were collected at the Hospital of Stomatology, Hebei Medical University. The study followed the Declaration of Helsinki and received approval from the institutional ethics committee (Approval No. 2025039); written informed consent was obtained from all participants.

FFPE sections were processed by standard deparaffinization, rehydration, antigen retrieval, and peroxidase/nonspecific blocking. Primary antibodies against Cyclin A2 (CCNA2) (Abcam, ab181973), Cluster of Differentiation 44 (CD44) (Cell Signaling Technology, #3570), and Signal Transducer and Activator of Transcription 1 (STAT1) (Proteintech, 10144-2-AP) were applied, followed by HRP-conjugated secondary antibodies and DAB detection with hematoxylin counterstaining. Two pathologists independently scored staining intensity under blinded conditions.

2.7 Cell culture

The human OSCC cell line CAL27 was obtained from the Cell Bank of the Chinese Academy of Sciences (CAS, Shanghai, China). Normal human oral keratinocytes (HOK) were purchased from ScienCell Research Laboratories (Carlsbad, CA, USA). All cell lines were authenticated by the suppliers and routinely tested to confirm the absence of mycoplasma contamination. Cells were cultured in DMEM supplemented with 10% fetal bovine serum and 1% penicillin–streptomycin at 37 °C in a humidified incubator with 5% CO2.

2.8 Quantitative PCR analysis

Total RNA was extracted using TRIzol reagent, and RNA quality was verified spectrophotometrically (A260/A280 = 1.8–2.0). One microgram of RNA was reverse-transcribed using a Takara PrimeScript™ RT reagent kit according to the manufacturer’s instructions. The resulting cDNA was diluted 1:5 with nuclease-free water prior to quantitative PCR amplification.

qPCR was performed using SYBR Green Master Mix on an ABI StepOnePlus system with a 20 µL reaction volume under the following cycling conditions: 95 °C for 30 s, followed by 40 cycles of 95 °C for 5 s and 60 °C for 30 s. Amplification specificity was confirmed by melting-curve analysis. GAPDH served as the internal reference, and relative expression was calculated using the 2ΔΔCt method. All qPCR experiments were performed in triplicate. Data analysis and figure generation were conducted using GraphPad Prism.

The primer sequences used in this study were as follows:

CCNA2-F: 5′-AGCAGCCAGACATCACTCTC-3′

CCNA2-R: 5′-TGGTGCTGAAGGTAGGTGTC-3′

CD44-F: 5′-CTGCCGCTTTGCAGGTGTA-3′

CD44-R: 5′-GGTGCTATTGAAAGCCTTGC-3′

STAT1-F: 5′-GCTGGAAGATGGTGAAATTGAG-3′

STAT1-R: 5′-CATGATGGTTGTGCTGATGCT-3′

GAPDH-F: 5′-AGAAGGCTGGGGCTCATTTG-3′

GAPDH-R: 5′-AGGGGCCATCCACAGTCTTC-3′

CRNN-F: 5′-TGGCAGAGTGCAGAGGACTA-3′

CRNN-R: 5′-AGGTCAGGGAGGATGGAAGA-3′

2.9 Western blot analysis

Total protein was extracted from human oral tissue specimens and cultured cells using ice-cold RIPA lysis buffer (Beyotime, China) supplemented with a protease inhibitor cocktail (Beyotime, China), according to the manufacturer’s instructions. Lysates were incubated on ice for 30 min with intermittent vortexing and subsequently centrifuged at 12,000 × g for 15 min at 4 °C to remove insoluble debris.

Protein concentration was determined using a bicinchoninic acid (BCA) protein assay kit (Beyotime, China). Equal amounts of total protein (30 µg per lane) were separated by 10% SDS–PAGE and transferred onto polyvinylidene fluoride (PVDF) membranes (Millipore, USA). Membranes were blocked with 5% non-fat milk in Tris-buffered saline containing 0.1% Tween-20 (TBST) for 1 h at room temperature and then incubated overnight at 4 °C with primary antibodies against STAT1 (Proteintech, 10144-2-AP, 1:1,000 dilution) and β-actin (Proteintech, 60008-1-Ig, 1:5,000 dilution).

After washing with TBST, membranes were incubated with horseradish peroxidase (HRP)-conjugated secondary antibodies (Proteintech, 1:5,000 dilution) for 1 h at room temperature. Protein bands were visualized using enhanced chemiluminescence (ECL) reagents (Millipore, USA) and detected with a ChemiDoc imaging system (Bio-Rad, USA). Densitometric analysis was performed using ImageJ software (NIH, USA), and STAT1 protein expression levels were normalized to β-actin.

2.10 Statistical analysis

All statistical analyses were performed in R (version 4.2.0). Normality was assessed using the Shapiro–Wilk test. Comparisons between two groups were performed using Student’s t-test or the Mann–Whitney U test as appropriate. Correlations were assessed using Spearman’s rank test. Survival differences were evaluated by log-rank testing, and hazard ratios were estimated using Cox regression. ROC curves were generated using the “pROC” package. A two-sided P < 0.05 was considered statistically significant.

3 Results

3.1 Identification of robust DEGs in OSCC

To identify transcriptional alterations robustly associated with OSCC, we analyzed two independent GEO cohorts (GSE30784 and GSE9844). UMAP embedding revealed a clear separation between tumor and normal tissues in GSE30784, and a still discernible separation trend in GSE9844 despite partial overlap, indicating a disease-related divergence in global expression patterns (Figures 1A,B). Differential expression analysis further demonstrated widespread dysregulation in GSE30784, whereas GSE9844 showed a smaller but directionally consistent DEG spectrum. Volcano plots illustrated the distribution and significance of DEGs in both datasets (Figures 1C,D). To obtain high-confidence OSCC-related genes, differentially expressed genes shared between the two datasets were identified, as illustrated by the overlap analysis (Figure 1E). Functional enrichment analyses further revealed that these shared DEGs were predominantly involved in extracellular matrix organization, cell adhesion, and tumor-associated signaling pathways, including PI3K–Akt signaling and ECM–receptor interaction (Figures 1FG).

Figure 1
Seven-panel image displaying gene expression and pathway analysis:A and B: UMAP scatter plots for datasets GSE30784 and GSE9844, showing OSCC and Control groups.C and D: Volcano plots for GSE30784 and GSE9844, indicating significance of gene expression changes.E: Venn diagram depicting overlap of genes between datasets.F: Bar plot showing enriched biological processes, cellular components, and molecular functions, with ontology legend.G: Bar plot for KEGG pathway enrichment, including cancer and signaling pathways.

Figure 1. Transcriptomic profiling identifies reproducible DEGs in OSCC. (A) UMAP clustering of GSE30784 samples. (B) UMAP clustering of GSE9844 samples. (C) Volcano plot of DEGs in GSE30784. (D) Volcano plot of DEGs in GSE9844. (E) Overlap of DEGs between the two datasets. (F) GO enrichment of shared DEGs. (G) KEGG enrichment of shared DEGs.

3.2 Network-based prioritization and validation of hub genes in OSCC

Building on the shared DEGs identified above, we next sought to extract hub genes occupying central positions within OSCC-associated regulatory networks. A high-confidence PPI network was constructed in STRING (confidence score = 0.9), comprising 1,458 nodes and 1,294 edges with an average node degree of 1.78 and a clustering coefficient of 0.293 (Supplementary Figure S1). The number of observed edges markedly exceeded the random expectation (expected = 844), and the enrichment p-value < 1.0 × 10−16 indicated a non-random, functionally coordinated interaction architecture.

Nodes were then ranked using two centrality metrics in Cytoscape. The top 10 candidates derived from betweenness and degree centrality were intersected, yielding CCNA2, CD44 and STAT1 as shared hub genes (Figures 2A–C). To validate these candidates, we examined their mRNA expression in two independent cohorts. CCNA2, CD44 and STAT1 were all significantly upregulated in OSCC tissues in GSE30784 and consistently elevated in GSE9844 (Figures 2D–I), supporting their stability across cohorts and endorsing them as credible hub regulators for downstream investigation.

Figure 2
Circular charts A and B show gene interactions, with distinct color-coded sections representing different genes based on betweenness and degree. Diagram C is a Venn diagram displaying overlapping genes: CCNA2, CD44, and STAT1. Bar graphs D to F (GSE30784 dataset) compare mRNA expression of CCNA2, CD44, and STAT1 between control and OSCC, showing significant increases in OSCC. Bar graphs G to I (GSE9844 dataset) present similar comparisons, also indicating higher expression in OSCC. Statistical significance is marked with asterisks.

Figure 2. Network-based hub gene screening and expression validation in OSCC. (A) Top 10 hub candidates ranked by betweenness centrality. (B) Top 10 hub candidates ranked by degree centrality. (C) Intersection of two rankings identifying CCNA2, CD44, and STAT1 as shared hub genes. (D–F) Expression of CCNA2, CD44, and STAT1 in GSE30784. (G–I) Expression of CCNA2, CD44, and STAT1 in GSE9844.

To evaluate the robustness of hub gene selection, a sensitivity analysis was conducted by varying the ranking thresholds for degree and betweenness centrality (top 5, top 10 and top 15 nodes). CCNA2 and CD44 were consistently retained among the top-ranked genes across all thresholds and metrics, while STAT1 remained among the highest-ranked candidates in the majority of conditions (Supplementary Table S1). These findings indicate that the identification of CCNA2, CD44 and STAT1 is not driven by arbitrary cutoff selection, but reflects stable network centrality across multiple analytical settings.

3.3 Validation and characterization of hub genes in OSCC

To validate the robustness of the identified hub genes, we evaluated the expression patterns of CCNA2, CD44 and STAT1 in the TCGA OSCC subset. All three genes were significantly upregulated in tumor tissues compared with normal controls (Figures 3A–C), consistent with the GEO-based discovery. Kaplan-Meier analysis showed that high CCNA2 expression was associated with worse overall survival, and CD44 exhibited a similar adverse trend, whereas elevated STAT1 predicted a more favorable prognosis (Figures 3D–F).

Figure 3
Graphs and charts illustrate mRNA expression levels of CCNA2, CD44, and STAT1 in control versus OSCC groups, showing higher expression in OSCC. Kaplan-Meier survival curves indicate overall survival probabilities for corresponding expressions, with hazard ratios and p-values. Enrichment plots display pathways related to each gene, highlighting significant results with pathway names, enrichment scores, and statistical details.

Figure 3. Validation and characterization of hub genes in TCGA cohort. (A–C) mRNA expression of CCNA2, CD44 and STAT1 in TCGA OSCC vs. normal tissues. (D–F) Kaplan–Meier overall survival curves stratified by CCNA2, CD44 and STAT1 expression. (G–I) GSEA enrichment plots for CCNA2, CD44 and STAT1.

To explore the functional context of these genes, we performed GSEA for each hub gene (Figures 3G–I). CCNA2 was enriched in pathways related to mitotic progression and protein degradation (e.g., Cell cycle, P53 signaling and ubiquitin-mediated proteolysis). CD44 was enriched in cancer-related signaling, tight junction disruption and JAK–STAT pathways, consistent with its roles in adhesion and malignant progression. STAT1 was enriched in P53, JAK–STAT and TGF-beta signaling, aligning with its involvement in immune surveillance and stress adaptation. Notably, all three genes converged on P53 signaling and ubiquitin-mediated proteolysis, suggesting potential cooperative modulation of stress response, genomic stability and protein turnover in OSCC.

To further explore whether the prognostic behavior of STAT1 may be influenced by the tumor immune microenvironment, immune infiltration analyses were performed using the TIMER and ESTIMATE algorithms in the TCGA-HNSC cohort. STAT1 expression exhibited significant positive correlations with multiple immune cell populations, including CD8+ T cells, CD4+ T cells, macrophages and dendritic cells, as well as with stromal enrichment scores (Supplementary Figure S2). These findings suggest that STAT1 expression in OSCC may partially reflect immune- and microenvironment-associated transcriptional programs.

3.4 Prognostic modeling and diagnostic discrimination based on CCNA2, CD44 and STAT1

Given the prognostic implications of CCNA2, CD44 and STAT1, we next assessed their utility in risk prediction and clinical discrimination. A nomogram incorporating all three genes was constructed using a Cox proportional hazards model, in which CCNA2 and CD44 contributed positively to the total risk score, whereas STAT1 exerted a negative contribution, consistent with their respective survival associations (Figure 4A). The calibration curve showed excellent agreement between predicted and observed 1-year survival probabilities (Figure 4B).

Figure 4
Panel A shows a nomogram with scales for CCNA2, CD44, and STAT1, predicting one-year survival probability. Panel B presents a calibration plot comparing observed survival fractions to predicted ones, with a one-year prediction line. Panels C, D, and E display ROC curves for datasets GSE30784, GSE9844, and TCGA, respectively, illustrating sensitivity versus one-specificity for CCNA2, CD44, and STAT1, with AUC values indicated.

Figure 4. Prognostic nomogram and diagnostic performance of CCNA2, CD44 and STAT1. (A) Nomogram model based on CCNA2, CD44 and STAT1. (B) Calibration curve of the nomogram for 1-year survival prediction. (C) ROC curves of CCNA2, CD44 and STAT1 in GSE30784. (D) ROC curves of CCNA2, CD44 and STAT1 in GSE9844. (E) ROC curves of CCNA2, CD44 and STAT1 in TCGA.

To further determine whether CCNA2, CD44 and STAT1 serve as independent prognostic factors, univariate and multivariate Cox proportional hazards regression analyses were performed using TCGA OSCC clinical data. Variables including age, sex and tumor stage were incorporated into the multivariate model. As shown in Supplementary Table S2, CD44 remained marginally associated with overall survival after adjustment, whereas CCNA2 and STAT1 did not retain independent prognostic significance. Age emerged as an independent clinical predictor of outcome, while sex lost significance after multivariable adjustment. These results indicate that the prognostic effects of the identified hub genes may be partially influenced by established clinicopathological factors, supporting their complementary rather than standalone prognostic value.

ROC analysis further demonstrated robust diagnostic performance across independent cohorts: in GSE30784, the AUCs for STAT1, CD44 and CCNA2 were all high; in GSE9844 they remained consistently discriminative; and in TCGA all three genes achieved AUC values above the generally accepted diagnostic threshold (Figures 4C–E). These findings indicate that this three-gene panel can reliably distinguish OSCC from normal tissues and holds stable prognostic and diagnostic utility across platforms.

3.5 Experimental validation of hub gene overexpression in OSCC

To experimentally validate the transcriptome-based findings, we assessed the protein and mRNA expression of CCNA2, CD44 and STAT1 in human OSCC tissues and cell models. Immunohistochemical staining revealed markedly stronger signals for all three markers in OSCC tissues compared with normal oral mucosa (Figure 5A), and semi-quantitative scoring confirmed significant upregulation (Figures 5B–D). Consistent with protein-level changes, qPCR analysis of human tissues demonstrated significantly elevated mRNA expression of all three genes in OSCC samples relative to controls (Figures 5E–G).

Figure 5
Histological and bar chart comparison of oral tissues and cell lines between control and OSCC (Oral Squamous Cell Carcinoma) groups. Panel A shows histological staining for CCNA2, CD44, and STAT1, with increased staining in OSCC. Panels B to D display bar charts showing elevated labeling indices of these proteins in OSCC. Panels E to L depict bar charts of mRNA expression levels for CCNA2, CD44, STAT1, and CRNN in both human tissues and cell lines, highlighting increased expression in OSCC compared to controls. Significant differences are marked with asterisks.

Figure 5. Experimental validation of CCNA2, CD44 and STAT1 expression in OSCC tissues and cell lines. (A) Representative IHC staining of CCNA2, CD44 and STAT1 in control and OSCC tissues. (B–D) Quantification of IHC staining index for CCNA2, CD44 and STAT1 in human oral tissues. (E–G) qPCR analysis of CCNA2, CD44 and STAT1 mRNA levels in human oral tissues. (H–J) qPCR analysis of CCNA2, CD44 and STAT1 mRNA levels in normal versus OSCC cell lines. (K,L) qPCR analysis of CRNN mRNA expression in human oral tissues and cell lines. Data are presented as mean ± SD. *P < 0.05, **P < 0.01, ***P < 0.001.

At the cellular level, qPCR analysis further verified that CAL27 cells expressed substantially higher levels of CCNA2, CD44 and STAT1 than normal oral epithelial cells (HOK) (Figures 5H–J). In addition to these upregulated hub genes, CRNN was selected as a representative downregulated gene based on the integrated transcriptomic analysis and further validated by qPCR. As shown in Figures 5K,L, CRNN expression was significantly reduced in OSCC tissues and cell lines compared with their corresponding controls. Together, these convergent findings across transcriptomic datasets, clinical specimens and cell models provide robust experimental evidence supporting the dysregulated expression patterns of key OSCC-associated genes. Consistent with the IHC findings, Western blot analysis demonstrated significantly increased total STAT1 protein levels in OSCC tissues and CAL27 cells relative to normal oral tissues and HOK cells, respectively (Supplementary Figure S3).

4 Discussion

OSCC is characterized by extensive transcriptional reprogramming, sustained proliferative signaling, epithelial remodeling, immune microenvironmental perturbation and genomic instability (Zhi et al., 2024; da Silva Junior et al., 2018; Yang et al., 2022; Truchard et al., 2022). In this study, we integrated multi-cohort transcriptomic analysis, network-based gene prioritization, outcome interrogation and experimental validation to identify OSCC-associated hub genes with cross-platform stability. Cross-dataset comparison revealed a reproducible transcriptional divergence between OSCC and normal tissues, and network-level inference converged on CCNA2, CD44 and STAT1 as central regulatory nodes with validated expression patterns and prognostic relevance. Functional enrichment suggested that these genes are embedded within proliferation-, adhesion- and stress-associated programs that are mechanistically aligned with OSCC biology. Their consistent elevation across public datasets, human tissues and cell models supports dual value for molecular characterization and translational applicability.

CCNA2, CD44 and STAT1 represent three biologically distinct yet functionally relevant classes of regulators. CCNA2 encodes a core cyclin that orchestrates mitotic progression and has been repeatedly implicated in epithelial tumor proliferation (Li et al., 2021; Zhou et al., 2024). CD44 functions as a stemness- and adhesion-related surface receptor widely linked to invasion, migration and metastatic competence across carcinomas (Senbanjo and Chellaiah, 2017; Xu et al., 2015; Li et al., 2022). STAT1 serves as a transcriptional effector of interferon signaling and has been reported to exert immune-mediated tumor restraint in a context-dependent manner (Koromilas and Sexl, 2013; Zimmerer et al., 2007). Given the immune-related function of STAT1, we further explored its association with the tumor immune microenvironment. Immune infiltration analyses suggested that STAT1 expression was positively associated with immune and stromal enrichment, indicating that its upregulation may partially reflect immune cell infiltration rather than tumor-intrinsic expression alone. This context-dependent feature may help explain the relatively favorable prognostic trend observed for STAT1 in OSCC. In OSCC, we observed that CCNA2 and CD44 were consistently upregulated and associated with adverse survival, consistent with their reported oncogenic behavior in epithelial malignancies (Zhang et al., 2024; Hassn et al., 2021). By contrast, STAT1 elevation correlated with more favorable prognosis and enrichment in immune- and stress-related pathways, aligning with literature describing STAT1-mediated antitumor surveillance under specific biological contexts (Zhang et al., 2020). Importantly, multivariate Cox proportional hazards analysis incorporating age, sex and tumor stage indicated that the prognostic associations of CCNA2, CD44 and STAT1 were attenuated after adjustment for established clinicopathological variables (Supplementary Table S2). While CD44 retained a marginal association with overall survival, CCNA2 and STAT1 did not demonstrate independent prognostic significance in the fully adjusted model. These findings suggest that the survival relevance of these hub genes may be partially mediated through their linkage with underlying tumor biology and clinical features, underscoring their complementary rather than standalone prognostic value.

GSEA-based enrichment provides mechanistic support linking the three hub genes to OSCC biology rather than statistical coincidence. CCNA2 enrichment in cell cycle and p53-associated programs is consistent with previous evidence showing that checkpoint override and mitotic drive are prerequisites for malignant expansion in epithelial cancers (Fang et al., 2022; Ruan et al., 2017). CD44 enrichment in tight-junction disruption, cancer-associated signaling and JAK–STAT activation aligns with reports that epithelial disassembly, ECM engagement and cytokine-integrated signaling are hallmarks of invasive transition in OSCC and related squamous tumors (Okamoto et al., 2016; Dolatabadi et al., 2022; Ortiz et al., 2018). STAT1 enrichment in interferon-, p53- and TGF-β–related circuits is consistent with its documented role in immune-linked stress restraint and tumor surveillance (Liang et al., 2004; Townsend et al., 2004). Of note, all three genes converged on p53 signaling and ubiquitin-mediated proteolysis, supporting the view that OSCC evolution arises not from a single linear oncogenic axis, but from sustained erosion of genome-stability surveillance and proteostasis control.

Although CCNA2, CD44 and STAT1 were validated at both the transcript and protein levels, the present study is primarily associative in design and does not establish that these genes are directly causative for OSCC progression. Definitive mechanistic involvement will require future functional perturbation studies, such as CRISPR-based gene disruption, rescue experiments or pharmacologic modulation.

With respect to STAT1, although its expression was consistently elevated and clinically associated, the current study focused on expression-level and network-associated evidence rather than post-translational activation status. In particular, phosphorylation at Tyr701 and Ser727, which is critical for STAT1 transcriptional activity, was not assessed in the present work. Accordingly, our findings should be interpreted as indicative of STAT1 dysregulation at the expression and network level, rather than direct confirmation of STAT1 signaling activation.

Moreover, all transcriptomic analyses in this study were conducted using bulk-tissue datasets, which limit the resolution of cell-type–specific expression patterns. As a result, the observed upregulation cannot be unequivocally attributed to malignant epithelial cells, infiltrating immune populations or stromal components. Future integration of single-cell or spatially resolved transcriptomic approaches will be necessary to clarify cellular provenance and regulatory context, thereby refining the biological interpretation of these findings.

Taken together, this study delineates a reproducible molecular axis in OSCC through integrated transcriptomic screening, network-level inference, multi-cohort validation and experimental confirmation. CCNA2, CD44 and STAT1 emerge not as incidental correlates but as network-defined and functionally anchored hub genes embedded in biological programs consistent with established determinants of OSCC evolution. Their consistent upregulation across independent datasets, human specimens and cell models, together with their diagnostic and prognostic relevance, positions this three-gene set as a rational molecular substrate for both mechanistic dissection and biomarker development. These findings refine the molecular landscape of OSCC and provide a defined entry point for subsequent functional, diagnostic and translational research.

5 Conclusion

This study identifies CCNA2, CD44 and STAT1 as reproducible and biologically associated hub genes in oral squamous cell carcinoma through the integration of multi-cohort transcriptomic analysis, network-level prioritization, clinical validation and experimental confirmation. These genes are embedded within proliferation-, adhesion- and stress-related signaling programs consistent with established pathogenic features of OSCC, and they exhibit consistent upregulation across independent datasets, human specimens and cell models. The diagnostic and prognostic relevance of this three-gene set, together with its network positioning, provides a rational molecular basis for downstream mechanistic interrogation and biomarker development. Overall, these findings refine the molecular framework of OSCC and establish CCNA2, CD44 and STAT1 as promising candidates for future functional and translational research.

Data availability statement

The transcriptomic data analyzed in this study are publicly available in the NCBI Gene Expression Omnibus (GEO) repository under accession numbers GSE28133 and GSE285957. All scripts used for data preprocessing and analysis are available at https://github.com/liujiangying68/PVR-ferroptosis-transcriptomics

Ethics statement

The studies involving humans were approved by the institutional ethics committee of Hospital of Stomatology, Hebei Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

MB: Conceptualization, Writing – original draft. YG: Funding acquisition, Writing – original draft. YM: Methodology, Writing – original draft. YyC: Validation, Writing – original draft. QW: Formal Analysis, Writing – original draft. YK: Investigation, Writing – original draft. HL: Resources, Writing – original draft. YC: Data curation, Writing – original draft. JZ: Visualization, Writing – original draft. LG: Project administration, Writing – original draft. YY: Conceptualization, Writing – review and editing. DH: Supervision, Writing – review and editing. ZM: Conceptualization, Supervision, Writing – review and editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This research was funded by Natural Science Foundation of Hebei Province (H2023206491). High-level Talent Support Program of Hebei Province (C20231011). Medical Science Research Project of Hebei (No. 20250766) and Scientific Research Project of Hebei Stomatological Association (No. QN2024009).

Acknowledgements

We gratefully acknowledge Hospital of Stomatology, Hebei Medical University for providing conducive research spaces that have greatly facilitated our work and collaboration.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmolb.2026.1748821/full#supplementary-material

References

Bian, Z., Chen, J., Liu, C., Ge, Q., Zhang, M., Meng, J., et al. (2022). Landscape of the intratumroal microenvironment in bladder cancer: implications for prognosis and immunotherapy. Comput. Struct. Biotechnol. J. 21, 74–85. doi:10.1016/j.csbj.2022.11.052

PubMed Abstract | CrossRef Full Text | Google Scholar

da Silva Junior, I. A., de Sousa Andrade, L. N., Jancar, S., and Chammas, R. (2018). Platelet activating factor receptor antagonists improve the efficacy of experimental chemo- and radiotherapy. Clin. (Sao Paulo) 73 (Suppl. 1), e792s. doi:10.6061/clinics/2018/e792s

PubMed Abstract | CrossRef Full Text | Google Scholar

de Morais, E. F., Mäkitie, A., Coletta, R. D., and Almangush, A. (2025). Stromal prognostic markers for oral cancer: an update. Oral Dis. 31 (7), 2019–2025. doi:10.1111/odi.15278

PubMed Abstract | CrossRef Full Text | Google Scholar

Dolatabadi, S., Jonasson, E., Andersson, L., Luna Santamaría, M., Lindén, M., Österlund, T., et al. (2022). FUS-DDIT3 fusion oncoprotein expression affects JAK-STAT signaling in myxoid liposarcoma. Front. Oncol. 12, 816894. doi:10.3389/fonc.2022.816894

PubMed Abstract | CrossRef Full Text | Google Scholar

Dong, L., Xue, L., Cheng, W., Tang, J., Ran, J., and Li, Y. (2024). Comprehensive survival analysis of oral squamous cell carcinoma patients undergoing initial radical surgery. BMC Oral Health 24 (1), 919. doi:10.1186/s12903-024-04690-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Fang, Z., Wang, F., Zhang, M., Huang, H., and Lin, Z. (2022). Identification of Co-Expression modules and genes associated with tumor progression in oral squamous cell carcinoma. Pathol. Oncol. Res. 28, 1610481. doi:10.3389/pore.2022.1610481

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, J., Zhou, S., Huang, P., Xu, S., Zhang, G., He, H., et al. (2020). NNK-mediated upregulation of DEPDC1 stimulates the progression of oral squamous cell carcinoma by inhibiting CYP27B1 expression. Am. J. Cancer Res. 10 (6), 1745–1760.

PubMed Abstract | Google Scholar

Hassn, M. M., Syafruddin, S. E., Mohtar, M. A., and Syahir, A. (2021). CD44: a multifunctional mediator of cancer progression. Biomolecules 11 (12), 1850. doi:10.3390/biom11121850

PubMed Abstract | CrossRef Full Text | Google Scholar

Hsu, P. J., Yan, K., Shi, H., Izumchenko, E., and Agrawal, N. (2020). Molecular biology of oral cavity squamous cell carcinoma. Oral Oncol. 102, 104552. doi:10.1016/j.oraloncology.2019.104552

PubMed Abstract | CrossRef Full Text | Google Scholar

Johnson, D. E., Burtness, B., Leemans, C. R., Lui, V. W. Y., Bauman, J. E., and Grandis, J. R. (2020). Head and neck squamous cell carcinoma. Nat. Rev. Dis. Prim. 6 (1), 92. doi:10.1038/s41572-020-00224-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Kim, M. J., and Ahn, K. M. (2024). Prognostic factors of oral squamous cell carcinoma: the importance of recurrence and pTNM stage. Maxillofac. Plast. Reconstr. Surg. 46 (1), 8. doi:10.1186/s40902-024-00410-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Koromilas, A. E., and Sexl, V. (2013). The tumor suppressor function of STAT1 in breast cancer. JAKSTAT 2 (2), e23353. doi:10.4161/jkst.23353

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Z., Zhong, L., Du, Z., Chen, G., Shang, J., Yang, Q., et al. (2019). Network analyses of differentially expressed genes in osteoarthritis to identify hub genes. Biomed. Res. Int. 2019, 8340573. doi:10.1155/2019/8340573

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, H., Weng, Y., Wang, S., Wang, F., Kong, P., Zhang, L., et al. (2021). CDCA7 facilitates tumor progression by directly regulating CCNA2 expression in esophageal squamous cell carcinoma. Front. Oncol. 11, 734655. doi:10.3389/fonc.2021.734655

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, B., Qin, Y., Yu, X., Xu, X., and Yu, W. (2022). Lipid raft involvement in signal transduction in cancer cell survival, cell death and metastasis. Cell Prolif. 55 (1), e13167. doi:10.1111/cpr.13167

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, M., Song, N., Sun, D., Yu, Y., Zheng, W., Zhang, X., et al. (2023). Transcriptome mapping of the internal N7-Methylguanosine methylome in messenger RNAs in human oral squamous cell carcinoma. Front. Biosci. Landmark Ed. 28 (12), 330. doi:10.31083/j.fbl2812330

PubMed Abstract | CrossRef Full Text | Google Scholar

Liang, M., Melchior, F., Feng, X. H., and Lin, X. (2004). Regulation of Smad4 sumoylation and transforming growth factor-beta signaling by protein inhibitor of activated STAT1. J. Biol. Chem. 279 (22), 22857–22865. doi:10.1074/jbc.M401554200

PubMed Abstract | CrossRef Full Text | Google Scholar

Lohavanichbutr, P., Houck, J., Doody, D. R., Wang, P., Mendez, E., Futran, N., et al. (2012). Gene expression in uninvolved oral mucosa of OSCC patients facilitates identification of markers predictive of OSCC outcomes. PLoS One 7 (9), e46575. doi:10.1371/journal.pone.0046575

PubMed Abstract | CrossRef Full Text | Google Scholar

Long, F., Li, S., Xu, Y., Liu, M., Zhang, X., Zhou, J., et al. (2022). Dynamic gene screening enabled identification of a 10-gene panel for early detection and progression assessment of gastric cancer. Comput. Struct. Biotechnol. J. 21, 677–687. doi:10.1016/j.csbj.2022.12.036

PubMed Abstract | CrossRef Full Text | Google Scholar

Okamoto, K., Ninomiya, I., Ohbatake, Y., Hirose, A., Tsukada, T., Nakanuma, S., et al. (2016). Expression status of CD44 and CD133 as a prognostic marker in esophageal squamous cell carcinoma treated with neoadjuvant chemotherapy followed by radical esophagectomy. Oncol. Rep. 36 (6), 3333–3342. doi:10.3892/or.2016.5133

PubMed Abstract | CrossRef Full Text | Google Scholar

Ortiz, R. C., Lopes, N. M., Amôr, N. G., Ponce, J. B., Schmerling, C. K., Lara, V. S., et al. (2018). CD44 and ALDH1 immunoexpression as prognostic indicators of invasion and metastasis in oral squamous cell carcinoma. J. Oral Pathol. Med. 47 (8), 740–747. doi:10.1111/jop.12734

PubMed Abstract | CrossRef Full Text | Google Scholar

Rades, D., Seibold, N. D., Hoffmann, A., Gebhard, M. P., Noack, F., Thorns, C., et al. (2013). Impact of the HPV-positivity definition on the prognostic value of HPV status in patients with locally advanced squamous cell carcinoma of the head and neck. Strahlenther Onkol. 189 (10), 856–860. doi:10.1007/s00066-013-0377-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Ritchie, M. E., Phipson, B., Wu, D., Hu, Y., Law, C. W., Shi, W., et al. (2015). Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43 (7), e47. doi:10.1093/nar/gkv007

PubMed Abstract | CrossRef Full Text | Google Scholar

Ruan, J. S., Zhou, H., Yang, L., Wang, L., Jiang, Z. S., and Wang, S. M. (2017). CCNA2 facilitates epithelial-to-mesenchymal transition via the integrin αvβ3 signaling in NSCLC. Int. J. Clin. Exp. Pathol. 10 (8), 8324–8333.

PubMed Abstract | Google Scholar

Senbanjo, L. T., and Chellaiah, M. A. (2017). CD44: a multifunctional cell surface adhesion receptor is a regulator of progression and metastasis of cancer cells. Front. Cell Dev. Biol. 5, 18. doi:10.3389/fcell.2017.00018

PubMed Abstract | CrossRef Full Text | Google Scholar

Struckmeier, A. K., Buchbender, M., Lutz, R., and Kesting, M. (2024). Improved recurrence rates and progression-free survival in primarily surgically treated oral squamous cell carcinoma - results from a German tertiary medical center. Clin. Oral Investig. 28 (5), 262. doi:10.1007/s00784-024-05644-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Subramanian, A., Tamayo, P., Mootha, V. K., Mukherjee, S., Ebert, B. L., Gillette, M. A., et al. (2005). Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. U. S. A. 102 (43), 15545–15550. doi:10.1073/pnas.0506580102

PubMed Abstract | CrossRef Full Text | Google Scholar

Townsend, P. A., Scarabelli, T. M., Davidson, S. M., Knight, R. A., Latchman, D. S., and Stephanou, A. (2004). STAT-1 interacts with p53 to enhance DNA damage-induced apoptosis. J. Biol. Chem. 279 (7), 5811–5820. doi:10.1074/jbc.M302637200

PubMed Abstract | CrossRef Full Text | Google Scholar

Truchard, E., Bertolus, C., Martinez, P., Thomas, E., Saintigny, P., and Foy, J. P. (2022). Identification of a gene-expression-based surrogate of genomic instability during oral carcinogenesis. Cancers (Basel) 14 (3), 834. doi:10.3390/cancers14030834

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, H., Tian, Y., Yuan, X., Wu, H., Liu, Q., Pestell, R. G., et al. (2015). The role of CD44 in epithelial-mesenchymal transition and cancer development. Onco Targets Ther. 8, 3783–3792. doi:10.2147/OTT.S95470

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, X., Sun, T., Zhao, Y., Liu, S., and Liang, X. (2022). 4sc-202 and Ink-128 cooperate to reverse the epithelial to mesenchymal transition in OSCC. Oral Dis. 28 (8), 2139–2148. doi:10.1111/odi.13860

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, J., Wang, F., Liu, F., and Xu, G. (2020). Predicting STAT1 as a prognostic marker in patients with solid cancer. Ther. Adv. Med. Oncol. 12, 1758835920917558. doi:10.1177/1758835920917558

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, X., Song, Y., Chen, X., Zhuang, X., Wei, Z., and Yi, L. (2022). Integration of genetic and immune infiltration insights into data mining of multiple sclerosis pathogenesis. Comput. Intell. Neurosci. 2022, 1661334. doi:10.1155/2022/1661334

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, J., Di, Y., Zhang, B., Li, T., Li, D., and Zhang, H. (2024). CDK1 and CCNA2 play important roles in oral squamous cell carcinoma. Med. Baltim. 103 (16), e37831. doi:10.1097/MD.0000000000037831

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhi, Y., Wang, Q., Zi, M., Zhang, S., Ge, J., Liu, K., et al. (2024). Spatial transcriptomic and metabolomic landscapes of oral submucous fibrosis-derived oral squamous cell carcinoma and its tumor microenvironment. Adv. Sci. (Weinh) 11 (12), e2306515. doi:10.1002/advs.202306515

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, H. Y., Wang, Y. C., Wang, T., Wu, W., Cao, Y. Y., Zhang, B. C., et al. (2024). CCNA2 and NEK2 regulate glioblastoma progression by targeting the cell cycle. Oncol. Lett. 27 (5), 206. doi:10.3892/ol.2024.14339

PubMed Abstract | CrossRef Full Text | Google Scholar

Zimmerer, J. M., Lesinski, G. B., Radmacher, M. D., Ruppert, A., and Carson, W. E. (2007). STAT1-dependent and STAT1-independent gene expression in murine immune cells following stimulation with interferon-alpha. Cancer Immunol. Immunother. 56 (11), 1845–1852. doi:10.1007/s00262-007-0329-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: CCNA2, CD44, hub genes, oral squamous cell carcinoma, STAT1

Citation: Bu M, Gu Y, Meng Y, Cui Y, Wu Q, Kong Y, Liu H, Chen Y, Zhang J, Guo L, Yang Y, Huang D and Ma Z (2026) Integrated network analysis and experimental validation identify CCNA2, CD44, and STAT1 as clinically relevant hub genes in oral squamous cell carcinoma. Front. Mol. Biosci. 13:1748821. doi: 10.3389/fmolb.2026.1748821

Received: 18 November 2025; Accepted: 20 January 2026;
Published: 09 February 2026.

Edited by:

Matteo Becatti, University of Firenze, Italy

Reviewed by:

Sanjit Roy, Glycomontra, Inc., United States
Rui Vitorino, University of Aveiro, Portugal

Copyright © 2026 Bu, Gu, Meng, Cui, Wu, Kong, Liu, Chen, Zhang, Guo, Yang, Huang and Ma. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zhe Ma, bWF6aGVAaGVibXUuZWR1LmNu; Dongyang Huang, aHVhbmdkeUBoZWJtdS5lZHUuY24=; Yakun Yang, eWFuZ3lha3VuQGhlYm11LmVkdS5jbg==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.