Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med., 02 July 2025

Sec. Rheumatology

Volume 12 - 2025 | https://doi.org/10.3389/fmed.2025.1518580

This article is part of the Research TopicGenetic and molecular research on osteoarthritis and osteoporosisView all 3 articles

Identification of key genes in osteoarthritis development: biomarker discovery and therapeutic targets

Liming Wu&#x;Liming Wu1Disheng Wen&#x;Disheng Wen2Weizhou WangWeizhou Wang3Yanghao WangYanghao Wang4Li Zhang
Li Zhang2*
  • 1Department of Orthopedics, Sichuan Santai County People's Hospital, Mianyang, China
  • 2Department of Orthopedics (Fourth), Dali Bai Autonomous Prefecture People's Hospital, Dali, China
  • 3Department of Orthopedics, The First Affiliated Hospital of Kunming Medical University, Kunming, China
  • 4Department of Pathology, The First Affiliated Hospital of Kunming Medical University, Kunming, China

Introduction: Osteoarthritis (OA) is the most common joint disorder and a leading cause of disability in the older adult. Early diagnosis and treatment are crucial for effective disease management and improved outcomes. This study aims to identify key genes involved in OA progression using bioinformatics, which may serve as diagnostic biomarkers and therapeutic targets.

Methods: Synovial tissue sequencing data (GSE1919, GSE55235, GSE82107) were retrieved from the Gene Expression Omnibus (GEO) database. Differentially expressed genes (DEGs) were analyzed using Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and protein–protein interaction (PPI) network analysis. ROC curve analysis was used to assess diagnostic potential, and results were validated using the GSE29746 dataset and synovial tissues from five OA patients and controls.

Results: A total of 33 common DEGs were identified across three datasets. Four hub genes (CXCL8, CXCL2, DUSP5, TNFSF11) showed high diagnostic potential [area under the receiver operating characteristic curve (AUC) > 0.8]. These genes were also linked to potential therapeutic agents, including lipopolysaccharide and acetaminophen.

Conclusion: CXCL8, CXCL2, DUSP5, and TNFSF11 represent novel multi-functional biomarkers that advance OA research by addressing two critical limitations of prior biomarker studies: (1) overcoming the diagnostic inadequacy of single-biomarker approaches through synergistic clusters, and (2) revealing an unreported integrative mechanism linking inflammatory pathways (CXCL8/2) and bone remodeling processes (TNFSF11/DUSP5). This dual diagnostic-therapeutic potential significantly expands the clinical applicability of OA biomarkers.

1 Introduction

Osteoarthritis (OA) is a common age-related degenerative disease that results in chronic joint pain and functional impairment, severely affecting patients’ quality of life (1, 2). Current therapeutic approaches predominantly target symptom relief, particularly inflammation and pain management, but are largely ineffective at preventing disease progression (3). As a result, many patients with end-stage OA ultimately require surgical intervention (4, 5). Early diagnosis and intervention are critical for improving patient outcomes and optimizing healthcare resource allocation. OA is marked by synovial inflammation, subchondral bone remodeling, and cartilage degradation, with synovial lesions increasingly recognized as central to disease development and joint pain (68). Synovial changes alter the intra-articular environment, playing a key role in the breakdown of the extracellular matrix and cartilage degradation (9, 10). Understanding the role of synovial tissue lesions in OA progression is thus of paramount importance (11).

While previous biomarker studies in OA have identified individual candidates (e.g., COMP, CTX-II), they are limited by a focus on singular targets that fail to capture the multifactorial pathogenesis of OA. This study addresses this critical gap by identifying functionally interconnected biomarker clusters that collectively reflect OA’s complexity. We establish three key methodological advances: (1) Integrated analysis of four independent synovial tissue datasets (training: GSE1919, GSE55235, GSE82107; validation: GSE29746) enhances the reliability of DEG identification, (2) Combined application of GO, KEGG, GSEA, and PPI network analysis enables systematic elucidation of both functional and interactive properties of hub genes, (3) Multi-level validation strategy incorporating ROC analysis, independent dataset replication, and patient-derived tissue verification. This approach uniquely identifies CXCL8, CXCL2, DUSP5, and TNFSF11 as a novel biomarker cluster—simultaneously implicating inflammatory pathways (CXCL8/2) and bone remodeling processes (TNFSF11/DUSP5)—providing a framework for combined diagnostic panels and multi-target therapies that transcend conventional single-biomarker limitations. In this study, we analyzed synovial tissue gene expression profiles from four datasets obtained from the Gene Expression Omnibus (GEO) database, including data from OA patients and healthy controls, which were divided into training and validation sets. Bioinformatics analyses, including Gene Ontology (GO) enrichment, Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis, Gene Set Enrichment Analysis (GSEA), and Protein–Protein Interaction (PPI) network analysis, were employed to identify differentially expressed genes (DEGs) and to elucidate their biological functions. The diagnostic accuracy of hub genes was assessed using receiver operating characteristic (ROC) analysis, which also facilitated predictions regarding OA progression and early therapeutic strategies based on these diagnostic genes.

This comprehensive approach offers new insights into the regulatory genes and molecular mechanisms underlying OA, contributing to a more detailed understanding of the disease’s pathophysiology and potential therapeutic avenues. While specific clinical biomarkers for OA remain elusive, *our identification of functionally interconnected biomarker clusters (CXCL8, CXCL2, DUSP5, TNFSF11) provides a novel framework for developing combined diagnostic panels and multi-target therapies, surpassing the limitations of conventional single-biomarker approaches.

2 Materials and methods

2.1 Data collection

The gene expression values from datasets GSE1919, GSE55235, GSE82107, and GSE29746 were retrieved from the Gene Expression Omnibus (GEO) database using the GEOquery package. Synovial biopsy samples from both OA patients and healthy controls were used to identify differentially expressed genes (DEGs) associated with OA, potentially serving as disease-specific signature genes. To ensure analytical rigor, the four datasets were divided into training sets (GSE1919, GSE55235, GSE82107) and an independent validation set (GSE29746) based on sample size (>20 samples per group) and platform compatibility (all Affymetrix platforms). The data were normalized using the “normalizeBetweenArrays” function in the limma [3.52.2] package in R (version 4.2.1). Sample subgroup clustering was visualized through principal component analysis (PCA) plots. Figure 1 provides a flowchart outlining the study design.

Figure 1
Flowchart illustrating the analysis process using datasets. Training set includes OA datasets GSE55235, GSE1919, and GSE82107, leading to differentially expressed genes (DEGs), gene set enrichment analysis (GSEA), and further analyses such as GO/KEGG, PPI analysis, and hub gene identification. Validation set includes OA dataset GSE29746. These analyses lead to gene expression, ROC curve, miRNA-gene network, diagnostic gene identification, and targeted drug prediction.

Figure 1. Flowchart presentation of this study.

2.2 Analysis of differentially expressed genes

Differential gene expression analysis between OA and control groups was conducted using the limma [3.52.2] package in R (version 4.2.1). The thresholds |log2(FC)| > 1 and p-value < 0.05 were selected based on: (1) Biological significance: |log2(FC)| > 1 ensures ≥2-fold expression changes, a standard cutoff for functionally relevant alterations in synovial transcriptomics; (2) Statistical rigor: p-value < 0.05 with Benjamini-Hochberg correction controls false discovery rates (FDR) in multi-dataset analyses (12). The significant DEGs were visualized using heatmaps (ComplexHeatmap [2.13.1]) and volcano plots (ggplot2 [3.3.6]). Gene Set Enrichment Analysis (GSEA) was performed with clusterProfiler (reference gene set: c2.cp.all.v2022.1. Hs.symbols.gmt). Enriched pathways were filtered at p.adj < 0.05 and FDR (q-value) < 0.25.

2.3 Expression analysis of common DEGs and hub genes

The intersection of DEGs from the three training sets was visualized using ggplot2 [3.3.6] and VennDiagram [1.7.3] in R (version 4.2.1). Subsequently, Gene Ontology (GO) and KEGG enrichment analyses were conducted using the clusterProfiler [4.4.4] package, with input gene IDs transformed using org. Hs.eg.dbR. Protein–protein interaction (PPI) network analysis was performed via the STRING database (version 11.5) with the following parameters: Minimum required interaction score: 0.7 (high confidence), Active interaction sources: Experiments & Databases, Excluded disconnected nodes. The resultant network (contained 33 nodes and 87 edges) was imported into Cytoscape software (version 3.9.0). Hub genes were identified using the Edge Percolated Component (EPC) algorithm in Cytohubba plugin with default parameters (edge weight: combined score; node score: betweenness centrality). The top 10 hub genes were selected based on descending EPC scores. The expression levels and Spearman correlations of DEGs were analyzed across the three training sets.

2.4 Identification of diagnostic genes

We illustrated the expression levels of key genes between OA patients and healthy controls using scatter plots and box plots. ROC analysis was conducted using the pROC [1.18.0] package in R (version 4.2.1), with results visualized using ggplot2 [3.3.6]. Training phase: Internal validation via 10-fold cross-validation repeated 3 times on combined training sets (GSE1919 + GSE55235 + GSE82107), Validation phase: Independent testing on GSE29746 dataset. Diagnostic genes were selected from both the training and validation datasets using an AUC threshold of >0.800.

2.5 Synovial sample collection and validation for differential expression

Validation cohort selection criteria: Synovial tissues from five OA patients (Kellgren-Lawrence grade II-III) and five age/sex-matched healthy controls (mean age 58.2 ± 4.3 vs. 56.8 ± 5.1 years, p > 0.05) were collected at our hospital. Inclusion criteria for OA patients: (1) Diagnosed according to ACR clinical criteria, (2) No history of inflammatory arthritis or joint infection, (3) No intra-articular injections within 6 months. Control group criteria: Individuals undergoing knee arthroscopy for meniscal injuries without radiographic OA (Kellgren-Lawrence grade 0) or synovitis (histological evaluation). Informed consent was obtained from all participants, and the study was approved by the hospital’s ethics committee. RNA isolation and extraction were performed using the TRIzol method as per the manufacturer’s protocol (Invitrogen). Complementary DNA (cDNA) synthesis was performed using a reverse transcription kit (Vazyme, 7E581J1). To ensure reproducibility, all validation experiments included: (1) Technical triplicates for each sample, (2) Negative controls without template, (3) Reference gene normalization (GAPDH and ACTB). RNA quality was assessed by: Nanodrop 2000 spectrophotometer (Thermo Fisher): OD260/280 ratios 1.9–2.1. For gene amplification, 1 μL of cDNA template and 0.6 μL of primers per gene were used in a 20-μL reaction. PCR amplification included an initial denaturation at 95°C for 10 min, followed by 40 cycles of denaturation at 95°C for 10 s, annealing, and extension at 60°C for 32 s.

2.6 Construction of miRNA-gene regulatory network

The starBase database1 was used to predict the interactions between diagnostic genes and miRNAs. The resulting miRNA-gene regulatory network was visualized using Cytoscape software (version 3.9.0).

2.7 Prediction of potential therapeutic drugs

Potential therapeutic drugs for OA were predicted by constructing gene-drug interaction networks using the Comparative Toxicogenomics Database (CTD). The CTD integrates data from published studies on chemical compounds, genetics, phenotypes, and diseases, aiding in the prediction of potential drug targets for key OA-related genes. Drug Prediction Protocol: Gene-drug interaction retrieval: Input: Official gene symbols of diagnostic markers, Filters: Interaction score >30, evidence count ≥5. Therapeutic relevance scoring: Mechanism score: 0–1 (1 = direct OA pathway evidence), Clinical score: 0–1 (1 = Phase III trial/OA approval), Priority score = 0.6 × Mechanism + 0.4 × Clinical. Manual curation: Excluded non-human studies: Removed compounds with off-target toxicity (LD50 < 50 mg/kg).

3 Results

3.1 Identification of DEGs in synovial tissues of OA patients and functional analysis

Using the thresholds of |log2(FC)| > 1 and p-value < 0.05, a total of 541 differentially expressed genes (DEGs) were identified from the GSE1919 training set, with 240 significantly upregulated and 301 significantly downregulated genes (Supplementary Table 1). DEGs were visualized using volcano plots and heat maps (Figure 2A). Similarly, 958 DEGs were identified from the GSE55235 training set, including 519 significantly upregulated and 439 significantly downregulated genes (Supplementary Table 2), visualized through volcano plots and heat maps (Figure 2B). In the GSE82107 training set, 1,213 DEGs were identified, comprising 728 significantly upregulated and 485 significantly downregulated genes (Supplementary Table 3), also visualized using volcano plots and heat maps (Figure 2C).

Figure 2
Three panels A, B, and C contain heatmaps and volcano plots. Heatmaps show gene expression levels with colors for upregulation (red) and downregulation (blue). Volcano plots depict significance (Y-axis: -Log10 P-value) versus fold change (X-axis: Log2 Fold Change) with dots representing upregulated (red), downregulated (blue), and non-significant genes (gray). Each panel combines a heatmap with a corresponding volcano plot, illustrating diverse gene expression patterns and statistical significance across conditions.

Figure 2. Differential gene expression in the three training sets. (A) Volcano and heatmap display of differential gene expression results for the GSE1919 dataset; (B) Volcano and heatmap display of differential gene expression results for the GSE55235 dataset; (C) Volcano and heatmap display of differential gene expression results for the GSE82107 dataset.

Gene Set Enrichment Analysis (GSEA) revealed several key pathways associated with OA progression, including REACTOME_ADAPTIVE_IMMUNE_SYSTEM, WP_GLUCOCORTICOID_RECEPTOR_PATHWAY, REACTOME_ASSEMBLY_OF_COLLAGEN_FIBRILS_AND_OTHER_MULTIMERIC_STRUCTURES, REACTOME_COLLAGEN_BIOSYNTHESIS_AND_MODIFYING_ENZYMES, REACTOME_EXTRACELLULAR_MATRIX_ORGANIZATION, and REACTOME_DEGRADATION_OF_THE_EXTRACELLULAR_MATRIX (Figures 3BD; Supplementary Table 4). By integrating DEGs from all three training sets, we identified 33 overlapping DEGs (Figure 3A).

Figure 3
A series of scientific visualizations depicting gene set enrichment analysis. (A) Venn diagram showing overlap among three datasets (GSE1919, GSE55235, GSE82107). (B) Bubble plot illustrating pathway enrichment scores and set sizes. (C) Graphs showing enrichment scores for various pathways. (D) Enrichment score plots ranked by dataset order. (E) Dot plot of biological processes, molecular functions, and KEGG pathways with gene ratios and significance. (F) Chord diagram linking genes to functional categories with log-fold change gradients.

Figure 3. Identification of DEGs and their functional analysis. (A) Venn diagram between overlapping genes; (B) Bubble diagram results of enrichment analysis using Gene Set Enrichment Analysis (GSEA); (C) Mountain range diagram results of enrichment analysis using GSEA; (D) Classical results of enrichment analysis using GSEA; (E) Bubble diagrams of GO and KEGG pathways enriched for 33 DEGs; (F) Chord diagrams of GO and KEGG pathways enriched for 33 DEGs.

Further analysis of these 33 overlapping DEGs revealed enriched biological processes (BPs) such as cellular response to nerve growth factor stimulus, response to nerve growth factor, and endoderm formation. In terms of molecular function (MF), the DEGs were enriched for protein tyrosine/threonine phosphatase activity, MAP kinase tyrosine/serine/threonine phosphatase activity, and MAP kinase phosphatase activity. KEGG analysis linked the DEGs to pathways including Rheumatoid arthritis, IL-17 signaling pathway, and NF-kappa B signaling pathway (Figures 3EF; Supplementary Tables 5, 6).

3.2 PPI network analysis and hub gene identification

A protein–protein interaction (PPI) network comprising the 33 differentially expressed genes was constructed using the STRING database to elucidate their interactions (Figure 4A). The top 10 hub genes were identified using the Cytohubba plugin in Cytoscape software (Figure 4B). The expression levels of these hub genes across the three training sets are shown in Figures 4CE, and their correlations are presented in Figures 4FH. Significant expression differences were observed for CXCL8, CXCL2, DUSP5, TNFSF11, and TCA1 in the GSE29746 validation set (Figure 5).

Figure 4
A series of visualizations with eight panels labeled A to H. Panel A: A network diagram showing interactions between various proteins, with nodes and connecting lines. Panel B: A simplified network with square nodes, highlighting central proteins like CXCL8. Panels C to E: Violin plots comparing values of different proteins between control and OA groups, using blue and red color coding. Statistical significance is indicated by asterisks. Panels F to H: Heatmaps displaying correlation matrices of proteins, with color gradients representing correlation values and significant correlations marked with asterisks.

Figure 4. Protein–protein interaction network and expression of hub genes. (A) Protein–protein interaction network constituted by DEGs; (B) Top 10 hub genes; (C–E) Pod plots of the expression of the 10 hub genes in GSE1919, GSE55235, and GSE82107. (F–H) Spearman correlation of the 10 hub genes in GSE1919, GSE55235, and GSE82107.

Figure 5
Box plots compare gene expression levels between control and OA groups for ten genes: CXCL8, CXCL2, DUSP5, TNFSF11, TCA1, FOSB, PTX3, DUSP2, SELL, and CDKN1A. Significant differences are indicated.

Figure 5. Expression of the 10 hub genes in the GSE29746 validation set.

3.3 Identification and validation of diagnostic signature biomarkers

Receiver operating characteristic (ROC) curves were employed to assess the diagnostic value of the 10 hub genes in OA (Figures 6AC). The results indicated that, with the exception of DUSP2, the remaining nine genes showed high diagnostic value (AUC > 0.7) for OA diagnosis. Validation in the GSE29746 set (Figures 6DM) confirmed that CXCL8 [AUC = 1.000 (95% CI 1.000–1.000)] and CXCL2 [AUC = 0.901 (95% CI 0.722–1.000)] had excellent predictive accuracy for OA onset (AUC > 0.9), while DUSP5 [AUC = 0.851 (95% CI 0.669–1.000)], TNFSF11 [AUC = 0.810 (95% CI 0.622–0.997)], and TAC1 [AUC = 0.785 (95% CI 0.576–0.884)] displayed high accuracy (AUC > 0.7).

Figure 6
Multiple ROC curves depict the sensitivity versus 1-specificity for various markers: CXCL8, CXCL2, DUSP5, TNFSF11, TAC1, SELL, CDKN1A, FOSB, PTX3, and DUSP2. Panels A, B, and C show combined markers with area under the curve (AUC) ranging from 0.440 to 1.000. Panels D to M display individual marker performance with confidence intervals.

Figure 6. Diagnostic values of the 10 Hub genes. (A) Diagnostic values of 10 hub genes in GSE1919 training set; (B) Diagnostic values of 10 hub genes in GSE55235 training set; (C) Diagnostic values of 10 hub genes in GSE82107 training set; (D–M) Diagnostic values of 10 hub genes in GSE29746 validation set.

Ultimately, four diagnostic genes with AUC > 0.8 (CXCL8, CXCL2, DUSP5, and TNFSF11) were identified. Synovial tissues from five pairs of OA patients and healthy controls were collected to validate these genes. PCR experiments revealed significant differential expression of these four genes in OA synovial tissues: CXCL2 (p = 0.0140), CXCL8 (p = 0.0255), DUSP5 (p = 0.0401), and TNFSF11 (p = 0.0444), supporting their diagnostic relevance in OA (Figure 7).

Figure 7
Bar chart comparing relative mRNA levels of CXCL2, CXCL8, DUSP, and TNFSF11 between control (blue) and OA (orange) groups. Control bars are higher for all genes, indicating increased expression compared to OA. Asterisks indicate significant differences.

Figure 7. mRNA expression of 4 diagnostic genes in tissues.

3.4 Construction of miRNA-gene networks and prediction of potential therapeutic drugs

The starBase database was used to predict microRNAs (miRNAs) interacting with the four diagnostic genes (CXCL8, CXCL2, DUSP5, and TNFSF11). The top 50 predicted miRNAs for each gene were selected to construct gene networks. Analysis of the miRNA-gene network using Cytoscape identified 36 miRNAs regulating two or more diagnostic genes (Figure 8A; Supplementary Table 7).

Figure 8
Panel A shows a network diagram with red diamond nodes connected to multiple blue circle nodes, indicating interactions. Panel B is a Venn diagram with overlapping colored areas labeled CXCL8, CXLC2, DUSP5, and TNFSF11, displaying numerical intersections representing shared elements among the categories.

Figure 8. miRNA gene network and potential therapeutic drug prediction. (A) Co-expression network of diagnostic genes and target miRNAs; (B) Venn diagram between overlapping potential therapeutic drugs.

Furthermore, potential drugs influencing the expression of these diagnostic genes in OA were predicted. CXCL8 was associated with 1,220 interacting drugs, CXCL2 with 458 drugs, DUSP5 with 227 drugs, and TNFSF11 with 259 drugs. By integrating these, we identified 48 overlapping drugs as potential treatments for OA. These included lipopolysaccharides, acetaminophen, particulate matter, tetrachlorodibenzodioxin, and dexamethasone, among others (Figure 8B; Supplementary Table 8).

4 Discussion

OA stands as a primary contributor to chronic pain and physical limitations among older individuals. Its prevalence escalates notably in developed nations with aging populations, thereby posing significant economic and health burdens (13). Despite its widespread occurrence, effective treatment options remain limited (14). Current therapies primarily focus on symptom relief rather than addressing joint damage or halting disease progression, which is critical for restoring normal function (15). Furthermore, the progression and severity of OA are aggravated by various risk factors, including joint injury, repetitive strain, obesity, and genetic predisposition (16). As a result, early diagnosis and targeted treatment are crucial strategies for managing OA. However, the regulatory factors underlying OA development remain poorly understood. Additionally, the lack of reliable biochemical markers (biomarkers) for assessing OA diagnosis and prognosis necessitates the exploration of synovial tissue biomarkers as potential indicators of disease status and treatment outcomes, complementing traditional imaging and clinical assessments (17).

This study focused on bioinformatics-based screening to identify diagnostic biomarkers linked to OA progression in synovial tissues of clinical patients. Previous studies on OA biomarkers have largely centered on analyzing differentially expressed genes (DEGs) in blood or cartilage tissues. For example, Xue et al. (18) identified circRNAs as potential diagnostic biomarkers in the peripheral blood mononuclear cells of OA patients. Similarly, Da et al. (19) used weighted gene co-expression network analysis to identify and validate hub genes associated with OA in cartilage. Ren et al. (20) employed bioinformatics tools to screen for key genes involved in reversing senescence and heterochromatin instability in chondrocytes. PTX3 can alleviate the formation or deterioration of OA through intervention verification responses (21). However, these studies did not explore the role of synovial tissue lesions in OA progression, which our findings bridge through the identification of CXCL8, CXCL2, DUSP5, and TNFSF11 as synovium-specific biomarkers. The CXCL8/IL-8 axis aligns with but extends prior findings on synovial inflammation in OA (22). Our multi-dataset validation strategy (AUC > 0.8) surpasses the single-dataset approaches used in earlier synovial biomarker studies (23). The dual inflammatory-bone remodeling function of TNFSF11 provides novel insights compared to previous single-pathway biomarkers like COMP.

CXCL8 encodes interleukin 8 (IL-8), a key mediator of the inflammatory response. Altered CXCL8 expression in OA synovium suggests its potential as a target for modulating inflammatory responses in macrophages (24, 25). CXCL2, a chemokine and neutrophil chemoattractant, plays important roles in immune responses and is closely linked to osteogenic processes that influence OA progression (26, 27). DUSP5 is a negative regulator of mitogen-activated protein (MAP) kinase signaling, which affects cell proliferation and differentiation, contributing to OA development (28). TNFSF11, a member of the tumor necrosis factor cytokine family, regulates apoptosis and osteoclast differentiation, both of which influence OA progression (29). Notably, our findings resonate with emerging OA mechanisms: The chemokine-osteoclast axis (CXCL8/2-TNFSF11) mirrors the synovial-bone crosstalk observed in recent single-cell studies (30). DUSP5’s MAPK regulatory role complements the mitochondrial dysfunction mechanisms identified in WGCNA-based OA subtyping (23). GO/KEGG and GSEA analyses demonstrated that these four diagnostic genes play significant roles in immune responses and stromal remodeling associated with OA, supporting their utility as biomarkers for OA diagnosis. The predicted drug associations (e.g., acetaminophen-DUSP5 interaction) align with recent breakthroughs: Acetaminophen’s anti-catabolic effects via MAPK inhibition (31), while lipopolysaccharide’s paradoxical effects warrant dose–response validation in synovial fibroblast models. Moreover, the miRNA co-targeting network overlaps with clinically validated OA targets: hsa-miR-196a (upregulated in our study) correlates with miR-29b-5p-based hydrogel therapies for cartilage regeneration (32).

The four synovial biomarkers identified in this study (CXCL8, CXCL2, DUSP5, and TNFSF11) not only advance diagnostic capabilities but also provide actionable insights for targeted OA therapy. Their distinct roles in synovial pathophysiology align with emerging precision medicine strategies: CXCL8/CXCL2 as inflammatory modulators: Elevated expression of these chemokines highlights synovitis as a therapeutic target. Preclinical studies suggest that CXCR2 antagonists (e.g., navarixin), currently in Phase II trials for rheumatoid arthritis (NCT04000789), could be repurposed for OA patients with predominant inflammatory phenotypes. Synovial fluid analysis in such patients may guide personalized anti-inflammatory interventions. As a negative regulator of MAPK signaling, DUSP5 downregulation in OA synovium correlates with cartilage catabolism. This finding supports the potential of MAPK inhibitors (e.g., trametinib) to mitigate OA progression, particularly in patients with low DUSP5 expression. Combining DUSP5 expression profiling with NSAID regimens could optimize therapeutic efficacy. The association between TNFSF11 overexpression and osteoclast activation suggests synergy with RANKL inhibitors like denosumab. Ongoing clinical trials exploring denosumab in erosive OA (NCT04880629) may benefit from stratifying patients based on synovial TNFSF11 levels. While our biomarkers demonstrate high diagnostic accuracy for OA, distinguishing early OA from rheumatoid arthritis (RA) or traumatic synovitis requires combinatorial approaches: Unlike RA, which shows systemic autoimmunity (e.g., anti-CCP antibodies), OA synovium exhibits unique TNFSF11 elevation unaltered by IL-17 inhibition. A diagnostic algorithm integrating serum anti-CCP testing with synovial TNFSF11 quantification could enhance specificity – a priority for future validation. Transient CXCL8 elevation post-injury contrasts with sustained overexpression in chronic OA. Longitudinal monitoring of CXCL8 dynamics may aid differentiation.

Several studies have highlighted the role of miRNA dysregulation in OA pathogenesis. For instance, miR-223 interacts with NLRP3 mRNA to suppress inflammation and promote chondroprotection, inhibiting OA progression (33). miR-146a and miR-140-5p modulate transcription via DNA methylation, alleviating OA progression (34). The miR-130a/HDAC3/PPAR-γ signaling axis is also implicated in regulating OA (35). In our study, we constructed a co-expression network of diagnostic genes and their target miRNAs, suggesting that multiple miRNAs co-targeting CXCL8, CXCL2, DUSP5, and TNFSF11 may represent potential therapeutic targets for OA prevention and treatment. However, this study did not conduct preliminary validation of these miRNA targets, which is a limitation. Future research will focus on validating these targets, with the aim of discovering more effective biomarkers for OA diagnosis and treatment.

Our study began by analyzing multiple microarray datasets to identify common differentially expressed genes, thereby minimizing individual variability. Using these common DEGs, a protein–protein interaction (PPI) network was constructed to screen for hub genes linked to the disease. The identified hub genes were then used to generate diagnostic curves and assess differential expression across additional datasets and clinical samples, ensuring both internal and external validation. Finally, the construction of miRNA interaction networks expanded the potential applications of these genes, laying a solid foundation for further in-depth exploration. Nonetheless, our study has limitations. First, platform heterogeneity across datasets (all Affymetrix but different versions) may introduce technical bias, though mitigated by rigorous normalization. Second, while internal and external validations were conducted, the clinical cohort size (n = 5/group) warrants expansion in multi-center studies. Third, therapeutic predictions (e.g., LPS/acetaminophen) require experimental validation in synovial models. Further in vitro and in vivo studies are necessary to confirm these findings and elucidate the underlying mechanisms.

5 Conclusion

In this study, we identified and validated four differentially expressed genes (CXCL8, CXCL2, DUSP5, and TNFSF11) as promising diagnostic biomarkers for osteoarthritis (OA), which collectively mediate immune regulation, inflammatory responses, and bone remodeling in OA pathogenesis. Our findings advance the understanding of synovial-bone crosstalk in OA progression and highlight actionable therapeutic targets. Future directions include experimental validation in OA animal models, pharmacological modulation studies of predicted drugs, prospective clinical trials stratifying early OA patients by synovial biomarker levels, and integration with imaging/clinical data in longitudinal cohorts to establish prognostic utility. These concrete next steps will accelerate the translation of our findings toward personalized OA management.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving humans were approved by Clinical Ethics Committee of the First Affiliated Hospital of Kunming Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

LW: Conceptualization, Data curation, Formal analysis, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing. DW: Conceptualization, Data curation, Formal analysis, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing. WW: Data curation, Formal analysis, Methodology, Software, Writing – review & editing. YW: Data curation, Formal analysis, Methodology, Software, Writing – review & editing. LZ: Conceptualization, Data curation, Formal analysis, Methodology, Software, Supervision, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2025.1518580/full#supplementary-material

Footnotes

References

1. Geng, R, Li, J, Yu, C, Zhang, C, Chen, F, Chen, J, et al. Knee osteoarthritis: current status and research progress in treatment (review). Exp Ther Med. (2023) 26:481. doi: 10.3892/etm.2023.12180

PubMed Abstract | Crossref Full Text | Google Scholar

2. Liu, J, Qiu, C, Zhou, Z, Li, J, Zhen, Y, Wang, R, et al. Pentraxin 3 exacerbates psoriasiform dermatitis through regulation of macrophage polarization. Int Immunopharmacol. (2024) 130:111805. doi: 10.1016/j.intimp.2024.111805

PubMed Abstract | Crossref Full Text | Google Scholar

3. Bannuru, RR, Osani, MC, Vaysbrot, EE, Arden, NK, Bennell, K, Bierma-Zeinstra, SMA, et al. OARSI guidelines for the non-surgical management of knee, hip, and polyarticular osteoarthritis. Osteoarthr Cartil. (2019) 27:1578–89. doi: 10.1016/j.joca.2019.06.011

PubMed Abstract | Crossref Full Text | Google Scholar

4. Brophy, RH, and Fillingham, YA. AAOS clinical practice guideline summary: Management of Osteoarthritis of the knee (nonarthroplasty), third edition. J Am Acad Orthop Surg. (2022) 30:e721–9. doi: 10.5435/JAAOS-D-21-01233

PubMed Abstract | Crossref Full Text | Google Scholar

5. Peng, H, Ou, A, Huang, X, Wang, C, Wang, L, Yu, T, et al. Osteotomy around the knee: the surgical treatment of osteoarthritis. Orthop Surg. (2021) 13:1465–73. doi: 10.1111/os.13021

PubMed Abstract | Crossref Full Text | Google Scholar

6. Wang, X, Chen, T, Liang, W, Fan, T, Zhu, Z, Cao, P, et al. Synovitis mediates the association between bone marrow lesions and knee pain in osteoarthritis: data from the Foundation for the National Institute of health (FNIH) osteoarthritis biomarkers consortium. Osteoarthr Cartil. (2022) 30:1270–7. doi: 10.1016/j.joca.2022.06.004

PubMed Abstract | Crossref Full Text | Google Scholar

7. Yusuf, E, Kortekaas, MC, Watt, I, Huizinga, TWJ, and Kloppenburg, M. Do knee abnormalities visualised on MRI explain knee pain in knee osteoarthritis? A systematic review. Ann Rheum Dis. (2011) 70:60–7. doi: 10.1136/ard.2010.131904

PubMed Abstract | Crossref Full Text | Google Scholar

8. Baker, K, Grainger, A, Niu, J, Clancy, M, Guermazi, A, Crema, M, et al. Relation of synovitis to knee pain using contrast-enhanced MRIs. Ann Rheum Dis. (2010) 69:1779–83. doi: 10.1136/ard.2009.121426

PubMed Abstract | Crossref Full Text | Google Scholar

9. Furuoka, H, Endo, K, and Sekiya, I. Mesenchymal stem cells in synovial fluid increase in number in response to synovitis and display more tissue-reparative phenotypes in osteoarthritis. Stem Cell Res Ther. (2023) 14:244. doi: 10.1186/s13287-023-03487-1

PubMed Abstract | Crossref Full Text | Google Scholar

10. Moretti, L, Bizzoca, D, Geronimo, A, Moretti, FL, Monaco, E, Solarino, G, et al. Towards precision medicine for osteoarthritis: focus on the synovial fluid proteome. Int J Mol Sci. (2022) 23:9731. doi: 10.3390/ijms23179731

PubMed Abstract | Crossref Full Text | Google Scholar

11. Hunter, DJ, and Felson, DT. Osteoarthritis. BMJ. (2006) 332:639–42. doi: 10.1136/bmj.332.7542.639

PubMed Abstract | Crossref Full Text | Google Scholar

12. Shou, J, Bull, CM, Li, L, Qian, HR, Wei, T, Luo, S, et al. Identification of blood biomarkers of rheumatoid arthritis by transcript profiling of peripheral blood mononuclear cells from the rat collagen-induced arthritis model. Arthritis Res Ther. (2006) 8:R28. doi: 10.1186/ar1883

PubMed Abstract | Crossref Full Text | Google Scholar

13. GBD 2019 Diseases and Injuries Collaborators. Global burden of 369 diseases and injuries in 204 countries and territories, 1990-2019: a systematic analysis for the global burden of disease study 2019. Lancet. (2020) 396:1204–22. doi: 10.1016/S0140-6736(20)30925-9

PubMed Abstract | Crossref Full Text | Google Scholar

14. Hunter, DJ, and Bierma-Zeinstra, S. Osteoarthritis. Lancet. (2019) 393:1745–59. doi: 10.1016/S0140-6736(19)30417-9

PubMed Abstract | Crossref Full Text | Google Scholar

15. Sellam, J, Courties, A, Eymard, F, Ferrero, S, Latourte, A, Ornetti, P, et al. Recommendations of the French Society of Rheumatology on pharmacological treatment of knee osteoarthritis. Joint Bone Spine. (2020) 87:548–55. doi: 10.1016/j.jbspin.2020.09.004

PubMed Abstract | Crossref Full Text | Google Scholar

16. Silverwood, V, Blagojevic-Bucknall, M, Jinks, C, Jordan, JL, Protheroe, J, and Jordan, KP. Current evidence on risk factors for knee osteoarthritis in older adults: a systematic review and meta-analysis. Osteoarthr Cartil. (2015) 23:507–15. doi: 10.1016/j.joca.2014.11.019

PubMed Abstract | Crossref Full Text | Google Scholar

17. Qiu, C, Han, Y, Zhang, H, Liu, T, Hou, H, Luo, D, et al. Perspectives on long pentraxin 3 and rheumatoid arthritis: several potential breakthrough points relying on study foundation of the past. Int J Med Sci. (2021) 18:1886–98. doi: 10.7150/ijms.54787

PubMed Abstract | Crossref Full Text | Google Scholar

18. Xue, L, Wang, B, Zhu, J, He, Q, Huang, F, Wang, W, et al. Profiling of differentially expressed circRNAs and functional prediction in peripheral blood mononuclear cells from patients with rheumatoid arthritis. Ann Med. (2023) 55:175–89. doi: 10.1080/07853890.2022.2156596

PubMed Abstract | Crossref Full Text | Google Scholar

19. Da, Z, Guo, R, Sun, J, and Wang, A. Identification of osteoarthritis-characteristic genes and immunological micro-environment features through bioinformatics and machine learning-based approaches. BMC Med Genet. (2023) 16:236. doi: 10.1186/s12920-023-01672-y

PubMed Abstract | Crossref Full Text | Google Scholar

20. Ren, X, Zhuang, H, Jiang, F, Zhang, Y, and Zhou, P. Barasertib impedes chondrocyte senescence and alleviates osteoarthritis by mitigating the destabilization of heterochromatin induced by AURKB. Biomed Pharmacother. (2023) 166:115343. doi: 10.1016/j.biopha.2023.115343

PubMed Abstract | Crossref Full Text | Google Scholar

21. Qiu, C, Liu, J, Li, J, di, D, Li, M, Liu, T, et al. Pentraxin 3 deficiency ameliorates the severity of osteoarthritis and alleviates inflammation. Biochem Biophys Res Commun. (2023) 677:20–5. doi: 10.1016/j.bbrc.2023.08.001

PubMed Abstract | Crossref Full Text | Google Scholar

22. Kim, H, and Lee, BK. Anti-inflammatory effect of adipose-derived stromal vascular fraction on osteoarthritic temporomandibular joint Synoviocytes. Tissue Eng Regen Med. (2020) 17:351–62. doi: 10.1007/s13770-020-00268-2

PubMed Abstract | Crossref Full Text | Google Scholar

23. Wu, Y, Hu, H, Wang, T, Guo, W, Zhao, S, and Wei, R. Characterizing mitochondrial features in osteoarthritis through integrative multi-omics and machine learning analysis. Front Immunol. (2024) 15:1414301. doi: 10.3389/fimmu.2024.1414301

PubMed Abstract | Crossref Full Text | Google Scholar

24. Ummarino, A, Pensado‐López, A, Migliore, R, Alcaide‐Ruggiero, L, Calà, N, Caputo, M, et al. An in vitro model for osteoarthritis using long-cultured inflammatory human macrophages repeatedly stimulated with TLR agonists. Eur J Immunol. (2023):e2350507. doi: 10.1002/eji.202350507

Crossref Full Text | Google Scholar

25. Hsueh, MF, Bolognesi, MP, Wellman, SS, and Kraus, VB. Anti-inflammatory effects of naproxen sodium on human osteoarthritis synovial fluid immune cells. Osteoarthr Cartil. (2020) 28:639–45. doi: 10.1016/j.joca.2020.01.013

PubMed Abstract | Crossref Full Text | Google Scholar

26. Liu, Y, Chen, Y, Li, XH, Cao, C, Zhang, HX, Zhou, C, et al. Dissection of cellular communication between human primary osteoblasts and bone marrow mesenchymal stem cells in osteoarthritis at single-cell resolution. Int J Stem Cells. (2023) 16:342–55. doi: 10.15283/ijsc22101

PubMed Abstract | Crossref Full Text | Google Scholar

27. Li, M, Tian, F, Guo, J, Li, X, Ma, L, Jiang, M, et al. Therapeutic potential of Coptis chinensis for arthritis with underlying mechanisms. Front Pharmacol. (2023) 14:1243820. doi: 10.3389/fphar.2023.1243820

PubMed Abstract | Crossref Full Text | Google Scholar

28. Wu, Z, Xu, L, He, Y, Xu, K, Chen, Z, Moqbel, SAA, et al. DUSP5 suppresses interleukin-1β-induced chondrocyte inflammation and ameliorates osteoarthritis in rats. Aging (Albany NY). (2020) 12:26029–46. doi: 10.18632/aging.202252

PubMed Abstract | Crossref Full Text | Google Scholar

29. Wang, J, Zhang, Y, Cao, J, Wang, Y, Anwar, N, Zhang, Z, et al. The role of autophagy in bone metabolism and clinical significance. Autophagy. (2023) 19:2409–27. doi: 10.1080/15548627.2023.2186112

PubMed Abstract | Crossref Full Text | Google Scholar

30. Huo, Z, Fan, C, Li, K, Xu, C, Niu, Y, and Wang, F. Identification and validation of hub m7G-related genes and infiltrating immune cells in osteoarthritis based on integrated computational and bioinformatics analysis. BMC Musculoskelet Disord. (2025) 26:333. doi: 10.1186/s12891-025-08539-6

PubMed Abstract | Crossref Full Text | Google Scholar

31. Zhang, Y, Shen, Y, Kou, D, and Yu, T. Identification and experimental verification of biomarkers related to butyrate metabolism in osteoarthritis. Sci Rep. (2025) 15:11884. doi: 10.1038/s41598-025-97346-z

PubMed Abstract | Crossref Full Text | Google Scholar

32. Zhu, J, Yang, S, Qi, Y, Gong, Z, Zhang, H, Liang, K, et al. Stem cell-homing hydrogel-based miR-29b-5p delivery promotes cartilage regeneration by suppressing senescence in an osteoarthritis rat model. Sci Adv. (2022) 8:eabk0011. doi: 10.1126/sciadv.abk0011

PubMed Abstract | Crossref Full Text | Google Scholar

33. Liu, W, Liu, A, Li, X, Sun, Z, Sun, Z, Liu, Y, et al. Dual-engineered cartilage-targeting extracellular vesicles derived from mesenchymal stem cells enhance osteoarthritis treatment via miR-223/NLRP3/pyroptosis axis: toward a precision therapy. Bioact Mater. (2023) 30:169–83. doi: 10.1016/j.bioactmat.2023.06.012

PubMed Abstract | Crossref Full Text | Google Scholar

34. Luobin, D, Huajun, W, Yao, LI, Jia, LI, Ling, LI, Yangping, G, et al. Electroacupuncture stimulating Neixiyan (EX-LE5) and Dubi (ST35) alleviates osteoarthritis in rats induced by anterior cruciate ligament transaction affecting DNA methylation regulated transcription of miR-146a and miR-140-5p. J Tradit Chin Med. (2023) 43:983–90. doi: 10.19852/j.cnki.jtcm.2023.05.004

PubMed Abstract | Crossref Full Text | Google Scholar

35. Saber, MM, Mahmoud, MM, Amin, HM, and Essam, RM. Therapeutic effects of combining curcumin and swimming in osteoarthritis using a rat model. Biomed Pharmacother. (2023) 166:115309. doi: 10.1016/j.biopha.2023.115309

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: osteoarthritis, biomarkers, bioinformatics, diagnostic markers, drug prediction

Citation: Wu L, Wen D, Wang W, Wang Y and Zhang L (2025) Identification of key genes in osteoarthritis development: biomarker discovery and therapeutic targets. Front. Med. 12:1518580. doi: 10.3389/fmed.2025.1518580

Received: 08 November 2024; Accepted: 23 June 2025;
Published: 02 July 2025.

Edited by:

João Eurico Fonseca, University of Lisbon, Portugal

Reviewed by:

Bodhisattwa Banerjee, University of Vermont, United States
Shuo Wang, Shanghai Jiao Tong University, China
Cheng Qiu, Peking Union Medical College Hospital (CAMS), China

Copyright © 2025 Wu, Wen, Wang, Wang and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Li Zhang, ODE1NDA5Nzg5QHFxLmNvbQ==

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.