ORIGINAL RESEARCH article

Front. Endocrinol., 08 May 2025

Sec. Experimental Endocrinology

Volume 16 - 2025 | https://doi.org/10.3389/fendo.2025.1545670

Identification and validation of immune-related and inflammation-related genes in endometriosis

Yanzhen Zhou,&#x;Yanzhen Zhou1,2†Jingmin Li&#x;Jingmin Li2†Meihuan Chen&#x;Meihuan Chen2†Hailong Huang,*Hailong Huang1,2*
  • 1College of Clinical Medicine for Obstetrics & Gynecology and Pediatrics, Fujian Medical University, Fuzhou, Fujian, China
  • 2Fujian Maternity and Child Health Hospital, College of Clinical Medicine for Obstetrics & Gynecology and Pediatrics, Fujian Medical University, Fuzhou, Fujian, China

Aim: Endometriosis is characterized by immune evasion and progressive inflammation. This study aimed to identify key genes related to immune and inflammation in endometriosis.

Methods: Differentially expressed genes between patients with and without endometriosis were identified from the GEO database. Furthermore, immune- and inflammation-related genes (IRGs) were identified by intersecting the differentially expressed genes with known immune and inflammatory genes. Functional analyses of the GO and KEGG pathways of these genes were performed. Subsequently, three machine learning models—LASSO regression, SVM-RFE, and Boruta—were conducted to identify the potential key genes in endometriosis. Finally, the expressions of key genes in endometriosis were verified in two validation cohorts using an online database and qRT-PCR, and their immunoregulatory properties were verified.

Results: A total of 13 differentially expressed IRGs were identified. Using machine learning algorithms, five key genes were selected in the endometriosis: BST2, IL4R, INHBA, PTGER2, and MET. Furthermore, the three hub genes exhibited consistent trends across both training and validation datasets. The three keys also correlated with infiltrated immune cells, checkpoint genes, and immune factors in various degrees. Finally, validation analysis using the online database and qRT-PCR confirmed that MET expression aligned with outcomes from both training and validation datasets.

Conclusion: Three immune- and inflammation-related genes were identified as potential biomarkers of endometriosis, providing new insights into the molecular mechanisms underlying immune function in endometriosis. The immune-related function of MET, particularly its correlation with NK cell activity in endometriosis, will be the focus of future studies.

1 Introduction

Endometriosis(EM) is one of the most prevalent gynecological diseases, with a 10%-15% prevalence rate among women of reproductive age (1). It is estimated that endometriosis affects approximately 190 million women worldwide (2). It clinically manifests as severe pelvic pain and reduced fertility and is characterized by the presence and growth of endometrial tissue outside the uterus. This disease significantly impairs patients’ quality of life and imposes a heavy burden on the healthcare systems (3, 4). The progression of the disease is slow and often takes 7-10 years before the onset of symptoms, resulting in delays in diagnosis and optimal treatment (5). The pathogenesis of endometriosis remains poorly understood. Despite decades of research, noninvasive diagnostic markers for endometriosis are lacking, and no curative treatment is available. Therefore, it is imperative to identify the potential diagnostic biomarkers and molecular mechanisms of endometriosis to improve early diagnosis and treatment.

The widely accepted theory of endometriosis pathogenesis is a combination of retrograded menses and the immunosuppression hypothesis. Disturbances of the immune microenvironment are critical factors in the pathophysiology and development of endometriosis (6). Endometriosis is a chronic inflammatory disorder characterized by immune evasion and progressive inflammation (7). Inflammation resulting from immune dysregulation is the primary mechanism involved in cell proliferation and infiltration (8). Consequently, it is important to explore the genes associated with immune response and inflammation related to endometriosis.

Owing to the widespread adaptation of computer-based technology in the health sector and the subsequent availability of large health databases, disease prediction, and medical informatics have recently gained attention from the data science research community (9). Machine learning algorithms have also been extensively applied to screen genes associated with diseases (10, 11). Abnormal immune and inflammatory changes may be responsible for major symptoms of endometriosis and contribute to the development of endometriosis and the growth of endometrial tissue (12). Building on these characteristics, this study aims to identify novel and important genes as diagnostic markers and therapeutic targets with endometriosis by constructing and validating immune-related and inflammation-related genes (IRGs) using machine learning algorithms. We identified three critical genes and verified their potential as diagnostic markers. Furthermore, we analyzed the potential relationship between key genes and their immunoregulatory properties, demonstrating the significant role of these genes in the immunopathogenesis of endometriosis. Finally, MET’s downregulation in the online database and clinical samples from the endometriosis group versus the control group. In conclusion, this study offers a novel perspective for the diagnosis and treatment of endometriosis at the molecular level.

2 Materials and methods

2.1 Collection of tissue samples

This study was approved by the Medical and Ethics Committees of Fujian Maternity and Child Health Hospital (No. 2023KY140-KS001), and informed consent was obtained from each patient prior to enrollment. Ectopic endometrial tissue from two patients with broad ligament endometriosis, three patients with sacral ligament endometriosis, and five patients with ovarian endometriosis (all in the follicular phase, n = 10) (EM group) and 10 eutopic endometrial tissues from women (follicular phase,n = 10) with tubal factor infertility without endometriosis (control group). All 20 women underwent hysteroscopy and laparoscopy surgery at the Department of Obstetrics and Gynecology, Fujian Maternity and Child Health Hospital, between Oct 2023 to Jun 2024.

2.2 Data acquisition and processing

In this study, the gene expression datasets GSE7305, GSE23339, and GSE7307 were obtained from the GEO database (https://www.ncbi.nlm.nih.gov/geo/). The study design is illustrated in Figure 1. The training dataset GSE7305 comprised 10 samples without endometriosis and 10 samples with endometriosis, which were sequenced using GPL570 technology. The validation dataset GSE23339, comprising 9 non-endometriosis and 10 endometriosis samples, was sequenced using GPL6102. Additionally, GSE7307, which included 18 endometriosis and 18 non-endometriosis samples, was sequenced using GPL570.

Figure 1
www.frontiersin.org

Figure 1. The study design. DEGs, differential expressed genes; ROC, Receiver Operator Characteristic; GSEA, Gene Set Enrichment Analysis; GGI, Gene-Gene Interaction Network; ANN, Construction of artificial neural network.

Clinical information and sequencing data were obtained in compliance with GEO requirements. Data filtering, background correction, log2 transformation, and normalization were performed on the datasets.

2.3 Identification of differentially expressed IRGs

The differentially expressed genes (DEGs) between the endometriosis and non-endometriosis groups were analyzed using the LIMMA package in R Studio. Adj.P <0.05 and |log2FC| >1.0 were set to determine the significant differentially expressed genes in patients with endometriosis.

This study intersected the differentially expressed genes with immune-related genes and inflammatory-related genes as the differential expressed IRGs using the R package ‘ggVenndiagram ‘.

2.4 Functional enrichment analysis and construction of protein-protein interaction network

To identify specific biological pathways in IRGs, we conducted a comprehensive analysis. The R package ‘clusterProfler’ was used to perform Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses to identify the functions and pathways. Visualization of the results was achieved using the R package ‘ggplot2’. GO annotations encompassed three main categories: biological processes (BP), cellular components (CC), and molecular functions (MF).

To construct the Protein-Protein Interaction (PPI) Network, the differentially expressed IRGs were simultaneously input into the STRING database (https://cn.string-db.org/), with the species defined as Homo sapiens and the PPI Network parameter set to 0.4 (medium confidence). Subsequently, Cytoscape (https://cytoscape.org/) was employed to visualize the protein-protein interaction network.

2.5 Machine learning and screening for potential key genes

To select disease diagnostic markers in the above IRGs, we developed three machine-learning models: LASSO regression, SVM-RFE, and Boruta. The potential key genes were determined by overlapping LASSO regression analysis, SVM analysis, and the Boruta algorithm results.

2.6 Identification and validation of the key genes

The validation datasets GSE23339 and GSE7307 were utilized to identify the expression trend of potential key genes. Furthermore, both the training and validation cohorts were utilized to evaluate the discriminative ability of hub genes in distinguishing endometriosis from non-endometriosis samples. The diagnostic performance of these hub genes was assessed by plotting the AUC using the R package ‘pROC’.

To further evaluate the predictive capability of the key genes, we developed a nomogram using the R package ‘rsm’ with the training set. The predictive performance of the nomogram model was assessed through calibration curves and decision curve analysis (DCA).

2.7 Gene-gene interaction networks and enrichment pathway analysis of key genes

GeneMANIA (https://genemania.org/) searches through extensive, publicly available biological datasets to identify functionally related genes. We constructed Gene–Gene Interaction (GGI) networks for key genes using the GeneMANIA platform and selected the top 20 most relevant genes as well as the seven most important pathways associated with the three key genes.

Gene Set Enrichment Analysis (GSEA) was performed for each key gene to explore the potential functions of the three key genes. First, we performed Spearman correlation analysis on the training set to assess the relationship between each key gene and all other genes using the R package ‘psych’. Then the enrichment analysis of GSEA was conducted using the R package ‘clusterProfiler’, based on the KEGG background gene set. The results were screened using the criteria |NES|>1 and p <0.05.

2.8 The construction and verification of the artificial neural network model

The training cohort was utilized to develop the Artificial Neural network (ANN) model, while the validation cohort was used for signature validation. The discrimination and net clinical benefit of the ANN model were assessed using a Receiver operating characteristic (ROC) curve.

2.9 Analysis of the relationship between key genes and immunoregulatory properties

2.9.1 Immune cell infiltration pattern

To elucidate the differences in infiltrated immune cells between the control and EM samples, the training cohort was analyzed using ssGSE(single sample gene set enrichment analysis), yielding enrichment scores for 28 immune cells based on immune-related genes referenced in the literature (PMID:28052254). Spearman’s correlation analysis was conducted to investigate the relationship between hub gene expression and immune cell abundance using the ‘psych’ package in R Studio. The results were visualized using the R package ‘ggplot2’. Statistical significance was set at P <0.05.

2.9.2 Correlation between key genes with immune checkpoints

This study involved a comparison of 53 immune checkpoints [PMID: 36979463, PMID: 36793711]. We analyzed differences in immune cell scores between the disease and control groups using the rank sum test and visualized the results with boxplots. Following this, we conducted Spearman’s correlation analysis to investigate the relationship between key genes and 18 differential immune checkpoints.

2.9.3 Correlation with immune factors

The relationships between key genes and different immune factors, including 24 chemokines, 14 immunosuppressive factors, and 27 immunostimulatory factors, were evaluated using Spearman’s correlation analysis.

2.10 Validation of key genes in an online database (Turku Endometriosis Database)

The online Turku Endometriosis Database (https://endometdb.utu.fi) was used to validate the expression of five key genes in various tissues of patients with EM.

2.11 RNA isolation, reverse transcription, and real-time RT-PCR

Total RNA was extracted from the ectopic endometrial tissues of the EM group and endometrial tissues of the control group using TRIzol reagent (RNAprep Pure Tissue Kit, TIANGEN, Beijing, China) and was then reverse-transcribed into cDNA using the Primescript reverse transcription reagent kit (Takara, Dalian, China) following the manufacturer’s instructions.

Real-time PCR was performed using 2×SG Fast qPCR Master Mix (BBI, Roche, Switzerland) on a LightCycler480II Real-Time PCR System (Roche, Rotkeruz, Switzerland). A 10ul PCR reaction that included 1ul of cDNA, 5ul of sybrGreen qPCR Master Mix, and 0.2ul of each primer was prepared and adjusted to the final volume with double distilled H2O (ddH2O). b-actin was used as the internal control for each run. Reactions were performed according to the manufacturer’s protocol. The relative mRNA expression ratio was quantified using the 2 (-ΔΔCt) method. The Real-time RT-PCR primers used are listed below.

H-MET-F 5’ AATCTTGGGACATCAGAGGGT 3’

H-MET-R 5’ TAATGTATGCTCCACAATCACTTCT 3’

H-b-actin-F 5’ TAGTTGCGTTACACCCTTTCTTG 3’

H-b-actin-R 5’ TCACCTTCACCGTTCCAGTTT 3’

2.12 Western blotting

After extraction of total tissue proteins from RIPA lysates(Servicebio, Wuhan), protein concentrations were quantified using the BCA Protein Quantitative Assay Kit(Jabes Biotechnology Guangzhou). The proteins were denatured by boiling at 100°C for 10 min. Subsequently, the protein samples (40 ugs per well) were separated via electrophoresis on a 10% SDS-PAGE gel and transferred to a PVDF membrane(Millipore, USA). The membrane was incubated with a primary antibody rabbit anti-MET antibody(Abclonal Wuhan) and rabbit anti-β-actin(Affinity USA) at 4°C overnight). The next day, after washing the PVDF membrane three times with TBST (Servicebio Wuhan), it was incubated with an HRP-conjugated secondary antibody for 2 h at room temperature on an orbital shaker. Lastly, the membrane was exposed for imaging using Immobilon Western Chemiluminescent HRP Substrate (Servicebio, Wuhan). The development was carried out in a dark room, and the grayscale values of the target bands were analyzed after taking photographs.

2.13 Statistical analysis

Statistical analyses were conducted using Student’s and Mann-Whitney tests with Prism version 10.0 Graphpad software. A P-value of<0.05 was considered statistically significant.

3 Result

3.1 Identification of differentially expressed IRGs in EM

We screened 1189 differentially expressed genes(DEGs) between EM and control samples in the GSE7305 dataset, including 634 upregulated and 555 downregulated DEGs (Supplementary Table S1). Volcano and heat maps of the DEGs are shown in Figures 2A, B.

Figure 2
www.frontiersin.org

Figure 2. Identification of DEGs and functional enrichment analysis of differential expressed IRGS. (A) The heatmap and (B) volcano plot of differentially expressed genes (DEGs) between EM and control samples. (C) The venngram of DEGs and IRGs. (D) Analysis of PPI. (E–H) Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) functional enrichment analysis of differential expressed IRGs.

To screen differential expressed IRGs, we further overlapped DEGs and immune-related and inflammatory-related genes, and 13 IRGs were acquired, including BST2, C3AR1, IL4R, OSMR, INHBA, CXCL8, PTGER2, CCL2, CD14, ADM, IL7R, MET, and C5AR1 (Figure 2C).

3.2 Functional enrichment analysis of differential expressed IRGs

To investigate potential interactions among these genes, we performed the analysis of the protein-protein interaction (PPI) network involving 13 candidate genes. Fifteen interaction pairs, consisting of 9 genes, were identified. Refer to Figure 2D.

Subsequently, we annotated the 13 genes using the KEGG pathway and GO function to investigate the biological significance of each gene. A total of 217 Gene Ontologies (GOs) and 24 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were enriched, as indicated in Supplementary Tables S2, S3. The primary enriched terms in the GO-CC category were ‘secretory granule membrane’, ‘external side of plasma membrane’, and ‘azurophil granule membrane’. In the GO-BP category, the top term included ‘response to molecules of bacterial origin’, ‘regulation of chemotaxis’, ‘positive regulation of response to external stimulus’, and ‘neutrophil chemotaxis’ (Figure 3B; P<0.05). Additionally, the GO-MF categories were predominantly enriched in ‘immune receptor activity’, ‘complement receptor activity’, and ‘cytokine receptor activity’ (Figures 2E–G; P<0.05). To further determine the potential signaling pathway, we analyzed the KEGG pathway. As Figures 2E-H indicated the IRGs were significantly enriched in several KEGG pathways, such as ‘cytokine-cytokine receptor interaction,’ ‘Alcoholic liver disease,’ and ‘Malaria,’ among others. These results indicate that certain IRGs may play a role in the immune response.

Figure 3
www.frontiersin.org

Figure 3. Screen for potential key genes. (A) Genes identifed by the LASSO regression analysis. (B) Optimal biomarker selection using the SVM-RFE analysis. (C) The 12 genes selected by the Boruta algorithm. (D) Venn diagram showing the potential key genes intersected by LASSO, SVM, and Boruta.

3.3 Identification of key genes

We use three machine learning models to screen potential hug genes. LASSO regression analysis, SVM-RFE analysis, and the Boruta algorithm were employed to identify key genes in the training dataset (Figures 3A–C). We screened meaningful five key genes: BST2, IL4R, INHBA, PTGER2, and MET. These genes resulted from the intersection of 9 genes from LASSO, 5 genes from SVM-RFE, and 12 genes from Boruta (Figure 3D).

3.4 Validation of the key genes

Two validation datasets were used to confirm the differential expression of the hub genes and to demonstrate their diagnostic capacity for endometriosis (Figure 4A). The three hub genes exhibited consistent trends across both training and validation datasets. Two genes exhibited an up-regulated trend, while one gene displayed a down-regulated trend in the endometriosis groups across both the training and validation databases.

Figure 4
www.frontiersin.org

Figure 4. Identification and Validation of key genes. (A) Expression profiles of 5 potential key genes between EM and control group in the training and 2 test cohorts. (B) Receiver operating characteristics (ROC) curves of 3 key genes in training and test cohorts. (C) Nomogram displaying the predicted risk for EM based on the KEY genes. (D) Calibration curve showing the predicted performance of the nomogram. (E) DCA showing the clinical benefits of the nomogram. (F) ROC curves showing the diagnostic performance of the feature genes.

The three genes were identified as key targets and their ability to differentiate between EM samples and control samples was evaluated using ROC analysis. The area under the curve (AUC) values of the three key genes were greater than 0.8 in both the training and validation sets, indicating their strong diagnostic value in EM (Figure 4B).

We also developed a nomogram model to assess the predictive ability of key genes. The calibration curve indicated the high accuracy of the nomogram model in predicting EM (Figure 4C). DCA indicated potential benefits for patients utilizing the nomogram model, showing a greater clinical advantage observed compared to the single gene curve (Figure 4F). The AUC values for the ROC curves in the testing sets approached 1 for the nomogram model (Figure 4E).

3.5 GGI networks, GSEA enrichment analysis, and ANN model

We used the GeneMANIA platform to establish a Gene-Gene Interaction Network (GGI). The analysis identified the top 20 correlated genes and the top 7 significant pathways related to the three key genes. The three key genes are linked to the pathways involving peptidyl-tyrosine phosphorylation, peptidyl-tyrosine modification, and regulation of peptidyl-tyrosine phosphorylation (Figure 5A).

Figure 5
www.frontiersin.org

Figure 5. Gene-gene interaction networks and Enrichment pathway analysis of key genes. (A) GGI of key genes. (B–D) MET, IL4R and BTS2 GSEA analysis. (E) Construction of artificial neural network (ANN) and model evaluation. (F) ROC curves of the ANN model.

We used GSEA analysis to study the specific signaling pathways involved in key genes in endometriosis. The findings revealed that three genes were significantly enriched in cell cycle pathways. Additionally, BST2 and MET were found to be enriched in the proteasome, ribosome, and spliceosome pathways (Figures 5B–D).

Three hub genes were integrated to construct an artificial neural network(ANN) in the training datasets (Figure 5E). A prediction model was then established based on the weights of the three key genes and the neural network. The ROC curves indicated that the key genes exhibited AUC values higher than 0.9 in the training set, demonstrating their significant potential in neural network prediction (Figure 5F).

3.6 Validation of the potential immunoregulatory properties of key genes

To investigate the differences in the immune microenvironment between the EM group and the control group, we employed immune infiltration analysis as well as an association analysis between key genes and immune checkpoints and factors.

3.6.1 Immune infiltration analysis

Significant differences were observed in twenty-two kinds of immune cells between the disease and control groups (p<0.05) (Figures 6A, B). The heatmap revealed that, among the 22 kinds of immune cell types, BST2 demonstrated the highest positive correlation with CD56bright natural killer cells (r = 0.87, P<0.05). Otherwise, CD56bright natural killer cells correlated with Memory B cells(r=-0.82, P<0.05) (Figure 6C). Conversely, MET exhibited the most significant negative correlation with this cell type (r = -0.81, P<0.05) (Figure 6D).

Figure 6
www.frontiersin.org

Figure 6. The profiles of immune cell subtype distribution pattern in training cohort. (A) The boxplots of the immune infiltration landscape in EM and control. (B) The heatmap of 22 immune differentially infiltrated fraction. (C) Correlation heatmap of all 22 immune cells. (D) Correlation heatmap of 3 key genes and 22 immune cells.

3.6.2 Association between key genes and checkpoints

Differential comparisons of immune cell scores between the EM and control groups indicated significant differences at 18 immune checkpoints as illustrated by box plots using the ggplot2 package (Figure 7A). Additionally, the relationships between the key genes and the 18 differential immune checkpoints were examined (Figure 7B). The strongest positive correlation was found between BST2 and CASP1 (r = 0.85, P < 0.05), whereas the most significant negative correlation was identified between BST2 and PDIA3 (r = -0.85, P < 0.05). Furthermore, a significant positive correlation was observed between MET and HMGB1 (r = 0.77, P < 0.05), whereas a notable negative correlation was detected between MET and P2RX7 (r = -0.8, P < 0.05). The most robust positive correlation was observed between IL4R and CASP1 (r = 0.89, P < 0.05), whereas the most notable negative correlation was observed between IL4R and HMGB1 (r = -0.76, P < 0.05).

Figure 7
www.frontiersin.org

Figure 7. (A) Expression levels of immune checkpoint genes in EM and control group. (B) correlation between key genes and checkpoint. (C–E) Correlation between key and chemokine, immunosuppressive cytokine and immunostimulatory factor.

3.6.3 Analysis of correlation between key genes with immune factors

The examination of the relationship between key genes and chemokines indicated that the most significant positive correlation existed between BST2 and CCL8 (r = 0.77, P<0.05), whereas the most substantial negative correlation was observed between BST2 and XCL1 (r = -0.56, P<0.05). The strongest positive correlation between MET and XCL1 was found (r = 0.83, P<0.05), whereas the most significant negative correlation was found between MET and CCL2 (r = -0.68, P<0.05). Conversely, the most significant positive correlation was observed between IL4R and CCL2 (r = 0.84, P < 0.05), whereas the most negative correlation was observed between IL4R and XCL1 (r = -0.50, P < 0.05) (Figure 7C).

The analysis of the relationships between the key genes and immunosuppressive factors revealed several significant correlations. The strongest positive correlations were found between: BST2 and LAG3 (r=0.90, P<0.05), MET and KIR2DL3 (r=0.61, P<0.05), and IL4R and CSFIR (r=0.84, P<0.05). The most significant negative correlations included: BST2 and KIR2DL1 (r = -0.46, P < 0.05), MET and LAG3 (r = -0.63, P < 0.05), and IL4R and KIR2DL1 (r = -0.54, P < 0.05) (Figure 7D).

The analysis of the relationship between key genes and immunostimulatory factors revealed a strong positive correlation between BST2 and CD40, TNFSF14 (r=0.70, P<0.05), MET and IL6R, LTA (r=0.45, P<0.05), and IL4R and TNFSF14 (r=0.93, P<0.05). On the other hand, the most significant negative correlations were between BST2 and LTA (r = -0.33, P < 0.05), MET and CD40 (r = -0.70, P < 0.05), and IL4R and LTA (r = -0.52, P < 0.05) (Figure 7E).

3.7 Validation of key genes in an online database and qRT-PCR

In addition to the GEO datasets, we identified the expression of the three hub genes in an online database, noting that the MET gene exhibited a consistent trend in both the GEO datasets and the online database (Figure 8A). In contrast, the gene BST2 exhibited an opposing trend, while IL4R exhibited a minor variation. Consequently, we assessed the expression of MET through qRT-PCR analysis and Western Blotting experiments using ectopic endometrial tissue from the EM group and normal endometrium from the control group (Figure 8B). Consistent with the bioinformatics analysis results, mRNA and protein expression of MET was significantly lower in EM tissues than in controls.

Figure 8
www.frontiersin.org

Figure 8. (A–C) Expression trend of three hub genes in an online dataset. (D) Protein expression MET detected by Western Blot in tissue sample. (E) Western Blot verified the difference of MET in protein in patients' tissue between EM and control. (F) RT-qPCR verified the difference in MET gene mRNA in patients' tissue between EM and control.

4 Discussion

Endometriosis is a chronic, inflammatory disease characterized by the growth of endometrial tissue outside of the uterine cavity. Immunological dysfunction has been proposed as a critical facilitator of ectopic lesion growth following retrograde menstruation of endometrial debris. One of the major challenges in contemporary gynecology is understanding and preventing the pathophysiology of endometriosis. An enhanced understanding of immune and inflammatory mechanisms occurring at the site of endometriotic lesion development may provide invaluable insight into the disease’s pathogenesis of the disease.

In this study, we analyzed bioinformatics databases to identify the potential key genes involved in endometriosis. Initially, we intersected 1,189 DEGs and 1,996 immune and 200 inflammation genes, screening out 13 DEGs correlated with inflammation and immunity. Subsequently, LASSO regression analysis, SVM-RFE analysis, and Boruta machine learning were performed to select five potential key genes from 13 DEGs. By comparing the differences in the expression of these five genes in both the training and external validation sets, we identified 3 key genes (MET, IL4R, and BST2) that exhibited consistent trends and significant differences between the training and validation sets. Finally, we validated MET expression in online datasets and endometriosis samples to confirm its role as a hub gene.

In the present study, based on IRGs, we identified a hub gene that is closely associated with intrinsic characteristics, such as molecular markers and therapeutic targets. Our findings confirm the close correlations between these three genes and immunoregulatory mechanisms, demonstrating both positive and negative relationships with immune cells, chemokines, and immune factors to varying degrees. Endometriosis, a disease influenced by immune-related mechanisms, has prompted numerous studies to identify biomarkers associated with immune function. Wang et al. utilized lasso regression and MCP-counter to identify immune-related biomarkers (13). In another study, He et al. intersected differentially expressed genes (DEGs) with transcription factors (TFs) from two databases and immune-related genes (IRGs) from the ImmPort database. Subsequently, they constructed a protein-protein interaction (PPI) network using Cytoscape to identify hub genes (14). In contrast, our study intersected DEGs with immune and inflammation-related genes, integrated machine learning results, validated key findings using training datasets and online databases, and investigated immunoregulatory properties. We aimed to screen and identify key genes using a comprehensive step-by-step screening process, enhancing the accuracy and consistency of our research.

The MET proto-oncogene, located on chromosome 7q21–q31 (15), encodes the tyrosine kinase receptor of the hepatocyte growth factor (HGF). It is characterized as a single-pass transmembrane receptor comprising an extracellular domain, transmembrane and juxtamembrane regions, and a tyrosine kinase domain (16). In cancer, aberrant MET signaling contributes to tumor invasion and regulates various physiological processes, including embryogenesis, wound healing, liver regeneration, angiogenesis, and immunomodulation (1720). In a study by Finisguerre et al., Met was essential for neutrophil chemoattraction and cytotoxicity. MET is induced by inflammatory stimuli in both mouse and human neutrophils, subsequently activating the endothelium, inducing iNOA production upon HGF stimulation, and promoting cancer cell death (21). These findings were consistent with those of our analysis, suggesting that the immunological role of MET may be linked to the onset and progression of endometriosis.

IL4R is a specific cell surface receptor with which IL-4 interacts to exert its activities. The IL-4 receptor (IL-4R) signaling system consists of two receptors: type 1 IL-4R and type 2 IL-4R (22, 23). IL-4 and IL-13 share a common receptor subunit and signaling pathway involving the Janus kinase (JAK): STAT6 pathway (24). IL-4 and IL-13 play important roles in Th2 immune responses, metabolism, tissue regeneration, remodeling, cancer, learning, and memory (25). Bone marrow stromal cell antigen 2 (BST2) is expressed in numerous cells, including hepatocytes, plasma blast cells, early plasma cells, mature B cells, dendritic cells, pneumocytes, monocytes, pancreatic cells, kidney cells, and vascular endothelial cells, suggesting that it plays vital roles in the innate immune response against viral infection and other physiological processes (26, 27). Increased BST2 expression has been observed in multiple human cancers, including hematological tumors and solid tumors, especially in breast cancer, HCC, gastrointestinal cancer, and lung cancer (28) via activating signaling pathways, such as EGFR/AKT, NF-κB/ERK, and GRB2/DIM/caspase 3 (2931). All three genes are involved in signatures related to immunomodulation, but their roles in endometriosis remain unreported, and further studies are needed to elucidate their function in endometriosis.

We validated the three hub genes using the Turku Endometriosis Database and tissue samples collected from women with and without non-endometriosis. The expression of the MET gene in both the online database and clinical tissue showed the same downregulated trend in the EM group as in the training and two validation sets. MET is overexpressed in a variety of tumors, whereas we observed low expression levels of MET in the EM group. MET may play a distinct role in immune regulation in endometriosis. The composition of the immune microenvironment in endometriotic lesions is associated with different disease stages, phenotypes, and symptoms of the disease (32). The functional properties of MET in EM may vary depending on the stages and phenotype of the endometriosis. Endometriotic lesions develop in complex and dynamic environments. MET was the target gene of our study, and further studies are required to understand its role in the pathogenesis of endometriosis.

In endometriosis, the network of immune cell populations originating from both the innate and adaptive immune systems creates an optimal environment for lesion formation in the ectopic endometrium. Numerous studies have attempted to construct immune-related gene signatures as diagnostic markers and correlate these genes with patterns of immune cell infiltration. Macrophages play a crucial role in the development of endometrial lesions and contribute to ectopic cell survival (33). A previous study found that the number of macrophages increased in the eutopic endometrium of patients with endometriosis (34). Additionally, macrophages polarize from M2 (anti-inflammatory phenotype) to M1 (proinflammatory phenotype) in the eutopic endometrium of patients with endometriosis compared to the control group (35). Ding et al. (36) and Cui et al. (37), focused on screening M2 macrophage-related genes based on the theory that immune cell dysfunction plays an important role in endometriosis using different analysis methods. In this study, we investigated the correlation between signature genes and infiltrating immune cells. It has been shown that CD56bright natural killer cells, related to two hub genes (MET and BST2), may exhibit dysfunction in EM. NK cells are key components of the innate immune system, serving as the initial defense against viral infections and tumor growth while playing a critical role in maintaining normal tissue homeostasis (38). NK cell populations exist in both the peripheral circulation and the uterus and are primarily characterized as CD56dimCD16+ and CD56brightCD16-, respectively (39). One study indicated that NK cells had less cytotoxicity in women with endometriosis than in those without it (40). Decreased cytotoxic can impair the clearance of endometrial fragments by NK cells in the peritoneal cavity, facilitating the implantation of ectopic implants. Julia et al. found that NK cytotoxic activity in the endometrium of women with endometriosis was lower than in normal endometrium. However, in cases where the endometriosis patient was infertile and/or experienced recurrent miscarriage, NK cytotoxic activity was elevated (41). Altered immune cells in the peritoneal cavity of women with endometriosis can influence other endometrial cell populations, thereby increasing their susceptibility to lesion formation, survival, and growth. Further studies are required to clarify the functional properties and interaction between MET and NK cells in endometriosis, which may provide in-depth insights into the pathophysiology of endometriosis.

The present study has several limitations. First, we validated the downregulation of MET in endometriosis(EM). However, the regulatory mechanisms of MET in EM need to be investigated, particularly the correlation between MET and NK cells. This can be achieved through both in vivo and in vitro experiments. Second, there was a lack of clinical information regarding the patient’s fertility and disease severity. Therefore, the mechanism by which this gene affects the progression of endometriosis throughout the menstrual cycle needs to be confirmed by experiments based on large sample numbers.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Ethics statement

The studies involving humans were approved by Medical and Ethics Committees of Fujian Maternity and Child Health Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

YZ: Data curation, Formal analysis, Writing – original draft, Writing – review & editing. JL: Validation, Writing – original draft. MC: Data curation, Writing – review & editing. HH: Funding acquisition, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This project was supported by Special Financial Fund of Fujian Province (204008004).

Acknowledgments

We thank Professor Xi Xie, Professor Chaobin Liu, Professor Jingsong Yi, and Professor Shunhe Lin at Fujian Maternity and Child Health Hospital for providing ectopic endometrial and endometrium samples. We would like to thank Editage (www.editage.com) for English language editing.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2025.1545670/full#supplementary-material

Abbreviations

IRG, immune-related and inflammation-related gene; PPI, Protein-Protein Interaction Network; SVM, Supports vector machines; RF, Random forest; GEO, Gene expression omnibus; GSVA, Gene set variation analysis; ANN, artificial neural network; GO, Gene ontology; KEGG, Kyoto encyclopedia of genes and genomes; AUC, Area under the curve; qRT-PCR, Quantitative reverse-transcription polymerase chain reaction; PCA, Principal component analysis; DCA, Decision curve analysis.

References

1. Shafrir AL, Farland LV, Shah DK, Harris HR, Kvaskoff M, Zondervan K, et al. Risk for and consequences of endometriosis: a critical epidemiologic review. Best Pract Res Clin Obstet Gynaecol. (2018) 51:1–15. doi: 10.1016/j.bpobgyn.2018.06.001

PubMed Abstract | Crossref Full Text | Google Scholar

2. Kvaskoff M, Mahamat-Saleh Y, Farland LV, Shigesi N, Terry KL, Harris HR, et al. Endometriosis and cancer: a systematic review and meta-analysis. Hum Reprod Update. (2021) 27:393–420. doi: 10.1093/humupd

PubMed Abstract | Crossref Full Text | Google Scholar

3. Zondervan KT, Becker CM, Koga K, Missmer SA, Taylor RN. Endometriosis. Nat Rev Dis Primers. (2018) 4:10. doi: 10.1038/s41572-018-0011-x

PubMed Abstract | Crossref Full Text | Google Scholar

4. Simoens S, Dunselman G, Dirksen C, Hummelshoj L, Bokor A, Brandes I, et al. The burden of endometriosis: costs and quality of life of women with endometriosis and treated in referral centres. Hum Reprod. (2012) 27:1292–9. doi: 10.1093/humrep/des073

PubMed Abstract | Crossref Full Text | Google Scholar

5. Bakhtiarizadeh MR, Hosseinpour B, Shahhoseini M, Korte A, Gifani P. Weighted gene co-expression network analysis of endometriosis and identification of functional modules associated with its main hallmarks. Front Genet. (2018) 9:453. doi: 10.3389/fgene.2018.00453

PubMed Abstract | Crossref Full Text | Google Scholar

6. Vallve-Juanico J, Houshdaran S, Giudice LC. The endometrial immune environment of women with endometriosis. Hum Reprod Update. (2019) 25:564–91. doi: 10.1093/humupd/dmz018

PubMed Abstract | Crossref Full Text | Google Scholar

7. Kobayashi H, Imanaka S. Understanding the molecular mechanisms of macrophage polarization and metabolic reprogramming in endometriosis: A narrative review. Reprod Med Biol. (2022) 21:e12488. doi: 10.1093/humupd/dmz018

PubMed Abstract | Crossref Full Text | Google Scholar

8. Lamceva J, Uljanovs R, Strumfa I. The main theories on the of endometriosis. Int J Mol Sci. (2023) 24:4254. doi: 10.3390/ijms24054254

PubMed Abstract | Crossref Full Text | Google Scholar

9. Uddin S, Khan A, Hossain ME, Moni MA. Comparing different supervised machine learning algorithms for disease prediction. BMC Med Inform Decis Mak. (2019) 19:281. doi: 10.1186/s12911-019-1004-8

PubMed Abstract | Crossref Full Text | Google Scholar

10. Peng J, Huang J, Huang G, Zhang J. Predicting the initial treatment response to transarterial chemoembolization in intermediate-stage hepatocellular carcinoma by the integration of radiomics and deep learning. Front Oncol. (2021) 11:730282. doi: 10.3389/fonc.2021.730282

PubMed Abstract | Crossref Full Text | Google Scholar

11. Han MH, Min KW, Noh YK, Kim JM, Cheong JH, Ryu JI, et al. High dkk3 expression related to immunosuppression was associated with poor prognosis in glioblastoma: Machine learning approach. Cancer Immunol Immunother. (2022) 71:3013–27. doi: 10.1007/s00262-022-03222-4

PubMed Abstract | Crossref Full Text | Google Scholar

12. Elbaradie SMY, Bakry MS, Bosilah AH. Serum macrophage migration inhibition factor for diagnosing endometriosis and its severity: case-control study. BMC Women’s Health. (2020) 20:189. doi: 10.1186/s12905-020-01051-0

PubMed Abstract | Crossref Full Text | Google Scholar

13. Wang L, Sun J. ASPN is a potential biomarker and associated with immune infiltration in endometriosis. Genes(Basel). (2022) 13:1352. doi: 10.3390/genes13081352

PubMed Abstract | Crossref Full Text | Google Scholar

14. He Y, Li J, Qu Y, Sun L, Zhao X, Wu H, et al. Identification and analysis of potential immune-related biomarkers in Endometriosis. J Immunol Res. (2023) 10:2975581. doi: 10.1155/2023/2975581

PubMed Abstract | Crossref Full Text | Google Scholar

15. Dean M, Park M, Le Beau MM, Robins TS, Diaz MO, Rowley JD, et al. The human met oncogene is related to the tyrosine kinase oncogenes. Natrue. (1985) 318:385–8. doi: 10.1038/318385a0

PubMed Abstract | Crossref Full Text | Google Scholar

16. Recondo G, Che JW, Janne PA, Awad MM. Targeting met dysregulation in cancer. Cancer Discovery. (2020) 10:922–34. doi: 10.1158/2159-8290.CD-19-1446

PubMed Abstract | Crossref Full Text | Google Scholar

17. Bladt F, Riethmacher D, Isenmann S, Aguzzi A, Birchmeier C. Essential role for the c-met receptor in the migration of myogenic precursor cells into the limb bud. Nature. (1995) 376:768–71. doi: 10.1038/376768a0

PubMed Abstract | Crossref Full Text | Google Scholar

18. Huh C-G, Factor VM, Sanchez A, Uchida K, Conner EA. Thorgeirsson SS. Hepatocyte growth factor/c-met signaling pathway is required for efficient liver regeneration and repair. Proc Natl Acad Sci U.S.A. (2004) 101:4477–82. doi: 10.1073/pnas.0306068101

PubMed Abstract | Crossref Full Text | Google Scholar

19. Okunishi K, Dohi M, Nakagome K, Tanaka R, Mizuno S, Matsumoto K, et al. A novel role of hepatocyte growth factor as an immune regulator through suppressing dendritic cell function. J Immunol. (2005) 175:4745–53. doi: 10.4049/jimmunol.175.7.4745

PubMed Abstract | Crossref Full Text | Google Scholar

20. Benkhoucha M, Santiago-Raber ML, Schneiter G, Chofflon M, Funakoshi H, Nakamura T, et al. Hepatocyte growth factor inhibits CNS autoimmunity by inducing tolerogenic dendritic cells and CD25+Foxp3+ regulatory T cells. Proc Natl Acad Sci U.S.A. (2010) 107:6424–9. doi: 10.1073/pnas.0912437107

PubMed Abstract | Crossref Full Text | Google Scholar

21. Finisguerra V, Conza GD, Matteo MD, Serneels J, Costa S, Thompson AA, et al. MET is required for the recruitment of anti-tumoural neutrophils. Nature. (2015) 522:349–53. doi: 10.1038/nature14407

PubMed Abstract | Crossref Full Text | Google Scholar

22. Karo-Atar D, Bitton A, Benhar I, Munitz A. Therapeutic targeting of the Interleukin-4/Interleukin-13 signaling pathway: In allergy and beyond. BioDrugs. (2018) 32:201–20. doi: 10.1007/s40259-018-0280-7

PubMed Abstract | Crossref Full Text | Google Scholar

23. Mueller TD, Zhang JL, Sebald W, Duschl A. Structure, binding, and antagonists in the IL-4/IL-13 receptor system. Biochim Biophys Acta. (2002) 1592:237–50. doi: 10.1016/s0167-4889(02)00318-x

PubMed Abstract | Crossref Full Text | Google Scholar

24. Wynn TA. IL-13 effector functions. Annu Rev Immunol. (2003) 21:425–56. doi: 10.1016/s0167-4889(02)00318-x

PubMed Abstract | Crossref Full Text | Google Scholar

25. Paul WE. History of interleukin-4. Cytokine. (2015) 75:3–7. doi: 10.1016/j.cyto.2015.01.038

PubMed Abstract | Crossref Full Text | Google Scholar

26. Jourdan M, Caraux A, Caron G, Robert N, Fiol G, Rème T, et al. Characterization of a transitional preplamblast population in the process human B cell to plasma cell differentiation. J Immunol. (2011) 187:3931–41. doi: 10.4049/jimmunol.1101230

PubMed Abstract | Crossref Full Text | Google Scholar

27. Jourdan M, Caraux A, De Vos J, Fiol G, Larroque M, Cognot C, et al. An in vitor model of differentiation of memory B cell into plasmablasts and plasma cells including detailed phenotypic and molecular characterization. Blood. (2009) 114:5173–81.

PubMed Abstract | Google Scholar

28. Yu H, Bian Q, Wang X, Wang X, Lai L, Wu Z, et al. Bone marrow stromal cell antigen 2:Tumor biology, signaling pathway and therapeutic targeting. Oncol Repor. (2024) 51:45. doi: 10.3892/or.2024.8704

PubMed Abstract | Crossref Full Text | Google Scholar

29. Mukai S, Oue N, Oshima T, Mukai R, Tatsumoto Y, Sakamoto N, et al. Overexpression of Transmembrane Protein BST2 is associated with poor survival of patients with esophageal, gastric, or colorectal cancer. Ann Surg Oncol. (2017) 24:594602. doi: 10.1245/s10434-016-5100-z

PubMed Abstract | Crossref Full Text | Google Scholar

30. Liu W, Li Y, Feng S, Guan Y, Cao Y. MicroRNA-760 inhibits cell viability and migration through down-regulating BST2 in gastric cancer. J Biochem. (2020) 168:159–70. doi: 10.1093/jb/mvaa031

PubMed Abstract | Crossref Full Text | Google Scholar

31. MahauadFernandez WD, Okeoma CM. Cysteine-linked dimerization of BST-2 confers anoikis resistance to breast cancer cells by negating proapoptotic activities to promote tumor cell survival and growth. Cell Death Dis. (2017) 8:e2687. doi: 10.1038/cddis.2017.68

PubMed Abstract | Crossref Full Text | Google Scholar

32. Symons LK, Miller JE, Kay VR, Marks RM, Liblik K, Koti M, et al. The immunopathophysiology of endometriosis. Trends Mol Med. (2018) 24:748–62. doi: 10.1016/j.molmed.2018.07.004

PubMed Abstract | Crossref Full Text | Google Scholar

33. Smolarz B, Szyłło K, Romanowicz H. Endometriosis: Epidemiology, classification, pathogenesis, treatment and genetics. Int J Mol Sci. (2021) 22:10554. doi: 10.3390/ijms221910554

PubMed Abstract | Crossref Full Text | Google Scholar

34. Li MZ, Wu YH, Ali M, Wu XQ, Nie MF. Endometrial stromal cells treated by tumor necrosis factor-alpha stimulate macrophages polarized toward M2 via interleukin-6 and monocyte chemoattractant protein-1. J Obstet Gynaecol Res. (2020) 46:293–301. doi: 10.1111/jog.14135

PubMed Abstract | Crossref Full Text | Google Scholar

35. Abramiuk M, Grywalska E, Małkowska P, Sierawska O, Hrynkiewicz R, Niedźwiedzka-Rystwej P. The role of the immune system in the development of endometriosis. Cells. (2022) 11:2028. doi: 10.3390/cells11132028

PubMed Abstract | Crossref Full Text | Google Scholar

36. Ding H, Xu H, Zhang T, Shi C. Identification and validation of M2 macrophage-related genes in endometriosis. Heliyon. (2023) 9:e22258. doi: 10.1016/j.heliyon.2023.e22258

PubMed Abstract | Crossref Full Text | Google Scholar

37. Cui Z, Bhandari R, Lei Q, Lu M, Zhang L, Zhang M, et al. Identification and exploration of novel macrophage M2-related biomarkers and potential therapeutic agents in endometriosis. Front Mol Biosci. (2020) 6:656145. doi: 10.3389/fmolb.2021.656145

PubMed Abstract | Crossref Full Text | Google Scholar

38. Thiruchelvam U, Wingfield M, O’Farrelly C. Increased uNK progenitor cell in women with endometriosis and infertility are associated with low levels of endometrial stem cell factor. Am J Reprod Immunol. (2016) 75:493–502. doi: 10.1111/aji.12486

PubMed Abstract | Crossref Full Text | Google Scholar

39. Shi FD, Ljunggren HG, La Cava A, Van Kaer L. Organ-specific features of natural killer cells. Nat Rev Immunol. (2011) 11:658–71. doi: 10.1038/nri3065

PubMed Abstract | Crossref Full Text | Google Scholar

40. Petta CA, Peloggia A, Salamonsen L, et al. Natural killer cells and telomerase in the endometrium of patients with endometriosis. J Endometr. (2010) 2:183–8.

Google Scholar

41. Vallvé-Juanico J, Houshdaran S, Giudice LC. The endometrial immune environment of women with endometriosis. Hum Repro Upd. (2019) 25:565–92. doi: 10.1093/humupd/dmz018

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: endometriosis, immunity, inflammation, machine learning, biomarker

Citation: Zhou Y, Li J, Chen M and Huang H (2025) Identification and validation of immune-related and inflammation-related genes in endometriosis. Front. Endocrinol. 16:1545670. doi: 10.3389/fendo.2025.1545670

Received: 13 January 2025; Accepted: 02 April 2025;
Published: 08 May 2025.

Edited by:

Takashi Yazawa, Asahikawa Medical University, Japan

Reviewed by:

Bhola Shankar Pradhan, Łukasiewicz Research Network – PORT Polish Center for Technology Development, Poland
Takashi Kajitani, Sakura no Seibo Junior College, Japan
Michael Sierk, National Cancer Institute (NIH), United States

Copyright © 2025 Zhou, Li, Chen and Huang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Hailong Huang, aHVhbmdoYWlsb25nQGZqbXUuZWR1LmNu

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.