Identification and Validation of Key Genes of Differential Correlations in Gastric Cancer

Chen, Tingna; He, Qiuming; Xiang, Zhenxian; Dou, Rongzhang; Xiong, Bin

doi:10.3389/fcell.2021.801687

ORIGINAL RESEARCH article

Front. Cell Dev. Biol., 13 January 2022

Sec. Molecular and Cellular Oncology

Volume 9 - 2021 | https://doi.org/10.3389/fcell.2021.801687

Identification and Validation of Key Genes of Differential Correlations in Gastric Cancer

Tingna Chen^1,2,3^†

Qiuming He^1,2,3^†

Zhenxian Xiang^1,2,3^†

Rongzhang Dou^1,2,3

Bin Xiong^1,2,3*

¹Department of Gastrointestinal Surgery, Zhongnan Hospital of Wuhan University, Wuhan, China
²Hubei Key Laboratory of Tumor Biological Behaviors, Wuhan, China
³Hubei Cancer Clinical Study Center, Wuhan, China

Background: Gastric cancer (GC) is aggressive cancer with a poor prognosis. Previously bulk transcriptome analysis was utilized to identify key genes correlated with the development, progression and prognosis of GC. However, due to the complexity of the genetic mutations, there is still an urgent need to recognize core genes in the regulatory network of GC.

Methods: Gene expression profiles (GSE66229) were retrieved from the GEO database. Weighted correlation network analysis (WGCNA) was employed to identify gene modules mostly correlated with GC carcinogenesis. R package ‘DiffCorr’ was applied to identify differentially correlated gene pairs in tumor and normal tissues. Cytoscape was adopted to construct and visualize the gene regulatory network.

Results: A total of 15 modules were detected in WGCNA analysis, among which three modules were significantly correlated with GC. Then genes in these modules were analyzed separately by “DiffCorr”. Multiple differentially correlated gene pairs were recognized and the network was visualized by the software Cytoscape. Moreover, GEMIN5 and PFDN2, which were rarely discussed in GC, were identified as key genes in the regulatory network and the differential expression was validated by real-time qPCR, WB and IHC in cell lines and GC patient tissues.

Conclusions: Our research has shed light on the carcinogenesis mechanism by revealing differentially correlated gene pairs during transition from normal to tumor. We believe the application of this network-based algorithm holds great potential in inferring relationships and detecting candidate biomarkers.

Introduction

As an aggressive malignant tumor with poor prognosis and high mortality, gastric cancer (GC) is responsible for over 1,000,000 new cases and an estimated 768,000 deaths in 2020, making it the fifth in terms of incidence and fourth in terms of mortality worldwide (Sung et al., 2021). GC ranks second and third in incidence and mortality in China, respectively (Chen et al., 2016). Despite traditional treatments such as chemotherapy and surgery, GC leads to recurrences within 2 years after surgery and poor long-term survival due to its early metastasis via the lymphatic system, blood, and peritoneum (Wu et al., 2003; D'Angelica et al., 2004). Genetic mutations play a significant role in the carcinogenesis of GC aside from environmental factors (Karimi et al., 2014). Studies have shown that oncogenes and tumor-suppressor genes, including E-cadherin, p16, and p53, can be used as biomarkers for diagnosis, prediction of sensitivity to treatment, and prognosis of GC (de Mello et al., 2021). Although GC molecular pathogenesis has considerably evolved over the years, much remains to be unraveled (Tan and Yeoh, 2015). Thus, it is essential to detect new biomarkers with most regulatory alterations in gene network to elucidate GC etiology, thereby providing information for a targeted treatment.

At this time, various bioinformatic methods have been developed based on gene expression data, which provide effective tools for the comprehensive analysis of the gene network in the pathogenesis of cancers (Zhang et al., 2018). In recent years, a variety of papers have reported and studied new biomarkers which possess a vital position in the gene network in different cancers such as, hepatocellular carcinoma (Fang et al., 2021), breast cancer (Yang et al., 2021), lung cancer (Chen et al., 2021) and bladder cancer (Liao et al., 2021). Usually, novel biomarkers were identified based on differentially expressed genes analysis (DEA) between disease and healthy tissues or status (Marco-Puche et al., 2019). However, what should be of concern is that the incidence of a disease is a combined effect of multiple highly interactive genes. Correlation analysis is an important approach for omics data and offers clues for gene regulatory networks (Eisen et al., 1998). As complementary to traditional analytical methods of gene expression data, it is essential to look at the alternation of gene correlation, referred to as “differential correlations”, in the pathogenesis of cancers (de la Fuente, 2010).

For the first time, we constructed an in-silico network in GC pathogenesis and identified two novel key genes based on the theory of ‘differential correlation’. Firstly, we detected 15 co-expressed genes modules by weighted gene co-expression network analysis (WGCNA). Then, the differential correlations of genes in the modules which were mostly correlated with GC were calculated. Furthermore, a gene network was built and functional analysis of key genes was carried out. Finally, the expression patterns of GEMIN5 and PFDN2, the two novel biomarkers in GC, were confirmed in the laboratory by RT-qPCR, WB, and IHC.

Materials and Methods

Data Collection and Preprocessing

The workflow of the data preparation, processing, analysis, and validation is shown in Figure 1. The raw data of microarray GSE66229 (tumor samples = 300, normal samples = 100) were downloaded from the Gene Expression Omnibus database (GEO: http://www.ncbi.nlm.nih.gov/geo/) and was further normalized by Robust multi-array average (RMA) using the R package “affy” (Gautier et al., 2004). The probes were concerted to gene symbols according to the platform GPL570 (Affymetrix Human Genome U133 Plus 2.0 Array). The Stomach adenocarcinoma (STAD) RNA-seq read counts data along with survival information was retrieved from The Cancer Genome Atlas database (TCGA, https://portal.gdc.cancer.gov/). After excluding samples without survival information, 387 samples were enrolled in this study. All analyses were carried out by R version 4.1.0.

FIGURE 1

FIGURE 1. Flow diagram of the study. Data processing, analysis, and validation was shown in the picture.

Construction of Weighted Gene Co-Expression Network

Gene expression profiles from GSE66229 were used to construct WGCNA using this package according to the protocol in R software (Langfelder and Horvath, 2008). First, we used the goodSamplesGenes (gsg) method to exclude samples with too many missing entries and genes with zero variance. Next, a similarity matrix between gene expression profiles was constructed based on pairwise Pearson correlation, which was converted to an adjacency matrix using a power adjacency function. The power was chosen based on the scale-free topology criterion according to the scale-free topology index (R2) as 0.9 (Zhang and Horvath, 2005). Afterwards, the adjacency matrix was transformed into a topological overlap matrix (TOM) to detect modules (Langfelder et al., 2008). Modules were cut using the Dynamic Tree-Cut algorithm (Langfelder et al., 2008). We cut genes into modules by (blockwiseModules) method with following parameters: minModuleSize = 30, mergeCutHeight = 0.25, deepSplit = 2, verbose = 3.

To extract co-expressed genes most related to GC carcinogenesis for further analysis, the modules and phenotypes were related by calculating the module eigengenes (MES), which were the representatives of all genes in a module. Modules with |ME|>0.5 were selected. In addition, we performed a correlation between MM (module membership) and GS (gene significance) in the selected modules, which reflected the overall relationships of all genes in the module with the phenotype. We focused on the modules which had a strong overall correlation between MM and GS (r > 0.5).

Functional Enrichment of the Module Genes

Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways analysis of the genes in the chosen module were performed using the R package “clusterProfiler” (Yu et al., 2012). Only the KEGG terms with FDR<0.05 were considered significant. After construction of the network, we used STRING database (https://cn.string-db.org/) to assist in researching relevant papers and functional analysis of the key genes and their co-expressed genes.

Differential Correlation Analysis

DiffCorr is an R package to analyze and visualize differential correlations between two conditions in biological networks (Fukushima, 2013). Briefly, the analytical process of DiffCorr is divided in three steps. Firstly, different correlations were calculated via Fisher’s z-test. The Pearson correlations of a gene-pair under two conditions r_A and r_B were transformed respectively into Z_A and Z_B by formula $Z = \frac{1}{2} \log \frac{1 + r}{1 - r}$ . Differences were subsequently calculated following the formula $Z = \frac{Z_{A} - Z_{B}}{\sqrt{\frac{1}{n_{A} - 3} + \frac{1}{n_{B} - 3}}}$ , where n_A and n_B represented sample size under each condition. The local false-discovery rate (fdr) was used for controlling estimates. Secondly, eigen-molecule modules based on the first principal component (PC) were identified. Using these eigen-molecule modules, pair-wise differential correlations between genes can be tested. Finally, scaling and clustering was finished. Different pre-treatment methods, including auto-scaling (unit-variance scaling), range scaling, Pareto scaling, vast scaling, level scaling, and power transformation, were integrated with downstream differential correlation analyses.

Construction and Visualization of the Gene Network

The pair-wise differential correlation results and DEA were integrated into Cytoscape software (version 3.7.1) to construct gene network. Using R package “limma” (Ritchie et al., 2015), DEA was used to decide whether genes were up-regulated or down-regulated in GSE66229.

Survival Analysis of the Key Genes in the Network.

The gene expression level was parsed into two groups and repeated for 91 times based on the cutoff value from 5 to 95 percent of its expression data. A repeated log-rank test based on each cutoff value was conducted, and a cutoff value with the lowest p-value was selected for Kaplan-Meier analysis using R package “survival” (HARRINGTON and FLEMING, 1982) and “survminer” (https://CRAN.R-project.org/package=survminer).

Cell Lines and Cell Culture

The GC cell lines (BGC823, HGC27, MKN45, AGS, MGC803, SGC7901) and the human normal mucosal epithelium cell line GES-1 purchased from Chinese Academy of Sciences in Shanghai were cultured in a humidified atmosphere with 5% CO₂ supplemented with 10% Fetal Bovine Serum (FBS), 100 IU/ml penicillin and 100 mg/ml streptomycin in a humidified atmosphere with 5% CO2 at 37°C.

Patients and Tissues

Sixty-three pairs of GC tumor and corresponding adjacent normal tissues were collected from Zhongnan Hospital of Wuhan University. The patients had not experienced any chemotherapy or radiotherapy. All samples were obtained with informed patients’ consent before collection, and approved by the Zhongnan Hospital of Wuhan University Ethics Committee. Samples were snap-frozen and stored at−80°C until use in real-time qPCR (RT-qPCR) and WB (western blot) experiments. In addition, we conducted immunohistochemical (IHC) staining of formalin-fixed paraffin-embedded GC patients and normal control samples.

RNA Isolation and Quantitative Real-Time PCR

Total RNA was extracted from GC cell lines and tissues using the Trizol reagent (Invitrogen, United States) according to the manufacturer’s protocol. RNA concentration was measured by NanoDrop ultramicroscopy spectrophotometer 2000 (Thermo Fisher Scientific, United States). 1 ug RNA was reverse transcribed to cDNA using Primescript™ RT reagent kit (Vazyme, China). Quantitation of mRNA expression levels were performed on a Bio-Rad IQ5 Real-Time PCR instrument (Bio-Rad, United States) using SYBR-Green PCR Master Mix (Vazyme, China). The primer sequences are listed in Supplementary Table S1.

Western Blot Analysis

Cells were lysed using RIPA buffer containing protease inhibitor cocktail (Thermo Fisher Scientific, United States). The proteins were separated utilizing 10% sodium dodecyl sulfate-polyacrylamide gels and then transferred to a polyvinylidene fluoride membrane (Millipore, United States). After 2 hours of protein blocking with 5% non-fat milk, the membranes were incubated with primary antibodies and HRP-conjugated secondary antibodies. Proteins were detected using Bio-Rad Image Lab software. Quantitation on western blots have been performed using ImageJ software. The following primary antibodies were used: anti-GEMIN5 (1:1,000, Abcam ab201691), anti-PDFN2 (1:1,000, Abcam ab237534), anti-β-actin (1:5,000, Abcam ab8226).

Immunohistochemistry

Formalin-fixed tissues were paraffin embedded and sectioned (5-μm-thick sections). Antigen retrieval was performed by microwave oven for 18 min in citrate buffe (Beyotime, China). 3% hydrogen peroxide (Merck, Germany) was used to block endogenous peroxidase activity. Nonspecific staining was blocked followed by incubation with antibodies to GEMIN5 (1:100, Abcam ab201691) and PFDN2 (1:100, Abcam ab237534). Immunostaining was performed using DAB according to the manufacturer’s instructions.

Results

Construction of Weighted Gene Co-Expression Network

After excluding two abnormal samples, GSM1523817 and GSM1523984 (Figure 2A), we performed WGCNA analysis to detect clusters of genes most correlated with GC carcinogenesis based on the expression profiles of the remaining 398 samples in GSE66229. The power, a critical parameter in the analysis, was chosen to be three to ensure a scale-free network (R² = 0.9, Figure 2B). A dendrogram of all genes (n = 16,241) was clustered using the average linkage method and Biweight midcorrelation (Bicor) method (Figure 2C). A total of eighteen modules with widely varied numbers of co-expressed genes were identified through hierarchical clustering. The genes and their attributed modules are listed in Supplementary Table S1.

FIGURE 2

FIGURE 2. Construction of weighted gene co-expression network (A) Cluster dendrogram displays the relationship between samples (B) The scale-free fit index for soft-thresholding powers. Left: the relationship between the soft-threshold and scale-free R2. Right: the relationship between the soft-threshold and mean connectivity. Different modules are labeled in different colors (C) Dendrogram of all genes clustered in GSE66229.

Identification of Significant Modules in WGCNA

The relationship between modules and GC was subsequently explored (Figure 3A). Four modules presented a strong relationship with correlation coefficients above 0.5, which were the pink module (r = −0.62, p = 8e−43), the turquoise module (r = 0.77, p = 1e−80), the purple module (r = 0.52, p = 2e−29) and the blue module (r = −0.57, p = 7e−36).

FIGURE 3

FIGURE 3. Identification of significant modules in WGCNA (A) Heatmap of the correlation between module eigengenes and disease status. Each cell contained the correlation coefficients and p value. Red indicates positive correlations while blue indicates negative (B)–(E) Scatterplots of Gene Significance (GS) for disease vs Module Membership (MM) in the pink (B), turquoise (C), purple (D) and blue (E) modules. There was a highly significant correlation between GS and MM in pink, purple and blue module (F)–(H) KEGG pathway analysis in pink (F), turquoise (G) and blue (H) modules.

Next, to further screen modules with most genes associated with GC, we performed a correlation analysis between GS and MM (Figures 3B–E). The pink module (r = 0.83, p = 7.3e−53), the turquoise module (r = 0.87, p < 1e−200) and the blue module (r = 0.81, p < 1e−200) were chosen for further analysis, while the purple module (r = 0.38, p = 5.9e−06) was excluded.

Functional analysis of genes in the three modules was performed by KEGG pathway analysis (Supplementary Table S2). The top five significant pathways in the pink module were mineral absorption, drug metabolism − cytochrome P450, metabolism of xenobiotics by cytochrome P450, gastric acid secretion and retinol metabolism (Figure 3F). The top five significant pathways in the turquoise module were cell cycle, spliceosome, nucleocytoplasmic transport, ribosome biogenesis in eukaryotes and p53 signaling pathway (Figure 3G). The top five significant pathways in the blue module were focal adhesion, proteoglycans in cancer, regulation of actin cytoskeleton, axon guidance and tight junction (Figure 3H).

Calculation of Differential Correlations and Visualization of the Gene Network

Genes in pink, turquoise and blue modules were chosen to analyze their differential correlations separately. First, we used (cluster.molecule) function to cluster genes based on their expression profiles. One-correlation coefficient was adopted to measure distance (the cutoff of the coefficient was 0.6) according to the (cutree) function. Then (get.eigen.molecule) and (get.eigen.molecule.graph) were applied to visualize this process (Figure 4). Finally, the results of pair-wise differential correlations were exported via (comp.2. cc.fdr) function (Supplementary Table S3), and the top 10 significantly differential coexpressions (FDR <0.05) in each module were shown in Table 1.

FIGURE 4

FIGURE 4. Representations of the module network and differential co-expressions. Images of pink (A), turquoise (B) and blue (C) module networks including cancerous and normal samples.

TABLE 1

TABLE 1. Top 10 correlated gene pairs changed to the opposite direction in each module between normal and GC samples.

The DiffCorr package also detected oppositely correlated gene pairs where two molecules exhibit positive correlation in one condition and negative correlation in the other condition, which referred to as a “switching mechanism” (Kayano et al., 2011). These switched gene pairs were of concern in the pathogenesis of GC. As the correlation relationships of gene pairs in each module were complicated, we select gene pairs which had both at least moderately strong positive and negative correlations (correlation r > 0.5) to construct a gene network. In total, we obtained 52 oppositely correlated gene pairs from pink module, 307 gene pairs from turquoise module and five gene pairs from blue module (Supplementary Table S4), and the gene network based on which was presented in Figure 5 by cytoscape software.

FIGURE 5

FIGURE 5. Differentially co-expressed gene networks in the pink (A), turquoise (B) and blue (C) modules. Each node represented a gene, with lavender-filled color denoting decreased genes and pink-filled color denoting increased genes in GC. The edge represented connection between two genes. The green edge represented negative correlation and red positive correlation.

Functional Analysis of Key Genes w ith Differential Correlations

In the pink module, AKIP1 deserved more attention than other genes (Figure 5A). In this module, AKIP1 appeared in most gene pairs and linked to 19 other genes. AKIP1 was increased in GC and its co-expressed genes were mostly decreased in GC. Previously, AKIP1 has been reported to act as an oncogene in GC by activating Slug-induced EMT and AKIP1 significantly correlated with clinical metastasis and poor prognosis (Chen et al., 2019). By uploading AKIP1 and its co-expressed genes into string database, we found that its co-expressed genes KCNJ15, KCNJ16, GHRL and CCKBR were all involved in gastric acid secretion (Rau et al., 2013). Thus, our research enhanced the understanding of the role of AKIP1 in GC.

In the turquoise module, GEMIN5, PFDN2 and Sjogren syndrome antigen B (SSB) were in the center part of the network and possessed more edges than other genes (Figure 5B). These three genes were all increased in GC. Gem Nuclear Organelle Associated Protein 5 (GEMIN5), a component of the spliceosomal complex, plays a crucial role in mRNA splicing and can affect tumor cell motility (Lee et al., 2008). However, it has not yet been studied in GC. Its co-expressed genes SHMT2 and MTHFD2 are two mitochondrial enzymes which take part in folate metabolism and play critical roles in the gastrointestinal cancer survival and proliferation (Konno et al., 2017), which indicated that GEMIN5 might function as an oncogene in GC by exerting impacts on folate embolism. Hence, our study discovered for the first time, the critical role of GEMIN5 in GC and provided information for further function research.

As an autoimmune RNA-binding protein, SSB binds to the 3′ poly(U) terminus of nascent RNA polymerase III transcripts, protecting them from exonuclease digestion and facilitating their folding and maturation (Chambers et al., 1988). It has been recently identified as a pre-miRNA-binding protein that regulates miRNA processing in vitro, and was correlated with dicer in human cancer transcriptome and prognosis (Liang et al., 2013).

However, the role of SSB was still unclear in human cancers. Thus, our network provided information for further investigation.

PFDN2 belongs to the prefoldin subunits family which can bind and stabilize newly synthesized polypeptides (Mo et al., 2020). In GC, high mRNA expression of PFDN2 displayed poor OS(Yesseyeva et al., 2020). Its co-expressed genes TTK, CCNB2, PLK1 were in the cell cycle pathway, indicating PFDN2 might play an important role in GC via affecting cell cycle. Therefore, our research firstly validated the differential expression of PFDN2, and identified its co-expression genes in GC which provided valuable information for further functional mechanism explore.

In the blue module, GPR155 (G protein-coupled receptor 155) stood out in the network (Figure 5C). In GC, GPR155 transcription was suppressed in GC cell lines compared with a nontumorigenic cell line, and low GPR155 mRNA level was an independent biomarker of hematogenous metastasis (Shimizu et al., 2017). Consistent in our study, the expression of GPR155 was downregulated in gastric cancer tissue of the ACRG cohort.

Validation of Novel Key Genes of GC

In order to validate the two novel possible key genes, GEMIN5 and PFDN2, we performed Kaplan-Meier analysis in TCGA-STAD dataset. RT-qPCR, WB, and IHC experiments were used to detect mRNA or protein expression in GC cell lines and patient samples. The Kaplan-Meier analysis showed higher levels of GEMIN5 and PFDN2 correlated with poor prognosis in GC patients (Figures 6A,B). RT-qPCR results showed that the mRNA expression level of GEMIN5 and PFDN2 were higher in cancer cells of HGC27, BGC823, AGS, MGC803, SGC7901, and MKN45 than in human normal mucosal epithelium cells GES-1 (Figures 6C, D).Also, the mRNA expression level of GEMIN5 and PFDN2 were higher in GC tissues than in normal gastric tissues (Figures 6E, F). The protein expression level of GEMIN5 and PFDN2 were confirmed consistent with mRNA expression levels in the same cell lines and patient tissues by WB (Figures 6G–I) and IHC (Figures 6J, K). In addition, we performed correlation analyses between the mRNA expression level and clinicopathological characteristics in our own patient samples (Table 2). High GEMIN5 expression was significantly correlated to tumor size (p = 0.031), T-stage (p = 0.009), lymphatic metastasis (p = 0.001), TNM stage (p = 0.002) and histological grade (p = 0.034). High PFDN2 expression was significantly correlated to age (p = 0.027), T-stage (p = 0.047), TNM stage (p = 0.05). For the first time, our experiments validated the elevated expression of GEMIN5 and PFDN2 in GC cell lines and GC samples and assumed their critical roles in the pathogenesis of GC.

FIGURE 6

FIGURE 6. Validation of the key genes using TCGA-STAD data and laboratory experiments. Kaplan-Meier curve analysis of GEMIN5 (A) and PFDN2 (B) was shown. Expression of GEMIN5 and PFDN2 mRNA levels were validated by RT-qPCR in cell lines (C)–(D) and patient samples (E)–(F). Expression of GEMIN5 and PFDN2 protein levels were validated by western blot (G–I) and immunohistochemistry (J–K). Both mRNA and protein expression level of GEMIN5 and PFDN2 were higher in tumor samples and cancer cell lines than in normal samples and nontumorigenic cell line.

TABLE 2

TABLE 2. clinicopathological features and expression of PFDN2 and GEMIN5

Discussion

Previous studies have revealed gene interaction networks in multiple cancers based on their co-expression patterns (Liu et al., 2020; Nangraj et al., 2020; Bo et al., 2021), but little is known about differential correlations between disease status and health status, which could be an instructive view to elucidate deep mechanisms of pathogenesis of cancer. It is a growing direction in co-expression analysis that researchers have begun to develop tools focused on genes that are correlated under one condition but show little or no correlation in another condition, including coXpress (Watson, 2006), DCGL (Yang et al., 2013), and DiffCorr (Fukushima, 2013; Yang et al., 2013). CoXpress is a simple method to identify differentially co-expressed gene groups and should be used as first step in the analysis of co-expression (Watson, 2006). DCGL selects differentially regulated genes (DRGs) and differentially regulated links (DRLs) according to the transcription factor (TF)-to-target information (Yang et al., 2013). DiffCorr identifies pattern changes between two experimental conditions in correlation networks and can be used to detect biomarker candidates (Fukushima, 2013).

However, there is plenty of scope for this approach in cancer research. In this study, for the first time, we constructed an in silico gene network in GC benefiting from the availability of large-scale data and multiple algorithms to calculate differential correlations between gene pairs. First , we used WGCNA as a gene filtration to detect significant co-expressed genes modules and narrowed down the range of suspects. Secondly, we used DiffCorr to identify differentially co-expressed gene modules inside significant modules. To identify the key genes in GC, we analyzed the network focusing on biological functions. A recent study in plant biology has used DiffCorr to identify a novel community of genes to explain differences in leaf phenotypes which has no enriched GO terms (Nakayama et al., 2021). We carried out our functional research of our key genes by studying the co-expressed genes and referring to previous researches with the help of STRING database. Candidate key genes were validated using clinical information and laboratory experiments. In addition, both candidate key genes and their differentially co-expressed genes could be useful guidance for further experimental investigations and eventually facilitating diagnosis and treatment on GC patients.

The most significant gene in the pink module A-kinase-interacting protein 1 (AKIP1) is a nuclear protein known to interact with the catalytic subunit of PKA (PKAc) and a binding partner of NF-kappaB p65 subunit (Gao et al., 2008). Due to its function, AKIP1 has previously been reported to act as a potential oncogenic protein in various cancers such as esophageal squamous cell carcinoma (Lin et al., 2015), breast carcinoma (Mo et al., 2016), non-small-cell lung cancer (Guo et al., 2017) and colorectal cancer (Jiang et al., 2018). Our research reconfirmed that it played an important role in GC and enhanced understanding by revealing its co-expressed genes.

The key gene GPR155 in the blue module has also been extensively studied in cancers. In addition to its role as a biomarker in GC, the downregulation of GPR155 was reported as significantly associated with more aggressive HCC phenotypes including high preoperative α-fetoprotein, poor differentiation, serosal infiltration, vascular invasion, and advanced disease stage (Umeda et al., 2017). The presence of GPR155 in multiple gene pairs suggest it might be a crucial regulatory gene in GC and this offers clues for carrying out its further research in cancers.

With regards to the turquoise module, we identified two novel key genes GEMIN5 and PFDN2. It has been reported that GEMIN5 was the most differentially varied protein in breast cancer cell lines after modulation of Nm23-H1, which is the first MSG to be characterized (Lee et al., 2009). A recent study has found that gain of PFDN2 somatic copy-number was associated with poor survival in patients with metastatic urothelial carcinoma treated with platinum-based chemotherapy (Riester et al., 2014). However, the significance of GEMIN5 and PFDN2 was hardly mentioned in GC. Our network offered clues that they might possess significant position in the pathogenesis of GC, because they linked to most genes in the co-expressed genes module and might serve as key regulators. In addition, we validated their differential expression in mRNA and protein level in laboratory and analyzed their correlations with clinicopathological features.

Conventional methods to identify key genes between disease and health status require complex screening from a list of hundred candidate genes, whereas in our research, we narrowed down the range of suspects by WGCNA and DiffCorr and achieved effective results. Our research is perspective and efficient in finding novel biomarkers in cancers. However, only transcriptomics data was enrolled in our analytical process, which covered only one single layer of genome. In future studies, further researches should be carried out on differential correlations integrating multidimensional data such as proteomic data and single-cell sequencing data.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Ethics Statement

The studies involving human participants were reviewed and approved by Zhongnan Hospital of Wuhan University Ethics Committee. The patients/participants provided their written informed consent to participate in this study.

Author Contributions

BX and TC conceived and designed the study, TC. conducted all analysis procedures. TC, ZX, RD, and BX analyzed the results. TC and BX. contributed analysis tools. QH conducted the experimental validations. TC, QH and ZX contributed to the writing of the manuscript. All authors reviewed and approved the manuscript.

Funding

This work was supported by grants from the National Natural Science Foundation of China (81572874; 81872376) and the Clinical Special Disease Diagnosis and Treatment Technology Foundation of Zhongnan Hospital of Wuhan Unicersity (ZLYNXM202018).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We would like to acknowledge the TCGA database and GEO database developed by the National Institutes of Health (NIH).

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcell.2021.801687/full#supplementary-material

References

Bo, H., Zhu, F., Liu, Z., Deng, Q., Liu, G., Li, R., et al. (2021). Integrated Analysis of High-Throughput Sequencing Data Reveals the Key Role of LINC00467 in the Invasion and Metastasis of Testicular Germ Cell Tumors. Cell Death Discov. 7, 206. doi:10.1038/s41420-021-00588-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Chambers, J. C., Kenan, D., Martin, B. J., and Keene, J. D. (1988). Genomic Structure and Amino Acid Sequence Domains of the Human La Autoantigen. J. Biol. Chem. 263, 18043–18051. doi:10.1016/s0021-9258(19)81321-2

CrossRef Full Text | Google Scholar

Chen, B., Xie, X., Lan, F., and Liu, W. (2021). Identification of Prognostic Markers by Weighted Gene Co‐expression Network Analysis in Non-small Cell Lung Cancer. Bioengineered 12, 4924–4935. doi:10.1080/21655979.2021.1960764

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, D., Cao, G., and Liu, Q. (2019). A‐kinase‐interacting Protein 1 Facilitates Growth and Metastasis of Gastric Cancer Cells via Slug‐induced Epithelial‐mesenchymal Transition. J. Cel Mol Med 23, 4434–4442. doi:10.1111/jcmm.14339

CrossRef Full Text | Google Scholar

Chen, W., Zheng, R., Baade, P. D., Zhang, S., Zeng, H., Bray, F., et al. (2016). Cancer Statistics in China, 2015. CA: A Cancer J. Clinicians 66, 115–132. doi:10.3322/caac.21338

CrossRef Full Text | Google Scholar

D'angelica, M., Gonen, M., Brennan, M. F., Turnbull, A. D., Bains, M., and Karpeh, M. S. (2004). Patterns of Initial Recurrence in Completely Resected Gastric Adenocarcinoma. Ann. Surg. 240, 808–816. doi:10.1097/01.sla.0000143245.28656.15

PubMed Abstract | CrossRef Full Text | Google Scholar

De La Fuente, A. (2010). From 'differential Expression' to 'differential Networking' - Identification of Dysfunctional Regulatory Networks in Diseases. Trends Genet. 26, 326–333. doi:10.1016/j.tig.2010.05.001

PubMed Abstract | CrossRef Full Text | Google Scholar

De Mello, R. A., Amaral, G. A., Neves, N. M., Lippo, E. G., Parini, F., Xu, S., et al. (2021). Current and Potential Biomarkers in Gastric Cancer: a Critical Review of the Literature. Future Oncol. doi:10.2217/fon-2021-0084

CrossRef Full Text | Google Scholar

Eisen, M. B., Spellman, P. T., Brown, P. O., and Botstein, D. (1998). Cluster Analysis and Display of Genome-wide Expression Patterns. Proc. Natl. Acad. Sci. 95, 14863–14868. doi:10.1073/pnas.95.25.14863

PubMed Abstract | CrossRef Full Text | Google Scholar

Fang, Y., Yang, Y., Zhang, X., Li, N., Yuan, B., Jin, L., et al. (2021). A Co-expression Network Reveals the Potential Regulatory Mechanism of lncRNAs in Relapsed Hepatocellular Carcinoma. Front. Oncol. 11, 745166. doi:10.3389/fonc.2021.745166

PubMed Abstract | CrossRef Full Text | Google Scholar

Fukushima, A. (2013). DiffCorr: an R Package to Analyze and Visualize Differential Correlations in Biological Networks. Gene 518, 209–214. doi:10.1016/j.gene.2012.11.028

PubMed Abstract | CrossRef Full Text | Google Scholar

Gao, N., Asamitsu, K., Hibi, Y., Ueno, T., and Okamoto, T. (2008). AKIP1 Enhances NF-κb-dependent Gene Expression by Promoting the Nuclear Retention and Phosphorylation of P65. J. Biol. Chem. 283, 7834–7843. doi:10.1074/jbc.m710285200

CrossRef Full Text | Google Scholar

Gautier, L., Cope, L., Bolstad, B. M., and Irizarry, R. A. (2004). affy--analysis of Affymetrix GeneChip Data at the Probe Level. Bioinformatics 20, 307–315. doi:10.1093/bioinformatics/btg405

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, X., Zhao, L., Cheng, D., Mu, Q., Kuang, H., and Feng, K. (2017). AKIP1 Promoted Epithelial-Mesenchymal Transition of Non-small-cell Lung Cancer via Transactivating ZEB1. Am. J. Cancer Res. 7, 2234–2244.

Google Scholar

Harrington, D. P., and Fleming, T. R. (1982). A Class of Rank Test Procedures for Censored Survival Data. Biometrika 69, 553–566. doi:10.1093/biomet/69.3.553

CrossRef Full Text | Google Scholar

Jiang, W., Yang, W., Yuan, L., and Liu, F. (2018). Upregulation of AKIP1 Contributes to Metastasis and Progression and Predicts Poor Prognosis of Patients with Colorectal Cancer. Ott Vol. 11, 6795–6801. doi:10.2147/ott.s151952

CrossRef Full Text | Google Scholar

Karimi, P., Islami, F., Anandasabapathy, S., Freedman, N. D., and Kamangar, F. (2014). Gastric Cancer: Descriptive Epidemiology, Risk Factors, Screening, and Prevention. Cancer Epidemiol. Biomarkers Prev. 23, 700–713. doi:10.1158/1055-9965.epi-13-1057

PubMed Abstract | CrossRef Full Text | Google Scholar

Kayano, M., Takigawa, I., Shiga, M., Tsuda, K., and Mamitsuka, H. (2011). ROS-DET: Robust Detector of Switching Mechanisms in Gene Expression. Nucleic Acids Res. 39, e74. doi:10.1093/nar/gkr130

PubMed Abstract | CrossRef Full Text | Google Scholar

Konno, M., Asai, A., Kawamoto, K., Nishida, N., Satoh, T., Doki, Y., et al. (2017). The One-Carbon Metabolism Pathway Highlights Therapeutic Targets for Gastrointestinal Cancer (Review). Int. J. Oncol. 50, 1057–1063. doi:10.3892/ijo.2017.3885

CrossRef Full Text | Google Scholar

Langfelder, P., and Horvath, S. (2008). WGCNA: an R Package for Weighted Correlation Network Analysis. BMC Bioinformatics 9, 559. doi:10.1186/1471-2105-9-559

PubMed Abstract | CrossRef Full Text | Google Scholar

Langfelder, P., Zhang, B., and Horvath, S. (2008). Defining Clusters from a Hierarchical Cluster Tree: the Dynamic Tree Cut Package for R. Bioinformatics 24, 719–720. doi:10.1093/bioinformatics/btm563

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, J. H., Horak, C. E., Khanna, C., Meng, Z., Yu, L. R., Veenstra, T. D., et al. (2008). Alterations in Gemin5 Expression Contribute to Alternative mRNA Splicing Patterns and Tumor Cell Motility. Cancer Res. 68, 639–644. doi:10.1158/0008-5472.can-07-2632

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, J. H., Marshall, J.-C., Steeg, P. S., and Horak, C. E. (2009). Altered Gene and Protein Expression by Nm23-H1 in Metastasis Suppression. Mol. Cel Biochem 329, 141–148. doi:10.1007/s11010-009-0124-3

CrossRef Full Text | Google Scholar

Liang, C., Xiong, K., Szulwach, K. E., Zhang, Y., Wang, Z., Peng, J., et al. (2013). Sjögren Syndrome Antigen B (SSB)/La Promotes Global MicroRNA Expression by Binding MicroRNA Precursors through Stem-Loop Recognition. J. Biol. Chem. 288, 723–736. doi:10.1074/jbc.m112.401323

CrossRef Full Text | Google Scholar

Liao, Y., Zou, X., Wang, K., Wang, Y., Wang, M., Guo, T., et al. (2021). Comprehensive Analysis of Transcription Factors Identified Novel Prognostic Biomarker in Human Bladder Cancer. J. Cancer 12, 5605–5621. doi:10.7150/jca.58484

CrossRef Full Text | Google Scholar

Lin, C., Song, L., Liu, A., Gong, H., Lin, X., Wu, J., et al. (2015). Overexpression of AKIP1 Promotes Angiogenesis and Lymphangiogenesis in Human Esophageal Squamous Cell Carcinoma. Oncogene 34, 384–393. doi:10.1038/onc.2013.559

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, H., Liu, M., You, H., Li, X., and Li, X. (2020). Oncogenic Network and Hub Genes for Natural Killer/T-Cell Lymphoma Utilizing WGCNA. Front. Oncol. 10, 223. doi:10.3389/fonc.2020.00223

PubMed Abstract | CrossRef Full Text | Google Scholar

Marco-Puche, G., Lois, S., Benítez, J., and Trivino, J. C. (2019). RNA-seq Perspectives to Improve Clinical Diagnosis. Front. Genet. 10, 1152. doi:10.3389/fgene.2019.01152

PubMed Abstract | CrossRef Full Text | Google Scholar

Mo, D., Li, X., Li, C., Liang, J., Zeng, T., Su, N., et al. (2016). Overexpression of AKIP1 Predicts Poor Prognosis of Patients with Breast Carcinoma and Promotes Cancer Metastasis through Akt/GSK-3β/Snail Pathway. Am. J. Transl Res. 8, 4951–4959.

Google Scholar

Mo, S.-j., Zhao, H.-C., Tian, Y.-z., and Zhao, H.-L. (2020). The Role of Prefoldin and its Subunits in Tumors and Their Application Prospects in Nanomedicine. Cmar Vol. 12, 8847–8856. doi:10.2147/cmar.s270237

PubMed Abstract | CrossRef Full Text | Google Scholar

Nakayama, H., Rowland, S. D., Cheng, Z., Zumstein, K., Kang, J., Kondo, Y., et al. (2021). Leaf Form Diversification in an Ornamental Heirloom Tomato Results from Alterations in Two Different HOMEOBOX Genes. Curr. Biol. 31, 4788–4799. doi:10.1016/j.cub.2021.08.023

PubMed Abstract | CrossRef Full Text | Google Scholar

Nangraj, A. S., Selvaraj, G., Kaliamurthi, S., Kaushik, A. C., Cho, W. C., and Wei, D. Q. (2020). Integrated PPI- and WGCNA-Retrieval of Hub Gene Signatures Shared between Barrett's Esophagus and Esophageal Adenocarcinoma. Front. Pharmacol. 11, 881. doi:10.3389/fphar.2020.00881

PubMed Abstract | CrossRef Full Text | Google Scholar

Rau, T. T., Sonst, A., Rogler, A., Burnat, G., Neumann, H., Oeckl, K., et al. (2013). Gastrin Mediated Down Regulation of Ghrelin and its Pathophysiological Role in Atrophic Gastritis. J. Physiol. Pharmacol. 64, 719–725.

Google Scholar

Riester, M., Werner, L., Bellmunt, J., Selvarajah, S., Guancial, E. A., Weir, B. A., et al. (2014). Integrative Analysis of 1q23.3 Copy-Number Gain in Metastatic Urothelial Carcinoma. Clin. Cancer Res. 20, 1873–1883. doi:10.1158/1078-0432.ccr-13-0759

PubMed Abstract | CrossRef Full Text | Google Scholar

Ritchie, M. E., Phipson, B., Wu, D., Hu, Y., Law, C. W., Shi, W., et al. (2015). Limma powers Differential Expression Analyses for RNA-Sequencing and Microarray Studies. Nucleic Acids Res. 43, e47. doi:10.1093/nar/gkv007

PubMed Abstract | CrossRef Full Text | Google Scholar

Shimizu, D., Kanda, M., Tanaka, H., Kobayashi, D., Tanaka, C., Hayashi, M., et al. (2017). GPR155 Serves as a Predictive Biomarker for Hematogenous Metastasis in Patients with Gastric Cancer. Sci. Rep. 7, 42089. doi:10.1038/srep42089

PubMed Abstract | CrossRef Full Text | Google Scholar

Sung, H., Ferlay, J., Siegel, R. L., Laversanne, M., Soerjomataram, I., Jemal, A., et al. (2021). Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA A. Cancer J. Clin. 71, 209–249. doi:10.3322/caac.21660

CrossRef Full Text | Google Scholar

Tan, P., and Yeoh, K.-G. (2015). Genetics and Molecular Pathogenesis of Gastric Adenocarcinoma. Gastroenterology 149, 1153–1162. doi:10.1053/j.gastro.2015.05.059

PubMed Abstract | CrossRef Full Text | Google Scholar

Umeda, S., Kanda, M., Sugimoto, H., Tanaka, H., Hayashi, M., Yamada, S., et al. (2017). Downregulation of GPR155 as a Prognostic Factor after Curative Resection of Hepatocellular Carcinoma. BMC Cancer 17, 610. doi:10.1186/s12885-017-3629-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Watson, M. (2006). CoXpress: Differential Co-expression in Gene Expression Data. BMC Bioinformatics 7, 509. doi:10.1186/1471-2105-7-509

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, C.-W., Lo, S.-S., Shen, K.-H., Hsieh, M.-C., Chen, J.-H., Chiang, J.-H., et al. (2003). Incidence and Factors Associated with Recurrence Patterns after Intended Curative Surgery for Gastric Cancer. World J. Surg. 27, 153–158. doi:10.1007/s00268-002-6279-7

CrossRef Full Text | Google Scholar

Yang, J., Yu, H., Liu, B.-H., Zhao, Z., Liu, L., Ma, L.-X., et al. (2013). DCGL v2.0: an R Package for Unveiling Differential Regulation from Differential Co-expression. PLoS One 8, e79729. doi:10.1371/journal.pone.0079729

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, L., Li, X., Luo, Y., Yang, T., Wang, H., Shi, L., et al. (2021). Weighted Gene Co-expression N-etwork A-nalysis of the A-ssociation between U-pregulated AMD1, EN1 and VGLL1 and the P-rogression and P-oor P-rognosis of B-reast C-ancer. Exp. Ther. Med. 22, 1030. doi:10.3892/etm.2021.10462

PubMed Abstract | CrossRef Full Text | Google Scholar

Yesseyeva, G., Aikemu, B., Hong, H., Yu, C., Dong, F., Sun, J., et al. (2020). Prefoldin Subunits (PFDN1-6) Serve as Poor Prognostic Markers in Gastric Cancer. Biosci. Rep. 40. doi:10.1042/BSR20192712

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, G., Wang, L.-G., Han, Y., and He, Q.-Y. (2012). clusterProfiler: an R Package for Comparing Biological Themes Among Gene Clusters. OMICS: A J. Integr. Biol. 16, 284–287. doi:10.1089/omi.2011.0118

CrossRef Full Text | Google Scholar

Zhang, B., and Horvath, S. (2005). A General Framework for Weighted Gene Co-expression Network Analysis. Stat. Appl. Genet. Mol. Biol. 4, 17. doi:10.2202/1544-6115.1128

CrossRef Full Text | Google Scholar

Zhang, Y., Li, X., Zhou, D., Zhi, H., Wang, P., Gao, Y., et al. (2018). Inferences of Individual Drug Responses across Diverse Cancer Types Using a Novel Competing Endogenous RNA Network. Mol. Oncol. 12, 1429–1446. doi:10.1002/1878-0261.12181

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: gastric cancer, differential correlation, switching mechanism, WGCNA, gene network

Citation: Chen T, He Q, Xiang Z, Dou R and Xiong B (2022) Identification and Validation of Key Genes of Differential Correlations in Gastric Cancer. Front. Cell Dev. Biol. 9:801687. doi: 10.3389/fcell.2021.801687

Received: 25 October 2021; Accepted: 06 December 2021;
Published: 13 January 2022.

Edited by:

Alessia Ligresti, Istituto di Chimica Biomolecolare (CNR), Italy

Reviewed by:

Carmela De Marco, Magna Græcia University of Catanzaro, Italy
Atsushi Fukushima, Kyoto Prefectural University, Japan
Pierlorenzo Pallante, Istituto per l'Endocrinologia e l’oncologia “Gaetano Salvatore (CNR), Italy
Michele Biagioli, University of Perugia, Italy

Copyright © 2022 Chen, He, Xiang, Dou and Xiong. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Bin Xiong, YmlueGlvbmcxOTYxQHdodS5lZHUuY24=

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.