Skip to main content

ORIGINAL RESEARCH article

Front. Genet., 30 April 2021
Sec. Cancer Genetics and Oncogenomics

Identification of Hub Genes and Their Correlation With Immune Infiltration Cells in Hepatocellular Carcinoma Based on GEO and TCGA Databases

\r\nRui HuangRui Huang1Jinying LiuJinying Liu1Hui LiHui Li2Lierui ZhengLierui Zheng1Haojun JinHaojun Jin1Yaqing ZhangYaqing Zhang1Wei MaWei Ma1Junhong SuJunhong Su3Min WangMin Wang1Kun Yang*Kun Yang4*
  • 1College of Medicine, Northwest Minzu University, Lanzhou, China
  • 2Lanzhou Maternity and Child Health Care Hospital, Lanzhou, China
  • 3Medical Faculty, Kunming University of Science and Technology, Kunming, China
  • 4Lanzhou University Second Hospital, Lanzhou, China

Hepatocellular carcinoma (HCC) is a primary liver cancer with extremely high mortality in worldwide. HCC is hard to diagnose and has a poor prognosis due to the less understanding of the molecular pathological mechanisms and the regulation mechanism on immune cell infiltration during hepatocarcinogenesis. Herein, by performing multiple bioinformatics analysis methods, including the RobustRankAggreg (RRA) rank analysis, weighted gene co-expression network analysis (WGCNA), and a devolution algorithm (CIBERSORT), we first identified 14 hub genes (NDC80, DLGAP5, BUB1B, KIF20A, KIF2C, KIF11, NCAPG, NUSAP1, PBK, ASPM, FOXM1, TPX2, UBE2C, and PRC1) in HCC, whose expression levels were significantly up-regulated and negatively correlated with overall survival time. Moreover, we found that the expression of these hub genes was significantly positively correlated with immune infiltration cells, including regulatory T cells (Treg), T follicular helper (TFH) cells, macrophages M0, but negatively correlated with immune infiltration cells including monocytes. Among these hub genes, KIF2C and UBE2C showed the most significant correlation and were associated with immune cell infiltration in HCC, which was speculated as the potential prognostic biomarker for guiding immunotherapy.

Introduction

Liver cancer is a highly malignant tumor with a worldwide mortality rate ranking among the top three in 2020 (Sung et al., 2021). Hepatocellular carcinoma (HCC) is a primary liver cancer that originates in the liver. The development of large-scale sequencing technology and subsequent big data mining methods has made a great contribution on the research on HCC genome changes. In addition to focusing on cancer cells, research on the infiltration of host immune cells has recently become the new concerns for cancer biology. Therefore, exploring the potential prognostic biomarkers and developing immunotherapy methods are crucial for improving the survival of cancer patients (Wei et al., 2020).

The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases are the two most commonly used tumor databases. It is necessary to repeatedly mine tumor databases given the continuous updates in bioinformatics technology. RobustRankAggreg (RRA) algorithm was extensively used to screen the differentially expressed genes (DEGs) in different dataset from multiple different sequencing platforms (Kolde et al., 2012). The R package for Weighted Correlation Network Analysis (WGCNA) was used to identify co-expressed gene modules with similar expression patterns, and to explore the relationship between gene networks and phenotypes of interest (Langfelder and Horvath, 2008). In the recent years, many reports were published for investigating the hub genes of HCC(Chen et al., 2017; Li et al., 2018; Zhou et al., 2019). Chen et al. (2017) identified 6 hub genes, ABAT, AGXT, ALDH6A1, CYP4A11, DAO, and EHHADH, that were associated with HCC metastasis risk. Li et al. (2018) identified the 5 hub genes, GINS1, TOP2A, BUB1B, ARPC4, and ACADM in HCC progression with high node degree. In this study, we comprehensively applied a variety of bioinformatics algorithms to screen out 171 HCC-related genes with significantly up- or down-regulated expression levels. Then, we have used protein-protein network ranked by 12 different algorithms of cytohubba to find 14 hub genes, whose expression levels were significantly up-regulated. Among them, the high expression level of 11 genes, NDC80, DLGAP5, BUB1B, KIF20A, KIF2C, KIF11, NCAPG, PBK, FOXM1, TPX2, and PRC1, were strongly correlated with shorter overall survival time. Furthermore, we carried out an immune cell infiltration analysis by CIBERSORT and found these hub genes was significantly positively correlated with immune infiltration cells, including regulatory T cells (Treg), T follicular helper (TFH) cells, macrophages M0, but negatively correlated with immune infiltration cells including monocytes. The results further illustrate that the expression of the hub genes is associated with a poor prognosis.

Materials and Methods

The flow chart showing the overall research design and methods used for this study was shown in Supplementary Figure 1.

GEO Data Collection and Preprocessing

The GEO database was searched by using the keywords “hepatocellular carcinoma” and “liver cell carcinoma.” After filtering according to the following criteria (1) all samples are from human beings, (2) all datasets include matched cancer tissue-normal tissue samples, and (3) the dataset contains at least 20 samples, 7 microarray data sets (GSE62322, GSE112790, GSE102079, GSE14323, GSE14520, GSE89377, and GSE64041) were selected and downloaded. The detail showed in Supplementary Table 1.

TCGA Data Download and Preprocessing

Gene expression quantification data and corresponding clinical information for HCC were downloaded from The Cancer Genome Atlas Liver Hepatocellular Carcinoma (TCGA-LIHC) data collection. The 424 HTSeq-counts files comprised 371 tumor samples and 50 normal samples. Clinical information was extracted and included follow-up time and clinical status. The TCGA expression matrix was obtained by data fusion and ID transformation of raw TCGA counts data. Next, the RPKM (Reads Per Kilobase per Million mapped reads) values were calculated for the WGCNA.

We applied the “limma” package (Ritchie et al., 2015) of R software to perform normalization and base-2 logarithm conversion for the matrix data for each GEO and TCGA dataset. differentially expressed genes for each GEO and TCGA matrix were obtained by transforming expression values, and genes were sorted according to the log2FoldChange (logFC) value. Next, rank analysis was performed using the R package “RRA.” The criterion for screening DEGs is that the P < 0.05 and | logFC | > 1.

Identification of Gene Expression Modules

GEO data from the same platform was merged as follows. The GPL570 array platform included three datasets GSE62322, GSE112790, and GSE102079. GSE14323 and GSE14520 were merged into the GPL571 platform. Using the “sva” package the batch effect and other unwanted variations were removed to avoid generating less reliable results (Leek et al., 2012). Next, we selected gene expression matrixes from GPL570, GPL571, GSE89377, GSE64041, and RPKM values of TCGA data and then identified gene expression modules using the WGCNA package in R. Setting an appropriate soft-thresholding power to ensure scale-free networks, R2 = 0.9 was selected. The adjacency matrix used to construct the Topological Overlap Matrix (TOM) using TOM similarity values and module eigengenes (MEs) were clustered using the dissimilarity measure (1-TOM).

Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) Pathways Enrichment Analysis

DEGs for GO and KEGG pathway enrichment analysis consist of two datasets. The first included 101 overlapping genes from 3 components the including integrated DEGs (RRA_diff), MEs with the strongest positive tumor correlation of GEO (GEO_positive ME) and TCGA (TCGA_positive ME), the other dataset included 70 overlapping genes integrating DEGs (RRA_diff), MEs with the strongest negative tumor correlation of GEO (GEO_ negative ME), and TCGA (TCGA_ negative ME). The merged 171 genes were retained for GO and KEGG pathway enrichment analysis using R packages. P < 0.05 was considered statistically significant.

Functional Protein Association PPI (Protein-Protein Interaction) Network Analysis

The overlapping genes were analyzed to identify potential interactions using the online STRING database (Szklarczyk et al., 2017). PPIs with the highest confidence scores ≥ 0.9 were reserved and the results were imported to Cytoscape (Shannon et al., 2003) for further complex network analysis. Moreover, to predict and explore important hub genes in the PPI network, we performed module analysis utilizing the cytohubba application with default parameters in Cytoscape. Overlapping of 12 topological algorithms were carried out using the “UpSetR” package. Finally, 14 overlapping genes were obtained.

Survival Analysis

The clinical information of patients with HCC was extracted and included follow-up time and clinical status. After removing patients with no information on overall survival (OS) data, candidate genes strongly correlated with survival were identified using the “survival” and “survminer” packages.

Correlation Between Survival-Related Candidate Genes and Immune Cells

To study the correlation between survival-related candidate genes and immune cells, we used a web server called TIMER2.01 (Taiwen et al., 2020). We detected the correlation in the expression of 14 hub genes in HCC with the levels of infiltrating immune cells, respectively, using the CIBERSORT as the deconvolution algorithm. The 22 immune cells included T cells, B cells, macrophages, dendritic cells (DCs), NK cell, monocytes, mast cells, eosinophils, and neutrophils. The correlation between them was shown using a heat map.

Infiltrating Immune Cells Between Tumor and Normal Samples

We compared the infiltrating immune cells of tumor and normal samples. The original gene expression data from TCGA were normalized as described previously (Chen et al., 2018). The normalized data was analyzed using the R package “CIBERSORT.” CIBERSORT, a deconvolution algorithm based on principles of linear support vector regression, was published in Nature Methods in 2015. It calculated the cell composition of unknown mixture based on their gene expression profiles according to known reference set LM22 (Leukocyte signature matrix). This reference dataset defines the gene expression characterizing a set of signature genes of 22 immune cell subtypes (Newman et al., 2015). The permutations (perm) of the deconvolution algorithm were set at 100. The results were filtered using a p < 0.05.

All data were processed using R language (version 4.0.2) and all statistical methods in this study were performed using corresponding R package. When the p < 0.05, results were considered statistically significant. Researchers who want the R code can contact the corresponding author (yangkun@lzu.edu.cn).

Results

Identification of Integrated GEO DEGs and TCGA DEGs

Seven microarray data sets [GSE62322 (Schulze et al., 2015), GSE112790 (Shimada et al., 2019), GSE102079 (Chiyonobu et al., 2018), GSE14323 (Mas et al., 2009), GSE14520 (Roessler et al., 2010), GSE89377, and GSE64041 (Makowska et al., 2016)] and the TCGA-LIHC dataset were downloaded and processed. Data sets from different platforms cannot be merged directly using “limma” packages for differentially expressed genes analysis. Therefore, we ranked the 8 data sets from different platforms using the RRA algorithm. Finally, 199 significantly up-regulated genes and 363 significantly down-regulated genes were identified by the RRA method. Top 20 up-regulated and down-regulated genes are shown in Figure 1. The | logFC| > 1 and adjusted p < 0.05 were considered statistically significant for the RRA DEGs.

FIGURE 1
www.frontiersin.org

Figure 1. Identification of DEGs of 7 GEO datasets and TCGA dataset using RRA. The heatmap of the top 20 up- and down-regulated DEGs in the integrated GEO datasets analysis. The rows represent the genes and the columns represent the GEO dataset. The number in the rectangle represents the log2FoldChange value. Red represents up-regulated genes, blue represents down-regulated genes.

Identification of Key Gene Expression Modules

GEO data from the same platform were merged. GSE62322, GSE112790, and GSE102079 were merged into the GPL570 platform and GSE14323 and GSE14520 were merged into the GPL571 platform. Then, we selected gene matrixes from GPL570, GPL571, GSE89377, GSE64041, and RPKM values of TCGA data and constructed a co-expression network with the WGCNA package in R. By gathering similarly expressed genes in tumor and normal tissue, mRNAs with similar expression profiles were aggregated into the different module by applying a dynamic tree cut algorithm. The module-trait relationships are shown in Figure 2. Next, 650 module eigengenes having the strongest positive tumor correlation from the GEO were merged to form GEO_positive ME including MEturquoise of GPL570, MEturquoise of GPL571, MEgrey of GSE64041, and MEgrey of GSE89377. 222 module eigengenes with the strongest tumor negative correlation were merged to form the GEO_negative ME and included MEblack of GPL570, MEbrown of GPL571, MEtan of GSE64041, and MEblack of GSE89377. The results showed in Supplementary Table 2.

FIGURE 2
www.frontiersin.org

Figure 2. Identification of module eigengenes (MEs) associated with HCC using GEO and TCGA datasets. (A,C,E,G,I) Dendrogram of DEGs clustered based on a dissimilarity measure (1–TOM). (B,D,F,H,J) Module-trait relationships. Each row corresponds to a color module and each column correlates to a clinical trait (normal and cancer). The numbers in each cell represent the corresponding correlation and P-value.

GO and KEGG Pathway Enrichment Analysis of DEGs

171 gene including 101 genes from GEO_positive ME, TCGA_positive ME, and RRA DEGs (Figure 3A) and 70 genes from GEO_negative ME, TCGA_negative ME, and RRA DEGs (Figure 3B) were merged for GO and KEGG pathway enrichment analysis. The results showed the genes were involved in the biological process (BP), cell component (CC), molecular function (MF). We found the biological processes in which these genes are mainly involved include nuclear division, chromosome segregation and organelle fission, of these, the expression products were mainly components of the chromosome, spindle and kinetochore (Figures 3C,D). KEGG analysis showed that these genes are involved in signal pathways including the cell cycle, DNA replication and pyrimidine metabolism (Figures 3E,F).

FIGURE 3
www.frontiersin.org

Figure 3. Venn diagrams for overlapping DEGs and MEs significantly related to HCC and overlapped genes for GO and KEGG pathways analysis. (A) Tumor positive related genes. GEO_ positive MEs includes MEturquoise of GPL570, MEturquoise of GPL571, MEgrey of GSE64041, and MEgrey of GSE89377. (B) Tumor negative related genes. GEO_negative MEs includes MEblack of GPL570, MEbrown of GPL571, MEtan of GSE64041, and MEblack of GSE89377. (C) The bar plot of all overlapping genes by GO biological process. (D) The bubble plot showing all overlapping genes by GO biological process (E) KEGG pathways of all overlapped genes are shown in the bar plot. (F) KEGG pathways of all overlapped genes showed in the bubble plot.

Identification of Hub Genes

In total, 171 overlapped genes were analyzed to characterize the potential protein-protein interactions using the online STRING database. PPIs with a highest confidence score ≥ 0.9 were selected and then imported to cytoscape for further complex network analysis. In addition, to predict and explore the important hub genes in the PPI network, we used cytoHubba with default parameters in cytoscape. Firstly, we found many genes are ranked same if just ranked by one algorithm (Supplementary Table 3). So, we thought that the parameters calculated using multiple algorithms maybe reflect the status of the node in the entire network from different aspects. Then, we got top 40 genes from protein-protein network ranked by 12 different algorithms of cytohubba including Degree, Density of Maximum Neighborhood Component (DMNC), Edge Percolated Component (EPC), Maximal Clique Centrality (MCC), Maximum Neighborhood Component (MNC), and centralities based on shortest paths, such as Bottleneck (BN), Closeness, EcCentricity (EC), Radiality, Betweenness, Stress, and Clustering Coefficient (CC) were intersected. Finally, 14 hub genes (degree value > 20), NDC80, DLGAP5, BUB1B, KIF20A, KIF2C, KIF11, NCAPG, NUSAP1, PBK, ASPM, FOXM1, TPX2, UBE2C, and PRC1 were obtained for further exploration (Figure 4). Details of these 14 hub genes was showed in Table 1.

FIGURE 4
www.frontiersin.org

Figure 4. UpSet diagrams of 12 topological algorithms determined by functional protein association PPI network analysis. The overlapping results of several topological algorithms including Degree, EPC, MNC, DMNC, MCC, and network centralities based on the shortest paths such as BN, EC, Closeness, Radiality, Betweenness, Stress and CC are shown.

TABLE 1
www.frontiersin.org

Table 1. Details of hub genes.

The Relationship Between Hub Gene and Overall Survival Time (OS)

The OS of the 14 hub genes was analyzed by survival and survminer package. The results demonstrated that there was an extremely significant correlation between the expression levels of DLGAP5, BUB1B, KIF20A, KIF2C, KIF11, FOXM1, and TPX2 and survival time (p < 0.001), a significant correlation existed between the expression of NDC80, NCAPG, PBK, PRC1, and survival time (p < 0.01), and a weak correlation between the expression of NUSAP1, ASPM, UBE2C, and survival (p < 0.05) (Figure 5).

FIGURE 5
www.frontiersin.org

Figure 5. Overall survival analyses of the overlapping hub genes. Analysis was performed using the survival and survminer packages in R. P-values were used to indicate significant differences.

Correlation Between Survival-Related Candidate Genes and Immune Cells

Next, we explored whether the mRNA expression level of the candidate hub genes was associated with infiltrating immune cells in HCC. Thus, we compared the expression of the 14 candidate hub genes in HCC and the infiltrating levels of immune cells using the TIMER database. The results showed that the expression level of candidate hub genes was significantly positively correlated with the infiltrating levels of immune cells including Treg cells, TFH cells, macrophages (M0), T cell CD4 + memory activated, myeloid dendritic cell resting and B cell plasma, and significantly negative correlations with immune infiltrating cells including monocytes, mast cells activated and NK cell resting. In addition, very weak negative correlations were existed between T cell CD4 + memory resting and the hub genes (Figure 6A). More interestingly, KIF2C and UBE2C showed the most significant positive correlation with Treg and a negative correlation with immune infiltration of monocytes (Supplementary Table 4). Meanwhile, we compared the fraction of infiltrating immune cells with tumor and normal tissues and found the content of macrophages M0 in tumor tissues significantly higher than that in normal tissue, while the fraction of macrophages M1 showed opposite result to macrophages M0 (Figure 6B).

FIGURE 6
www.frontiersin.org

Figure 6. The correlation between hub genes-immune cells and the fraction of infiltrated immune cells between normal tissue and tumor tissue in HCC. (A) The expression of candidate hub genes having significant positive correlations with immune infiltration cells are shown in red. The expression of candidate hub genes having significant negative correlations with immune infiltration cells are shown in blue. *p < 0.05, **p < 0.01, *** p < 0.001. (B) Fraction of infiltrated immune cells in HCC. Red represents tumor tissue and blue shows normal tissue.

Discussion

Tumorigenesis is a very complex process which includes the changes in expression of various genes and changes in the microenvironment. It is vital for tumor therapy to identify novel biomarkers and targets and to explore the diversity and complexity of the tumor immune microenvironment. At present, this complicated process can be opportunely explored because of the development of high-throughput sequencing technology and subsequent data analysis techniques. Herein, we have screened the hub genes in HCC and revealed a strong correlation between hub genes expression level and the tumor microenvironment.

We identified 14 hub genes (NDC80, DLGAP5, BUB1B, KIF20A, KIF2C, KIF11, NCAPG, NUSAP1, PBK, ASPM, FOXM1, TPX2, UBE2C, and PRC1) that were significantly associated with overall survival by RRA rank analysis and WGCNA based on TCGA and GEO databases. Previous studies showed that 14 hub gene are involved in cell cycle regulation, which is according with our results from GO and KEGG pathway enriched analysis. It has previously been reported that some of these also regulate the P53 signaling pathway (Mirgayazova et al., 2019). The hub genes have been also found to be associated with various cancers (Burum-Auensen et al., 2010; Wierstra, 2013; Luo et al., 2014; Singh et al., 2014; Schneider et al., 2017; Gao and Wang, 2019). In this study, we found the expression of the hub genes was up-regulated in cancer samples compared with normal tissues. Some hub genes, such as DLGAP5 and TPX2, whose activation was also regulated by the other up-regulated genes (Bird and Hyman, 2008; Tagal et al., 2017). Interestingly, the high expression level of NDC80, DLGAP5, BUB1B, KIF20A, KIF2C, KIF11, NCAPG, PBK, FOXM1, TPX2, and PRC1 corresponds to a short overall survival time.

The most interesting finding in this study is that there is a correlation between the expression level of hub genes and immune infiltrating cells. It is well-known that infiltrating immune cells, are a component of the tumor microenvironment and play an important role in tumor growth, invasion, and metastasis. The tumor microenvironment has diverse capacities to induce both adverse and beneficial consequences for tumorigenesis (Quail and Joyce, 2013). Our study showed that there was a significantly positive correlation between the expression of hub genes and infiltrating immune cells including Treg, TFH, and macrophages (M0), but the most significant negative correlation was with monocytes.

It has previously been reported that Tregs and TFH cells exert opposite roles in tumorigenesis. Tregs suppress antitumor immunity. In contrast, TFH cell contribute to antitumor immunity (Zhang and Zhang, 2020). Tregs are present in a variety of tumors and are physiologically involved in the maintenance of immunological self-tolerance through immunosuppressive effects, thereby allowing tumor cells to escape the body’s immune killing (Nishikawa and Sakaguchi, 2010; Finotello and Trajanoski, 2017). TFH cells are an independent subset of CD4 + T effector cells having an essential role in assisting B cell proliferation and antibody production (Ochando and Braza, 2017). We observed that the expression level of KIF2C and UBE2C are strongly positively corelatedcorrelated to the content of Tregs and TFH. It has previously been reported that T cells from patients with metastatic melanoma could recognize mutated kinesin family member 2C (KIF2C) antigen (Lu et al., 2014). But there was no significant difference between the Treg or TFH infiltrating immune cell density when comparing tumor and normal cells.

However, the density of M0 macrophages in tumor tissues was significantly increased compared with normal tissues, in contrast, the density of M1 macrophages in tumor tissues was significantly decreased. Macrophages infiltrating in tumors act as a “double-edged sword” in tumorigenesis and development. M1 type macrophages can kill tumor cells, while M2 type macrophages promote tumor growth. Our findings indicating whether it is the up-regulation in expression level of the hub gene or the decrease macrophage M1 levels, it will induce adverse consequences for tumorigenesis. Meanwhile, we observed the expression level of hub genes negatively correlated with monocyte levels. Macrophages are partially differentiated from monocytes in peripheral blood in response to a wide spectrum of growth factors and chemokines produced by stromal and tumor cells (Noy and Pollard, 2014). Monocytes in the tumor microenvironment may induce beneficial consequences for tumorigenesis. Therefore, we thought that the hub genes induced adverse consequences for tumorigenesis and can be used the potential prognostic biomarker.

Conclusion

In conclusion, our study, based on several bioinformatics algorithms, revealed hub genes and their correlation with immune infiltration cells in HCC and our comprehensive analysis identified associations between 22 immune cells subpopulations. We found 11 genes, NDC80, DLGAP5, BUB1B, KIF20A, KIF2C, KIF11, NCAPG, PBK, FOXM1, TPX2, and PRC1, were strongly associated with poor prognosis. Nonetheless, our findings still require further experimental validation in the future.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Author Contributions

RH and KY conceived and designed the study and wrote the manuscript. RH, JL, HL, LZ, YZ, HJ, WM, JS, and MW analyzed the data. RH, JL, HL, LZ, HJ, YZ, WM, JS, MW, and KY revised the manuscript. All authors have read and approved the final version of the manuscript.

Funding

The work was supported by the Fundamental Research Funds for the Central Universities of China (Grants #31920190107, 31920210005, 31920200001, and 31920160069).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2021.647353/full#supplementary-material

Supplementary Figure 1 | Flowchart detailing the overall study design and samples at each stage of analysis.

Supplementary Table 1 | Information for the seven GEO datasets included in the current study.

Supplementary Table 2 | MEs related to HCC.

Supplementary Table 3 | Top 40 genes from protein-protein network ranked by 12 different algorithms of cytohubba including Degree, DMNC, EPC, MCC, MNC, Bottleneck, Closeness, EcCentricity, Radiality, Betweenness, Stress and Clustering Coefficient.

Supplementary Table 4 | The correlation between survival-related candidate genes and immune cells. Spearman’s Rho and p-values were calculated using the online web resource TIMER 2.0.

Abbreviations

HCC, Hepatocellular carcinoma; RRA, RobustRankAggreg; WGCNA, weighted gene co-expression network analysis; Treg, regulatory T cell; TFH T follicular helper cells; LM22, Leukocyte signature matrix, GEO, Gene Expression Omnibus; DEGs, Differentially expressed genes; TCGA, The Cancer Genome Atlas; RPKM, Reads Per Kilobase per Million mapped reads; TOM, Topological Overlap Matrix; MEs, module eigengenes; EPC, Edge Percolated Component; MNC, Maximum Neighborhood Component; DMNC, Density of Maximum Neighborhood Component; MCC, Maximal Clique Centrality; BN, Bottleneck; CC, Clustering Coefficient; OS, overall survival; BP, biological process; CC, cell component; MF, molecular function.

Footnote

  1. ^ https://cistrome.shinyapps.io/timer/

References

Abe, Y., Matsumoto, S., Kito, K., and Ueda, N. (2000). Cloning and expression of a novel MAPKK-like protein kinase, lymphokine-activated killer T-cell-originated protein kinase, specifically expressed in the testis and activated lymphoid cells. J. Biol. Chem. 275, 21525–21531. doi: 10.1074/jbc.M909629199

PubMed Abstract | CrossRef Full Text | Google Scholar

Bakhoum, S. F., Thompson, S. L., Manning, A. L., and Compton, D. A. (2009). Genome stability is ensured by temporal control of kinetochore-microtubule dynamics. Nat. Cell Biol. 11, 27–35. doi: 10.1038/ncb1809

PubMed Abstract | CrossRef Full Text | Google Scholar

Bird, A. W., and Hyman, A. A. (2008). Building a spindle of the correct length in human cells requires the interaction between TPX2 and Aurora A. J. Cell Biol. 182, 289–300. doi: 10.1083/jcb.200802005

PubMed Abstract | CrossRef Full Text | Google Scholar

Burum-Auensen, E., Deangelis, P. M., Schjølberg, A. R., Røislien, J., Mjåland, O., and Clausen, O. P. F. (2010). Reduced level of the spindle checkpoint protein BUB1B is associated with aneuploidy in colorectal cancers. Cell Prolif. 41, 645–659. doi: 10.1111/j.1365-2184.2008.00539.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, B., Khodadoust, M. S., Liu, C. L., Newman, A. M., and Alizadeh, A. A. (2018). Profiling tumor infiltrating immune cells with CIBERSORT. Methods Mol. Biol. 1711, 243–259. doi: 10.1007/978-1-4939-7493-1_12

CrossRef Full Text | Google Scholar

Chen, P., Wang, F., Feng, J., Zhou, R., and Zhao, Q. (2017). Co-expression network analysis identified six hub genes in association with metastasis risk and prognosis in hepatocellular carcinoma. Oncotarget 8, 48948–48958. doi: 10.18632/oncotarget.16896

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, R. H. (2002). BubR1 is essential for kinetochore localization of other spindle checkpoint proteins and its phosphorylation requires Mad1. J. Cell Biol. 158, 487–496. doi: 10.1083/jcb.200204048

PubMed Abstract | CrossRef Full Text | Google Scholar

Chiyonobu, N., Shimada, S., Akiyama, Y., Mogushi, K., Itoh, M., Akahoshi, K., et al. (2018). Fatty Acid Binding Protein 4 (FABP4) overexpression in intratumoral hepatic stellate cells within hepatocellular carcinoma with metabolic risk factors. Am. J. Pathol. 188, 1213–1224. doi: 10.1016/j.ajpath.2018.01.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Ciferri, C., Pasqualato, S., Screpanti, E., Varetti, G., Santaguida, S., Dos Reis, G., et al. (2008). Implications for kinetochore-microtubule attachment from the structure of an engineered Ndc80 complex. Cell 133, 427–439. doi: 10.1016/j.cell.2008.03.020

PubMed Abstract | CrossRef Full Text | Google Scholar

Finotello, F., and Trajanoski, Z. (2017). New strategies for cancer immunotherapy: targeting regulatory T cells. Genome Med. 9:10.

Google Scholar

Gao, L. W., and Wang, G. L. (2019). miR-190, CDK1, MCM10 and NDC80 predict the prognosis of the patients with lung cancer. Rev. Rom. Med. Lab. 27, 15–24. doi: 10.2478/rrlm-2019-0001

CrossRef Full Text | Google Scholar

Jiang, W., Jimenez, G., Wells, N. J., Hope, T. J., Wahl, G. M., Hunter, T., et al. (1998). PRC1: a human mitotic spindle-associated CDK substrate protein required for cytokinesis. Mol. Cell 2, 877–885. doi: 10.1016/s1097-2765(00)80302-0

CrossRef Full Text | Google Scholar

Kimura, K., Cuvier, O., and Hirano, T. (2001). Chromosome condensation by a human condensin complex in Xenopus egg extracts. J. Biol. Chem. 276, 5417–5420. doi: 10.1074/jbc.C000873200

PubMed Abstract | CrossRef Full Text | Google Scholar

Kolde, R., Laur, S., Adler, P., and Vilo, J. (2012). Robust rank aggregation for gene list integration and meta-analysis. Bioinformatics 28, 573–580. doi: 10.1093/bioinformatics/btr709

PubMed Abstract | CrossRef Full Text | Google Scholar

Kouprina, N., Pavlicek, A., Collins, N. K., Nakano, M., Noskov, V. N., Ohzeki, J., et al. (2005). The microcephaly ASPM gene is expressed in proliferating tissues and encodes for a mitotic spindle protein. Hum. Mol. Genet. 14, 2155–2165. doi: 10.1093/hmg/ddi220

PubMed Abstract | CrossRef Full Text | Google Scholar

Langfelder, P., and Horvath, S. (2008). WGCNA: an R package for weighted correlation network analysis. BMC Bioinform. 9:559. doi: 10.1186/1471-2105-9-559

PubMed Abstract | CrossRef Full Text | Google Scholar

Laoukili, J., Kooistra, M. R. H., Brás, A., Kauw, J., Kerkhoven, R. M., Morrison, A., et al. (2005). FoxM1 is required for execution of the mitotic programme and chromosome stability. Nat. Cell Biol. 7, 126–136. doi: 10.1038/ncb1217

PubMed Abstract | CrossRef Full Text | Google Scholar

Leek, J. T., Johnson, W. E., Parker, H. S., Jaffe, A. E., and Storey, J. D. (2012). The SVA package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics 28, 882–883. doi: 10.1093/bioinformatics/bts034

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Y., Zhihui, C., Baoan, Z., and Cunshuan, X. (2018). Identification of key pathways and genes in the dynamic progression of HCC based on WGCNA. Genes 9:92. doi: 10.3390/genes9020092

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, Y.-C., Yao, X., Crystal, J. S., Li, Y. F., El-Gamil, M., Gross, C., et al. (2014). Efficient identification of mutated cancer antigens recognized by T cells associated with durable tumor regressions. Clin. Cancer Res. 20:3401. doi: 10.1158/1078-0432.CCR-14-0433

PubMed Abstract | CrossRef Full Text | Google Scholar

Luo, Q., Lei, B., Liu, S., Chen, Y., and Shen, H. (2014). Expression of PBK/TOPK in cervical cancer and cervical intraepithelial neoplasia. Int. J. Clin. Exp. Pathol. 7, 8059–8064.

Google Scholar

Makowska, Z., Boldanova, T., Adametz, D., Quagliata, L., Vogt, J. E., Dill, M. T., et al. (2016). Gene expression analysis of biopsy samples reveals critical limitations of transcriptome-based molecular classifications of hepatocellular carcinoma. J. Pathol. Clin. Res. 2, 80–92. doi: 10.1002/cjp2.37

PubMed Abstract | CrossRef Full Text | Google Scholar

Mas, V. R., Maluf, D. G., Archer, K. J., Yanek, K., Kong, X., Kulik, L., et al. (2009). Genes involved in viral carcinogenesis and tumor initiation in hepatitis C virus-induced hepatocellular carcinoma. Mol. Med. 15, 85–94. doi: 10.2119/molmed.2008.00110

PubMed Abstract | CrossRef Full Text | Google Scholar

Mirgayazova, R., Khadiullina, R., Mingaleeva, R., Chasov, V., Gomzikova, M., Garanina, E., et al. (2019). Novel isatin-based activator of p53 transcriptional functions in tumor cells. Mol. Biol. Res. Commun. 8, 119–128. doi: 10.22099/mbrc.2019.34179.1419

PubMed Abstract | CrossRef Full Text | Google Scholar

Neef, R., Preisinger, C., Sutcliffe, J., Kopajtich, R., Nigg, E. A., Mayer, T. U., et al. (2003). Phosphorylation of mitotic kinesin-like protein 2 by polo-like kinase 1 is required for cytokinesis. J. Cell Biol. 162, 863–875. doi: 10.1083/jcb.200306009

PubMed Abstract | CrossRef Full Text | Google Scholar

Newman, A. M., Liu, C. L., Green, M. R., Gentles, A. J., Feng, W., Xu, Y., et al. (2015). Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods 12, 453–457. doi: 10.1038/nmeth.3337

PubMed Abstract | CrossRef Full Text | Google Scholar

Nishikawa, H., and Sakaguchi, S. (2010). Regulatory T cells in tumor immunity. Int. J. Cancer 127, 759–767. doi: 10.1002/ijc.25429

PubMed Abstract | CrossRef Full Text | Google Scholar

Noy, R., and Pollard, J. W. (2014). Tumor-associated macrophages: from mechanisms to therapy. Immunity 41, 49–61. doi: 10.1016/j.immuni.2014.06.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Ochando, J., and Braza, M. S. (2017). T follicular helper cells: a potential therapeutic target in follicular lymphoma. Oncotarget 8, 112116–112131. doi: 10.18632/oncotarget.22788

PubMed Abstract | CrossRef Full Text | Google Scholar

Quail, D. F., and Joyce, J. A. (2013). Microenvironmental regulation of tumor progression and metastasis. Nat. Med. 19, 1423–1437. doi: 10.1038/nm.3394

PubMed Abstract | CrossRef Full Text | Google Scholar

Raemaekers, T., Ribbeck, K., Beaudouin, J., Annaert, W., Van Camp, M., Stockmans, I., et al. (2003). NuSAP, a novel microtubule-associated protein involved in mitotic spindle organization. J. Cell Biol. 162, 1017–1029. doi: 10.1083/jcb.200302129

PubMed Abstract | CrossRef Full Text | Google Scholar

Rapley, J., Nicolàs, M., Groen, A., Regué, L., Bertran, M. T., Caelles, C., et al. (2008). The NIMA-family kinase Nek6 phosphorylates the kinesin Eg5 at a novel site necessary for mitotic spindle formation. J. Cell Sci. 121(Pt. 23), 3912–3921. doi: 10.1242/jcs.035360

PubMed Abstract | CrossRef Full Text | Google Scholar

Ritchie, M. E., Phipson, B., Wu, D., Hu, Y., Law, C. W., Shi, W., et al. (2015). limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43:e47. doi: 10.1093/nar/gkv007

PubMed Abstract | CrossRef Full Text | Google Scholar

Roessler, S., Jia, H. L., Budhu, A., Forgues, M., Ye, Q. H., Lee, J. S., et al. (2010). A unique metastasis gene signature enables prediction of tumor relapse in early-stage hepatocellular carcinoma patients. Cancer Res. 70, 10202–10212. doi: 10.1158/0008-5472.can-10-2607

PubMed Abstract | CrossRef Full Text | Google Scholar

Schneider, M., Christopoulos, P., Muley, T., Warth, A., Klingmueller, U., Thomas, M., et al. (2017). AURKA, DLGAP5, TPX2, KIF11 and CKAP5: five specific mitosis-associated genes correlate with poor prognosis for non-small cell lung cancer patients. Int. J. Oncol. 50, 365–372. doi: 10.3892/ijo.2017.3834

PubMed Abstract | CrossRef Full Text | Google Scholar

Schulze, K., Imbeaud, S., Letouzé, E., Alexandrov, L. B., Calderaro, J., Rebouissou, S., et al. (2015). Exome sequencing of hepatocellular carcinomas identifies new mutational signatures and potential therapeutic targets. Nat. Genet. 47, 505–511. doi: 10.1038/ng.3252

PubMed Abstract | CrossRef Full Text | Google Scholar

Shannon, P., Markiel, A., Ozier, O., Baliga, N. S., Wang, J. T., Ramage, D., et al. (2003). Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks. Abington: Routledge.

Google Scholar

Shimada, S., Mogushi, K., Akiyama, Y., Furuyama, T., Watanabe, S., Ogura, T., et al. (2019). Comprehensive molecular and immunological characterization of hepatocellular carcinoma. EBioMedicine 40, 457–470. doi: 10.1016/j.ebiom.2018.12.058

PubMed Abstract | CrossRef Full Text | Google Scholar

Singh, P. K., Srivastava, A. K., Dalela, D., Rath, S. K., Goel, M. M., Bhatt, M. L., et al. (2014). Expression of PDZ-binding kinase/T-LAK cell-originated protein kinase (PBK/TOPK) in human urinary bladder transitional cell carcinoma. Immunobiology 219, 469–474. doi: 10.1016/j.imbio.2014.02.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Sung, H., Ferlay, J., Siegel, R. L., Laversanne, M., Soerjomataram, I., Jemal, A., et al. (2021). Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 1–41. doi: 10.3322/caac.21660

PubMed Abstract | CrossRef Full Text | Google Scholar

Surhone, L. M., Tennoe, M. T., Henssonow, S. F., and Macromolecule, and Acid, N. (2011). DLGAP5. J.Opt. B Quantum Semiclass Opt. 26, 1–29.

Google Scholar

Szklarczyk, D., Morris, J. H., Cook, H., Kuhn, M., Wyder, S., Simonovic, M., et al. (2017). The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible. Nucleic Acids Res. 45, D362–D368. doi: 10.1093/nar/gkw937

PubMed Abstract | CrossRef Full Text | Google Scholar

Tagal, V., Wei, S., Zhang, W., Brekken, R. A., Posner, B. A., Peyton, M., et al. (2017). SMARCA4-inactivating mutations increase sensitivity to aurora kinase A inhibitor VX-680 in non-small cell lung cancers. Nat. Commun. 8:14098.

Google Scholar

Taiwen, L., Jingxin, F., Zexian, Z., David, C., Jing, L., Qianming, C., et al. (2020). TIMER2.0 for analysis of tumor-infiltrating immune cells. Nuclc Acids Res. 48:6.

Google Scholar

Townsley, F. M., Aristarkhov, A., Beck, S., Hershko, A., and Ruderman, J. V. (1997). Dominant-negative cyclin-selective ubiquitin carrier protein E2-C/UbcH10 blocks cells in metaphase. Proc. Natl. Acad. Sci. U.S.A. 94, 2362–2367. doi: 10.1073/pnas.94.6.2362

PubMed Abstract | CrossRef Full Text | Google Scholar

Wei, S., Chen, J., Huang, Y., Sun, Q., Wang, H., Liang, X., et al. (2020). Identification of hub genes and construction of transcriptional regulatory network for the progression of colon adenocarcinoma hub genes and TF regulatory network of colon adenocarcinoma. J. Cell. Physiol. 235, 2037–2048. doi: 10.1002/jcp.29067

PubMed Abstract | CrossRef Full Text | Google Scholar

Wierstra, I. (2013). FOXM1 (Forkhead box M1) in tumorigenesis: overexpression in human cancer, implication in tumorigenesis, oncogenic functions, tumor-suppressive properties, and target of anticancer therapy - sciencedirect. Adv. Cancer Res. 119, 191–419. doi: 10.1016/b978-0-12-407190-2.00016-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Y., and Zhang, Z. (2020). The history and advances in cancer immunotherapy: understanding the characteristics of tumor-infiltrating immune cells and their therapeutic implications. Cell. Mol. Immunol. 17(Suppl.), 807–821. doi: 10.1038/s41423-020-0488-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, Z., Li, Y., Hao, H., Wang, Y., and Chu, X. (2019). Screening hub genes as prognostic biomarkers of hepatocellular carcinoma by bioinformatics analysis. Cell Transplant. 28, 76S–86S.

Google Scholar

Keywords: hepatocellular carcinoma, weighted gene co-expression network analysis, RobustRankAggreg, CIBERSORT, immune infiltration

Citation: Huang R, Liu J, Li H, Zheng L, Jin H, Zhang Y, Ma W, Su J, Wang M and Yang K (2021) Identification of Hub Genes and Their Correlation With Immune Infiltration Cells in Hepatocellular Carcinoma Based on GEO and TCGA Databases. Front. Genet. 12:647353. doi: 10.3389/fgene.2021.647353

Received: 29 December 2020; Accepted: 06 April 2021;
Published: 30 April 2021.

Edited by:

Wei-Chien Huang, China Medical University, Taiwan

Reviewed by:

Emil Bulatov, Kazan Federal University, Russia
Myvizhi Esai Selvan, Icahn School of Medicine at Mount Sinai, United States

Copyright © 2021 Huang, Liu, Li, Zheng, Jin, Zhang, Ma, Su, Wang and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Kun Yang, yangkun@lzu.edu.cn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.