Skip to main content

ORIGINAL RESEARCH article

Front. Genet., 30 September 2020
Sec. Computational Genomics
This article is part of the Research Topic Bioinformatics Analysis of Omics Data for Biomarker Identification in Clinical Research View all 45 articles

Screening and Identification of Potential Biomarkers in Hepatitis B Virus-Related Hepatocellular Carcinoma by Bioinformatics Analysis

\r\nXian-Chang Zeng&#x;Xian-Chang Zeng1†Lu Zhang&#x;Lu Zhang1†Wen-Jun LiaoWen-Jun Liao1Lu AoLu Ao2Ze-Man LinZe-Man Lin2Wen KangWen Kang1Wan-Nan Chen,*Wan-Nan Chen1,3*Xu Lin,*Xu Lin1,3*
  • 1Key Laboratory of Gastrointestinal Cancer, Ministry of Education, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, China
  • 2Fujian Key Laboratory of Medical Bioinformatics, Department of Bioinformatics, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, China
  • 3Fujian Key Laboratory of Tumor Microbiology, Department of Medical Microbiology, Fujian Medical University, Fuzhou, China

Hepatocellular carcinoma (HCC) is one of the most lethal cancers globally. Hepatitis B virus (HBV) infection might cause chronic hepatitis and cirrhosis, leading to HCC. To screen prognostic genes and therapeutic targets for HCC by bioinformatics analysis and determine the mechanisms underlying HBV-related HCC, three high-throughput RNA-seq based raw datasets, namely GSE25599, GSE77509, and GSE94660, were obtained from the Gene Expression Omnibus database, and one RNA-seq raw dataset was acquired from The Cancer Genome Atlas (TCGA). Overall, 103 genes were up-regulated and 127 were down-regulated. A protein–protein interaction (PPI) network was established using Cytoscape software, and 12 pivotal genes were selected as hub genes. The 230 differentially expressed genes and 12 hub genes were subjected to functional and pathway enrichment analyses, and the results suggested that cell cycle, nuclear division, mitotic nuclear division, oocyte meiosis, retinol metabolism, and p53 signaling-related pathways play important roles in HBV-related HCC progression. Further, among the 12 hub genes, kinesin family member 11 (KIF11), TPX2 microtubule nucleation factor (TPX2), kinesin family member 20A (KIF20A), and cyclin B2 (CCNB2) were identified as independent prognostic genes by survival analysis and univariate and multivariate Cox regression analysis. These four genes showed higher expression levels in HCC than in normal tissue samples, as identified upon analyses with Oncomine. In addition, in comparison with normal tissues, the expression levels of KIF11, TPX2, KIF20A, and CCNB2 were higher in HBV-related HCC than in HCV-related HCC tissues. In conclusion, our results suggest that KIF11, TPX2, KIF20A, and CCNB2 might be involved in the carcinogenesis and development of HBV-related HCC. They can thus be used as independent prognostic genes and novel biomarkers for the diagnosis of HBV-related HCC and development of pertinent therapeutic strategies.

Introduction

Liver cancer is the most common type of cancer across the world, accounting for 8.2% of cancer deaths (Bray et al., 2018). Hepatocellular carcinoma (HCC) is the most common primary liver malignancy and the leading cause of liver cancer-related deaths globally (Venook et al., 2010). HCC is difficult to diagnose at an early stage and challenging to treat. It can be caused by several risk agents, such as chronic infection with hepatitis B virus (HBV) or hepatitis C virus (HCV), and exposure to alcohol and aflatoxins (Wang et al., 2002; El-Serag and Rudolph, 2007). In Asian countries, most cases of HCC are associated with chronic HBV infection (Beasley et al., 1981). HCC is associated with high recurrence and drug resistance; thus, it is urgent to identify potential biomarkers during chronic HBV infection to precisely predict HCC progression and to determine better therapeutic targets. HBV-induced HCC involves a complex, gradual process and includes the integration of HBV DNA into host cell DNA (Wang et al., 2017). HBV proteins, including HB× and MHBSt, have oncogenic potential themselves; in addition, some oncogenes in hepatocytes are potentially regulated by HBV proteins via protein-protein interactions, participating in the initiation and progression of HBV-induced HCC (Levrero and Zucman-Rossi, 2016). However, the molecular mechanisms underlying the initiation, progression and metastasis of HBV-induced HCC remain far from being fully understood.

In recent years, the exploration of genes related to carcinogenesis and development of HCC by bioinformatics methods have been increasing. TP53 (Kan et al., 2013), UBE3C (Jiang et al., 2014), SHP-1 (Wen et al., 2018), COL1A1 (Ma et al., 2019), CD5L, and SLC22A10 (Zhang et al., 2019) have been reported to be potential therapeutic targets of HCC by high-throughput sequencing-based bioinformatics analysis. The NCBI Gene Expression Omnibus (GEO) and the Cancer Genome Atlas (TCGA) databases, which provide comprehensive profiles of gene expression data, have been extensively applied to investigate the carcinogenesis of HCC by bioinformatics mining. Further, the potential molecular mechanisms underlying HBV-related HCC can be speculated via hub genes identification by bioinformatics analysis. In the present study, three high-throughput RNA-Seq-based raw datasets from the GEO database and one dataset from TCGA were downloaded, and these included 97 normal, 47 HBV-related HCC, and 374 HCC specimens. We identified 230 differentially expressed genes (DEGs) and 12 hub genes. Among the 12 hub genes, kinesin family member 11 (KIF11), TPX2 microtubule nucleation factor (TPX2), kinesin family member 20A (KIF20A), and cyclin B2 (CCNB2) were found to be independent prognostic markers of HBV-related HCC. We believe that our results should help us better comprehend the mechanisms underlying HBV-related HCC and facilitate the identification of potential targets for the diagnosis and treatment of HCC.

Materials and Methods

Raw RNA-seq Dataset Collection

For screening DEGs, three high-throughput RNA-seq-based raw datasets, namely GSE25599 (Huang et al., 2011), GSE94660 (Yoo et al., 2017), and GSE77509 (Yang et al., 2017), which comprised patients with HBV infection, were downloaded from the NCBI GEO database1. GSE94660 (21 paired normal and HBV-related HCC tissue samples) and GSE77509 (16 paired normal and HBV-related HCC tissue samples) were established using GPL16791 Illumina HiSeq 2500 (Homo sapiens), while GSE25599 (10 paired normal and HBV-related HCC tissue samples) was established using the GPL9052 Illumina Genome Analyzer. RNA-seq raw data and clinical data of 50 normal samples and 374 HCC samples2 were downloaded from TCGA3. For the validation of independent prognostic genes, a dataset named LIRI-JP4, including 202 normal and 243 HCC tissue samples, was downloaded from the International Cancer Genome Consortium (ICGC)5. For comparing the expression levels of independent prognostic genes between HCV- and HBV-related HCC, the GSE69715 dataset [66 normal and 37 HCV-related HCC tissue samples established using GLP570 (HG-U133_Plus_2)] was downloaded from the GEO database.

Data Processing and DEGs Screening

Gene expression profile matrix files of GSE25599, GSE77509, and GSE94660 were obtained from raw datasets using Perl (de Hoon et al., 2004). Nevertheless, the gene expression profile matrix data of TCGA was acquired using the “TCGAbiolinks” R package (Colaprico et al., 2016) and Perl. Genes that were differentially expressed between normal and HBV-related HCC tissue samples were screened by the limma R (Ritchie et al., 2015) and edgeR R packages (Robinson et al., 2010). | Log2(FC)| ≥ 1.0, p-value ≤ 0.05, and FDR ≤ 0.05 were set as the cutoff criteria for DEGs screening after background correction and data normalization. Overlapped DEGs among GSE25599, GSE77509, GSE94660, and TCGA were identified using the VennDiagram R package (Chen and Boutros, 2011). The heatmaps of DEGs, which could be divided into up- and down-regulated groups, were drawn using the “pheatmap” R package (Galili et al., 2018).

Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) Pathway Enrichment Analyses

The names of DEGs were translated into gene IDs using the R programming language. To investigate the biological pathways that might be involved in the occurrence and development of HBV infection and HCC, candidate DEGs were segregated into up- and down-regulated groups and subjected to pathway enrichment analysis. Gene Ontology (GO) analysis, which involved three categories, namely molecular functions (MF), cellular components (CC), and biological processes (BP), and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis were performed with the threshold of FDR-value < 0.05 using the clusterProfiler R package (Yu et al., 2012), which facilitated biological terminology classification and gene cluster enrichment.

Protein–Protein Interaction (PPI) Network Analysis and Hub Gene Screening

A protein–protein interaction (PPI) network of DEGs was constructed using STRING6 (Szklarczyk et al., 2019) and visualized with Cytoscape v3.6.1 (Shannon et al., 2003). DEGs that consisted of several important nodes with many other interaction partners were analyzed using Molecular Complex Detection (Bader and Hogue, 2003) and CytoHubba (Chin et al., 2014; Tang et al., 2019). Subnets of the vast protein interaction network were extracted by calculating the degree of nodes, and highly connected nodes with a degree score of >45 and p-value < 0.05 were identified as hub genes.

Survival Analysis of Hub Genes

Survival analysis were primarily performed using clinical data from TCGA to predict the prognostic value of hub genes. Kaplan–Meier survival curves of hub genes were plotted using the survival R package7 and differences in survival rate were evaluated with a log-rank test threshold of p-value < 0.05. To evaluate the accuracy of the survival curves, receiver operating characteristic (ROC) curves were then constructed using the “survival ROC” R package (Huang et al., 2017) with the threshold of AUC ≥ 0.6. Next, Cox proportional-hazards models were used to estimate the effects of prognostic factors on survival using the survival R and “survminer” R packages8 with the threshold of p-value < 0.05. Univariate Cox analysis was first performed to screen for genes significantly associated with overall survival rate, and multivariate Cox analysis was then performed to identify independent prognostic genes (Orimo et al., 2008; Uemura et al., 2009).

Expression Analysis of Independent Prognostic Genes for HCC Using TCGA Dataset

To validate independent prognostic genes for HCC screened by survival analyses, the aforementioned TCGA clinical data were used to analyze individual gene expression levels between normal and HCC tissue specimens at different stages of tumor progression using the “ggpubr” R package9. Data pertaining to normal and HCC tissue samples were compared using Wilcoxon test, and those pertinent to multiple samples from different stages of tumor progression were compared using the Kruskal–Wallis test, with the threshold of p-value < 0.05.

Validation of Potential Prognostic Biomarkers in HCC Using a Dataset From the International Cancer Genome Consortium (ICGC)

To further evaluate the clinical value of the independent prognostic genes, a dataset of patients with HCC was downloaded from the ICGC portal (see text footnote 4) for survival and ROC curve analyses; for this purpose, we used the survival R package, survival ROC R package, and Perl. Ultimately, a meta-analysis of the independent prognostic genes in Oncomine10 (Rhodes et al., 2004), a cancer-profiling database containing published data and listing differential gene expression analyses, were performed to verify their expression levels in patients with HCC using four published data (Chen et al., 2002; Wurmbach et al., 2007; Roessler et al., 2010).

Correlation Analysis of Potential Prognostic Biomarkers in HCC

To analyze the potential relationships among the four independent prognostic genes in HCC occurrence and development, TCGA dataset was subjected to correlation analyses using the corrplot11 R software. The correlation coefficient (Cor), ranging from −1 (perfect negative correlation) to +1 (perfect positive correlation), indicated how closely data in a scatterplot were arranged along a straight line. p-value < 0.05 for the coefficients indicates a statistically significant relationship.

Expression Levels of Potential Prognostic Biomarkers in HCV- and HBV-Related HCC

Since viral Hepatitis B and Hepatitis C are the most commonly implicated risk factors for HCC, to compare the expression levels of the independent prognostic genes between HCV-related HCC and HBV-related HCC, the GSE69715 dataset for HCV and the GSE94660 dataset for HBV were analyzed using gglpot212, cowplot13, and ggpubr14 package. The relative expression levels (i.e., fold change) of these four genes in the tumor tissues comparing with normal tissues were calculated. Wilcoxon test was carried out between the HCC and normal tissues. p-value < 0.05 indicate statistical significance.

Results

Identification of DEGs

Differentially expressed genes were identified from three raw datasets, namely GSE25599, GSE94660, and GSE77509, downloaded from the NCBI GEO database and one downloaded dataset from TCGA database using the limma R package and edgeR R package. The cutoff criteria were | log2(FC)| ≥ 1.0, p-value ≤ 0.05 and FDR ≤ 0.05. In total, 230 DEGs were overlapping among the four datasets, of which 103 were up-regulated (Figure 1A) and 127 were down-regulated (Figure 1B). Cluster heatmaps showing the expression levels of the 230 DEGs in each of the four datasets were generated (Figures 1C–F). Details of the top 20 up- and 20 down-regulated DEGs in HBV-related HCC are shown in Supplementary Table S1.

FIGURE 1
www.frontiersin.org

Figure 1. Venn diagram and cluster heatmap of differentially expressed genes. (A,B) Venn diagram showing overlapping DEGs. DEGs were screened based on the following criteria: | Log2(FC)| ≥ 1.0, p-value ≤ 0.05, and FDR ≤ 0.05. In total, 230 DEGs were overlapping among the GSE25599, GSE94660, GSE77509, and TCGA datasets; of these, 103 genes were up-regulated and 127 were down-regulated. (C–F) Cluster heatmaps of DEGs in the four datasets. Colors indicate gene expression levels. Red represents up-regulated genes, and green represents down-regulated genes. For GSE25599, GSE94660, GSE77509 datasets, all of the samples are shown in the heatmaps. However, for TCGA dataset, 46 samples (23 normal and 23 HCC tissue samples) were randomly selected from 424 samples for display convenience.

Pathway Enrichment Analysis of DEGs

To investigate the functional annotation of DEGs, GO, and KEGG pathway enrichment analyses were performed. The results were considered to be statistically significant if FDR value was <0.05. The top 15 GO terms of up-regulated genes are listed in Supplementary Table S2. As evident from Figure 2A and Supplementary Table S2, in the MF, CC, and BP categories, the up-regulated genes were significantly enriched in nuclear division, organelle fission, and mitotic nuclear division; spindle, chromosomal region, and spindle pole; and protein kinase binding, enzyme binding, and chromatin binding, respectively. Further, KEGG pathway analysis of the up-regulated genes indicated that they were primarily enriched in cell cycle, p53 signaling pathway, and oocyte meiosis (Figure 2C and Supplementary Table S3).

FIGURE 2
www.frontiersin.org

Figure 2. Functional GO and KEGG pathway enrichment analysis of DEGs. DEGs were subjected to GO and KEGG pathway enrichment analyses with the criteria of FDR <0.05. (A,B) Plots of significantly enriched GO terms of up- and down-regulated genes for molecular function (MF), cellular components (CC), and biological processes (BP). (C,D) Significantly enriched KEGG pathway terms of up- and down-regulated genes. The y-axis shows GO category or KEGG pathway, and the x-axis shows gene ratio for each individual category. Count represents the number of genes enriched in the corresponding category. –log10 (FDR) represents the logarithm of adjusted p-value. The smaller the FDR, the deeper the red color; and the larger the FDR, the deeper the blue color.

As evident from Figure 2B,D and Supplementary Tables S4, S5, in the MF, CC, and BP categories, the down-regulated genes were mainly involved in small molecule catabolic process, organic acid catabolic process, carboxylic acid catabolic process, extracellular matrix, collagen-containing extracellular matrix, collagen trimer, cofactor binding, iron ion binding, and monooxygenase activity, respectively. Moreover, KEGG pathway analysis of the down-regulated genes indicated that they were enriched in retinol metabolism, arachidonic acid metabolism, and drug metabolism–cytochrome P450.

PPI Network Construction of DEGs and Identification of Hub Genes

A PPI network of DEGs (Figure 3A) containing 230 nodes and 1189 edges was constructed by STRING and visualized by Cytoscape, which provides critical assessment and integration of protein-protein interactions, including direct (physical) and indirect (functional) correlations. Pivotal modules of the network were obtained using Molecular Complex Detection, and the degree of nodes was calculated using CytoHubba. In the PPI network, the number of edges involved determines the degree score of nodes; the nodes with high degree scores were considered to be hub genes (Chin et al., 2014). 54 DEGs with a degree score of >10 and p-value < 0.05 are listed in Supplementary Figure S1. There were 39 genes with degree scores of >30, 24 genes with degree scores of >40, 12 genes with degree scores of >45, and only one gene with degree score of >50, and all of these genes meet the requirements of p-value < 0.05. The modules with 39 nodes and 698 edges (degree score >30 and p-value < 0.05) were extracted to construct a subnet (Figure 3B). The most significant modules of 12 genes (degree score >45 and p-value < 0.05; Figure 3C) were identified as hub genes. The names, abbreviations, and scores of hub genes are summarized in Supplementary Table S6. The top five hub genes with the highest interaction node degrees were CDK1, CCNB1, CCNA2, BUB1B, and CCNB2, implying their potential roles in the development of HBV-related HCC.

FIGURE 3
www.frontiersin.org

Figure 3. Protein–protein interaction network construction and hub genes identification. (A) PPI network of 230 DEGs was visualized using Cytoscape. Red color represents up-regulated genes, and blue color represents down-regulated genes. (B) A subnet with 39 nodes and 698 edges was extracted from the PPI network using Molecular Complex Detection and CytoHubba based on the following criteria: degree score >30 and p-value < 0.05. (C) Hub genes and their co-expression network. Twelve pivotal genes were identified as hub genes using CytoHubba, according to degree score >45 and p-value < 0.05. (D) GO enrichment analysis of the 12 hub genes. (E) KEGG pathway enrichment analysis of the 12 hub genes.

Gene Ontology and KEGG pathway enrichment analyses were utilized to investigate the functional enrichment of the 12 hub genes. In the MF, CC, and BP categories, these 12 hub genes were mainly enriched in nuclear division, organelle fission, mitotic nuclear division, spindle, spindle pole, condensed chromosome, protein kinase binding, protein C-terminus binding, and protein serine/threonine kinase activity, respectively (Figure 3D and Supplementary Table S7). Further, KEGG pathway analysis for the hub genes (Figure 3E and Supplementary Table S8) indicated that they were primarily enriched in cell cycle, progesterone-mediated oocyte maturation, and oocyte meiosis.

Survival Analysis of Hub Genes

It is noteworthy that all the 12 hub genes were up-regulated in patients with HBV-related HCC. To explore their prognostic importance, all of them were evaluated using the Kaplan–Meier plot and ROC curve with clinical and expression data from TCGA. As shown in Figure 4, based on their expression levels, AUC values of the 12 hub genes (BUB1B, CCNA2, CCNB1, CCNB2, CDC20, CDK1, KIF11, KIF20A, MAD2L1, PLK1, TOP2A, and TPX2), ranged from 0.6 to 0.7, while log-rank test showed p-value < 0.05 in all of the survival curves. Therefore, we considered that all the 12 hub genes appeared to be capable of survival prediction with the thresholds of p-value < 0.05 and AUC ≥ 0.6. Patients with HCC and up-regulation of these genes showed worse survival rate.

FIGURE 4
www.frontiersin.org

Figure 4. TCGA dataset analysis showed all of the 12 hub genes are related to worse survival rate. Survival analysis of the 12 hub genes, including (A) BUB1B, (B) CCNA2, (C) CCNB1, (D) CCNB2, (E) CDC20, (F) CDK1, (G) KIF11, (H) KIF20A, (I) MAD2L1, (J) PLK1, (K) TOP2A, and (L) TPX2, was performed using Kaplan–Meier survival curves and ROC curves based on clinical data from TCGA dataset. Log-rank test p-value < 0.05 and AUC ≥ 0.6 indicate a statistically significant difference.

Further, univariate and multivariate Cox regression analyses were performed to analyze their independent prognostic importance in patients with HCC. As indicated in Table 1, univariate Cox regression analysis showed that all the 12 hub genes were high-risk genes (hazard ratio >1, p-value < 0.05); however, multivariate Cox regression analysis suggested that only KIF11, TPX2, KIF20A, and CCNB2 were independent prognostic genes in case of patients with HCC (hazard ratio >1, p-value < 0.05).

TABLE 1
www.frontiersin.org

Table 1. Univariate and Multivariate COX analysis of hub genes.

Validation of Potential Prognostic Biomarkers

The Cancer Genome Atlas dataset of normal and HCC tissue samples were subjected to Wilcoxon test; KIF11, TPX2, KIF20A, and CCNB2 were found to have higher mRNA expression levels in HCC than in normal tissue samples (Figures 5A–D). Furthermore, the expression levels of KIF11, TPX2, KIF20A, and CCNB2 in multiple samples from different stages (I–IV) of tumor progression were compared using the Kruskal–Wallis test, and the results revealed that in comparison with normal tissue samples, the expression levels of KIF11, TPX2, KIF20A, and CCNB2 were higher at each stage of HCC (Figures 5E–H). These findings indicated the potential roles of these genes for diagnostic and prognostic prediction.

FIGURE 5
www.frontiersin.org

Figure 5. mRNA expression levels of four independent prognostic genes in the HCC tissue samples. TCGA dataset of paired normal and HCC tissue samples (A–D) and multiple samples from different stages (I–IV) of HCC progression (E–H) were used to investigate mRNA expression levels of four independent prognostic genes, namely KIF11, TPX2, KIF20A, and CCNB2, in patients with HCC. A p-value less than 0.05 is statistically significant.

Another dataset with RNA-Seq mRNA expression data and clinical pathological data were obtained from the ICGC portal as an independent validation cohort to verify the prognostic potential of KIF11, TPX2, KIF20A, and CCNB2 in HBV-related HCC. Overall survival rate analysis of these four genes was performed using the Kaplan–Meier plot and ROC curves. The results were consistent with those obtained from TCGA datasets, revealing that patients with up-regulated KIF11, TPX2, KIF20A, and CCNB2 genes showed worse survival rate (Figures 6A–D, p-value < 0.05 and AUC ≥ 0.6). Notably, as indicated in Figure 6, AUC values calculated using ICGC data were a little higher (from 0.7 to 0.8) compared with those using the TCGA data. In general, an AUC of 0.7 to 0.8 is considered to be acceptable (Mandrekar, 2010). In addition, meta-analysis in Oncomine showed that KIF11, TPX2, KIF20A, and CCNB2 were highly expressed in HCC comparing with normal tissues samples (Figure 6E–H, p-value < 0.05). Correlation analyses (Supplementary Figure S2) revealed the potential relationships among these four independent prognostic genes, implying that these four genes have combined effects in HCC occurrence and development.

FIGURE 6
www.frontiersin.org

Figure 6. Validation of four independent prognostic genes in the ICGC dataset by survival curves and Oncomine expression analysis. (A–D) Kaplan–Meier survival curves and ROC curves of KIF11, TPX2, KIF20A, and CCNB2 genes in the HCC patients from the ICGC dataset. Log-rank test p-value < 0.05 and AUC ≥ 0.6 indicate a statistically significant difference. (E–H) Oncomine analysis of mRNA expression levels of KIF11, TPX2, KIF20A, and CCNB2 in HCC tissue samples compared with those in normal tissue samples using four sets of published data (p-value < 0.05).

Potential Prognostic Biomarkers Showed Lower Relative Expression Levels in HCV-Related HCC Than in HBV-Related HCC

To deternmine whether KIF11, TPX2, KIF20A, and CCNB2 are specific to HBV-induced HCC comparing with HCV-induced HCC, HCV-related HCC (GSE69715) and HBV-related HCC (GSE94660) datasets were used to analyze the relative expression levels (i.e., fold change) of these 4 genes in HCC and normal tissue samples. As indicated in Figure 7 and Table 2, log2(FC) of KIF11, TPX2, KIF20A, and CCNB2 in the HBV-related HCC dataset ranges from 1.38 to 2.66, while log2(FC) of these genes in the HCV-related HCC dataset ranges from 0.09 to 0.35. Although all of the p-values in the Figure 7 are less than 0.05 (p-value < 0.05), which are statistically significant, we do not believe that they are biologically significant, because the expression level of these four genes in HCV-related HCC showed only a minor increase as compared with that in HBV-related HCC.

FIGURE 7
www.frontiersin.org

Figure 7. Relative mRNA expression levels of the independent prognostic genes in the HCV-related HCC and HBV-related HCC. Datasets of HCV-related HCC (GSE69715) and HBV-related HCC (GSE94660) were used to analyze relative mRNA expression levels of KIF11 (A), TPX2 (B), KIF20A (C), and CCNB2 (D) genes in the tumor tissues comparing with normal tissues. p-values < 0.05 indicate statistical significance. The expression level of these four genes in HCV-related HCC, however, only showed a minor increase as compared with that of those in HBV-related HCC.

TABLE 2
www.frontiersin.org

Table 2. Compare four independent prognostic genes expression in the HCV-related HCC and HBV-related HCC.

Discussion

Hepatocellular carcinoma is the most common malignant tumor of the liver. The etiological factors of HCC include hepatitis B or C, aflatoxin, alcohol, and metabolic disorders. In HBV endemic areas, chronic hepatitis B infection has been verified to be closely associated with HCC carcinogenesis (Lavanchy, 2004). Identifying potential biomarkers and elucidating molecular mechanisms of HCC progression are pivotal. Some researchers have used comprehensive bioinformatics analysis to identify hub genes from PPI networks constructed with DEGs, such as for colorectal cancer (Guo et al., 2017), breast cancer (Yang et al., 2019), and non-small-cell lung cancer (Ma et al., 2019). Notably, with bioinformatics analysis, different research groups may identify the same prognostic biomarkers using different datasets (Yang et al., 2019; Nakamura et al., 2020), which may strengthen the significance of data mining. Hepatocarcinogenesis is a complex multifactorial process; in recent decades, bioinformatics analyses of high-throughput data obtained upon using methods such as microarray and new generation sequencing have become common for exploring the mechanisms underlying HCC.

Gene Expression Omnibus is a public functional genomics database and includes a large repository of high-throughput, next-generation sequencing results and related information for over 200 organisms (Barrett et al., 2007). In the present study, to investigate pivotal genes associated with the HBV-related HCC, we used three high-throughput RNA-seq-based datasets, namely GSE25599, GSE94660, and GSE77509, downloaded from GEO, and information pertaining to 47 normal and 47 HBV-related HCC tissue samples was used. At the same time, to reduce the number of DEGs identified and improve accuracy, high-throughput RNA -seq results and clinical information of 50 normal and 374 HCC tissue samples were downloaded from TCGA and used for screening DEGs overlapping with GEO datasets and for survival analysis. In addition, the HCC expression data in the ICGC database were used to validate potential prognostic biomarkers in HCC.

A total of 230 DEGs were identified, of which 127 were down-regulated and 103 were up-regulated, and a PPI network was then constructed. Twelve hub genes – BUBIB, CCNA2, CCNB1, CDC20, CDK1, KIF11, KIF20A, PLK1, TOP2A, MAD2L1, TOP2A, and TPX2 – were identified according to a degree score of >45. GO and KEGG pathway enrichment analyses of the 230 DEGs and 12 hub genes suggested that HBV-related HCC occurrence and development are associated with cell cycle, nuclear division, mitosis, p53 pathway, oocyte meiosis, retinol metabolism, and organic acid catabolism. Cell cycle abnormality evidently has a key role during the process of liver cancer (Choi et al., 2001), and cyclin D1 degradation has been reported to inhibit HCC occurrence (Wu et al., 2018). Further, p53, the most common abnormality of dominant oncogenes in human tumors including HCC (Wu et al., 2018), plays a critical role in cell cycle arrest and apoptosis in response to DNA damage (Khemlina et al., 2017). Alterations in retinol metabolism play a pivotal role in the process of liver fibrosis, and enzymes involved in retinol metabolism are reportedly related to liver cancer (Pettinelli et al., 2018).

To analyze the prognosis and clinical significances of the 12 hub genes in HBV-related HCC, clinical data from TCGA were used for survival and ROC curve analyses. It was found that patients in whom the expression levels of the 12 hub genes were up-regulated showed worse survival rate, indicating their prognostic value for HCC. To further analyze the prognostic value of these genes, univariate and multivariate Cox regression analyses were performed using the 12 hub genes and found that KIF11, TPX2, KIF20A, and CCNB2 might be independent prognostic genes and potential targets for the diagnosis of HBV-related HCC. In addition, it was demonstrated that the expression levels of these four genes were higher in HCC than in normal tissue samples, and their expression levels were also higher at different stages of HCC than those in normal tissue samples. Data pertaining to patients with HCC from the ICGC database further validated that KIF11, TPX2, KIF20A, and CCNB2 were associated with worse survival rates in patients with higher gene expression levels. Oncomine analysis demonstrated that the expression levels of these genes were still higher in different patients with HCC.

Through continuous data filtering using different procedures and different sources of data, the number of candidate genes reduced, making our results more credible. Besides, correlation analyses of KIF11, TPX2, KIF20A, and CCNB2 indicated the potential relationships among them, and suggested that they together promote the occurrence and development of HCC.

The activation of KIF20A–Gli2 axis has been reported to be crucial for hepatoma cell growth, indicating that KIF20A plays a vital role in the development of liver cancer (Shi et al., 2016). Further, an increase in the mRNA expression level of KIF20A and its product MKLP2 has been related to HCC invasion (Gasnereau et al., 2012). KIF11 is related with the progression and prognosis of liver cancer, and its overexpression has been related to low survival rate of patients with liver cancer (Chen et al., 2017). A study reported that CCNB2 overexpression induces the expression of karyopherin subunit-α-2, promoting the cell cycle of HCC cells (Gao et al., 2018). Other studies have reported that the positive regulatory network of CCNB2 is involved in ubiquitination, DNA repair, and cell proliferation in non-tumor hepatitis or cirrhosis induced by HBV (Wang et al., 2012), suggesting that CCNB2 plays a role in HBV-related diseases. Moreover, it was observed that knocking down TPX2 in hepatocarcinoma cell lines effectively reduced cell growth via G2/M blockage and induced apoptosis (Hsu et al., 2017). TPX2 has also been reported to promote HCC development by activating PI3K/Akt signal (Huang et al., 2019).

Datasets of HCV-related HCC and HBV-related HCC were used to compare the expression levels of these four genes between HCV- and HBV-induced HCC. HCV-related HCC showed only a minor increase in the expression levels as compared with HBV-related HCC, indicating that KIF11, TPX2, KIF20A, and CCNB2 might be specific to HBV-induced HCC. But the underlying mechanisms how these four genes may induce the HBV-related HCC need to be further elucidated.

In conclusion, KIF11, TPX2, KIF20A, and CCNB2 seem to play a key role in HBV-related HCC. However, further studies are warranted to explore the mutual influence of these genes and HBV on HBV-related HCC carcinogenesis. Further studies should also identify whether these four genes are induced by factors other than HBV infection in patients with HCC and whether HBV infection itself causes aberrant expression of these genes and promotes HCC progression. Whether HBV-encoded proteins, such as HBV X protein, can interact with intracellular proteins via these four genes and lead to HCC remains to be elucidated.

Conclusion

Our findings suggest that KIF11, TPX2, KIF20A, and CCNB2 are involved in the carcinogenesis and development of HBV-related HCC. Thus, they can be used as independent prognostic genes for patients with HBV-related HCC and also as novel biomarkers for the diagnosis of HBV-related HCC and development of pertinent therapeutic strategies.

Data Availability Statement

The data analyzed in this manuscript can be downloaded from the NCBI Gene Expression Omnibus database (http://www.ncbi.nlm.nih.gov/geo) using the accession numbers GSE25599, GSE94660, GSE77509, and GSE69715, and TCGA (https://cancergenome.nih.gov/), and the ICGC (https://dcc.icgc.org/releases/current/Projects/LIRI-JP) database.

Author Contributions

W-NC and XL conceived and designed the investigation. X-CZ and LZ analyzed the data and drafted the manuscript. W-JL, LA, Z-ML, and WK conducted statistical analyses. All authors have read and approved the manuscript.

Funding

This study was supported by grants from the National Natural Science Foundation of China (No. 81672031), Natural Science Foundation of Fujian Province (No. 2019J01299), Foundation of Fujian Provincial Department of Finance (No. 2019B027), and Scientific Research Project for Young Teachers of Fujian Province (No. JT180163).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2020.555537/full#supplementary-material

FIGURE S1 | DEGs with degree scores of >10 in the PPI network. Fifty-four DEGs showed a degree score of >10 (arranged from largest to smallest).

FIGURE S2 | Correlation among KIF11, TPX2, KIF20A, and CCNB2 genes in HCC occurrence and development. Correlation analysis of KIF11, TPX2, KIF20A, and CCNB2 in HCC occurrence and development showed the potential relationships among these independent prognostic genes. Correlation coefficient (Cor) represents the correlation coefficient. p-value < 0.05 indicates a statistically significant difference.

TABLE S1 | The Top 20 up-regulated and 20 down-regulated DEGs between HCC and Normal tissues.

TABLE S2 | The Top 15 significantly enriched GO terms of the 103 up-regulated DEGs.

TABLE S3 | Enriched KEGG pathyways of the 103 up-regulated DEGs.

TABLE S4 | The Top 15 significantly enriched GO terms of the 127 down-regulated DEGs.

TABLE S5 | Enriched KEGG pathyways of the 127 down-regulatde DEGs.

TABLE S6 | Degree scores of the hub genes.

TABLE S7 | The Top 15 significantly enriched GO terms of the hub genes.

TABLE S8 | Enriched KEGG pathyways of the hub genes.

Footnotes

  1. ^ http://www.ncbi.nlm.nih.gov/geo
  2. ^ https://portal.gdc.cancer.gov/repository
  3. ^ https://portal.gdc.cancer.gov/
  4. ^ https://dcc.icgc.org/projects/LIRI-JP
  5. ^ https://dcc.icgc.org/releases/current/Projects
  6. ^ http://string-db.org/
  7. ^ https://CRAN.R-project.org/view=Survival
  8. ^ https://CRAN.R-project.org/package=survminer
  9. ^ https://CRAN.R-project.org/package=ggpubr
  10. ^ https://www.oncomine.org/resource/login.html
  11. ^ https://github.com/taiyun/corrplot
  12. ^ https://github.com/tidyverse/ggplot2
  13. ^ https://github.com/wilkelab/cowplot/issues
  14. ^ https://rpkgs.datanovia.com/ggpubr/

References

Bader, G. D., and Hogue, C. W. (2003). An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinformatics 4:2. doi: 10.1186/1471-2105-4-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Barrett, T., Troup, D. B., Wilhite, S. E., Ledoux, P., Rudnev, D., Evangelista, C., et al. (2007). NCBI GEO: mining tens of millions of expression profiles–database and tools update. Nucleic Acids Res. 35, D760–D765.

Google Scholar

Beasley, R. P., Hwang, L. Y., Lin, C. C., and Chien, C. S. (1981). Hepatocellular carcinoma and hepatitis B virus. A prospective study of 22 707 men in Taiwan. Lancet 2, 1129–1133.

Google Scholar

Bray, F., Ferlay, J., Soerjomataram, I., Siegel, R. L., Torre, L. A., and Jemal, A. (2018). Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 68, 394–424. doi: 10.3322/caac.21492

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, H., and Boutros, P. C. (2011). VennDiagram: a package for the generation of highly-customizable Venn and Euler diagrams in R. BMC Bioinformatics 12:35. doi: 10.1186/1471-2105-12-35

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, J., Li, S., Zhou, S., Cao, S., Lou, Y., Shen, H., et al. (2017). Kinesin superfamily protein expression and its association with progression and prognosis in hepatocellular carcinoma. J. Cancer Res. Ther. 13, 651–659. doi: 10.4103/jcrt.jcrt_491_17

CrossRef Full Text | Google Scholar

Chen, X., Cheung, S. T., So, S., Fan, S. T., Barry, C., Higgins, J., et al. (2002). Gene expression patterns in human liver cancers. Mol. Biol. Cell 13, 1929–1939. doi: 10.1091/mbc.02-02-0023.

PubMed Abstract | CrossRef Full Text | Google Scholar

Chin, C. H., Chen, S. H., Wu, H. H., Ho, C. W., Ko, M. T., and Lin, C. Y. (2014). cytoHubba: identifying hub objects and sub-networks from complex interactome. BMC Syst. Biol. 8, (Suppl. 4):S11. doi: 10.1186/1752-0509-8-S4-S11

PubMed Abstract | CrossRef Full Text | Google Scholar

Choi, Y. L., Park, S. H., Jang, J. J., and Park, C. K. (2001). Expression of the G1-S modulators in hepatitis B virus-related hepatocellular carcinoma and dysplastic nodule: association of cyclin D1 and p53 proteins with the progression of hepatocellular carcinoma. J. Korean Med. Sci. 16, 424–432. doi: 10.3346/jkms.2001.16.4.424

PubMed Abstract | CrossRef Full Text | Google Scholar

Colaprico, A., Silva, T. C., Olsen, C., Garofano, L., Cava, C., Garolini, D., et al. (2016). TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res. 44:e71. doi: 10.1093/nar/gkv1507

PubMed Abstract | CrossRef Full Text | Google Scholar

de Hoon, M. J., Imoto, S., Nolan, J., and Miyano, S. (2004). Open source clustering software. Bioinformatics 20, 1453–1454.

Google Scholar

El-Serag, H. B., and Rudolph, K. L. (2007). Hepatocellular carcinoma: epidemiology and molecular carcinogenesis. Gastroenterology 132, 2557–2576. doi: 10.1053/j.gastro.2007.04.061

PubMed Abstract | CrossRef Full Text | Google Scholar

Galili, T., O’Callaghan, A., Sidi, J., and Sievert, C. (2018). heatmaply: an R package for creating interactive cluster heatmaps for online publishing. Bioinformatics 34, 1600–1602. doi: 10.1093/bioinformatics/btx657

PubMed Abstract | CrossRef Full Text | Google Scholar

Gao, C. L., Wang, G. W., Yang, G. Q., Yang, H., and Zhuang, L. (2018). Karyopherin subunit-alpha 2 expression accelerates cell cycle progression by upregulating CCNB2 and CDK1 in hepatocellular carcinoma. Oncol. Lett. 15, 2815–2820.

Google Scholar

Gasnereau, I., Boissan, M., Margall-Ducos, G., Couchy, G., Wendum, D., Bourgain-Guglielmetti, F., et al. (2012). KIF20A mRNA and its product MKlp2 are increased during hepatocyte proliferation and hepatocarcinogenesis. Am. J. Pathol. 180, 131–140. doi: 10.1016/j.ajpath.2011.09.040

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, Y., Bao, Y., Ma, M., and Yang, W. (2017). Identification of key candidate genes and pathways in colorectal cancer by integrated bioinformatical analysis. Int. J. Mol. Sci. 18:722. doi: 10.3390/ijms18040722

PubMed Abstract | CrossRef Full Text | Google Scholar

Hsu, C. W., Chen, Y. C., Su, H. H., Huang, G. J., Shu, C. W., Wu, T. T., et al. (2017). Targeting TPX2 suppresses the tumorigenesis of hepatocellular carcinoma cells resulting in arrested mitotic phase progression and increased genomic instability. J. Cancer 8, 1378–1394. doi: 10.7150/jca.17478

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, D. H., Jian, J., Li, S., Zhang, Y., and Liu, L. Z. (2019). TPX2 silencing exerts antitumor effects on hepatocellular carcinoma by regulating the PI3K/AKT signaling pathway. Int. J. Mol. Med. 44, 2113–2122.

Google Scholar

Huang, Q., Lin, B., Liu, H., Ma, X., Mo, F., Yu, W., et al. (2011). RNA-Seq analyses generate comprehensive transcriptomic landscape and reveal complex transcript patterns in hepatocellular carcinoma. PLoS One 6:e26168. doi: 10.1371/journal.pone.0026168

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, R., Liao, X., and Li, Q. (2017). Identification and validation of potential prognostic gene biomarkers for predicting survival in patients with acute myeloid leukemia. Onco Targets Ther. 10, 5243–5254. doi: 10.2147/ott.s147717

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, J. H., Liu, Y. F., Ke, A. W., Gu, F. M., Yu, Y., Dai, Z., et al. (2014). Clinical significance of the ubiquitin ligase UBE3C in hepatocellular carcinoma revealed by exome sequencing. Hepatology 59, 2216–2227. doi: 10.1002/hep.27012

PubMed Abstract | CrossRef Full Text | Google Scholar

Kan, Z., Zheng, H., Liu, X., Li, S., Barber, T. D., Gong, Z., et al. (2013). Whole-genome sequencing identifies recurrent mutations in hepatocellular carcinoma. Genome Res. 23, 1422–1433.

Google Scholar

Khemlina, G., Ikeda, S., and Kurzrock, R. (2017). The biology of Hepatocellular carcinoma: implications for genomic and immune therapies. Mol. Cancer 16:149.

Google Scholar

Lavanchy, D. (2004). Hepatitis B virus epidemiology, disease burden, treatment, and current and emerging prevention and control measures. J. Viral. Hepat. 11, 97–107. doi: 10.1046/j.1365-2893.2003.00487.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Levrero, M., and Zucman-Rossi, J. (2016). Mechanisms of HBV-induced hepatocellular carcinoma. J. Hepatol. 64, S84–S101.

Google Scholar

Ma, Q., Xu, Y., Liao, H., Cai, Y., Xu, L., Xiao, D., et al. (2019). Identification and validation of key genes associated with non-small-cell lung cancer. J. Cell Physiol. 234, 22742–22752. doi: 10.1002/jcp.28839

PubMed Abstract | CrossRef Full Text | Google Scholar

Mandrekar, J. N. (2010). Receiver operating characteristic curve in diagnostic test assessment. J. Thorac. Oncol. 5, 1315–1316. doi: 10.1097/jto.0b013e3181ec173d

PubMed Abstract | CrossRef Full Text | Google Scholar

Nakamura, M., Takano, A., Thang, P. M., Tsevegjav, B., Zhu, M., Yokose, T., et al. (2020). Characterization of KIF20A as a prognostic biomarker and therapeutic target for different subtypes of breast cancer. Int. J. Oncol. 57, 277–288. doi: 10.3892/ijo.2020.5060

PubMed Abstract | CrossRef Full Text | Google Scholar

Orimo, T., Ojima, H., Hiraoka, N., Saito, S., Kosuge, T., Kakisaka, T., et al. (2008). Proteomic profiling reveals the prognostic value of adenomatous polyposis coli-end-binding protein 1 in hepatocellular carcinoma. Hepatology 48, 1851–1863. doi: 10.1002/hep.22552

PubMed Abstract | CrossRef Full Text | Google Scholar

Pettinelli, P., Arendt, B. M., Teterina, A., McGilvray, I., Comelli, E. M., Fung, S. K., et al. (2018). Altered hepatic genes related to retinol metabolism and plasma retinol in patients with non-alcoholic fatty liver disease. PLoS One 13:e0205747. doi: 10.1371/journal.pone.0205747

PubMed Abstract | CrossRef Full Text | Google Scholar

Rhodes, D. R., Yu, J., Shanker, K., Deshpande, N., Varambally, R., Ghosh, D., et al. (2004). ONCOMINE: a cancer microarray database and integrated data-mining platform. Neoplasia 6, 1–6. doi: 10.1016/s1476-5586(04)80047-2

CrossRef Full Text | Google Scholar

Ritchie, M. E., Phipson, B., Wu, D., Hu, Y., Law, C. W., Shi, W., et al. (2015). limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43:e47. doi: 10.1093/nar/gkv007

PubMed Abstract | CrossRef Full Text | Google Scholar

Robinson, M. D., McCarthy, D. J., and Smyth, G. K. (2010). edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140. doi: 10.1093/bioinformatics/btp616

PubMed Abstract | CrossRef Full Text | Google Scholar

Roessler, S., Jia, H. L., Budhu, A., Forgues, M., Ye, Q. H., Lee, J. S., et al. (2010). A unique metastasis gene signature enables prediction of tumor relapse in early-stage hepatocellular carcinoma patients. Cancer Res. 70, 10202–10212. doi: 10.1158/0008-5472.can-10-2607

PubMed Abstract | CrossRef Full Text | Google Scholar

Shannon, P., Markiel, A., Ozier, O., Baliga, N. S., Wang, J. T., Ramage, D., et al. (2003). Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504. doi: 10.1101/gr.1239303

PubMed Abstract | CrossRef Full Text | Google Scholar

Shi, C., Huang, D. L., Lu, N. H., Chen, D., Zhang, M. H., Yan, Y. H., et al. (2016). Aberrantly activated Gli2-KIF20A axis is crucial for growth of hepatocellular carcinoma and predicts poor prognosis. Oncotarget 7, 26206–26219. doi: 10.18632/oncotarget.8441

PubMed Abstract | CrossRef Full Text | Google Scholar

Szklarczyk, D., Gable, A. L., Lyon, D., Junge, A., Wyder, S., Huerta-Cepas, J., et al. (2019). STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 47, D607–D613.

Google Scholar

Tang, D., Zhao, X., Zhang, L., Wang, Z., and Wang, C. (2019). Identification of hub genes to regulate breast cancer metastasis to brain by bioinformatics analyses. J. Cell Biochem. 120, 9522–9531. doi: 10.1002/jcb.28228

PubMed Abstract | CrossRef Full Text | Google Scholar

Uemura, N., Nakanishi, Y., Kato, H., Saito, S., Nagino, M., Hirohashi, S., et al. (2009). Transglutaminase 3 as a prognostic biomarker in esophageal cancer revealed by proteomics. Int. J. Cancer 124, 2106–2115. doi: 10.1002/ijc.24194

PubMed Abstract | CrossRef Full Text | Google Scholar

Venook, A. P., Papandreou, C., Furuse, J., and de Guevara, L. L. (2010). The incidence and epidemiology of hepatocellular carcinoma: a global and regional perspective. Oncologist 15(Suppl. 4), 5–13. doi: 10.1634/theoncologist.2010-s4-05

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, L., Huang, J., Jiang, M., and Lin, H. (2012). Tissue-specific transplantation antigen P35B (TSTA3) immune response-mediated metabolism coupling cell cycle to postreplication repair network in no-tumor hepatitis/cirrhotic tissues (HBV or HCV infection) by biocomputation. Immunol. Res. 52, 258–268. doi: 10.1007/s12026-012-8337-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, M., Xi, D., and Ning, Q. (2017). Virus-induced hepatocellular carcinoma with special emphasis on HBV. Hepatol. Int. 11, 171–180. doi: 10.1007/s12072-016-9779-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, X. W., Hussain, S. P., Huo, T. I., Wu, C. G., Forgues, M., Hofseth, L. J., et al. (2002). Molecular pathogenesis of human hepatocellular carcinoma. Toxicology 18, 43–47.

Google Scholar

Wen, L. Z., Ding, K., Wang, Z. R., Ding, C. H., Lei, S. J., Liu, J. P., et al. (2018). SHP-1 Acts as a tumor suppressor in hepatocarcinogenesis and HCC progression. Cancer Res. 78, 4680–4691. doi: 10.1158/0008-5472.can-17-3896

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, S. Y., Lan, S. H., Wu, S. R., Chiu, Y. C., Lin, X. Z., Su, I. J., et al. (2018). Hepatocellular carcinoma-related cyclin D1 is selectively regulated by autophagy degradation system. Hepatology 68, 141–154. doi: 10.1002/hep.29781

PubMed Abstract | CrossRef Full Text | Google Scholar

Wurmbach, E., Chen, Y. B., Khitrov, G., Zhang, W., Roayaie, S., Schwartz, M., et al. (2007). Genome-wide molecular profiles of HCV-induced dysplasia and hepatocellular carcinoma. Hepatology 45, 938–947. doi: 10.1002/hep.21622

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, K., Gao, J., and Luo, M. (2019). Identification of key pathways and hub genes in basal-like breast cancer using bioinformatics analysis. Onco Targets Ther. 12, 1319–1331. doi: 10.2147/ott.s158619

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, Y., Chen, L., Gu, J., Zhang, H., Yuan, J., Lian, Q., et al. (2017). Recurrently deregulated lncRNAs in hepatocellular carcinoma. Nat. Commun. 8:14421.

Google Scholar

Yoo, S., Wang, W., Wang, Q., Fiel, M. I., Lee, E., Hiotis, S. P., et al. (2017). A pilot systematic genomic comparison of recurrence risks of hepatitis B virus-associated hepatocellular carcinoma with low- and high-degree liver fibrosis. BMC Med. 15:214. doi: 10.1186/s12916-017-0973-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, G. C., Wang, L. G., Han, Y. Y., and He, Q. Y. (2012). clusterProfiler: an R Package for comparing biological themes among gene clusters. Omics 16, 284–287. doi: 10.1089/omi.2011.0118

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, X. Z., Kang, C. Y., Li, N. N., Liu, X. L., Zhang, J. Z., Gao, F. L., et al. (2019). Identification of special key genes for alcohol-related hepatocellular carcinoma through bioinformatic analysis. Peerj 7:e6375. doi: 10.7717/peerj.6375

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: hepatitis B virus, hepatocellular carcinoma, hub genes, biomarkers, bioinformatics analysis

Citation: Zeng X-C, Zhang L, Liao W-J, Ao L, Lin Z-M, Kang W, Chen W-N and Lin X (2020) Screening and Identification of Potential Biomarkers in Hepatitis B Virus-Related Hepatocellular Carcinoma by Bioinformatics Analysis. Front. Genet. 11:555537. doi: 10.3389/fgene.2020.555537

Received: 25 April 2020; Accepted: 10 September 2020;
Published: 30 September 2020.

Edited by:

Yanni Sun, City University of Hong Kong, Hong Kong

Reviewed by:

Gonzalo Riadi, University of Talca, Chile
Ravikanth Nanduri, National Cancer Institute (NCI), United States

Copyright © 2020 Zeng, Zhang, Liao, Ao, Lin, Kang, Chen and Lin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Wan-Nan Chen, wannanchen79@mail.fjmu.edu.cn; Xu Lin, linxu@mail.fjmu.edu.cn

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.