ORIGINAL RESEARCH article
Network Analysis-Based Approach for Exploring the Potential Diagnostic Biomarkers of Acute Myocardial Infarction
- 1Department of Pharmacology, College of Basic Medical Sciences, Jilin University, Changchun, China
- 2Department of Pharmacy, The Second Hospital of Jilin University, Changchun, China
Acute myocardial infarction (AMI) is a severe cardiovascular disease that is a serious threat to human life. However, the specific diagnostic biomarkers have not been fully clarified and candidate regulatory targets for AMI have not been identified. In order to explore the potential diagnostic biomarkers and possible regulatory targets of AMI, we used a network analysis-based approach to analyze microarray expression profiling of peripheral blood in patients with AMI. The significant differentially-expressed genes (DEGs) were screened by Limma and constructed a gene function regulatory network (GO-Tree) to obtain the inherent affiliation of significant function terms. The pathway action network was constructed, and the signal transfer relationship between pathway terms was mined in order to investigate the impact of core pathway terms in AMI. Subsequently, constructed the transcription regulatory network of DEGs. Weighted gene co-expression network analysis (WGCNA) was employed to identify significantly altered gene modules and hub genes in two groups. Subsequently, the transcription regulation network of DEGs was constructed. We found that specific gene modules may provide a better insight into the potential diagnostic biomarkers of AMI. Our findings revealed and verified that NCF4, AQP9, NFIL3, DYSF, GZMA, TBX21, PRF1 and PTGDR genes by RT-qPCR. TBX21 and PRF1 may be potential candidates for diagnostic biomarker and possible regulatory targets in AMI.
Acute myocardial infarction (AMI) is a severe cardiovascular disease that is a serious threat to human life (Roger et al., 2012). AMI can cause congestive heart failure and malignant arrhythmia leading to a high morbidity and mortality (Jameel and Zhang, 2009; Mozaffarian et al., 2016). While thrombolysis and percutaneous coronary intervention (PCI) can significantly improve the prognosis of patients with AMI, there are still many patients with AMI that eventually develops into heart failure or arrhythmia due to the unclear etiology of AMI (Eapen et al., 2012) Looking for the potential diagnostic biomarkers and possible regulatory targets of AMI may help to reduce the mortality of AMI. At present, the systems biology analysis of gene expression profiling provides a better method to elucidate the possible mechanisms of myocardial infarction from the perspective of gene regulation. Using gene expression profiles, we can obtain adequate information about altered gene expression correlating with disease. Kiliszek et al. (2012) utilized a microarray approach to demonstrate that, during ST-segment elevation myocardial infarction (STEMI), many genes have altered expression, including those involved in various pathways related to platelet function, lipid/glucose metabolism and atherosclerotic plaque stability. Based on the system level of gene expression profiling, gene co-expression network analyses could be an alternative method for analyzing expression profiling data (Stuart et al., 2003), in order to gain an insight into molecular regulatory mechanisms of heart diseases caused by myocardial infarction. Gene ontology (GO) and pathway enrichment analyses were used in our study to explore the molecular mechanisms of heart disease induced by myocardial infarction.
Weighted gene co-expression network analysis (WGCNA) was also used in this study, and is generally utilized for illuminating the changes of transcriptome expression patterns in complex diseases (Zhang and Horvath, 2005; Chen et al., 2008; Voineagu et al., 2011). Compared with the standardized analysis of DEGs, the purpose of which is to detect disease-related individual genes, WGCNA aims to recognize higher-order correlation between gene products (Oldham et al., 2008). Moreover, the algorithm of WGCNA can substantially simplify the multiple testing problems that are unavoidable in standard gene-centric methods of microarray expression profiling data analysis; therefore, it is a powerful systematic analysis method that focuses on the coherence function of network modules (Zhao et al., 2010).
Using the bioinformatics methods above, we analyzed GO-tree, pathway action network and gene module alteration in patients with AMI to explore the potential diagnostic biomarkers and possible regulatory targets of AMI in our study. The gene co-expression network was constructed based on the gene expression profiling, and gene modules of peripheral blood were also detected by the WGCNA when AMI occurred. Furthermore, hub genes were recognized, which could be used as biomarkers for assessing the severity of heart diseases and served as promising therapeutic targets for promoting novel therapeutic schedules in AMI. In summary, our study aimed to determine the key genes that are involved in the regulation of AMI development by bioinformatics analysis, which could be served as potential targets for the diagnosis and treatment of AMI.
Materials and Methods
Microarray Data and Analysis of Differentially-Expressed Genes
The microarray expression data of peripheral blood samples was obtained from GEO (http://www.ncbi.nlm.nih.gov/geo) and GEO Series accession number GSE48060. This microarray profiling was acquired from human peripheral blood samples of 31 AMI patients and 21 controls. 52 samples were profiled using the chip-based platform GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array (Suresh et al., 2014).
The use of gene expression data from whole blood instead of heart tissue samples may limit the ability of our approach to detect signatures in the specific diseased tissue. However, peripheral blood is more easily accessed than heart tissue and may provide specific insights into the immune and inflammatory response of AMI exacerbations.
Microarray data analysis was performed using R software and Bioconductor 3.3 (http://www.bioconductor.org/). By quantile normalization, base 2 logarithm conversions, background correction and normalization were performed on original expression data by Robust Multiarray Averaging (RMA) (Gautier et al., 2004); as a result 54,675 mRNAs were carried over for further analysis (Supplement Table 1).
We screened the significant DEGs from 31 AMI patients and 21 controls by Limma (linear models for microarray data) package of R 3.1.3 software (Sanges et al., 2007). The Fold change > 1.2 and P < 0.05 was regarded as a threshold.
Cluster and TreeView are software programs for analyzing and visualizing DNA microarray data or other genomic data sets. In this study, Cluster analysis was used to differentially-expressed genes data and then visualize the data through TreeView.
Gene Ontology (GO) and Pathway Enrichment Analyses
GO analysis was utilized to explain the primary function of the DEGs according to the GO database, which is the crucial functional classification of NCBI (Ashburner et al., 2000; Gene Ontology, 2006). Fisher's exact test was used to calculate the significance level (P-value) of each GO term to screen out the significance GO term of the DEGs enrichment. While P < 0.05 was considered to be significant.
Analogously, pathway analysis was performed to discover the significant pathway terms of the DEGs according to the Kyoto Encyclopedia of Genes and Genomes (KEGG) (Kanehisa and Goto, 2000). We used the Fisher's exact test to identify the significant pathway terms; a P < 0.05 was considered significant (Kanehisa et al., 2004; Draghici et al., 2007).
Construction of Gene Function Regulatory Network (GO-Tree)
The GO hierarchy was a directed acyclic graph (DAG), in which each term had a defined link with one or more other terms. A GO-Tree was constructed based on the GO DAG to supply clear data navigation and visualization. P < 0.01 were selected for statistical significance of GO terms in GO analysis, and the GO-Tree was constructed using the up- and down-regulated DEGs in order to outline the functions that influenced AMI (Zhang et al., 2004).
Experimental genes also participated in many significant GO analyses. We organized the mutual regulation and affiliations between all GO analyses into the database based on a hierarchy of GO. By building a functional relationship network, it was possible to summarize the impact of the experimental function groups, as well as internal affiliation significant features. We considered the biological process (BP) terms of GO analysis to build a network of functional regulation (P < 0.01).
Construction of Pathway Action Network (Pathway-Act-Network)
The KEGG database included signal transduction, metabolism, cell cycle, membrane transport pathways and information of their interactions. The genes we selected may be related to two or more signaling pathways. Due to the same genes in different pathways, overlapping between pathways was inevitable. We selected the genes in enriched biological pathways and used Cytoscape for graphical representations of pathways (Shannon et al., 2003). By constructing the pathway action network, the signal transfer relations between the pathways was explored at a macro level. In order to look for the core pathways and regulatory mechanisms affected by the disease from multiple significant pathway terms, P < 0.05 of pathway terms in pathway analysis were selected to construct a pathway action network.
Co-expression Network Analysis
Gene co-expression network analyses (Kim et al., 2001) were carried out to reveal the interrelation between the DEGs, based on their normalized signal intensity in “AMI” and “Control” profiles. The Pearson correlation was calculated for each pair of genes, and the significantly correlated pairs were used to construct the co-expression network (Prieto et al., 2008). To trace the core regulatory genes in the networks, k-core scores were applied to simplify the analysis of graph topology. The k-core of a given gene indicated its core or nodal status linking other genes in the network to “k” (Ravasz et al., 2002; Barabási and Oltvai, 2004). Consequently, we applied k-core scores to identify genes with the highest networking degrees as key regulatory genes.
Weighted Gene Co-expression Network Analysis
WGCNA is a common algorithm for constructing a co-expression network. We utilized the similarity of gene co-expression to define a network. When using Smn to represent the gene co-expression similarity between genes m and n, we can then apply the power adjacency function for correlating adjacency of genes: Smn = . In data processing, the genome-wide gene expression data was preliminarily filtered, followed by measuring the consistency of gene expression profiles by Pearson correlation, and finally using the power adjacent function to Pearson correlation matrix, data was transformed into weighted gene co-expression networks.
Intramodular connectivity (IC) describes the correlation degree of a gene with other genes in a given module, which can be understood as a measure of module membership (MM). Network module (Module) is a cluster of closely interconnected genes. During module detection, the adjacency matrix (which is a measure of topology similarity) is transformed into the topological overlap matrix (TOM), and modules are detected by cluster analysis (Zhao et al., 2014). We then analyzed the importance of genes by the t-test to determine whether the modules were associated with myocardial inflammation. Module eigengene (ME) refers to the first principal component gene of module expression matrix. It is considered the most representative of the module genes, which has important biological significance.
Functional Enrichment Analysis of the Network Module Genes
The online tool PANTHER was used to analyze the functional enrichment of network module genes that are associated with myocardial inflammation (P < 0.05) (Mi et al., 2013).
Identification of Hub Genes
Hub genes are described as the genes most closely associated with disease; they are connected to the highest degree of a series of genes in a module. To a certain extent, they are used to determine the character of modules, the hub genes of modules often have more biological significance compared with the hub genes of global networks (Jeong et al., 2001; Goh et al., 2007; Liang and Li, 2007). A gene is considered to be a hub gene if it has a unique character, such as high GS, high MM and high IC in the network (Zhang and Horvath, 2005). Gene significance (GS) showed the different IC of a gene in various networks, and MM described the significance of genes in the network. The IC of a gene suggested the connectivity with the other genes within network. We were therefore also able to identify the hub genes in modules through the GS, MM, and IC.
Transcription Regulatory Network
JASPAR (http://jaspar.genereg.net) is an open-access database storing curated, non-redundant transcription factor (TF) binding profiles representing transcription factor binding preferences as position frequency matrices for multiple species in six taxonomic groups (Mathelier et al., 2016). Cytoscape is a visualization data integration package for biological networks based on the Java language (Smoot et al., 2011). We imported DEGs into the JASPAR database to obtain the interaction between transcription factors and their target genes (TG). Then, we used Cytoscape software to visualize these relationships and finally obtained the transcription factor-target gene regulatory network.
The research object of this study with a total of 20, all are hospitalized patients with AMI in the department of cardiology of first hospital of Jilin university, and each was invited to join the study patients are required to sign a consent form. Our research protocol was approved by the ethical committee of the First Hospital of Jilin University. Among the 20 patients, 12 males, and 8 females, aged 57 ± 9 years. Inclusion criteria: chest pain or distress within 24 h duration > 30 min, and myocardial enzymes CK-MB and cTNT higher than the normal range online. While excluding myocarditis and other diseases caused by chest pain or distress; exclude the patients with history of renal failure; patients with advanced liver disease; patients with malignant tumors and patients with other inflammatory diseases, such as psoriasis, rheumatoid arthritis and so on.
Quantitative Real-Time RT-qPCR
According to the microarray results, the 8 most dysregulated mRNAs were chosen for further validation by RT-qPCR in AMI patients vs. healthy controls. Blood samples were collected and PBMCs were isolated. Total RNAs were isolated from PBMCs using Trizol reagent (Invitrogen), and complementary DNA was synthesized using the TransScript®Frist-Strand cDNA Synthesis SuperMix(Transgen, China)according to the manufacturer's instructions. Primer sets for selected genes were designed by Sangon Biotech (Shanghai, China); their sequences and reaction conditions are available in Table 1. Each sample was run in triplicate in 96-well plates using LightCycler®96 and FastStart Essential DNA Green Master (Roche Diagnostics GmbH, Germany). Quantification cycles (Cq) were calculated using the fit point method (LightCycler®96 Software, Version 1.1 provided by Roche). The expression data were normalized to the reference glyceraldehyde-3-phosphate dehydrogenase (Gapdh). All experiments (sample collection, preparation and storage, primer design, qPCR normalization) were performed according to the MIQE guidelines.
DEGs Selection and Hierarchical Clustering Analysis
We used R software and corrected batch effect by Bayesian methods, then removed the probes without corresponding annotation information. To determine the expression values of each gene, we used multiple GSE48060 probes corresponding to the median expression value of that gene, and finally obtained the expression profile of 54,675 genes (Supplement Table 1).
By using algorithms provided in the Limma package, we calculated the data and obtained with lists of differentially expressed genes. A total of 551 DEGs were identified between the AMI group and the control group, including 164 upregulated and 387 down-regulated genes (P < 0.05, Fold change > 1.2, Figure 1A, Supplement Table 2). Hierarchical clustering analysis was obtained for the 551 DEGs from the 52 samples of the AMI and control groups. The general gene expression patterns were evidently different in the two groups by TreeView (Figure 1B).
Figure 1. DEGs selection and hierarchical clustering analysis. (A) The Limma algorithm was applied to filter the DEGs; we selected the DEGs according to P < 0.05 and Fold change > 1.2. (B) DEGs can be effectively divided into AMI and control groups. Red indicates that the gene that is upregulated and green represents down-regulated genes.
GO Enrichment Analysis of DEGs and GO-Tree
To reveal AMI–related biological processes, we conducted a functional enrichment analysis. GO enrichment analysis of 551 DEGs was performed. The most GO terms of biological processes were associated with inflammation, including the inflammatory response, immune response, cellular defense response and chronic inflammatory response in the AMI group (Figure 2A). In the cellular component category, enriched GO terms were mainly associated with lipid particles, the external side of the plasma membrane and the extracellular space. In the molecular function category, GO terms enriched for DEGs in AMI included chemokine activity, GTP binding and carbohydrate binding (Table 2). We constructed gene function regulatory networks (GO-Tree) for the significant GO terms (P < 0.01) of the category of biological processes in GO analysis in order to explore the intrinsic link among gene function (Figure 2B). We then found the hierarchical tree relationships between gene function significantly (Supplement Table 3). The analysis showed that the gene function cascade eventually induced inflammation during AMI.
Figure 2. GO enrichment analysis and GO-Tree. (A) The significant GO terms that conformed to a P < 0.05 were screened. (B) GO-Tree: red circles represent the upregulated genes involved in GO terms, and green circles represent the down-regulated genes involved in GO terms. (C) We used the Fisher's exact test to select the significant pathway, identified by a P < 0.05. (D) Pathway-act-network. P < 0.05. Red circles represent the upregulated genes involved in pathway terms, and green circles represent the down-regulated genes involved in pathway terms.
Pathway Enrichment Analysis of DEGs and Pathway-Act-Network Construction
To screen the significant enrichment of DEGs in pathway terms, Fisher's exact test was used to calculate the significance level of the pathway (P < 0.05). According to KEGG databases, we performed pathway annotation of DEGs and obtained DEGs involved in all pathway terms (Figure 2C, Table 3). The DEGs of AMI were enriched mainly in inflammatory bowel disease (IBD), the chemokine signaling pathway, natural killer cell mediated cytotoxicity, toll-like receptor signaling pathway and cytokine-cytokine receptor interactions. As there was cross-talk between the pathway terms, in order to search the transitive relation between signaling pathways, we constructed pathway action networks (pathway-act-network) for the significant pathway terms (P < 0.05) (Figure 2D, Supplement Table 4). There were interactions in multiple pathways, demonstrated by the pathway-act-network, which ultimately impacted activation of multiple pathway terms including the JAK-STAT signaling pathway, toll-like receptor signaling pathways and antigen processing and presentation.
Gene Co-expression Network
We performed a gene co-expression network analysis to investigate the phenotypic change of genes associated with AMI (Kumari et al., 2012; Villa-Vialaneix et al., 2013). Twelve genes from the AMI and control groups were chosen as key regulatory genes (|Dif_Kcore| > 30), and it was shown that there was a significant change in the expression pattern of genes in the AMI group (Supplement Table 5), namely CNN2 (calponin 2), CRYZ (quinone oxidoreductase), SULT1A1 (sulfotransferase 1A1), SULT1A2 (sulfotransferase 1A2), PRMT2 (protein arginine methyltransferase 2), ATP1B1 (ATPase, Na+/K+ transporting, beta 1 polypeptide), CX3CR1 (CX3C chemokine receptor 1), GCH1 (GTP cyclohydrolase 1), INSIG1 (insulin-induced gene protein), CXCL5 (C-X-C motif chemokine 5), GBP3 (guanylate-binding protein 3) and HEG1 (heart development protein with EGF-like domains 1).We hypothesized that the 12 key regulatory genes are likely to be closely related to the occurrence and development of AMI (Figure 3).
Figure 3. Gene co-expression network analyses. The Pearson correlation was calculated for each pair of genes, and the significantly correlated pairs were used to construct the co-expression network (P < 0.05). (A) AMI. (B) Control. Red nodes represent the key regulatory genes with the highest K-Core. The node color represents the K-Core. The node size represents the K-Core power, and the edges between two nodes represent the interactions between the genes.
However, traditional studies of gene co-expression patterns mainly utilize the Pearson coefficient to describe the correlation between genes, and tend to determine the presence of a co-expression network using the Pearson coefficient and FDR threshold (also known as hard threshold) (Butte and Kohane, 2000; Carter et al., 2004; Prieto et al., 2008). The two genes are considered to be connected when the correlation coefficient of two genes is equal to or greater than this threshold (for example, 0.8). However, there is a significant limitation to this method, i.e., we have no evidence to identify whether the coefficient 0.8 or 0.79 of two genes has a significant difference. The above algorithm cannot avoid this situation. WGCNA is different from the traditional gene-gene correlation coefficient matrix, and introduces a soft-threshold method that avoids this limitation. In WGCNA, the correlation coefficient in genes of a gene co-expression matrix is obtained by weight calculation. The standard of weighting is the connections between genes, which should conform to the distribution of scale-free networks, obeying a power-law distribution. In short, the correlation coefficient of each gene pair was the β-th power of the exponent operation, which was the weight calculation and β was known as the soft threshold.
Weighted Gene Co-expression Networks
We selected a suitable weighted parameter of adjacency function, which is the soft-threshold β, before constructing the weighted co-expression network. After the calculation, we selected the correlation coefficient close to 0.8, soft-threshold β = 9 to construct gene modules using the WGCNA package.
After determining the soft threshold, a total of 551 DEGs were used to construct weighted gene co-expression networks. According to the basic idea of WGCNA, we calculated the correlation matrix and adjacency matrix of the gene expression profile of the AMI and control groups, and then transformed them into a topological overlap matrix (TOM), and obtained a system clustering tree of DEGs on the basis of gene-gene non-similarity ( = 1− ωij). Together with the TOM, we performed the hierarchical average linkage clustering method to identify the gene modules of each gene network (deep split = 2, cut height = 0.99). In both the AMI and control groups, a total of five gene modules were recognized by the dynamic tree cut (Figure 4A). Genes that not belong to any modules were housed in the gray module. The gray gene modules were ignored in this study.
Figure 4. (A) Network analysis of gene expression in AMI identifies distinct modules of co-expression genes. (B) Correlation between the gene modules and immune and inflammatory responses. Scatter plot of MM vs. GS in (C) blue and (D) brown modules. Cor represents an absolute correlation coefficient of GS and MM; P-value for significance assessment. It follows that in both modules, GS and MM have a high correlation. In the high correlation of image: upper right node (gene) and immune and inflammatory responses, on the other hand in the module also has an important significance.
GO Enrichment Analysis of the Gene Modules
For a preliminarily test to assess whether the network was acceptable, we derived the corresponding genes of each module, and PANTHER was used for the functional enrichment analysis of GO terms, P < 0.05 (Mi et al., 2013). For the blue module, genes involved in the inflammatory response (GO: 0006954, P = 1.81E-07), immune response (GO: 0006955, P = 1.54E-05), defense response (GO: 0006952, P = 5.12E-05) and immune system process (GO: 0002376, P = 1.35E-03) were significantly enhanced (Table 4). It is suggested that this module is closely related to the occurrence and development of the immune and inflammation responses in AMI. Meanwhile, by analyzing the brown module, it is evident that the module gene is mainly concentrated in the immune response (GO: 0006955, P = 8.83E-06), regulation of lymphocyte chemotaxis (GO: 1901623, P = 1.05E-05), defense response (GO: 0006952, P = 2.03E-05), response to stimulus (GO: 0050896, P = 9.99E-05) and immune system process (GO: 0002376, P = 1.53E-04) (Table 4). These results indicate that the module may be closely related to the immune response during AMI. The gene modules that we obtained have biological significance, so we named the blue and brown modules as immune modules. The functional enrichment analysis of blue and brown modules indicates that the immune response and inflammation would immediately follow the onset of AMI, which is consistent with previous reports (Fang et al., 2015).
Table 4. Gene ontology (GO) enrichment analysis in gene modules (top 8 significantly enriched biology terms).
A major goal of our research was to analyze the degree of correlation between genes and disease, and to ascertain the significance of genes in the corresponding modules. We defined GS as the association of a gene with immune and inflammatory responses. Then we calculated the mean GS of all genes in the module, which is module significance (MS), and described subsequently the relationship between modules and the inflammatory response (Langfelder and Horvath, 2008). Figure 4B showed the correlation between the various modules and immune/inflammatory responses, which shows that the most relevant modules to the immune and inflammatory responses were the blue and brown modules. For a particular gene, to evaluate its involvement in the given module, we analyzed the correlation between its expression profile and ME profile in all samples. We calculated the MM of all the genes in the corresponding module to learn their significance within the given modules. We identified modules (blue and brown) that showed a high correlation with immune and inflammatory reactions, and GS and MM of each gene are shown in Figures 4C,D.
Hub Genes Associated with the Occurrence of AMI
The above results suggested that the occurrence and development of AMI were closely related to the immune and inflammatory responses. To screen genes that were most relevant to immune and inflammatory responses, we constructed the gene co-expression network of immune module genes. There was a positive correlation between MM and IC in the modules, but not GS. Therefore, we selected the hub genes according to the IC and MM values of each gene module (Table 5). Eight hub genes were selected, including NCF4, AQP9, NFIL3, DYSF, GZMA, TBX21, PRF1, and PTGDR. Combined with previous reports, we hypothesized that these hub genes were closely relevant to immune and inflammatory responses when AMI occurs. The co-expression network of module genes was then constructed to obtain the degree of connection between genes in the module (Figures 5A,B).
Figure 5. Construction of a co-expression network of module genes. (A) Blue module; (B) brown module. Red nodes represent the hub genes with the highest K-Core and MM. The node color represents the K-Core. The node size represents the K-Core power, and the edges between two nodes represent the interactions between the genes.
Transcription Regulatory Network
Based on JASPAR database, the transcription regulatory network of DEGs was constructed and included 66 nodes (Figure 6A). There were 30 transcription factors, and the core transcription factors were SATA4, BCL11B, GATA3, and TBX21, which have the regulatory relationship with the most target genes. (TG > 10, Supplement Table 6). Combined with the results of WGCNA analysis, we found that the transcription factor TBX21 was a hub gene. We screened all target genes associated with the transcription factor TBX21 and constructed a transcriptional regulatory network centered around TBX21 (Figure 6B). Significantly, TBX21 was directly related to the transcription factors SATA4, BCL11B, and GATA3.Interestingly, these four transcription factors were also directly related to PRF1, and PRF1 was also a hub gene.
Figure 6. Transcription regulatory network of DEGs. Based on JASPAR database, the transcription regulatory network constructed by DEGs included 66 nodes (A). (B) Construction of transcription regulatory network centered around TBX21. Red V-type represents up-regulated TF, green V-type represents down-regulated TF; red circles represent up-regulated TG, green circles represent the down-regulated TG.
Validation of Microarrays with RT-qPCR
RT-qPCR was used to validate the microarray data. To verify the main conclusion drawn from the microarray results for peripheral blood samples, the expression levels of genes coding for NCF4, AQP9, NFIL3, DYSF, GZMA, TBX21, PRF1, and PTGDR were determined. Overall, the RT-qPCR results were qualitatively consistent with the results of the microarray analysis. However, the RT-qPCR analysis tended to give higher up-regulation levels than those calculated from the microarray data. The highest change was found for TBX21, PRF1, and NFIL3 (Figure 7).
Figure 7. Validation of microarray data with RT-qPCR. Several hub genes identified in microarray data are dysregulation in AMI patients. mRNA expression of hub genes identified in microarray data validated by RT-qPCR is shown. Total RNAs were isolated from PBMCs or AMI patients and healthy donors. Reverse-transcribed to cDNA and used as template for RT-qPCR analysis. Relative Expression of each gene in PBMCs from healthy donors were considered as 1.
Although the appropriate thromboprophylaxis treatments for AMI have reduced successfully the incidence and mortality of AMI to some extent (Lau and Lip, 2014), effective methods for preventing and diagnosing AMI are still controversial. In our study, we use a systematic biology approach to identify 551 DEGs in blood samples from AMI patients compared with control groups. For enrichment analysis of pathways, DEGs were mainly involved in IBD, the chemokine signaling pathway, toll-like receptor signaling pathway and cytokine-cytokine receptor interactions. From the functional enrichment and GO-Tree analysis, we found that the major biological processes are the inflammatory response, chemotaxis, the immune response and the cellular defense response when AMI occurs; these are functional annotations associated with the immune and inflammatory responses.
In the WGCNA, we identified five gene modules based on 551 DEGs. Each module contained 44–279 genes; only 36 genes failed to be assigned to any of the gene modules. By functional enrichment analysis, the modules have obvious biological significance, the genes of blue and brown modules were significantly enriched in immunity and inflammation-related biological processes. The majority of genes within the turquoise module were involved in mRNA processing and transcription, and the yellow module genes were involved mainly in platelet coagulation and activation.
Through the enrichment function analysis of the module genes, we determine that accompanied by the activation of inflammatory response signal pathways and the initiation of immune system when AMI occurs. After the analysis of WGCNA, we selected the blue and brown modules, which were closely related to inflammatory reaction, as the main research objects. Eight potential hub genes related to AMI were screened from blue and brown modules, namely NCF4, AQP9, NFIL3, DYSF, GZMA, TBX21, PRF1, and PTGDR. Subsequently, RT-qPCR was used to verify the expression of the eight hub genes in peripheral blood of AMI patients. Our results suggested that the expression was consistent with the bioinformatics analysis results. Wherein the most obvious change of hub genes was TBX21 and PRF1, which expression were significantly down-regulated in peripheral blood of AMI patients. In combination with the results of transcriptional regulatory network analysis, we hypothesized that transcription factor TBX21 and target gene PRF1 may be the diagnostic biomarkers of AMI and merit further explored.
Gene TBX21 can encode a transcription factor named T-bet, whose main function is to suppress GATA-3 expression and prevent Th1 to Th2 cell differentiation (Szabo et al., 2000). It is well known that patients with AMI can develop subsequently cardiac arrest (Vanbrabant et al., 2006). Upon cardiac arrest, an abnormal stress response occurs and causes the change of transcription factor T-bet/GATA-3, resulting in the imbalance of Th1/Th2. The Th2-type cytokine-induced waterfall-like cascade is then initiated, which results in multiple organ failure, eventually leading to death. This has also been confirmed by a porcine model of cardiac arrest that lowers GATA-3 and T-bet levels, which alters the drifting Th1/Th2 cells and causes the immune imbalance of myocardial tissue after cardiopulmonary resuscitation (CPR) (Gu et al., 2013).
Perforin (PRF1) can be secreted by NK cells, CTL cells, γδ+T and regulatory T cells. PRF1 is a protein in the membrane attack complex/PR (FMACPF) superfamily (Lichtenheld et al., 1988; Shinkai et al., 1988), which is a highly conserved glycoprotein. Due to its important functions in immune surveillance and immune regulation, the malfunction of PRF1 has been found to be involved in many diseases (Baran et al., 2009). Thiery and Lieberman showed that PRF1 can activate clathrin- and dynein-dependent endocytosis, and inhibition of this endocytosis pathway may trigger apoptotic cell death (Thiery and Lieberman, 2014). In a study of 495 patients with cardiomyopathy, the infiltration of perforin-positive cells in the myocardium could serve as a predictor for long-term prognosis of patients; the presence of perforin-positive infiltration in myocardial cells indicates an adverse left ventricular ejection fraction (LVEF) course (Escher et al., 2014). We speculate that the abnormal expression of perforin may be a major cause of progressing LV dysfunction in AMI.
There is no report about the relationship between the TBX21 and PRF1 by far. It may be the focus of follow-up work that the potential regulatory role of TBX21 on PRF1 or other target genes in AMI. The highlight of this study was to identify several hub genes associated with AMI through the network analysis and to validate the expression levels of these hub genes by RT-qPCR experiments, including TBX21 and PRF1. We hypothesized that TBX21 and PRF1may be potential candidates for diagnostic and possible regulatory targets in the diagnosis and treatment of AMI. Our results provide us a direction for subsequent research in exploring thoroughly the gene regulatory mechanisms of AMI and its targeted therapy.
This study was carried out in accordance with the recommendations of ethics committee of First Hospital of Jilin University with written informed consent from all subjects (2015-019-12). All subjects gave written informed consent in accordance with the Declaration of Helsinki.
JC participated in the design and execution of the study, conducted data analysis and interpretation, and drafted the manuscript. LY, SZ, and XC contributed to the design, execution and data interpretation. All authors participated in modifying and editing the manuscript, and the final manuscript has been read and approved by all authors.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The present study was supported by General Program of National Natural Science Foundation of China (No. 81570253) and by a grant from the Jilin Province Development and Reform Commission (No. 2013-088).
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/article/10.3389/fphys.2016.00615/full#supplementary-material
Ashburner, M., Ball, C. A., Blake, J. A., Botstein, D., Butler, H., Cherry, J. M., et al. (2000). Gene ontology: tool for the unification of biology. Gene Ontol. Consortium. Nat. Genet. 25, 25–29. doi: 10.1038/75556
Baran, K., Dunstone, M., Chia, J., Ciccone, A., Browne, K. A., Clarke, C. J., et al. (2009). The molecular basis for perforin oligomerization and transmembrane pore assembly. Immunity 30, 684–695. doi: 10.1016/j.immuni.2009.03.016
Butte, A. J., and Kohane, I. S. (2000). Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements. Pac. Symp. Biocomput. 5, 418–429. doi: 10.1142/9789814447331_0040
Carter, S. L., Brechbühler, C. M., Griffin, M., and Bond, A. T. (2004). Gene co-expression network topology provides a framework for molecular characterization of cellular state. Bioinformatics 20, 2242–2250. doi: 10.1093/bioinformatics/bth234
Eapen, Z. J., Tang, W. H., Felker, G. M., Hernandez, A. F., Mahaffey, K. W., Lincoff, A. M., et al. (2012). Defining heart failure end points in ST-segment elevation myocardial infarction trials: integrating past experiences to chart a path forward. Circ. Cardiovasc. Qual. Outcomes 5, 594–600. doi: 10.1161/CIRCOUTCOMES.112.966150
Escher, F., Kuhl, U., Lassner, D., Stroux, A., Westermann, D., Skurk, C., et al. (2014). Presence of perforin in endomyocardial biopsies of patients with inflammatory cardiomyopathy predicts poor outcome. Eur. J. Heart Fail. 16, 1066–1072. doi: 10.1002/ejhf.148
Fang, L., Moore, X. L., Dart, A. M., and Wang, L. M. (2015). Systemic inflammatory response following acute myocardial infarction. J. Geriatr. Cardiol. 12, 305–312. doi: 10.11909/j.issn.1671-5411.2015.03.020
Gu, W., Li, C. S., Yin, W. P., Hou, X. M., Zhang, J., Zhang, D., et al. (2013). Expression imbalance of transcription factors GATA-3 and T-bet in post-resuscitation myocardial immune dysfunction in a porcine model of cardiac arrest. Resuscitation 84, 848–853. doi: 10.1016/j.resuscitation.2012.11.023
Kiliszek, M., Burzynska, B., Michalak, M., Gora, M., Winkler, A., Maciejak, A., et al. (2012). Altered gene expression pattern in peripheral blood mononuclear cells in patients with acute myocardial infarction. PLoS ONE 7:e50054. doi: 10.1371/journal.pone.0050054
Kumari, S., Nie, J., Chen, H. S., Ma, H., Stewart, R., Li, X., et al. (2012). Evaluation of gene association methods for coexpression network construction and biological knowledge discovery. PLoS ONE 7:e50411. doi: 10.1371/journal.pone.0050411
Lau, Y. C., and Lip, G. Y. (2014). Post myocardial infarction and atrial fibrillation: thromboprophylaxis and risk stratification using the CHA2DS2-VASc score. Cardiol. J. 21, 451–453. doi: 10.5603/CJ.2014.0075
Mathelier, A., Fornes, O., Arenillas, D. J., Chen, C. Y., Denay, G., Lee, J., et al. (2016). JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles. Nucleic Acids Res. 44, D110–D115. doi: 10.1093/nar/gkv1176
Mozaffarian, D., Benjamin, E. J., Go, A. S., Arnett, D. K., Blaha, M. J., Cushman, M., et al. (2016). Heart disease and stroke statistics-2016 update: a report from the american heart association. Circulation 133, e38–e360. doi: 10.1161/CIR.0000000000000366
Oldham, M. C., Konopka, G., Iwamoto, K., Langfelder, P., Kato, T., Horvath, S., et al. (2008). Functional organization of the transcriptome in human brain. Nat. Neurosci. 11, 1271–1282. doi: 10.1038/nn.2207
Prieto, C., Risueño, A., Fontanillo, C., and De las Rivas, J. (2008). Human gene coexpression landscape: confident network derived from tissue transcriptomic profiles. PLoS ONE 3:e3911. doi: 10.1371/journal.pone.0003911
Ravasz, E., Somera, A. L., Mongru, D. A., Oltvai, Z. N., and Barabási, A. L. (2002). Hierarchical organization of modularity in metabolic networks. Science 297, 1551–1555. doi: 10.1126/science.1073374
Roger, V. L., Go, A. S., Lloyd-Jones, D. M., Benjamin, E. J., Berry, J. D., Borden, W. B., et al. (2012). Heart disease and stroke statistics–2012 update: a report from the American Heart Association. Circulation 125, e2–e220. doi: 10.1161/CIR.0b013e31823ac046
Sanges, R., Cordero, F., and Calogero, R. A. (2007). oneChannelGUI: a graphical interface to Bioconductor tools, designed for life scientists who are not familiar with R language. Bioinformatics 23, 3406–3408. doi: 10.1093/bioinformatics/btm469
Shannon, P., Markiel, A., Ozier, O., Baliga, N. S., Wang, J. T., Ramage, D., et al. (2003). Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504. doi: 10.1101/gr.1239303
Smoot, M. E., Ono, K., Ruscheinski, J., Wang, P. L., and Ideker, T. (2011). Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics 27, 431–432. doi: 10.1093/bioinformatics/btq675
Stuart, R. O., Bush, K. T., and Nigam, S. K. (2003). Changes in gene expression patterns in the ureteric bud and metanephric mesenchyme in models of kidney development. Kidney Int. 64, 1997–2008. doi: 10.1046/j.1523-1755.2003.00383.x
Suresh, R., Li, X., Chiriac, A., Goel, K., Terzic, A., Perez-Terzic, C., et al. (2014). Transcriptome from circulating cells suggests dysregulated pathways associated with long-term recurrent events following first-time myocardial infarction. J. Mol. Cell. Cardiol. 74, 13–21. doi: 10.1016/j.yjmcc.2014.04.017
Szabo, S. J., Kim, S. T., Costa, G. L., Zhang, X., Fathman, C. G., and Glimcher, L. H. (2000). A novel transcription factor, T-bet, directs Th1 lineage commitment. Cell 100, 655–669. doi: 10.1016/S0092-8674(00)80702-3
Vanbrabant, P., Dhondt, E., Billen, P., and Sabbe, M. (2006). Aetiology of unsuccessful prehospital witnessed cardiac arrest of unclear origin. Eur. J. Emerg. Med. 13, 144–147. doi: 10.1097/01.mej.0000209050.40139.23
Villa-Vialaneix, N., Liaubet, L., Laurent, T., Cherel, P., Gamot, A., and SanCristobal, M. (2013). The structure of a gene co-expression network reveals biological functions underlying eQTLs. PLoS ONE 8:e60045. doi: 10.1371/journal.pone.0060045
Voineagu, I., Wang, X., Johnston, P., Lowe, J. K., Tian, Y., Horvath, S., et al. (2011). Transcriptomic analysis of autistic brain reveals convergent molecular pathology. Nature 474, 380–384. doi: 10.1038/nature10110
Zhang, B., Schmoyer, D., Kirov, S., and Snoddy, J. (2004). GOTree Machine (GOTM): a web-based platform for interpreting sets of interesting genes using Gene Ontology hierarchies. BMC Bioinformatics 5:16. doi: 10.1186/1471-2105-5-16
Zhao, H., Cai, W., Su, S., Zhi, D., Lu, J., and Liu, S. (2014). Screening genes crucial for pediatric pilocytic astrocytoma using weighted gene coexpression network analysis combined with methylation data analysis. Cancer Gene Ther. 21, 448–455. doi: 10.1038/cgt.2014.49
Keywords: acute myocardial infarction, biomarkers, inflammation, systems biology, hub genes
Citation: Chen J, Yu L, Zhang S and Chen X (2016) Network Analysis-Based Approach for Exploring the Potential Diagnostic Biomarkers of Acute Myocardial Infarction. Front. Physiol. 7:615. doi: 10.3389/fphys.2016.00615
Received: 12 July 2016; Accepted: 24 November 2016;
Published: 09 December 2016.
Edited by:Natalia Polouliakh, Sony Computer Science Laboratories, Japan
Reviewed by:Preetam Ghosh, Virginia Commonwealth University, USA
Anatoly Sorokin, Institute of Cell Biophysics, Russian Academy of Sciences, Russia
Copyright © 2016 Chen, Yu, Zhang and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xia Chen, firstname.lastname@example.org