ORIGINAL RESEARCH article

Front. Genet., 03 December 2018

Sec. Computational Genomics

Volume 9 - 2018 | https://doi.org/10.3389/fgene.2018.00573

Functional Prediction of Chronic Kidney Disease Susceptibility Gene PRKAG2 by Comprehensively Bioinformatics Analysis

  • 1. Department of Nephrology, The First Affiliated Hospital, Jinzhou Medical University, Jinzhou, China

  • 2. Jinzhou Medical University, Jinzhou, China

Abstract

The genetic predisposition to chronic kidney disease (CKD) has been widely evaluated especially using the genome-wide association studies, which highlighted some novel genetic susceptibility variants in many genes, and estimated glomerular filtration rate to diagnose and stage CKD. Of these variants, rs7805747 in PRKAG2 was identified to be significantly associated with both serum creatinine and CKD with genome wide significance level. Until now, the potential mechanism by which rs7805747 affects CKD risk is still unclear. Here, we performed a functional analysis of rs7805747 variant using multiple bioinformatics software and databases. Using RegulomeDB and HaploReg (version 4.1), rs7805747 was predicated to locate in enhancer histone marks (Liver, Duodenum Mucosa, Fetal Intestine Large, Fetal Intestine Small, and Right Ventricle tissues). Using GWAS analysis in PhenoScanner, we showed that rs7805747 is not only associated with CKD, but also is significantly associated with other diseases or phenotypes. Using metabolite analysis in PhenoScanner, rs7805747 is identified to be significantly associated with not only the serum creatinine, but also with other 16 metabolites. Using eQTL analysis in PhenoScanner, rs7805747 is identified to be significantly associated with gene expression in multiple human tissues and multiple genes including PRKAG2. The gene expression analysis of PRKAG2 using 53 tissues from GTEx RNA-Seq of 8555 samples (570 donors) in GTEx showed that PRKAG2 had the highest median expression in Heart-Atrial Appendage. Using the gene expression profiles in human CKD, we further identified different expression of PRKAG2 gene in CKD cases compared with control samples. In summary, our findings provide new insight into the underlying susceptibility of PRKAG2 gene to CKD.

Introduction

Chronic kidney disease (CKD) is a major global problem caused by the permanent loss of kidney function, and is also associated with an increased risk for cardiovascular disease (Cusumano and Gonzalez Bedat, 2008; Prodjosudjadi et al., 2009; Chambers et al., 2010; Shinohara, 2010; Sherwood and McCullough, 2016; James et al., 2017; Malhotra et al., 2017; Sinha and Bagga, 2018). The overall prevalence of CKD exceeds 10%, and is approximately 14% in the general population and its incidence is increasing (Almirall, 2016; Hursitoglu, 2016; Mills and He, 2016; Wuttke and Kottgen, 2016; Hedayati et al., 2017; James et al., 2017; Clark et al., 2018). It is reported that up to 20% of CKD cases are caused by genetic forms of renal disease (Almirall, 2016; Hursitoglu, 2016; Mills and He, 2016; Wuttke and Kottgen, 2016; Hedayati et al., 2017; James et al., 2017; Clark et al., 2018). Understanding genetic predisposition to CKD and uncovering underlying pathophysiological mechanisms may contribute to the development of targeted therapies. In recent years, the genetic predisposition to CKD has been widely evaluated especially using the genome-wide association studies (GWAS), which highlighted some novel genetic susceptibility variants in many genes, and estimated glomerular filtration rate to diagnose and stage CKD (Pattaro et al., 2016; Wuttke and Kottgen, 2016).

In these CKD risk genes, a genetic variant rs7805747 in PRKAG2 was identified to be significantly associated with both serum creatinine and CKD with genome wide significance level (Chambers et al., 2010). The rs7805747 (chr7:151407801 for hg19) variant is located in intronic of PRKAG2. PRKAG2 is a protein coding gene. Until now, the potential mechanism by which rs7805747 affects CKD risk is still unclear. It is difficult to identify the function of coding and non-coding genes in molecular wet laboratories. However, computational methods including kinds of bioinformatics software and databases may be useful tools to guide and predict function (Zou et al., 2016; Wan and Zou, 2017; Wan et al., 2017; Wei et al., 2017a,b; He et al., 2018a,b; Jia et al., 2018a,b; Jiang et al., 2018; Zeng et al., 2018). Here, we performed a functional analysis of rs7805747 variant using multiple bioinformatics databases including RegulomeDB (Boyle et al., 2012), HaploReg (version 4.1) (Ward and Kellis, 2016), PhenoScanner (version 1.1) (Staley et al., 2016), and UCSC Genome Browser (Rosenbloom et al., 2015; Tyner et al., 2017; Casper et al., 2018), as did in previous studies (Lu et al., 2011; Rhie et al., 2013; Hazelett et al., 2014; Liu et al., 2016, 2017a,b,c,d,e, 2018b; Guo et al., 2017; Hu et al., 2017a,b; Jiang et al., 2017; Zhang et al., 2018). Meanwhile, we analyzed a whole genome case-control expression profiles in human CKD to investigate whether the susceptibility gene PRKAG2 is differently expressed in CKD cases compared with control samples.

Materials and Methods

Regulatory Analysis of rs7805747 Using RegulomeDB

RegulomeDB database could annotate genetic variants with known and predicted regulatory elements in the intergenic regions of the human genome (Boyle et al., 2012). In brief, the known and predicted regulatory DNA elements include regions of DNAase hypersensitivity, binding sites of transcription factors, and promoter regions that have been biochemically characterized to regulation transcription (Boyle et al., 2012). These regulatory element datasets are from Gene Expression Omnibus (GEO), the Encyclopedia of DNA Elements (ENCODE) project, and published literature (Boyle et al., 2012).

Functional Analysis of rs7805747 Using HaploReg

HaploReg is a tool for exploring annotations of the non-coding variants (Ward and Kellis, 2012, 2016). HaploReg v4 included LD information from the 1000 Genomes Project, chromatin state and protein binding annotation from the Roadmap Epigenomics and ENCODE projects, sequence conservation across mammals, the effect of SNPs on regulatory motifs, and the effect of SNPs on expression from eQTL studies (Ward and Kellis, 2012, 2016). More detailed information is provided in the original studies (Ward and Kellis, 2012, 2016).

Functional Analysis of rs7805747 Using PhenoScanner

PhenoScanner included publicly available large-scale GWAS summary results, about 3 billion associations and over 10 million unique single nucleotide polymorphisms (SNPs) and a broad range of phenotypes (Staley et al., 2016). The results are aligned across traits to the same effect and non-effect alleles for each SNP (Staley et al., 2016). Here, we performed three kinds of functional analyses including the GWAS, Metabolites, and eQTL analysis options (Staley et al., 2016). To perform a GWAS analysis, the PhenoScanner included 88 GWAS datasets with 76 kinds of diseases or phenotypes (Staley et al., 2016). To perform a Metabolites analysis, PhenoScanner consisted of two metabolomics datasets (Shin et al., 2014; Kettunen et al., 2016). To perform an eQTL analysis, PhenoScanner included several eQTL datasets from eQTL Browser, Geuvadis, GTEx (version 6), MuTHER and bloodeqtlbrowser. More detailed information is provided on the original study (Staley et al., 2016).

Gene Expression Analysis of PRKAG2 in GTEx

We evaluated the expression of PRKAG2 using the RNA-Seq datasets from the NIH Genotype-Tissue Expression (GTEx) project, which was created to establish a sample and data resource for studies on the relationship between genetic variation and gene expression in multiple human tissues (GTEx Consortium, 2013; Mele et al., 2015). The GTEx project included median gene expression levels in 51 tissues and 2 cell lines (V6, October 2015) (GTEx Consortium, 2013; Mele et al., 2015). This release is based on data from 8555 tissue samples obtained from 570 adult post-mortem individuals (GTEx Consortium, 2013; Carithers et al., 2015; Mele et al., 2015). Here, we used the Genome Browser to evaluate the expression of PRKAG2 in GTEx 53 human tissues (V6, October 2015). The UCSC Genome Browser is a new method to visualize interactions between regions of the genome (Meyer et al., 2013; Karolchik et al., 2014; Rosenbloom et al., 2015; Speir et al., 2016; Tyner et al., 2017; Casper et al., 2018).

Case-Control Gene Expression Analysis of PRKAG2

We analyzed a whole genome case-control expression profiles in human CKD (Nakagawa et al., 2015). In the original study, a microarray analysis with renal biopsy specimens from CKD patients was conducted to identify the responsible genes associated with tubulointerstitial fibrosis and tubular cell injury in CKD (Nakagawa et al., 2015). This study showed microarray profiles in a total of 61 samples including 53 biopsy specimens of CKD patients, and 8 controls (Nakagawa et al., 2015). Here, we selected the web tool GEO2R to evaluate whether PRKAG2 gene is significantly dysregulated in CKD cases compared with control samples, as did in a recent study (Liu et al., 2018a). The significance level is defined to be P < 0.01.

Results

Regulatory Analysis of rs7805747 Using RegulomeDB

Using RegulomeDB, the predicted score is 5, which suggested that rs7805747 is likely to affect binding (TF binding) or DNase peak. The predicted binding protein is HNF4A (chr7:151407767-151408030 by ChIP-seq in Caco2 cell type) (Verzi et al., 2010). The histone modification analysis showed that rs7805747 was predicated to locate in enhancer histone marks (Liver, Fetal Intestine Large, Right Ventricle, Duodenum Mucosa, and Fetal Intestine Small). Here, we provided some key information about regulatory analysis in Table 1. More detailed results are described in Supplementary Table 1.

Table 1

MethodLocationChromatin stateTissue
ChromHMMchr7:151407000..151408400EnhancersFetal intestine large
ChromHMMchr7:151407200..151408000EnhancersRight ventricle
ChromHMMchr7:151407200..151408200EnhancersDuodenum mucosa
ChromHMMchr7:151407400..151411400EnhancersLiver
ChromHMMchr7:151407000..151408600Genetic enhancersFetal intestine small
ChromHMMchr7:151406000..151408400HeterochromatinH9 derived neuron cultured cells
ChromHMMchr7:151407400..151408400Repressed PolyCombForeskin fibroblast primary cells skin01
ChromHMMchr7:151407800..151408400Repressed PolyCombForeskin fibroblast primary cells skin02
ChromHMMchr7:151405400..151410400Strong transcriptionMonocytes-CD14+ RO01746 primary cells
ChromHMMchr7:151405600..151408000Strong transcriptionPrimary monocytes from peripheral blood

Key histone modification analysis of rs7805747 using RegulomeDB.

Functional Analysis of rs7805747 Using HaploReg

Using HaploReg v4, rs7805747 is predicated to locate in enhancer histone marks (Liver, Duodenum Mucosa, Fetal Intestine Large, Fetal Intestine Small, and Right Ventricle tissues), DNase hypersensitivity (Foreskin Melanocyte Primary Cells skin01, Fetal Heart, Fetal Intestine Large, Fetal Intestine Small, Small Intestine), and motifs changed (GATA, HMGN3, Pax-6, and Tgif1) (Ward and Kellis, 2012, 2016).

Functional Analysis of rs7805747 Using PhenoScanner in GWAS

Using PhenoScanner in GWAS option, we identified 43 significant association results with P < 0.01. In addition to the CKD, we found that rs7805747 is also significantly associated with other diseases or phenotypes including Hemoglobin Hb, Hematocrit Hct, Red blood cell count RBC, systolic blood pressure (SBP), Breast cancer, Gout, Hypertension, and Extraversion, as described in Table 2.

Table 2

SNPTraitPMIDP-valueSamples
rs7805747log(eGFR creatinine in non-diabetics)268311991.50E-29116602
rs7805747log(eGFR creatinine)268311998.00E-29131772
rs7805747Chronic kidney disease268311992.10E-14117165
rs7805747Serum creatinine estimated glomerular filtration rate eGFR without diabetes224791914.00E-14130600
rs7805747Serum creatinine203831462.00E-1390075
rs7805747Serum creatinine estimated glomerular filtration rate eGFR224791912.70E-13130600
rs7805747Serum creatinine estimated glomerular filtration rate eGFR no diabetes203831469.20E-1390075
rs7805747Chronic kidney disease203831464.00E-12NA
rs7805747Kidney diseases203831464.00E-12NA
rs7805747Chronic kidney disease203831464.20E-1290075
rs7805747log(eGFR creatinine)203831465.10E-1167093
rs7805747Serum creatinine estimated glomerular filtration rate eGFR203831465.10E-1190075
rs7805747Chronic kidney disease224791916.00E-10130600
rs7805747Serum creatinine estimated glomerular filtration rate eGFR age 65224791917.10E-10130600
rs7805747Hemoglobin Hb232225171.80E-09135367
rs7805747Serum creatinine estimated glomerular filtration rate eGFR with hypertension224791918.00E-09130600
rs7805747Chronic kidney disease203831468.60E-0962237
rs7805747Serum creatinine estimated glomerular filtration rate eGFR males224791911.10E-08130600
rs7805747Hematocrit Hct232225173.92E-08135367
rs7805747Red blood cell count RBC232225179.17E-08135367
rs7805747Serum creatinine estimated glomerular filtration rate eGFR females224791919.20E-08130600
rs7805747Serum creatinine206866512.20E-0768439
rs7805747Serum creatinine estimated glomerular filtration rate eGFR with hypertension203831462.60E-0790075
rs7805747Serum cystatin c estimated glomerular filtration rate eGFR203831468.10E-0790075
rs7805747Serum creatinine estimated glomerular filtration rate eGFR without hypertension224791918.80E-07130600
rs7805747Serum creatinine estimated glomerular filtration rate eGFR age 65224791911.10E-06130600
rs7805747log(eGFR cystatin C)268311993.00E-0632299
rs7805747Serum creatinine estimated glomerular filtration rate eGFR no hypertension203831464.60E-0690075
rs7805747SBP276184476.35E-06192763
rs7805747Cystatin C in serum203831461.80E-0590075
rs7805747Serum cystatin c estimated glomerular filtration rate eGFR224791911.82E-05130600
rs7805747Serum urate232634863.91E-0594744
rs7805747Chronic kidney disease severe224791911.00E-04130600
rs7805747Breast cancer235553150.0004314905
rs7805747log(eGFR cystatin C)203831460.0004520957
rs7805747Pulse pressure276184470.0006478192763
rs7805747Gout232634860.000755869374
rs7805747Transmission distortion223776320.00094174728
rs7805747HbA1C208586830.00155546368
rs7805747Hypertension276184470.003366183273
rs7805747Plasma Beta 2 microglobulin levels234171100.00656728
rs7805747Extraversion263625750.00693363030
rs7805747Height SDS for females aged 10 years234496270.0085126967

Significant association between rs7805747 and kinds of diseases or phenotypes with P ≤ 0.01.

eGFR, estimated glomerular filtration rate; SDS, standard deviation scores.

Functional Analysis of rs7805747 Using PhenoScanner in eQTL

Using PhenoScanner in eQTL option, we identified 23 significant associations between rs7805747 and gene expression with P < 0.01. These findings show that rs7805747 is significantly associated with gene expression in multiple human tissues including brain, cells transformed fibroblasts, colon sigmoid, colon transverse, heart left ventricle, liver, lung, skin, small intestine terminal ileum, stomach, and whole blood, as described in Table 3. These regulated genes include GIMAP1, GIMAP5, AGAP3, KCNH2, TMEM176B, TMEM176A, FASTK, WDR86, NOS3, WDR86-AS1, XRCC2, YBX1P4, SLC4A2, TMUB1, and PRKAG2. Importantly, rs7805747 could significantly regulate PRKAG2 expression in blood with P = 3.81E-03 and 8.15E-04.

Table 3

SNPPMIDSourceTissueGeneSamplesEffect alleleBetaSEP-value
rs780574725954001GTExBrain anterior cingulate cortex ba24GIMAP172A–0.4740.16756.68E-03
rs780574725954001GTExBrain caudate basal gangliaGIMAP5100A  0.32080.079131.18E-04
rs780574725954001GTExBrain cerebellar hemisphereAGAP389A–0.29710.078843.50E-04
rs780574725954001GTExBrain cerebellumKCNH2103A  0.27660.072742.76E-04
rs780574725954001GTExBrain cerebellumTMEM176B103A  0.25030.085454.41E-03
rs780574725954001GTExBrain cortexTMEM176A96A  0.22840.067741.19E-03
rs780574725954001GTExBrain cortexTMEM176B96A  0.1960.061422.08E-03
rs780574725954001GTExBrain nucleus accumbens basal gangliaTMEM176B93A  0.23080.06911.34E-03
rs780574725954001GTExBrain nucleus accumbens basal gangliaFASTK93A–0.18280.066277.37E-03
rs780574725954001GTExBrain putamen basal gangliaGIMAP582A  0.2560.07811.75E-03
rs780574725954001GTExCells transformed fibroblastsGIMAP5272A–0.21520.075214.60E-03
rs780574725954001GTExColon sigmoidWDR86124A  0.18760.068387.19E-03
rs780574725954001GTExColon transverseNOS3169A–0.2020.073997.20E-03
rs780574725954001GTExColon transverseNA169A–0.26240.096777.59E-03
rs780574725954001GTExHeart left ventricleWDR86-AS1190A  0.2770.10398.50E-03
rs780574725954001GTExLiverXRCC297A–0.34580.12979.42E-03
rs780574725954001GTExLungYBX1P4278A  0.28940.086219.19E-04
rs780574725954001GTExSkin not sun exposed suprapubicFASTK196A–0.10020.036687.01E-03
rs780574725954001GTExSmall intestine terminal ileumSLC4A277A  0.32450.081011.87E-04
rs780574725954001GTExStomachTMUB1170A  0.17340.057873.26E-03
rs780574725954001GTExWhole bloodPRKAG2338A–0.1450.049713.81E-03
rs780574722941192MuTHERSkinXRCC2667A  0.097560.033173.30E-03
rs780574724013639Westra HPeripheral bloodPRKAG25311NA  NANA8.15E-04

Significant association between rs7805747 and the gene expression with P ≤ 0.01.

Beta is the regression coefficient based on A allele, which means that A allele regulates increased (Beta > 0) and reduced (Beta < 0) expression of nearby genes.

Functional Analysis of rs7805747 Using PhenoScanner in Metabolites

Using PhenoScanner in metabolites option, we identified significant association between rs7805747 and 17 metabolites with P < 0.01. These metabolites included Creatinine, Indolelactate, Phenol sulfate, Pseudouridine, Propionylcarnitine, C-glycosyltryptophan, Kynurenine, Myo-inositol, 3-carboxy-4-methyl-5-propyl-2-furanpropanoate (CMPF), 1-palmitoylglycerophosphoethanolamine, Phenyllactate (PLA), Erythronate, N-acetylthreonine, Citrulline, 3-methoxytyrosine, Urate, 2-methylbutyroylcarnitine, as described in Table 4.

Table 4

SNPTraitPMIDEffect alleleBetaSEP-valueSamples
rs7805747Creatinine27005778A0.060170.011269.76E-0824806
rs7805747Indolelactate24816252A0.01310.00283.77E-066939
rs7805747Phenol sulfate24816252A0.02020.00563.23E-047361
rs7805747Pseudouridine24816252A0.00650.00184.24E-047336
rs7805747Propionylcarnitine24816252A0.00930.00288.46E-047364
rs7805747C-glycosyltryptophan24816252A0.00570.00181.89E-037338
rs7805747Kynurenine24816252A0.0080.00272.58E-037368
rs7805747Myo-inositol24816252A0.0080.00272.62E-037354
rs78057473-carboxy-4-methyl-5-propyl-2-furanpropanoate (CMPF)24816252A0.02780.00932.71E-037363
rs78057471-palmitoylglycerophosphoethanolamine24816252A0.00810.00284.48E-037317
rs7805747Phenyllactate (PLA)24816252A0.00810.00294.87E-035702
rs7805747Erythronate24816252A0.00540.00195.10E-037307
rs7805747N-acetylthreonine24816252A0.00770.00286.05E-036502
rs7805747Citrulline24816252A0.00510.00196.30E-037325
rs78057473-methoxytyrosine24816252A0.00770.00286.38E-035656
rs7805747Urate24816252A0.0050.00186.56E-037371
rs78057472-methylbutyroylcarnitine24816252A0.00760.00286.76E-037007

Significant association between rs7805747 and metabolites with P ≤ 0.01.

Beta is the regression coefficient based on A allele, which means that A allele regulates increased (Beta > 0) and reduced (Beta < 0) metabolites.

Gene Expression Analysis of PRKAG2 in GTEx

Using the UCSC Genome Browser, the results showed that PRKAG2 had the highest median expression: 34.90 RPKM in Heart – Atrial Appendage (Ensembl gene ID: ENSG00000106617.9, Genomic position: hg38 chr7:151556111-151877125). PRKAG2 had the total median expression 412.37 RPKM in all these 53 tissues. Figure 1 provided more detailed information about the PRKAG2 gene expression in 53 tissues from GTEx RNA-Seq of 8555 samples (570 donors).

FIGURE 1

Case-Control Gene Expression Analysis of PRKAG2

There are three probes to evaluate the expression of PRKAG2 gene in the gene expression profiles in human CKD including A_24_P384779, A_23_P44366, and A_23_P314760. Each of these three probes represents different regions of PRKAG2 gene. These three probes may have the same or different transcript or isoforms, or extrons. The results showed that all these probes about the PRKAG2 gene is significantly dysregulated in CKD cases compared with control samples including A_24_P384779 [P = 1.23E-07 and log2(fold change) = -2.07], and A_23_P44366 [P = 4.39E-03 and log2(fold change) = 0.74], and A_23_P314760 [P = 1.54E-02 and log2(fold change) = 0.62]. In Table 3, there are 15 unique genes regulated by rs7805747. In addition to PRKAG2, we also evaluated the expression of other 14 genes, as provided in Table 3. The results showed that 4 of other 14 genes including TMUB1, AGAP3, XRCC2, and WDR86-AS1 also had different expression in CKD cases with P < 0.01. Table 5 provided the detailed information about 23 probes of 15 genes including PRKAG2. Importantly, the different expression of PRKAG2, TMUB1, AGAP3, and XRCC2 had passed the multiple testing correction threshold 0.01/23 = 0.000435.

Table 5

Probe IDGeneModerated t-statisticlog2(fold change)P-value
A_24_P384779PRKAG2–5.955014–2.07192161.23E-07
A_23_P31195TMUB1   4.8579817  0.92574948.08E-06
A_24_P333364AGAP3   4.7795743  1.17766451.08E-05
A_23_P134584XRCC2   4.0270549  1.54343011.53E-04
A_24_P317622AGAP3–3.5418815–1.52204087.50E-04
A_32_P138396WDR86-AS1  3.5246206  1.35701837.92E-04
A_23_P44366PRKAG2  2.9543448  0.73601694.39E-03
A_32_P77343XRCC2–2.8724894–1.39783825.53E-03
A_23_P314760PRKAG2  2.4911763  0.61880341.54E-02
A_23_P427023GIMAP1  2.4154446  0.62074051.86E-02
A_32_P123479WDR86  2.2539559  0.89316062.77E-02
A_23_P70849NOS3  1.8409091  0.41711767.03E-02
A_23_P253389SLC4A2–1.7570194–0.65062018.37E-02
A_23_P31188GIMAP1  1.6700716  0.35452239.98E-02
A_23_P252082TMEM176A  1.6570106  0.65626791.02E-01
A_23_P502930FASTK  1.6067909  0.31095511.13E-01
A_23_P42588GIMAP5–1.2244375–0.3379372.25E-01
A_23_P377882KCNH2–1.1399471–0.30271312.59E-01
A_24_P354998TMUB1  0.9443127  0.24984723.49E-01
A_23_P168403KCNH2–0.9265106–0.43124053.58E-01
A_23_P157007TMEM176B–0.3719722–0.15212757.11E-01
A_23_P111452AGAP3  0.3314008  0.12135197.41E-01
A_23_P215140FASTK  0.0048177  0.00089369.96E-01

Gene expression analysis of 15 genes regulated by rs7805747 using CKD gene expression dataset.

Discussion

It is reported that PRKAG2 could encode the gamma2-subunit isoform of 5′-AMP-activated protein kinase (AMPK) (Ahmad et al., 2005; Banerjee et al., 2007; Folmes et al., 2009; Kim et al., 2012; Thorn et al., 2013; Zhang et al., 2013; Hinson et al., 2016, 2017). AMPK is a metabolic enzyme, which plays important roles in regulating of energy metabolism in response to cellular stress. AMPK has been identified to be a regulator of metabolism, survival, and fibrosis, by a recent integrative analysis of PRKAG2 cardiomyopathy iPS and microtissue models (Hinson et al., 2016). In addition, mutations in PRKAG2 have been identified to be associated with hypertrophic cardiomyopathy (Xu et al., 2017).

Over the past decade, GWAS have considerably improved our understanding of the genetic basis of kidney function and disease (Wuttke and Kottgen, 2016). A SNP rs7805747 identified by CKD GWAS lies upstream of PRKAG2. Here, we performed a comprehensively functional analysis of this variant using multiple bioinformatics databases including RegulomeDB (Boyle et al., 2012), HaploReg (version 4.1) (Ward and Kellis, 2016), and PhenoScanner (version 1.1) (Staley et al., 2016). Using RegulomeDB, rs7805747 is predicted to affect HNF4A binding or DNase peak. Using RegulomeDB, the predicted score is 5, which suggested that rs7805747 is likely to affect binding (TF binding) or DNase peak. The predicted binding protein is HNF4A (chr7:151407767-151408030 by ChIP-seq in Caco2 cell type). In addition, rs7805747 was predicated to locate in enhancer histone marks (Liver, Fetal Intestine Large, Right Ventricle, Duodenum Mucosa, and Fetal Intestine Small). Using HaploReg (version 4.1), we identified rs7805747 to be associated with enhancer histone marks, DNase hypersensitivity, and motifs changed. In HaploReg (version 4.1), rs7805747 was also predicated to locate in enhancer histone marks (Liver, Duodenum Mucosa, Fetal Intestine Large, Fetal Intestine Small, and Right Ventricle tissues). Hence, the findings in HaploReg (version 4.1) were consistent with RegulomeDB.

Using PhenoScanner in GWAS option, we showed that rs7805747 is not only associated with CKD, but also is significantly associated with other diseases or phenotypes including Hemoglobin Hb, Hematocrit Hct, Red blood cell count RBC, SBP, Breast cancer, Gout, Hypertension, and Extraversion.

Using PhenoScanner in eQTL option, rs7805747 is identified to be significantly associated with gene expression in multiple human tissues and multiple genes including PRKAG2. Previous study has reported rs7805747 to be associated with serum creatinine and CKD (Chambers et al., 2010). Using PhenoScanner in metabolites option, rs7805747 is identified to be significantly associated with not only the serum creatinine, but also with other 16 metabolites, as described in Table 4.

The gene expression analysis of PRKAG2 using 53 tissues from GTEx RNA-Seq of 8555 samples (570 donors) in GTEx showed that PRKAG2 had the highest median expression in Heart – Atrial Appendage. Using the gene expression profiles in human CKD, we further identified different expression of PRKAG2 gene in CKD cases compared with control samples. All these findings indicate that rs7805747 is associated with CKD risk, PRKAG2 gene expression, and 17 metabolites. Meanwhile, gene expression analysis further showed that CKD cases had different expression of PRKAG2 gene. In summary, our findings provide new insight into the underlying susceptibility of PRKAG2 gene to CKD.

Statements

Author contributions

EW conceived and initiated the project and performed the functional analysis. EW, HZ, DZ, LL, and LD wrote the manuscript. All authors reviewed the manuscript and contributed to the final manuscript.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2018.00573/full#supplementary-material

References

  • 1

    AhmadF.AradM.MusiN.HeH.WolfC.BrancoD.et al (2005). Increased alpha2 subunit-associated AMPK activity and PRKAG2 cardiomyopathy.Circulation1123140–3148.

  • 2

    AlmirallJ. (2016). Sodium excretion, cardiovascular disease, and chronic kidney disease.JAMA316:1112.

  • 3

    BanerjeeS. K.RamaniR.SabaS.RagerJ.TianR.MathierM. A.et al (2007). A PRKAG2 mutation causes biphasic changes in myocardial AMPK activity and does not protect against ischemia.Biochem. Biophys. Res. Commun.360381–387.

  • 4

    BoyleA. P.HongE. L.HariharanM.ChengY.SchaubM. A.KasowskiM.et al (2012). Annotation of functional variation in personal genomes using RegulomeDB.Genome Res.221790–1797. 10.1101/gr.137323.112

  • 5

    CarithersL. J.ArdlieK.BarcusM.BrantonP. A.BrittonA.BuiaS. A.et al (2015). A novel approach to high-quality postmortem tissue procurement: the GTEx project.Biopreserv. Biobank13311–319. 10.1089/bio.2015.0032

  • 6

    CasperJ.ZweigA. S.VillarrealC.TynerC.SpeirM. L.RosenbloomK. R.et al (2018). The UCSC genome browser database: 2018 update.Nucleic Acids Res.46D762–D769. 10.1093/nar/gkx1020

  • 7

    ChambersJ. C.ZhangW.LordG. M.van der HarstP.LawlorD. A.SehmiJ. S.et al (2010). Genetic loci influencing kidney function and chronic kidney disease.Nat. Genet.42373–375. 10.1038/ng.566

  • 8

    ClarkW. F.SontropJ. M.HuangS. H.GalloK.MoistL.HouseA. A.et al (2018). Effect of coaching to increase water intake on kidney function decline in adults with chronic kidney disease: the CKD WIT randomized clinical trial.JAMA3191870–1879. 10.1001/jama.2018.4930

  • 9

    CusumanoA. M.Gonzalez BedatM. C. (2008). Chronic kidney disease in Latin America: time to improve screening and detection.Clin. J. Am. Soc. Nephrol.3594–600. 10.2215/CJN.03420807

  • 10

    FolmesK. D.ChanA. Y.KoonenD. P.PulinilkunnilT. C.BaczkoI.HunterB. E.et al (2009). Distinct early signaling events resulting from the expression of the PRKAG2 R302Q mutant of AMPK contribute to increased myocardial glycogen.Circ. Cardiovasc. Genet.2457–466. 10.1161/CIRCGENETICS.108.834564

  • 11

    GTEx Consortium (2013). The genotype-tissue expression (GTEx) Project.Nat. Genet.45580–585. 10.1038/ng.2653

  • 12

    GuoL.PengY.SunL.HanX.XuJ.MaoD. (2017). Ovarian cancer variant rs2072590 is associated with HOXD1 and HOXD3 gene expression.Oncotarget8103410–103414. 10.18632/oncotarget.21902

  • 13

    HazelettD. J.RhieS. K.GaddisM.YanC.LakelandD. L.CoetzeeS. G.et al (2014). Comprehensive functional annotation of 77 prostate cancer risk loci.PLoS Genet.10:e1004102. 10.1371/journal.pgen.1004102

  • 14

    HeW.JiaC.DuanY.ZouQ. (2018a). 70ProPred: a predictor for discovering sigma70 promoters based on combining multiple features.BMC Syst. Biol.12(Suppl. 4):44. 10.1186/s12918-018-0570-1

  • 15

    HeW.JiaC.ZouQ. (2018b). 4mCPred: machine learning methods for DNA N4-methylcytosine sites prediction.Bioinformatics10.1093/bioinformatics/bty668 [Epub ahead of print].

  • 16

    HedayatiS. S.GreggL. P.CarmodyT.JainN.ToupsM.RushA. J.et al (2017). Effect of sertraline on depressive symptoms in patients with chronic kidney disease without dialysis dependence: the CAST randomized clinical trial.JAMA3181876–1890. 10.1001/jama.2017.17131

  • 17

    HinsonJ. T.ChopraA.LoweA.ShengC. C.GuptaR. M.KuppusamyR.et al (2016). Integrative analysis of PRKAG2 cardiomyopathy iPS and microtissue models identifies AMPK as a regulator of metabolism, survival, and fibrosis.Cell Rep.173292–3304.

  • 18

    HinsonJ. T.ChopraA.LoweA.ShengC. C.GuptaR. M.KuppusamyR.et al (2017). Integrative analysis of PRKAG2 cardiomyopathy iPS and microtissue models identifies AMPK as a regulator of metabolism, survival, and fibrosis.Cell Rep.19:2410.

  • 19

    HuY.ChengL.ZhangY.BaiW.ZhouW.WangT.et al (2017a). Rs4878104 contributes to Alzheimer’s disease risk and regulates DAPK1 gene expression.Neurol. Sci.381255–1262. 10.1007/s10072-017-2959-9

  • 20

    HuY.JinS.ChengL.LiuG.JiangQ. (2017b). Autoimmune disease variants regulate GSDMB gene expression in human immune cells and whole blood.Proc. Natl. Acad. Sci. U.S.A.114E7860–E7862.

  • 21

    HursitogluM. (2016). Sodium excretion, cardiovascular disease, and chronic kidney disease.JAMA316:1112.

  • 22

    JamesM. T.PannuN.HemmelgarnB. R.AustinP. C.TanZ.McArthurE.et al (2017). Derivation and external validation of prediction models for advanced chronic kidney disease following acute kidney injury.JAMA3181787–1797. 10.1001/jama.2017.16326

  • 23

    JiaC.YangQ.ZouQ. (2018a). NucPosPred: predicting species-specific genomic nucleosome positioning via four different modes of general PseKNC.J. Theor. Biol.45015–21. 10.1016/j.jtbi.2018.04.025

  • 24

    JiaC.ZuoY.ZouQ. (2018b). O-GlcNAcPRED-II: an integrated classification algorithm for identifying O-GlcNAcylation sites based on fuzzy undersampling and a K-means PCA oversampling technique.Bioinformatics342029–2036. 10.1093/bioinformatics/bty039

  • 25

    JiangJ.XingF.ZengX.ZouQ. (2018). RicyerDB: a database for collecting rice yield-related genes with biological analysis.Int. J. Biol. Sci.14965–970. 10.7150/ijbs.23328

  • 26

    JiangQ.JinS.JiangY.LiaoM.FengR.ZhangL.et al (2017). Alzheimer’s disease variants with the genome-wide significance are significantly enriched in immune pathways and active in immune cells.Mol. Neurobiol.54594–600. 10.1007/s12035-015-9670-8

  • 27

    KarolchikD.BarberG. P.CasperJ.ClawsonH.ClineM. S.DiekhansM.et al (2014). The UCSC genome browser database: 2014 update.Nucleic Acids Res.42D764–D770. 10.1093/nar/gkt1168

  • 28

    KettunenJ.DemirkanA.WurtzP.DraismaH. H.HallerT.RawalR.et al (2016). Genome-wide study for circulating metabolites identifies 62 loci and reveals novel systemic effects of LPA.Nat. Commun.7:11122. 10.1038/ncomms11122

  • 29

    KimE.LeeS. H.LeeK. S.CheongH. K.NamkoongK.HongC. H.et al (2012). AMPK gamma2 subunit gene PRKAG2 polymorphism associated with cognitive impairment as well as diabetes in old age.Psychoneuroendocrinology37358–365. 10.1016/j.psyneuen.2011.07.005

  • 30

    LiuG.HuY.JinS.JiangQ. (2017a). Genetic variant rs763361 regulates multiple sclerosis CD226 gene expression.Proc. Natl. Acad. Sci. U.S.A.114E906–E907.

  • 31

    LiuG.SunJ. Y.XuM.YangX. Y.SunB. L. (2017b). SORL1 variants show different association with early-onset and late-onset Alzheimer’s disease risk.J. Alzheimers Dis.581121–1128. 10.3233/JAD-170005

  • 32

    LiuG.XuY.JiangY.ZhangL.FengR.JiangQ. (2017c). PICALM rs3851179 variant confers susceptibility to Alzheimer’s disease in chinese population.Mol. Neurobiol.543131–3136. 10.1007/s12035-016-9886-2

  • 33

    LiuG.ZhangF.HuY.JiangY.GongZ.LiuS.et al (2017d). Genetic variants and multiple sclerosis risk gene SLC9A9 expression in distinct human brain regions.Mol. Neurobiol.546820–6826. 10.1007/s12035-016-0208-5

  • 34

    LiuG.ZhangF.JiangY.HuY.GongZ.LiuS.et al (2017e). Integrating genome-wide association studies and gene expression data highlights dysregulated multiple sclerosis risk pathways.Mult. Scler.23205–212. 10.1177/1352458516649038

  • 35

    LiuG.LiuY.JiangQ.JiangY.FengR.ZhangL.et al (2016). Convergent genetic and expression datasets highlight TREM2 in Parkinson’s disease susceptibility.Mol. Neurobiol.534931–4938. 10.1007/s12035-015-9416-7

  • 36

    LiuG.WangT.TianR.HuY.HanZ.WangP.et al (2018a). Alzheimer’s disease risk variant rs2373115 regulates GAB2 and NARS2 expression in human brain tissues.J. Mol. Neurosci.6637–43. 10.1007/s12031-018-1144-9

  • 37

    LiuG.ZhangY.WangL.XuJ.ChenX.BaoY.et al (2018b). Alzheimer’s disease rs11767557 variant regulates EPHA1 gene expression specifically in human whole blood.J. Alzheimers Dis.611077–1088. 10.3233/JAD-170468

  • 38

    LuY.ZhangZ.YuH.ZhengS. L.IsaacsW. B.XuJ.et al (2011). Functional annotation of risk loci identified through genome-wide association studies for prostate cancer.Prostate71955–963. 10.1002/pros.21311

  • 39

    MalhotraR.NguyenH. A.BenaventeO.MeteM.HowardB. V.MantJ.et al (2017). Association between more intensive vs less intensive blood pressure lowering and risk of mortality in chronic kidney disease stages 3 to 5: a systematic review and meta-analysis.JAMA Intern. Med.1771498–1505. 10.1001/jamainternmed.2017.4377

  • 40

    MeleM.FerreiraP. G.ReverterF.DeLucaD. S.MonlongJ.SammethM.et al (2015). Human genomics. The human transcriptome across tissues and individuals.Science348660–665. 10.1126/science.aaa0355

  • 41

    MeyerL. R.ZweigA. S.HinrichsA. S.KarolchikD.KuhnR. M.WongM.et al (2013). The UCSC genome browser database: extensions and updates 2013.Nucleic Acids Res.41D64–D69. 10.1093/nar/gks1048

  • 42

    MillsK.HeJ. (2016). Sodium excretion, cardiovascular disease, and chronic kidney disease.JAMA3161112–1113.

  • 43

    NakagawaS.NishiharaK.MiyataH.ShinkeH.TomitaE.KajiwaraM.et al (2015). Molecular markers of tubulointerstitial fibrosis and tubular cell damage in patients with chronic kidney disease.PLoS One10:e0136994. 10.1371/journal.pone.0136994

  • 44

    PattaroC.TeumerA.GorskiM.ChuA. Y.LiM.MijatovicV.et al (2016). Genetic associations at 53 loci highlight cell types and biological pathways relevant for kidney function.Nat. Commun.7:10023. 10.1038/ncomms10023

  • 45

    ProdjosudjadiW.Suhardjono SuwitraK.Pranawa WidianaI. G.LoekmanJ. S.et al (2009). Detection and prevention of chronic kidney disease in Indonesia: initial community screening.Nephrology14669–674. 10.1111/j.1440-1797.2009.01137.x

  • 46

    RhieS. K.CoetzeeS. G.NoushmehrH.YanC.KimJ. M.HaimanC. A.et al (2013). Comprehensive functional annotation of seventy-one breast cancer risk Loci.PLoS One8:e63925. 10.1371/journal.pone.0063925

  • 47

    RosenbloomK. R.ArmstrongJ.BarberG. P.CasperJ.ClawsonH.DiekhansM.et al (2015). The UCSC genome browser database: 2015 update.Nucleic Acids Res.43D670–D681. 10.1093/nar/gku1177

  • 48

    SherwoodM.McCulloughP. A. (2016). Chronic kidney disease from screening, detection, and awareness, to prevention.Lancet Glob. Health4e288–e289.

  • 49

    ShinS. Y.FaumanE. B.PetersenA. K.KrumsiekJ.SantosR.HuangJ.et al (2014). An atlas of genetic influences on human blood metabolites.Nat. Genet.46543–550. 10.1038/ng.2982

  • 50

    ShinoharaY. (2010). Significance of serum creatinine measurement in the specific screening test system for detection of chronic kidney disease.Nihon Hinyokika Gakkai Zasshi10125–28.

  • 51

    SinhaA.BaggaA. (2018). Screening urinalysis in detection of chronic kidney disease in children.Indian J. Pediatr.85603–604.

  • 52

    SpeirM. L.ZweigA. S.RosenbloomK. R.RaneyB. J.PatenB.NejadP.et al (2016). The UCSC genome browser database: 2016 update.Nucleic Acids Res.44D717–D725. 10.1093/nar/gkv1275

  • 53

    StaleyJ. R.BlackshawJ.KamatM. A.EllisS.SurendranP.SunB. B.et al (2016). PhenoScanner: a database of human genotype-phenotype associations.Bioinformatics323207–3209.

  • 54

    ThornS. L.GollobM. H.HarperM. E.BeanlandsR. S.DekempR. A.DasilvaJ. N. (2013). Chronic AMPK activity dysregulation produces myocardial insulin resistance in the human Arg302Gln-PRKAG2 glycogen storage disease mouse model.EJNMMI Res.3:48. 10.1186/2191-219X-3-48

  • 55

    TynerC.BarberG. P.CasperJ.ClawsonH.DiekhansM.EisenhartC.et al (2017). The UCSC genome browser database: 2017 update.Nucleic Acids Res.45D626–D634. 10.1093/nar/gkw1134

  • 56

    VerziM. P.ShinH.HeH. H.SulahianR.MeyerC. A.MontgomeryR. K.et al (2010). Differentiation-specific histone modifications reveal dynamic chromatin interactions and partners for the intestinal transcription factor CDX2.Dev. Cell19713–726. 10.1016/j.devcel.2010.10.006

  • 57

    WanS.DuanY.ZouQ. (2017). HPSLPred: an ensemble multi-label classifier for human protein subcellular location prediction with imbalanced source.Proteomics17:1700262. 10.1002/pmic.201700262

  • 58

    WanS.ZouQ. (2017). HAlign-II: efficient ultra-large multiple sequence alignment and phylogenetic tree reconstruction with distributed and parallel computing.Algorithms Mol. Biol.12:25. 10.1186/s13015-017-0116-x

  • 59

    WardL. D.KellisM. (2012). HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants.Nucleic Acids Res.40D930–D934. 10.1093/nar/gkr917

  • 60

    WardL. D.KellisM. (2016). HaploReg v4: systematic mining of putative causal variants, cell types, regulators and target genes for human complex traits and disease.Nucleic Acids Res.44D877–D881. 10.1093/nar/gkv1340

  • 61

    WeiL.TangJ.ZouQ. (2017a). SkipCPP-Pred: an improved and promising sequence-based predictor for predicting cell-penetrating peptides.BMC Genomics18(Suppl. 7):742. 10.1186/s12864-017-4128-1

  • 62

    WeiL.XingP.SuR.ShiG.MaZ. S.ZouQ. (2017b). CPPred-RF: a sequence-based predictor for identifying cell-penetrating peptides and their uptake efficiency.J. Proteome Res.162044–2053. 10.1021/acs.jproteome.7b00019

  • 63

    WuttkeM.KottgenA. (2016). Insights into kidney diseases from genome-wide association studies.Nat. Rev. Nephrol.12549–562. 10.1038/nrneph.2016.107

  • 64

    XuY.GrayA.HardieD. G.UzunA.ShawS.PadburyJ.et al (2017). A novel, de novo mutation in the PRKAG2 gene: infantile-onset phenotype and the signaling pathway involved.Am. J. Physiol. Heart. Circ. Physiol.313H283–H292. 10.1152/ajpheart.00813.2016

  • 65

    ZengX.LiuL.LuL.ZouQ. (2018). Prediction of potential disease-associated microRNAs using structural perturbation method.Bioinformatics342425–2432. 10.1093/bioinformatics/bty112

  • 66

    ZhangB. L.XuR. L.ZhangJ.ZhaoX. X.WuH.MaL. P.et al (2013). Identification and functional analysis of a novel PRKAG2 mutation responsible for Chinese PRKAG2 cardiac syndrome reveal an important role of non-CBS domains in regulating the AMPK pathway.J. Cardiol.62241–248. 10.1016/j.jjcc.2013.04.010

  • 67

    ZhangY.WangL.JiaH.LiaoM.ChenX.XuJ.et al (2018). Genetic variants regulate NR1H3 expression and contribute to multiple sclerosis risk.J. Neurol. Sci.390162–165. 10.1016/j.jns.2018.04.037

  • 68

    ZouQ.WanS.JuY.TangJ.ZengX. (2016). Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy.BMC Syst. Biol.10(Suppl. 4):114. 10.1186/s12918-016-0353-5

Summary

Keywords

chronic kidney disease, PRKAG2, genome-wide association study, eQTL, gene expression

Citation

Wang E, Zhao H, Zhao D, Li L and Du L (2018) Functional Prediction of Chronic Kidney Disease Susceptibility Gene PRKAG2 by Comprehensively Bioinformatics Analysis. Front. Genet. 9:573. doi: 10.3389/fgene.2018.00573

Received

15 September 2018

Accepted

08 November 2018

Published

03 December 2018

Volume

9 - 2018

Edited by

Quan Zou, Tianjin University, China

Reviewed by

Chong Wang, Harvard Medical School, United States; Sicheng Hao, Northeastern University, United States

Updates

Copyright

*Correspondence: Ermin Wang,

This article was submitted to Bioinformatics and Computational Biology, a section of the journal Frontiers in Genetics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics