Systematic review of gastric cancer-associated genetic variants, gene-based meta-analysis, and gene-level functional analysis to identify candidate genes for drug development

Lee, Sangjun; Yang, Han-Kwang; Lee, Hyuk-Joon; Park, Do Joong; Kong, Seong-Ho; Park, Sue K.

doi:10.3389/fgene.2022.928783

SYSTEMATIC REVIEW article

Front. Genet., 16 August 2022

Sec. Pharmacogenetics and Pharmacogenomics

Volume 13 - 2022 | https://doi.org/10.3389/fgene.2022.928783

Systematic review of gastric cancer-associated genetic variants, gene-based meta-analysis, and gene-level functional analysis to identify candidate genes for drug development

SL
Sangjun Lee ^1,2,3
HY
Han-Kwang Yang ⁴
HL
Hyuk-Joon Lee ⁴
DJ
Do Joong Park ⁴
SK
Seong-Ho Kong ⁴
SK
Sue K. Park ^1,2,5^*

1. Department of Preventive Medicine, Seoul National University College of Medicine, Seoul, South Korea
2. Cancer Research Institute, Seoul National University College of Medicine, Seoul, South Korea
3. Department of Biomedical Sciences, Seoul National University Graduate School, Seoul, South Korea
4. Department of Surgery and Cancer Research Institute, Seoul National University College of Medicine, Seoul, South Korea
5. Integrated Major in Innovative Medical Science, Seoul National University College of Medicine, Seoul, South Korea

Article metrics

View details

Citations

5,8k

Views

2,4k

Downloads

Abstract

Objective: Despite being a powerful tool to identify novel variants, genome-wide association studies (GWAS) are not sufficient to explain the biological function of variants. In this study, we aimed to elucidate at the gene level the biological mechanisms involved in gastric cancer (GC) development and to identify candidate drug target genes.

Materials and methods: We conducted a systematic review for GWAS on GC following the PRISMA guidelines. Single nucleotide polymorphism (SNP)-level meta-analysis and gene-based analysis (GBA) were performed to identify SNPs and genes significantly associated with GC. Expression quantitative trait loci (eQTL), disease network, pathway enrichment, gene ontology, gene-drug, and chemical interaction analyses were conducted to elucidate the function of the genes identified by GBA.

Results: A review of GWAS on GC identified 226 SNPs located in 91 genes. In the comprehensive GBA, 44 genes associated with GC were identified, among which 12 genes (THBS3, GBAP1, KRTCAP2, TRIM46, HCN3, MUC1, DAP3, EFNA1, MTX1, PRKAA1, PSCA, and ABO) were eQTL. Using disease network and pathway analyses, we identified that PRKAA, THBS3, and EFNA1 were significantly associated with the PI3K-Alt-mTOR-signaling pathway, which is involved in various oncogenic processes, and that MUC1 acts as a regulator in both the PI3K-Alt-mTOR and P53 signaling pathways. Furthermore, RPKAA1 had the highest number of interactions with drugs and chemicals.

Conclusion: Our study suggests that PRKAA1, a gene in the PI3K-Alt-mTOR-signaling pathway, could be a potential target gene for drug development associated with GC in the future.

Systematic Review Registration: website, identifier registration number.

1 Introduction

Gastric cancer (GC) was the cancer with the fifth-highest worldwide incidence in 2020, with 1,089,103 new cases (Sung et al., 2021). The incidence of GC is highly variable depending on the region and culture, with the highest incidence rates in Eastern Asia, Europe, and South America (Sung et al., 2021). In Eastern Asia, the average incidence of GC is 32.5 per 100,000 among males and 13.2 among females. On the contrary, in North America, the overall incidence among males and females is 5.4 and 3.1 per 100,000, respectively. The lowest incidence is in regions of Middle Africa, where only 4.6 per 100,000 males and 3.8 per 100,000 females are diagnosed annually. (Sung et al., 2021).

The sequencing and bioinformatic advances in the past decade have permitted genome-wide association studies (GWAS) to become an innovative tool for identifying new single nucleotide polymorphisms (SNPs) or genes for cancer susceptibility (Wang et al., 2005). GWAS explore the associations between a large number of SNPs and traits such as major diseases, thereby investigating the entire genome with an unbiased approach (Manolio, 2010). Previous GWAS and meta-analyses have identified several genetic variants that are associated with GC susceptibility (Mocellin et al., 2015; Jin et al., 2020; Yan et al., 2020). However, no systematic reviews have evaluated the genetic factors associated with GC using gene-based meta-analyses or gene-network analyses.

Despite GWAS being powerful tools for the identification of novel variants associated with a certain trait, they may not capture the entire signal due to a lack of power, and their results may be biased due to population stratification or locus heterogeneity (Luo et al., 2010). In addition, as the identified variants may be non-pathogenic variants in linkage disequilibrium (LD) with the actual causal variants, follow-up studies are necessary to confirm the functional effects of the identified signal (Stadler et al., 2010).

Gene-based analysis (GBA) has recently been suggested as an approach to overcome the limitations of GWAS. GBA can detect regions that display allelic heterogeneity and identify modest genetic effects by improving statistical power by combining single variants obtained from individual GWAS (Liu et al., 2010; Huang et al., 2011). Another approach to overcome the limitations of GWAS is expression quantitative trait loci (eQTL) analysis. This method permits the functional interpretation of GWAS markers by linking them to changes in gene expression (Nica and Dermitzakis, 2013). Furthermore, pathway and Gene Ontology (GO) enrichment analyses of the identified variants can inform about the biological function of the identified variants at the gene level (Gene Ontology Consortium, 2015; Slenter et al., 2018). Finally, as genes associated with a specific disease can be pleiotropic, meaning that they can be associated with other diseases or phenotypes (Solovieff et al., 2013). Disease interaction analysis has also been conducted to identify shared pathological pathways (Piñero et al., 2020). Using a combination of these approaches, studying genetic variants and their functions in disease can ultimately be used to identify novel drug targets or biomarkers (Wheeler et al., 2013).

The purpose of this study was to identify potential genes for drug development associated with GC based on a comprehensive understanding of the biological mechanisms of GC-associated genes by systematically reviewing published GWAS for GC and performing gene-level functional analyses, including drug/chemical interactions, through GBA.

2 Materials and methods

2.1 Literature search and selection criteria

Our study conducted a systematic review according to the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines (Supplementary Table S1). (Page et al., 2021) The inclusion criteria based on the Population, Intervention, Comparison, Outcome, Study design (PICOS) model were as follows (Richardson et al., 1995): 1) Population: Human patients with gastric cancer; 2) Intervention (Exposure): Genetic variants (SNPs); 3) Comparison: Control group with unaffected risk alleles; 4) Outcome: Genotyping profiles of patients with gastric cancer; 5) Study design: case-control, GWAS. We excluded all studies that were not published in English and did not perform the GWAS analysis. In addition, studies in which the GWAS analysis was repeated with the same population were also excluded. An overall identification of eligible studies on the literature search was presented in the PRISMA2020 flow diagram (Supplementary Figure S1) (Haddaway et al., 2022).

To retrieve potentially eligible studies from PubMed and Embase, combinations of search queries were used (Supplementary Table S2). A PubMed search was conducted using “RISmed” R package, whereas Embase search was conducted in online search (https://www.embase.com/search/quick, assessed on 20 March 2021) (Kovalchik, 2014). In addition, a detailed search of several publicly available GWAS databases (registers) was conducted: the Human Genome Epidemiology (HuGE) Navigator (https://phgkb.cdc.gov/PHGKB/hNHome.action), the Genome-Wide Repository of Associations between SNPs and Phenotypes (GRASP) (https://grasp.nhlbi.nih.gov/Search.aspx), the National Human Genome Research Institute (NHGRI GWAS Catalog) (https://www.ebi.ac.uk/gwas/), and the GWAS Central (https://www.gwascentral.org/). All GWAS databases (registers) were assessed on 20 March 2021. We only included articles for which the full text was published on or before 31 December 2020. The identification of all studies was performed by two independent researchers. The two independent researchers independently extracted data on the first author’s name, publication year, study design, location of the study, ethnicity of the participants, number of cases/controls, SNPs investigated, chromosome, candidate genes, genotyping platform used, cancer type/location, measure of association with corresponding to 95% confidence interval (CI), and p-value obtained from the combined sample sets.

Non-Randomised Studies of Interventions (ROBINS-I) tool was used to assess the risk of bias (RoB) in non-randomized studies (Sterne et al., 2016). The instrument was developed to assess the internal validity of non-randomized trials by assessing the RoB within the seven domains: 1) Confounding bias, 2) Bias in the selection of study participants, 3) Bias in classification of intervention, 4) Bias due deviation from intended intervention, 5) Bias due to missing data, 6) Bias in measurement of outcome, and 7) Bias in the selection of reported results. The domain conclusion classified the overall body of evidence into “low”, “moderate”, “serious”, and “critical” categories. The results were also visualized by “robvis” Shiny (McGuinness and Higgins, 2021).

When the opinions of the two independent researchers differed, the four co-authors who are gastrointestinal surgery clinicians were consulted to resolve the dispute. We followed the principles proposed by the Human Genome Epidemiology Network (HuGeNet) for a systematic review of molecular association studies (Little et al., 2006).

2.2 Meta-analysis

A meta-analysis was performed to synthesize a total of 522 SNPs associated with GC that were included in 12 eligible studies as follows (

Gurevitch et al., 2018

1) The original values of the fixed-effect model were obtained when the reported SNPs in the individual study were validated several times.
2) The values of the fixed-effect model were estimated when external replication was possible because each SNP was reported only once in each different study.
3) The original odds ratios (ORs) (95% CI) or P-values for the single SNPs reported in the eligible studies were also obtained even though those of SNPs were excluded from the gene-based analysis.
4) When the same SNPs were reported from GC and its sub-types, the ORs (95% CI) or P-values reported for GC were used.
5) When the OR (95% CI) or P-value for the same SNP was estimated from multiple subtypes, the one with a lower P-value was used for the meta-analysis.
6) The OR (95% CI) and P-value were calculated based on the random-effect model. However, the values of the random-effect model could not be estimated when the OR was reported only once or when only the OR p-value was presented for each SNP.

ORs were calculated for each study and polymorphism assuming an additive genetic model. Heterogeneity in the meta-analysis was evaluated using I² statistics (Higgins and Thompson, 2002). All statistical analyses were performed using the R software (version 4.1.0).

2.3 Gene-based analysis: Burden test

Gene-level association tests in the random-effects model were performed after weighting by minor-allele frequencies (MAFs) (Morgenthaler and Thilly, 2007). We combined information across several variants in a target region and then performed a burden test based on a single/meta-analysis for each SNP with LD structure based on the 1,000 Genome reference panel (Phase 3, East Asian). The burden test results were converted to gene-level estimates of effect sizes (betas) and their standard errors (Svishcheva et al., 2015). When multiple SNPs were in high LD (R² > 0.9) in the same gene region, a burden test was performed with SNPs remaining after LD clumping. A Bonferroni correction for multiple testing was applied to account for the total number of genes tested (approximately 20,000 genes). Significant gene-level associations in the burden test were those with a p-value < 2.5 × 10^–6 after correcting for multiple testing.

2.4 Functional annotation analysis

2.4.1 eQTL analysis

Overlapping eQTL analysis was performed to identify SNPs affecting a regulatory element controlling gene expression (Nica and Dermitzakis, 2013). The eQTL were identified based on the eQTLGen consortium, which is a large-scale multi-study effort to identify the downstream effects of trait-related variants via their effects on gene expression in whole blood (Võsa et al., 2018). eQTL analysis was also conducted based on the Genotype-Tissue Expression (GTEx) project, which aims to study tissue-specific gene expression and regulation (Carithers and Moore, 2015). We used individual-level data in stomach tissue from GTEx (v8) to construct the co-expression matrix and further validate the gene sets reported by eQTLGen.

We performed eQTL analysis based on SNP statistics (p-value) from the meta-analysis and burden test using Functional Mapping and Annotation of Genome-Wide Association Studies (FUMA) online software (https://fuma.ctglab.nl/) (Watanabe et al., 2017). Significant eQTL with a False Discovery Rate (FDR) ≤ 0.05 were selected for further analysis. Gene annotation was performed based on the Genome Reference Consortium Human Genome Build 37 or hg19 reference assembly.

2.4.2 Disease network analysis

The disease network analysis was used to identify candidate genes for GC from a burden test using DisGeNET (Piñero et al., 2020). DisGeNET is a discovery platform that contains one of the largest publicly available collections of genes and variants associated with humans. An FDR-corrected p-value of <0.05 was used to identify significant disease networks.

2.4.3 Pathway analysis

To identify pathways associated with GC, we used statistical results from the WikiPathway Human Collection (http://wikipathways.org) (Slenter et al., 2018) and the Network Data Exchange (NDEx) (https://www.ndexbio.org/) (Pratt et al., 2015). The WikiPathway is a collaborative open database that includes knowledge of curated biological pathways. In addition, the NDEx database provides access to not only pathways but also diverse types of network models, offering digital object identifier (DOI) minting for citation. Pathways with an FDR <0.05, including at least one altered gene, were considered significant.

2.4.4 GO analysis

We performed GO analysis to annotate genes to known functional information sources (Gene Ontology Consortium, 2015), including biological process (BP), cellular component (CC), and molecular function (MF) using the “clusterProfilter” R package (Yu et al., 2012). We submitted genes significantly estimated from the burden test and were considered significant for GO results with an FDR <0.05.

2.4.5 Gene-drug interaction analysis

We studied gene-drug interactions using the DrugBank database (https://go.drugbank.com/) and the DGIdb database (http://www.dgidb.org/) (Wishart et al., 2018; Freshour et al., 2021). DrugBank is a drug-centric online database that provides detailed information about over 500,000 drugs and their target genes. DGIdb comprises drug-gene interaction information of more than 40,000 genes and 10,000 drugs from 15 different resources and allows filtering at different levels. Only gene-drug interactions in which the drug was found in two or more references or databases were selected.

2.4.6 Gene-chemical interaction analysis

The Comparative Toxicogenomics Database (CTD) (http://ctdbase.org/) was employed to construct a gene-chemical interaction network (Davis et al., 2021). The CTD includes toxicological information for over 16,000 chemicals and 50,000 genes. Only gene-chemical interactions with two or more references were selected.

2.4.7 Protein-protein interaction analysis

The Search Tool for the Retrieval of Interacting Genes (STRING; http://string.embl.de/) is a biological database designed to construct a PPI network by analyzing the functional interactions between proteins (Szklarczyk et al., 2021). Using STRING, PPIs were constructed with a confidence score ≥0.99 (Asadzadeh-Aghdaee et al., 2016). Subsequently, the PPI network was visualized using the Cytoscape software (version 3.8.2) (Shannon et al., 2003) via Rcy3 (Gustavsen et al., 2019).

3 Results

3.1 Flow of study selection

We identified 3,251 and 90 eligible studies through PubMed and Embase, respectively (Supplementary Figure S1). In addition, 906, 123, 46, and 14 eligible studies were also identified through the HUGE Navigator, GRASP, GWAS Catalog, and GWAS Central, respectively (Supplementary Figure S1). Subsequently, 230 duplicated studies and two studies that were written in other languages were removed. After title and abstract screening, 348 full text articles were assessed for further eligibility. A total of 333 studies were excluded for not conducting GWAS, and three full text articles were excluded for repeating the analyses in the same population. The remaining 12 GWAS for GC, including duplicated 522 SNPs, were included in the meta-analysis (Supplementary Figures S1, S2) (Sakamoto et al., 2008; Abnet et al., 2010; Shi et al., 2011; Jin et al., 2012; Tanikawa et al., 2012; Helgason et al., 2015; Hu et al., 2016; Wang et al., 2017; Tanikawa et al., 2018; Park et al., 2019; Du et al., 2020; Rashkin et al., 2020). Among the selected studies, ten were conducted in Asia (China, Japan, Korea, and Singapore), and two were performed in Europe and North America. The studies were published between 2008 and 2020. The present study was approved by the respective institutional ethics review committee, and informed consent was obtained from all participants.

3.2 Study characteristics and risk of bias within the studies

Of the total of 12 studies, ten of which focused on Asians (Korea, China, Japan, Singapore) (Sakamoto et al., 2008; Abnet et al., 2010; Shi et al., 2011; Jin et al., 2012; Tanikawa et al., 2012; Hu et al., 2016; Wang et al., 2017; Tanikawa et al., 2018; Park et al., 2019; Du et al., 2020), one in Europe (Helgason et al., 2015), and the other in the United States/United Kingdom (Rashkin et al., 2020). 12 studies in all were adjusted for age and sex or additional covariates such as principal components. Three studies presented the results of the diffuse and intestinal subtypes (Sakamoto et al., 2008; Tanikawa et al., 2012; Tanikawa et al., 2018), while five studies revealed the results of cardia or non-cardia subtypes (Abnet et al., 2010; Shi et al., 2011; Jin et al., 2012; Hu et al., 2016; Wang et al., 2017). Furthermore, two studies presented findings associated with adenocarcinoma (Abnet et al., 2010; Helgason et al., 2015). Other studies without subtype analysis have reported GC results.

The assessment of RoB for observational studies was shown in Supplementary Table S3 and Supplementary Figure S3. Based on the ROBINS-I tool, 4 studies were identified as “low risk”, 6 studies were assessed as “moderate risk” studies, while 2 studies were considered as “Serious risk”. The p-value of each SNP reported prior to data synthesis in one of the two severe risk studies was not genome-wide significant (5 × 10^–8) (Sakamoto et al., 2008), and some of the SNP annotations and effect sizes were not presented in another study (Jin et al., 2012). It is believed that there may be limitations because this GWAS study is in its early days. GWAS generally adjusts for age and sex, but if there is heterogeneity in the population, the principal component is additionally adjusted (McCaw et al., 2022). However, in our eligible studies, validation analysis was also performed with the same ethnicity, so there seems to be little bias due to the confounder. According to our assessment of the certainty of the evidence, the body of evidence supporting an association between SNPs and elevated risk of GC had “moderate degree of evidence”.

3.3 Major genes associated with GC: Meta-and gene-based analyses

A total of 552 SNPs were identified from the eligible studies based on literature search (Supplementary Table S4). 522 SNPs were located in upstream (n = 5), downstream (n = 9), intronic (n = 207), exonic (n = 28), noncoding RNA (ncRNA) intronic (n = 10), ncRNA exonic (n = 8), 5′-UTR (n = 15), 3′-UTR (n = 38), and intergenic (n = 202) regions (Figure 1; Supplementary Table S4). Some of the SNPs were associated specifically with histological subtypes (intestinal; n = 12, diffuse; n = 24), site (cardia; n = 36, non-cardia; n = 98), onset age (early; n = 6, late; n = 6), and pathological subtype (adenocarcinoma; n = 17) (Figure 1; Supplementary Table S4). Out of 522 SNPs, 296 SNPs were reported in the multiple studies or overlapped results from subtypes were excluded (Supplementary Table S5). Therefore, a total of 226 SNPs remained in the meta-analysis based on both the fixed- and random-effect models (Supplementary Table S6). In many cases, 25%–49%, 50%–74%, and over 75% of I² suggest low, intermediate, and high heterogeneity, respectively (Higgins et al., 2003). Among 226 SNPs, 41 SNPs had no heterogeneity. On the other hand, 24, 71, and 53 SNPs had low, intermediate, and high heterogeneity, respectively. The heterogeneity of 37 SNPs was not evaluated due to without validation (Supplementary Table S6).

FIGURE 1

Since a gene’s effect size is estimated based on the effect size of several SNPs located in the gene, 59 SNPs located in the intergenic position were excluded from the remaining 226 SNPs to perform the gene-based analysis. Therefore, 167 SNPs located in 91 genes were retained as candidates for gene-based analysis (Supplementary Figure S2). Among the 91 genes, 44 genes were included in the burden test after excluding genes that were specifically associated with a subgroup of GC or in a non-Asian population, genes whose effect size was estimated from a single distinct SNP, and genes not identified as entrez id (Supplementary Figure S2). Of the 44 genes, effect sizes for six genes and 38 genes were estimated by the burden test and the meta-analysis, respectively (Table 1). After correcting for multiple testing, 25 genes had significant gene-level associations with GC (p-value < 2.5 × 10^–6).

TABLE 1

Gene	Chr	OR (95% CI)	p-value	Method
KRTCAP2	1q22	0.65 (0.60–0.71)	8.44E-24	Burden Test
MUC1	1q22	0.76 (0.71–0.81)	1.99E-14	Meta
MTX1	1q22	0.71 (0.63–0.80)	1.70E-08	Burden Test
GBAP1	1q22	0.48 (0.42–0.55)	1.03E-27	Burden Test
EFNA1	1q22	0.82 (0.77–0.87)	1.46E-10	Meta
TRIM46	1q22	0.71 (0.66–0.76)	1.65E-21	Burden Test
THBS3	1q22	0.51 (0.45–0.57)	6.39E-29	Burden Test
HCN3	1q22	0.83 (0.80–0.87)	2.65E-17	Meta
DAP3P1	1q22	0.64 (0.56–0.74)	1.56E-10	Meta
MST O 1	1q22	0.67 (0.60–0.75)	1.60E-12	Meta
DAP3	1q22	0.65 (0.58–0.73)	4.23E-13	Meta
GON4L	1q22	0.63 (0.55–0.72)	4.47E-11	Meta
RPL7P10	1p31.1	1.25 (1.10–1.42)	5.76E-04	Meta
SERINC2	1p35.2	1.01 (0.96–1.06)	8.05E-01	Meta
RMDN2	2p22.2	0.67 (0.58–0.78)	3.24E-07	Meta
LYPD6	2q23.2	0.84 (0.77–0.91)	4.27E-05	Meta
BAZ2B	2q24.2	1.29 (1.14–1.45)	2.59E-05	Meta
BZW1	2q33.1	0.97 (0.93–1.01)	1.56E-01	Meta
UMPS	3q21.2	1.03 (0.99–1.08)	2.00E-01	Meta
ITGB5	3q21.2	0.95 (0.91–0.99)	2.92E-02	Meta
TRIML1	4q35.2	0.73 (0.64–0.84)	1.68E-05	Meta
RAB3C	5q11.2	1.07 (1.03–1.13)	2.46E-03	Meta
PRKAA1	5p13.1	0.80 (0.77–0.83)	4.83E-26	Meta
GPX3	5q33.1	0.92 (0.89–0.95)	1.49E-06	Meta
LINC01411	5q35.2	1.05 (1.01–1.10)	2.65E-02	Meta
UNC5CL	6p21.1	1.14 (1.09–1.20)	7.22E-08	Meta
SAMD5	6q24.3	0.81 (0.75–0.88)	5.62E-07	Meta
PSCA	8q24.3	0.75 (0.72–0.78)	8.20E-56	Meta
ABO	9q34.11	1.15 (1.11–1.19)	2.64E-13	Meta
PLCE1	10q23.33	3.60 (2.46–5.26)	3.51E-11	Burden Test
LOC101928477	11q22.2	0.95 (0.84–1.08)	4.29E-01	Meta
DYNC2H1	11q22.3	0.95 (0.91–1.00)	3.30E-02	Meta
OPCML	11q25	1.19 (1.11–1.28)	1.60E-06	Meta
CCDC63	12q24.11	0.92 (0.89–0.96)	1.80E-05	Meta
CUX2	12q24.12	0.91 (0.88–0.94)	3.20E-08	Meta
DTX1	12q24.13	1.15 (1.09–1.21)	1.20E-07	Meta
GPC5	13q31.3	1.07 (1.02–1.11)	2.31E-03	Meta
UBAC2	13q32.3	1.07 (1.03–1.11)	2.71E-04	Meta
FMN1	15q13.3	1.11 (1.05–1.17)	4.53E-04	Meta
TRPM1	15q13.3	0.61 (0.50–0.74)	5.87E-07	Meta
RORA	15q22.2	1.09 (1.04–1.14)	1.83E-04	Meta
SNX29	16p13.13	0.92 (0.88–0.96)	1.83E-04	Meta
HA O 1	20p12.3	0.92 (0.87–0.96)	7.20E-04	Meta
DEFB121	20q11.21	1.11 (1.07–1.15)	8.11E-10	Meta

The results of gene-based analysis for gastric cancer.

OR, odds ratio; CI, confidence interval; BT, burden test.

3.4 Functional annotation analysis

eQTL analysis was performed based on 226 SNPs with statistics (p-value) after the meta-analysis. The eQTL analysis results were represented in SNP-gene pairs since the SNPs have a role in gene expression regulation. Furthermore, since one SNP can affect the expression level of multiple genes, the results of eQTL analysis were calculated in pairs.

Forty-seven SNPs out of the 226 SNPs in the meta-analysis were identified to regulate the expression of 12 genes (THBS3, GBAP1, KRTCAP2, TRIM46, HCN3, MUC1, DAP3, EFNA1, MTX1, PRKAA1, PSCA, and ABO) out of the 25 genes significantly estimated from the burden tests, resulting in a total of 175 SNP-gene pairs (Supplementary Table S7). In PRKAA1, PSCA, and ABO, the SNPs located in the corresponding gene regulated the expression of their respective (Supplementary Figure S4). Three pairs (three SNPs-one gene) for PRKAA1, 26 pairs (26 SNPs-one gene) for PSCA, and one pair (one SNP-one gene) for ABO were estimated (Supplementary Figure S4). However, the expression level of nine genes (THBS3, GBAP1, KRTCAP2, TRIM46, HCN3, MUC1, DAP3, EFNA1, and MTX1) on chromosome 1 were regulated by 17 SNPs located nearby (Supplementary Figure S4; Supplementary Table S7). Among the 17 SNPs, 13 (rs1057941, rs12752585, rs2049805, rs28445596, rs2974929, rs2990220, rs4276914, rs4971059, rs4971085, rs4971088, rs4971100, rs4971101, and rs7556304) regulated the expression of 9 genes, yielding a total of 117 pairs. Three SNPs (rs2066981, rs3814316, and rs4971093) regulated the expression of eight genes, not including EFNA1, yielding 24 pairs. Finally, rs4971066 regulated the expression level of only four genes (GBAP1, THBS3, MTX1, and MUC1), establishing four pairs. In total, eQTL analysis yielded 175 SNP-gene pairs between 47 SNPs and 12 genes.

Cis-eQTL identification based on the eQTLGen database yielded a total of 120 significant (FDR ≤0.05) pairs between 21 SNPs and 11 genes (THBS3, GBAP1, KRTCAP2, TRIM46, HCN3, MUC1, DAP3, EFNA1, MTX1, PRKAA1, and ABO). In contrast, no trans-eQTL were found. eQTL refers to genetic variants involved in regulating gene expression (Võsa et al., 2021). eQTL is divided into cis-eQTL and trans-eQTL. SNPs regulating gene expression located near a gene (<1 megabase; Mb) with local effects are called cis-QTL, whereas SNPs located distally (>5 Mb) or on a different chromosome of a gene with remote effects are called trans-eQTLs (Westra and Franke, 2014). Because cis-eQTLs generally have large effect sizes (Huang da et al., 2009), even moderate sample size enables the detection of cis-eQTLs of thousands of genes (Westra et al., 2013). In addition, cis-eQTLs have a direct effect on gene expression due to their proximity to the transcription start site (TSS) (Stranger et al., 2012). On the other hand, since the effect size of trans-eQTLs is generally small, a larger sample size is required (Grundberg et al., 2012). Moreover, it is hard to identify validated reports to estimate the effect size for tans-eQTLs due to estimation difficulty (Westra et al., 2013). Nevertheless, since a trans-eQTL can affect multiple genes with small effect size and can have a wide range of effects in biological networks, it can be highly associated with a cross-phenotype (Brynedal et al., 2014; Westra and Franke, 2014).

Based on the GTEx-stomach database, 55 pairs between 42 SNPs and three genes (THBS3, GBAP1, and PSCA) were significant (FDR ≤0.05) (Supplementary Table S7). The results for THBS3 and GBAP1 were validated on both the eQTLGen and GTEx-stomach databases.

Of the 12 eQTL genes, 10 were associated with a total of 28 diseases according to the disease network analysis (Figure 2; Supplementary Figure S5). Among these diseases, 11 were associated with GC (Helicobacter pylori infections, infection caused by Helicobacter pylori, atrophic gastritis, duodenal ulcer, preneoplastic conditions, intestinal metaplasia, precancerous lesions, hereditary diffuse gastric cancer, malignant neoplasm of gastrointestinal tract, gastric adenocarcinoma, and precancerous conditions). In addition, three biomarkers of chronic kidney disease were identified (blood urea nitrogen, glomerular filtration rate, and uric acid). Five diseases associated with uric acid or inflammation (Gaucher disease, tarsal-carpal coalition syndrome, tuberous sclerosis, psoriatic arthritis, and inflammation) and two viral diseases were selected (Rubella and Epstein-Barr virus infection), and hemoglobin and hematocrit were also found in the disease network. Additionally, five other diseases associated with eQTL genes were identified (Supplementary Figure S5).

FIGURE 2

A total of 18 pathways were significantly associated with 12 eQTL genes in Wikipathways (Figure 2; Supplementary Figure S5). Among these pathways, the PI3K-Alt-mTOR-signaling pathway containing three eQTL genes (THBS3, EFNA1, and PRKAA1) was the most significant (FDR = 5.99 × 10^–4). PRKAA1 was associated with 13 pathways and was the gene that showed the strongest association among the 12 eQTL genes (FDR <0.05). According to the NDEx database, MUC1 was identified as a regulator of the PI3K-Alt-mTOR and p53 signaling pathways. Moreover, RAS drives the PI3K-Alt-mTOR-signaling pathway via PRKAA1, the adenosine monophosphate-activated kinase (AMPK) via the ERK pathway, and TP53 also interacts with RPKAA1 (AMPK). In addition, PPI network analysis showed that MUC1 interacted with EGFR (HER2-receptor), CTNNB1 (β-catenin), Src, and ICAM-1 (intracellular adhesion molecule)-1.

GO annotation analysis revealed the genetic signal was enriched in 9 MF and 24 BP terms (Supplementary Figure S5). The most significant MF and BP terms were glycoprotein-fucosylgalactoside alpha-N-acetylgalactosaminyl-transferase activity and positive regulation of peptidyl-lysine acetylation, respectively. In addition, glycosylation and AMPK-associated functions or processes were identified. Of the 12 eQTL genes, MUC1 and PRKAA1 were the ones that were most annotated to the enriched GO terms.

Gene-drug interactions were identified only for MUC1 and PRKAA1. PRKAA1 interacts with phenformin, metformin, hesperadin, sirolimus, streptozocin, thyroxine, pentostatin, saponarin, fluorinated N,N′-diarylureas, acetylsalicylic acid, and adenosine monophosphate. MUC1 interacted with Huhmfg1 and GO-203-2C (Figure 2).

Based on the CTD database, 66 gene-chemical interactions with 44 chemicals and ten genes were identified (Figure 2; Supplementary Figure S6). Of the ten genes, PRKAA1 and EFNA1 interacted with the largest number of chemicals (17). Oestradiol was six times reported to decrease the expression of EFNA1. AICA ribonucleotide and metformin were five times reported to increase the phosphorylation of PRKAA1. Furthermore, the expression of MUC1 was reported to be increased by the action of aflatoxin B1, oxygen, and valproic acid a total of four times.

4 Discussion

In this review, we described the most reported genetic loci that are associated with the increased risk of GC from the available GWAS and conducted meta-analyses and GBA of the genetic variants with available genotypes. Comprehensive meta-analysis and GBA of genetic variants identified 25 significant genes for GC susceptibility. Among the 25 genes, 12 genes (THBS3, GBAP1, KRTCAP2, TRIM46, HCN3, MUC1, DAP3, EFNA1, MTX1, PRKAA1, PSCA, and ABO) were significant at the gene expression level according to eQTL analysis. To understand the function of these 12 genes, disease network analysis, biological pathway and GO enrichment analysis, and gene-drug and chemical interaction analyses were conducted.

PSCA encodes a glycosylphosphatidylinositol-anchored cell membrane glycoprotein. In addition to being highly expressed in the prostate, it is also expressed in the bladder, placenta, colon, kidney, and stomach. PSCA is the genetic locus most significantly associated with the risk of H. pylori-induced GC in the Japanese population, which is the case with regard to the European population as well (Rizzato et al., 2013). Moreover, in H. pylori-infected gastric mucosal tissue, PSCA expression was found to be remarkably suppressed compared to that in normal, non- H. pylori infected gastric mucosal tissue (Toyoshima et al., 2018). This can lead to a reduced risk of GC or an increased risk of duodenal ulcers (Tanikawa et al., 2012).

PRKAA1 belongs to the serine/threonine-protein kinase family. It is the catalytic subunit of AMPK, a cellular energy sensor conserved in all eukaryotic cells (Krishan et al., 2014). PRKAA1 is mainly involved in the PI3K-Alt-mTOR-signaling pathway via AMPK. The PI3K-Alt-mTOR-signaling pathway is a transduction hub linked to various biological pathways and mechanisms associated with carcinogenesis (Figure 3). AMPK negatively modulates mTOR, which plays an important role in regulating cellular energy homeostasis by regulating cellular processes such as protein synthesis and autophagy. mTOR signalling positively regulates cell proliferation and tumorigenesis in various cancers and is often aberrantly activated in cancer. In addition, the PI3K-Alt-mTOR-signaling pathway can influence glycosylation through nuclear factor kappa B (NF-κB), a protein complex that functions as a signal-induced transcription factor regulating proliferation and apoptosis (Magaway et al., 2019; Cho et al., 2021).

FIGURE 3

The role of PRKAA1 was also confirmed by the GO annotation analysis results (Supplementary Figure S5). PRKKA1 was shown to be associated with affecting the molecular function of AMPK and involved in biological pathways, including glycolysis, PI3K-Akt-mTOR signaling, autophagy, cell cycle, and cell differentiation (Supplementary Figure S5). This confirms that PRKAA1 directly affects the molecular function of AMPK and has a role in its related pathways (Figure 3). In addition, PRKAA1 is also associated with the glucosylceramide process and mitochondrial regulation in biological processes. MUC1, a mucin linked to AMPK’s pathway, seems to be more involved in biological processes through the regulation of protein acetylation, cell adhesion by integrin, and glycosylation (Figure 3; Supplementary Figure S5).

Regarding gene-chemical interactions, 5-aminoimidazole-4-carboxamide (AICA) ribonucleotide is one of the most reported chemicals to phosphorylate PRKAA1 (Supplementary Figure S6) and is widely used as a pharmacological modulator of AMPK activity (Višnjić et al., 2021). In a previous experimental study, AICA ribonucleotide was also shown to induce apoptosis alone in GC cells with the aim of developing a chemotherapy sensitizer for GC (Wu et al., 2016).

Metformin,both as a chemical and drug, had the highest association with PRKAA1 (Figure 2; Supplementary Figure S5), and gene-chemical interaction results revealed that metformin phosphorylates PRKAA1. Metformin is one of the most widely used anti-hyperglycemic drugs for the management of type 2 diabetes. Experimental studies strongly suggest that metformin also possesses anticancer activity mediated through the modulation of several cellular signaling pathways, including AMPK activation and other mechanisms. In addition, metformin use has been associated with a reduced GC risk, where an increasing metformin dose was correlated with a lower GC risk (Kim et al., 2014; Cheung et al., 2019). Regarding gene-drug interactions, phenformin scored four, coming after metformin, which scored five (the score reflects the number of reports in previous studies). It has been reported that both phenformin and metformin can inhibit cell growth through inhibition of cell proliferation, promotion of apoptosis, and cell cycle disturbances (Wang et al., 2018).

EFNA1 is a growth factor that induces cell proliferation, differentiation, and survival by binding to receptor tyrosine kinase (RTK) in the cell membrane to generate Ras-GTP, which activates the mitogen-activated protein kinase (MAPK) pathway in the cytoplasm (Haglund et al., 2007). When extracellular signal-regulated kinase (ERK), an important factor in the MAPK pathway, is activated, the transcription of several genes is activated, thereby resulting in cell growth. RAS mutations lead to sustained activation of the ERK pathway, which leads to cancer development (Mitra et al., 1993; Seger and Krebs, 1995). In addition, ERK is linked to the PI3K-Alt-mTOR-signaling pathway by activating AMPK (Figure 3).

Thrombospondin 3 (THBS3) is an extracellular glycoprotein that mediates cell-to-matrix and cell-to-cell interactions (Mosher and Adams, 2012). THBS3 activates the PI3K-Alt-mTOR-signaling pathway via protein kinase B (PKB).

MUC1 is a single-pass type I transmembrane protein with a heavily glycosylated extracellular domain (Hattrup and Gendler, 2008; Nath and Mukherjee, 2014). MUC1 has been reported to act as an anti-inflammatory molecule in gastric mucosal cells. The anti-inflammatory properties of MUC1 have also been observed in gastric mucosal cell responses to H. pylori infection (Guang et al., 2010). In addition, MUC1 inhibits cell proliferation and regulates the PI3K-Alt-mTOR-signaling pathway through a β-catenin-dependent mechanism (Lillehoj et al., 2007).

H. pylori induces ICAM-1 and CD11b (integrin) expression, causing degranulation and eosinophil cationic protein (ECP) release (Chmiela et al., 2018). Activation of ICAM-1 by CD11b results in the release of reactive oxidative species, which stimulate NK-κB. In addition, an interaction between the H. pylori virulence factor CagA and the receptor c-Met has been found (Eom et al., 2016). CagA stimulates the MAPK/ERK pathway and PI3K-Alt-mTOR-signaling pathway by activating RAS by binding to the c-Met receptor (Churin et al., 2003; Suzuki et al., 2009). Moreover, binding between c-Met and hepatocyte growth factor (HUFF) also stimulates the MAPK/ERK, PI3K-Alt-mTOR-signaling, and JAK/STAT pathways (Jang et al., 2020). These mechanisms converge in inducing cell proliferation, pro-inflammatory response, and cell motility, which are involved in tumor development and progression (Figure 3) (Churin et al., 2003; Suzuki et al., 2009; Bradley et al., 2017).

Our study has some limitations. First, given that only the results of GWAS that have been published were selected, we cannot avoid the possibility of publication bias. In general, GWAS with significant associations are more likely to be published than studies with null associations (Stadler et al., 2010). Second, the SNPs generally reported in GWAS are the lead SNPs with the most significant p-value based on Bonferroni multiple tests after LD clumping. However, since lead SNPs are not always causal SNPs, fine-mapping analysis is necessary to investigate the region around the lead SNP to find the presence of other potential causal SNPs (Farh et al., 2015). The GWAS included in our study did not perform such follow-up analyses. Third, the heterogeneity between each GWAS, in terms of population origin, phenotype definition, genotyping platform, and software used can lead to biased results. Since the original data used in the individual GWAS were not available, taking into account sources of variability in the analysis was difficult. Furthermore, the results of GWAS for GC in Caucasians were not included in the GBA due to an insufficient number of SNPs or genes. In the future, the genetic burden of GC in Caucasians and the differences among ethnicities need to be further explored. Lastly, although the meta-analysis results based on the fixed- and random-effect models presented similar estimates for most SNPs, some of the SNPs that were reported only twice in previous studies yielded different estimates. Similarly, SNPs with different estimates between fixed- and random-effect model had high heterogeneity. Given that the random-effect model is more evenly weighted compared to the fixed-effect model (Hedges and Vevea, 1998), it is possible that the estimates of less-reported SNPs are more unstable. In addition, the p-value threshold of GWAS is generally 5 × 10^–8 in discovery and less than 0.05 in validation (Risch and Merikangas, 1996; Oetting et al., 2017). Even if the SNPs were validated multiple times in a single study, the heterogeneity can be highly evaluated because the threshold of p-value in validation analysis is high. Thus, some SNPs still require a larger number of external validations to estimate the stable effect size of SNPs associated with GC.

Despite these limitations, our study had several strengths. First, we used a comprehensive and systematic approach to identify all possible GWAS in the literature. Second, the statistical power of our analysis was increased due to the large number of SNPs comprised in the meta-analysis. Moreover, compared to SNP-based GWAS, GBA is more robust in terms of statistical significance. By combining the SNPs of individual GWAS into a gene-based score without increasing the sample size or collecting new data, the statistical power is increased, resulting in a less stringent significance threshold (Liu et al., 2010). Therefore, our study highlights the possibilities that meta-analysis and GBA offer by reusing published summary statistics. Furthermore, functional annotation using disease network, biological pathway, GO, gene-drug, and chemical interaction analyses permitted a further understanding of the mechanisms of GC development.

Based on the comprehensive investigation and multifaceted functional analysis of the reported GC-associated genetic variants, we conclude that PRKAA1 is a key gene for GC development. Based on our results, PRKAA1, which is involved in the PI3K-Alt-mTOR-signaling pathway, could be a target gene for drug development associated with GC in the future.

Statements

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Author contributions

SL and SP conceived and designed the study. SL performed the statistical analyses and wrote the manuscript. SL, H-KY, H-JL, DP, S-HK, and SP revised the manuscript. SP supervised the study. All authors reviewed the final version of the manuscript and approved the final submission.

Funding

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIP) (No. NRF-2016R1A2B4014552) and the Korean Foundation for Cancer Research (No. CB-2013-01).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.928783/full#supplementary-material

References

1
AbnetC. C.FreedmanN. D.HuN.WangZ.YuK.ShuX. O.et al (2010). A shared susceptibility locus in PLCE1 at 10q23 for gastric adenocarcinoma and esophageal squamous cell carcinoma. Nat. Genet.42 (9), 764–767. 10.1038/ng.649PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
2
Asadzadeh-AghdaeeH.ShahrokhS.NorouziniaM.HosseiniM.KeramatiniaA.JamalanM.et al (2016). Introduction of inflammatory bowel disease biomarkers panel using protein-protein interaction (PPI) network analysis. Gastroenterol. Hepatol. Bed Bench9 (1), S8–s13. PubMed Abstract | Google Scholar
- Google Scholar
3
BradleyC. A.Salto-TellezM.Laurent-PuigP.BardelliA.RolfoC.TaberneroJ.et al (2017). Targeting c-MET in gastrointestinal tumours: rationale, opportunities and challenges. Nat. Rev. Clin. Oncol.14 (9), 562–576. 10.1038/nrclinonc.2017.40PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
4
BrynedalB.RajT.StrangerB. E.BjornsonR.NealeB. M.VoightB. F.et al (2014). Cross-phenotype meta-analysis reveals large-scale trans-eQTLs mediating patterns of transcriptional co-regulation. arXiv preprint arXiv:1402.1728. 10.48550/arXiv.1402.1728CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
5
CarithersL. J.MooreH. M. (2015). The genotype-tissue expression (GTEx) project. New Rochelle, NY 10801 USA: Mary Ann Liebert, Inc. 140 Huguenot Street, 3rd Floor. Google Scholar
- Google Scholar
6
CheungK. S.ChanE. W.WongA. Y. S.ChenL.SetoW. K.WongI. C. K.et al (2019). Metformin use and gastric cancer risk in diabetic patients after Helicobacter pylori eradication. J. Natl. Cancer Inst.111 (5), 484–489. 10.1093/jnci/djy144PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
7
ChmielaM.WalczakN.RudnickaK. (2018). Helicobacter pylori outer membrane vesicles involvement in the infection development and Helicobacter pylori-related diseases. J. Biomed. Sci.25 (1), 78. 10.1186/s12929-018-0480-yPubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
8
ChoK.LeeH. G.PiaoJ. Y.KimS. J.NaH. K.SurhY. J. (2021). Protective effects of silibinin on Helicobacter pylori-induced gastritis: NF-κB and STAT3 as potential targets. J. Cancer Prev.26 (2), 118–127. 10.15430/jcp.2021.26.2.118PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
9
ChurinY.Al-GhoulL.KeppO.MeyerT. F.BirchmeierW.NaumannM. (2003). Helicobacter pylori CagA protein targets the c-Met receptor and enhances the motogenic response. J. Cell Biol.161 (2), 249–255. 10.1083/jcb.200208039PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
10
DavisA. P.GrondinC. J.JohnsonR. J.SciakyD.WiegersJ.WiegersT. C.et al (2021). Comparative Toxicogenomics database (CTD): Update 2021. Nucleic Acids Res.49 (D1), D1138–d1143. 10.1093/nar/gkaa891PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
11
DuM.ZhengR.MaG.ChuH.LuJ.LiS.et al (2020). Remote modulation of lncRNA GCLET by risk variant at 16p13 underlying genetic susceptibility to gastric cancer. Sci. Adv.6 (21), eaay5525. 10.1126/sciadv.aay5525PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
12
EomS. Y.HongS. M.YimD. H.KwonH. J.KimD. H.YunH. Y.et al (2016). Additive interactions between PRKAA1 polymorphisms and Helicobacter pylori CagA infection associated with gastric cancer risk in Koreans. Cancer Med.5 (11), 3236–3335. 10.1002/cam4.926PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
13
FarhK. K.-H.MarsonA.ZhuJ.KleinewietfeldM.HousleyW. J.BeikS.et al (2015). Genetic and epigenetic fine mapping of causal autoimmune disease variants. Nature518 (7539), 337–343. 10.1038/nature13835PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
14
FreshourS. L.KiwalaS.CottoK. C.CoffmanA. C.McMichaelJ. F.SongJ. J.et al (2021). Integration of the drug-gene interaction database (DGIdb 4.0) with open crowdsource efforts. Nucleic Acids Res.49 (D1), D1144–d1151. 10.1093/nar/gkaa1084PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
15
Gene Ontology Consortium (2015). Gene ontology consortium: going forward. Nucleic Acids Res.43 (D1), D1049–D1056. 10.1093/nar/gku1179PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
16
GrundbergE.SmallK. S.Hedman ÅK.NicaA. C.BuilA.KeildsonS.et al (2012). Mapping cis- and trans-regulatory effects across multiple tissues in twins. Nat. Genet.44 (10), 1084–1089. 10.1038/ng.2394PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
17
GuangW.DingH.CzinnS. J.KimK. C.BlanchardT. G.LillehojE. P. (2010). Muc1 cell surface mucin attenuates epithelial inflammation in response to a common mucosal pathogen. J. Biol. Chem.285 (27), 20547–20557. 10.1074/jbc.M110.121319PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
18
GurevitchJ.KorichevaJ.NakagawaS.StewartG. (2018). Meta-analysis and the science of research synthesis. Nature555 (7695), 175–182. 10.1038/nature25753PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
19
GustavsenJ. A.PaiS.IsserlinR.DemchakB.PicoA. R. (2019). RCy3: Network biology using Cytoscape from within R. F1000Res.8, 1774. 10.12688/f1000research.20887.3PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
20
HaddawayN. R.PageM. J.PritchardC. C.McGuinnessL. A. (2022). PRISMA2020: an R package and Shiny app for producing PRISMA 2020 compliant flow diagrams, with interactivity for optimised digital transparency and Open Synthesis. Campbell Syst. Rev.18 (2), e1230. 10.1002/cl2.1230CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
21
HaglundK.RustenT. E.StenmarkH. (2007). Aberrant receptor signaling and trafficking as mechanisms in oncogenesis. Crit. Rev. Oncog.13 (1), 39–74. 10.1615/critrevoncog.v13.i1.20PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
22
HattrupC. L.GendlerS. J. (2008). Structure and function of the cell surface (tethered) mucins. Annu. Rev. Physiol.70, 431–457. 10.1146/annurev.physiol.70.113006.100659PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
23
HedgesL. V.VeveaJ. L. (1998). Fixed-and random-effects models in meta-analysis. Psychol. methods3 (4), 486–504. 10.1037/1082-989x.3.4.486CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
24
HelgasonH.RafnarT.OlafsdottirH. S.JonassonJ. G.SigurdssonA.StaceyS. N.et al (2015). Loss-of-function variants in ATM confer risk of gastric cancer. Nat. Genet.47 (8), 906–910. 10.1038/ng.3342PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
25
HigginsJ. P.ThompsonS. G. (2002). Quantifying heterogeneity in a meta analysis. Stat. Med.21 (11), 1539–1558. 10.1002/sim.1186PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
26
HigginsJ. P.ThompsonS. G.DeeksJ. J.AltmanD. G. (2003). Measuring inconsistency in meta-analyses. BMJ327 (7414), 557–560. 10.1136/bmj.327.7414.557PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
27
HuN.WangZ.SongX.WeiL.KimB. S.FreedmanN. D.et al (2016). Genome-wide association study of gastric adenocarcinoma in Asia: a comparison of associations between cardia and non-cardia tumours. Gut65 (10), 1611–1618. 10.1136/gutjnl-2015-309340PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
28
Huang daW.ShermanB. T.LempickiR. A. (2009). Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc.4 (1), 44–57. 10.1038/nprot.2008.211PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
29
HuangH.ChandaP.AlonsoA.BaderJ. S.ArkingD. E. (2011). Gene-based tests of association. PLoS Genet.7 (7), e1002177. 10.1371/journal.pgen.1002177PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
30
JangJ.MaS. H.KoK. P.ChoiB. Y.YooK. Y.ParkS. K. (2020). Hepatocyte growth factor in blood and gastric cancer risk: A nested case-control study. Cancer Epidemiol. Biomarkers Prev.29 (2), 470–476. 10.1158/1055-9965.Epi-19-0436PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
31
JinG.MaH.WuC.DaiJ.ZhangR.ShiY.et al (2012). Genetic variants at 6p21.1 and 7p15.3 are associated with risk of multiple cancers in Han Chinese. Am. J. Hum. Genet.91 (5), 928–934. 10.1016/j.ajhg.2012.09.009PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
32
JinG.LvJ.YangM.WangM.ZhuM.WangT.et al (2020). Genetic risk, incident gastric cancer, and healthy lifestyle: a meta-analysis of genome-wide association studies and prospective cohort study. Lancet. Oncol.21 (10), 1378–1386. 10.1016/S1470-2045(20)30460-5PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
33
KimY. I.KimS. Y.ChoS. J.ParkJ. H.ChoiI. J.LeeY. J.et al (2014). Long-term metformin use reduces gastric cancer risk in type 2 diabetics without insulin treatment: a nationwide cohort study. Aliment. Pharmacol. Ther.39 (8), 854–863. 10.1111/apt.12660PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
34
KovalchikS. (2014). Download content from NCBI databases. R package version 4.1.0. 2021. Available at: https://cran.r/project.org/package=RISmed (Accessed March 20, 2021). Google Scholar
- Google Scholar
35
KrishanS.RichardsonD. R.SahniS. (2014). Gene of the month. AMP kinase (PRKAA1). J. Clin. Pathol.67 (9), 758–763. 10.1136/jclinpath-2014-202422PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
36
LillehojE. P.LuW.KiserT.GoldblumS. E.KimK. C. (2007). MUC1 inhibits cell proliferation by a beta-catenin-dependent mechanism. Biochim. Biophys. Acta1773 (7), 1028–1038. 10.1016/j.bbamcr.2007.04.009PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
37
LittleJ.HigginsJ.BrayM.IoannidisJ.KhouryM.ManolioT.et al (2006). The HuGENet™ HuGE review handbook, version 1.0. Ottawa, Ontario, Canada: HuGENet Canada Coordinating Centre. Google Scholar
- Google Scholar
38
LiuJ. Z.McraeA. F.NyholtD. R.MedlandS. E.WrayN. R.BrownK. M.et al (2010). A versatile gene-based test for genome-wide association studies. Am. J. Hum. Genet.87 (1), 139–145. 10.1016/j.ajhg.2010.06.009PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
39
LuoL.PengG.ZhuY.DongH.AmosC. I.XiongM. (2010). Genome-wide gene and pathway analysis. Eur. J. Hum. Genet.18 (9), 1045–1053. 10.1038/ejhg.2010.62PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
40
MagawayC.KimE.JacintoE. (2019). Targeting mTOR and metabolism in cancer: lessons and innovations. Cells8 (12), E1584. 10.3390/cells8121584PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
41
ManolioT. A. (2010). Genomewide association studies and assessment of the risk of disease. N. Engl. J. Med.363 (2), 166–176. 10.1056/NEJMra0905980PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
42
McCawZ. R.ColthurstT.YunT.FurlotteN. A.CarrollA.AlipanahiB.et al (2022). DeepNull models non-linear covariate effects to improve phenotypic prediction and association power. Nat. Commun.13 (1), 241. 10.1038/s41467-021-27930-0PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
43
McGuinnessL. A.HigginsJ. P. T. (2021). Risk-of-bias VISualization (robvis): An R package and Shiny web app for visualizing risk-of-bias assessments. Res. Synth. Methods12 (1), 55–61. 10.1002/jrsm.1411PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
44
MitraG.WeberM.StaceyD. (1993). Multiple pathways for activation of MAP kinases. Cell. Mol. Biol. Res.39 (5), 517–523. PubMed Abstract | Google Scholar
- Google Scholar
45
MocellinS.VerdiD.PooleyK. A.NittiD. (2015). Genetic variation and gastric cancer risk: a field synopsis and meta-analysis. Gut64 (8), 1209–1219. 10.1136/gutjnl-2015-309168PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
46
MorgenthalerS.ThillyW. G. (2007). A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: a cohort allelic sums test (CAST). Mutat. Res.615 (1-2), 28–56. 10.1016/j.mrfmmm.2006.09.003PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
47
MosherD. F.AdamsJ. C. (2012). Adhesion-modulating/matricellular ECM protein families: a structural, functional and evolutionary appraisal. Matrix Biol.31 (3), 155–161. 10.1016/j.matbio.2012.01.003PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
48
NathS.MukherjeeP. (2014). MUC1: a multifaceted oncoprotein with a key role in cancer progression. Trends Mol. Med.20 (6), 332–342. 10.1016/j.molmed.2014.02.007PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
49
NicaA. C.DermitzakisE. T. (2013). Expression quantitative trait loci: present and future. Philos. Trans. R. Soc. Lond. B Biol. Sci.368 (1620), 20120362. 10.1098/rstb.2012.0362PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
50
OettingW. S.JacobsonP. A.IsraniA. K. (2017). Validation is critical for genome‐wide association study‐based associations. American Journal of Transplantation17 (2), 318–319. 10.1111/ajt.14051PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
51
PageM. J.McKenzieJ. E.BossuytP. M.BoutronI.HoffmannT. C.MulrowC. D.et al (2021). The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ372, n71. 10.1136/bmj.n71PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
52
ParkB.YangS.LeeJ.WooH. D.ChoiI. J.KimY. W.et al (2019). Genome-wide association of genetic variation in the PSCA gene with gastric cancer susceptibility in a Korean population. Cancer Res. Treat.51 (2), 748–757. 10.4143/crt.2018.162PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
53
PiñeroJ.Ramírez-AnguitaJ. M.Saüch-PitarchJ.RonzanoF.CentenoE.SanzF.et al (2020). The DisGeNET knowledge platform for disease genomics: 2019 update. Nucleic Acids Res.48 (D1), D845–D855. 10.1093/nar/gkz1021PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
54
PrattD.ChenJ.WelkerD.RivasR.PillichR.RynkovV.et al (2015). NDEx, the network data Exchange. Cell Syst.1 (4), 302–305. 10.1016/j.cels.2015.10.001PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
55
RashkinS. R.GraffR. E.KachuriL.ThaiK. K.AlexeeffS. E.BlatchinsM. A.et al (2020). Pan-cancer study detects genetic risk variants and shared genetic basis in two large cohorts. Nat. Commun.11 (1), 4423. 10.1038/s41467-020-18246-6PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
56
RichardsonW. S.WilsonM. C.NishikawaJ.HaywardR. S. (1995). The well-built clinical question: a key to evidence-based decisions. ACP J. Club123 (3), A12–A13. PubMed Abstract | Google Scholar
- Google Scholar
57
RischN.MerikangasK. (1996). The future of genetic studies of complex human diseases. Science273 (5281), 1516–1517. 10.1126/science.273.5281.1516PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
58
RizzatoC.KatoI.PlummerM.MuñozN.CanzianF. (2013). Genetic variation in PSCA and risk of gastric advanced preneoplastic lesions and cancer in relation to Helicobacter pylori infection. PLoS One8 (9), e73100. 10.1371/journal.pone.0073100PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
59
SakamotoH.YoshimuraK.SaekiN.KataiH.ShimodaT.MatsunoY.et al (2008). Genetic variation in PSCA is associated with susceptibility to diffuse-type gastric cancer. Nat. Genet.40 (6), 730–740. 10.1038/ng.152PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
60
SegerR.KrebsE. G. (1995). The MAPK signaling cascade. FASEB J.9 (9), 726–735. 10.1096/fasebj.9.9.7601337PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
61
ShannonP.MarkielA.OzierO.BaligaN. S.WangJ. T.RamageD.et al (2003). Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res.13 (11), 2498–2504. 10.1101/gr.1239303PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
62
ShiY.HuZ.WuC.DaiJ.LiH.DongJ.et al (2011). A genome-wide association study identifies new susceptibility loci for non-cardia gastric cancer at 3q13.31 and 5p13.1. Nat. Genet.43 (12), 1215–1218. 10.1038/ng.978PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
63
SlenterD. N.KutmonM.HanspersK.RiuttaA.WindsorJ.NunesN.et al (2018). WikiPathways: a multifaceted pathway database bridging metabolomics to other omics research. Nucleic Acids Res.46 (D1), D661–d667. 10.1093/nar/gkx1064PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
64
SolovieffN.CotsapasC.LeeP. H.PurcellS. M.SmollerJ. W. (2013). Pleiotropy in complex traits: challenges and strategies. Nat. Rev. Genet.14 (7), 483–495. 10.1038/nrg3461PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
65
StadlerZ. K.ThomP.RobsonM. E.WeitzelJ. N.KauffN. D.HurleyK. E.et al (2010). Genome-wide association studies of cancer. J. Clin. Oncol.28 (27), 4255–4267. 10.1200/JCO.2009.25.7816PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
66
SterneJ. A.HernánM. A.ReevesB. C.SavovićJ.BerkmanN. D.ViswanathanM.et al (2016). ROBINS-I: a tool for assessing risk of bias in non-randomised studies of interventions. BMJ355, i4919. 10.1136/bmj.i4919PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
67
StrangerB. E.MontgomeryS. B.DimasA. S.PartsL.StegleO.IngleC. E.et al (2012). Patterns of cis regulatory variation in diverse human populations. PLoS Genet.8 (4), e1002639. 10.1371/journal.pgen.1002639PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
68
SungH.FerlayJ.SiegelR. L.LaversanneM.SoerjomataramI.JemalA.et al (2021). Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin.71 (3), 209–249. 10.3322/caac.21660PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
69
SuzukiM.MimuroH.KigaK.FukumatsuM.IshijimaN.MorikawaH.et al (2009). Helicobacter pylori CagA phosphorylation-independent function in epithelial proliferation and inflammation. Cell Host Microbe5 (1), 23–34. 10.1016/j.chom.2008.11.010PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
70
SvishchevaG. R.BelonogovaN. M.AxenovichT. I. (2015). Region-based association test for familial data under functional linear models. PloS one10 (6), e0128999. 10.1371/journal.pone.0128999PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
71
SzklarczykD.GableA. L.NastouK. C.LyonD.KirschR.PyysaloS.et al (2021). The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res.49 (D1), D605–d612. 10.1093/nar/gkaa1074PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
72
TanikawaC.UrabeY.MatsuoK.KuboM.TakahashiA.ItoH.et al (2012). A genome-wide association study identifies two susceptibility loci for duodenal ulcer in the Japanese population. Nat. Genet.44 (4), 430–434. 10.1038/ng.1109PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
73
TanikawaC.KamataniY.ToyoshimaO.SakamotoH.ItoH.TakahashiA.et al (2018). Genome-wide association study identifies gastric cancer susceptibility loci at 12q24.11-12 and 20q11.21. Cancer Sci.109 (12), 4015–4024. 10.1111/cas.13815PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
74
ToyoshimaO.TanikawaC.YamamotoR.WatanabeH.YamashitaH.SakitaniK.et al (2018). Decrease in PSCA expression caused by Helicobacter pylori infection may promote progression to severe gastritis. Oncotarget9 (3), 3936–3945. 10.18632/oncotarget.23278PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
75
VišnjićD.LalićH.DembitzV.TomićB.SmoljoT. (2021). AICAr, a widely used AMPK activator with important AMPK-independent effects: a systematic review. Cells10 (5), 1095. 10.3390/cells10051095PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
76
VõsaU.ClaringbouldA.WestraH.-J.BonderM. J.DeelenP.ZengB.et al (2018). Unraveling the polygenic architecture of complex traits using blood eQTL metaanalysis. BioRxiv, 447367. 10.1101/447367CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
77
VõsaU.ClaringbouldA.WestraH. J.BonderM. J.DeelenP.ZengB.et al (2021). Large-scale cis- and trans-eQTL analyses identify thousands of genetic loci and polygenic scores that regulate blood gene expression. Nat. Genet.53 (9), 1300–1310. 10.1038/s41588-021-00913-zPubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
78
WangW. Y.BarrattB. J.ClaytonD. G.ToddJ. A. (2005). Genome-wide association studies: theoretical and practical concerns. Nat. Rev. Genet.6 (2), 109–118. 10.1038/nrg1522PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
79
WangZ.DaiJ.HuN.MiaoX.AbnetC. C.YangM.et al (2017). Identification of new susceptibility loci for gastric non-cardia adenocarcinoma: pooled results from two Chinese genome-wide association studies. Gut66 (4), 581–587. 10.1136/gutjnl-2015-310612PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
80
WangY.MengY.ZhangS.WuH.YangD.NieC.et al (2018). Phenformin and metformin inhibit growth and migration of LN229 glioma cells in vitro and in vivo. Onco. Targets. Ther.11, 6039–6048. 10.2147/ott.S168981PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
81
WatanabeK.TaskesenE.Van BochovenA.PosthumaD. (2017). Functional mapping and annotation of genetic associations with FUMA. Nat. Commun.8 (1), 1826–1911. 10.1038/s41467-017-01261-5PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
82
WestraH. J.FrankeL. (2014). From genome to function by studying eQTLs. Biochim. Biophys. Acta1842 (10), 1896–1902. 10.1016/j.bbadis.2014.04.024PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
83
WestraH. J.PetersM. J.EskoT.YaghootkarH.SchurmannC.KettunenJ.et al (2013). Systematic identification of trans eQTLs as putative drivers of known disease associations. Nat. Genet.45 (10), 1238–1243. 10.1038/ng.2756PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
84
WheelerH. E.MaitlandM. L.DolanM. E.CoxN. J.RatainM. J. (2013). Cancer pharmacogenomics: strategies and challenges. Nat. Rev. Genet.14 (1), 23–34. 10.1038/nrg3352PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
85
WishartD. S.FeunangY. D.GuoA. C.LoE. J.MarcuA.GrantJ. R.et al (2018). DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res.46 (D1), D1074–d1082. 10.1093/nar/gkx1037PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
86
WuY.QiY.LiuH.WangX.ZhuH.WangZ. (2016). AMPK activator AICAR promotes 5-FU-induced apoptosis in gastric cancer cells. Mol. Cell. Biochem.411 (1-2), 299–305. 10.1007/s11010-015-2592-yPubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
87
YanC.ZhuM.DingY.YangM.WangM.LiG.et al (2020). Meta-analysis of genome-wide association studies and functional assays decipher susceptibility genes for gastric cancer in Chinese populations. Gut69 (4), 641–651. 10.1136/gutjnl-2019-318760PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar
88
YuG.WangL. G.HanY.HeQ. Y. (2012). clusterProfiler: an R package for comparing biological themes among gene clusters. Omics16 (5), 284–287. 10.1089/omi.2011.0118PubMed Abstract | CrossRef Full Text | Google Scholar
- CrossRef
- Google Scholar

Summary

Keywords

stomach neoplasms, gastric cancer, genome-wide association study, gene-based analysis, functional annotations

Citation

Lee S, Yang H-K, Lee H-J, Park DJ, Kong S-H and Park SK (2022) Systematic review of gastric cancer-associated genetic variants, gene-based meta-analysis, and gene-level functional analysis to identify candidate genes for drug development. Front. Genet. 13:928783. doi: 10.3389/fgene.2022.928783

Received

26 April 2022

Accepted

25 July 2022

Published

16 August 2022

Volume

13 - 2022

Edited by

Monde Ntwasa, University of South Africa, South Africa

Reviewed by

Zhiguo Xie, Central South University, China

Yang Zhang, Peking University, China

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Sue K. Park, suepark@snu.ac.kr

This article was submitted to Pharmacogenetics and Pharmacogenomics, a section of the journal Frontiers in Genetics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Pharmacogenetics and Pharmacogenomics

SYSTEMATIC REVIEW article

Systematic review of gastric cancer-associated genetic variants, gene-based meta-analysis, and gene-level functional analysis to identify candidate genes for drug development

Abstract

1 Introduction

2 Materials and methods