Skip to main content


Front. Genet., 15 March 2017
Sec. Behavioral and Psychiatric Genetics
Volume 8 - 2017 |

Molecular Genetic Influences on Normative and Problematic Alcohol Use in a Population-Based Sample of College Students

Bradley T. Webb1,2,3* Alexis C. Edwards1,2 Aaron R. Wolen4 Jessica E. Salvatore1,5 Fazil Aliev6,7,8 Brien P. Riley1,2,3 Cuie Sun1,2 Vernell S. Williamson1 James N. Kitchens1 Kimberly Pedersen1,2,8 Amy Adkins5,8 Megan E. Cooke1,8 Jeanne E. Savage1,8 Zoe Neale5,8 Seung B. Cho5,8 Danielle M. Dick3,5,6,8 Kenneth S. Kendler1,2
  • 1Virginia Institute for Psychiatric and Behavioral Genetics, Virginia Commonwealth University, Richmond, VA, USA
  • 2Department of Psychiatry, Virginia Commonwealth University, Richmond, VA, USA
  • 3Department of Human and Molecular Genetics, Virginia Commonwealth University, Richmond, VA, USA
  • 4Center for Clinical and Translational Research, Virginia Commonwealth University, Richmond, VA, USA
  • 5Department of Psychology, Virginia Commonwealth University, Richmond, VA, USA
  • 6Department of African-American Studies, Virginia Commonwealth University, Richmond, VA, USA
  • 7Faculty of Business, Karabuk University, Karabuk, Turkey
  • 8College Behavioral and Emotional Health Institute, Virginia Commonwealth University, Richmond, VA, USA

Background: Genetic factors impact alcohol use behaviors and these factors may become increasingly evident during emerging adulthood. Examination of the effects of individual variants as well as aggregate genetic variation can clarify mechanisms underlying risk.

Methods: We conducted genome-wide association studies (GWAS) in an ethnically diverse sample of college students for three quantitative outcomes including typical monthly alcohol consumption, alcohol problems, and maximum number of drinks in 24 h. Heritability based on common genetic variants (h2SNP) was assessed. We also evaluated whether risk variants in aggregate were associated with alcohol use outcomes in an independent sample of young adults.

Results: Two genome-wide significant markers were observed: rs11201929 in GRID1 for maximum drinks in 24 h, with supportive evidence across all ancestry groups; and rs73317305 in SAMD12 (alcohol problems), tested only in the African ancestry group. The h2SNP estimate was 0.19 (SE = 0.11) for consumption, and was non-significant for other outcomes. Genome-wide polygenic scores were significantly associated with alcohol outcomes in an independent sample.

Conclusions: These results robustly identify genetic risk for alcohol use outcomes at the variant level and in aggregate. We confirm prior evidence that genetic variation in GRID1 impacts alcohol use, and identify novel loci of interest for multiple alcohol outcomes in emerging adults. These findings indicate that genetic variation influencing normative and problematic alcohol use is, to some extent, convergent across ancestry groups. Studying college populations represents a promising avenue by which to obtain large, diverse samples for gene identification.


Alcohol use phenotypes are genetically influenced, with heritability for alcohol use disorder typically estimated at around 50% in twin and adoption studies (Heath et al., 1997; McGue, 1999; Prescott and Kendler, 1999; Verhulst et al., 2015). Other alcohol-related phenotypes, such as initiation of use, typical consumption, maximum number of drinks consumed in a day, or initial response to alcohol, are also modestly to moderately heritable (Fowler et al., 2007; Poelen et al., 2009; Agrawal et al., 2011; Kalu et al., 2012). As these measures constitute steps along the trajectory to alcohol problems among some individuals, elucidating their genetic etiology is important.

Many genome-wide association studies (GWAS) have been conducted for diagnostic alcohol outcomes (e.g., alcohol abuse/dependence or symptom count; Treutlein et al., 2009a; Bierut et al., 2010; Edenberg et al., 2010; Kendler et al., 2011b; Wang et al., 2013; Gelernter et al., 2014). Overall, these investigations have not enjoyed the same success in large-scale gene-identification efforts as, for example, schizophrenia (Schizophrenia Working Group of the Psychiatric Genomics Consortium, 2014). Quantitative phenotypes are more statistically powerful than binary outcomes in general population samples and have also been the subject of recent genome-wide studies (e.g., Heath et al., 2011; Schumann et al., 2011; Pan et al., 2013; Kos et al., 2014; Edwards et al., 2015). In some cases, associations surpassing stringent genome-wide significance criteria have been observed (Schumann et al., 2011; Pan et al., 2013), though robust replication has not been reported.

Although, robust and replicated variant level associations have yet to be discovered for alcohol related measures, most GWAS have supported the hypothesis that these phenotypes are classic quantitative genetic traits, influenced by many variants (hundreds to thousands) of individual small effect. Methods that examine the aggregate effects of common variants represent an important complement to analyses based on single markers (e.g., primary GWAS results) and support the results from twin and family studies. Heritabilities based on common markers using unrelated individuals have been reported including an alcohol problems score in Dutch adults (0.33; Mbarek et al., 2015), and maximum drinks in 24 h and alcohol use disorder (AUD) (0.32 and 0.34, respectively; Kos et al., 2014) in a modestly sized sample of Mexican-American adults. However, in a UK sample of emerging adults, a heritability estimate of 0.05 for alcohol problems was non-significant (Edwards et al., 2015).

Most prior GWAS have focused on alcohol outcomes in mature adults, and less is known about the transition period between adolescence and young adulthood. Alcohol use increases across adolescence, with consumption, risky drinking, and alcohol use disorders all peaking in young adulthood (SAMHSA, 2014). Longitudinal studies indicate that heritability estimates for alcohol use and problems also increase across adolescence and generally reach the levels observed in adults during emerging adulthood (Rose et al., 2001; Palmer et al., 2013; Samek et al., 2013), though there is some suggestion from cross-sectional and/or retrospective studies that heritability may continue to increase slightly across adulthood (Bergen et al., 2007; Kendler et al., 2007; Hansell et al., 2008). During emerging adulthood there is also a shift in the nature of genetic influences on substance use outcomes, as it is during this time frame that genetic factors become more substance-specific (Vrieze et al., 2012) and less related to overall externalizing behavior (Kendler et al., 2011a; Edwards and Kendler, 2013; Meyers et al., 2014). Thus, emerging adulthood is a critical time frame for clarifying alcohol use etiology.

Here, we examine problematic and normative alcohol use in a population-based sample of US emerging adults. The Spit for Science (S4S) sample (Dick et al., 2014) was recruited at a large, urban university as part of a study on alcohol use and other health-related behaviors. In addition to emerging adulthood being an important period for clarifying the etiology of alcohol use outcomes, there is evidence that college students drink more than their non-college attending peers (Quinn and Fromme, 2011), and that the college environment actually enhances the degree to which genetic influences are important for alcohol consumption (Timberlake et al., 2007). Accordingly, this population presents an opportunity to readily obtain large samples in order to assess genetic influences on alcohol outcomes. The Spit for Science sample's ethnic diversity further enables us to assess whether genetic effects are consistent across ancestry groups.

Three cross-sectional alcohol outcomes were assessed: typical consumption, maximum drinks in 24 h, and a quantitative measure of alcohol problems, which have been demonstrated to be phenotypically and genetically correlated (Kendler et al., 2010; Dick et al., 2011). A series of analyses investigate the effects of individual variants as well as aggregate measures of genetic risk. These effects are tested for replication in an independent sample.



Spit for Science is an ongoing longitudinal study of college students enrolled in a large, urban university in the Mid-Atlantic, as previously described (Dick et al., 2014). Briefly, incoming students age 18 or older were eligible to complete phenotypic assessments, which covered a wide range of topics but focused on alcohol use. Study data were collected and managed using REDCap electronic data capture tools (Harris et al., 2009) hosted at Virginia Commonwealth University. Follow-up assessments were completed in subsequent spring semesters. Individuals who did not participate in the first wave of data collection (including those who turned 18 after the end of the first wave of data collection) had the opportunity to join the study the following spring; those who participated during their first year were eligible to complete follow-up assessments each spring. Participants who completed the phenotypic assessments were eligible to provide a DNA sample. The current study includes three cohorts, which matriculated in Fall 2011 (N = 2,714), 2012 (N = 2,486), and 2013 (N = 2,403), for a total N = 7,603. Of these, 98% provided a DNA sample. As the current analyses are based on the data capture after the Spring 2014 survey, data were available for 2–4 waves, depending on cohort. At wave 1 (61% female), the average (SD) age was 18.59 (0.61); at wave 2 (65% female), 18.99 (0.44); at wave 3 (66% female), 19.94 (0.60); and at wave 4 (66% female), 20.93 (0.69).

Alcohol Use Variables

For the current study, three alcohol-related variables were derived including a measure of past 30-day alcohol consumption in grams of ethanol (Consumption), the lifetime maximum number of drinks consumed in a 24-h period (Maxdrinks), and an alcohol problems sum score (Symptoms). Abstainers were not administered the alcohol-related items and therefore were not included in any of the current phenotypic and genetic analyses. Participants were given the option of skipping questions therefore number of participants varies across constructs. When a participant had assessments from different waves, the highest score was used.

Typical Consumption (Consumption)

Participants were asked to report on their drinking in the past month with items regarding how frequently they drank and how many drinks they typically consumed on a drinking day. These items were combined to create a single measure of grams of ethanol consumed in the last month, using a method previously described (Dawson, 2000) and reported for the current sample by Salvatore et al. (2016). Briefly, the frequency and quantity variables were multiplied, and that product was multiplied by 14 (corresponding to the number of grams of ethanol in a standard alcoholic drink). Responses were windsorized with a maximum value of 2,000 g to account for unlikely responses. Values were natural log transformed (after adding 1 to each value). Phenotypic data were available for 7,374 participants.

Alcohol Problems Sum Score (Problems)

In waves 1–4, participants who reported having ever consumed alcohol were asked items related to DSM-5 (American Psychiatric Association, 2013) alcohol use disorder criteria (e.g., “Have you ever started drinking and become drunk when you didn't want to?”), with some criteria assessed using multiple items. For all but 2 items, response options were “never,” “1–2 times,” or “3 or more times,” which were scored 1, 2, and 3, respectively. Items addressing craving and tolerance had response options of “no” and “yes,” coded 0 and 1, respectively. Participants missing more than half their data within a wave were coded as missing for that wave. For each wave, sum scores were created and pro-rated to account for the missingness (when at least half the data was present) and data structure. For the first survey of wave 1, only the seven DSM-IV alcohol dependence items were assessed. None of the four alcohol abuse items were measured. Subsequent surveys of wave 1 and all surveys for wave 2–4 included 11 DSM-V items which were modified to make them appropriate for the participants in accordance with IRB guidelines that the language be written at a 10th grade reading level. The highest pro-rated score across all available waves was selected and natural log-transformed (after adding 1), with data available for 6,082 participants.

Maximum Drinks in 24 h (Maxdrinks)

Participants were asked in waves 1–4 to report the highest number of drinks they had ever consumed in a 24-h period, with response options ranging from 1 to 24, plus “more than 24.” They could also opt not to answer or select “I don't know,” both of which were recoded as missing for the current analyses. Data were available for 6,125 participants. We selected the highest reported Maxdrinks for each individual, as well as the participant's age at that report for inclusion as a covariate in genetic analyses.

Genotyping, Pre-imputation QC, and Imputation

There were 6534 samples passing DNA and initial genotyping QC. Genotyping was performed at Rutgers University Cell and DNA Repository (RUCDR) using the Affymetrix BioBank array (653 k variants) which contains both common GWAS framework variants (296 k) for imputation and functional variants (357 k) including rare high impact exome variants (272 k), indels (18 k), eQTLs (16 k), and miscellaneous (51 k). QC excluded Off Target Variants found by SNPolisher, single nucleotide polymorphisms (SNPs) missing >5% of genotypes, samples missing >2% of genotypes, and SNPs missing >2% of genotypes after sample filtering, similar to the Psychiatric Genomics Consortium (PGC; Schizophrenia Working Group of the Psychiatric Genomics Consortium, 2014). This pre-imputation QC removed 209 samples, leaving 6,325 samples and 560,138 variants for imputation. Imputation was conducted using SHAPEIT2 (Delaneau et al., 2013)/IMPUTE2 (Howie et al., 2009) and the 1000 genomes phase 3 reference panel (n = 2,504) (1000 Genomes Project Consortium et al., 2015; Sudmant et al., 2015).

Ancestry Principal Components (PC)

1000 Genomes Project (1KGP) phase 3 variants (2,504 samples, 26 populations) found in common with the post QC filtered S4S genotypes were merged together. Regions with high LD were excluded (Price et al., 2006, 2008) and the common set of variants was then pruned (r2 < 0.1) using PLINK 1.9 (Purcell et al., 2007; Chang et al., 2015) (–indep-pairwise 1,500 150 0.1) to yield 109,259 semi-independent variants for ancestry analyses. EIGENSOFT/SmartPCA (Patterson et al., 2006; Price et al., 2006) was used to perform PCA using only the 1KGP phase 3 reference panel to determine SNP weights for each eigenvector. This solution was then projected onto the S4S data to generate 10 principal components (PCs).

Genetic Based Population Assignment

S4S is ethnically diverse with self-identified census race/ethnicity as follows: American Indian/Alaska Native (N = 35); Asian (N = 1223); Black/African American (N = 1464); Hispanic/Latino (N = 450); More than one race (N = 467); Native Hawaiian/Other Pacific Islander (N = 50); Unknown (N = 30); and White (N = 3763). Participants could also elect not to answer (N = 108). As noted previously, the sample of participants corresponds closely to the overall demographics of the university student population (Dick et al., 2014).

For genetic analyses, S4S subjects were empirically assigned to 1KGP based ancestry super-populations. Briefly, using all 10 ancestry PCs, the Mahalanobis distance (Mahalanobis, 1936) between each S4S sample and each 1KGP population (N = 26) without reference population outliers (>4 SD from population median, N = 61) was calculated. Each subject was then assigned to the 1KGP population with the minimum Mahalanobis distance and then collapsed into their respective super-population assignment. This empirically based ancestry has several advantages to self-identified race/ethnicity including reducing variance of the within group PCs and being able to include “Unknown,” “More than one race,” and small groups in the analysis without an increase in genomic inflation. There were five final ancestry groups: African descent (AFR), American descent (AMR), East Asian descent (EAS), European descent (EUR), and South Asian descent (SAS).

Within Group Quality Control

Due to the diverse nature of S4S, filtering by Hardy-Weinberg Equilibrium (HWE), minor allele frequency (MAF), and relatedness were performed within empirically assigned super-populations. Genome-wide IBD (Π^) was calculated using PLINK 1.9. For each sample, the mean cross-sample Π^ was calculated to find samples showing excessive relatedness, which is where a sample appears to be a cryptic relative to many other samples but those samples do not appear related to one another. One hundred and ninety four samples were excluded (>2.5 standard deviations above the mean) as outliers for average relatedness with all other samples. Clusters of probable relatives were defined using Π^ > 0.1, Z0 >= 0.825, and Z1 < 0.175. The inclusion of Z0/Z1 is important since Π^ > 0.1 can be due to artifacts where Z2 > 0 which is extremely unlikely for cryptic relatives. Then the best performing sample for each relative cluster was retained which resulted in an additional 180 samples being excluded from the GWAS sample.

The choice of ancestry PC covariates to include in each super-population GWAS was determined by stepwise linear regression for the specific phenotype being analyzed. Non-ancestry covariates (sex and age) were forced to be kept in the model while ancestry covariates were kept if they were retained in the best fitting model as measured by AIC. This approach, of only keeping PCs significant to a specific ancestry and phenotype, increases parsimony and minimizes risk of over-fitting. We have shown that there is no evidence of genomic inflation when this parsimonious approach is taken.

SNPTEST (Marchini et al., 2007) v2.5.2 was used to conduct association analyses under an additive model only including markers with a minimum MAF of 0.005 and INFO of 0.5. Post GWAS filtering was performed using ancestry specific HWE (p > 10−6) and sample size based MAFs. Instead of using a fixed MAF threshold for each group, the minimum observed minor allele count (MAC) was used. Previous research has shown a MAC of ~40 is robust for most association analyses performed in GWAS (Bigdeli et al., 2014). Post filtered GWAS results were meta-analyzed using METAL (Marchini et al., 2007; Willer et al., 2010) which implements a fixed effect model and inverse variance weighting based on sample size. Markers available for fewer than 1,000 individuals after meta-analysis were excluded. Estimation of genomic inflation (λ, λ1,000) for within super-population GWAS and meta-analyses was performed in R (R Core Team, 2016). False Discovery Rate (FDR) analysis was performed using the “q-value” package ( using Bioconductor 3.2 (Huber et al., 2015). To define genomic bins for follow-up, we started with all markers with a q < 0.5. Markers were initially collapsed into bins if they were within 10 kb. Post-hoc inspection showed several adjacent bins <75 kb apart which were then collapsed into the reported bins.


Genome-wide Complex Trait Analysis was used (Yang et al., 2011) (GCTA) to estimate the proportion of phenotypic variance attributable to observed (non-imputed) genetic variants [V(G)/Vp, or h2SNP]. Genetic relationship matrices (GRMs) were derived for each ancestry group. Within ancestry group, only unrelated individuals were included in the GRM as in the GWAS, resulting in the following sample sizes: AFR: N = 1339; AMR: N = 582; EAS: N = 557; EUR: N = 3018; SAS: N = 455. An ancestry group-specific MAF cutoff of 0.01 was applied. The same ancestry PCs used in the GWAS analyses were included in the heritability analyses.



We used the Avon Longitudinal Study of Parents and Children (ALSPAC; Boyd et al., 2013; Fraser et al., 2013) to test for replication of individual variants as well as at the aggregate level. ALSPAC participants were born in 1991–1992 and are predominantly of European descent (>96%). The total ALSPAC sample included 15,247 pregnancies from women residing in Avon, UK with expected due dates between April 1991 and December 1992, resulting in 15,458 fetuses. Of this total sample, 14,775 were live births and 14,701 were alive at 1 year of age. The study website contains details of all the data that are available through a fully searchable data dictionary ( Ethical approval for the study was obtained from the ALSPAC Ethics and Law Committee, Bristol University and Virginia Commonwealth University. Genetic data are available for N = 8,237 individuals who meet quality control filters.


ALSPAC participants were administered a self-report questionnaire that included alcohol use items at approximately age 20.75. These items included questions about frequency of alcohol use and average number of drinks per drinking day, from which the Consumption variable is derived; DSM-IV and AUDIT (Babor et al., 2001) items, from which a problems sum score is derived; and maximum number of drinks in 24 h. Thus, they corresponded quite closely to those administered to S4S participants. Sample size varied across outcomes (Consumption N = 3,150, Problems N = 2,906, and Maxdrinks N = 2,670) as a function of attrition and alcohol use initiation.


The ALSPAC sample was genotyped on the Illumina HumanHap550 quad SNP genotyping platform by the Wellcome Trust Sanger Institute (Cambridge, UK) and Laboratory Corporation of America (Burlington, NC, US). Samples were subjected to quality control filters as previously described (Edwards et al., 2015). Data were imputed using a phased version of the 1000 Genomes reference panel (Phase 1, Version 3), using Impute V2.2.2 and all reference haplotypes to maximize imputation quality.

Individual Variant Replication

For markers with q < 0.5 in the three meta-analysis results, we extracted summary statistics from GWAS results of the corresponding phenotype in ALSPAC. Due to differences in allele frequencies across the discovery and replication samples, summary statistics were not available for all markers.

Genome-Wide Polygenic Scores (GPS)

P-value threshold based polygenic scores (GPS) were calculated in ALSPAC using the S4S GWAS results. Prior to GPS derivation, we identified markers present in both the S4S and ALSPAC genetic data with MAF > 0.01 and INFO > 0.5. We pruned this list using the clumping function in PLINK 1.9 (Chang et al., 2015) to identify independent loci. Clumping was conducted based on the European subsample of the 1000 Genomes reference panel, in two stages: the first used an r2 threshold of 0.5 and a range of 250 kb; the second used an r2 threshold of 0.1 and a range of 5 mb. Markers selected were used to derive GPS for ALSPAC participants, employing a series of p-value thresholds as is customary. We derived GPS based on the Z-scores (and associated p-values) from the METAL meta-analyses as well as based on EUR-specific SNP regression coefficients (and associated p-values), given the possibility that the latter might be more predictive of outcome in the European ALSPAC sample. In R (v3.2.3), we tested whether GPS were associated with alcohol outcomes (Consumption, Problems, or Maxdrinks), controlling for sex, using linear regression. Our analytic goal was to test for association rather than to try to predict outcome, given the sample sizes and expected genetic effect sizes (Dudbridge, 2013).


Descriptive Statistics

Mean values for each variable are reported in Table 1. Variables were moderately inter-correlated (r = 0.42–0.58). The sample consists of more women than men (60.8% female), though this distribution is only slightly different than that of the student population. Women reported lower levels of drinking for all outcomes (p < 0.01). Sex was included as a covariate in genetic analyses.


Table 1. Descriptive statistics for untransformed alcohol outcome variables.

GCTA/Heritability Estimates

We used GCTA to calculate h2SNP for each of the three phenotypes, separately by ancestry group. Results are provided in Table 2. We next meta-analyzed these results across ancestries, as several groups (AMR, EAS, and SAS) had limited sample sizes. Meta-analytic results indicated that Consumption is modestly heritable, while estimates for Problems and Maxdrinks were not significantly different from zero at this age.


Table 2. SNP-based heritability estimates.

Primary GWAS Results

After applying filtering and meta-analysis, results were available for 16,511,702, 15,625,945, and 15,724,050 markers for Consumption, Problems, and Maxdrinks, respectively. For each phenotype, results for 33–35% of the markers were only testable (MAC > 40) in the AFR ancestry group due to frequency. The meta-analyses showed no evidence of genomic inflation with λ1,000s of 1.0001 (Problems), 0.9997 (Consumption), and 0.9994 (Maxdrinks). Across the three phenotypes, FDR analysis showed 187 markers with q < 0.50. All markers with FDR q < 0.5 are listed in Supplementary Table 1, with summary statistics and information on nearby genes. These 187 markers map to 53 genomic bins (Supplementary Table 1). Seven bins contained at least one genome-wide significant (GWS; p < 5 × 10−8) marker. However, in all but one of these bins the signal was limited to the AFR group. Using stricter genome-wide significance for samples of African ancestry (p < 1 × 10−8), only one AFR specific bin remained GWS.

The two markers robustly GWS are rs11201929 for Maxdrinks (p = 4.11 × 10−9, q = 0.06) and rs73317305 for Problems (p = 9.02 × 10−9, q = 0.11). rs11201929 is in the third intron of glutamate ionotropic receptor delta type 1 (GRID1) and was of sufficient frequency and quality to be tested in all five ancestry groups. Figure 1 depicts the regional association plot for GRID1. The direction of effect was consistent across ancestries, though the strength of the association varied (rs11201929 p-values by group: 5 × 10−7 AFR, 0.046 AMR, 0.07 EAS, 0.0067 EUR, 0.13 SAS). The other GWS marker, rs73317305 (Problems), is in the second intron of sterile alpha motif domain containing 12 (SAMD12) on chromosome 8 and is rare in non-African populations (MAFs: AFR 0.0898, AMR 0.0175, EAS 0, EUR 0.0021, SAS 0.0008; Figure 2).


Figure 1. Regional association plot for GRID1 and 200 kb flanking regions, implemented using LocusZoom (Pruim et al., 2010). The most significant marker is in purple (rs11201929, p = 4.11e-09, q = 0.06 for Maxdrinks). Linkage disequilibrium information is based on the 1000 Genomes AFR super-population. The size of the points representing plotted SNPs corresponds to the meta-analysis sample size.


Figure 2. Regional association plot for SAMD12 and 200 kb flanking regions. The most significant marker is in purple (rs73317305, p = 9.02 × 10−9, q = 0.11 for Problems). Linkage disequilibrium information is based on the 1000 Genomes AFR super-population, as the minor allele was rare in other subgroups. The size of the points representing plotted SNPs corresponds to the meta-analysis sample size.

For Consumption, the most strongly associated marker was rs76541530 (p = 3.39 × 10−8, q = 0.10), which does not meet our stringent genome-wide significance criterion for variants informative only in the AFR group (p < 1 × 10−8). This marker, along with 12 others that map to the same genomic region on chromosome 11, does not map to any known gene.

Replication in ALSPAC

For replication, we considered samples similar to S4S with respect to age and/or alcohol phenotypes in order to minimize lack of replication due to differences in ascertainment and phenotypes. ALSPAC met these criteria due to being a sample similar in age and the derived variables being nearly identical. Replication was attempted for (1) individual variants with FDR q < 0.50 and (2) p-value threshold based polygene scores. Importantly, only results based on equivalent phenotypes were examined (e.g., markers associated with Problems in S4S were not examined for their association with Maxdrinks or Consumption). While several variants with q < 0.50 exhibited nominal associations (p < 0.05) in ALSPAC, we did not observe robust variant level replication after correction for multiple testing for any phenotype. Results are presented in Supplementary Table 2. In contrast to specific SNP results, each GPS showed some evidence for replication, with support varying across outcomes. Consumption and Problems both showed nominal significance (p < 0.05) for a wide (pthreshold 0.01 to 0.5) and narrow range (pthreshold 0.01) of thresholds, respectively. The Maxdrinks PRS was robustly associated (p < 0.005) across a wide range of thresholds (pthreshold 0.05–0.5) Further details are provided in Table 3. Variance accounted for by the scores was low (<1%) for Consumption and Problems; for Maxdrinks scores accounted for >6% of the variance in some cases. Scores derived from the EUR-specific GWAS were similar though less pronounced (Supplementary Table 3).


Table 3. Associations between GPS derived from S4S meta-analysis results and ALSPAC alcohol outcomes.


Using a population-based study of emerging adults at a diverse mid-Atlantic university, the Spit for Science sample, we present evidence of replicable aggregate genetic influences on three alcohol-related phenotypes: typical monthly consumption (Consumption), maximum drinks in 24 h (MaxDrinks), and an alcohol problems sum score (Problems). We find further support that Consumption is heritable (h2SNP 0.19, SE = 0.11). At the marker level, variation in a previously implicated (see below) gene, GRID1, surpasses stringent genome-wide significance criteria for association with MaxDrinks. Furthermore, polygenic scores derived from S4S for Consumption and Maxdrinks, and Problems to a lesser extent, are significantly associated with the equivalent outcomes in an independent and comparably-aged sample. These results provide empirical support for the influence of aggregate molecular variation on multiple alcohol outcomes.

We observed associations between alcohol phenotypes and markers that localize to within or near genes of biological interest. Most notable is the association between Maxdrinks and the glutamate ionotropic receptor delta type subunit 1 gene (GRID1), which is involved in synaptic plasticity. GRID1 has been implicated in prior genetic studies of alcohol use outcomes: Chen et al. (2015) found that SNPs nominally associated with alcohol cue-elicited brain activation were enriched for markers mapping to genes, including GRID1, that are involved in synaptic long term depression. This gene was further implicated in comorbid alcohol dependence and depressive syndrome (Edwards et al., 2012), and in a study of alcohol problems in a population-based sample (Edwards et al., 2015). Glutamatergic receptor subunit mRNA, including GRID1, has been shown to be altered in the caudate within an alcoholic sample relative to controls (Bhandage et al., 2014). More generally, GRID1 has been associated at varying levels of significance with brain structure (Nenadic et al., 2012) and schizophrenia (Fallin et al., 2005; Treutlein et al., 2009b; Nenadic et al., 2012). Besides GRID1, top markers mapped to within or near NPAS3, which potentially regulates genes involved in neurogenesis and has been previously associated with psychiatric disorders (Pickard et al., 2006; Huang et al., 2010; Nurnberger et al., 2014); and SV2B, a synaptic vesicle protein-encoding gene implicated in cognitive processes in model systems (Detrait et al., 2014; Olson et al., 2015).

There was not support for specific (q < 0.5) variants in another sample assessed for comparable alcohol use phenotypes. Such failures to replicate may be due to population-specific effects (genetic and/or environmental) or false positive results. Many markers with q < 0.5 in the current analyses were assayed only in the AFR ancestry group due to low MAC in other groups. These results underscore the need for additional genetic analyses to be conducted in samples of non-European ancestry, not only to facilitate replication, but also to clarify potential differences in genetic risk factors across ancestries (Dick et al., 2017).

We find that GPS derived from markers associated with ethanol Consumption and MaxDrinks at modest p-value thresholds (p < 0.05 and above) predict those same outcomes in the ALSPAC sample. These results are consistent with the highly polygenic nature of alcohol phenotypes. Although GPS derived from relatively few markers (several hundred to ~25,000 for up to p < 0.01) were not predictive of outcome, those that are more inclusive do capture meaningful genetic liability. Furthermore, this interpretation is consistent with the null results of our replication attempts for markers with q < 0.50. Thus, it is likely that many of the variants implicated at marginal thresholds are incrementally contributing to risk, though to a degree too small to be detected in isolation. We observe similar trends when GPS are derived from the European ancestry group only, though results are less robust (Supplementary Table 3). We likely benefitted from the improved statistical power of the meta-analysis. These findings also provide support for the hypothesis that genetic variants impacting alcohol outcomes are largely similar across different ethnicities, though given variation with respect to allele presence/frequencies there are also likely to be ethnicity-specific genetic factors.

Our estimate of the heritability of consumption was far lower than estimates (Manolio et al., 2009; Zuk et al., 2012) obtained previously for consumption in young adults in previous studies (Rose et al., 2001; Palmer et al., 2013). Current methods used to calculate h2SNP typically result in estimates that are lower than those obtained using traditional biometric modeling. For example, a recent study of a Dutch sample assessed using the AUDIT reported a twin-based h2 estimate of 0.6, with a corresponding SNP-based estimate of h2SNP 0.33 (Mbarek et al., 2015). Such findings (“missing heritability”) have been discussed extensively (Manolio et al., 2009; Lee et al., 2011; Zuk et al., 2012; Brookfield, 2013; Koch, 2014), and may be due to rare variants, poor tagging of common variants, overestimation of heritability in twin studies, epistatic interactions, epigenetic factors, or other genomic phenomena. Accordingly, the low and non-significant h2SNP estimates for Problems and Maxdrinks may reflect the presence of other genetic factors, insufficient statistical power, or both (Supplementary Table 4). There are other potential explanations specific to the study and outcomes. First, recall bias is potentially an issue for Maxdrinks since ~50% of drinkers in our sample report having blacked out when drinking. There is also an overall elevation in misuse and associated problems during this age range which may reflect more environmental factors and mask genetic variation. Additionally, reports of alcohol misuse and associated consequences may reflect a certain degree of bravado at this age, when some youth may have less developmental perspective on alcohol problems. It is likely inappropriate to interpret these results as an indication that genetic factors do not impact alcohol Problems and Maxdrinks in this population, particularly given the positive associations between S4S-derived GPS and alcohol outcomes in an independent sample. Indeed, non-significant h2SNP estimates can mask meaningful genetic variation that simply does not account for a substantial component of phenotypic variance. In the current study, the rs671 variant within ALDH2, which is common in East Asian populations and is known to impact risk for AUD (Edenberg, 2007), was strongly associated with all three outcomes within the EAS-specific results (pConsumption 0.013, pMaxDrinks 0.00014, pSymptoms 0.0039). Nonetheless, this did not translate into significant h2SNP estimates within that ancestry group.


The results presented herein should be considered in light of several limitations. First, the phenotypes examined are based on self-report data, which do not offer the possibility of external verification and are subject to reporting bias. We attempted to mitigate potential issues by imposing cut-offs that allowed for variation in responses while restricting values to a reasonable realm of possibility. Second, several of the ancestry groups (American, East Asian, and South Asian descent) included in these analyses were rather small, resulting in large standard errors around parameter estimates for both individual variant analyses and aggregate tests (e.g., GCTA). While the European and African descent groups were larger and more statistically powerful, substantial sample sizes are necessary to reliably detect low heritability using GCTA, and these results should be interpreted with caution.

Many markers with q < 0.50 were evaluated only in the AFR group due to low MAC in other groups. Given that most alcohol-related GWAS have been conducted on samples of European ancestry, these markers were largely unavailable for replication attempts in other samples. Had we calculated q-values based only on markers available in the EUR group, or only on those available across all groups, the list of markers for follow-up would have differed, potentially impacting the overall outcome of the SNP-based replication assessments.

GPS parameter estimates are quite modest, indicating that scores would account for little of the total variance in ALSPAC outcomes. In addition, we report uncorrected p-values, as the tests are not independent and the appropriate correction approach is not immediately evident.

In spite of the above limitations, these analyses demonstrate that genetic factors are integrally involved in liability to alcohol use in college-aged students, at both individual variant and aggregate levels. Furthermore, the effects of many genetic variants are consistent across ethnicities, suggesting shared biological mechanisms. Critically, replication in an independent UK cohort indicates that the observed genetic effects are generalizable within a similarly aged cohort. These findings validate prior evidence of specific and general genetic effects on alcohol outcomes, and provide nascent support for novel loci that merit additional research.

Ethics Statement

All Spit for Science protocols were approved by the VCU Institutional Review Board (IRB).

Author Contributions

The author contributions to the current research include study design (BW, AE, DD, KK), sample collection (KP, AA, MC, JESavage, ZN), phenotype analyses and QC (JESalvatore, FA, SC), DNA sample processing and QC (CS, BR), genotype QC (BW, VW, JK) statistical and bioinformatic analyses (BW, AE, AW) and manuscript preparation (BW, AE, DD, KK).

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.


The Spit for Science project is supported by P20AA017828, R37AA011408, K02AA018755, and P50AA022537 from the National Institute on Alcohol Abuse and Alcoholism; UL1TR000058, from the National Center for Advancing Translational Studies; and UL1RR031990 from the National Center for Research Resources and National Institutes of Health Roadmap for Medical Research, with additional support for this research through R01AA018333 (DD/KK); K01AA021399 (AE); K01AA024152 (JESalvatore); and F31AA024380 (MC). We would like to thank the Virginia Commonwealth University students for making this study a success, as well as the many VCU faculty, students, and staff who contributed to the design and implementation of the project. With respect to the ALSPAC data, we are extremely grateful to all the families who took part in this study, the midwives for their help in recruiting them, and the whole ALSPAC team, which includes interviewers, computer and laboratory technicians, clerical workers, research scientists, volunteers, managers, receptionists and nurses. The UK Medical Research Council and the Wellcome Trust (Grant ref: 102215/2/13/2) and the University of Bristol provide core support for ALSPAC. This publication is the work of the authors and BW will serve as guarantor for the contents of this paper. This research was additionally specifically funded by support from MRC and ESRC (MR/L022206/1 and ES/L015471/1).

Supplementary Material

The Supplementary Material for this article can be found online at:


1000 Genomes Project Consortium, Auton, A., Brooks, L. D., Durbin, R. M., Garrison, E. P., Kang, H. M., et al. (2015). A global reference for human genetic variation. Nature 526, 68–74. doi: 10.1038/nature15393

PubMed Abstract | CrossRef Full Text | Google Scholar

Agrawal, A., Lynskey, M. T., Heath, A. C., and Chassin, L. (2011). Developing a genetically informative measure of alcohol consumption using past-12-month indices. J. Stud. Alcohol Drugs 72, 444–452. doi: 10.15288/jsad.2011.72.444

PubMed Abstract | CrossRef Full Text | Google Scholar

American Psychiatric Association (2013). Diagnostic and Statistical Manual of Mental Disorders: DSM-5. Washington, DC: American Psychiatric Association.

Babor, T., Higgins-Biddle, J., Saunders, J., and Monteiro, M. (2001). The Alcohol Use Disorders Identification Test. Guidelines for Use in Primary Health Care. 2nd Edn. Geneva: World Health Organization.

Google Scholar

Bergen, S. E., Gardner, C. O., and Kendler, K. S. (2007). Age-related changes in heritability of behavioral phenotypes over adolescence and young adulthood: a meta-analysis. Twin Res. Hum. Genet. 10, 423–433. doi: 10.1375/twin.10.3.423

PubMed Abstract | CrossRef Full Text | Google Scholar

Bhandage, A. K., Jin, Z., Bazov, I., Kononenko, O., Bakalkin, G., Korpi, E. R., et al. (2014). GABA-A and NMDA receptor subunit mRNA expression is altered in the caudate but not the putamen of the postmortem brains of alcoholics. Front. Cell. Neurosci. 8:415. doi: 10.3389/fncel.2014.00415

PubMed Abstract | CrossRef Full Text | Google Scholar

Bierut, L. J., Agrawal, A., Bucholz, K. K., Doheny, K. F., Laurie, C., Pugh, E., et al. (2010). A genome-wide association study of alcohol dependence. Proc. Natl. Acad. Sci. U.S.A. 107, 5082–5087. doi: 10.1073/pnas.0911109107

PubMed Abstract | CrossRef Full Text

Bigdeli, T. B., Neale, B. M., and Neale, M. C. (2014). Statistical properties of single-marker tests for rare variants. Twin Res. Hum. Genet. 17, 143–150. doi: 10.1017/thg.2014.17

PubMed Abstract | CrossRef Full Text | Google Scholar

Boyd, A., Golding, J., Macleod, J., Lawlor, D. A., Fraser, A., Henderson, J., et al. (2013). Cohort Profile: the ‘children of the 90s’–the index offspring of the Avon Longitudinal Study of Parents and Children. Int. J. Epidemiol. 42, 111–127. doi: 10.1093/ije/dys064

PubMed Abstract | CrossRef Full Text | Google Scholar

Brookfield, J. F. (2013). Quantitative genetics: heritability is not always missing. Curr. Biol. 23, R276–R278. doi: 10.1016/j.cub.2013.02.040

PubMed Abstract | CrossRef Full Text | Google Scholar

Chang, C. C., Chow, C. C., Tellier, L. C., Vattikuti, S., Purcell, S. M., and Lee, J. J. (2015). Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7. doi: 10.1186/s13742-015-0047-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, J., Hutchison, K. E., Calhoun, V. D., Claus, E. D., Turner, J. A., Sui, J., et al. (2015). CREB-BDNF pathway influences alcohol cue-elicited activation in drinkers. Hum. Brain Mapp. 36, 3007–3019. doi: 10.1002/hbm.22824

PubMed Abstract | CrossRef Full Text | Google Scholar

Dawson, D. A. (2000). US low-risk drinking guidelines: an examination of four alternatives. Alcohol. Clin. Exp. Res. 24, 1820–1829. doi: 10.1111/j.1530-0277.2000.tb01986.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Delaneau, O., Zagury, J. F., and Marchini, J. (2013). Improved whole-chromosome phasing for disease and population genetic studies. Nat. Methods 10, 5–6. doi: 10.1038/nmeth.2307

PubMed Abstract | CrossRef Full Text | Google Scholar

Detrait, E., Maurice, T., Hanon, E., Leclercq, K., and Lamberty, Y. (2014). Lack of synaptic vesicle protein SV2B protects against amyloid-β25−35-induced oxidative stress, cholinergic deficit and cognitive impairment in mice. Behav. Brain Res. 271, 277–285. doi: 10.1016/j.bbr.2014.06.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Dick, D. M., Barr, P., Guy, M., Nasim, A., and Scott, D. (2017). (Invited review) genetic research on alcohol use outcomes in African American populations: a review of the literature, associated challenges, and implications. Am. J. Addict. doi: 10.1111/ajad.12495. [Epub ahead of print].

CrossRef Full Text | Google Scholar

Dick, D. M., Meyers, J. L., Rose, R. J., Kaprio, J., and Kendler, K. S. (2011). Measures of current alcohol consumption and problems: two independent twin studies suggest a complex genetic architecture. Alcohol. Clin. Exp. Res. 35, 2152–2161. doi: 10.1111/j.1530-0277.2011.01564.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Dick, D. M., Nasim, A., Edwards, A. C., Salvatore, J. E., Cho, S. B., Adkins, A., et al. (2014). Spit for Science: launching a longitudinal study of genetic and environmental influences on substance use and emotional health at a large US university. Front. Genet. 5:47. doi: 10.3389/fgene.2014.00047

PubMed Abstract | CrossRef Full Text | Google Scholar

Dudbridge, F. (2013). Power and predictive accuracy of polygenic risk scores. PLoS Genet. 9:e1003348. doi: 10.1371/journal.pgen.1003348

PubMed Abstract | CrossRef Full Text | Google Scholar

Edenberg, H. J. (2007). The genetics of alcohol metabolism. Role of alcohol dehydrogenase and aldehyde dehydrogenase variants. Alcohol Res. Health 30, 5–13. Available online at:

PubMed Abstract

Edenberg, H. J., Koller, D. L., Xuei, X., Wetherill, L., McClintick, J. N., Almasy, L., et al. (2010). Genome-wide association study of alcohol dependence implicates a region on chromosome 11. Alcohol. Clin. Exp. Res. 34, 840–852. doi: 10.1111/j.1530-0277.2010.01156.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Edwards, A. C., Aliev, F., Bierut, L. J., Bucholz, K. K., Edenberg, H., Hesselbrock, V., et al. (2012). Genome-wide association study of comorbid depressive syndrome and alcohol dependence. Psychiatr. Genet. 22, 31–41. doi: 10.1097/YPG.0b013e32834acd07

PubMed Abstract | CrossRef Full Text | Google Scholar

Edwards, A. C., Aliev, F., Wolen, A. R., Salvatore, J. E., Gardner, C. O., McMahon, G., et al. (2015). Genomic influences on alcohol problems in a population-based sample of young adults. Addiction 110, 461–470. doi: 10.1111/add.12822

PubMed Abstract | CrossRef Full Text | Google Scholar

Edwards, A. C., and Kendler, K. S. (2013). Alcohol consumption in men is influenced by qualitatively different genetic factors in adolescence and adulthood. Psychol. Med. 43, 1857–1868. doi: 10.1017/S0033291712002917

PubMed Abstract | CrossRef Full Text | Google Scholar

Fallin, M. D., Lasseter, V. K., Avramopoulos, D., Nicodemus, K. K., Wolyniec, P. S., McGrath, J. A., et al. (2005). Bipolar I disorder and schizophrenia: a 440-single-nucleotide polymorphism screen of 64 candidate genes among Ashkenazi Jewish case-parent trios. Am. J. Hum. Genet. 77, 918–936. doi: 10.1086/497703

PubMed Abstract | CrossRef Full Text | Google Scholar

Fowler, T., Lifford, K., Shelton, K., Rice, F., Thapar, A., Neale, M. C., et al. (2007). Exploring the relationship between genetic and environmental influences on initiation and progression of substance use. Addiction 102, 413–422. doi: 10.1111/j.1360-0443.2006.01694.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Fraser, A., MacDonald-Wallis, C., Tilling, K., Boyd, A., Golding, J., Davey Smith, G., et al. (2013). Cohort profile: the avon longitudinal study of parents and children: ALSPAC mothers cohort. Int. J. Epidemiol. 42, 97–110. doi: 10.1093/ije/dys066

PubMed Abstract | CrossRef Full Text | Google Scholar

Gelernter, J., Kranzler, H. R., Sherva, R., Almasy, L., Koesterer, R., Smith, A. H., et al. (2014). Genome-wide association study of alcohol dependence:significant findings in African- and European-Americans including novel risk loci. Mol. Psychiatry 19, 41–49. doi: 10.1038/mp.2013.145

PubMed Abstract | CrossRef Full Text | Google Scholar

Hansell, N. K., Agrawal, A., Whitfield, J. B., Morley, K. I., Zhu, G., Lind, P. A., et al. (2008). Long-term stability and heritability of telephone interview measures of alcohol consumption and dependence. Twin Res. Hum. Genet. 11, 287–305. doi: 10.1375/twin.11.3.287

PubMed Abstract | CrossRef Full Text | Google Scholar

Harris, P. A., Taylor, R., Thielke, R., Payne, J., Gonzalez, N., and Conde, J. G. (2009). Research electronic data capture (REDCap)–a metadata-driven methodology and workflow process for providing translational research informatics support. J. Biomed. Inform. 42, 377–381. doi: 10.1016/j.jbi.2008.08.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Heath, A. C., Bucholz, K. K., Madden, P. A., Dinwiddie, S. H., Slutske, W. S., Bierut, L. J., et al. (1997). Genetic and environmental contributions to alcohol dependence risk in a national twin sample: consistency of findings in women and men. Psychol. Med. 27, 1381–1396. doi: 10.1017/S0033291797005643

PubMed Abstract | CrossRef Full Text | Google Scholar

Heath, A. C., Whitfield, J. B., Martin, N. G., Pergadia, M. L., Goate, A. M., Lind, P. A., et al. (2011). A quantitative-trait genome-wide association study of alcoholism risk in the community: findings and implications. Biol. Psychiatry. 70, 513–518. doi: 10.1016/j.biopsych.2011.02.028

PubMed Abstract | CrossRef Full Text | Google Scholar

Howie, B. N., Donnelly, P., and Marchini, J. (2009). A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 5:e1000529. doi: 10.1371/journal.pgen.1000529

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, J., Perlis, R. H., Lee, P. H., Rush, A. J., Fava, M., Sachs, G. S., et al. (2010). Cross-disorder genomewide analysis of schizophrenia, bipolar disorder, and depression. Am. J. Psychiatry 167, 1254–1263. doi: 10.1176/appi.ajp.2010.09091335

PubMed Abstract | CrossRef Full Text | Google Scholar

Huber, W., Carey, V. J., Gentleman, R., Anders, S., Carlson, M., Carvalho, B. S., et al. (2015). Orchestrating high-throughput genomic analysis with Bioconductor. Nat. Methods 12, 115–121. doi: 10.1038/nmeth.3252

PubMed Abstract | CrossRef Full Text | Google Scholar

Kalu, N., Ramchandani, V. A., Marshall, V., Scott, D., Ferguson, C., Cain, G., et al. (2012). Heritability of level of response and association with recent drinking history in nonalcohol-dependent drinkers. Alcohol. Clin. Exp. Res. 36, 1034–1041. doi: 10.1111/j.1530-0277.2011.01699.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Kendler, K. S., Gardner, C., and Dick, D. M. (2011a). Predicting alcohol consumption in adolescence from alcohol-specific and general externalizing genetic risk factors, key environmental exposures and their interaction. Psychol. Med. 41, 1507–1516. doi: 10.1017/S003329171000190X

PubMed Abstract | CrossRef Full Text | Google Scholar

Kendler, K. S., Jacobson, K. C., Gardner, C., Gillespie, N. A., Aggen, S. H., and Prescott, C. (2007). Creating a social world: a developmental twin study of peer group deviance. Arch. Gen. Psychiatry 64, 958–965. doi: 10.1001/archpsyc.64.8.958

PubMed Abstract | CrossRef Full Text | Google Scholar

Kendler, K. S., Kalsi, G., Holmans, P. A., Sanders, A. R., Aggen, S. H., Dick, D. M., et al. (2011b). Genomewide association analysis of symptoms of alcohol dependence in the molecular genetics of schizophrenia (MGS2) control sample. Alcohol. Clin. Exp. Res. 35, 963–975. doi: 10.1111/j.1530-0277.2010.01427.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Kendler, K. S., Myers, J., Dick, D., and Prescott, C. A. (2010). The relationship between genetic influences on alcohol dependence and on patterns of alcohol consumption. Alcohol. Clin. Exp. Res. 34, 1058–1065. doi: 10.1111/j.1530-0277.2010.01181.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Koch, L. (2014). Epigenetics: an epigenetic twist on the missing heritability of complex traits. Nat. Rev. Genet. 15, 218. doi: 10.1038/nrg3698

PubMed Abstract | CrossRef Full Text | Google Scholar

Kos, M. Z., Glahn, D. C., Carless, M. A., Olvera, R., McKay, D. R., Quillen, E. E., et al. (2014). Novel QTL at chromosome 6p22 for alcohol consumption: implications for the genetic liability of alcohol use disorders. Am. J. Med. Genet. B Neuropsychiatr. Genet. 165B, 294–302. doi: 10.1002/ajmg.b.32231

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, S. H., Wray, N. R., Goddard, M. E., and Visscher, P. M. (2011). Estimating missing heritability for disease from genome-wide association studies. Am. J. Hum. Genet. 88, 294–305. doi: 10.1016/j.ajhg.2011.02.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Mahalanobis, P. C. (1936). “On the generalised distance in statistics,” in Proceedings of the National Institute of Sciences in India, Vol. 2 (Calcutta), 49–55.

Manolio, T. A., Collins, F. S., Cox, N. J., Goldstein, D. B., Hindorff, L. A., Hunter, D. J., et al. (2009). Finding the missing heritability of complex diseases. Nature 461, 747–753. doi: 10.1038/nature08494

PubMed Abstract | CrossRef Full Text | Google Scholar

Marchini, J., Howie, B., Myers, S., McVean, G., and Donnelly, P. (2007). A new multipoint method for genome-wide association studies by imputation of genotypes. Nat. Genet. 39, 906–913. doi: 10.1038/ng2088

PubMed Abstract | CrossRef Full Text | Google Scholar

Mbarek, H., Milaneschi, Y., Fedko, I. O., Hottenga, J. J., De Moor, M. H., Jansen, R., et al. (2015). The genetics of alcohol dependence: twin and SNP-based heritability, and genome-wide association study based on AUDIT scores. Am. J. Med. Genet. B Neuropsychiatr. Genet. 168, 739–748. doi: 10.1002/ajmg.b.32379

PubMed Abstract | CrossRef Full Text | Google Scholar

McGue, M. (1999). The behavioral genetics of alcoholism. Curr. Dir. Psychol. Sci. 8, 109–115. doi: 10.1111/1467-8721.00026

CrossRef Full Text | Google Scholar

Meyers, J. L., Salvatore, J. E., Vuoksimaa, E., Korhonen, T., Pulkkinen, L., Rose, R. J., et al. (2014). Genetic influences on alcohol use behaviors have diverging developmental trajectories: a prospective study among male and female twins. Alcohol. Clin. Exp. Res. 38, 2869–2877. doi: 10.1111/acer.12560

PubMed Abstract | CrossRef Full Text | Google Scholar

Nenadic, I., Maitra, R., Scherpiet, S., Gaser, C., Schultz, C. C., Schachtzabel, C., et al. (2012). Glutamate receptor delta 1 (GRID1) genetic variation and brain structure in schizophrenia. J. Psychiatr. Res. 46, 1531–1539. doi: 10.1016/j.jpsychires.2012.08.026

PubMed Abstract | CrossRef Full Text | Google Scholar

Nurnberger, J. I. Jr., Koller, D. L., Jung, J., Edenberg, H. J., Foroud, T., Guella, I., et al. (2014). Identification of pathways for bipolar disorder: a meta-analysis. JAMA Psychiatry 71, 657–664. doi: 10.1001/jamapsychiatry.2014.176

PubMed Abstract | CrossRef Full Text | Google Scholar

Olson, C. R., Hodges, L. K., and Mello, C. V. (2015). Dynamic gene expression in the song system of zebra finches during the song learning period. Dev. Neurobiol. 75, 1315–1338. doi: 10.1002/dneu.22286

PubMed Abstract | CrossRef Full Text | Google Scholar

Palmer, R. H., Young, S. E., Corley, R. P., Hopfer, C. J., Stallings, M. C., and Hewitt, J. K. (2013). Stability and change of genetic and environmental effects on the common liability to alcohol, tobacco, and cannabis DSM-IV dependence symptoms. Behav. Genet. 43, 374–385. doi: 10.1007/s10519-013-9599-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Pan, Y., Luo, X., Liu, X., Wu, L. Y., Zhang, Q., Wang, L., et al. (2013). Genome-wide association studies of maximum number of drinks. J. Psychiatr. Res. 47, 1717–1724. doi: 10.1016/j.jpsychires.2013.07.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Patterson, N., Price, A. L., and Reich, D. (2006). Population structure and eigenanalysis. PLoS Genet. 2:e190. doi: 10.1371/journal.pgen.0020190

PubMed Abstract | CrossRef Full Text | Google Scholar

Pickard, B. S., Pieper, A. A., Porteous, D. J., Blackwood, D. H., and Muir, W. J. (2006). The NPAS3 gene–emerging evidence for a role in psychiatric illness. Ann. Med. 38, 439–448. doi: 10.1080/07853890600946500

PubMed Abstract | CrossRef Full Text | Google Scholar

Poelen, E. A., Engels, R. C., Scholte, R. H., Boomsma, D. I., and Willemsen, G. (2009). Similarities in drinking behavior of twin's friends: moderation of heritability of alcohol use. Behav. Genet. 39, 145–153. doi: 10.1007/s10519-008-9250-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Prescott, C. A., and Kendler, K. S. (1999). Genetic and environmental contributions to alcohol abuse and dependence in a population-based sample of male twins. Am. J. Psychiatry 156, 34–40. doi: 10.1176/ajp.156.1.34

PubMed Abstract | CrossRef Full Text | Google Scholar

Price, A. L., Patterson, N. J., Plenge, R. M., Weinblatt, M. E., Shadick, N. A., and Reich, D. (2006). Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909. doi: 10.1038/ng1847

PubMed Abstract | CrossRef Full Text | Google Scholar

Price, A. L., Weale, M. E., Patterson, N., Myers, S. R., Need, A. C., Shianna, K. V., et al. (2008). Long-range LD can confound genome scans in admixed populations. Am. J. Hum. Genet. 83, 132–135. author reply: 135–139. doi: 10.1016/j.ajhg.2008.06.005

PubMed Abstract | CrossRef Full Text

Pruim, R. J., Welch, R. P., Sanna, S., Teslovich, T. M., Chines, P. S., Gliedt, T. P., et al. (2010). LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics 26, 2336–2337. doi: 10.1093/bioinformatics/btq419

PubMed Abstract | CrossRef Full Text | Google Scholar

Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M. A., Bender, D., et al. (2007). PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575. doi: 10.1086/519795

PubMed Abstract | CrossRef Full Text | Google Scholar

Quinn, P. D., and Fromme, K. (2011). Alcohol use and related problems among college students and their noncollege peers: the competing roles of personality and peer influence. J. Stud. Alcohol Drugs 72, 622–632. doi: 10.15288/jsad.2011.72.622

PubMed Abstract | CrossRef Full Text | Google Scholar

R Core Team (2016). R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing. Available online at:

Rose, R. J., Dick, D. M., Viken, R. J., and Kaprio, J. (2001). Gene-environment interaction in patterns of adolescent drinking: regional residency moderates longitudinal influences on alcohol use. Alcohol. Clin. Exp. Res. 25, 637–643. doi: 10.1111/j.1530-0277.2001.tb02261.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Salvatore, J. E., Thomas, N. S., Cho, S. B., Adkins, A., Kendler, K. S., and Dick, D. M., (2016). The role of romantic relationship status in pathways of risk for emerging adult alcohol use. Psychol. Addict. Behav. 30, 335–344. doi: 10.1037/adb0000145

PubMed Abstract | CrossRef Full Text | Google Scholar

Samek, D. R., Keyes, M. A., Iacono, W. G., and McGue, M. (2013). Peer deviance, alcohol expectancies, and adolescent alcohol use: explaining shared and nonshared environmental effects using an adoptive sibling pair design. Behav. Genet. 43, 286–296. doi: 10.1007/s10519-013-9595-9

PubMed Abstract | CrossRef Full Text | Google Scholar

SAMHSA (2014). Results from the 2013 National Survey on Drug Use and Health: Summary of National Findings. Rockville, MD: Office of Applied Statistics.

Schizophrenia Working Group of the Psychiatric Genomics Consortium (2014). Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427. doi: 10.1038/nature13595

PubMed Abstract | CrossRef Full Text

Schumann, G., Coin, L. J., Lourdusamy, A., Charoen, P., Berger, K. H., Stacey, D., et al. (2011). Genome-wide association and genetic functional studies identify autism susceptibility candidate 2 gene (AUTS2) in the regulation of alcohol consumption. Proc. Natl. Acad. Sci. U.S.A. 108, 7119–7124. doi: 10.1073/pnas.1017288108

PubMed Abstract | CrossRef Full Text | Google Scholar

Sudmant, P. H., Rausch, T., Gardner, E. J., Handsaker, R. E., Abyzov, A., Huddleston, J., et al. (2015). An integrated map of structural variation in 2,504 human genomes. Nature 526, 75–81. doi: 10.1038/nature15394

PubMed Abstract | CrossRef Full Text | Google Scholar

Timberlake, D. S., Hopfer, C. J., Rhee, S. H., Friedman, N. P., Haberstick, B. C., Lessem, J. M., et al. (2007). College attendance and its effect on drinking behaviors in a longitudinal study of adolescents. Alcohol. Clin. Exp. Res. 31, 1020–1030. doi: 10.1111/j.1530-0277.2007.00383.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Treutlein, J., Cichon, S., Ridinger, M., Wodarz, N., Soyka, M., Zill, P., et al. (2009a). Genome-wide association study of alcohol dependence. Arch. Gen. Psychiatry 66, 773–784. doi: 10.1001/archgenpsychiatry.2009.83

PubMed Abstract | CrossRef Full Text | Google Scholar

Treutlein, J., Mühleisen, T. W., Frank, J., Mattheisen, M., Herms, S., Ludwig, K. U., et al. (2009b). Dissection of phenotype reveals possible association between schizophrenia and Glutamate Receptor Delta 1 (GRID1) gene promoter. Schizophr. Res. 111, 123–130. doi: 10.1016/j.schres.2009.03.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Verhulst, B., Neale, M. C., and Kendler, K. S. (2015). The heritability of alcohol use disorders: a meta-analysis of twin and adoption studies. Psychol. Med. 45, 1061–1072. doi: 10.1017/S0033291714002165

PubMed Abstract | CrossRef Full Text | Google Scholar

Vrieze, S. I., Hicks, B. M., Iacono, W. G., and McGue, M. (2012). Decline in genetic influence on the co-occurrence of alcohol, marijuana, and nicotine dependence symptoms from age 14 to 29. Am. J. Psychiatry 169, 1073–1081. doi: 10.1176/appi.ajp.2012.11081268

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, J. C., Foroud, T., Hinrichs, A. L., Le, N. X., Bertelsen, S., Budde, J. P., et al. (2013). A genome-wide association study of alcohol-dependence symptom counts in extended pedigrees identifies C15orf53. Mol. Psychiatry 18, 1218–1224. doi: 10.1038/mp.2012.143

PubMed Abstract | CrossRef Full Text | Google Scholar

Willer, C. J., Li, Y., and Abecasis, G. R. (2010). METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191. doi: 10.1093/bioinformatics/btq340

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, J., Lee, S. H., Goddard, M. E., and Visscher, P. M. (2011). GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82. doi: 10.1016/j.ajhg.2010.11.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Zuk, O., Hechter, E., Sunyaev, S. R., and Lander, E. S. (2012). The mystery of missing heritability: genetic interactions create phantom heritability. Proc. Natl. Acad. Sci. U.S.A. 109, 1193–1198. doi: 10.1073/pnas.1119675109

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: alcohol problems, alcohol consumption, GWAS, heritability, genetic ancestry, genome-wide polygenic score

Citation: Webb BT, Edwards AC, Wolen AR, Salvatore JE, Aliev F, Riley BP, Sun C, Williamson VS, Kitchens JN, Pedersen K, Adkins A, Cooke ME, Savage JE, Neale Z, Cho SB, Dick DM and Kendler KS (2017) Molecular Genetic Influences on Normative and Problematic Alcohol Use in a Population-Based Sample of College Students. Front. Genet. 8:30. doi: 10.3389/fgene.2017.00030

Received: 19 July 2016; Accepted: 27 February 2017;
Published: 15 March 2017.

Edited by:

Jenae M. Neiderhiser, Pennsylvania State University, USA

Reviewed by:

Juko Ando, Keio University, Japan
Toni Clarke, University of Edinburgh, UK

Copyright © 2017 Webb, Edwards, Wolen, Salvatore, Aliev, Riley, Sun, Williamson, Kitchens, Pedersen, Adkins, Cooke, Savage, Neale, Cho, Dick and Kendler. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Bradley T. Webb,

These authors have contributed equally to this work.

These authors jointly supervised this work.