Original Research ARTICLE
A Genome-Wide Linkage Study for Chronic Obstructive Pulmonary Disease in a Dutch Genetic Isolate Identifies Novel Rare Candidate Variants
- 1Department of Epidemiology, Erasmus Medical Center, Rotterdam, Netherlands
- 2Department of Respiratory Medicine, Ghent University Hospital, Ghent, Belgium
- 3Department of Epidemiology, University Medical Center Groningen, University of Groningen, Groningen, Netherlands
- 4Groningen Research Institute for Asthma and COPD, University Medical Center Groningen, University of Groningen, Groningen, Netherlands
- 5Pharmaceutical Care Unit, Department of Bioanalysis, Ghent University, Ghent, Belgium
- 6Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, Netherlands
- 7Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA, United States
- 8Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Boston, MA, United States
- 9Department of Respiratory Medicine, Erasmus Medical Center, Rotterdam, Netherlands
- 10Department of Pulmonary Medicine and Tuberculosis, University Medical Center Groningen, University of Groningen, Groningen, Netherlands
Chronic obstructive pulmonary disease (COPD) is a complex and heritable disease, associated with multiple genetic variants. Specific familial types of COPD may be explained by rare variants, which have not been widely studied. We aimed to discover rare genetic variants underlying COPD through a genome-wide linkage scan. Affected-only analysis was performed using the 6K Illumina Linkage IV Panel in 142 cases clustered in 27 families from a genetic isolate, the Erasmus Rucphen Family (ERF) study. Potential causal variants were identified by searching for shared rare variants in the exome-sequence data of the affected members of the families contributing most to the linkage peak. The identified rare variants were then tested for association with COPD in a large meta-analysis of several cohorts. Significant evidence for linkage was observed on chromosomes 15q14–15q25 [logarithm of the odds (LOD) score = 5.52], 11p15.4–11q14.1 (LOD = 3.71) and 5q14.3–5q33.2 (LOD = 3.49). In the chromosome 15 peak, that harbors the known COPD locus for nicotinic receptors, and in the chromosome 5 peak we could not identify shared variants. In the chromosome 11 locus, we identified four rare (minor allele frequency (MAF) <0.02), predicted pathogenic, missense variants. These were shared among the affected family members. The identified variants localize to genes including neuroblast differentiation-associated protein (AHNAK), previously associated with blood biomarkers in COPD, phospholipase C Beta 3 (PLCB3), shown to increase airway hyper-responsiveness, solute carrier family 22-A11 (SLC22A11), involved in amino acid metabolism and ion transport, and metallothionein-like protein 5 (MTL5), involved in nicotinate and nicotinamide metabolism. Association of SLC22A11 and MTL5 variants were confirmed in the meta-analysis of 9,888 cases and 27,060 controls. In conclusion, we have identified novel rare variants in plausible genes related to COPD. Further studies utilizing large sample whole-genome sequencing should further confirm the associations at chromosome 11 and investigate the chromosome 15 and 5 linked regions.
Chronic obstructive pulmonary disease is a common and complex disease, and one of the leading causes of death worldwide (Lozano and Naghavi, 2012). Previous studies provided heritability estimates for COPD of 20% to even 60% (Ingebrigtsen et al., 2010; Zhou et al., 2013). Both rare variants with a large impact and common variants with a modest impact on the risk to develop COPD have been identified. The SERPINA1 gene at chromosome 14q32.13, encoding AAT, was in fact the first gene identified to be associated with COPD (Laurell and Eriksson, 1963; Bashir et al., 2016). Rare variants in SERPINA1 are known to contribute to COPD risk in AAT deficiency in homozygous and heterozygous carriers of the low-frequency Z allele (Foreman et al., 2017). In an exome study of severe, early-onset families, Qiao et al. (2016) identified several genes with rare variants segregating in at least two pedigrees. In extended families, genetic linkage studies have found evidence of linkage to chromosomes 2q, 6q, 8p, 12p, and 19q, among others (Silverman et al., 2002; Palmer et al., 2003). However, many initially promising findings from linkage or exome sequencing in candidate-gene studies could not be replicated in subsequent analyses (Hersh et al., 2005).
Common variants in several genes have been identified in multiple GWAS, to be associated with COPD or obstructive lung function impairment. Among consistently replicated loci in GWAS are genes on chromosome 4 – Hedgehog-interacting protein (HHIP) and Family with sequence similarity 13 member A (FAM13A), chromosome 5 – 5-hydroxytryptamine receptor 4 (HTR4), chromosome 15 – Nicotinic cholinergic receptors (CHRNA3/5) and Ion-responsive element binding protein 2 (IREB2) and chromosome 19 – Cytochrome P450 family gene (CYP2A6), member RAS oncogene family gene (RAB4B) and Egl-9 family hypoxic-inducible factor 2 (EGLN2) (Hobbs et al., 2017; Wain et al., 2017). However, only few loci identified in GWAS could be functionally explained.
Despite the undeniable progress in understanding the genetic origins of COPD, a major part of its heritability remains unexplained. A complicating factor in studies on the genetics of COPD is that COPD is considered a complex genetic trait, i.e., multiple, possibly interacting, genetic and environmental factors are involved. Therefore, there is a need for fine mapping techniques that can identify functional, rare variants with large effects explaining specific types of COPD. Rare variant association studies can be carried out in relatively small sample sizes when using family-based settings (Auer and Lettre, 2015). In a genetically isolated population, alleles that are found at low or very-low (rare) frequencies in control samples may reach much higher proportions due to a limited number of founder individuals, genetic drift, minimal immigration, and high inbreeding (Pardo et al., 2005). Therefore, attempting to identify risk genes for COPD in populations that are relatively genetically and environmentally homogeneous could be beneficial (Van Diemen et al., 2010).
This study uses the ERF study, a Dutch genetically isolated population, to localize and identify rare genetic variants and subsequently shows the relevance of these variants in the general population by performing an association analysis in a large sample.
Materials and Methods
The linkage study was performed in 142 related participants from the ERF study. ERF is a family-based cohort study, studied as part of the Genetic Research in Isolated Population (GRIP) program. It is based in a genetically isolated community from the south-west area of the Netherlands, set up to investigate genes underlying different quantitative traits and common diseases (Pardo et al., 2005). The participants of ERF are living descendants of 22 couples from the religious isolate in the 19th century, who had at least six children baptized in the community church. The baseline data collection for over 3,000 people was conducted between June 2002 and February 2005. These individuals are related to each other through multiple lines of descent in a single large pedigree spanning 23 generations and connecting over 23,000 individuals. In 2015 a follow-up data collection for 1,500 participants was performed by reviewing general practitioner’s records, including letters from the specialists and spirometry reports and medication use. In total 192 probable COPD cases were identified in the follow-up. The COPD diagnosis was confirmed by respiratory specialists based on an obstructive lung function, i.e., the ratio of Forced Expiratory Volume in one second over the Forced Vital Capacity (FEV1/FVC) < 0.7, with or without medication use (n = 116). If the information on FVC was missing (n = 14), the following criteria for COPD were used: FEV1 < 80%, use of respiratory medication and a COPD diagnosis in the report of the respiratory specialist to the general practitioner. If no lung function measurement was available (n = 15), COPD diagnosis was based on: medication use with CT-scan of the lungs indicating COPD and/or a history of frequent COPD exacerbations mentioned in the medical documents. Thus, the COPD diagnosis could be confirmed for 145 participants, of which 3 did not have genotyping data, resulting in the final sample size for the linkage study of 142 COPD cases.
The association analysis was performed using data from the RS (1,588 cases and 9,784 controls), the LLS (1,647 cases and 9,530 controls), the VlaVla study (375 cases and 1,019 controls) and the data from the study of Hobbs et al. (2016) (6,161 cases and 6,004 controls), in addition to the ERF study (117 cases and 1,091 controls).
Rotterdam Study is a prospective, population-based study (Ikram et al., 2017), focusing on the diseases in the participants aged 45 or older. The COPD diagnosis in the RS was defined as having pre-bronchodilator obstructive spirometry (FEV1/FVC < 0.7), assessed either by spirometry in the research center or by reviewing medical histories of the participants. Spirometry was performed by trained paramedical personnel, according to the guidelines of the American Thoracic Society/European Respiratory Society (ATS/ERS). In absence of interpretable spirometry measures, all medical information of subjects regularly using respiratory medication was reviewed, including files from specialists and general practitioners, to confirm a diagnosis of COPD. Both ERF and RS have been approved by the Medical Ethics Committee of the Erasmus Medical Center. All participants provided written informed consent to participate in the study and to obtain information from their treating physicians.
Lifelines study is a multi-disciplinary prospective population-based cohort of the Northern provinces of the Netherlands with a three generation design, focusing on the onset of common complex diseases (Scholtens et al., 2015). COPD was defined as having pre-bronchodilator FEV1/FVC < 0.7, assessed by spirometry using a Welch Allyn Version 188.8.131.529, PC-based SpiroPerfect with Ca Workstation software. All subjects provided written informed consent and the study was approved by the Medical Ethics Committee of the University Medical Center Groningen, Groningen, Netherlands.
The VlaVla is a prospective, Dutch population-based cohort including individuals from Vlagtwedde (a rural area) and Vlaardingen (an urban area), aimed to gain insight into the risk factors for chronic airway diseases and lung function (Van Diemen et al., 2005). COPD was defined as having pre-bronchodilator FEV1/FVC < 0.7. Data of the last survey in 1989/1990 were used and spirometry data were collected by performing a slow inspiratory maneuver, using a water-sealed spirometer (Lode instruments, Groningen, Netherlands). The Committee on Human Subjects in Research of the University of Groningen reviewed the study and affirmed the safety of the protocol and study design and all participants gave their written informed consent.
In the study by Hobbs et al. (2016) COPD cases were defined as having FEV1/FVC ≤ 0.7 and FEV1 ≤ 80% of the predicted value. It was multi-ethnic study with Asian, African, and European ancestry individuals. Institutional review board approval and written informed consent were obtained for all these cohorts. For more details please refer to their publication (Hobbs et al., 2016).
For all participants, DNA was extracted from venous blood using the salting out method (Miller et al., 1988).
For the linkage analysis genotyping was performed using the 6K Illumina Linkage IV panel (Illumina, San Diego, CA, United States). Further, QC was performed involving exclusion of the variants with call rate <98%, those diverging from Hardy–Weinberg equilibrium (P < 10-8) and X-chromosome variants and participants with an overall call rate <96%. Mendelian inconsistencies were designated as missing genotypes. The final dataset comprised 5,250 autosomal SNVs in 3,018 participants.
Exome-Sequencing and Genotyping
The sequencing and genotyping in the ERF study have been described elsewhere (Amin et al., 2017). In short, for 1,336 ERF participants whole exome sequencing was performed at a mean depth of 74× (Agilent, v4 capture). After QC, 543,954 SNVs in 1,327 participants were retained. For 1,527 individuals whose exomes were not sequenced, the Illumina Infinium HumanExome BeadChip v1.1 was used for genotyping and variant calling was done using Genome Studio. After QC 70,000 polymorphic SNVs in 1,515 participants were retrieved. Of these, the overlap with COPD status information was available for 636 participants (59 cases and 577 controls) with exome-sequence and 572 participants (58 cases and 514 controls) with exome-chip data. The cases overlap with the sample used in the linkage analysis. The ERF data is available in the EGA public repository1 with ID number: EGAS00001001134.
The RS was genotyped using Illumina 550K and Illumina 610K and 660K arrays, and genotyping QC was done as described elsewhere (Iglesias et al., 2017). HRC imputation panel (McCarthy et al., 2015) was used for imputation. File preparation and imputation was done as described elsewhere (Iglesias et al., 2017). In the final dataset we included 11,372 participants of RS (cases and controls) with HRC imputed genotype data available.
In LLS and VlaVla the genotyping was done using Illumina CytoSNP-12 arrays and QC was done as described elsewhere (De Jong et al., 2015). The GoNL panel was used for imputation of LLS and VlaVla and was done as described elsewhere (Scholtens et al., 2015). The final dataset included 11,177 participants of LLS and 1,394 of VlaVla.
In Hobbs et al. (2016) work all individuals were genotyped using the Illumina HumanExome arrays (v1.1 and v1.2; Illumina, San Diego, CA, United States). For more information please refer to their publication (Hobbs et al., 2016).
Genome-Wide Linkage Analysis
For the genome-wide linkage analysis, 142 related COPD cases from ERF were used. The cases were linked in a single large pedigree of 23 generations. However, due to the linkage software restraints, the cases were clustered into 27 smaller (≤24 bits) families using PEDCUT software (Liu et al., 2008). We used HaploPainter (Thiele and Nürnberg, 2005) to illustrate all 27 pedigrees (Supplementary Figure S1). We then performed affected-only parametric linkage analysis in MERLIN software (Abecasis et al., 2002) using incomplete penetrance and no phenocopies for both dominant (0, 0.5, 0.5) and recessive models (0, 0, 0.5) (Durner et al., 1999). The measure of the likelihood of linkage is the LOD score and we considered LOD ≥ 3.3 to be statistically significant. Further we performed per-family analysis for significant regions to identify the families with COPD cases contributing the most to the LOD score.
Identification of Variants in the Identified Regions
Next, we used exome-sequence data in ERF to identify rare variants that may explain the identified linkage peaks. For this, among all variants in this region we selected only variants with predicted damaging effects on protein (missense and stop-coding) based on the FunctionGVS column of the SeattleSeq Annotation database2 from the National Heart, Lung and Blood Institute (NHLBI) and with MAF < 0.05 in the general population (1000 Genomes). As frequencies in a genetically isolated population may be inflated or deflated due to genetic drift (Pardo et al., 2005), we used the MAF from the general population for filtering. We selected variants shared among most (>50%) of the affected family members as candidate variants.
A formal test of association was performed for the identified candidate variants in each study – ERF, in samples with exome-sequence (N = 636) and in exome-chip (N = 572) data, in three RS cohorts (RS-I, RS-II, and RS-III), using the HRC imputed data (N = 11,372), the LLS (N = 11,177), the VlaVla cohort (N = 1,394) and the Hobbs et al. (2016) results (N = 11,797). For this analysis, in ERF we used “seqMeta” package in R (Voorman et al., 2013) to perform single-variant analysis, adjusted for age, sex, and smoking status (current/past/never smoking). Logistic regression analysis was used to associate the variants in the RS and the VlaVla cohort, using SPSS software (Norušis, 1992) and in LLS, using PLINK (Purcell et al., 2007), applying the same models as used in ERF. Variants were excluded from the analysis if the minor allele count was less than five in either the case or the control category. Summary statistics for identified the variants were extracted from the results of Hobbs et al. (2016). A fixed-effects meta-analysis was performed with the summary statistics from all studies using the “rmeta” package in R (Lumley, 2011).
Functional Look-Up of the Genes
We investigated the Ingenuity Knowledge Base for functional annotation and look up of the genes, harboring the identified variants (IPA, Qiagen bioinformatics) (IPA, 2015). Furthermore, we consulted the Gene network tool (Fehrmann et al., 2015), a bioinformatics database containing co-expression data, functional predictions from gene ontology, Biocarta and the KEGG to investigate our findings.
The general characteristics of the study samples are presented in Table 1. All 27 families included in the linkage analyses in ERF are depicted in Supplementary Figure S1. The affected relatives were mainly smokers: 81.7% of the cases included in the linkage analyses were current or ex-smokers. As shown in Table 2 and Figure 1, we identified significant evidence for linkage of COPD to chromosomes 15q14–15q25 (HLOD = 5.52), 11p15.4–11q14.1 (HLOD = 3.71), and 5q14.3–5q33.2 (HLOD = 3.49).
FIGURE 1. Logarithm of the odds (LOD) score plot for the regions at (A) chromosomes 5, (B) 11 and (C) 15. X-axis shows the chromosomal position in cM and the Y-axis shows the heterogeneity log of odds score (HLOD) score. Red line represents HLOD scores for recessive and green line for dominant model. Dashed red line represents the level of significance (HLOD = 3.3), while dashed black line represents the suggestive level (HLOD = 2).
We next searched for rare, deleterious and shared variants by most (>50%) of the affected family members in the three identified regions mentioned above. In the linked regions of chromosomes 5 and 15 we could not identify any variants that passed mentioned filtering criteria. For the linked region on chromosome 11, we identified two families that were contributing most (LOD > 1) to the linkage score (Figure 2). Exome-sequence data were available for 8 of 17 COPD cases from these two families. We identified four missense variants including rs116243978 (AHNAK), rs35169799 (PLCB3), rs141159367 (SLC22A11), and rs146043252 (MTL5), shared among five of the eight affected family members (Table 3). Each of these variants was predicted to be highly pathogenic (CADD > 15, PolyPhen > 0.98) which suggests their relevance for the disease development. Of these four variants, one (rs141159367 in SLC22A11) showed a significant association (OR = 1.87, P = 0.002) with COPD in the meta-analysis (Table 4). The variant rs146043252 in MTL5 showed a nominal association signal (OR = 1.66, P = 0.04).
FIGURE 2. The two sub-families contributing most to the linkage peak on chromosome 11. Squares represent males and circles females. Cases are denoted in black, known controls are denoted in gray and the family members for which we do not have chronic obstructive pulmonary disease (COPD) information are denoted in white. Family members with dot in the middle are not included in Erasmus Rucphen family (ERF) study and for them only pedigree information was available. Deceased family members are crossed. For cases with exome-sequence data used in the sharing analysis information on 5-year age range (in years) is provided.
TABLE 3. Deleterious variants from chromosome 11q (missense, stop codon or CADD > 15) with a frequency in the 1000 genomes <0.05 that are shared by at least 5 cases.
In this study, we found significant evidence for extensive linkage of COPD to the chromosomes 15q14–15q25 (40.1 Mb), 11p15.4–11q14.1 (73.9 Mb), and 5q14.3–5q33.2 (64.1 Mb). We were able to identify four rare and predicted pathogenic variants under the chromosome 11 peak, in plausible genes (AHNAK, PLCB3, SLC22A11, and MTL5), shared by at least five family members. One of these four variants, i.e., rs141159367 in SLC22A11, was significantly associated with COPD in 9,888 cases and 27,428 controls (P = 0.002) while another variant (rs146043252 in MTL5) showed nominal association with COPD (P = 0.04).
The finding of our family-based linkage analysis aligns with that of large scale GWASs implicating the CHRNA3/5-CHRNB4, and IREB2 region on chromosome 15q25 in COPD development. This region is also associated with lung cancer, peripheral arterial disease, nicotine addiction and smoking quantity (Thorgeirsson et al., 2008). The evidence in the literature on the role of smoking in the genetic risk of COPD thus far is controversial. On one hand, there is evidence to support that the variants in this region, although implicated in both lung disease and smoking behavior, are associated with COPD susceptibility, independently of cigarette smoke exposure (Hardin and Silverman, 2014). On the other hand, in a previous study we show that two variants, previously associated with COPD in the CHRNA3/5 locus, were associated with lung function measurements in ever-smokers, but not in never-smokers (van der Plaat et al., 2017), which is in line with the only longitudinal study on the relation between the nicotine receptor variant and annual lung function decline (Budulac et al., 2012). That study shows that carriers of the nicotinic receptors variants are significantly less able to quit smoking, leading to the lung function decline and, subsequently to COPD. Similarly, for the chromosome 5 linked region, we could not observe any shared rare variant. This region, known for its associations with pulmonary function and airflow obstruction (Hancock et al., 2010; Wilk et al., 2012) was recently associated with COPD by the largest GWAS to date (Hobbs et al., 2017). The HTR4 gene in 5q32 encodes a serotonin receptor involved in depression and is strongly expressed in respiratory complex neurons (Manzke et al., 2003).
However, the functional variants in these regions have still not been confirmed. In our families, we could not identify rare damaging variants shared between the cases in this region. This may be explained if rare intronic regulatory variants play a key role, which we could not investigate using the exome data. It is unlikely that these linkage peaks are attributed to the common variants which have small effects identified in GWASs, given the very strong evidence for linkage of this region to COPD. Future studies using whole-genome sequencing should investigate this region further, ideally in never smokers. This emphasizes the need for integration of available genomic information into more focused, candidate-gene based efforts to disentangle the functional role of the chromosome 5 and 15 regions.
In the identified region of chromosome 11 we were able to pinpoint four strong candidate genes for the association with COPD, i.e., SLC22A11, AHNAK, PLCB3, and MTL5. The most interesting finding is the rare variant in SLC22A11 (solute carrier family 22 member 11), which encodes an integral membrane protein and part of the family of OATs, known to mediate the absorption and elimination of endogenous and exogenous organic anions and as such, are involved in the pharmacokinetic, pharmacodynamic and safety profiles in a wide range of drugs (Bosquillon, 2010). SLC22A11 (OAT4) is mainly expressed in kidney and placenta. However, it is also shown to be expressed in lung tissue, fibroblasts and T-lymphocytes (P < 5 × 10-7), among other tissues/cells reported in the Gene network (Fehrmann et al., 2015). In addition, in vitro SLC22A11 mRNA was absent in normal human bronchial epithelial cells, but highly expressed in other bronchial cells models comprising transformed cells (Endter et al., 2009). SLC22A11 in particular is known to be a drug target for probenecid, a SLC22A11 inhibitor, used in the gout prevention and to increase antibiotic blood levels, yet its direct role in lung disease treatment is still unknown (Bosquillon, 2010).
Our linkage analysis yielded different regions compared with those identified earlier. However, the fact that both SLC22A11 and MTL5 variants were associated with COPD in our meta-analysis confirms their role in COPD and makes them even more interesting candidates. MTL5 (metallothionein-like protein 5) encodes testis expressed metallothionein like proteins (TESMIN). They are highly conserved, low-molecular-weight cysteine-rich proteins induced by and binding to heavy metal ions, and they do not have enzymatic activity. They play a central role in the regulation of cell growth and differentiation, and are involved in spermatogenesis, differentially regulating meiosis in male and female cells (Olesen et al., 2004). MTL5 was shown to be involved in nicotinate and nicotinamide metabolism and is also expressed in fibroblasts and lung tissue (P < 7 × 10-29), based on the Gene network (Fehrmann et al., 2015). Metallothioneins were additionally shown to protect cells against oxidative stress damage and participate in differentiation, proliferation and/or apoptosis of normal and lung cancer cells (Werynska et al., 2015).
The main strength of our study is the genetically isolated family-based population, which can display increased frequencies of some variants found at very low proportions in panmictic populations. This allowed us to perform a genome-wide linkage scan and identify rare coding variants. However, even though we identified linkage of three regions to COPD, a limitation of our study is the low power to explain the peaks at chromosomes 5 and 15, possibly due to the use of exome data. As intronic regulatory variants may play a significant role, in the future, faster and cheaper whole-genome sequencing will allow us to improve identification of rare variants and our understanding of their involvement in COPD. As our sample consists of high percentage of current or ex-smokers, it is possible that we are demonstrating genetic effects on smoking which further affects the development of COPD. Nevertheless, we were able to demonstrate a positive association, independent of smoking, of two variants in the association meta-analysis comprising 9,888 cases and 27,060 controls. Yet, studies with very large sample sizes utilizing mediation or mendelian randomization techniques are needed to disentangle these relationships and confirm our results in the general population.
Using the powerful genome-wide linkage scan in a Dutch genetic isolate, we have confirmed the implication of the 15q25 region in COPD and identified regions at chromosomes 5 and 11. Within the region on chromosome 11 we identified four deleterious rare variants shared between most of the affected family members in AHNAK, PLCB3, SLC22A11 and MTL5. The variants in SLC22A11 and MTL5 were significantly associated with COPD in our meta-analysis. Further studies pooling large sample sizes could confirm the role of the identified rare variants at chromosome 11 in the general population. Similarly, large studies utilizing whole-genome sequencing should further investigate the role of linked regions in chromosomes 5 and 15 in COPD.
IN, NA, NT, LL, JV, DvdP, BH, DQ, and MC were involved in the analysis of the data. IN, JV, DvdP, CCvD, DvdP, HB, CMvD, and NA contributed to the conception and design of this work and were involved in the interpretation of the results. IN, JV, DvdP, LL, GB, and DvdP were involved in data collection/preparation. All authors were involved in writing and critically revising the manuscript, approved the final manuscript, and agreed to be accountable for it.
The ERF study as a part of EUROSPAN (European Special Populations Research Network) was supported by European Commission FP6 STRP grant number 018947 (LSHG-CT-2006-01947) and also received funding from the European Community’s Seventh Framework Program (FP7/2007-2013)/grant agreement HEALTH-F4-2007-201413 by the European Commission under the program “Quality of Life and Management of the Living Resources” of 5th Framework Program (No. QLG2-CT-2002-01254). The ERF study was further supported by ENGAGE Consortium and CMSB. High-throughput analysis of the ERF data was supported by joint grant from Netherlands Organisation for Scientific Research and the Russian Foundation for Basic Research (NWO-RFBR 047.017.043). Exome-sequencing in ERF was supported by the ZonMw grant (project 91111025). DvdP and NA were supported by grant number 4.1.13.007 of Lung Foundation Netherlands (Longfonds). NT was supported by a grant from the Fund for Scientific Research Flanders (FWO) project (G035014N). The Rotterdam Study is funded by Erasmus Medical Center and Erasmus University Rotterdam, The Netherlands Organization for the Health Research and Development (ZonMw), the Research Institute for Diseases in the Elderly (RIDE), the Ministry of Education, Culture and Science, the Ministry of Health, Welfare and Sport, the European Commission (DG XII), and the Municipality of Rotterdam. The generation and management of GWAS genotype data for the Rotterdam Study (RS-I, RS-II, and RS-III) were executed by the Human Genotyping Facility of the Genetic Laboratory of the Department of Internal Medicine, Erasmus MC, Rotterdam, Netherlands. The GWAS datasets are supported by the Netherlands Organization for Scientific Research NWO Investments (nr. 175.010.2005.011, 911-03-012), the Genetic Laboratory of the Department of Internal Medicine, Erasmus MC, the Research Institute for Diseases in the Elderly (014-93-015; RIDE2), the Netherlands Genomics Initiative (NGI)/Netherlands Organization for Scientific Research (NWO) Netherlands Consortium for Healthy Aging (NCHA), project nr. 050-060-810. The LifeLines Biobank initiative has been made possible by funds from FES (Fonds Economische Structuurversterking), SNN (Samenwerkingsverband Noord-Nederland), and REP (Ruimtelijk Economisch Programma). The Vlagtwedde-Vlaardingen cohort study was supported by the Ministry of Health and Environmental Hygiene of The Netherlands and The Netherlands Asthma Fund (grant 187) and The Netherlands Asthma Fund grant no. 3.2.02.51, the Stichting Astma Bestrijding, BBMRI-NL (Complementation project), and the European Respiratory Society COPD research award 2011 (to HB). The Hobbs et al. (2016) work was supported by R01HL113264, R01HL137927, R01HL089897, R01HL089856, K01HL129039, K08HL136928, and the Parker B. Francis Research Opportunity Award. MC has received grant support from GSK. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH. The funding body has no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
We are grateful to all ERF study participants and their relatives, general practitioners, and pulmonologists for their contributions, P. Veraart for her help in genealogy, J. Vergeer for the supervision of the laboratory work, Sven van der Lee and Ashley van der Spek for follow-up data collection, and P. Snijders for his help in data collection. We are grateful to all the study participants, the staff, the participating general practitioners, and pharmacists. We thank Pascal Arp, Mila Jhamai, Marijn Verkerk, Lizbeth Herrera, and Marjolein Peters M.Sc., and Carolina Medina-Gomez M.Sc., for their help in creating the RS GWAS database and Linda Broer for creation of HRC imputed data. We are also grateful to Edwin Silverman for critically revising the manuscript.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2018.00133/full#supplementary-material
AAT, Alpha-1-antitrypsin; ATS, American thoracic society; CADD, Combined Annotation Dependent Depletion; COPD, chronic obstructive pulmonary disease; CT, computed tomography scan; ERF, Erasmus Rucphen Family; ERS, European Respiratory Society; FEV1, forced expiratory volume in one second; FVC, forced vital capacity; GoNL, genome of the Netherlands; GWAS, genome-wide association study; HLOD, heterogeneity log of odds score; HRC, haplotype reference consortium; KEGG, Kyoto Encyclopedia of Genes and Genomes; LLS, lifelines study; LOD, logarithm of the odds; MAF, minor allele frequency; OATs, organic anion transporters; QC, quality control; RS, Rotterdam Study; SNVs, single nucleotide variants; VlaVla, Vlagtwedde/Vlaardingen-study.
Amin, N., de Vrij, F. M. S., Baghdadi, M., Brouwer, R. W. W., van Rooij, J. G. J., Jovanova, O., et al. (2017). A rare missense variant in RCL1 segregates with depression in extended families. Mol. Psychiatry. doi: 10.1038/mp.2017.49 [Epub ahead of print].
Bashir, A., Shah, N. N., Hazari, Y. M., Habib, M., Bashir, S., Hilal, N., et al. (2016). Novel variants of SERPIN1A gene: interplay between alpha1-antitrypsin deficiency and chronic obstructive pulmonary disease. Respir. Med. 117, 139–149. doi: 10.1016/j.rmed.2016.06.005
Budulac, S. E., Vonk, J. M., Postma, D. S., Siedlinski, M., Timens, W., and Boezen, M. H. (2012). Nicotinic acetylcholine receptor variants are related to smoking habits, but not directly to COPD. PLoS One 7:e33386. doi: 10.1371/journal.pone.0033386
De Jong, K., Vonk, J. M., Timens, W., Bossé, Y., Sin, D. D., Hao, K., et al. (2015). Genome-wide interaction study of gene-by-occupational exposure and effects on FEV1 levels. J. Allergy Clin. Immunol. 136, 1664–1672.e14. doi: 10.1016/j.jaci.2015.03.042
Durner, M., Vieland, V. J., and Greenberg, D. A. (1999). Further evidence for the increased power of LOD scores compared with nonparametric methods. Am. J. Hum. Genet. 64, 281–289. doi: 10.1086/302181
Endter, S., Francombe, D., Gumbleton, M., and Ehrhardt, C. (2009). RT-PCR analysis of ABC, SLC and SLCO drug transporters in human lung epithelial cell models. J. Pharm. Pharmacol. 61, 583–591. doi: 10.1211/jpp/61.05.0006
Fehrmann, R. S. N., Karjalainen, J. M., Westra, H., Maloney, D., Simeonov, A., Pers, T. H., et al. (2015). Gene expression analysis identifies global gene dosage sensitivity in cancer. Nat. Genet. 47, 115–126. doi: 10.1038/ng.3173
Foreman, M. G., Wilson, C., DeMeo, D. L., Hersh, C. P., Beaty, T. H., Cho, M. H., et al. (2017). Alpha-1 antitrypsin PiMZ genotype is associated with chronic obstructive pulmonary disease in two racial groups. Ann. Am. Thorac. Soc. 14, 1280–1287. doi: 10.1513/AnnalsATS.201611-838OC
Hancock, D. B., Eijgelsheim, M., Wilk, J. B., Gharib, S. A., Loehr, L. R., Marciante, K. D., et al. (2010). Meta-analyses of genome-wide association studies identify multiple loci associated with pulmonary function. Nat. Genet. 42, 45–52. doi: 10.1038/ng.500
Hardin, M., and Silverman, E. K. (2014). Chronic obstructive pulmonary disease genetics: a review of the past and a look into the future. Chronic Obstr. Pulm. Dis. 1, 33–46. doi: 10.15326/jcopdf.1.1.2014.0120
Hersh, C. P., DeMeo, D. L., Lange, C., Litonjua, A. A., Reilly, J. J., Kwiatkowski, D., et al. (2005). Attempted replication of reported chronic obstructive pulmonary disease candidate gene associations. Am. J. Respir. Cell Mol. Biol. 33, 71–78. doi: 10.1165/rcmb.2005-0073OC
Hobbs, B. D., De Jong, K., Lamontagne, M., Bossé, Y., Shrine, N., Artigas, M. S., et al. (2017). Genetic loci associated with chronic obstructive pulmonary disease overlap with loci for lung function and pulmonary fibrosis. Nat. Genet. 49, 426–432. doi: 10.1038/ng.3752
Hobbs, B. D., Parker, M. M., Chen, H., Lao, T., Hardin, M., Qiao, D., et al. (2016). Exome array analysis identifies a common Variant in IL27 associated with chronic obstructive pulmonary disease. Am. J. Respir. Crit. Care Med. 194, 48–57. doi: 10.1164/rccm.201510-2053OC
Iglesias, A. I., van der Lee, S. J., Bonnemaijer, P. W. M., Höhn, R., Nag, A., Gharahkhani, P., et al. (2017). Haplotype reference consortium panel: practical implications of imputations with large reference panels. Hum. Mutat. 38, 1025–1032. doi: 10.1002/humu.23247
Ikram, M. A., Brusselle, G. G. O., Murad, S. D., van Duijn, C. M., Franco, O. H., Goedegebure, A., et al. (2017). The Rotterdam Study: 2018 update on objectives, design and main results. Eur. J. Epidemiol. 32, 807–850. doi: 10.1007/s10654-017-0321-4
Ingebrigtsen, T., Thomsen, S. F., Vestbo, J., van der Sluis, S., Kyvik, K. O., Silverman, E. K., et al. (2010). Genetic influences on chronic obstructive pulmonary disease—a twin study. Respir. Med. 104, 1890–1895. doi: 10.1016/j.rmed.2010.05.004
Laurell, C. B., and Eriksson., S. (1963). The electrophoretic alpha 1-globulin pattern of serum in alpha 1-antitrypsin deficiency. Scand. J. Clin. Lab. Invest. 15, 132–40. doi: 10.1080/00365516309051324
Liu, F., Kirichenko, A., Axenovich, T. I., van Duijn, C. M., and Aulchenko, Y. S. (2008). An approach for cutting large and complex pedigrees for linkage analysis. Eur. J. Hum. Genet. 16, 854–860. doi: 10.1038/ejhg.2008.24
Lozano, R., and Naghavi, M. (2012). Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet 380, 2095–2128. doi: 10.1016/S0140-6736(12)61728-0
Lumley, T. (2011). rmeta: Meta-Analysis. R Package Version 2.16. Available at: https://cran.r-project.org/web/packages/rmeta/rmeta.pdf
Manzke, T., Guenther, U., Ponimaskin, E. G., Haller, M., Dutschmann, M., Schwarzacher, S., et al. (2003). 5-HT4(a) receptors avert opioid-induced breathing depression without loss of analgesia. Science 301, 226–229. doi: 10.1126/science.1084674
Palmer, L. J., Celedón, J. C., Chapman, H. A., Speizer, F. E., Weiss, S. T., and Silverman, E. K. (2003). Genome-wide linkage analysis of bronchodilator responsiveness and post-bronchodilator spirometric phenotypes in chronic obstructive pulmonary disease. Hum. Mol. Genet. 12, 1199–1210. doi: 10.1093/hmg/ddg125
Pardo, L. M., MacKay, I., Oostra, B., van Duijn, C. M., and Aulchenko, Y. S. (2005). The effect of genetic drift in a young genetically isolated population. Ann. Hum. Genet. 69, 288–295. doi: 10.1046/j.1529-8817.2005.00162.x
Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M. A. R., Bender, D., et al. (2007). PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575. doi: 10.1086/519795
Qiao, D., Lange, C., Beaty, T. H., Crapo, J. D., Barnes, K. C., and Bamshad, M. (2016). Exome sequencing analysis in severe, early-onset chronic obstructive pulmonary disease. Am. J. Respir. Crit. Care Med. 193, 1–61. doi: 10.1164/rccm.201506-1223OC
Scholtens, S., Smidt, N., Swertz, M. A., Bakker, S. J. L., Dotinga, A., Vonk, J. M., et al. (2015). Cohort profile: LifeLines, a three-generation cohort study and biobank. Int. J. Epidemiol. 44, 1172–1180. doi: 10.1093/ije/dyu229
Silverman, E. K., Mosley, J. D., Palmer, L. J., Barth, M., Senter, J. M., Brown, A., et al. (2002). Genome-wide linkage analysis of severe, early-onset chronic obstructive pulmonary disease: airflow obstruction and chronic bronchitis phenotypes. Hum. Mol. Genet. 11, 623–632. doi: 10.1093/hmg/11.6.623
Thorgeirsson, T. E., Geller, F., Sulem1, P., Rafnar,T., Wiste, A., Magnusson, K. P., et al. (2008). A variant associated with nicotine dependence, lung cancer and peripheral arterial disease. Nature 452, 638–642. doi: 10.1038/nature06846
van der Plaat, D. A., de Jong, K., Lahousse, L., Faiz, A., Vonk, J. M., van Diemen, C. C., et al. (2017). Genome-wide association study on the FEV1/FVC ratio in never-smokers identifies HHIP and FAM13A. J. Allergy Clin. Immunol. 139, 533–540. doi: 10.1016/j.jaci.2016.06.062
Van Diemen, C. C., Postma, D. S., Aulchenko, Y. S., Snijders, P. J. L. M., Oostra, B. A., Van Duijn, C. M., et al. (2010). Novel strategy to identify genetic risk factors for COPD severity: a genetic isolate. Eur. Respir. J. 35, 768–775. doi: 10.1183/09031936.00054408
Van Diemen, C. C., Postma, D. S., Vonk, J. M., Bruinenberg, M., Scheuten, J. P., and Boezen, H. M. (2005). A disintegrin and metalloprotease 33 polymorphisms and lung function decline in the general population. Am. J. Respir. Crit. Care Med. 172, 329–333. doi: 10.1164/rccm.200411-1486OC
Voorman, A., Brody, J., and Lumley, T. (2013). skatMeta: An R Package for Meta Analyzing Region-Based Tests of Rare DNA Variants. Available at: http://cran.r-project.org/web/packages/skatMeta
Wain, L. V., Shrine, N., Artigas, M. S., Erzurumluoglu, A. M., Noyvert, B., Bossini-Castillo, L., et al. (2017). Supplementary: genome-wide association analyses for lung function and chronic obstructive pulmonary disease identify new loci and potential druggable targets. Nat. Genet. 49, 416–425. doi: 10.1038/ng.3787
Wilk, J. B., Shrine, N. R. G., Loehr, L. R., Zhao, J. H., Manichaikul, A., Lopez, L. M., et al. (2012). Genome-wide association studies identify CHRNA5/3 and HTR4 in the development of airflow obstruction. Am. J. Respir. Crit. Care Med. 186, 622–632. doi: 10.1164/rccm.201202-0366OC
Zhou, J. J., Cho, M. H., Castaldi, P. J., Hersh, C. P., Silverman, E. K., and Laird, N. M. (2013). Heritability of chronic obstructive pulmonary disease and related phenotypes in smokers. Am. J. Respir. Crit. Care Med. 188, 941–947. doi: 10.1164/rccm.201302-0263OC
Keywords: COPD, genetic linkage analysis, genetic isolate, rare variants, chromosome 11
Citation: Nedeljkovic I, Terzikhan N, Vonk JM, van der Plaat DA, Lahousse L, van Diemen CC, Hobbs BD, Qiao D, Cho MH, Brusselle GG, Postma DS, Boezen HM, van Duijn CM and Amin N (2018) A Genome-Wide Linkage Study for Chronic Obstructive Pulmonary Disease in a Dutch Genetic Isolate Identifies Novel Rare Candidate Variants. Front. Genet. 9:133. doi: 10.3389/fgene.2018.00133
Received: 21 December 2017; Accepted: 03 April 2018;
Published: 19 April 2018.
Edited by:Brahim Aissani, University of Alabama at Birmingham, United States
Reviewed by:Susana Seixas, i3S, Instituto de Investigação e Inovação em Saúde, Portugal
Mary Kaye Wojczynski, Washington University in St. Louis, United States
Copyright © 2018 Nedeljkovic, Terzikhan, Vonk, van der Plaat, Lahousse, van Diemen, Hobbs, Qiao, Cho, Brusselle, Postma, Boezen, van Duijn and Amin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Najaf Amin, firstname.lastname@example.org