ORIGINAL RESEARCH article

Front. Genet., 11 January 2019

Sec. Applied Genetic Epidemiology

Volume 9 - 2018 | https://doi.org/10.3389/fgene.2018.00729

Role of Segregation for Variant Discovery in Multiplex Families Ascertained by Probands With Left Sided Cardiovascular Malformations

  • 1. Department of Pediatrics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, United States

  • 2. Department of Pediatrics, University of Cincinnati School of Medicine, Cincinnati, OH, United States

  • 3. Department of Pediatrics, Medical College of Wisconsin, Milwaukee, WI, United States

Abstract

Cardiovascular malformations (CVM) are common birth defects (incidence of 2–5/100 live births). Although a genetic basis is established, in most cases the cause remains unknown. Analysis of whole exome sequencing (WES) in left sided CVM case and trio series has identified large numbers of potential variants but evidence of causality has remained elusive except in a small percentage of cases. We sought to determine whether variant segregation in families would aid in novel gene discovery. The objective was to compare conventional and co-segregation approaches for WES in multiplex families. WES was performed on 52 individuals from 4 multiplex families ascertained by probands with hypoplastic left heart syndrome (HLHS). We identified rare variants with informatics support (RVIS, minor allele frequency ≤0.01 and Combined Annotation Dependent Depletion score ≥20) in probands. Non-RVIS variants did not meet these criteria. Family specific two point logarithm of the odds (LOD) scores identified co-segregating variants (C-SV) using a dominant model and 80% penetrance. In families, 702 RVIS in 668 genes were identified, but only 1 RVIS was also a C-SV (LOD ≥ 1). On the other hand, there were 109 non-RVIS variants with LOD ≥ 1. Among 110 C-SV, 97% were common (MAF > 1%). These results suggest that conventional variant identification methods focused on RVIS, miss most C-SV. For diseases such as left sided CVM, which exhibit strong familial transmission, co-segregation can identify novel candidates.

Introduction

Cardiovascular malformations (CVM) are the most common birth defects with an incidence of 2 to 5 per 100 live births (Benson, 2002). CVM occur during cardiogenesis, and phenotypes and clinical impact are varied. HLHS is a severe form of CVM characterized by hypoplasia of the left ventricle and ascending aorta in addition to atresia or hypoplasia of the aortic and mitral valves. While rare (prevalence 0.02%) (Parker et al., 2010), it accounts for 25% of infant deaths due to CVM (Boneva et al., 2001). On the other hand, bicuspid aortic valve (BAV) is the most common CVM, affecting 1–2% of the population. The prevailing view is that BAV and HLHS are developmentally related, and that the two phenotypes represent extremes of a spectrum of CVM involving structures on the left side of the heart; related phenotypes include abnormalities of the aorta, mitral valve and heart chamber septa (Brenner et al., 1989; Loffredo et al., 2004; McBride et al., 2005; Hoang et al., 2018). Although CVM heritability, especially for left sided CVM, provides strong evidence of a genetic basis (Cripe et al., 2004; McBride et al., 2005; Hinton et al., 2007; McBride and Garg, 2011; Hanchard et al., 2016; Kuo et al., 2017), in most cases the cause remains unknown. Exome data for left sided CVM case series and trios has identified large numbers of potential variants but definitive evidence of causality has remained elusive except in a small percentage of cases (Russell et al., 2018).

As originally described by Ng et al. (2010a; 2010b), the conventional approach to variant discovery for exome sequencing prioritizes variants which occur rarely in population databases and are predicted to impact protein structure. Current American College of Medical Genetics (ACMG) clinical guidelines (Richards et al., 2015) recognize these criteria but also provide additional criteria for pathogenicity including biologic rather than in silico support for variant functionality and a role of the gene in disease etiology. Characterizing biologic support can be time consuming, costly, and challenging (Hosseini et al., 2018). Even with this approach, 2–11% of unaffected individuals harbor pathogenic variants in clinically actionable genes (Dorschner et al., 2013; Tabor et al., 2014; Lindor et al., 2017).

Another strategy to identify variants relies on linkage analyses to quantify co-segregation evidence (Martin et al., 2014; Richards et al., 2015). While inheritance within a trio has been used to prioritize variants identified using the conventional approach, formal quantification of segregation evidence through linkage analyses is not typically performed with next generation sequence data. As linkage analyses in multiplex families was used to identify CVM genes including TBX5 (Basson et al., 1995, 1997), NKX2.5 (Schott et al., 1998), and NOTCH1 (Garg et al., 2005), we hypothesized that evaluation of co-segregation of variants and disease in a kindred using linkage analysis will aid in identifying the genetic underpinnings of disease such as left sided CVM that exhibit family clustering.

Materials and Methods

Study Population

We selected 4 multiplex families (n = 52 participants; Table 1) recruited as part of a family based genetic study of hypoplastic left heart syndrome (HLHS) (Hinton et al., 2007, 2009). Families were ascertained by a proband with HLHS. Additional, family members were recruited using a sequential sampling strategy. Briefly, each proband’s first-degree relatives were evaluated (Hinton et al., 2007). When additional affected family members were identified, sampling was extended to include their first-degree relatives. Written informed consent was obtained from each participant or participant’s parent or guardian. Assent was obtained from pediatric participants when appropriate. This study was approved by the Institutional Review Board at Cincinnati Children’s Hospital Medical Center.

Table 1

Family 5Family 9Family 14Family 22All
n181610852
Sex (%M)56%63%50%38%53.8%
Race (% White)100%75%100%100%92.3%
% CVM22%31%50%38%32.7%

Description of participants.

Echocardiographic Analysis

Cardiac phenotype was determined by cross-sectional 2-dimensional and Doppler transthoracic echocardiography on all participants using Hewlett-Packard Sonos 5500, General Electric Vivid 5 or Vivid 7 systems as previously described (Hinton et al., 2007, 2009). A detailed echocardiographic protocol previously described was used to assess cardiovascular structures (Cripe et al., 2004). A single experienced echocardiographer interpreted echocardiograms (Cripe et al., 2004; Hinton et al., 2007).

Whole Exome Sequencing (WES)

Whole exome sequencing was performed at the Genetic Variation and Gene Discovery Core of Cincinnati Children’s Hospital. One ug of dsDNA (blood) was used. Quantity was determined by Invitrogen Qubit (Life Technologies, Grand Island, NY, United States) high sensitivity spectro-fluorometric measurement. DNA was sheared by sonication on a Diagenode Bioruptor (Diagenode Inc., USA North America, Denville, NJ, United States). Library construction was performed using Illumina TruSeq DNA Sample Preparation kit (Illumina Inc, San Diego CA, United States) with a size selection at 350 bp post adapter ligation. One ug of genomic library was recovered for exome enrichment using Nimblegen SeqCap EZ Human Exome v2 kit (Roche Nimblegen, Inc, Madison, WI, United States). Enriched libraries were sequenced on an Illumina HiSeq2000 (Illumina, Inc., San Diego, CA, United States), generating at least 30 million paired end reads of 125 bases each per sample, corresponding to an average coverage of 60X.

Variant Calling

Variant calling was performed with Genome Analysis Toolkit v3.5-0 (McKenna et al., 2010). Samples were individually pre-processed by realigning reads around putative indels using IndelRealigner tool, marking putative PCR duplicate reads with Picard’s MarkDuplicates tool and by recalibrating base quality scores with BaseRecalibrator tool. Following pre-processing, samples were called with HaplotypeCaller and the resulting gVCFs were jointly processed with GenotypeGVCFs to generate initial variant calls such that regions which differed from the reference genome (alternative allele) for at least one sample in the batch had variants called in all individuals. Variants were then filtered using Variant Quality Score Recalibration (McKenna et al., 2010).

Data Analysis

Quality Control

To reduce the chance of false positives, quality control was employed (Martin et al., 2014). Briefly, at an individual call level, each call had to have coverage ≥20X but ≤250X and a quality score of ≥20. Variant calls which did not meet this threshold were blanked. Indels and variants which had call rates less than 80% were removed. To minimize sequencing artifacts, we eliminated variants for which the alternative allele was called in more than 30% of the individuals. Only autosomal variants were considered because x-linked inheritance was not indicated. CNVs were also not assessed.

Bioinformatic Prediction of Deleteriousness

To quantify the bioinformatic evidence of functional impact, we used Combined Annotation Dependent Depletion (CADD) scores (v1.3) (Kircher et al., 2014). This tool was chosen because it allowed evaluation of both missense and loss of function (stop gain/loss, splicing) as well as regulatory regions. Phred scores ≥20 were considered as evidence of a deleterious effect.

Prioritization of Variants

Conventional approach

We sought to identify variants which were rare, i.e., minor allele frequency (MAF) ≤1% in all reported 1000 Genomes (Genomes Project Consortium et al., 2015) and ExAC populations (Lek et al., 2016). Such variants would be consistent with the reported prevalence of CVM (2–5%) (Benson, 2002). We identified those that had informatics support of a deleterious effect by selecting variants with Phred scores ≥20. Variants present in the heterozygous or homozygous state in the proband meeting these criteria were considered as rare variants with informatics support (RVIS). The list of RVIS was further analyzed for the degree of sharing across probands at the variant and gene level, loss of function variants (predicted stop gain), and ClinVar pathogenic variants to further reduce the number of variants under consideration. In addition because HLHS had by hypothesized to occur when 2 copies of a CVM variant are carried, homozygous variants were also reviewed.

Co-segregation approach

To identify co-segregating variants (C-SV) in each family, we performed 2 point linkage analyses of variants, with a dominant model and 80% penetrance and a 1% disease mutant gene frequency using SUPERLINK (Silberstein et al., 2006). We selected 2 point rather than multipoint linkage because 2 point linkage permitted evaluation of all variants present in probands, whereas multipoint linkage would have required removing variants in linkage disequilibrium (Evans and Cardon, 2004). The selection of a dominant model was based on the fact that each pedigree had multiple individuals. Further, prior studies have suggested that BAV, a common phenotype in our families is dominantly inherited (Huntington et al., 1997; Wessels et al., 2005). The penetrance was selected to be 80% because prior studies suggested reduced penetrance (Huntington et al., 1997) and there was incomplete familial transmission of CVM in these families. We also evaluated a model using 50% penetrance. In these models, the LOD scores were lower but the most strongly C-SV was consistent with the model using 80% penetrance (data not shown). We opted to use the same model for all families so that we could sum the LOD across families to obtain a cumulative LOD score. Individuals with CVM were considered affected; individuals without CVM were considered unaffected. Individuals with unknown phenotype (labeled with ? in the pedigrees) were scored as missing. LOD were summed across the families to create a cumulative LOD score. However, similar inference was made when evaluating family specific LODs. Assuming two fully informative markers, the theoretic maximum LOD per family was 2.7, 2.1, 1.2, and 0.9 for Family 5, 9, 14, and 22, respectively. However, as our prior data suggests as single variant is likely not sufficient (e.g., reduced penetrance), the bilineal nature of some of the pedigrees, and the information content of SNPs we would not expect to obtain such high LODs.

Results

Cardiovascular Phenotypes of Participants

Pedigrees of 4 multiplex families are illustrated in Figure 1. CVM phenotype was present in 17 of 52 (33%) (Supplementary Table 1). HLHS was present in 5 participants. Varied CVM phenotypes were observed in other family members, either alone or in combination. BAV was present in 4 family members, but 7 additional family members had abnormal aortic valve that did not meet criteria for BAV. Abnormalities of the aorta were present in 4 family members including coarctation of the aorta in 2 participants and dilated aorta in 2 participants. Ventricular septal defect and atrial septal defect in combination with abnormality of the mitral valve were each present in a single individual. These findings underscore the importance of screening relatives of HLHS patients to identify those at risk for latent disease.

FIGURE 1

Rare Variants With Informatics Support (RVIS) Are Common in HLHS Probands

After quality control, there were 108,048 variants called in the 52 individuals. Among these variants, 31,867 were heterozygous or homozygous for the alternative allele in at least 1 of the 4 probands with no additional filtering. Following the conventional approach, 702 RVIS (668 genes; 159–194 variants per proband) were identified (Figure 2 and Supplementary Table 2). Limiting the results to variants seen in the homozygous state, a single variant (rs139011641, MUC6) was identified in a proband. Of the 702 RVIS, only 9 were seen in at least 2 of the probands and none in all 4 (see Supplementary Table 3). Further, there were 27 genes which harbor RVIS in more than one family (2 families n = 24, 3 families n = 3; Supplementary Table 4). Importantly, many of these genes do not harbor the same variant across families. Within this list of 702 variants, there are 28 stop gain variants (Supplementary Table 5) and 5 variants reported in ClinVar as pathogenic or likely pathogenic (Supplementary Table 6). Limiting the results to variants with MAF ≤ 0.1% reduced the number of variants by ∼50% (n = 358) while limiting the results to MAF ≤ 0.01% resulted in 296 variants. However, as a practical matter, this variant list is still prohibitively long for biological evaluation.

FIGURE 2

RVIS Rarely Co-segregate With CVM in Families

All 702 RVIS were also present in at least one parent (suggesting these were not new mutations). We determined the cumulative LOD scores of the RVIS variants across the 4 families. The maximum LOD was 1.3 for a variant on chromosome 4, rs200183228, a non-synonymous variant in SORCS2. No other variants had cumulative LOD ≥ 1.0 (nor family specific, data not shown) and only 8 other variants had LOD > 0.5 (Supplementary Table 7 and Figure 3). However, nearly 25% of variants had LOD < -0.5 (Figure 3). Notably, 11 of these variants had LOD < -2, which would be sufficient to exclude a locus under the specified genetic model.

FIGURE 3

Characteristics of Variants That Co-segregate With CVM in Families (C-SV)

Given the minimal evidence of linkage among RVIS, we then evaluated linkage across 31,867 variants seen in at least one proband (Supplementary Figure 1). The larger families (Family 5 and 9) exhibited multiple variants with moderate evidence of linkage (LOD ≥ 1.0) while the smaller families (Family 14 and 22) did not (Supplementary Table 8). The highest cumulative LOD score was 2.0 for rs34053053 (chromosome 11, an intronic variant in CTTN). There were 9 additional C-SV with cumulative LOD ≥ 1.5. Interestingly, 2 of these variants were in also in CTTN (rs2298396 and rs2298397, both LOD = 1.7) and 3 variants were in GSTP1 (rs1695, rs1871042, and rs4891, all LOD = 1.6). One variant was present in ARAP1-AS2 (rs12575364), PCLO (rs12668093), SORCS2 (rs28531835), and MS4A5 (rs708229). The phenotypes of the affected individuals carrying these variants are varied with no clear pattern of specific variants being associated with abnormalities in the aortic valve, aorta, mitral valve or other CVM (Supplementary Table 9). There were 110 C-SV with cumulative LOD ≥ 1.0 (Figure 4). These variants were distributed across 14 chromosomes, with 5 chromosomes (4, 16, 11, 20, and 7) having 9 or more C-SV.

FIGURE 4

Based on the minimal overlap between RVIS and C-SV, we then sought to characterize population frequency and informatics support as separate components (Figure 3). Strikingly, 97% of C-SV are common (MAF > 1%) with the rarest variant occurring at a frequency = 0.003. Moreover, 90% of C-SV did not have informatics support of functionality. Among the 10 variants with LOD ≥ 1.5, all variants are common and none are predicted to be functional (Supplementary Table 8).

Discussion

In this study, we used multiplex families to compare 2 methods of variant identification for exome data. A conventional approach that relies on documentation of RVIS was compared to an approach based on variant co-segregation (C-SV) with CVM in families using existing linkage analysis methods. In terms of variants identified by the 2 approaches, we found little overlap. Specifically, we found that while emphasis on RVIS identified a large number of variants in probands, few of these variants co-segregated with disease in our families. Surprisingly, among C-SV, the great majority were non-RVIS. Lastly, we found that C-SV are usually common with multiple variants per family. Taken together, these results support the value of family based studies for CVM discovery as these studies can evaluate both C-SV as well as RVIS. Our findings support the use of linkage analyses in multiplex families for left sided CVM gene discovery with exome sequencing data to evaluate more complex inheritance models.

When following the conventional approach for exome data, we identified a large number of RVIS in probands. This is not surprising since pathogenic variants in actionable genes commonly occur in unaffected individuals (Dorschner et al., 2013; Tabor et al., 2014; Lindor et al., 2017). In addition, finding of a large number of RVIS is consistent with prior case series and trio studies for left sided CVM (Zaidi et al., 2013; Homsy et al., 2015; Priest et al., 2016; Jin et al., 2017). Further complicating the discovery efforts for case series is the marked genetic heterogeneity for CVM as demonstrated from linkage (Benson et al., 1998; Martin et al., 2007; Hinton et al., 2009; McBride et al., 2011), association (Gago-Diaz et al., 2017), and exome studies (Zaidi et al., 2013; Homsy et al., 2015; Jin et al., 2017; Li et al., 2017). To overcome the challenges of heterogeneity, researchers have focused on enrichment, which is often restricted to de novo variants (Homsy et al., 2015), protein truncating variants (Sifrim et al., 2016), or gene lists (Kathiresan et al., 2009; LaHaye et al., 2016; Blue et al., 2017; Szot et al., 2018). Even with these strategies, a large proportion of CVM has unknown etiology (Benson et al., 2016; Jin et al., 2017; Wilsdon et al., 2017; Russell et al., 2018; Szot et al., 2018), suggesting alternative strategies are needed.

Importantly, we found that most (>99%) C-SV were not RVIS and thus would have been missed with conventional exome workflows. Other studies that have included family data, have noted that many RVIS do not co-segregate (Arrington et al., 2012; Preuss et al., 2016; Blue et al., 2017). Unfortunately, non-RVIS were not considered in these prior studies. Further, the failure of informatics to support a role for 90% of the C-SV is not surprising as studies have raised concerns about the accuracy of these tools for protein coding changes (Miosge et al., 2015; Mahmood et al., 2017). The situation for informatic prediction becomes more challenging for non-protein coding variation which is based cellular transcriptomes (Sloan et al., 2016). Indeed, the utility of cell based transcriptomics to understand the dynamics underpinning organogenesis where multiple cell types must interact with each other is unclear. Given the lack of RVIS that segregate, these results challenge the assumption that informatics will be the best initial filter for CVM gene discovery.

Following ACMG guidelines, we sought to determine if variants and the genes identified through linkage analysis were biologically plausible. We found 3 chromosomal regions (chromosome 4, 7, and 11 encompassing 6 genes) with LOD ≥ 1.5, and each of these regions had support in 2 families. The chromosome 11 region spanned 60198328 to 72409189 base pairs with variants attributed to 4 genes: cortactin (CTTN, 3 variants), Glutathione S-Transferase Pi 1 (GSTP1, 3 variants), Membrane Spanning 4-Domains A5 (MS4A5, 1 variant), and ARAP1 Antisense RNA 2 (ARAP1-AS2, 1 variant). The variant with the strongest evidence of co-segregation, rs34053053, is located in an intron of CTTN but alters expression of PTPRF Interacting Protein Alpha 1 (PPFIA1) (GTEx Consortium, 2015). PPFIA1 is part of a family of scaffolding proteins involved in focal adhesion turnover, cell migration, and tissue organization (Asperti et al., 2009; Astro et al., 2014, 2016). PPFIA1 regulates integrin (Asperti et al., 2010), is critical for heart development (Parker and Ingber, 2007), and affects vascular morphogenesis in developing zebrafish (Mana et al., 2016). Another gene on chromosome 11 is GSTP1, an oxidative stress-related detoxification enzyme. The 3 segregating variants were predicted to be eQTLs for GSTP1 using GTex (GTEx Consortium, 2015). Prior work has identified GSTP1 variants associated with CVM (Nembhard et al., 2017). MS4A5 is expressed primarily in testes but is detectable in heart and is speculated to be involved in signal transduction. Also included on chromosome 11 is ARAP1-AS2, a non-coding RNA. Chromosome 4 contains Sortilin Related VPS10 Domain Containing Receptor 2 (SORCS2). Notably, SORCS2 was identified both by segregation variants and RVIS, but the strongest segregating variant was non-RVIS. In mice, Sorcs2 is expressed in mesodermally derived structures of the heart prior to E15.5 (Glerup et al., 2014; Boggild et al., 2016) as well as after myocardial ischemia (Siao et al., 2012). Chromosome 7 contains Piccolo Presynaptic Cytomatrix Protein (PCLO). PCLO is down regulated in the hearts of mice lacking cardiac myosin binding protein C (Eijssen et al., 2008). Thus, a limited number of biologically plausible candidate genes, which deserve further consideration, were identified using linkage analyses.

Beyond the identification of novel candidates, utilization of co-segregation provides evidence that multiple common variants contribute to left sided CVM, aka complex inheritance. The finding that multiple rather than a single variant may be required is consistent with work in humans (McBride et al., 2005; Martin et al., 2007; Hinton et al., 2009; Li et al., 2017) and mice (Liu et al., 2017). Prior studies have reported associations between common variants and CVM (Stevens et al., 2010; Qian et al., 2014; Hanchard et al., 2016); however, these studies did not evaluate the functional nature of these variants. Thus, it is not clear whether the common variants are simply in linkage disequilibrium with nearby rare variants. The role of common variation in CVM seems to be in direct contrast to the recommendations of ACMG which suggest that a variant should occur at a frequency lower than the disease prevalence (Richards et al., 2015). While this recommendation is appropriate for Mendelian inheritance where a single variant is sufficient, when multiple variants are required for disease this may not be the case. For example, considering the dominant example where a single variant is sufficient in the heterozygous state, a variant with an allele frequency of 1% would be expected to be seen in 2% of the population, similar to the population estimates of CVM. However, 2 independent variants each with an allele frequency of 5%, would be expected to co-occur infrequently, i.e., in ∼1% of the population. Further, combinations of common and not so common variants may also result in co-occurrence rates consistent with disease prevalence. The challenge with such a scenario is that the number of possible variants that could contribute to disease could increase exponentially. Moreover, reduced penetrance may result in increased frequency in the general population. Thus, for traits which exhibit complex inheritance, utilization of multiplex families may be essential for disentangling the genetic etiology.

There are several limitations to our study. First, while our data clearly supports a disconnect between C-SV and RVIS, we did not demonstrate that non-RVIS were causal. However, if CVM is a complex trait influenced by multiple variants, then evaluation of causality becomes more challenging, as all causal variants must be jointly evaluated. Second, we do not know the extent to which our results, obtained from families ascertained by probands with left sided heart malformations, are generalizable to other CVM phenotypes. Third, our study was a relatively small sample, with 52 individuals from 4 families. Further, three of the four families have CVM in two distinct lines of descent. While this makes variant discovery challenging, given the not so rare nature of CVM (Benson, 2002; McBride et al., 2005) such occurrences are not unexpected but would be missed without detailed phenotyping in extended families. Clearly, additional studies using multiplex CVM families are warranted. Lastly, we did not evaluate copy number variation (CNV) which have been recognized to contribute to CVM (Costain et al., 2016; Hussein et al., 2018).

Taken together these results suggest that the approach to whole exome/genome variant discovery for left sided CVM needs to be reconsidered. Specifically, the results presented here as well as findings of prior studies (McBride et al., 2005; Martin et al., 2007; Hinton et al., 2009; Liu et al., 2017) suggest that multiple genetic variants contribute to disease development. Given such a scenario, the challenge will be how to narrow the list of possible variants. We found that use of linkage narrowed the variant list to a manageable level with 10 variants exhibiting LOD ≥ 1.5. Results from our linkage analyses suggest that there may be multiple variants within a linkage region as well as multiple independent loci contributing to disease. These combinatorial effects may be due to alterations in chromatin accessibility or looping (Tolhuis et al., 2002; Spilianakis and Flavell, 2004; Vernimmen et al., 2007; Jing et al., 2008; Smemo et al., 2014), long range gene regulation (which can occur on the same or different chromosomes) (Spilianakis et al., 2005; Lomvardas et al., 2006) or genes which individually contribute to disease etiology (Winston et al., 2012). The next logical step would be to evaluate how combinations of variants contribute within these families and if these combinations help explain the phenotypic heterogeneity present. Unfortunately, the sample size of the current study does not provide sufficient power to evaluate combinatorial effects. Once hypotheses have been generated, then generalizability of variants identified in families can be evaluated in the case series and trios for which exome or genome data are available such as the Pediatric Cardiac Genomics Consortium (Pediatric Cardiac Genomics Consortium et al., 2013). Additionally, when considering how to biologically validate variants, the multigenic context must be considered. Fortunately, with the advances in CRISPR/cas9 genome editing, multigenic mouse models can be evaluated (Liu et al., 2017).

In summary, these results suggest that for left sided CVM, conventional identification of candidate variants using allele frequency and predicted informatic functionality may miss a large proportion of variants co-segregating with CVM. This is likely due to the complex genetic basis of CVM for many families. For birth defects such as CVM that exhibit strong familial transmission, yet for which single variants are not sufficient, utilization of linkage analyses could be a powerful tool to identify novel candidates missed by conventional strategies.

Statements

Author contributions

LJM developed the question, performed and oversaw analyses, interpreted the results, and drafted the manuscript. DWB developed the question, critically revised the manuscript. VP performed analyses and aided in interpretation.

Funding

This work was supported in part by the Children’s Heart Foundation.

Acknowledgments

We would like to thank the families for their participation.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2018.00729/full#supplementary-material

FIGURE S1

Distribution of cumulative LOD in RVIS and non-RVIS variants. Distributions are presented as Violin plots.

TABLE S1

Phenotypic characteristics of affected individuals.

TABLE S2

Rare variants with informatic support.

TABLE S3

RVIS which are shared across families.

TABLE S4

Genes harboring RVIS present in more than one family.

TABLE S5

Loss of function RVIS.

TABLE S6

RVIS reported to be pathogenic in ClinVar.

TABLE S7

RVIS with cumulative LOD ≥ 0.5.

TABLE S8

Variants which exhibit evidence of segregation with CVM (LOD ≥ 1.0).

TABLE S9

Variants which exhibit evidence of segregation with CVM (LOD ≥ 1.5) and phenotypes associated.

References

  • 1

    ArringtonC. B.BleylS. B.MatsunamiN.BonnellG. D.OtterudB. E.NielsenD. C.et al (2012). Exome analysis of a family with pleiotropic congenital heart disease.Circ. Cardiovasc. Genet.5175182. 10.1161/CIRCGENETICS.111.961797

  • 2

    AspertiC.AstroV.TotaroA.ParisS.de CurtisI. (2009). Liprin-alpha1 promotes cell spreading on the extracellular matrix by affecting the distribution of activated integrins.J. Cell Sci.12232253232. 10.1242/jcs.054155

  • 3

    AspertiC.PettinatoE.de CurtisI. (2010). Liprin-alpha1 affects the distribution of low-affinity beta1 integrins and stabilizes their permanence at the cell surface.Exp. Cell Res.316915926. 10.1016/j.yexcr.2010.01.017

  • 4

    AstroV.ChiarettiS.MagistratiE.FivazM.de CurtisI. (2014). Liprin-alpha1, ERC1 and LL5 define polarized and dynamic structures that are implicated in cell migration.J. Cell Sci.12738623876. 10.1242/jcs.155663

  • 5

    AstroV.TonoliD.ChiarettiS.BadanaiS.SalaK.ZerialM.et al (2016). Liprin-alpha1 and ERC1 control cell edge dynamics by promoting focal adhesion turnover.Sci. Rep.6:33653. 10.1038/srep33653

  • 6

    BassonC. T.BachinskyD. R.LinR. C.LeviT.ElkinsJ. A.SoultsJ.et al (1997). Mutations in human TBX5 [corrected] cause limb and cardiac malformation in Holt-Oram syndrome.Nat. Genet.153035. 10.1038/ng0197-30

  • 7

    BassonC. T.SolomonS. D.WeissmanB.MacRaeC. A.PoznanskiA. K.PrietoF.et al (1995). Genetic heterogeneity of heart-hand syndromes.Circulation9113261329. 10.1161/01.CIR.91.5.1326

  • 8

    BensonD. W. (2002). The genetics of congenital heart disease: a point in the revolution.Cardiol. Clin.20385394. 10.1016/S0733-8651(02)00009-7

  • 9

    BensonD. W.MartinL. J.LoC. W. (2016). Genetics of hypoplastic left heart syndrome.J. Pediatr.1732531. 10.1016/j.jpeds.2016.02.052

  • 10

    BensonD. W.SharkeyA.FatkinD.LangP.BassonC. T.McDonoughB.et al (1998). Reduced penetrance, variable expressivity, and genetic heterogeneity of familial atrial septal defects.Circulation9720432048. 10.1161/01.CIR.97.20.2043

  • 11

    BlueG. M.HumphreysD.SzotJ.MajorJ.ChapmanG.BosmanA.et al (2017). The promises and challenges of exome sequencing in familial, non-syndromic congenital heart disease.Int. J. Cardiol.230155163. 10.1016/j.ijcard.2016.12.024

  • 12

    BoggildS.MolgaardS.GlerupS.NyengaardJ. R. (2016). Spatiotemporal patterns of sortilin and SorCS2 localization during organ development.BMC Cell Biol.17:8. 10.1186/s12860-016-0085-9

  • 13

    BonevaR. S.BottoL. D.MooreC. A.YangQ.CorreaA.EricksonJ. D. (2001). Mortality associated with congenital heart defects in the United States: trends and racial disparities, 1979-1997.Circulation10323762381. 10.1161/01.CIR.103.19.2376

  • 14

    BrennerJ. I.BergK. A.SchneiderD. S.ClarkE. B.BoughmanJ. A. (1989). Cardiac malformations in relatives of infants with hypoplastic left-heart syndrome.Am. J. Dis. Child14314921494. 10.1001/archpedi.1989.02150240114030

  • 15

    CostainG.SilversidesC. K.BassettA. S. (2016). The importance of copy number variation in congenital heart disease.NPJ Genom. Med.1:16031. 10.1038/npjgenmed.2016.31

  • 16

    CripeL.AndelfingerG.MartinL. J.ShoonerK.BensonD. W. (2004). Bicuspid aortic valve is heritable.J. Am. Coll. Cardiol.44138143. 10.1016/j.jacc.2004.03.050

  • 17

    DorschnerM. O.AmendolaL. M.TurnerE. H.RobertsonP. D.ShirtsB. H.GallegoC. J.et al (2013). Actionable, pathogenic incidental findings in 1,000 participants’ exomes.Am. J. Hum. Genet.93631640. 10.1016/j.ajhg.2013.08.006

  • 18

    EijssenL. M.van den BoschB. J.VignierN.LindseyP. J.van den BurgC. M.CarrierL.et al (2008). Altered myocardial gene expression reveals possible maladaptive processes in heterozygous and homozygous cardiac myosin-binding protein C knockout mice.Genomics915260. 10.1016/j.ygeno.2007.09.005

  • 19

    EvansD. M.CardonL. R. (2004). Guidelines for genotyping in genomewide linkage studies: single-nucleotide-polymorphism maps versus microsatellite maps.Am. J. Hum. Genet.75687692. 10.1086/424696

  • 20

    Gago-DiazM.BrionM.GallegoP.CalvoF.Robledo-CarmonaJ.SauraD.et al (2017). The genetic component of bicuspid aortic valve and aortic dilation. An exome-wide association study.J. Mol. Cell Cardiol.10239. 10.1016/j.yjmcc.2016.11.012

  • 21

    GargV.MuthA. N.RansomJ. F.SchlutermanM. K.BarnesR.KingI. N.et al (2005). Mutations in NOTCH1 cause aortic valve disease.Nature437270274. 10.1038/nature03940

  • 22

    Genomes Project ConsortiumAutonA.BrooksL. D.DurbinR. M.GarrisonE. P.KangH. M.et al (2015). A global reference for human genetic variation.Nature5266874. 10.1038/nature15393

  • 23

    GlerupS.OlsenD.VaegterC. B.GustafsenC.SjoegaardS. S.HermeyG.et al (2014). SorCS2 regulates dopaminergic wiring and is processed into an apoptotic two-chain receptor in peripheral glia.Neuron8210741087. 10.1016/j.neuron.2014.04.022

  • 24

    GTEx Consortium. (2015). Human genomics. the genotype-tissue expression (GTEx) pilot analysis: multitissue gene regulation in humans.Science348648660. 10.1126/science.1262110

  • 25

    HanchardN. A.SwaminathanS.BucasasK.FurthnerD.FernbachS.AzamianM. S.et al (2016). A genome-wide association study of congenital cardiovascular left-sided lesions shows association with a locus on chromosome 20.Hum. Mol. Genet.2523312341. 10.1093/hmg/ddw071

  • 26

    HintonR. B.Jr.MartinL. J.TabanginM. E.MazwiM. L.CripeL. H.BensonD. W. (2007). Hypoplastic left heart syndrome is heritable.J. Am. Coll. Cardiol.5015901595. 10.1016/j.jacc.2007.07.021

  • 27

    HintonR. B.MartinL. J.Rame-GowdaS.TabanginM. E.CripeL. H.BensonD. W. (2009). Hypoplastic left heart syndrome links to chromosomes 10q and 6q and is genetically related to bicuspid aortic valve.J. Am. Coll. Cardiol.5310651071. 10.1016/j.jacc.2008.12.023

  • 28

    HoangT. T.GoldmuntzE.RobertsA. E.ChungW. K.KlineJ. K.DeanfieldJ. E.et al (2018). The congenital heart disease genetic network study: cohort description.PLoS One13:e0191319. 10.1371/journal.pone.0191319

  • 29

    HomsyJ.ZaidiS.ShenY.WareJ. S.SamochaK. E.KarczewskiK. J.et al (2015). De novo mutations in congenital heart disease with neurodevelopmental and other congenital anomalies.Science35012621266. 10.1126/science.aac9396

  • 30

    HosseiniS. M.KimR.UdupaS.CostainG.JoblingR.ListonE.et al (2018). Reappraisal of reported genes for sudden arrhythmic death: an evidence-based evaluation of gene validity for brugada syndrome.Circulation13811951205. 10.1161/CIRCULATIONAHA.118.035070

  • 31

    HuntingtonK.HunterA. G.ChanK. L. (1997). A prospective study to assess the frequency of familial clustering of congenital bicuspid aortic valve.J. Am. Coll. Cardiol.3018091812. 10.1016/S0735-1097(97)00372-0

  • 32

    HusseinI. R.BaderR. S.ChaudharyA. G.BassiouniR.AlquaitiM.AshganF.et al (2018). Identification of de novo and rare inherited copy number variants in children with syndromic congenital heart defects.Pediatr. Cardiol.39924940. 10.1007/s00246-018-1842-7

  • 33

    JinS. C.HomsyJ.ZaidiS.LuQ.MortonS.DePalmaS. R.et al (2017). Contribution of rare inherited and de novo variants in 2,871 congenital heart disease probands.Nat. Genet.4915931601. 10.1038/ng.3970

  • 34

    JingH.VakocC. R.YingL.MandatS.WangH.ZhengX.et al (2008). Exchange of GATA factors mediates transitions in looped chromatin organization at a developmentally regulated gene locus.Mol. Cell29232242. 10.1016/j.molcel.2007.11.020

  • 35

    KathiresanS.VoightB. F.PurcellS.MusunuruK.ArdissinoD.MannucciP. M.et al (2009). Genome-wide association of early-onset myocardial infarction with single nucleotide polymorphisms and copy number variants.Nat. Genet.41334341. 10.1038/ng.327

  • 36

    KircherM.WittenD. M.JainP.O’RoakB. J.CooperG. M.ShendureJ. (2014). A general framework for estimating the relative pathogenicity of human genetic variants.Nat. Genet.46310315. 10.1038/ng.2892

  • 37

    KuoC. F.LinY. S.ChangS. H.ChouI. J.LuoS. F.SeeL. C.et al (2017). Familial aggregation and heritability of congenital heart defects.Circ. J.82232238. 10.1253/circj.CJ-17-0250

  • 38

    LaHayeS.CorsmeierD.BasuM.BowmanJ. L.Fitzgerald-ButtS.ZenderG.et al (2016). Utilization of whole exome sequencing to identify causative mutations in familial congenital heart disease.Circ. Cardiovasc. Genet.9320329. 10.1161/CIRCGENETICS.115.001324

  • 39

    LekM.KarczewskiK. J.MinikelE. V.SamochaK. E.BanksE.FennellT.et al (2016). Analysis of protein-coding genetic variation in 60,706 humans.Nature536285291. 10.1038/nature19057

  • 40

    LiA. H.HanchardN. A.FurthnerD.FernbachS.AzamianM.NicosiaA.et al (2017). Whole exome sequencing in 342 congenital cardiac left sided lesion cases reveals extensive genetic heterogeneity and complex inheritance patterns.Genome Med.9:95. 10.1186/s13073-017-0482-5

  • 41

    LindorN. M.ThibodeauS. N.BurkeW. (2017). Whole-genome sequencing in healthy people.Mayo Clin. Proc.92159172. 10.1016/j.mayocp.2016.10.019

  • 42

    LiuX.YagiH.SaeedS.BaisA. S.GabrielG. C.ChenZ.et al (2017). The complex genetics of hypoplastic left heart syndrome.Nat. Genet.4911521159. 10.1038/ng.3870

  • 43

    LoffredoC. A.ChokkalingamA.SillA. M.BoughmanJ. A.ClarkE. B.ScheelJ.et al (2004). Prevalence of congenital cardiovascular malformations among relatives of infants with hypoplastic left heart, coarctation of the aorta, and d-transposition of the great arteries.Am. J. Med. Genet. Part A 124A, 225230. 10.1002/ajmg.a.20366

  • 44

    LomvardasS.BarneaG.PisapiaD. J.MendelsohnM.KirklandJ.AxelR. (2006). Interchromosomal interactions and olfactory receptor choice.Cell126403413. 10.1016/j.cell.2006.06.035

  • 45

    MahmoodK.JungC. H.PhilipG.GeorgesonP.ChungJ.PopeB. J.et al (2017). Variant effect prediction tools assessed using independent, functional assay-based datasets: implications for discovery and diagnostics.Hum. Genomics11:10. 10.1186/s40246-017-0104-8

  • 46

    ManaG.ClaperoF.PanieriE.PaneroV.BottcherR. T.TsengH. Y.et al (2016). PPFIA1 drives active alpha5beta1 integrin recycling and controls fibronectin fibrillogenesis and vascular morphogenesis.Nat. Commun.7:13546. 10.1038/ncomms13546

  • 47

    MartinL. J.PilipenkoV.KaufmanK. M.CripeL.KottyanL. C.KeddacheM.et al (2014). Whole exome sequencing for familial bicuspid aortic valve identifies putative variants.Circ. Cardiovasc. Genet.7677683. 10.1161/CIRCGENETICS.114.000526

  • 48

    MartinL. J.RamachandranV.CripeL. H.HintonR. B.AndelfingerG.TabanginM.et al (2007). Evidence in favor of linkage to human chromosomal regions 18q, 5q and 13q for bicuspid aortic valve and associated cardiovascular malformations.Hum. Genet.121275284. 10.1007/s00439-006-0316-9

  • 49

    McBrideK. L.GargV. (2011). Heredity of bicuspid aortic valve: is family screening indicated?Heart9711931195. 10.1136/hrt.2011.222489

  • 50

    McBrideK. L.PignatelliR.LewinM.HoT.FernbachS.MenessesA.et al (2005). Inheritance analysis of congenital left ventricular outflow tract obstruction malformations: segregation, multiplex relative risk, and heritability.Am. J. Med. Genet. Part A134180186. 10.1002/ajmg.a.30602

  • 51

    McBrideK. L.ZenderG. A.Fitzgerald-ButtS. M.SeagravesN. J.FernbachS. D.ZapataG.et al (2011). Association of common variants in ERBB4 with congenital left ventricular outflow tract obstruction defects.Birth Defects Res. A Clin. Mol. Teratol.91162168. 10.1002/bdra.20764

  • 52

    McKennaA.HannaM.BanksE.SivachenkoA.CibulskisK.KernytskyA.et al (2010). The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.Genome Res.2012971303. 10.1101/gr.107524.110

  • 53

    MiosgeL. A.FieldM. A.SontaniY.ChoV.JohnsonS.PalkovaA.et al (2015). Comparison of predicted and actual consequences of missense mutations.Proc. Natl. Acad. Sci. U.S.A.112E5189E5198. 10.1073/pnas.1511585112

  • 54

    NembhardW. N.TangX.HuZ.MacLeodS.StoweZ.WebberD.et al (2017). Maternal and infant genetic variants, maternal periconceptional use of selective serotonin reuptake inhibitors, and risk of congenital heart defects in offspring: population based study.BMJ356:j832. 10.1136/bmj.j832

  • 55

    NgS. B.BighamA. W.BuckinghamK. J.HannibalM. C.McMillinM. J.GildersleeveH. I.et al (2010a). Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome.Nat. Genet.42790793. 10.1038/ng.646

  • 56

    NgS. B.BuckinghamK. J.LeeC.BighamA. W.TaborH. K.DentK. M.et al (2010b). Exome sequencing identifies the cause of a mendelian disorder.Nat. Genet.423035. 10.1038/ng.499

  • 57

    ParkerK. K.IngberD. E. (2007). Extracellular matrix, mechanotransduction and structural hierarchies in heart tissue engineering.Philos. Trans. R. Soc. Lond. B Biol. Sci.36212671279. 10.1098/rstb.2007.2114

  • 58

    ParkerS. E.MaiC. T.CanfieldM. A.RickardR.WangY.MeyerR. E.et al (2010). Updated national birth prevalence estimates for selected birth defects in the United States, 2004-2006.Birth Defects Res. A Clin. Mol. Teratol.8810081016. 10.1002/bdra.20735

  • 59

    Pediatric Cardiac Genomics ConsortiumGelbB.BruecknerM.ChungW.GoldmuntzE.KaltmanJ.et al (2013). The congenital heart disease genetic network study: rationale, design, and early results.Circ. Res.112698706. 10.1161/CIRCRESAHA.111.300297

  • 60

    PreussC.CapredonM.WunnemannF.ChetailleP.PrinceA.GodardB.et al (2016). Family based whole exome sequencing reveals the multifaceted role of notch signaling in congenital heart disease.PLoS Genet.12:e1006335. 10.1371/journal.pgen.1006335

  • 61

    PriestJ. R.OsoegawaK.MohammedN.NandaV.KunduR.SchultzK.et al (2016). De novo and rare variants at multiple loci support the oligogenic origins of atrioventricular septal heart defects.PLoS Genet.12:e1005963. 10.1371/journal.pgen.1005963

  • 62

    QianB.MoR.DaM.PengW.HuY.MoX. (2014). Common variations in BMP4 confer genetic susceptibility to sporadic congenital heart disease in a Han Chinese population.Pediatr. Cardiol.3514421447. 10.1007/s00246-014-0951-1

  • 63

    RichardsS.AzizN.BaleS.BickD.DasS.Gastier-FosterJ.et al (2015). Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology.Genet. Med.17405424. 10.1038/gim.2015.30

  • 64

    RussellM. W.ChungW. K.KaltmanJ. R.MillerT. A. (2018). Advances in the understanding of the genetic determinants of congenital heart disease and their impact on clinical outcomes.J. Am. Heart Assoc.7:e006906. 10.1161/JAHA.117.006906

  • 65

    SchottJ. J.BensonD. W.BassonC. T.PeaseW.SilberbachG. M.MoakJ. P.et al (1998). Congenital heart disease caused by mutations in the transcription factor NKX2-5.Science281108111. 10.1126/science.281.5373.108

  • 66

    SiaoC. J.LorentzC. U.KermaniP.MarinicT.CarterJ.McGrathK.et al (2012). ProNGF, a cytokine induced after myocardial infarction in humans, targets pericytes to promote microvascular damage and activation.J. Exp. Med.20922912305. 10.1084/jem.20111749

  • 67

    SifrimA.HitzM. P.WilsdonA.BreckpotJ.TurkiS. H.ThienpontB.et al (2016). Distinct genetic architectures for syndromic and nonsyndromic congenital heart defects identified by exome sequencing.Nat. Genet.4810601065. 10.1038/ng.3627

  • 68

    SilbersteinM.TzemachA.DovgolevskyN.FishelsonM.SchusterA.GeigerD. (2006). Online system for faster multipoint linkage analysis via parallel execution on thousands of personal computers.Am. J. Hum. Genet.78922935. 10.1086/504158

  • 69

    SloanC. A.ChanE. T.DavidsonJ. M.MalladiV. S.StrattanJ. S.HitzB. C.et al (2016). ENCODE data at the ENCODE portal.Nucleic Acids Res.44D726D732. 10.1093/nar/gkv1160

  • 70

    SmemoS.TenaJ. J.KimK. H.GamazonE. R.SakabeN. J.Gomez-MarinC.et al (2014). Obesity-associated variants within FTO form long-range functional connections with IRX3.Nature507371375. 10.1038/nature13138

  • 71

    SpilianakisC. G.FlavellR. A. (2004). Long-range intrachromosomal interactions in the T helper type 2 cytokine locus.Nat. Immunol.510171027. 10.1038/ni1115

  • 72

    SpilianakisC. G.LaliotiM. D.TownT.LeeG. R.FlavellR. A. (2005). Interchromosomal associations between alternatively expressed loci.Nature435637645. 10.1038/nature03574

  • 73

    StevensK. N.HakonarsonH.KimC. E.DoevendansP. A.KoelemanB. P.MitalS.et al (2010). Common variation in ISL1 confers genetic susceptibility for human congenital heart disease.PLoS One5:e10855. 10.1371/journal.pone.0010855

  • 74

    SzotJ. O.CunyH.BlueG. M.HumphreysD. T.IpE.HarrisonK.et al (2018). A screening approach to identify clinically actionable variants causing congenital heart disease in exome data.Circ. Genom. Precis. Med.11:e001978. 10.1161/CIRCGEN.117.001978

  • 75

    TaborH. K.AuerP. L.JamalS. M.ChongJ. X.YuJ. H.GordonA. S.et al (2014). Pathogenic variants for Mendelian and complex traits in exomes of 6,517 European and African Americans: implications for the return of incidental results.Am. J. Hum. Genet.95183193. 10.1016/j.ajhg.2014.07.006

  • 76

    TolhuisB.PalstraR. J.SplinterE.GrosveldF.de LaatW. (2002). Looping and interaction between hypersensitive sites in the active beta-globin locus.Mol. Cell.1014531465. 10.1016/S1097-2765(02)00781-5

  • 77

    VernimmenD.De GobbiM.Sloane-StanleyJ. A.WoodW. G.HiggsD. R. (2007). Long-range chromosomal interactions regulate the timing of the transition between poised and active gene expression.EMBO J.2620412051. 10.1038/sj.emboj.7601654

  • 78

    WesselsM. W.BergerR. M.Frohn-MulderI. M.Roos-HesselinkJ. W.HoogeboomJ. J.ManciniG. S.et al (2005). Autosomal dominant inheritance of left ventricular outflow tract obstruction.Am. J. Med. Genet. Part A134171179. 10.1002/ajmg.a.30601

  • 79

    WilsdonA.SifrimA.HitzM. P.HurlesM.BrookJ. D. (2017). Recent advances in congenital heart disease genomics.F1000Res6:869. 10.12688/f1000research.10113.1

  • 80

    WinstonJ. B.SchulkeyC. E.ChenI. B.RegmiS. D.EfimovaM.ErlichJ. M.et al (2012). Complex trait analysis of ventricular septal defects caused by Nkx2-5 mutation.Circ. Cardiovasc. Genet.5293300. 10.1161/CIRCGENETICS.111.961136

  • 81

    ZaidiS.ChoiM.WakimotoH.MaL.JiangJ.OvertonJ. D.et al (2013). De novo mutations in histone-modifying genes in congenital heart disease.Nature498220223. 10.1038/nature12141

Summary

Keywords

linkage, heart, complex trait, exome, gene

Citation

Martin LJ, Pilipenko V and Benson DW (2019) Role of Segregation for Variant Discovery in Multiplex Families Ascertained by Probands With Left Sided Cardiovascular Malformations. Front. Genet. 9:729. doi: 10.3389/fgene.2018.00729

Received

10 August 2018

Accepted

22 December 2018

Published

11 January 2019

Volume

9 - 2018

Edited by

Daniel Shriner, National Human Genome Research Institute (NHGRI), United States

Reviewed by

Alex Vincent Postma, University of Amsterdam, Netherlands; Elizabeth Hauser, Duke University, United States; Duncan C. Thomas, University of Southern California, United States

Updates

Copyright

*Correspondence: Lisa J. Martin,

This article was submitted to Applied Genetic Epidemiology, a section of the journal Frontiers in Genetics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics