ORIGINAL RESEARCH article

Front. Genet., 12 September 2022

Sec. Genetics of Common and Rare Diseases

Volume 13 - 2022 | https://doi.org/10.3389/fgene.2022.990015

Systematic analysis of inheritance pattern determination in genes that cause rare neurodevelopmental diseases

  • 1. Department of Pediatrics, Seoul National University College of Medicine, Seoul National University Children’s Hospital, Seoul, South Korea

  • 2. Department of Genomic Medicine, Seoul National University Hospital, Seoul, South Korea

  • 3. Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul, South Korea

  • 4. Rare Disease Center, Seoul National University Hospital, Seoul, South Korea

  • 5. Department of Pediatrics, Seoul National University Bundang Hospital, Seongnam, South Korea

  • 6. Department of Pediatrics, Seoul National University Hospital Child Cancer and Rare Disease Administration, Seoul National University Children’s Hospital, Seoul, South Korea

Abstract

Despite recent advancements in our understanding of genetic etiology and its molecular and physiological consequences, it is not yet clear what genetic features determine the inheritance pattern of a disease. To address this issue, we conducted whole exome sequencing analysis to characterize genetic variants in 1,180 Korean patients with neurological symptoms. The diagnostic yield for definitive pathogenic variant findings was 50.8%, after including 33 cases (5.9%) additionally diagnosed by reanalysis. Of diagnosed patients, 33.4% carried inherited variants. At the genetic level, autosomal recessive-inherited genes were characterized by enrichments in metabolic process, muscle organization and metal ion homeostasis pathways. Transcriptome and interactome profiling analyses revealed less brain-centered expression and fewer protein-protein interactions for recessive genes. The majority of autosomal recessive genes were more tolerant of variation, and functional prediction scores of recessively-inherited variants tended to be lower than those of dominantly-inherited variants. Additionally, we were able to predict the rates of carriers for recessive variants. Our results showed that genes responsible for neurodevelopmental disorders harbor different molecular mechanisms and expression patterns according to their inheritance patterns. Also, calculated frequency rates for recessive variants could be utilized to pre-screen rare neurodevelopmental disorder carriers.

Introduction

Genetic disorders are caused by various alterations in gene function. According to the Online Mendelian Inheritance in Men (OMIM) compendium, 4,617 genes and their variants are associated with human disease as of April 2022 (; ). Broadly, disease-associated alterations can be categorized as resulting in either gain-of-function (GoF) or loss-of-function (LoF) of a gene. GoF is mostly associated with dominant inheritance, whereas LoF can appear in recessive form as well as dominant (as haploinsufficiency) (; ). Mendelian disorders with recessive inheritance patterns are primarily observed in ethnic groups with high rates of consanguineous marriages (; ), whereas those with dominant and recessive inheritance patterns appear in comparable rates in other “outbred” ethnic groups (). Despite recent efforts into patient genome sequencing, diagnosis, and the discovery of novel genes that cause rare Mendelian disorders, it is not yet clear what drives genes to carry variants that are inherited in dominant or recessive patterns. It seems intuitive to postulate that genes that cause diseases in recessive pattern are less critical and more tolerant in development and physiology as one has to carry defective variants in both alleles for a disease to manifest. Nevertheless, there is need to systematically evaluate whether other properties that may represent gene function, expression, and previous disease associations also play a role in the process.

The study of genes that cause recessive diseases is important because knowledge of such genes can be utilized to predict marriages between carriers and avoid the generation of new patients. This strategy has been already proven successful in a number of diseases, such as β-thalassaemia and Tay–Sachs disease (; ). However, those diseases are single gene disorders and display ethnic biases, making the process of variant curation and evaluation for pathogenicity and also the prediction of patients more efficient. Meanwhile, applying such an approach to diseases with heuristic symptoms requires considerably more effort because of the involvement of many genes and variants and the diverse clinical symptoms. Therefore, understanding the features of genetic variants that cause disease in a recessive inheritance pattern will provide a novel approach for avoiding generation of patients.

Complex structure and function of brain involve coordinated expression and function of many genes, and this is also reflected on a diverse array of rare Mendelian neurodevelopmental diseases (NDD) that we observe from patients. Such patients display abnormal brain function and/or structure which may affect motor function, learning ability, development, language, and other brain activities. Diagnosis of such diseases is a challenge because of the extreme genetic heterogeneity and rare occurrence, while whole exome sequencing (WES) has enhanced the yield of NDD diagnoses in clinical practice (; ; ; ; ; ; ).

Here we report analyses of the factors that confer effects on genes that cause diseases in dominant or recessive inheritance pattern. Based on a cohort of 1,180 Korean Neurodevelopmental Disorder (KND) patients and additional patient datasets from Deciphering Developmental Delay (DDD; n = 13,500) and the Simons Foundation Autism Research Initiative (SFARI; n = 34,868), we found that genes which follow a recessive inheritance pattern are more tolerant, harbor variants of lower functionality, interact with fewer other proteins, and are more enriched in metabolic and mitochondrial functional categories when compared to those that follow a dominant inheritance pattern. In addition, the ongoing accumulation of carrier information suggests possible future utility for carrier prediction as more NDD patient data become available.

Materials and methods

Patients and study criteria

Patients with NDD and their parents who visited the Seoul National University Children’s Hospital (SNUCH) pediatric neurology clinic were recruited to this study. Informed consent and blood samples for genomic DNA were obtained under the approval of the Seoul National University Hospital (SNUH) internal review board (#1406-081-588). Patients with confirmed genetic variants identified through candidate gene sequencing, targeted gene sequencing panel, microarray, metabolic work-up, brain/spine MRI, or muscle biopsy were excluded. A total of 1,180 patients with complex neurological symptoms of suspected genetic origin were selected by the pediatric neurologists in SNUCH.

Whole exome sequencing

Genomic DNA was extracted from whole blood using the QIAamp DNA Blood Midi Kit according to the manufacturer’s instructions (Qiagen, Valencia, CA). WES procedures including exome capturing and sequencing were performed at Theragen Etex Bio Institute (Suwon, Korea). The data were analyzed as described previously (). Briefly, Burrows-wheeler Aligner [v.0.7.15 ()] was used to align sequenced reads and the Picard software [v.2.8.0 ()], samtools [v.1.8 ()] and Genome Analysis Toolkit [GATK, v.4.1.4 ()] were used for subsequent data processing steps such as removal of PCR duplicates, base recalibration, and variant quality control. ANNOVAR and SnpEff were used for variant annotation (; ).

Gene expression analysis

The normalized transcript level (TPM) of each gene was extracted from the Genotype-Tissue Expression (GTEx) project [v8 ()]. The median TPM value across all brain regions for each gene was divided by the median value of all other regions to determine relative brain vs. body expression. These relative TPM values were then plotted to visualize the distribution of the gene set. RNA-seq data and exon microarray data from BrainSpan (http://www.brainspan.org) were used to analyze brain spatial and temporal gene expression in the brain (). This dataset contains expression from 16 cortical and subcortical structures along the full course of human brain development.

Protein-protein interaction analysis

Protein correlation profiling of seven mouse tissues was used to explore tissue-specific PPI (). The dataset contained more than 190,000 high-confidence PPIs identified with stable isotope labelling of tissues. We extracted for analysis those pairs that included proteins corresponding to the causal genes identified in the KND, DDD and SFARI cohorts.

Evaluation of pathogenic variants

The pathogenicity of variants in our dataset was assessed with reference to multiple databases as follows. Normal population database such as gnomAD, ExAC, 1000 Genomes, and KOVA were used to evaluate dominant variants that were never seen as heterozygous and recessive variants that were never seen as homozygous or hemizygous when filtered by allele frequency of 0.001 in heterozygous status. In silico prediction scores such as CADD (), SIFT (), and phyloP () were used to gain information regarding whether variants were evolutionarily well conserved at the amino acid level. Disease databases such as OMIM (; ; ), HGMD (), and ClinVar () were used to find causative genes with consideration of genotype-phenotype associations. Then the variants were classified based on inheritance patterns such as de novo, compound heterozygous, homozygous, and hemizygous by comparing to genotypes in the parents. Copy number variation (CNV) analysis through WES was carried out by comparing the mean coverage depth of each captured interval to the mean coverage depth of parental samples as described previously ().

Gene ontology analysis

To analyze disease associations and biological pathways that are enriched in a selected genes, a web-based analysis tool from Metascape (https://metascape.org/gp/index.html) was used. Conventional GO sources were used: biological process (BP), Cellular Component (CC) and Molecular Function (MF). Disease Gene Network (DisGeNET) was used for disease ontology. Results were collected and grouped into clusters for comparative analyses of biological process and disease association between gene groups.

Statistical evaluation

Wilcox test was used to determine the statistical significance of the observed differences in functional scores for genes with different inheritance modes. The statistical significance of the expression level in boxplots was measured by a two-sample t-test. Statistical analyses were performed with R version 3.6.2.

Results

Genetic analysis of 1,180 patients with neurodevelopmental disorders

Participants consisted of pediatric patients (mean age = 11.3 years, range 1–62) displaying one or more neurological symptoms including developmental delay, intellectual disability, intractable seizure, involuntary movements, or muscle weakness who visited SNUCH in 2014–2020 with idiopathic or undiagnosed symptoms. For genetic analysis, DNA from peripheral blood was subjected to WES. Among the 1,180 patients, parents of 707 patients were also sequenced. The resulting genome data were processed and pathogenic variants called with a standard process (Subjects and Methods). The pathogenic variants were identified in the 284 disease-causing genes in 1,180 KND patients (Supplementary Table S1). Overall, 41.9% of the patients carried known variants and 4.7% carried known variants but displayed symptoms different from those previously reported, possibly extending the disease spectra of these variants (Figure 1; Supplementary Table S2). Including the 4.2% of the patients with known CNVs (Supplementary Table S3), 50.8% of patients with pathogenic variants in known causal genes were definitively diagnosed by WES analysis (Figure 1A). This group with known variants was further divided according to variant inheritance pattern (Figure 1B). De novo variants were identified in more than half of diagnosed patients (62.9%) and recessive variants in about a quarter (24.7%). Lastly, 8.7% carried variants on the X chromosome with hemizygous status, making the proportion of inherited variants 33.4% (Figure 1B). This distribution of pathogenic variant inheritance is comparable to those reported in other studies using rare disease patients from outbred populations (; ; ; ).

FIGURE 1

Reanalysis improved diagnostic yield

Re-evaluation of patients previously determined to be without pathogenic variants often allows for the discovery of new variants due to accumulating understanding of gene-disease relationships and improved bioinformatic pipelines (; ; ; ). Previous re-analysis studies have reported 5%–12% increases in diagnostic yield (; ; ; ); here, we observed a 5.9% increase in diagnostic yield, discovering pathogenic variants in 33 patients among the 553 that were previously analyzed in 2020 (KND553; Figure 1A) (). The variants implicated in these 33 cases can be broadly divided into two groups, 1) variants for which new entries in OMIM allowed defining them as pathogenic (n = 8; Table 1) and 2) pathogenic calls previously missed during the bioinformatic process (n = 25; Table 2).

TABLE 1

IndexGeneInheritanceVariant typeVariantVariant statusPhenotypeaOMIM entry number and date of creation
1GEMIN5RecComp hetc.3857A>G; p.Tyr1286CysNovelProgressive cerebellar atrophy with severe developmental arrest#619333; 05/19/2021
c.2510–2A>TNovel
2STAG2DomDe novo hetc.3724C>T; p.Arg1242*NovelHoloprosencephaly#301043; 04/07/2020
3MED12LDomDe novo hetc.1895C>T; p.Ser632LeuNovelGDD with FD#618872; 05/02/2020
4DHX16DomDe novo hetc.2021C>T; p.Thr674MetReportedCongenital myopathy#618733; 01/09/2020
5HK1DomDe novo hetc.1475C>T; p.Thr492MetReportedSevere brain atrophy, deep cortex disruption#618547; 08/20/2019
6ADH5RecComp hetc.678delA; p.Asp227fsNovelGDD, myelodysplastic syndrome#619151; 01/13/2021
c.832G>C; p.Ala278ProReported
7SIAH1DomDe novo hetc.613G>C; p.Gly205ArgReportedGDD and FD#619314; 05/06/2021
8MN1DomDe novo hetc.3850delC; p.His1284fsNovelCHARGE syndrome#618774; 02/11/2020

List of newly diagnosed cases due to new gene entry into OMIM.

a

GDD, global developmental delay; FD, facial dysmorphism; CHARGE, coloboma, heart defects, atresia choanae, growth retardation, genital abnormalities, and ear abnormalities.

TABLE 2

IndexGeneInheritanceVariant typeVariantVariant statusPhenotypebReason
1PMM2RecComp hetc.194A>G; p.Asp65GlyNovelProgressive cerebellar atrophyNot clear
c.713G>C; p.Arg238ProReported
2PDHA1X-linkedDe novo hetc.613G>A; p.Val205MetReportedRett syndrome-likeNot clear
3ZEB2DomDe novo hetc.2083C>T; p.Arg695*ReportedRett syndrome-likeNot clear
4KAT6BDomDe novo hetc.3147G>A; p.Pro1049ProReportedRett syndrome-likeA synonymous variant; called during re-evaluation
5ACTBDomDe novo hetc.547C>T; p.Arg183TrpReportedSevere dystonia, ID, and SNHLNot clear
6CASKX-linkedHemizygousc.1667T>G; p.Leu556ArgNovelAutism spectrum disorder with FDNot clear
7MAGEL2DomDe novo hetc.2873G>A; p.Trp958*ReportedID with FD and multiple anomalyNot clear
8NARS2RecComp hetc.1163C>T; p.Thr388MetNovelEENot clear
c.88G>C; p.Val30LeuNovel
9FOXG1Dompendingc.460dupG; p.Glu154fsReportedEE and microcephalyNot clear
10DYRK1ADomDe novo hetc.520G>T; p.Val174LeuReportedDD with microcephaly and FDNot clear
c.521T>A; p.Val174GluNovel
11SLC16A2X-linkedHemizygousc.1265T>G; p.Leu422ArgNovelNeurodegenerative disorderNot clear
12UGDHRecComp hetc.1183G>A; p.Val395MetNovelEE with family historyNot clear
c.1038–2A>GNovel
13PDHA1X-linkedHemizygousc.761T>C; p.Leu254SerReportedLeigh SyndromeInitially missed due to coverage depth <10
14TBR1DomDe novo hetc.1588_1594dupGGCTGCA; p.Thr532fsReportedRett syndrome-likeInitially missed due to coverage depth <10
15IQSEC2X-linkedDe novo hemic.2139delC; p.Gly714fsNovelRett syndrome-likeInitially missed due to coverage depth <10
16SMC1AX-linkedPossible de novo hetac.2923C>T; p.Arg975*ReportedRett syndrome-likeInitially missed due to coverage depth <10
17AHDC1DomPossible de novo hetac.2389G>T; p.Glu797*NovelRett syndrome-likeInitially missed due to coverage depth <10
18IRF2BPLDomDe novo hetc.562C>T; p.Arg188*ReportedNeurodegenerative diseaseCalled during phenotype re-evaluation
19UBAP1DomDe novo hetc.529dupA; p.Met177fsNovelHereditary spastic paraplegiaCalled during phenotype re-evaluation
20GABRB3DomPossible de novo hetac.554C>T; p.Thr185IleReportedRett syndrome-likeCalled during phenotype re-evaluation
21CDK13DomPossible de novo hetac.2149G>A; p.Gly717ArgReportedRett syndrome-likeCalled during phenotype re-evaluation
22TRAPPC11RecHomc.302A>G; p.Tyr101CysNovelUnknown muscular dystrophy, most likely calpainopathyCalled during phenotype re-evaluation
23SLC35A2X-linkedDe novo hemic.1A>C; p.Met1?NovelUllrich disease or Bethlem myopathy suspectedCalled during phenotype re-evaluation
24DHDDSDomDe novo hetc.110G>A; p.Arg37HisReportedEECalled during phenotype re-evaluation
25KMT2CDomDe novo hetc.5716C>T; p.Arg 1906*NovelFemale ID, microcephalyCalled during phenotype re-evaluation

List of newly diagnosed cases by data re-analysis.

a

Variant is pathogenic in ClinVar, but parental samples were not available.

b

ID, intellectual disability; DD, developmental delay; SNHL, sensorineural hearing loss; FD, facial dysmorphism; EE, epileptic encephalopathy.

Genetic characteristics of genes that follow a dominant or recessive pattern

NDDs display considerable clinical and biological heterogeneity. The innate function and expression pattern of a gene can impact both its inheritance mode and the phenotype when its function or expression is altered. Understanding their genetic properties associated with inheritance modes will help in gaining a more comprehensive view of NDD. To develop this understanding, we first analyzed functional enrichments of those pathogenic genes that follow a dominant or recessive pattern in KND, and compared them with corresponding gene sets from DDD or SFARI (Figure 2A). GO analysis revealed that KND genes are enriched in molecular function and biological process terms relating to brain developmental progression, such as regulation of membrane potential, chromatin organization, head development, and pyrophosphatase activity (Figure 2A). In addition, the published variants from DDD and SFARI shared biological mechanisms involved in brain development (Supplementary Figure S1). Interestingly, differential enrichments were observed between dominant and recessive genes, with remarkably little overlap between the two gene groups (Figure 2B; Supplementary Table S4). In particular, dominant genes were strongly enriched for synaptic functions, while recessive genes were characterized by metabolic process, mitochondrial function, and muscular disease terms. Remarkably, X-linked genes shared more terms with dominant genes than with recessive genes (Figure 2B). This is unexpected because the majority of X-linked genes follow a hemizygous pattern, where disease manifests when the only X-chromosome allele in a male patient is mutated, and may follow a recessive pattern.

FIGURE 2

Expression patterns of genes that follow dominant or recessive inheritance

Next, we used GTEx data to compare the expression profiles of KND genes in brain and other tissues to determine if expression profiles would also differ by inheritance pattern. The results showed that KND genes having brain-specific expression were more enriched in the dominant gene set compared to the recessive gene set (p = 1.4 × 10−7; Figure 2C). Interestingly, the relative brain expression of X-linked genes was more similar to dominant genes than to recessive genes. BrainSpan comprises a comprehensive survey of gene expression in the brain during development, and in the dataset, dominant genes displayed increased expression level relative to both recessive and X-linked genes (between dominant and recessive genes, p = 1.4 × 10−9 for the prenatal period and p = 3.5 × 10−6 for the postnatal period; Figure 2D). There was no substantial difference between prenatal and postnatal expression levels (Figure 2D). Therefore, our findings imply a clear distinction in function and expression level for genes of different inheritance modes.

Tissue-specific PPI networks

PPI information enables us to explore the biological function of a protein though its physical interactions with other proteins. A recent study provided data on protein pairs that interacted in seven mouse tissues, which we used to identify PPIs for KND genes in tissue-specific context (). Focusing on brain tissue, we observed that the fraction of genes with PPIs was greater among dominant genes (59/163 = 36.2%) than for recessive genes (26/117 = 22.2%), and the mean number of interactions was also higher (7.2 for dominant genes vs. 4.5 for recessive genes) (Figure 2E). Among those recessive genes having PPIs at least one interaction in the brain, more than half also interacted with other proteins in all seven tissues (14/26 = 53.8%; Figure 2F), implying these genes to have a more ubiquitous functional pattern. PPIs from DDD and SFARI were also compared for validation (Supplementary Figure S2) and yielded similar patterns of brain-specific PPIs for dominant gene products and broader PPIs for recessive gene products.

Tolerance to pathogenic variants

The tolerance of a gene, indicating the degree to which a critical mutation in it may be detrimental to human development and physiology, is effectively represented by the probability of loss of function intolerance (pLI) and the observed/expected (O/E) constraint ratio scores in gnomAD (). In KND, autosomal dominant genes tended to have pLI values close to 1, representing strong constraint, while autosomal recessive genes showed the opposite trend with pLI close to 0 (e.g., 75.0% of autosomal dominant genes are near 1, and 84.6% of autosomal recessive genes are near 0; Figure 3A). Consistent patterns were also observed for DDD and SFARI. Meanwhile, similar to the GO analysis findings, X-linked recessive genes exhibited patterns akin to dominant genes. These observations were recapitulated when using O/E values (Figure 3A). All told, these findings suggest that genes responsible for NDDs harbor different functions according to their inheritance patterns, and they share little in terms of the molecular pathways leading to disease phenotype.

FIGURE 3

Characteristics of variants that follow dominant or recessive inheritance

We next investigated whether the functionality of genetic variants would differ according to their inheritance pattern using functional prediction scores like CADD, SIFT, and PhyloP. Variants from DDD and SFARI were also compared for validation. This analysis revealed that functional prediction scores for recessive variants tend to be lower than those of dominant variants (between dominant and recessive variants, p = 0.11 for CADD, p = 2.2 × 10−4 for SIFT, p = 9.6 × 10−6 for PhyloP, and p = 6.6 × 10−7 for AA conservation; Figure 3B). This finding indicates that variants under recessive inheritance are less damaging and less critical in function, hence demonstrate little physiological effect on carriers.

Estimating carrier frequencies of variants that cause recessive neurodevelopmental disorders

In our previous study using 553 KND patients, we estimated that one in every 17 healthy individuals is a carrier for at least one pathogenic variant for a recessive genes represented in the KND cohort. This estimate remains unchanged using 1,180 patients (), but with a patient set twice as large, we inferred that the power to predict carriers would substantially increase. We first collected a list of pathogenic LoF and missense variants from ClinVar and KND, and aggregated their population frequencies using the gnomAD East Asian and Korean Variant Archive [KOVA 2; 5,305 healthy Korean individual set ()]. This provided us with an estimation of the number of carriers of recessive neurodevelopmental diseases in the general Korean population (Figure 4A; Supplementary Table S5). Among the 161 genes that carriers were found in KND1180 set, the estimation yields were variable by gene, ranging up to 1.2% of the general population for VPS13B, and no carriers were predicted for 23 genes (Figure 4A). Among the 138 genes that carriers were predicted, only 34 genes were previously found in KND553. On average, the estimation yield for KND1180 variants on the 34 overlapping genes were 1.9-fold higher than those determined for KND553 variants, implying that larger cohort size is critical for increased sensitivity (Figure 4B).

FIGURE 4

Discussion

NDDs show considerable variability at both phenotypic and genetic levels. We conducted WES analysis to reveal genetic etiology for 1,180 undiagnosed patients in the KND cohort. Previously reported diagnostic rates of WES vary substantially among studies, ranging from 25% to 56% (; ; ; ; ; ; ; ). Herein, we report that the diagnostic yield for definitive pathogenic variant findings in KND patients was 50.8%. Among the diagnosed patients, 33.4% carried inherited variants, demonstrating that a large portion of KND patients inherited pathogenic variants from healthy parents.

It is expected that exome reanalysis applying the latest versions of databases and using improved bioinformatic tools would increase diagnostic yield (; ; ; ). We performed reanalysis of 291 patients from the KND553 set who remained without clear pathogenic variants. This reanalysis increased diagnostic yield from 47.5% to 53.4%, which can be attributed to a number of factors: newly discovered and deposited gene-disease associations in OMIM, increased coverage allowing identification of variants that may previously have been missed, filtering out in the initial analysis of synonymous variants affecting gene splicing of KAT6B (p.Pro1049=), and re-evaluation of previously analyzed variants (Figure 1; Tables 1, 2). Therefore, we also suggest that exome sequencing data should be periodically re-evaluated.

Since NDDs may variously be caused by alterations in genes with autosomal dominant, autosomal recessive, or X-linked inheritance modes, genotype-phenotype correlations are often difficult to establish. Also, we believe that studying the fundamental differences in genes that cause NDDs in recessive or dominant mode is crucial in understanding the mechanisms of NDD pathogenicity. As a first step, we analyzed the biological pathways of the KND genes to obtain systematic insights into the molecular mechanisms associated with different inheritance modes. The results revealed that dominant and recessive genes are most strongly associated with synaptic function and metabolic processes, respectively, implying that diseases can be caused through different molecular mechanisms according to their inheritance patterns. Moreover, we observed dominant and recessive gene sets to have opposite trends in pLI and O/E scores, which proved the differences in genetic architecture between these inheritance patterns. In addition, we found gene expression profiles to also reflected this fundamental difference. Profiling of brain expression patterns in GTEx and BrainSpan revealed the dominant gene sets to exhibit specific and increased expression in the brain compared to the recessive gene set, suggesting dominant genes to be more brain-specific. In addition to the GO and expression analyses, we investigated whether PPI data support an association of tissue-specific expression and function with inheritance mode. Tissue-specific PPI networks based on direct interactions have previously demonstrated biological relevance (). Here, we observed that dominant genes to have more interactions specifically in brain tissue than recessive genes. In contrast, recessive genes tended to have interactions ubiquitous across all seven tissues. Therefore, combined biological studies including PPI networks, functional pathways and phenotype data may be effective in expanding our understanding of disease progression in NDD. We also investigated variant functional effects and found that variants with recessive pathogenic alleles were less deleterious than those having dominant alleles. This is well supported by the fact that parental carriers are mostly healthy, although recent large-scale analyses have revealed heterozygous carriers of rare diseases to harbor subtle effects in various aspects of individual health and reproductivity (; ).

The frequency of carriers varies among population groups and specific genetic conditions could be biased toward particular ethnic groups (; ; ; ). Ethnic Koreans are an outbred population, and the culture has prohibited marriages between relatives and among members of family clans for more than 500 years (). As a major tertiary clinical institution, SNUCH covers a large portion of rare NDD patients in the country. Therefore, this study provides an unprecedented opportunity to study the occurrence of recessive diseases in an outbred population. We estimated that 24.7% of patients in the KND1180 cohort were affected by recessive conditions, which allows us to use databases such as gnomAD East Asian and KOVA to calculate carrier frequencies for reported and predicted pathogenic variants in the general population. Our resulting carrier panel will have a sensitivity of 36.1%, which is not too much deviated from previous attempts on Chinese and Israeli populations (38.7% for well-defined recessive conditions and >30% for recessive retinal diseases, respectively) (; ).

As expected, the larger sample size of this cohort relative to the KND553 cohort resulted in a greater number of pathogenic genes and an increase in the reported disease-associated variants enrolled in ClinVar and OMIM. Although calculated carrier frequencies may differ from those observed in clinical practices, the findings from this study will provide genetic evidence for the utility of preconception carrier screening.

Conclusion

Recent efforts into genome-based diagnosis of rare Mendelian disorders have provided with many gene-disease relationships and understanding of disease pathophysiology. However, it has not been clearly elucidated whether there is any genetic feature that determine the modes of inheritance of sucj diseases. We took advantage of in-house as well as public patient genome data and found genetic features of recessive vs. dominant disorders. Furthermore, we demonstrate that we can utilize this understanding of recessive variants to carrier prediction to reduce future patients originated from rare recessive variants.

Statements

Data availability statement

The variant data presented in the study are deposited in the repository (https://www.sysbiolab.org/knd1180).

Ethics statement

The studies involving human participants were reviewed and approved by Seoul National University Hospital. Written informed consent to participate in this study was provided by the participants’ legal guardian/next of kin.

Author contributions

J-HC and MC designed the project. SL, SYK, MJK, JGY, BCL, WJK, KJK, JMK, AC, JSL, and J-HC provided clinical examinations for the participants. SP, SSJ, SL, MK, HS, HJ, SEH, Jean L, Jeongeun L, Jeongha L, and C-RL collected and analyzed the data. SP, MC, and J-HC wrote the article. All authors read and approved the final manuscript.

Funding

The study was supported in part by the National Research Foundation (2020M3E5D7086836 and 2014M3C9A2064686), the Korea Centers for Disease Control and Prevention (2021-ER0701-00 and 2017N-6901-00), and the Ministry of Health and Welfare (HI16C1986).

Acknowledgments

We thank the patients and families that participated in the study.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.990015/full#supplementary-material

Glossary

  • OMIM

    the Online Mendelian Inheritance in Men

  • GoF

    gain-of-function

  • LoF

    loss-of-function

  • NDD

    neurodevelopmental diseases

  • WES

    whole exome sequencing

  • KND

    Korean Neurodevelopmental Disorder

  • DDD

    Deciphering Developmental Delay

  • SFARI

    Simons Foundation Autism Research Initiative

  • SNUH

    the Seoul National University Hospital

  • BWA

    Burrows-wheeler Aligner

  • GATK

    Genome Analysis Toolkit

  • TPM

    Transcripts Per Million

  • GTEx

    the Genotype-Tissue Expression

  • PPI

    Protein-protein interaction

  • CNV

    Copy number variation

  • GO

    Gene ontology

  • pLI

    the probability of loss of function intolerance

  • O/E

    the observed/expected

  • KOVA

    Korean Variant Archive

  • SNUCH

    the Seoul National University Children’s Hospital

  • ID

    intellectual disability

  • GDD

    global developmental delay

  • CHARGE

    coloboma, heart defects, atresia choanae, growth retardation, genital abnormalities, and ear abnormalities

  • FD

    facial dysmorphism

  • EE

    epileptic encephalopathy

  • BP

    biological process

  • CC

    cellular component

  • MF

    molecular function

  • DO

    disease ontology

  • AD

    autosomal dominant

  • AR

    autosomal recessive

  • XL

    X-linked

References

  • 1

    AbouelhodaM.SobahyT.El-KaliobyM.PatelN.ShamseldinH.MoniesD.et al (2016). Clinical genomics can facilitate countrywide estimation of autosomal recessive disease burden. Genet. Med.18, 12441249. 10.1038/gim.2016.37

  • 2

    AmbergerJ. S.BocchiniC. A.SchiettecatteF.ScottA. F.HamoshA. (2015). OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders. Nucleic Acids Res.43, D789D798. 10.1093/nar/gku1205

  • 3

    AmbergerJ. S.BocchiniC. A.ScottA. F.HamoshA. (2019). OMIM.org: Leveraging knowledge across phenotype-gene relationships. Nucleic Acids Res.47, D1038D1043. 10.1093/nar/gky1151

  • 4

    AmbergerJ. S.HamoshA. (2017). Searching online mendelian inheritance in man (OMIM): A knowledgebase of human genes and genetic phenotypes. Curr. Protoc. Bioinforma.58, 12. 10.1002/cpbi.27.1-1.2.12

  • 5

    BairdP. A.AndersonT. W.NewcombeH. B.LowryR. B. (1988). Genetic disorders in children and young adults: A population study. Am. J. Hum. Genet.42, 677693.

  • 6

    BartonA. R.HujoelM. L. A.MukamelR. E.ShermanM. A.LohP.-R. (2021). A spectrum of recessiveness among Mendelian disease variants in UK Biobank. MedRxiv.

  • 7

    Broad Institute (2019). Picard toolkit [online]. Broad Institute, GitHub repository: Broad Institute. Available at: https://broadinstitute.github.io/picard/.

  • 8

    CaoA.KanY. W. (2013). The prevention of thalassemia. Cold Spring Harb. Perspect. Med.3, a011775. 10.1101/cshperspect.a011775

  • 9

    ChauJ. F. T.YuM. H. C.ChuiM. M. C.YeungC. C. W.KwokA. W. C.ZhuangX.et al (2022). Comprehensive analysis of recessive carrier status using exome and genome sequencing data in 1543 Southern Chinese. NPJ Genom. Med.7, 23. 10.1038/s41525-022-00287-z

  • 10

    CingolaniP.PatelV. M.CoonM.NguyenT.LandS. J.RudenD. M.et al (2012). Using Drosophila melanogaster as a model for genotoxic chemical mutational studies with a new program, SnpSift. Front. Genet.3, 35. 10.3389/fgene.2012.00035

  • 11

    DeuchlerM. (1992). The confucian transformation of Korea : A study of society and ideology. Cambridge, Massachusetts: Harvard University Asia Center.

  • 12

    Epilepsy Genetics Initiative (2019). The Epilepsy Genetics Initiative: Systematic reanalysis of diagnostic exomes increases yield. Epilepsia60, 797806. 10.1111/epi.14698

  • 13

    EwansL. J.SchofieldD.ShresthaR.ZhuY.GayevskiyV.YingK.et al (2018). Whole-exome sequencing reanalysis at 12 months boosts diagnosis and is cost-effective when applied early in Mendelian disorders. Genet. Med.20, 15641574. 10.1038/gim.2018.39

  • 14

    FungJ. L. F.YuM. H. C.HuangS.ChungC. C. Y.ChanM. C. Y.PajusaluS.et al (2020). A three-year follow-up study evaluating clinical utility of exome sequencing and diagnostic potential of reanalysis. NPJ Genom. Med.5, 37. 10.1038/s41525-020-00144-x

  • 15

    GardnerE. J.NevilleM. D. C.SamochaK. E.BarclayK.KolkM.NiemiM. E. K.et al (2022). Reduced reproductive success is associated with selective constraint on human genes. Nature603, 858863. 10.1038/s41586-022-04549-9

  • 16

    GTEx Consortium (2020). The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science369, 13181330. 10.1126/science.aaz1776

  • 17

    HananyM.AllonG.KimchiA.BlumenfeldA.NewmanH.PrasE.et al (2018). Carrier frequency analysis of mutations causing autosomal-recessive-inherited retinal diseases in the Israeli population. Eur. J. Hum. Genet.26, 11591166. 10.1038/s41431-018-0152-0

  • 18

    IglesiasA.Anyane-YeboaK.WynnJ.WilsonA.Truitt ChoM.GuzmanE.et al (2014). The usefulness of whole-exome sequencing in routine clinical practice. Genet. Med.16, 922931. 10.1038/gim.2014.58

  • 19

    JalkhN.CorbaniS.HaidarZ.HamdanN.FarahE.Abou GhochJ.et al (2019). The added value of WES reanalysis in the field of genetic diagnosis: Lessons learned from 200 exomes in the Lebanese population. BMC Med. Genomics12, 11. 10.1186/s12920-019-0474-y

  • 20

    Jimenez-SanchezG.ChildsB.ValleD. (2001). Human disease genes. Nature409, 853855. 10.1038/35057050

  • 21

    KabackM. M. (2001). Screening and prevention in tay-sachs disease: Origins, update, and impact. Adv. Genet.44, 253265. 10.1016/s0065-2660(01)44084-3

  • 22

    KangH. J.KawasawaY. I.ChengF.ZhuY.XuX.LiM.et al (2011). Spatio-temporal transcriptome of the human brain. Nature478, 483489. 10.1038/nature10523

  • 23

    KuperbergM.LevD.BlumkinL.ZeremA.GinsbergM.LinderI.et al (2016). Utility of whole exome sequencing for genetic diagnosis of previously undiagnosed pediatric neurology patients. J. Child. Neurol.31, 15341539. 10.1177/0883073816664836

  • 24

    LandrumM. J.LeeJ. M.BensonM.BrownG.ChaoC.ChitipirallaS.et al (2016). ClinVar: Public archive of interpretations of clinically relevant variants. Nucleic Acids Res.44, D862D868. 10.1093/nar/gkv1222

  • 25

    LazarinG. A.HaqueI. S.NazarethS.IoriK.PattersonA. S.JacobsonJ. L.et al (2013). An empirical estimate of carrier frequencies for 400+ causal mendelian variants: Results from an ethnically diverse clinical sample of 23, 453 individuals. Genet. Med.15, 178186. 10.1038/gim.2012.114

  • 26

    LeeH.DeignanJ. L.DorraniN.StromS. P.KantarciS.Quintero-RiveraF.et al (2014). Clinical exome sequencing for genetic identification of rare Mendelian disorders. Jama312, 18801887. 10.1001/jama.2014.14604

  • 27

    LeeS.SeoJ.ParkJ.NamJ. Y.ChoiA.IgnatiusJ. S.et al (2017). Korean variant archive (KOVA): A reference database of genetic variations in the Korean population. Sci. Rep.7, 4287. 10.1038/s41598-017-04642-4

  • 28

    LeeY.ParkS.LeeJ. S.KimS. Y.ChoJ.YooY.et al (2020). Genomic profiling of 553 uncharacterized neurodevelopment patients reveals a high proportion of recessive pathogenic variant carriers in an outbred population. Sci. Rep.10, 1413. 10.1038/s41598-020-58101-8

  • 29

    LekM.KarczewskiK. J.MinikelE. V.SamochaK. E.BanksE.FennellT.et al (2016). Analysis of protein-coding genetic variation in 60, 706 humans. Nature536, 285291. 10.1038/nature19057

  • 30

    LiH.DurbinR. (2010). Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics26, 589595. 10.1093/bioinformatics/btp698

  • 31

    LiH.HandsakerB.WysokerA.FennellT.RuanJ.HomerN.et al (2009). The sequence alignment/map format and SAMtools. Bioinformatics25, 20782079. 10.1093/bioinformatics/btp352

  • 32

    LynchH. T.RubinsteinW. S.LockerG. Y. (2004). Cancer in jews: Introduction and overview. Fam. Cancer3, 177192. 10.1007/s10689-004-9538-y

  • 33

    MarinakisN. M.SvingouM.VeltraD.KekouK.SofocleousC.TilemisF. N.et al (2021). Phenotype-driven variant filtration strategy in exome sequencing toward a high diagnostic yield and identification of 85 novel variants in 400 patients with rare Mendelian disorders. Am. J. Med. Genet. A185, 25612571. 10.1002/ajmg.a.62338

  • 34

    MckennaA.HannaM.BanksE.SivachenkoA.CibulskisK.KernytskyA.et al (2010). The genome analysis toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res.20, 12971303. 10.1101/gr.107524.110

  • 35

    NgP. C.HenikoffS. (2003). Sift: Predicting amino acid changes that affect protein function. Nucleic Acids Res.31, 38123814. 10.1093/nar/gkg509

  • 36

    RentzschP.WittenD.CooperG. M.ShendureJ.KircherM. (2019). Cadd: Predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res.47, D886D894. 10.1093/nar/gky1016

  • 37

    RettererK.JuusolaJ.ChoM. T.VitazkaP.MillanF.GibelliniF.et al (2016). Clinical application of whole-exome sequencing across clinical indications. Genet. Med.18, 696704. 10.1038/gim.2015.148

  • 38

    RozenP.ShomratR.StrulH.NaimanT.KarminskyN.LegumC.et al (1999). Prevalence of the I1307K APC gene variant in Israeli Jews of differing ethnic origin and risk for colorectal cancer. Gastroenterology116, 5457. 10.1016/s0016-5085(99)70228-3

  • 39

    Schuster-BöcklerB.BatemanA. (2008). Protein interactions in human genetic diseases. Genome Biol.9, R9. 10.1186/gb-2008-9-1-r9

  • 40

    SiepelA.BejeranoG.PedersenJ. S.HinrichsA. S.HouM.RosenbloomK.et al (2005). Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res.15, 10341050. 10.1101/gr.3715005

  • 41

    SkinniderM. A.ScottN. E.PrudovaA.KerrC. H.StoynovN.StaceyR. G.et al (2021). An atlas of protein-protein interactions across mouse tissues. Cell184, 40734089.e17. e17. 10.1016/j.cell.2021.06.003

  • 42

    SrivastavaS.CohenJ. S.VernonH.BarañanoK.McclellanR.JamalL.NaiduS.FatemiA. (2014). Clinical whole exome sequencing in child neurology practice. Ann. Neurol.76, 473483. 10.1002/ana.24251

  • 43

    StensonP. D.BallE. V.MortM.PhillipsA. D.ShielJ. A.ThomasN. S.et al (2003). Human gene mutation database (HGMD): 2003 update. Hum. Mutat.21, 577581. 10.1002/humu.10212

  • 44

    TrujillanoD.Bertoli-AvellaA. M.Kumar KandaswamyK.WeissM. E.KösterJ.MaraisA.et al (2017). Clinical exome sequencing: Results from 2819 samples reflecting 1000 families. Eur. J. Hum. Genet.25, 176182. 10.1038/ejhg.2016.146

  • 45

    VermaI. C.PuriR. D. (2015). Global burden of genetic disease and the role of genetic screening. Semin. Fetal Neonatal Med.20, 354363. 10.1016/j.siny.2015.07.002

  • 46

    WangK.LiM.HakonarsonH. (2010). Annovar: Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res.38, e164. 10.1093/nar/gkq603

  • 47

    WrightC. F.FitzgeraldT. W.JonesW. D.ClaytonS.McraeJ. F.Van KogelenbergM.et al (2015). Genetic diagnosis of developmental disorders in the DDD study: A scalable analysis of genome-wide research data. Lancet385, 13051314. 10.1016/S0140-6736(14)61705-0

  • 48

    YangY.MuznyD. M.ReidJ. G.BainbridgeM. N.WillisA.WardP. A.et al (2013). Clinical whole-exome sequencing for the diagnosis of mendelian disorders. N. Engl. J. Med.369, 15021511. 10.1056/NEJMoa1306555

  • 49

    YangY.MuznyD. M.XiaF.NiuZ.PersonR.DingY.et al (2014). Molecular findings among patients referred for clinical whole-exome sequencing. Jama312, 18701879. 10.1001/jama.2014.14601

Summary

Keywords

neurodevelopmental disorder, inheritance pattern, carrier prediction, whole exome sequencing, recessive disorders

Citation

Park S, Jang SS, Lee S, Kim M, Sim H, Jeon H, Hong SE, Lee J, Lee J, Jeon EY, Lee J, Lee C-R, Kim SY, Kim MJ, Yoon JG, Lim BC, Kim WJ, Kim KJ, Ko JM, Cho A, Lee JS, Choi M and Chae J-H (2022) Systematic analysis of inheritance pattern determination in genes that cause rare neurodevelopmental diseases. Front. Genet. 13:990015. doi: 10.3389/fgene.2022.990015

Received

09 July 2022

Accepted

23 August 2022

Published

12 September 2022

Volume

13 - 2022

Edited by

Ruth Roberts, ApconiX, United Kingdom

Reviewed by

Ngoc-Lan Nguyen,Vietnam Academy of Science and Technology, Vietnam

Fei Yin, Xiangya hospital, Central South University, China

Muhammad Naeem, Hebei Normal University, China

Updates

Copyright

*Correspondence: Murim Choi, ; Jong-Hee Chae,

† These authors share senior authorship

This article was submitted to Genetics of Common and Rare Diseases, a section of the journal Frontiers in Genetics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics