<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="brief-report" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">1015017</article-id>
<article-id pub-id-type="doi">10.3389/fgene.2023.1015017</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Perspective</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Systematic errors in annotations of truncations, loss-of-function and synonymous variants</article-title>
<alt-title alt-title-type="left-running-head">Vihinen</alt-title>
<alt-title alt-title-type="right-running-head">
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fgene.2023.1015017">10.3389/fgene.2023.1015017</ext-link>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Vihinen</surname>
<given-names>Mauno</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/55709/overview"/>
</contrib>
</contrib-group>
<aff>
<institution>Department of Experimental Medical Science</institution>, <institution>Lund University</institution>, <addr-line>Lund</addr-line>, <country>Sweden</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1235965/overview">Laura V. Milko</ext-link>, University of North Carolina at Chapel Hill, United States</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/417786/overview">Eleanor Grace Seaby</ext-link>, University of Southampton, United Kingdom</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/690831/overview">Roddy Walsh</ext-link>, Academic Medical Center, Netherlands</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Mauno Vihinen, <email>mauno.vihinen@med.lu.se</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Genetics of Common and Rare Diseases, a section of the journal Frontiers in Genetics</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>13</day>
<month>01</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>14</volume>
<elocation-id>1015017</elocation-id>
<history>
<date date-type="received">
<day>09</day>
<month>08</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>03</day>
<month>01</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2023 Vihinen.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Vihinen</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Description of genetic phenomena and variations requires exact language and concepts. Vast amounts of variation data are produced with next-generation sequencing pipelines. The obtained variations are automatically annotated, e.g., for their functional consequences. These tools and pipelines, along with systematic nomenclature, mainly work well, but there are still some problems in nomenclature, organization of some databases, misuse of concepts and certain practices. Therefore, systematic errors prevent correct annotation and often preclude further analysis of certain variation types. Problems and solutions are described for presumed protein truncations, variants that are claimed to be of loss-of-function based on the type of variation, and synonymous variants that are not synonymous and lead to sequence changes or to missing protein.</p>
</abstract>
<kwd-group>
<kwd>variation annotation errors</kwd>
<kwd>protein truncation</kwd>
<kwd>loss-of-function variation</kwd>
<kwd>synonymous variation</kwd>
<kwd>mutation</kwd>
<kwd>frameshift variation</kwd>
</kwd-group>
<contract-sponsor id="cn001">Vetenskapsr&#xe5;det<named-content content-type="fundref-id">10.13039/501100004359</named-content>
</contract-sponsor>
<contract-sponsor id="cn002">Cancerfonden<named-content content-type="fundref-id">10.13039/501100002794</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1 Introduction</title>
<p>Naming and definition of concepts and things forms the basis of scientific communication. Common language is the foundation; however, difficult to obtain without ambiguity. People often use problematic and sometimes even wrong names for many concepts in genetics. Confusion appears also when people mean different concepts with the same term. The reader cannot always interpret the meaning from the context and can misunderstand the message.</p>
<p>Many types of scientific systematics have been developed since the <italic>Systema Naturae</italic> for taxonomy of species was introduced by Carl von Linn&#xe9; between 1735 and 1768. Some problems in genetic nomenclature and practices have been described earlier (<xref ref-type="bibr" rid="B55">Vihinen, 2015a</xref>; <xref ref-type="bibr" rid="B63">Vihinen, 2022a</xref>). Here systematic errors in certain types of annotations are discussed, reasons for the problems are charted and remedies suggested.</p>
<p>Next-generation sequencing (NGS) data generation and analysis pipelines produce ever increasing amounts of variation information. As these pipelines are highly computerized, it is instrumental that the used programs make correct choices and are based on accurate definitions, concepts and ideas. Any errors and mistakes in these steps affect the downstream analyses and largely remain unnoticed. Once the variation calls are obtained, variation annotation is used to describe the types of variations in the data. The annotation step largely dictates the outcome of variant interpretation, since variants are selected for interpretation based on the annotations. Popular annotation tools, nomenclature, some databases and practices in the field generate systematic errors. Therefore, certain variants are not properly characterized, e.g. because they are classified as irrelevant or of low significance and relevance in diseases. Nomenclature, annotation tools and databases mainly do great work, the focus here is in issues that still need attention. The concepts discussed and described here have systematic definitions in Variation Ontology (VariO), which facilitates description of variation types, functions, effects, and mechanisms (<xref ref-type="bibr" rid="B62">Vihinen, 2014</xref>).</p>
<p>These problems are harmful for scientific inquiry, for detailed and correct descriptions of variations, and in variation interpretation for different purposes. Naturally, there is also impact on the genetic diagnosis of individuals with such variants, which may lead to delayed or missing diagnosis, mistreatment and other problems, in the extreme case to the death of a patient even when treatment would be available.</p>
</sec>
<sec id="s2">
<title>2 Variation annotation</title>
<p>The goal of the variant annotation is to assign functional information to DNA variants. The most basic and the most common annotations describe how variants change the coding sequences and affect the gene products. The annotation of variations is a crucial step in the generation and use of variation data as the annotations are essential, e.g., in the analysis of disease-related variants. If causative variants are misclassified, they are likely ignored in variation interpretation. Only computational approaches are amenable to annotation of whole genome sequences. Every human haploid genome contains some 3.5 to 4.3 million single nucleotide variants (SNVs) and numerous other types of variants, when compared to the reference sequence (<xref ref-type="bibr" rid="B2">Auton et al., 2015</xref>).</p>
<p>Tens of variant annotation methods have been developed and can be grouped as theoretically based and empirically based approaches (<xref ref-type="bibr" rid="B42">Samuels et al., 2022</xref>). The most widely used of the tools include ANNOVAR (<xref ref-type="bibr" rid="B64">Wang et al., 2010</xref>), Ensembl Variant Effect Predictor (VEP) (<xref ref-type="bibr" rid="B32">McLaren et al., 2016</xref>), and SnpEff (<xref ref-type="bibr" rid="B5">Cingolani et al., 2012</xref>). An important step in the annotation is mapping of variants to reference sequences. Depending on which tool and which sequence is used, the annotations may differ markedly (<xref ref-type="bibr" rid="B30">McCarthy et al., 2014</xref>). Recent comparison of ANNOVAR and SnpEff indicated concordance of about 85% when using the same reference sequences (<xref ref-type="bibr" rid="B38">Park and Park, 2021</xref>). Thus, variation annotations contain many differences and errors, details depend on the implementation and the used tool. Note that annotation is a general term for addition of new information to data. Thus, e.g. addition of Human Genome Variation Society (HGVS) descriptions (<xref ref-type="bibr" rid="B9">den Dunnen and Antonarakis, 2001</xref>) to variants is also a form of annotation.</p>
</sec>
<sec id="s3">
<title>3 Protein truncations</title>
<p>Proteoforms are protein forms originating from a single gene (<xref ref-type="bibr" rid="B47">Smith and Kelleher, 2013</xref>) by more than 10 different mechanisms (<xref ref-type="bibr" rid="B53">Vihinen, 2021a</xref>). Natural alternative forms appear in some proteins due to N- or C-terminal truncations. These processes are rather common due to alternative translation initiation and termination and other processes (<xref ref-type="bibr" rid="B45">Sharma et al., 2016</xref>; <xref ref-type="bibr" rid="B20">Kaushal and Lee, 2021</xref>). Proteolytic cleavage is one of the common post-translational modifications and involved in activation and tight regulation of activities of proteins as insulin and proteases. Proteoforms originate from DNA, RNA and protein level alterations. Protein truncation means shortening of polypeptide chain. Systematic definition in VariO is &#x201c;shortening of protein sequence from terminus&#x201d; (VariO:0015). Terminological confusion may occur because some scientists consider missing protein as a truncation.</p>
<p>Premature termination codons (PTCs) emerge due to several types of variations and are rather common among disease-causing variants. Some text books, several articles and dictionaries describe PTCs typically as &#x201c;a codon that has been converted to the same sequence as a stop codon by a non-sense mutation. It is different from a stop codon in that it occurs abnormally and causes premature termination of protein translation resulting in the production of truncated proteins which may be non-functional&#x201d; (<ext-link ext-link-type="uri" xlink:href="https://www.xmri.com/resource-center/conditions.html?term=92806">https://www.xmri.com/resource-center/conditions.html?term&#x3d;92806</ext-link>). This kind of definition is problematic, because in many instances no protein is produced at all.</p>
<p>The HGVS nomenclature (<xref ref-type="bibr" rid="B9">den Dunnen and Antonarakis, 2001</xref>) and tools generating annotations based on the nomenclature (<xref ref-type="bibr" rid="B13">Hart et al., 2015</xref>; <xref ref-type="bibr" rid="B11">Freeman et al., 2018</xref>; <xref ref-type="bibr" rid="B24">Lefter et al., 2021</xref>) describe PTC-introducing variants as a protein sequence with a termination signal. These annotations are interpreted as protein truncations. However, most of these variants are not protein truncations, since many PTC-containing mRNA transcripts are degraded and thereby no protein, full length or truncated, is produced. A non-existent protein cannot be truncated! mRNA quality control mechanisms, mainly non-sense-mediated decay (NMD), recognize PTC-containing transcripts and degrade them (<xref ref-type="bibr" rid="B22">Kurosaki et al., 2019</xref>). The outcome of a genetic variant depends in addition to the type of the variation also on its context.</p>
<p>NMD machinery recognizes and degrades PTC-containing mRNA transcripts unless the transcript can escape from degradation. NMD escape happens in certain exons. One such scenario is in the penultimate exon within the last 50 nucleotides, called the 50&#xa0;nt rule (<xref ref-type="bibr" rid="B34">Nagy and Maquat, 1998</xref>). Another rule is for the last exon, where variants do not lead to transcript degradation (<xref ref-type="bibr" rid="B23">Le Hir et al., 2001</xref>). Recently, additional rules were suggested. Stop codons within 150 nucleotides from the start codon may not cause degradation (<xref ref-type="bibr" rid="B26">Lindeboom et al., 2016</xref>). According to the long exon rule, exons longer than 400 bp reduce efficiency of NMD (<xref ref-type="bibr" rid="B26">Lindeboom et al., 2016</xref>; <xref ref-type="bibr" rid="B15">Hoek et al., 2019</xref>). Numerous other factors affect NMD efficiency and NMD escape, including last exon size, re-initiation of transcription, splice site rescue <italic>etc.</italic>, see (<xref ref-type="bibr" rid="B6">Cummings et al., 2020</xref>; <xref ref-type="bibr" rid="B18">Karczewski et al., 2020</xref>).</p>
<p>Analysis of about 10,000 matched tumor exomes with the extended rule set suggested that 51% of PTCs trigger efficient NMD (<xref ref-type="bibr" rid="B27">Lindeboom et al., 2019</xref>). Additional 27% of PTC variants were predicted to have intermediate NMD effect. Thus, in total 78% of possible PTCs lead at least to partial NMD, which could cause severe effects. This means that 22% of PTCs could indeed be translated and lead to protein truncation. In summary, only about one out of four or five of theoretical truncations in proteins coded by multi-exon transcripts likely have correct annotation.</p>
<p>The degradation of transcripts by NMD means that the coded proteins are not produced. This is a knock-out -type variant and called <italic>missing protein</italic> (VariO:0240) in VariO (<xref ref-type="bibr" rid="B62">Vihinen, 2014</xref>). These alterations are among the most common disease-causing variants in many genetic disorders. Missing protein components distort protein complex stoichiometry and can lead to dominant negative phenotype (<xref ref-type="bibr" rid="B52">Veitia and Birchler, 2010</xref>). When complex-forming proteins have different abundances, the assembly of the complex is impaired and causes functional consequences.</p>
<p>Missing RNA (VariO:0245) and missing protein can be outcomes of several types of variations: substitutions, insertions, deletions, and indels (<xref ref-type="bibr" rid="B60">Vihinen, 2021b</xref>). Aberrant splicing that leads to exon skipping or inclusion with consequent reading frame alteration are common PTC-introducing variants (see <xref ref-type="fig" rid="F1">Figure 1</xref>). Aberrant splicing-causing variants can appear either in exon or intron.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Examples for outcomes of premature stop codon-introducing variants, original sequence to the left and the variant to the right. Systematics both with HGVS and VariO annotations is for observed or predicted outcomes of variants. <bold>(A)</bold> Substitution in <italic>DMD</italic> gene introduces a PTC, therefore the transcript is degraded and no protein in produced. HGVS annotation indicates imaginary effects on RNA and protein levels. Therefore, variants like this are considered as protein truncations, although no protein is produced. VariO annotations of missing RNA and missing protein describe the actual changes. On RNA level, annotation for non-sense variation can be added to indicate what is the reason for missing RNA. <bold>(B)</bold> Indel in <italic>BTK</italic> gene introduces out-of-frame indel at RNA with PTC. No protein is produced due to NMD. The correct HGVS annotations in both examples is on RNA level either r.0 or r (0) and on protein level p.0 or p.(0). HGVS annotations were produced with VariantValidator, which follows the HGVS annotation rules. Differences in the original and variant strands are in red and underlined. RNA and protein changes are in brackets as they are theoretical constructions, in reality the molecules do not exist.</p>
</caption>
<graphic xlink:href="fgene-14-1015017-g001.tif"/>
</fig>
<p>The problematic annotations originate from nomenclature, naming tools, annotations and databases. The HGVS nomenclature facilitates the systematic description of variations (<xref ref-type="bibr" rid="B9">den Dunnen and Antonarakis, 2001</xref>). It is widely used, and it makes computational searches and analyses possible, however, it still causes also some problems. An example from HGVS nomenclature recommendation for annotation of PTC effect on protein level highlights the issue (<xref ref-type="fig" rid="F1">Figure 1A</xref>): &#x201c;LRG_199p1:p.Trp24Ter (p.Trp24&#x2a;) amino acid Trp24 is changed to a stop codon (Ter, &#x2a;). NOTE: this change is not described as a deletion of the C-terminal end of the protein (i.e. p. Trp24_Met36853del)&#x201d; (HGVS website <ext-link ext-link-type="uri" xlink:href="http://varnomen.hgvs.org/recommendations/protein/variant/substitution/">http://varnomen.hgvs.org/recommendations/protein/variant/substitution/</ext-link>accesses 2 January 2023).</p>
<p>The isoform one of dystrophin is 3,685 amino acids long and composed of 79 exons. This variant at amino acid 24 is far from the end of the protein, thus the last exon or the 50&#xa0;nt rules do not apply. The exon is also less than 400 bp long. The site is close to the start codon in exon one and the start-proximal rule might therefore apply. This would mean that there is another functional translation initiation site, which indeed has been seen in certain dystrophin variants (<xref ref-type="bibr" rid="B12">Gurvich et al., 2009</xref>). If translation starts from exon 6, the codon for amino acid 24 is not on the translated region at all and cannot thus cause a stop codon. In summary, there is no termination at position 24 as there is no protein expression to be terminated. The HGVS note indicates that it is not annotated as a large C-terminal deletion, however, that is how people interpret the annotation. This protein is either not produced at all due to NMD, or it may be expressed from alternative translation start site; either way, the given protein annotation is incorrect. The correct annotation in the case of NMD is VariO:0240 <italic>missing protein</italic> when using a VariO term and p.(0) in HGVS annotation, in brackets to indicate prediction, or there is another transcript and the N-terminally shorter protein does not contain this amino acid.</p>
<p>The corresponding RNA variation is annotated according to HGVS as LRG_199t1:r.72g&#x3e;a. In the LOVD DMD database at <ext-link ext-link-type="uri" xlink:href="https://databases.lovd.nl/shared/genes/DMD">https://databases.lovd.nl/shared/genes/DMD</ext-link> RNA annotation is r.(?). The transcript is not produced, thus the annotation at RNA should be r.0 or r.(0), or if the alternative transcription initiation site is used based on the new transcript that does not contain this site. The corresponding VariO RNA annotations are VariO:0245 <italic>missing RNA</italic>, or in the new transcript. One could claim there to be an RNA substitution before the RNA is degraded, however, the final outcome is no transcript at all. In the DMD database an unsure RNA annotation r.(?). gives a protein annotation p.(Arg24Ter).</p>
<p>
<xref ref-type="fig" rid="F1">Figure 1B</xref> describes another example of PTC-related variation. Indel c.592_595delinsCTAACTACATA in <italic>BTK</italic> gene for Bruton tyrosine kinase (<xref ref-type="bibr" rid="B28">Lougaris et al., 2020</xref>) is annotated according to the HGVS nomenclature as p. Lys199Thrfs&#x2a;3 in protein level. The variant appears in exon eight among 19 exons. Coding region insertions, deletions and indels are either in-frame or out-of-frame, for extensive discussion of RNA variants see (<xref ref-type="bibr" rid="B60">Vihinen, 2021b</xref>). For variants to have a chance to be translated the introduced sequence change should be divisible by three to retain coding region and to prevent premature stop. Variants not divisible by three (theoretically two out of three cases) change the coding frame and typically introduce a PTC soon after the variation site. In this example, PTC is introduced in the middle of the transcript and detected by NMD. Therefore, both the RNA and protein annotations provided based on the HGVS nomenclature are incorrect. The correct annotations are r.(0) and p.(0). The PTC-related issues in variation nomenclature are distributed further by computational tools for variation naming, including Mutalyzer (<xref ref-type="bibr" rid="B24">Lefter et al., 2021</xref>), Variant Validator (<xref ref-type="bibr" rid="B11">Freeman et al., 2018</xref>) and hgvs package (<xref ref-type="bibr" rid="B13">Hart et al., 2015</xref>). These methods follow the HGVS nomenclature. Thereby, these problems appear also in variant databases, such as those maintained in LOVD (<xref ref-type="bibr" rid="B10">Fokkema et al., 2021</xref>). Variation annotation methods, as ANNOVAR and SnpEff, call such variants as stop gain(ed), but do not describe the outcome of the variants.</p>
<p>If a variant causing missing protein (p.0) is annotated as truncated protein, the users of the annotations may obtain a wrong idea, e.g. about residual activity. Other problems emerge when one wants to understand the causes, mechanisms and effects of these variants and when considering strategies to treat patients. In several neurodegenerative diseases true truncated proteins are mis-localized and cause characteristic disease phenotypes (<xref ref-type="bibr" rid="B16">Jadhav et al., 2013</xref>), whereas missing proteins are harmful due to missing activity. NMD activation has been considered as a therapy, e.g., for amyotrophic lateral sclerosis (ALS) to degrade PTC-containing transcripts and to prevent the production of harmful protein forms (<xref ref-type="bibr" rid="B17">Jaffrey and Wilkinson, 2018</xref>). Knowledge about whether a protein is truncated or missing can thus have fundamental clinical significance. Thereby, accurate annotation of such variants is of utmost importance.</p>
</sec>
<sec id="s4">
<title>4 Loss-of-function as variation type</title>
<p>Functional consequences of some variations can be called as loss-of-function (lof) or gain-of-function (gof), many variations do not have any functional effect being normal genetic heterogeneity (<xref ref-type="bibr" rid="B54">Vihinen, 2022b</xref>). Additional forms of functional effects are antimorphic (antagonistic) and neomorphic (new activity) variants, see (<xref ref-type="bibr" rid="B53">Vihinen, 2021a</xref>).</p>
<p>Lof or potential/predicted lof (plof) have been quite widely used as a variant type, e.g., in <ext-link ext-link-type="uri" xlink:href="https://macarthurlab.org/lof/">https://macarthurlab.org/lof/</ext-link>, although it is a functional effect. The practice may simplify terminology, but it generates also confusion and errors. Such predictions may be useful for guiding and prioritizing experimental studies, however, accurate lof and gof statements should always be based on experimental evidence (<xref ref-type="bibr" rid="B53">Vihinen, 2021a</xref>), because there is not a clear correlation between variation type and functional effect.</p>
<p>Plof variants have been defined as those with &#x201c;premature stop (stop-gained), shift-reported transcriptional frame (frameshift), or alter the two essential splice-site nucleotides immediately to the left and right of each exon (splice) found in protein-coding transcripts&#x201d; (<xref ref-type="bibr" rid="B18">Karczewski et al., 2020</xref>). Many such variants indeed cause lof, but not all. Even when there is a lof effect, it may not cause a phenotype. Large numbers of amino acid substitutions have also lof effect, but they have not been included in the (p)lof category.</p>
<p>Plof annotations (<xref ref-type="bibr" rid="B6">Cummings et al., 2020</xref>; <xref ref-type="bibr" rid="B18">Karczewski et al., 2020</xref>) are produced in an extensive predictive pipeline, which includes numerous rules, analyses and inferences to address different types of variations and various effects. The system has allowed correction of numerous prior errors and generation of a catalogue of plofs, which is very widely used for various purposes, including clinical variant interpretation. The resource is valuable for pinpointing possible lof variants, however, the users should remember that the provided annotations are predictions and are not accurate for all variants. Even when the plof annotation correctly indicates there to be a lof effect, it does not automatically mean that the variant has biological effect that would describe clinical or other consequences of the variant.</p>
<p>As discussed above, more than 20% of PTC variants escape NMD and may thus not be of lof type, for example because of having some residual activity. Similarly, the other types of variants do not have a lof effect in every case. Some splice disruptions, especially those causing exon skipping without RNA frameshift, can code for proteins with some residual activity, which may be sufficient for biological function (<xref ref-type="bibr" rid="B37">Oda et al., 2016</xref>; <xref ref-type="bibr" rid="B35">Nakamura et al., 2017</xref>). Exon skipping or intronic inclusion can be in-frame in mRNA and code for a protein and have some residual activity. Those variants with out-of-frame alterations typically have an early PTC and are detected by NMD and degraded.</p>
<p>Cells, tissues and organisms have numerous mechanisms that modulate and reduce effects of all kinds of perturbations, including genetic changes. Those mechanisms that reduce effects of variations are called TARAR countermeasures after tolerance, avoidance, repair, attenuation and resistance (<xref ref-type="bibr" rid="B53">Vihinen, 2021a</xref>; <xref ref-type="bibr" rid="B56">Vihinen, 2022c</xref>). Some of these mechanisms and processes are in-built, intrinsic, while others are active and mounted when there is a perturbation. These mechanisms return the system to sufficient and relevant extent called lagom (<xref ref-type="bibr" rid="B57">Vihinen, 2020a</xref>). In the case of the so-called lof variants, tolerance, repair, attenuation and even resistance are relevant. The human body is robust and can tolerate various perturbations, disease tolerance is an example (<xref ref-type="bibr" rid="B31">McCarville and Ayres, 2018</xref>). Repair countermeasures, including DNA repair, are highly relevant for phenotypic effects and for correction and rescue of genetic variants (<xref ref-type="bibr" rid="B54">Vihinen, 2022b</xref>). Redundant activities are one form of attenuation. Resistance countermeasures actively oppose effects of variants, DNA damage response (<xref ref-type="bibr" rid="B4">Ciccia and Elledge, 2010</xref>) is an example of DNA level resistance.</p>
<p>Reading frame errors can be corrected by revertant mosaicism, which restores partly or completely the wild type phenotype or activity by having reversion back to the original sequence (<xref ref-type="bibr" rid="B14">Hirschhorn, 2003</xref>). In addition to the actual revertants and other activity-restoring variants within the variant site, variants in other sites may compensate for the alteration, several such variants were seen in T-cell in a single patient (<xref ref-type="bibr" rid="B7">Davis et al., 2010</xref>).</p>
<p>Biallelic knock-outs of many human genes do not have a phenotype. Several studies have indicated different numbers of such genes, between 1,285 and 3,230 (<xref ref-type="bibr" rid="B29">MacArthur et al., 2012</xref>; <xref ref-type="bibr" rid="B49">Sulem et al., 2015</xref>; <xref ref-type="bibr" rid="B25">Lek et al., 2016</xref>; <xref ref-type="bibr" rid="B36">Narasimhan et al., 2016</xref>; <xref ref-type="bibr" rid="B41">Saleheen et al., 2017</xref>). The estimated number of human hemizygous knock-out variants is close to 100 and the number of complete knock-out variants is about 20 in healthy individuals (<xref ref-type="bibr" rid="B29">MacArthur et al., 2012</xref>). The knock-outs in these genes could be tolerated because the genes are not needed, at least in the normal living conditions, or there may be redundant activities in ohnologs, countermeasures reduce the effect, <italic>etc.</italic>
</p>
<p>Human genome, along with that of many other eukaryotes, is an outcome of two complete genome duplications (<xref ref-type="bibr" rid="B8">Dehal and Boore, 2005</xref>), which have produced numerous ohnologs, paralogs, of genes. There are still 7,358 human ohnolog pairs (<xref ref-type="bibr" rid="B46">Singh and Isambert, 2020</xref>) and thus a large number of potential redundant activities, which may complement actions of each other and prevent or reduce effects of variants in some cells or situations. Monogenic disease genes are enriched with duplicates and duplicated disease-related genes have higher similarity to their close paralogous genes than other genes (<xref ref-type="bibr" rid="B3">Chen et al., 2013</xref>). Redundancy likely masks a substantial number of variants that impair function in a paralog. Moonlighting activities and substrate and catalytic promiscuity of proteins further increase the complexity and reduce the effects of variants (<xref ref-type="bibr" rid="B53">Vihinen, 2021a</xref>). Incomplete penetrance is another confounding factor.</p>
<p>When describing lof effects, it is necessary to refer to what function and system and in which level the annotation relates to. A single variant can have different lof effects depending on the level. VariO (<xref ref-type="bibr" rid="B62">Vihinen, 2014</xref>) facilitates descriptions of DNA (<xref ref-type="bibr" rid="B59">Vihinen, 2018</xref>), RNA (<xref ref-type="bibr" rid="B60">Vihinen, 2021b</xref>) and protein (<xref ref-type="bibr" rid="B61">Vihinen, 2015b</xref>) variants, consequences, mechanism and functional effects. There are several functions on each of these levels. The DNA functions comprise catalytic activity, information transfer, DNA repair, DNA replication, regulation, reservoir of genetic material, and transcription. The functional effects at the RNA level are amino acid transfer, catalytic activity, regulation, information transfer, splicing function and translation. At the protein level there are seven functions: catalysis, information transfer, movement, recognition, storage, structural protein, and transport activity. Functional effects thus should be described on all the relevant levels and the affected function(s) indicated instead of just claiming variant to be (p)lof. In case there are several transcripts or proteoforms, the functional descriptions should be provided separately for each of them.</p>
<p>A further complication in functional effect discussion is that functional and biological effect may differ. Biological systems are robust and can tolerate even substantial alterations. Several enzymopathies, enzyme-related diseases, display a phenotype only when there is a substantial loss of activity in comparison to normal activity (<xref ref-type="bibr" rid="B53">Vihinen, 2021a</xref>). For example, in different forms of hemophilia, 90% or more of the normal activity has to be lost for a patient to display a severe phenotype (<xref ref-type="bibr" rid="B39">Peyvandi et al., 2012</xref>; <xref ref-type="bibr" rid="B48">Srivastava et al., 2013</xref>). Thus, a variant annotated as categorical (p)lof and indeed having a substantially reduced function may not cause a disease. There are also examples where much smaller activity loss is disease or phenotype-causing (<xref ref-type="bibr" rid="B53">Vihinen, 2021a</xref>). In haploinsufficiency, loss of one allele, i.e. 50% of activity, causes the disease in the case of biallelic expression (<xref ref-type="bibr" rid="B33">Morrill and Amon, 2019</xref>). It is therefore essential to understand the relevance of the level of activity loss, for extended discussion see (<xref ref-type="bibr" rid="B53">Vihinen, 2021a</xref>). For function discussion, the biological function threshold should be defined, but these data are missing for most proteins, even for enzymes, for which the thresholds are among the easiest to define. Therefore, categorical functional annotations cannot apply to all variants and genes/proteins.</p>
<p>The lof annotation issue originates mainly from annotation tools and databases. For example, ANNOVAR (<xref ref-type="bibr" rid="B64">Wang et al., 2010</xref>), VEP (<xref ref-type="bibr" rid="B32">McLaren et al., 2016</xref>) and Loss-of-Function Transcript Effect Estimator (LOFTEE, <ext-link ext-link-type="uri" xlink:href="https://github.com/konradjk/loftee">https://github.com/konradjk/loftee</ext-link>) assemble variants to the lof category based on the types of variations. Widely used databases Exome Aggregation Consortium (ExAC) database (<xref ref-type="bibr" rid="B19">Karczewski et al., 2017</xref>) and Genome Aggregation Database (gnomAD) (<xref ref-type="bibr" rid="B18">Karczewski et al., 2020</xref>) are built on the plof principle. Variation type is not a straightforward proxy for functional effects. The models of these tools are too simple to consider the processes, mechanisms, pathways and systems that affect functional effects in living organisms (<xref ref-type="bibr" rid="B53">Vihinen, 2021a</xref>).</p>
<p>Similar to lof variants, there are problems also in the use of term gof. It can refer to different types of effects, which are not necessarily clear from the context and the biological impact of which is not clear. Further, the biological relevance of the increased activity is often not known. Many gof and lof effects are without biological consequences and fall within normal biological heterogeneity.</p>
</sec>
<sec id="s5">
<title>5 Synonymous variants</title>
<p>RNA substitutions are typically, and wrongly, divided into three categories: missense, non-sense and synonymous. Note that these terms are relevant only for mRNA sequences, not for DNA or protein variants, see (<xref ref-type="bibr" rid="B55">Vihinen, 2015a</xref>). Use of the three categories completely misses the fourth class where variants may look from surface as synonymous, but which in fact affect protein sequence, typically causing missing protein due to NMD degradation of PTC-containing mRNA transcripts (<xref ref-type="bibr" rid="B63">Vihinen, 2022a</xref>).</p>
<p>When substitutions are looked only from the perspective of genetic code, many harmful variants are misclassified as synonymous. Even when the variant is synonymous, it can affect DNA, RNA or protein and have non-synonymous effects (<xref ref-type="bibr" rid="B63">Vihinen, 2022a</xref>). In DNA, such variants can alter transcription factor binding sites. Even when the coded protein sequence is unaltered, protein activity, structure or regulation may be altered due to a synonymous mRNA change (<xref ref-type="bibr" rid="B43">Sauna and Kimchi-Sarfaty, 2011</xref>; <xref ref-type="bibr" rid="B44">Shabalina et al., 2013</xref>).</p>
<p>Several apparently synonymous variations are in fact non-synonymous. To facilitate annotation of such variants they were recently renamed as unsense variants (<xref ref-type="bibr" rid="B63">Vihinen, 2022a</xref>). The reason why they are not synonymous, although the coding frame looks intact, is that they affect mRNA splicing or splicing regulation or impair regulatory miRNA binding and thus affect protein abundance, often leading to missing protein because of NMD. If looking only at the codon table, these variants look like synonymous. Exonic splice site (<xref ref-type="bibr" rid="B65">Wehr et al., 2018</xref>), ESS and ESE (<xref ref-type="bibr" rid="B51">Tonin et al., 2019</xref>) variants can impair splicing, lead to various effects and alter the sequence, frequently introducing a PTC. miRNA binding site variants also in the coding region can affect regulation or gene expression and even prevent protein production (<xref ref-type="bibr" rid="B50">Tay et al., 2008</xref>). All these variants are systematically ignored by current practice and thus provide wrong annotations.</p>
<p>Unsense variation is described as VariO:0514 &#x201c;Substitution in mRNA coding region that affects gene expression, protein or protein production without introducing a stop codon in the variation site&#x201d;. Since unsense variants are currently not annotated, they are largely ignored in clinical variation interpretation and considered irrelevant or of low importance. In reality, these variants have substantial effects and are related to numerous diseases. Causes for these misclassifications are lack of awareness, missing terminology in systematics, and errors in annotation. Therefore, it is important to bear in mind that apparent synonymous variants may be non-synonymous and have various effects. Variants should be called synonymous only based on experimental evidence or reliable predictions (which are not currently available).</p>
</sec>
<sec id="s6">
<title>6 Getting it right</title>
<p>Three factors are instrumental for correcting the discussed challenges and others not covered in here. First, geneticists and other users of genetic data have to become aware of these issues. Second, systematics such as variation nomenclature has to be updated. Third, computer programs, databases and other information resources have to be improved.</p>
<p>Education and increased awareness are key factors in correcting the errors. Full understanding of variation effects is a demanding task and in addition to genetics, the practitioners have to understand proteins, their functions, interactions, structures, networks <italic>etc.</italic> Currently, the field is genetics dominated, which may partially explain also the problems discussed above. Many of the functional effects of variants manifest at protein level, which may not be that familiar a field for geneticists.</p>
<p>Genetics societies should provide educational material to train new and establishes scientists in use of proper nomenclature including the issues introduced above. Text books should be updated to include the latest developments in variation types and effects, as well that in their naming and annotation. In addition, scientific meetings could cover issues listed above and others to increase awareness and to educate the community on correct and accurate practices.</p>
<p>Many systematic classifications relate to variation and variation description. Variation annotation with HGVS nomenclature has to provide terms for all possible variations and outcomes. Relevant terminology has to be introduced (when missing), users trained in correct use of terms and work towards developing exact naming conventions for all kinds of variants. This is already largely in place, but there are still issues to introduce and develop. Practices and pipelines for predictions of functional effects has to be improved and in the mean time warnings could be added to inform users of possible errors and problems. Again, awareness is a key factor. In addition, other systematics relevant for variation data has to be updated, whenever relevant. For example, VariO has ben updated to be consistent.</p>
<p>Even if perfect systematics existed, it is not valuable if not used. Variant annotation and analysis are largely based on computational tools. Therefore, the annotation tools, programs for variant naming, and databases distributing the generated data have to update their routines and pay special attention for currently neglected, ignored and misnamed types of variations.</p>
<p>Variation interpretation for genetic diagnosis in many countries is based on the American College of Medical Genetics and Genomics and the Association of Molecular Pathology (ACMG/AMP) guidelines and standards (<xref ref-type="bibr" rid="B40">Richards et al., 2015</xref>). These guidelines have been instrumental in systematizing variation interpretation and in providing more reliable interpretation. The guidelines have also some problems, see, e.g., (<xref ref-type="bibr" rid="B21">Kim et al., 2019</xref>; <xref ref-type="bibr" rid="B58">Vihinen, 2020b</xref>). I will discuss here how the issues discussed above should be taken into account in variation interpretation according to ACMG/AMP guidelines.</p>
<p>Computational and predictive data, benign supporting BP7: silent variant with non-predicted splice impact. This category relates to many unsense variants. People should pay attention to synonymous variations in exon/intron boundaries but also inside exons as they may not be silent.</p>
<p>PM4, protein length changing variants are currently considered to have moderate support for pathogenic assessment. As many protein truncations are misclassifications, it is important to be careful with such variants. Missing protein instead of truncation is usually a strong evidence for pathogenicity.</p>
<p>PS3, well established functional studies show deleterious effect. The functional studies have to take into account the biological system, since TARAR countermeasures reduce harmful effects of even some substantial change-causing variants. It is not sufficient e.g. to study protein activity just <italic>in vitro</italic>.</p>
<p>BP6 and PP5 are for reputable source of information in other database(s). These may refer e.g. to ExAC, gnomAD and LOVD databases. It is important to consider the quality of these sources and potential problems in them case by case and for different variation types.</p>
<p>For PVS1 criterion, a detailed decision tree has been elaborated for different types of lofs (<xref ref-type="bibr" rid="B1">Abou Tayoun et al., 2018</xref>). It takes into account the variation type, possible NMD, splicing effects and other factors. The outcome of the evaluation varies widely from PVS to supporting evidence and there are also cases where annotation cannot be made.</p>
<p>It will never be possible to automatically annotate all variants correctly since many factors contribute to the phenotypes of variants. Therefore, experimental research will always be needed. It should be supported by reliable predictions and annotations that could highlight potential cases for further investigation.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="s7">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s8">
<title>Author contributions</title>
<p>The author confirms being the sole contributor of this work and has approved it for publication.</p>
</sec>
<sec id="s9">
<title>Funding</title>
<p>This research was funded by Vetenskapsr&#xe5;det (2019-01403) and Swedish Cancer Society (CAN 20 1350).</p>
</sec>
<sec sec-type="COI-statement" id="s10">
<title>Conflict of interest</title>
<p>The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s11">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Abou Tayoun</surname>
<given-names>A. N.</given-names>
</name>
<name>
<surname>Pesaran</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>DiStefano</surname>
<given-names>M. T.</given-names>
</name>
<name>
<surname>Oza</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Rehm</surname>
<given-names>H. L.</given-names>
</name>
<name>
<surname>Biesecker</surname>
<given-names>L. G.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Recommendations for interpreting the loss of function PVS1 ACMG/AMP variant criterion</article-title>. <source>Hum. Mutat.</source> <volume>39</volume>, <fpage>1517</fpage>&#x2013;<lpage>1524</lpage>. <pub-id pub-id-type="doi">10.1002/humu.23626</pub-id>
</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Auton</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Brooks</surname>
<given-names>L. D.</given-names>
</name>
<name>
<surname>Durbin</surname>
<given-names>R. M.</given-names>
</name>
<name>
<surname>Garrison</surname>
<given-names>E. P.</given-names>
</name>
<name>
<surname>Kang</surname>
<given-names>H. M.</given-names>
</name>
<name>
<surname>Korbel</surname>
<given-names>J. O.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>A global reference for human genetic variation</article-title>. <source>Nature</source> <volume>526</volume>, <fpage>68</fpage>&#x2013;<lpage>74</lpage>. <pub-id pub-id-type="doi">10.1038/nature15393</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>W. H.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>X. M.</given-names>
</name>
<name>
<surname>van Noort</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Bork</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Human monogenic disease genes have frequently functionally redundant paralogs</article-title>. <source>PLoS Comput. Biol.</source> <volume>9</volume>, <fpage>e1003073</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1003073</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ciccia</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Elledge</surname>
<given-names>S. J.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>The DNA damage response: Making it safe to play with knives</article-title>. <source>Mol. Cell</source> <volume>40</volume>, <fpage>179</fpage>&#x2013;<lpage>204</lpage>. <pub-id pub-id-type="doi">10.1016/j.molcel.2010.09.019</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cingolani</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Platts</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>L. L.</given-names>
</name>
<name>
<surname>Coon</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Nguyen</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>L.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of <italic>Drosophila melanogaster</italic> strain w1118; iso-2; iso-3</article-title>. <source>Fly. (Austin)</source> <volume>6</volume>, <fpage>80</fpage>&#x2013;<lpage>92</lpage>. <pub-id pub-id-type="doi">10.4161/fly.19695</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cummings</surname>
<given-names>B. B.</given-names>
</name>
<name>
<surname>Karczewski</surname>
<given-names>K. J.</given-names>
</name>
<name>
<surname>Kosmicki</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Seaby</surname>
<given-names>E. G.</given-names>
</name>
<name>
<surname>Watts</surname>
<given-names>N. A.</given-names>
</name>
<name>
<surname>Singer-Berk</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Transcript expression-aware annotation improves rare variant interpretation</article-title>. <source>Nature</source> <volume>581</volume>, <fpage>452</fpage>&#x2013;<lpage>458</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-020-2329-2</pub-id>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Davis</surname>
<given-names>B. R.</given-names>
</name>
<name>
<surname>Yan</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Bui</surname>
<given-names>J. H.</given-names>
</name>
<name>
<surname>Felix</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Moratto</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Muul</surname>
<given-names>L. M.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). <article-title>Somatic mosaicism in the wiskott-aldrich syndrome: Molecular and functional characterization of genotypic revertants</article-title>. <source>Clin. Immunol.</source> <volume>135</volume>, <fpage>72</fpage>&#x2013;<lpage>83</lpage>. <pub-id pub-id-type="doi">10.1016/j.clim.2009.12.011</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dehal</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Boore</surname>
<given-names>J. L.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Two rounds of whole genome duplication in the ancestral vertebrate</article-title>. <source>PLoS Biol.</source> <volume>3</volume>, <fpage>e314</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pbio.0030314</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>den Dunnen</surname>
<given-names>J. T.</given-names>
</name>
<name>
<surname>Antonarakis</surname>
<given-names>S. E.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>Nomenclature for the description of human sequence variations</article-title>. <source>Hum. Genet.</source> <volume>109</volume>, <fpage>121</fpage>&#x2013;<lpage>124</lpage>. <pub-id pub-id-type="doi">10.1007/s004390100505</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fokkema</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Kroon</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Hern&#xe1;ndez</surname>
<given-names>L. J. A.</given-names>
</name>
<name>
<surname>Asscheman</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Lugtenburg</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Hoogenboom</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>The LOVD3 platform: Efficient genome-wide sharing of genetic variants</article-title>. <source>Eur. J. Hum. Genet.</source> <volume>29</volume>, <fpage>1796</fpage>&#x2013;<lpage>1803</lpage>. <pub-id pub-id-type="doi">10.1038/s41431-021-00959-x</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Freeman</surname>
<given-names>P. J.</given-names>
</name>
<name>
<surname>Hart</surname>
<given-names>R. K.</given-names>
</name>
<name>
<surname>Gretton</surname>
<given-names>L. J.</given-names>
</name>
<name>
<surname>Brookes</surname>
<given-names>A. J.</given-names>
</name>
<name>
<surname>Dalgleish</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>VariantValidator: Accurate validation, mapping, and formatting of sequence variation descriptions</article-title>. <source>Hum. Mutat.</source> <volume>39</volume>, <fpage>61</fpage>&#x2013;<lpage>68</lpage>. <pub-id pub-id-type="doi">10.1002/humu.23348</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gurvich</surname>
<given-names>O. L.</given-names>
</name>
<name>
<surname>Maiti</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Weiss</surname>
<given-names>R. B.</given-names>
</name>
<name>
<surname>Aggarwal</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Howard</surname>
<given-names>M. T.</given-names>
</name>
<name>
<surname>Flanigan</surname>
<given-names>K. M.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>DMD exon 1 truncating point mutations: Amelioration of phenotype by alternative translation initiation in exon 6</article-title>. <source>Hum. Mutat.</source> <volume>30</volume>, <fpage>633</fpage>&#x2013;<lpage>640</lpage>. <pub-id pub-id-type="doi">10.1002/humu.20913</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hart</surname>
<given-names>R. K.</given-names>
</name>
<name>
<surname>Rico</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Hare</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Garcia</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Westbrook</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Fusaro</surname>
<given-names>V. A.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>A Python package for parsing, validating, mapping and formatting sequence variants using HGVS nomenclature</article-title>. <source>Bioinformatics</source> <volume>31</volume>, <fpage>268</fpage>&#x2013;<lpage>270</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu630</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hirschhorn</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>
<italic>In vivo</italic> reversion to normal of inherited mutations in humans</article-title>. <source>J. Med. Genet.</source> <volume>40</volume>, <fpage>721</fpage>&#x2013;<lpage>728</lpage>. <pub-id pub-id-type="doi">10.1136/jmg.40.10.721</pub-id>
</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hoek</surname>
<given-names>T. A.</given-names>
</name>
<name>
<surname>Khuperkar</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Lindeboom</surname>
<given-names>R. G. H.</given-names>
</name>
<name>
<surname>Sonneveld</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Verhagen</surname>
<given-names>B. M. P.</given-names>
</name>
<name>
<surname>Boersma</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Single-molecule imaging uncovers rules governing nonsense-mediated mRNA decay</article-title>. <source>Mol. Cell</source> <volume>75</volume>, <fpage>324</fpage>&#x2013;<lpage>339</lpage>. <pub-id pub-id-type="doi">10.1016/j.molcel.2019.05.008</pub-id>
</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jadhav</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Zilka</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Novak</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Protein truncation as a common denominator of human neurodegenerative foldopathies</article-title>. <source>Mol. Neurobiol.</source> <volume>48</volume>, <fpage>516</fpage>&#x2013;<lpage>532</lpage>. <pub-id pub-id-type="doi">10.1007/s12035-013-8440-8</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jaffrey</surname>
<given-names>S. R.</given-names>
</name>
<name>
<surname>Wilkinson</surname>
<given-names>M. F.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Nonsense-mediated RNA decay in the brain: Emerging modulator of neural development and disease</article-title>. <source>Nat. Rev. Neurosci.</source> <volume>19</volume>, <fpage>715</fpage>&#x2013;<lpage>728</lpage>. <pub-id pub-id-type="doi">10.1038/s41583-018-0079-z</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Karczewski</surname>
<given-names>K. J.</given-names>
</name>
<name>
<surname>Francioli</surname>
<given-names>L. C.</given-names>
</name>
<name>
<surname>Tiao</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Cummings</surname>
<given-names>B. B.</given-names>
</name>
<name>
<surname>Alf&#xf6;ldi</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Q.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>The mutational constraint spectrum quantified from variation in 141, 456 humans</article-title>. <source>Nature</source> <volume>581</volume>, <fpage>434</fpage>&#x2013;<lpage>443</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-020-2308-7</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Karczewski</surname>
<given-names>K. J.</given-names>
</name>
<name>
<surname>Weisburd</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Thomas</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Solomonson</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ruderfer</surname>
<given-names>D. M.</given-names>
</name>
<name>
<surname>Kavanagh</surname>
<given-names>D.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>The ExAC browser: Displaying reference data information from over 60 000 exomes</article-title>. <source>Nucleic Acids Res.</source> <volume>45</volume>, <fpage>D840</fpage>&#x2013;<lpage>D845</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw971</pub-id>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kaushal</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>N-terminomics - its past and recent advancements</article-title>. <source>J. Proteomics</source> <volume>233</volume>, <fpage>104089</fpage>. <pub-id pub-id-type="doi">10.1016/j.jprot.2020.104089</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kim</surname>
<given-names>Y. E.</given-names>
</name>
<name>
<surname>Ki</surname>
<given-names>C. S.</given-names>
</name>
<name>
<surname>Jang</surname>
<given-names>M. A.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Challenges and considerations in sequence variant interpretation for Mendelian disorders</article-title>. <source>Ann. Lab. Med.</source> <volume>39</volume>, <fpage>421</fpage>&#x2013;<lpage>429</lpage>. <pub-id pub-id-type="doi">10.3343/alm.2019.39.5.421</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kurosaki</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Popp</surname>
<given-names>M. W.</given-names>
</name>
<name>
<surname>Maquat</surname>
<given-names>L. E.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Quality and quantity control of gene expression by nonsense-mediated mRNA decay</article-title>. <source>Nat. Rev. Mol. Cell Biol.</source> <volume>20</volume>, <fpage>406</fpage>&#x2013;<lpage>420</lpage>. <pub-id pub-id-type="doi">10.1038/s41580-019-0126-2</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Le Hir</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Gatfield</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Izaurralde</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Moore</surname>
<given-names>M. J.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>The exon-exon junction complex provides a binding platform for factors involved in mRNA export and nonsense-mediated mRNA decay</article-title>. <source>Embo J.</source> <volume>20</volume>, <fpage>4987</fpage>&#x2013;<lpage>4997</lpage>. <pub-id pub-id-type="doi">10.1093/emboj/20.17.4987</pub-id>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lefter</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Vis</surname>
<given-names>J. K.</given-names>
</name>
<name>
<surname>Vermaat</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>den Dunnen</surname>
<given-names>J. T.</given-names>
</name>
<name>
<surname>Taschner</surname>
<given-names>P. E. M.</given-names>
</name>
<name>
<surname>Laros</surname>
<given-names>J. F. J.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Next generation HGVS nomenclature checker</article-title>. <source>Bioinformatics</source> <volume>37</volume>, <fpage>2811</fpage>&#x2013;<lpage>2817</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btab051</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lek</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Karczewski</surname>
<given-names>K. J.</given-names>
</name>
<name>
<surname>Minikel</surname>
<given-names>E. V.</given-names>
</name>
<name>
<surname>Samocha</surname>
<given-names>K. E.</given-names>
</name>
<name>
<surname>Banks</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Fennell</surname>
<given-names>T.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Analysis of protein-coding genetic variation in 60, 706 humans</article-title>. <source>Nature</source> <volume>536</volume>, <fpage>285</fpage>&#x2013;<lpage>291</lpage>. <pub-id pub-id-type="doi">10.1038/nature19057</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lindeboom</surname>
<given-names>R. G.</given-names>
</name>
<name>
<surname>Supek</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Lehner</surname>
<given-names>B.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>The rules and impact of nonsense-mediated mRNA decay in human cancers</article-title>. <source>Nat. Genet.</source> <volume>48</volume>, <fpage>1112</fpage>&#x2013;<lpage>1118</lpage>. <pub-id pub-id-type="doi">10.1038/ng.3664</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lindeboom</surname>
<given-names>R. G. H.</given-names>
</name>
<name>
<surname>Vermeulen</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Lehner</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Supek</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>The impact of nonsense-mediated mRNA decay on genetic disease, gene editing and cancer immunotherapy</article-title>. <source>Nat. Genet.</source> <volume>51</volume>, <fpage>1645</fpage>&#x2013;<lpage>1651</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-019-0517-5</pub-id>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lougaris</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Soresina</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Baronio</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Montin</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Martino</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Signa</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Long-term follow-up of 168 patients with X-linked agammaglobulinemia reveals increased morbidity and mortality</article-title>. <source>J. Allergy Clin. Immunol.</source> <volume>146</volume>, <fpage>429</fpage>&#x2013;<lpage>437</lpage>. <pub-id pub-id-type="doi">10.1016/j.jaci.2020.03.001</pub-id>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>MacArthur</surname>
<given-names>D. G.</given-names>
</name>
<name>
<surname>Balasubramanian</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Frankish</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Morris</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Walter</surname>
<given-names>K.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>A systematic survey of loss-of-function variants in human protein-coding genes</article-title>. <source>Science</source> <volume>335</volume>, <fpage>823</fpage>&#x2013;<lpage>828</lpage>. <pub-id pub-id-type="doi">10.1126/science.1215040</pub-id>
</citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>McCarthy</surname>
<given-names>D. J.</given-names>
</name>
<name>
<surname>Humburg</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Kanapin</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Rivas</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Gaulton</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Cazier</surname>
<given-names>J. B.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Choice of transcripts and software has a large effect on variant annotation</article-title>. <source>Genome Med.</source> <volume>6</volume>, <fpage>26</fpage>. <pub-id pub-id-type="doi">10.1186/gm543</pub-id>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>McCarville</surname>
<given-names>J. L.</given-names>
</name>
<name>
<surname>Ayres</surname>
<given-names>J. S.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Disease tolerance: Concept and mechanisms</article-title>. <source>Curr. Opin. Immunol.</source> <volume>50</volume>, <fpage>88</fpage>&#x2013;<lpage>93</lpage>. <pub-id pub-id-type="doi">10.1016/j.coi.2017.12.003</pub-id>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>McLaren</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Gil</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Hunt</surname>
<given-names>S. E.</given-names>
</name>
<name>
<surname>Riat</surname>
<given-names>H. S.</given-names>
</name>
<name>
<surname>Ritchie</surname>
<given-names>G. R.</given-names>
</name>
<name>
<surname>Thormann</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>The Ensembl variant effect predictor</article-title>. <source>Genome Biol.</source> <volume>17</volume>, <fpage>122</fpage>. <pub-id pub-id-type="doi">10.1186/s13059-016-0974-4</pub-id>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Morrill</surname>
<given-names>S. A.</given-names>
</name>
<name>
<surname>Amon</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Why haploinsufficiency persists</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>116</volume>, <fpage>11866</fpage>&#x2013;<lpage>11871</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1900437116</pub-id>
</citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nagy</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Maquat</surname>
<given-names>L. E.</given-names>
</name>
</person-group> (<year>1998</year>). <article-title>A rule for termination-codon position within intron-containing genes: When nonsense affects RNA abundance</article-title>. <source>Trends Biochem. Sci.</source> <volume>23</volume>, <fpage>198</fpage>&#x2013;<lpage>199</lpage>. <pub-id pub-id-type="doi">10.1016/s0968-0004(98)01208-0</pub-id>
</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nakamura</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Shiba</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Miyazaki</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Nishizawa</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Inaba</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Fueki</surname>
<given-names>N.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Comparison of the phenotypes of patients harboring in-frame deletions starting at exon 45 in the Duchenne muscular dystrophy gene indicates potential for the development of exon skipping therapy</article-title>. <source>J. Hum. Genet.</source> <volume>62</volume>, <fpage>459</fpage>&#x2013;<lpage>463</lpage>. <pub-id pub-id-type="doi">10.1038/jhg.2016.152</pub-id>
</citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Narasimhan</surname>
<given-names>V. M.</given-names>
</name>
<name>
<surname>Hunt</surname>
<given-names>K. A.</given-names>
</name>
<name>
<surname>Mason</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Baker</surname>
<given-names>C. L.</given-names>
</name>
<name>
<surname>Karczewski</surname>
<given-names>K. J.</given-names>
</name>
<name>
<surname>Barnes</surname>
<given-names>M. R.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Health and population effects of rare gene knockouts in adult humans with related parents</article-title>. <source>Science</source> <volume>352</volume>, <fpage>474</fpage>&#x2013;<lpage>477</lpage>. <pub-id pub-id-type="doi">10.1126/science.aac8624</pub-id>
</citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Oda</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Sato</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Kunishima</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Nakagawa</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Izawa</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Hiejima</surname>
<given-names>E.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Exon skipping causes atypical phenotypes associated with a loss-of-function mutation in FLNA by restoring its protein function</article-title>. <source>Eur. J. Hum. Genet.</source> <volume>24</volume>, <fpage>408</fpage>&#x2013;<lpage>414</lpage>. <pub-id pub-id-type="doi">10.1038/ejhg.2015.119</pub-id>
</citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Park</surname>
<given-names>K. J.</given-names>
</name>
<name>
<surname>Park</surname>
<given-names>J. H.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Variations in nomenclature of clinical variants between annotation tools</article-title>. <source>Lab. Med.</source> <volume>53</volume>, <fpage>242</fpage>&#x2013;<lpage>245</lpage>. <pub-id pub-id-type="doi">10.1093/labmed/lmab074</pub-id>
</citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peyvandi</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Di Michele</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Bolton-Maggs</surname>
<given-names>P. H.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>C. A.</given-names>
</name>
<name>
<surname>Tripodi</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Srivastava</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>Classification of rare bleeding disorders (RBDs) based on the association between coagulant factor activity and clinical bleeding severity</article-title>. <source>J. Thromb. Haemost.</source> <volume>10</volume>, <fpage>1938</fpage>&#x2013;<lpage>1943</lpage>. <pub-id pub-id-type="doi">10.1111/j.1538-7836.2012.04844.x</pub-id>
</citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Richards</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Aziz</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Bale</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Bick</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Das</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Gastier-Foster</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Standards and guidelines for the interpretation of sequence variants: A joint consensus recommendation of the American College of medical genetics and Genomics and the association for molecular Pathology</article-title>. <source>Genet. Med.</source> <volume>17</volume>, <fpage>405</fpage>&#x2013;<lpage>424</lpage>. <pub-id pub-id-type="doi">10.1038/gim.2015.30</pub-id>
</citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Saleheen</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Natarajan</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Armean</surname>
<given-names>I. M.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Rasheed</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Khetarpal</surname>
<given-names>S. A.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Human knockouts and phenotypic analysis in a cohort with a high rate of consanguinity</article-title>. <source>Nature</source> <volume>544</volume>, <fpage>235</fpage>&#x2013;<lpage>239</lpage>. <pub-id pub-id-type="doi">10.1038/nature22034</pub-id>
</citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Samuels</surname>
<given-names>D. C.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Is it time to reassess variant annotation?</article-title> <source>Trends Genet.</source> <volume>38</volume>, <fpage>521</fpage>&#x2013;<lpage>523</lpage>. <pub-id pub-id-type="doi">10.1016/j.tig.2022.02.002</pub-id>
</citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sauna</surname>
<given-names>Z. E.</given-names>
</name>
<name>
<surname>Kimchi-Sarfaty</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Understanding the contribution of synonymous mutations to human disease</article-title>. <source>Nat. Rev. Genet.</source> <volume>12</volume>, <fpage>683</fpage>&#x2013;<lpage>691</lpage>. <pub-id pub-id-type="doi">10.1038/nrg3051</pub-id>
</citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shabalina</surname>
<given-names>S. A.</given-names>
</name>
<name>
<surname>Spiridonov</surname>
<given-names>N. A.</given-names>
</name>
<name>
<surname>Kashina</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Sounds of silence: Synonymous nucleotides as a key to biological regulation and complexity</article-title>. <source>Nucleic Acids Res.</source> <volume>41</volume>, <fpage>2073</fpage>&#x2013;<lpage>2094</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gks1205</pub-id>
</citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sharma</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Toledo</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Hedden</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Lyon</surname>
<given-names>K. F.</given-names>
</name>
<name>
<surname>Brooks</surname>
<given-names>S. B.</given-names>
</name>
<name>
<surname>David</surname>
<given-names>R. P.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>The functional human C-terminome</article-title>. <source>PLoS One</source> <volume>11</volume>, <fpage>e0152731</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0152731</pub-id>
</citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Singh</surname>
<given-names>P. P.</given-names>
</name>
<name>
<surname>Isambert</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>OHNOLOGS v2: A comprehensive resource for the genes retained from whole genome duplication in vertebrates</article-title>. <source>Nucleic Acids Res.</source> <volume>48</volume>, <fpage>D724</fpage>&#x2013;<lpage>D730</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz909</pub-id>
</citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Smith</surname>
<given-names>L. M.</given-names>
</name>
<name>
<surname>Kelleher</surname>
<given-names>N. L.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Proteoform: A single term describing protein complexity</article-title>. <source>Nat. Methods</source> <volume>10</volume>, <fpage>186</fpage>&#x2013;<lpage>187</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.2369</pub-id>
</citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Srivastava</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Brewer</surname>
<given-names>A. K.</given-names>
</name>
<name>
<surname>Mauser-Bunschoten</surname>
<given-names>E. P.</given-names>
</name>
<name>
<surname>Key</surname>
<given-names>N. S.</given-names>
</name>
<name>
<surname>Kitchen</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Llinas</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2013</year>). <article-title>Guidelines for the management of hemophilia</article-title>. <source>Haemophilia</source> <volume>19</volume>, <fpage>e1</fpage>&#x2013;<lpage>e47</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-2516.2012.02909.x</pub-id>
</citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sulem</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Helgason</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Oddson</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Stefansson</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Gudjonsson</surname>
<given-names>S. A.</given-names>
</name>
<name>
<surname>Zink</surname>
<given-names>F.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Identification of a large set of rare complete human knockouts</article-title>. <source>Nat. Genet.</source> <volume>47</volume>, <fpage>448</fpage>&#x2013;<lpage>452</lpage>. <pub-id pub-id-type="doi">10.1038/ng.3243</pub-id>
</citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tay</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Thomson</surname>
<given-names>A. M.</given-names>
</name>
<name>
<surname>Lim</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Rigoutsos</surname>
<given-names>I.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>MicroRNAs to Nanog, Oct4 and Sox2 coding regions modulate embryonic stem cell differentiation</article-title>. <source>Nature</source> <volume>455</volume>, <fpage>1124</fpage>&#x2013;<lpage>1128</lpage>. <pub-id pub-id-type="doi">10.1038/nature07299</pub-id>
</citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tonin</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Catarzi</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Caciotti</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Procopio</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Marini</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Guerrini</surname>
<given-names>R.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Progressive myoclonus epilepsy in Gaucher Disease due to a new Gly-Gly mutation causing loss of an Exonic Splicing Enhancer</article-title>. <source>J. Neurol.</source> <volume>266</volume>, <fpage>92</fpage>&#x2013;<lpage>101</lpage>. <pub-id pub-id-type="doi">10.1007/s00415-018-9084-4</pub-id>
</citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Veitia</surname>
<given-names>R. A.</given-names>
</name>
<name>
<surname>Birchler</surname>
<given-names>J. A.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Dominance and gene dosage balance in health and disease: Why levels matter</article-title>. <source>J. Pathol.</source> <volume>220</volume>, <fpage>174</fpage>&#x2013;<lpage>185</lpage>. <pub-id pub-id-type="doi">10.1002/path.2623</pub-id>
</citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vihinen</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Functional effects of protein variants</article-title>. <source>Biochimie</source> <volume>180</volume>, <fpage>104</fpage>&#x2013;<lpage>120</lpage>. <pub-id pub-id-type="doi">10.1016/j.biochi.2020.10.009</pub-id>
</citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vihinen</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Individual genetic heterogeneity</article-title>. <source>Genes (Basel)</source> <volume>13</volume>, <fpage>1626</fpage>. <pub-id pub-id-type="doi">10.3390/genes13091626</pub-id>
</citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vihinen</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Muddled genetic terms miss and mess the message</article-title>. <source>Trends Genet.</source> <volume>31</volume>, <fpage>423</fpage>&#x2013;<lpage>425</lpage>. <pub-id pub-id-type="doi">10.1016/j.tig.2015.05.008</pub-id>
</citation>
</ref>
<ref id="B56">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vihinen</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Generic model for biological regulation</article-title>. <source>F1000Res.</source> <volume>11</volume>, <fpage>419</fpage>. <pub-id pub-id-type="doi">10.12688/f1000research.110944.1</pub-id>
</citation>
</ref>
<ref id="B57">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vihinen</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Poikilosis &#x2013; pervasive biological variation</article-title>. <source>F1000Res</source>. <volume>9</volume>, <fpage>602</fpage>. <pub-id pub-id-type="doi">10.12688/f1000research.24173.2</pub-id>
</citation>
</ref>
<ref id="B58">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vihinen</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Problems in variation interpretation guidelines and in their implementation in computational tools</article-title>. <source>Mol. Genet. Genomic Med.</source> <volume>8</volume>, <fpage>e1206</fpage>. <pub-id pub-id-type="doi">10.1002/mgg3.1206</pub-id>
</citation>
</ref>
<ref id="B59">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vihinen</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Systematics for types and effects of DNA variations</article-title>. <source>BMC Genomics</source> <volume>19</volume>, <fpage>974</fpage>. <pub-id pub-id-type="doi">10.1186/s12864-018-5262-0</pub-id>
</citation>
</ref>
<ref id="B60">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vihinen</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Systematics for types and effects of RNA variations</article-title>. <source>RNA Biol.</source> <volume>18</volume>, <fpage>481</fpage>&#x2013;<lpage>498</lpage>. <pub-id pub-id-type="doi">10.1080/15476286.2020.1817266</pub-id>
</citation>
</ref>
<ref id="B61">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vihinen</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Types and effects of protein variations</article-title>. <source>Hum. Genet.</source> <volume>134</volume>, <fpage>405</fpage>&#x2013;<lpage>421</lpage>. <pub-id pub-id-type="doi">10.1007/s00439-015-1529-6</pub-id>
</citation>
</ref>
<ref id="B62">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vihinen</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Variation Ontology for annotation of variation effects and mechanisms</article-title>. <source>Genome Res.</source> <volume>24</volume>, <fpage>356</fpage>&#x2013;<lpage>364</lpage>. <pub-id pub-id-type="doi">10.1101/gr.157495.113</pub-id>
</citation>
</ref>
<ref id="B63">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Vihinen</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>When a synonymous variant is nonsynonymous</article-title>. <source>Genes (Basel)</source> <volume>13</volume>, <fpage>1485</fpage>. <pub-id pub-id-type="doi">10.3390/genes13081485</pub-id>
</citation>
</ref>
<ref id="B64">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Hakonarson</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data</article-title>. <source>Nucleic Acids Res.</source> <volume>38</volume>, <fpage>e164</fpage>. <pub-id pub-id-type="doi">10.1093/nar/gkq603</pub-id>
</citation>
</ref>
<ref id="B65">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wehr</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Grotius</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Casadei</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Bleckmann</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Bode</surname>
<given-names>S. F. N.</given-names>
</name>
<name>
<surname>Frye</surname>
<given-names>B. C.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>A novel disease-causing synonymous exonic mutation in GATA2 affecting RNA splicing</article-title>. <source>Blood</source> <volume>132</volume>, <fpage>1211</fpage>&#x2013;<lpage>1215</lpage>. <pub-id pub-id-type="doi">10.1182/blood-2018-03-837336</pub-id>
</citation>
</ref>
</ref-list>
</back>
</article>