Abstract
Myosin binding protein-C (MyBP-C) is a sarcomeric protein which regulates the force of contraction in striated muscles. Mutations in the MYBPC family of genes, including slow skeletal (MYBPC1), fast skeletal (MYBPC2) and cardiac (MYBPC3), can result in cardiac and skeletal myopathies. Nonetheless, their evolutionary pattern, pathogenicity and impact on MyBP-C protein structure remain to be elucidated. Therefore, the present study aimed to systematically assess the evolutionarily conserved and epigenetic patterns of MYBPC family mutations. Leveraging a machine learning (ML) approach, the Genome Aggregation Database (gnomAD) provided variants in MYBPC1, MYBPC2, and MYBPC3 genes. This was followed by an analysis with Ensembl’s variant effect predictor (VEP), resulting in the identification of 8,618, 3,871, and 3,071 variants in MYBPC1, MYBPC2, and MYBPC3, respectively. Missense variants comprised 61%–66% of total variants in which the third nucleotide positions in the codons were highly altered. Arginine was the most mutated amino acid, important because most disease-causing mutations in MyBP-C proteins are arginine in origin. Domains C5 and C6 of MyBP-C were found to be hotspots for most mutations in the MyBP-C family of proteins. A high percentage of truncated mutations in cMyBP-C cause cardiomyopathies. Arginine and glutamate were the top hits in fMyBP-C and cMyBP-C, respectively, and tryptophan and tyrosine were the most common among the three paralogs changing to premature stop codons and causing protein truncations at the carboxyl terminus. A heterogeneous epigenetic pattern was identified among the three MYBP-C paralogs. Overall, it was shown that databases using computational approaches can facilitate diagnosis and drug discovery to treat muscle disorders caused by MYBPC mutations.
Introduction
Complex diseases stemming from genetic mutations have become a worldwide concern affecting the quality of life. Detecting such genetic diseases depends, in part, on information from online databases (). Thus, combining such readily accessible data with advanced molecular technologies has helped in identifying various diseases caused by changes in DNA sequences. Indeed, understanding such genetic defects and detecting them at early stages have steadily progressed (; ). No less important is an understanding of conserved elements of the human genome in the context of disease etiology, as well as disease prevention and treatment (). Researchers now use next-generation sequencing (NGS) methods to determine the order of nucleotides in entire genomes or targeted regions of DNA or RNA, leading to the isolation of genetic mutations likely to develop diseases (). Such high-throughput technologies also make it easier to predict the nature of the complex diseases. NGS platforms carry out sequencing of the whole human genome, or a number of small fragments of DNA, at the same time, followed by mapping individual reads to the human reference genome. We can also choose a specific site of interest. Any size of gene can be sequenced to detect the presence of mutations. So far, NGS has successfully identified many disease-causing variants, leading to a better understanding of pathogenic effects and clinical consequences (). Moreover, deep machine learning methods can be developed to predict genotype-phenotype outcomes (; ; ).
We herein focused on a group of myosin binding protein-C (MYBPC) paralogs which, together, constitute an immunoglobulin super-family of intracellular muscle proteins (). MYBPC has three paralogs encoded by unique genes, including slow skeletal (MYBPC1), fast skeletal (MYBPC2), and cardiac (MYBPC3) (). Slow skeletal MyBP-C (sMyBP-C), fast skeletal MyBP-C (fMyBP-C) and cardiac MyBP-C (cMyBP-C) proteins are highly conserved with over 90% homology. They play unique muscle-specific structural and regulatory roles in actomyosin interactions and contractility in striated muscles, including both cardiac and skeletal muscles (). MyBP-C protein is found in the cross-bridge-bearing zone (C region) of A bands in sarcomere of striated muscles (). They provide thick filament stability by interacting with titin and the rod portion of sarcomeric myosin (light meromyosin) through MyBP-C’s C-terminal region (; ). Ablating MYBPC2 () and MYBPC3 () gene expression results in contractile dysfunction, suggesting the key role played by MyBP-C in striated muscles. It is well known that genetic variants in MYBPC genes cause various life-threatening cardiovascular and congenital muscular diseases. For example, mutations in MYBPC3 are linked to hypertrophic cardiomyopathy (HCM) and dilated cardiomyopathy (DCM) (; ; ; ). More than 45% of HCM cases can be attributed to mutations in the MYBPC3 gene (). Strikingly, 70% of genetic variants in MYBPC3 are nonsense mutations, including indels, frameshift, and splice-site mutations, leading to cMyBP-C truncations at the carboxyl terminus (; ). It is, however, unclear why MYBPC3 variants predominantly result in protein truncations and whether any evolutionary reasons behind such preferential variants. In contrast, few variants have been reported in the skeletal paralogs (). However, some recent studies suggest that mutations in MYBPC1 are linked to a congenital disease called distal arthrogryposis (; ; ) and myogenic tremor (). Specifically, infants born with MYBPC1 variants developed with multiple joint contractures congenitally limiting muscular movement and affecting the quality of life (; ). On the other hand, MYBPC2 has also been linked to skeletal muscular disorders like arthrogryposis (). Thus, it is well worth systematically determining the genetic variability among these three genes, such as frequency, a hot spot, differences in codon usage, and degree of pathogenicity.
To date, around 2,000 variants have been reported in the MYBPC3 gene (), but the conserved pattern and biochemical characteristics of these variants have not been systematically reviewed. Therefore, in the present study, we analyzed variants of all three MYBPC gene isoforms for their effects, using the Variant Effect Predictor (VEP) to understand MyBP-C biology and evolutionary pattern (). The Genome Aggregation Database (GnomAD) () was used to collect the up-to-date MYBPC variants. We then performed data mining, queried the database of variants reported in these three gene isoforms, and carried out a comprehensive bioinformatics review of the evolutionary pattern of conserved variants in the MYBPC gene family. Variants and the similarities among them were compared among the three MYBPC paralogs, along with heterogeneous, gene-specific epigenetic patterns.
Materials and Methods
Accessing Variant Database and Data Extraction
Variant data for MYBPC1, MYBPC2, and MYBPC3 genes were directly downloaded from the Genome Aggregation Database (gnomAD) (). This database is open source, and it aggregates and harmonizes exome and genome sequencing data from multiple large-scale sequencing projects. We also used Ensembl’s Variant Effect Predictor (VEP) () to obtain annotations for all gnomAD variants from these three genes with an rsID. The collection of variants was genome-wide, including both coding and noncoding regions. We understand that homopolymeric regions pertaining to mitochondrial DNA (mtDNA) variants have been filtered out of gnomAD data, and we carried out our analyses accordingly.
Variant Identification and Analyses
Analysis was carried out using in-house scripts. The data were first processed by removing any duplicate variant entries. The longest isoforms of MYBPC1 (transcript ID ENST00000361466), MYBPC2 (transcript ID ENST00000357701), and MYBPC3 (transcript ID ENST0000545968) were identified from the resulting files. Variants impacting other genes could then be removed (AC117505.1 residing within MYBPC1; AC020909.1 and SPIB, both downstream of MYBPC2, and FAM71E1 upstream of MYBPC2; MADD downstream of MYBPC3 and SPI1 upstream of MYBPC3) (Supplementary Figure S1). Based on the resultant data, we wanted to discriminate among the variations observed across the three genes. First, we identified the frequency of each variant by category, including synonymous, missense, truncation, frameshift, and non-frameshift, indels and others, such as splice donors and acceptors, loss of start and stop codon, and protein-altering variants. Next, we identified the mutated nucleotides in each variant category, as well as codon position of nucleotides mutated in missense and synonymous variants. We also studied protein variants, classifying the amino acid (aa) variations in protein domains, the information of which was obtained through UniProt. Based on nucleotide data obtained earlier, we also investigated the proneness of certain exons to mutations. Last, with the help of variant consequence predictors, such as SIFT and Polyphen, we identified the distribution of pathogenic variants among the three genes (Supplementary Figure S2). Scripts will be provided upon request to the corresponding author.
Results
Variable Distribution of Genetic Variants in Myosin Binding Protein-C Paralogs
The MyBP-C protein family is a group of thick filament accessory proteins regulating striated muscle structure and function. cMyBP-C differs from sMyBP-C and fMyBP-C proteins by containing a unique C0 domain and 28 aa loops at the C5 domain (Figure 1). sMyBP-C is highly homologous to fMyBP-C, but its expression and functions differ (; ). To determine the conserved pattern in MyBP-C structural biology, we analyzed 8,617, 3,870, and 3,070 variants in MYBPC1, MYBPC2, and MYBPC3 genes, respectively. Variants were collected from GnomAD, and annotations were calculated by VEP. Among coding variants across the three paralogs, VEP analysis revealed missense variants to be the most predominant (61%–66%), followed by synonymous variants (∼30%), frameshift and truncation variants (∼3%), and then in-frame indels and splice-site variants (2%–3%) (Figure 2). While MYBPC3 had the highest number of coding variants (Supplementary Figure S2), MYBPC1 had the highest number of variants with intronic variants, making up 90% of variants in MYBPC1 as compared to 75% in MYBPC2 and 62% in MYBPC3 (data not shown). Next, we analyzed the domain-wise frequency in all MyBP-C proteins (Figure 3). Interestingly, the C5 domain proved to be the most prone to mutations among all three proteins, despite not being the longest domain in sMyBP-C and fMyBP-C (Figures 3A–C).
FIGURE 1
FIGURE 2
FIGURE 3
Paralog-Specific Alterations in Amino Acids
Next, variants were categorized into missense, frameshift, and truncation mutants based on variations in their amino acids. Our analyses revealed that Glu > Lys (E > K) and Ala > Thr (A > T) were the most frequent missense amino acid substitutions across the MyBP-C proteins and that Ile > Val (I > V) was the most frequent in sMyBP-C (Figures 4A–C). Interestingly, among the ten most frequent amino acid substitutions among the three paralogs, there lacks a mutation of the commonly post-translationally modifiable residues (Lys, Ser, Thr, Tyr), except for Arg, however we do commonly observe modifiable amino acids occurring in these proteins as a result of mutations. This feature could be explored as a potential therapeutic target since post-translational modifications are known to frequently activate or de-activate proteins.
FIGURE 4
Among missense variants, Cys, Phe, His, and Trp were the least mutated amino acids, while, again, Arg and Val were the most frequently mutated amino acids across all MYBPC genes (Figures 5A–C). sMyBP-C was shown to have other frequently mutated amino acids, including Ile, Gly, Val, Asp, Ala, and Glu, while fMyBP-C had frequent mutations in Pro, Glu and Val, and cMyBP-C showed no affinity towards mutations in any amino acids other than Val and Arg. This could be attributed to an excess number of codons coding for Arg. However, this pattern dramatically changed in frameshift variants with most mutations impacting Thr (sMyBP-C), Ile (fMyBP-C), and Pro (cMyBP-C), respectively (Figures 6A–C). Arg was largely unaffected by frameshift mutations.
FIGURE 5
FIGURE 6
Another important category of mutation includes truncation variants since they might not include regulatory or functional domains in the translated protein. In the MyBP-C family, not surprisingly, we mostly observe Trp and Tyr mutations leading to the introduction of a premature stop codon (Figures 7A–C). In sMyBP-C, however, Arg variants leading to truncated variants are as common as Trp and Tyr (Figure 7A). Glu mutations leading to truncation could be observed in both fMyBP-C and cMyBP-C (Figure 7).
FIGURE 7
Variant Distribution Across Exons and Domains in the Paralogs
Next, we applied filters to the VEP files in order to categorize variants as “likely pathogenic” and analyzed which domains and exons were the most susceptible to mutation. Again, all three paralogs showed very heterogeneous distribution in the frequency of pathogenic variants (Figures 8D–F). However, while mutations in the C10 domain caused pathogenic variants in fMyBP-C, the C10 domain of cMyBP-C was the least mutated. Instead, C6 was the most mutated domain. Heterogeneous distribution characterized sMyBP-C in which domains comprising the N terminal of the protein were found to be the least mutated. Next, we investigated which exons contained the most pathogenic variants. Here, although MYBPC1 showed very mixed distribution, exons 21 and 29 contained most pathogenic variants. Exons 8, 10, 26, and 27 contained the most pathogenic variants for MYBPC2, whereas exon 25 was clearly the most pathogenic variant-containing exon for MYBPC3, followed by exons 2 and 29. (Figures 8A–C). Distal Arthrogryposis has been attributed to pathogenic variants in MYBPC1, and MYBPC3 is known to cause a series of cardiomyopathies, including HCM, DCM and congenital heart defects. Mutations in MYBPC2 were not very well annotated in terms specific diseases, but a few variants were linked to cognitive dysfunction, according to the VEP files.
FIGURE 8
Discussion
MYBPC paralogs play a major role in striated muscle contraction. Increasing evidence suggests that genetic alterations in MYBPC paralogs are directly linked to myopathies. However, no systematic analyses have been carried out to determine nucleotide pattern, codon, and amino acid changes in existing genetic mutations among these three proteins. We wanted to understand the patterns of genetic variants arising from evolutionarily conserved amino acids in MyBP-C structural biology. To this end, we analyzed around 3,000 variants in each paralog obtained from GnomAD and calculated the annotations using the variant effect predictor (VEP). The collected data were analyzed by comparing all three paralogs. Mapping variant frequency in the domains of all MyBP-C proteins revealed a heterogeneous distribution, indicating that all domains in the MyBP-C protein are equally susceptible to mutation. Very few studies have reported on the conserved sequences among the three MYBPC paralogs (; ; ; ). However, in the present study, the conserved pattern of MYBPC mutations also showed a very heterogeneous distribution in all three paralogs. Missense variants predominated with Ile as the most mutated amino acid in sMyBP-C, Val in fMBP-C and Arg in cMyBP-C.
() reported a high level of conserved sequences. The M-domain, otherwise known as the MyBP-C motif, contains a unique set of 100 amino acids at the N terminus between domains C1 and C2. This region is essential for actomyosin interactions. The M-domain binds myosin S2, as well as actin, to regulate cross-bridge formation during contraction and relaxation. Upon phosphorylation by kinases like PKA, the bond between M-domain and S2, or actin, is broken, allowing cross-bridge formation (; ; ; ). Many regions of the M domain are highly conserved, including 293–300 and 331–353 in humans, which may, or may not, carry functional importance. However, some regions of MyBP-C are not well conserved and are unique to the cardiac paralog. The cardiac cMyBP-C isoforms contain an additional ∼100 amino acid domain at the extreme N terminus called the C0 domain which is absent in the slow and fast skeletal paralogs (Figure 1). In the same evolutionary study by Shaffer and Gillis, the phylogenetic analysis of MyBP-C sequences revealed MyBP-C paralogs to be monophyletic, while the fast and slow skeletal MyBP-C protein paralogs clustered in a group that deviated from that of cMyBP-C. This indicated that cMyBP-C is the ancestral form of MyBP-C. They also predicted that gene duplication events caused changes in the sequence of cMyBP-C, resulting in the differentiation of slow skeletal from the cardiac paralog. Differences in the sequence of cMyBP-C enable it to carry out its specialized cardiac muscle function ().
Previous alignment studies noted a significant degree of conserved sequences across all three paralogs (). A high number of amino acids were found to be conserved in MyBP-C, depending on the species. Altogether, eight residues in mammalian cMyBP-C switch from nonpolar amino acids to charged amino acids in the other two isoforms (). For example, in human cMyBP-C, as well as all other mammalian cardiac isoforms, Gly-354 (or its equivalent) can be observed, while an Arg residue can be found at the equivalent site in nonmammalian cardiac isoforms, as well as all fast and slow skeletal isoforms. The functional importance of these amino acids is unknown (). This could explain why Arg was the most mutated amino acid in cMyBP-C and the other paralogs.
Life-threatening diseases have been attributed to mutations in the MYBPC paralog proteins. For example, mutations in the MYBPC3 gene are also linked to HCM, DCM and sudden cardiac death. Missense mutations cause stable proteins to incorporate into the sarcomere and lead to various functional defects. However, frameshift mutations result in a prematurely terminated codon in the transcribed mRNA, making C-terminal truncated proteins unable to bind myosin or titin and also leading to functional defects. About 70% of genetic variants in MYBPC3 represent C′’-truncations (). Furthermore, C-terminal truncated proteins have never been detected by immunoblots in cardiac tissue of HCM patients (; ). Importantly, cardiomyocytes from cMyBP-C mutants are markedly decreased from those of the wild-type protein (). Altogether, these studies strongly suggest that mutant proteins and/or mRNAs are unstable and degraded accordingly. Therefore, it was proposed that frameshift and nonsense mutations might lead to cMyBP-C haploinsufficiency (; ). A non-functional or mutant protein incorporating into the sarcomere can cause filament disassembly, altered function, and, finally, HCM- or DCM-like phenotype (). While few studies have reported on MYBPC1 and MYBPC2, many studies have linked MYBPC1 mutations to distal arthrogryposis (). For example, one study found that MYBPC1 mutations W236R and Y856H could cause distal arthrogryposis type 1 (). Another study in a Chinese family found that E359K, R318X, and P319L mutations led to distal arthrogryposis type 2 (). Distal arthrogryposis is a skeletal muscle disorder characterized by joint contractures and deformities on distal body parts immobilizing muscle movements (). Few MYBPC2 mutations have been explained clinically. However, a recent study from our laboratory shows that global knockout of MYBPC2 in mice results in reduced contractility and reduced myofilament calcium sensitivity and hypertrophic response to mechanical overload ().
Some disease-causing founder mutations are specific to ethnicity and limited to a geographic region (). In recent years, MYBPC3 has gained significant interest owing to its role in the regulation of contractility in the sarcomere machinery. MYBPC3 is known to regulate contraction upon its phosphorylation by various kinases (). Studies have shown that C′-truncation of MYBPC3 causes cMyBP-C null and DCM in mice at the age of 3 months (), as well as significant epigenetic changes (), indicating the importance of normal cMyBP-C for regular cardiac function (). In comparison, much less is known about the two skeletal isoforms of MyBP-C ().
Across the three genes, we observed the highest prevalence of coding variants and pathogenic coding variants in MYBPC3. While missense variants constitute most of the coding variants, a considerable prevalence of loss-of-function variants can be seen in the three genes represented by frameshift and truncation variants (Figure 2). From an evolutionary perspective, we also noted in all genes that the C5 domain is highly prone to variants, which could, therefore, be a potential therapeutic target in disease conditions. In MYBPC3, the cardiac specific N-terminal domain is also highly prone to variations, thus possibly crucial in the treatment of HCM (Figure 3). A stark change in the polarity of amino acids is observed in missense mutations, potentially altering protein binding and/or key post-translational modifications in MyBP-C proteins, thereby leading to a disease phenotype (Figure 4).
In our analyses, the most dominant variant among the MyBP-C paralogs was missense variants, followed by frameshift variants. For instance, a missense variant, like R403Q or R663H, ablates the binding of myosin with the C0-C7 fragment of cMyBP-C protein and causes hypercontractility (). The N-terminal of cMyBP-C regulates contractility within the sarcomere machinery. Mutations in the domains comprising the N-terminal often lead to cardiac dysfunction. Our analysis revealed the C6 domain of cMyBP-C to have the most pathogenic variants, followed by C0 and C1. Nearly half of these variants localize to the C0-C4, comprising the N terminal of the protein. Mutations in the N -terminal can lead to either reduced or increased binding with the myosin region, depending on the mutation. This explains the loss- or gain-of-function in the case of mutations leading to cardiac dysfunction. As mentioned previously, MyBP-C has many conserved domains, and, over time, mutations can negatively impact contractile function. In our study, Glu > Lys (E/K) was most frequent amino acid substitution found in all MyBP-C paralogs. A switch from a negatively charged amino acid to a positively charged amino acid can lead to loss of binding to the neighboring amino acids in a protein and ultimately to reduced or increased function in the sarcomere. Arg was the most mutated amino acid across all three paralogs of MyBP-C missense variants. However, Arg was also the least susceptible to frameshift mutation, suggesting that the nucleotides coding for Arg do not often change the entire sequence of amino acids in the protein, another avenue for exploration in further studies. Among the pathogenic variants, exon 25 and 29 of MYBPC1 and MYBPC3 were the most likely to be mutated, suggesting the likelihood that these positions can destabilize the structure in a manner sufficient to alter the protein’s functionality. However, for MYBPC2, mutations in the exons 8, 10, and 27 were most likely associate with distal arthrogryposis ().
In conclusion, our study demonstrates the evolutionary pattern of conserved variants in the MyBP-C family of proteins, potentially leading to complex genetic diseases. Overall, the results of our assessment can be used for genetic mapping and identifying genetic variants in individuals with a history of such mutations for the purpose of clinical diagnosis and prognosis.
Limitations of the Study
In this study, only the variants available on the gnomAD database were analyzed. Other variants may be present in other databases for MYBPC genes. Since the data in gnomAD represent aggregate data, phenotype and other individual-level data are not available. Follow-up studies can be undertaken using data from biobanks such as the UK Biobank. Additionally, gnomAD has an over-representation of data from European populations compared to participants from other communities (e.g., Middle Eastern, and Oceanian populations. Despite rigorous quality control, gnomAD may also contain sequencing and annotation artifacts (). We used only one effect predictor, namely VEP, to annotate the variants. Current variant annotation tools, including VEP, annotate each variant independently and do not consider the potential compound effects of combining alternate alleles. In other words, two or more variants affecting the same codon are not considered when annotating. While VEP, or similar predictors, can predict the functional effects of genomic variants, without validation studies, the predicted deleterious variants cannot be claimed as absolutely pathogenic or cause a definite phenotype. With subjects’ samples (control and case datasets) underlying mechanisms of pathogenesis caused by these variants can be deduced.
Statements
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.
Author contributions
Conceptualization, DD, AJ, PD, and SS; methodology, DD, VR, and AJ; investigation, DD, VR, AJ, PD, and SS; data curation, DD, VR, AJ, PD, and SS; writing—original draft preparation, DD; writing—review and editing, DD, VR, AJ, PD, and SS; visualization, SS; supervision, AJ, PD, and SS; funding acquisition, SS. All authors have read and agreed to the published version of the manuscript.
Funding
DD was supported by an American Heart Association Predoctoral Fellowship (20PRE35120272). SS has received support from National Institutes of Health grants (R01 HL130356, R01 HL105826, R01 AR078001, and R01 HL143490), American Heart Association, Institutional Undergraduate Student (19UFEL34380251), Transformation (19TPA34830084) awards, the PLN Foundation (PLN crazy idea) awards, as well as Novo Nordisk, AstraZeneca, MyoKardia, Merck and Amgen. PD is supported by the Wellcome Trust-Indian Alliance (IA/I/16/1/502367), Rajiv Gandhi University of Health Sciences (RGUHS), Scientist Development Grant (15SDG23250005) from the American Heart Association (AHA), Department of Science and Technology (DST/CRG/2019/005401) and inStem core funding. VR is supported by ICMR-SRF (3/1/1 (8)/CVD/2020-NCD-1).
Conflict of interest
SS provided consulting and collaborative research studies to the Leducq Foundation (CURE-PLAN), Red Saree Inc., Greater Cincinnati Tamil Sangam, AavantiBio, Pfizer, Novo Nordisk, AstraZeneca, MyoKardia, Merck and Amgen. AJ serves as a member of Scientific Advisory Board of GEn1E Lifesciences, Palo Alto, CA, United States.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.896117/full#supplementary-material
Glossary
- cMyBP-C
Cardiac myosin binding protein-C protein
- fMyBP-C
Fast skeletal myosin binding protein-C protein
- sMyBP-C
Slow skeletal myosin binding protein-C protein
- MyBP-C
Myosin binding protein-C protein
- MYBPC1
Slow skeletal myosin binding protein-C gene
- MYBPC2
Fast skeletal myosin binding protein-C gene
- MYBPC3
Cardiac myosin binding protein-C gene
- Ala (A)
Alanine
- Arg (R)
Arginine
- Asn (N)
Asparagine
- Asp (D)
Aspartic acid
- Cys (C)
Cysteine
- DCM
Dilated cardiomyopathy
- Gln (Q)
Glutamine
- Glu (E)
Glutamic acid
- Gly (G)
Glycine
- His (H)
Histidine
- HCM
Hypertrophic cardiomyopathy
- Ile (I)
Isoleucine
- Leu (L)
Leucine
- Lys (K)
Lysine
- Met (M)
Methionine
- Phe (F)
Phenylalanine
- Pro (P)
Proline
- Pyl (O)
Pyrrolysine
- Ser (S)
Serine
- Thr (T)
Threonine
- Trp (W)
Tryptophan
- Tyr (Y)
Tyrosine
- Val (V)
Valine
- SNP
Single nucleotide polymorphism
- NGS
Next-generation sequencing
References
1
Al-NumairN. S.LopesL.SyrrisP.MonserratL.ElliottP.MartinA. C. R. (2016). The Structural Effects of Mutations Can Aid in Differential Phenotype Prediction of Beta-Myosin Heavy Chain (Myosin-7) Missense Variants. Bioinformatics32 (19), 2947–2955. 10.1093/bioinformatics/btw362
2
BamshadM.Van HeestA. E.PleasureD. (2009). Arthrogryposis: a Review and Update. J. Bone Jt. Surg. Am.91 (Suppl. 4), 40–46. 10.2106/JBJS.I.00281
3
BarefieldD.SadayappanS. (2010). Phosphorylation and Function of Cardiac Myosin Binding Protein-C in Health and Disease. J. Mol. Cell. Cardiol.48 (5), 866–875. 10.1016/j.yjmcc.2009.11.014
4
BayramY.KaracaE.Coban AkdemirZ.YilmazE. O.TayfunG. A.AydinH.et al (2016). Molecular Etiology of Arthrogryposis in Multiple Families of Mostly Turkish Origin. J. Clin. Investigation126 (2), 762–778. 10.1172/Jci84457
5
BonneG.CarrierL.BercoviciJ.CruaudC.RichardP.HainqueB.et al (1995). Cardiac Myosin Binding Protein-C Gene Splice Acceptor Site Mutation Is Associated with Familial Hypertrophic Cardiomyopathy. Nat. Genet.11 (4), 438–440. 10.1038/ng1295-438
6
de MarvaoA.McGurkK. A.ZhengS. L.ThanajM.BaiW.DuanJ.et al (2021). Phenotypic Expression and Outcomes in Individuals with Rare Genetic Variants of Hypertrophic Cardiomyopathy. J. Am. Coll. Cardiol.78 (11), 1097–1110. 10.1016/j.jacc.2021.07.017
7
DesaiD.StieneD.SongT.SadayappanS. (2020). Distal Arthrogryposis and Lethal Congenital Contracture Syndrome - an Overview. Front. Physiol.11, 689. 10.3389/fphys.2020.00689
8
DhandapanyP. S.SadayappanS.XueY.PowellG. T.RaniD. S.NallariP.et al (2009). A Common MYBPC3 (Cardiac Myosin Binding Protein C) Variant Associated with Cardiomyopathies in South Asia. Nat. Genet.41 (2), 187–191. 10.1038/ng.309
9
FlashmanE.RedwoodC.Moolman-SmookJ.WatkinsH. (2004). Cardiac Myosin Binding Protein-C its Role in Physiology and Disease. Circulation Res.94 (10), 1279–1289. 10.1161/01.RES.0000127175.21818.C2
10
GaganJ.Van AllenE. M. (2015). Next-generation Sequencing to Guide Cancer Therapy. Genome Med.7 (1), 80. 10.1186/s13073-015-0203-x
11
Geist HausermanJ.StavusisJ.JocaH. C.RobinettJ. C.HanftL.VandermeulenJ.et al (2021). Sarcomeric Deficits Underlie MYBPC1-Associated Myopathy with Myogenic Tremor. JCI Insight6 (19). e147612. 10.1172/jci.insight.147612
12
GruenM.PrinzH.GautelM. (1999). cAPK-phosphorylation Controls the Interaction of the Regulatory Domain of Cardiac Myosin Binding Protein C with Myosin-S2 in an On-Off Fashion. FEBS Lett.453 (3), 254–259. 10.1016/s0014-5793(99)00727-9
13
GudmundssonS.Singer‐BerkM.WattsN. A.PhuW.GoodrichJ. K.SolomonsonM.et al (2021). Variant Interpretation Using Population Databases: Lessons from gnomAD. Hum. Mutat.1. 1. 10.1002/humu.24309
14
GurnettC. A.DesruisseauD. M.McCallK.ChoiR.MeyerZ. I.TalericoM.et al (2010). Myosin Binding Protein C1: a Novel Gene for Autosomal Dominant Distal Arthrogryposis Type 1. Hum. Mol. Genet.19 (7), 1165–1173. 10.1093/hmg/ddp587
15
HarrisS. P.BartleyC. R.HackerT. A.McDonaldK. S.DouglasP. S.GreaserM. L.et al (2002). Hypertrophic Cardiomyopathy in Cardiac Myosin Binding Protein-C Knockout Mice. Circulation Res.90 (5), 594–601. 10.1161/01.res.0000012222.70819.64
16
HarrisS. P.LyonsR. G.BezoldK. L. (2011). In the Thick of it: HCM-Causing Mutations in Myosin Binding Proteins of the Thick Filament. Circ. Res.108 (6), 751–764. 10.1161/CIRCRESAHA.110.231670
17
HelmsA. S.ThompsonA. D.GlazierA. A.HafeezN.KabaniS.RodriguezJ.et al (2020). Spatial and Functional Distribution of MYBPC3 Pathogenic Variants and Clinical Outcomes in Patients with Hypertrophic Cardiomyopathy. Circ Genomic Precis. Med.13 (5), 396–405. 10.1161/CIRCGEN.120.002929
18
JiangJ.BurgonP. G.WakimotoH.OnoueK.GorhamJ. M.O’MearaC. C.et al (2015). Cardiac Myosin Binding Protein C Regulates Postnatal Myocyte Cytokinesis. Proc. Natl. Acad. Sci. U.S.A.112 (29), 9046–9051. 10.1073/pnas.1511004112
19
KarczewskiK. J.FrancioliL. C.TiaoG.CummingsB. B.AlföldiJ.WangQ.et al (2020). The Mutational Constraint Spectrum Quantified from Variation in 141,456 Humans. Nature581 (7809), 434–443. 10.1038/s41586-020-2308-7
20
KorteF. S.McDonaldK. S.HarrisS. P.MossR. L. (2003). Loaded Shortening, Power Output, and Rate of Force Redevelopment Are Increased with Knockout of Cardiac Myosin Binding Protein-C. Circulation Res.93 (8), 752–758. 10.1161/01.RES.0000096363.85588.9A
21
LiX.ZhongB.HanW.ZhaoN.LiuW.SuiY.et al (2015). Two Novel Mutations in Myosin Binding Protein C Slow Causing Distal Arthrogryposis Type 2 in Two Large Han Chinese Families May Suggest Important Functional Role of Immunoglobulin Domain C2. Plos One10 (5), e0117158. 10.1371/journal.pone.0117158
22
LinB.GovindanS.LeeK.ZhaoP.HanR.RunteK. E.et al (2013). Cardiac Myosin Binding Protein-C Plays No Regulatory Role in Skeletal Muscle Structure and Function. PloS One8 (7), e69671. 10.1371/journal.pone.0069671
23
LynchT. L.SivaguruM.VelayuthamM.CardounelA. J.MichelsM.BarefieldD.et al (2015). Oxidative Stress in Dilated Cardiomyopathy Caused byMYBPC3Mutation. Oxidative Med. Cell. Longev.2015, 1–14. 10.1155/2015/424751
24
MarkusB.NarkisG.LandauD.BirkR. Z.CohenI.BirkO. S. (2012). Autosomal Recessive Lethal Congenital Contractural Syndrome Type 4 (LCCS4) Caused by a Mutation in MYBPC1. Hum. Mutat.33 (10), 1435–1438. 10.1002/humu.22122
25
MarstonS.CopelandO. N.JacquesA.LiveseyK.TsangV.McKennaW. J.et al (2009). Evidence from Human Myectomy Samples that MYBPC3 Mutations Cause Hypertrophic Cardiomyopathy through Haploinsufficiency. Circulation Res.105 (3), 219–222. 10.1161/Circresaha.109.202440
26
McConnellB. K.JonesK. A.FatkinD.ArroyoL. H.LeeR. T.AristizabalO.et al (1999). Dilated Cardiomyopathy in Homozygous Myosin-Binding Protein-C Mutant Mice. J. Clin. Invest.104 (9), 1235–1244. 10.1172/JCI7377
27
McLarenW.GilL.HuntS. E.RiatH. S.RitchieG. R. S.ThormannA.et al (2016). The Ensembl Variant Effect Predictor. Genome Biol.17 (1), 122. 10.1186/s13059-016-0974-4
28
McNamaraJ. W.SadayappanS. (2018). Skeletal Myosin Binding Protein-C: An Increasingly Important Regulator of Striated Muscle Physiology. Archives Biochem. Biophysics660, 121–128. 10.1016/j.abb.2018.10.007
29
MéndezI.FernándezA. I.EspinosaM. Á.CuencaS.LorcaR.RodríguezJ. F.et al (2021). Founder Mutation in Myosin-Binding Protein C with an Early Onset and a High Penetrance in Males. Open Heart8 (2), e001789. 10.1136/openhrt-2021-001789
30
MooneyS. D.KrishnanV. G.EvaniU. S. (2010). Bioinformatic Tools for Identifying Disease Gene and SNP Candidates. Methods Mol. Biol.628, 307–319. 10.1007/978-1-60327-367-1_17
31
OfferG.MoosC.StarrR. (1973). A New Protein of the Thick Filaments of Vertebrate Skeletal Myofibrils. Extractions, Purification and Characterization. J. Mol. Biol.74 (4), 653–676. 10.1016/0022-2836(73)90055-7
32
OkagakiT.WeberF. E.FischmanD. A.VaughanK. T.MikawaT.ReinachF. C. (1993). The Major Myosin-Binding Domain of Skeletal Muscle MyBP-C (C Protein) Resides in the COOH-Terminal, Immunoglobulin C2 Motif. J. Cell Biol.123 (3), 619–626. 10.1083/jcb.123.3.619
33
RichardP.FressartV.CharronP.HainqueB. (2010). Génétique des cardiomyopathies héréditaires. Pathol. Biol.58 (5), 343–352. 10.1016/j.patbio.2009.10.010
34
SadayappanS.de TombeP. P. (2012). Cardiac Myosin Binding Protein-C: Redefining its Structure and Function. Biophys. Rev.4 (2), 93–106. 10.1007/s12551-012-0067-x
35
SarkarS. S.TrivediD. V.MorckM. M.AdhikariA. S.PashaS. N.RuppelK. M.et al (2020). The Hypertrophic Cardiomyopathy Mutations R403Q and R663H Increase the Number of Myosin Heads Available to Interact with Actin. Sci. Adv.6 (14), eaax0069. 10.1126/sciadv.aax0069
36
SchlossarekS.SchuermannF.GeertzB.MeariniG.EschenhagenT.CarrierL. (2012). Adrenergic Stress Reveals Septal Hypertrophy and Proteasome Impairment in Heterozygous Mybpc3-Targeted Knock-In Mice. J. Muscle Res. Cell Motil.33 (1), 5–15. 10.1007/s10974-011-9273-6
37
SchmittM. W.KennedyS. R.SalkJ. J.FoxE. J.HiattJ. B.LoebL. A. (2012). Detection of Ultra-rare Mutations by Next-Generation Sequencing. Proc. Natl. Acad. Sci. U.S.A.109 (36), 14508–14513. 10.1073/pnas.1208715109
38
ShafferJ. F.GillisT. E. (2010). Evolution of the Regulatory Control of Vertebrate Striated Muscle: the Roles of Troponin I and Myosin Binding Protein-C. Physiol. Genomics42 (3), 406–419. 10.1152/physiolgenomics.00055.2010
39
ShafferJ. F.KenslerR. W.HarrisS. P. (2009). The Myosin-Binding Protein C Motif Binds to F-Actin in a Phosphorylation-Sensitive Manner. J. Biol. Chem.284 (18), 12318–12327. 10.1074/jbc.M808850200
40
SongT.McNamaraJ. W.MaW.Landim-VieiraM.LeeK. H.MartinL. A.et al (2021). Fast Skeletal Myosin-Binding Protein-C Regulates Fast Skeletal Muscle Contraction. Proc. Natl. Acad. Sci. U.S.A.118 (17), e2003596118. 10.1073/pnas.2003596118
41
SpiritoP.SeidmanC. E.McKennaW. J.MaronB. J. (1997). The Management of Hypertrophic Cardiomyopathy. N. Engl. J. Med.336 (11), 775–785. 10.1056/NEJM199703133361107
42
StavusisJ.LaceB.SchäferJ.GeistJ.InashkinaI.KidereD.et al (2019). Novel Mutations in MYBPC1 Are Associated with Myogenic Tremor and Mild Myopathy. Ann. Neurol.86 (1), 129–142. 10.1002/ana.25494
43
StelzerJ. E.PatelJ. R.WalkerJ. W.MossR. L. (2007). Differential Roles of Cardiac Myosin-Binding Protein C and Cardiac Troponin I in the Myofibrillar Force Responses to Protein Kinase A Phosphorylation. Circulation Res.101 (5), 503–511. 10.1161/CIRCRESAHA.107.153650
44
Suay-CorrederaC.PricoloM. R.Herrero-GalánE.Velázquez-CarrerasD.Sánchez-OrtizD.García-GiustinianiD.et al (2021). Protein Haploinsufficiency Drivers Identify MYBPC3 Variants that Cause Hypertrophic Cardiomyopathy. J. Biol. Chem.297 (1), 100854. 10.1016/j.jbc.2021.100854
45
TabishA. M.ArifM.SongT.ElbeckZ.BeckerR. C.KnöllR.et al (2019). Association of Intronic DNA Methylation and Hydroxymethylation Alterations in the Epigenetic Etiology of Dilated Cardiomyopathy. Am. J. Physiology-Heart Circulatory Physiology317 (1), H168–H180. 10.1152/ajpheart.00758.2018
46
van DijkS. J.DooijesD.dos RemediosC.MichelsM.LamersJ. M. J.WinegradS.et al (2009). Cardiac Myosin-Binding Protein C Mutations and Hypertrophic Cardiomyopathy: Haploinsufficiency, Deranged Phosphorylation, and Cardiomyocyte Dysfunction. Circulation119 (11), 1473–1483. 10.1161/CIRCULATIONAHA.108.838672
47
WangT.SunJ.ZhangX.WangW.-J.ZhouQ. (2021). CNV-P: a Machine-Learning Framework for Predicting High Confident Copy Number Variations. PeerJ9, e12564. 10.7717/peerj.12564
48
WatkinsH.ConnerD.ThierfelderL.JarchoJ. A.MacRaeC.McKennaW. J.et al (1995). Mutations in the Cardiac Myosin Binding Protein-C Gene on Chromosome 11 Cause Familial Hypertrophic Cardiomyopathy. Nat. Genet.11 (4), 434–437. 10.1038/ng1295-434
49
WeberF. E.VaughanK. T.ReinachF. C.FischmanD. A. (1993). Complete Sequence of Human Fast-type and Slow-type Muscle Myosin-Binding-Protein C (MyBP-C). Differential Expression, Conserved Domain Structure and Chromosome Assignment. Eur. J. Biochem.216 (2), 661–669. 10.1111/j.1432-1033.1993.tb18186.x
50
YangQ.SanbeA.OsinskaH.HewettT. E.KlevitskyR.RobbinsJ. (1999). In Vivo modeling of Myosin Binding Protein C Familial Hypertrophic Cardiomyopathy. Circulation Res.85 (9), 841–847. 10.1161/01.res.85.9.841
51
YuX.YaoX.WuB.ZhouH.XiaS.SuW.et al (2021). Using Deep Learning Method to Identify Left Ventricular Hypertrophy on Echocardiography. Int. J. Cardiovasc Imaging38, 759–769. Online ahead of print. 10.1007/s10554-021-02461-3
52
DhootG. K.HalesM. C.GrailB. M.PerryS. V.et al (1985). The Isoforms of C Protein and Their Distribution in Mammalian Skeletal Muscle. J. Muscle Res. Cell Motil.6 (4). 487–505. 10.1007/BF00712585
Summary
Keywords
MYBPC1, MYBPC2, MYBPC3, hypertrophic cardiomyopathy, distal arthrogryposis
Citation
Desai DA, Rao VJ, Jegga AG, Dhandapany PS and Sadayappan S (2022) Heterogeneous Distribution of Genetic Mutations in Myosin Binding Protein-C Paralogs. Front. Genet. 13:896117. doi: 10.3389/fgene.2022.896117
Received
14 March 2022
Accepted
07 June 2022
Published
27 June 2022
Volume
13 - 2022
Edited by
Anastacia M. Garcia, University of Colorado Denver, United States
Reviewed by
Hasan Orhan Akman, Columbia University Irving Medical Center, United States
Charles K. Thodeti, University of Toledo, United States
Narasimman Gurusamy, University of Tennessee Health Science Center (UTHSC), United States
Updates
Copyright
© 2022 Desai, Rao, Jegga, Dhandapany and Sadayappan.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Sakthivel Sadayappan, sadayasl@ucmail.uc.edu
† These authors have contributed equally to this work
This article was submitted to Genetics of Common and Rare Diseases, a section of the journal Frontiers in Genetics
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.