<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fgene.2021.650897</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Computational Studies of the Structural Basis of Human RPS19 Mutations Associated With Diamond-Blackfan Anemia</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>An</surname> <given-names>Ke</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1192944/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Zhou</surname> <given-names>Jing-Bo</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1333106/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Xiong</surname> <given-names>Yao</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1334759/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Han</surname> <given-names>Wei</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Wang</surname> <given-names>Tao</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Ye</surname> <given-names>Zhi-Qiang</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/29200/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Wu</surname> <given-names>Yun-Dong</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="corresp" rid="c002"><sup>&#x002A;</sup></xref>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School</institution>, <addr-line>Shenzhen</addr-line>, <country>China</country></aff>
<aff id="aff2"><sup>2</sup><institution>Shenzhen Bay Laboratory</institution>, <addr-line>Shenzhen</addr-line>, <country>China</country></aff>
<aff id="aff3"><sup>3</sup><institution>College of Chemistry and Molecular Engineering, Peking University</institution>, <addr-line>Beijing</addr-line>, <country>China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Long Gao, University of Pennsylvania, United States</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Jeff Brender, National Institutes of Health (NIH), United States; Feifei Tao, Eisai, United States; Cai Chen, Merck, United States</p></fn>
<corresp id="c001">&#x002A;Correspondence: Zhi-Qiang Ye, <email>yezq@pku.org.cn</email></corresp>
<corresp id="c002">Yun-Dong Wu, <email>ydwu@pku.edu.cn</email></corresp>
<fn fn-type="other" id="fn004"><p>This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>24</day>
<month>05</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>12</volume>
<elocation-id>650897</elocation-id>
<history>
<date date-type="received">
<day>12</day>
<month>01</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>28</day>
<month>04</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2021 An, Zhou, Xiong, Han, Wang, Ye and Wu.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>An, Zhou, Xiong, Han, Wang, Ye and Wu</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Diamond-Blackfan Anemia (DBA) is an inherited rare disease characterized with severe pure red cell aplasia, and it is caused by the defective ribosome biogenesis stemming from the impairment of ribosomal proteins. Among all DBA-associated ribosomal proteins, RPS19 affects most patients and carries most DBA mutations. Revealing how these mutations lead to the impairment of RPS19 is highly demanded for understanding the pathogenesis of DBA, but a systematic study is currently lacking. In this work, based on the complex structure of human ribosome, we comprehensively studied the structural basis of DBA mutations of RPS19 by using computational methods. Main structure elements and five conserved surface patches involved in RPS19-18S rRNA interaction were identified. We further revealed that DBA mutations would destabilize RPS19 through disrupting the hydrophobic core or breaking the helix, or perturb the RPS19-18S rRNA interaction through destroying hydrogen bonds, introducing steric hindrance effect, or altering surface electrostatic property at the interface. Moreover, we trained a machine-learning model to predict the pathogenicity of all possible RPS19 mutations. Our work has laid a foundation for revealing the pathogenesis of DBA from the structural perspective.</p>
</abstract>
<kwd-group>
<kwd>Diamond-Blackfan Anemia</kwd>
<kwd>RPS19</kwd>
<kwd>missense mutation</kwd>
<kwd>structure stability</kwd>
<kwd>interaction</kwd>
<kwd>pathogenesis</kwd>
<kwd>ribosomopathy</kwd>
</kwd-group>
<counts>
<fig-count count="8"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="84"/>
<page-count count="14"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1">
<title>Introduction</title>
<p>Diamond-Blackfan Anemia (DBA, OMIM # 105650) is an inherited rare pure red blood cell aplasia (&#x223C;5 to 7 per million birth) (<xref ref-type="bibr" rid="B74">Vlachos et al., 2001</xref>; <xref ref-type="bibr" rid="B21">Da Costa et al., 2018</xref>) characterized by the failure of erythropoiesis but with normal production of leukocytes and platelets in the bone marrow (<xref ref-type="bibr" rid="B23">Diamond, 1938</xref>; <xref ref-type="bibr" rid="B12">Ball, 2011</xref>; <xref ref-type="bibr" rid="B21">Da Costa et al., 2018</xref>; <xref ref-type="bibr" rid="B26">Engidaye et al., 2019</xref>). It usually presents during the first year of life, and affects the follow-up growth and development, resulting in short stature and congenital abnormalities. An elevated risk of cancer with &#x223C;4.8-fold is observed as well (<xref ref-type="bibr" rid="B75">Vlachos et al., 2018</xref>). Currently, steroids and blood transfusions can keep the disease at bay but with considerable side effects, and the only cure for the bone marrow failure phenotype of DBA is hematopoietic stem cell transplantation, but donors are often unavailable (<xref ref-type="bibr" rid="B26">Engidaye et al., 2019</xref>).</p>
<p>In DBA, almost all linked genetic lesions come from genes encoding ribosomal proteins (RP) (<xref ref-type="bibr" rid="B10">Aspesi and Ellis, 2019</xref>; <xref ref-type="bibr" rid="B27">Farley-Barnes et al., 2019</xref>), whose haploinsufficiency is believed to impair ribosome biogenesis. Ribosomes are the protein translation machines, and are intensively implicated in the processes with high requirement of protein synthesis, such as hematopoiesis and embryonic development (<xref ref-type="bibr" rid="B19">Cmejla et al., 2000</xref>). During the process of promoting erythroid lineage commitment from hematopoietic stem and progenitor cells, the quantity of ribosomes plays a key role (<xref ref-type="bibr" rid="B40">Khajuria et al., 2018</xref>). The impaired ribosome biogenesis will lead to decreased ribosome quantity and thus the failure of erythropoiesis (<xref ref-type="bibr" rid="B26">Engidaye et al., 2019</xref>; <xref ref-type="bibr" rid="B73">Ulirsch et al., 2019</xref>).</p>
<p>DBA is mainly inherited in an autosomal dominant manner and caused by loss-of-function mutations (<xref ref-type="bibr" rid="B26">Engidaye et al., 2019</xref>). All DBA-related RP gene mutations identified to date are heterozygous (<xref ref-type="bibr" rid="B21">Da Costa et al., 2018</xref>), indicating that homozygous RP gene mutations are lethal. The homozygous lethality of RP genes has been supported in several animal models including zebrafishes and mice (<xref ref-type="bibr" rid="B4">Amsterdam et al., 2004</xref>; <xref ref-type="bibr" rid="B53">Matsson et al., 2004</xref>).</p>
<p>Among all the RP-coding genes, <italic>RPS19</italic> have affected the most majority of DBA patients (&#x223C;25%) (<xref ref-type="bibr" rid="B73">Ulirsch et al., 2019</xref>). As an indispensable component of the ribosome small subunit (SSU or 40S subunit), RPS19 interacts with the 18S rRNA and other RPs in the mature SSU, contributing to the assembling and stability maintenance of SSU (<xref ref-type="bibr" rid="B30">Gregory et al., 2007</xref>; <xref ref-type="bibr" rid="B3">Ameismeier et al., 2018</xref>). During the biogenesis of SSU, RPS19 also plays essential roles in pre-rRNA processing, exportation of SSU precursors from nucleus to cytoplasm, and conformation maturation of 18S rRNA (<xref ref-type="bibr" rid="B47">Leger-Silvestre et al., 2005</xref>; <xref ref-type="bibr" rid="B28">Flygare et al., 2007</xref>; <xref ref-type="bibr" rid="B34">Juli et al., 2016</xref>; <xref ref-type="bibr" rid="B25">Duss et al., 2019</xref>).</p>
<p>With the continuous efforts that have been put in genetic screening and clinical studies of DBA, a considerable number of pathogenic RPS19 mutations have been cataloged (<xref ref-type="bibr" rid="B14">Boria et al., 2010</xref>; <xref ref-type="bibr" rid="B40">Khajuria et al., 2018</xref>), accounting for the most majority of DBA mutations (<xref ref-type="bibr" rid="B73">Ulirsch et al., 2019</xref>). In order to develop better diagnostics and treatment, it is swiftly becoming necessary to reveal the molecular basis of the pathogenic mutations (<xref ref-type="bibr" rid="B39">Kato et al., 2003</xref>; <xref ref-type="bibr" rid="B2">Alexov and Sternberg, 2013</xref>; <xref ref-type="bibr" rid="B71">Stefl et al., 2013</xref>). While it is usually straightforward to interpret how frameshift and splice mutations result in defected RPS19 and thereby its haploinsufficiency, it is more challenging to decipher missense pathogenic mutations of RPS19.</p>
<p>Over the past two decades, the consequences of a few missense mutations on the <italic>RPS19</italic> gene have been studied experimentally (<xref ref-type="bibr" rid="B20">Cmejla and Cmejlova, 2003</xref>; <xref ref-type="bibr" rid="B22">Da Costa et al., 2003</xref>; <xref ref-type="bibr" rid="B29">Gazda et al., 2004</xref>; <xref ref-type="bibr" rid="B6">Angelini et al., 2007</xref>). <xref ref-type="bibr" rid="B30">Gregory et al. (2007)</xref> solved the crystal structure of an archaea RPS19 (from <italic>P. abyssi</italic>), which shares 36% sequence identity with human RPS19, and then analyzed 16 missense DBA mutations based on it. These mutations were classified according to the impact on protein folding or on surface properties, and this binary classification has been applied to interpret pathogenic mutations in others&#x2019; work (<xref ref-type="bibr" rid="B6">Angelini et al., 2007</xref>; <xref ref-type="bibr" rid="B16">Campagnoli et al., 2008</xref>). However, the indirect conclusion deduced from the archaea RPS19 was not applicable to mutations occurring at the sites that are not conserved between human and <italic>P. abyssi</italic>, and the interaction details between RPS19 and other molecules were yet unavailable due to the lack of complex structures accordingly. Thanks to the recent development of Cryo-electron microscopy (Cryo-EM), several human ribosome structures have been determined and have thus created opportunities of further direct analyses (<xref ref-type="bibr" rid="B7">Anger et al., 2013</xref>; <xref ref-type="bibr" rid="B13">Behrmann et al., 2015</xref>; <xref ref-type="bibr" rid="B41">Khatter et al., 2015</xref>; <xref ref-type="bibr" rid="B63">Quade et al., 2015</xref>; <xref ref-type="bibr" rid="B54">Myasnikov et al., 2016</xref>; <xref ref-type="bibr" rid="B82">Zhang et al., 2016</xref>; <xref ref-type="bibr" rid="B55">Natchiar et al., 2017</xref>; <xref ref-type="bibr" rid="B76">Weisser et al., 2017</xref>; <xref ref-type="bibr" rid="B3">Ameismeier et al., 2018</xref>). A recent study briefly described that DBA missense mutations appear to predominantly disrupt the stability of RPS19 by altering the hydrophobic core or to perturb interactions with rRNA in assembled ribosomes (<xref ref-type="bibr" rid="B73">Ulirsch et al., 2019</xref>), but the specific approaches through which the mutations affect RPS19 remain unclear. Till now, an in-depth and systematic understanding of all available DBA missense mutations is still lacking, and a method for inferring the pathogenicity of newly identified RPS19 mutations is also highly required.</p>
<p>In this work, we conducted a comprehensive study on the structural basis of the DBA mutations from human RPS19 based on its 3D structures. First, starting with the human RPS19 structure extracted from the ribosome complex, we identified its main structure elements: the hydrophobic core with a bundle of five helices, two &#x03B2;-hairpins, and three putative intrinsic disordered regions (IDR). Second, we revealed that RPS19 interacts with 18S rRNA in the mature SSU through five conserved surface patches. Third, we identified several specific approaches through which DBA mutations destabilize the protein structure or affect interactions, and thereby lead to defected RPS19. Last, based on the understandings of the structural basis of DBA mutations, we trained a support vector machine (SVM) model to predict the pathogenicity of all possible missense mutations of RPS19, which will be valuable in future studies when new mutations are identified.</p>
</sec>
<sec id="S2">
<title>Results</title>
<sec id="S2.SS1">
<title>Analyses of Human RPS19 Structure</title>
<p>An in-depth understanding of structure is the basis of studying mutations in a structural context, so we investigated the structure of human RPS19 in detail. Its structures both in free state and in the SSU complex were inspected in order to obtain an accurate description of its secondary structure, hydrophobic core, and interaction with other molecules.</p>
<sec id="S2.SS1.SSS1">
<title>Structure Elements of Human RPS19</title>
<p>The recent determined Cryo-EM structure of human ribosome can be used as a starting point to assign the secondary structure of RPS19 (<xref ref-type="bibr" rid="B3">Ameismeier et al., 2018</xref>). Considering that RPS19 is packed in the ribosome complex enriched with negatively charged rRNAs, its conformation may be skewed from the free state. To obtain its conformation in free state, we have thereafter performed a molecular dynamics (MD) simulation of human RPS19 without the presence of 18S rRNA and other proteins. The follow-up conformation clustering resulted in seven clusters, where the largest cluster represents 70.1% of all conformations sampled in the whole simulation. The representative conformation of the largest cluster was adopted for the assignment of secondary structure. Our assignment confirms the existence of five helices (h1&#x2013;h5) in free state but with boundary shifts of 3&#x2013;4 residues compared with those inferred based on the archaea RPS19 structure previously (<xref ref-type="bibr" rid="B30">Gregory et al., 2007</xref>; <xref ref-type="fig" rid="F1">Figure 1</xref>). Similar to the archaea RPS19, these five helices fold into a bundle and constitute the hydrophobic core (<xref ref-type="supplementary-material" rid="FS1">Supplementary Figure 1A</xref>). The comparison of RPS19 between in free state and packed in ribosome shows that the helix regions (defined according to the free state conformation) are more stretched (<xref ref-type="supplementary-material" rid="FS2">Supplementary Figure 2</xref>) and less regular in the packed state (<xref ref-type="fig" rid="F1">Figure 1</xref>), indicating a higher winded level that may stem from the packing in the ribosome. Two &#x03B2;-hairpins can also be observed (<xref ref-type="fig" rid="F1">Figure 1</xref>).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>The plots of RMSF, structure elements, IDRs, and mutations of human RPS19. The putative IDRs (RMSF &#x003E; 2 &#x00C5;) are marked by red lines. The secondary structures are assigned based on the human RPS19 structure in free state, packed state, and inferred from <italic>P. abyssi</italic> RPS19, respectively (red rectangle: helix; yellow arrow: &#x03B2;-strands; gap in &#x201C;inferred:&#x201D; putative IDR inferred from <italic>P. abyssi</italic>). The DBA and neutral mutations are indicated with small upward arrows in red and green, respectively.</p></caption>
<graphic xlink:href="fgene-12-650897-g001.tif"/>
</fig>
<p>Two IDRs have been roughly proposed based on the archaea RPS19 structure previously (<xref ref-type="bibr" rid="B30">Gregory et al., 2007</xref>; <xref ref-type="fig" rid="F1">Figure 1</xref>). Here we superposed the 7 representative conformations of the conformation clusters, and observed that two long regions (between h1 and h2, and between h3 and h4) display much higher flexibility than the helices (<xref ref-type="supplementary-material" rid="FS1">Supplementary Figure 1B</xref>). The RMSF plot also shows large fluctuations in these two regions (<xref ref-type="fig" rid="F1">Figure 1</xref>). These observations are consistent with the existence of the two putative IDRs mentioned above, but they are much larger than previously inferred (<xref ref-type="bibr" rid="B30">Gregory et al., 2007</xref>). A third small IDR was also observed from the RMSF plot which was between h4 and h5. In accordance with our knowledge about the IDR&#x2019;s preference for interaction, these IDRs bind 18S rRNA and other proteins in the assembled ribosome (see below). Notably, the IDR between h3 and h4 and the IDR between h4 and h5 overlap with the &#x03B2;-hairpins identified in the representative conformation, possibly suggesting that these &#x03B2;-hairpins may have much flexibility as a whole.</p>
</sec>
<sec id="S2.SS1.SSS2">
<title>Analyses of Interactions in Packed State</title>
<p>Through analyzing the interaction interface of RPS19 in the complex of ribosome SSU, we found that RPS19 mainly interacts with 18S rRNA, RPS16, and RPS18 (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>). The largest interaction interface is found between RPS19 and 18S rRNA, and it consists of 75 residues of RPS19 and 61 nucleotides of 18S rRNA, accounting for 51.7% of RPS19 and 3.7% of 18S rRNA, respectively (<xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>). According to the definition of the secondary structure of human 18S rRNA (<xref ref-type="bibr" rid="B59">Petrov et al., 2014</xref>), five elements are involved in the interaction interface (<xref ref-type="table" rid="T1">Table 1</xref>). The interface is stabilized by 46 inter-molecule hydrogen bonds, which are enriched at the h41es10 and h41 of 18S rRNA. As for the interaction interfaces formed with RPS16 and RPS18, the areas are much smaller as shown in <xref ref-type="supplementary-material" rid="TS1">Supplementary Table 1</xref>.</p>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>The detailed interactions between 18S rRNA and RPS19.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Interacting elements of 18S rRNA</td>
<td valign="top" align="left">Interacting residues of RPS19</td>
<td valign="top" align="center">Patch</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">h42 (1,603&#x2013;1,607 and 1,626&#x2013;1,629)</td>
<td valign="top" align="left">V37, K38, L39, K43, E44, L45</td>
<td valign="top" align="center">I</td>
</tr>
<tr>
<td valign="top" align="left">h41 (1,537&#x2013;1,543 and 1,583&#x2013;1,596)</td>
<td valign="top" align="left">P47, W52, T55, R56, S59, R62, H63, R67, Y79, G80</td>
<td valign="top" align="center">II</td>
</tr>
<tr>
<td valign="top" align="left">h42 and h43 (1,653&#x2013;1,656 and 1,664&#x2013;1,666)</td>
<td valign="top" align="left">R84, N85, G86, P89, H91, F92</td>
<td valign="top" align="center">III</td>
</tr>
<tr>
<td valign="top" align="left">h41es10 (1,561&#x2013;1,571)</td>
<td valign="top" align="left">G71, V72, R94, S96, K97, S98, R101, R102, Q105, G120, R121</td>
<td valign="top" align="center">IV</td>
</tr>
<tr>
<td valign="top" align="left">h39es9 (1,414&#x2013;1,430)</td>
<td valign="top" align="left">P2-V9, Y65, R129, D132, A135</td>
<td valign="top" align="center">V</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>After mapping the conservation score to the solvent-excluded molecular surface by specific color scheme (<xref ref-type="bibr" rid="B11">Baker et al., 2001</xref>; <xref ref-type="bibr" rid="B68">Schrodinger, 2015</xref>; <xref ref-type="bibr" rid="B8">Ashkenazy et al., 2016</xref>), we further analyzed the interaction interface of RPS19 (<xref ref-type="fig" rid="F2">Figure 2A</xref>). Five conserved patches (I, II, III, IV, and V) were identified by visual inspection, and they interact with the five secondary structural elements of 18S rRNA accordingly (<xref ref-type="fig" rid="F2">Figure 2A</xref> and <xref ref-type="table" rid="T1">Table 1</xref>). Patch I and III correspond to the first two IDRs, respectively; Patch II is composed of the residues from the hydrophilic side of h2 and their nearby residues; Patch IV comprises the residues from the hydrophilic sides of h3 and h4, and the C-terminal of the third IDR; Patch V consists of the N-terminal coil of RPS19 and the exposed residues of h5. Moreover, all these 5 conserved patches present positive electrostatic potential (<xref ref-type="fig" rid="F2">Figure 2B</xref>), which is well matched with the negatively charged rRNA at the interaction interface.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>The five surface patches of RPS19 interact with the five secondary structural elements of 18S rRNA (orange) accordingly, with different color schemes rendering <bold>(A)</bold> conservation level and <bold>(B)</bold> electrostatic potential.</p></caption>
<graphic xlink:href="fgene-12-650897-g002.tif"/>
</fig>
<p>Previously, <xref ref-type="bibr" rid="B30">Gregory et al. (2007)</xref> had inferred that two highly conserved basic patches on the RPS19 surface might interact with rRNA. Patch I and III in our work have covered them, and three new patches were identified additionally. Moreover, our analyses have extracted the specific interaction residues and nucleotides between human RPS19 and 18S rRNA directly, thereby providing additional structural insights.</p>
</sec>
</sec>
<sec id="S2.SS2">
<title>Overview of DBA Mutations</title>
<p>We have curated 51 DBA missense mutations (<xref ref-type="supplementary-material" rid="TS2">Supplementary Table 2</xref>) and 30 neutral ones (<xref ref-type="supplementary-material" rid="TS3">Supplementary Table 3</xref>). These data presumably constitute a very complete missense mutation dataset of RPS19 until now. We first checked the conservation features of DBA mutations. According to the calculated Consurf scores (<xref ref-type="bibr" rid="B8">Ashkenazy et al., 2016</xref>), the conservation levels of DBA mutation sites are significantly higher than the neutral ones (median: -0.665 vs. 0.0995, <italic>p</italic> = 8.9E-5, one-tailed Mann-Whitney U test, <xref ref-type="fig" rid="F3">Figure 3A</xref>). Nearly one half of DBA mutations are located at the highly conserved sites (Consurf score &#x003C; -1.0). These results are consistent with the general knowledge that disease mutations tend to occur at conservative sites (<xref ref-type="bibr" rid="B44">Kumar et al., 2009</xref>; <xref ref-type="bibr" rid="B45">Lal et al., 2020</xref>), which are indicative of functional or structural importance. Although it shows the potential in discriminating DBA and neutral mutations, the conservation feature cannot provide specific clues concerning the pathogenesis. We then analyzed the residue types and counts of the DBA mutations.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>The conservation properties of RPS19 mutations and the residue type counts of DBA mutations. <bold>(A)</bold> The Consurf score boxplots between DBA and neutral mutations. The lower the Consurf score, the higher the conservation. <bold>(B)</bold> The count of wildtype and mutant residues in DBA mutations.</p></caption>
<graphic xlink:href="fgene-12-650897-g003.tif"/>
</fig>
<p>Overall, DBA mutations are more likely to occur at positively charged, hydrophobic or small residues (<xref ref-type="fig" rid="F3">Figure 3B</xref> and <xref ref-type="supplementary-material" rid="TS2">Supplementary Table 2</xref>). Arginine (Arg) is the most mutated residue, accounting for about 19.6% (10/51) of all DBA mutations. Leucine (Leu) is the second most mutated residue (17.6%, 9/51), and small volume residues such as Glycine (Gly) and Alanine (Ala) are also mutated frequently (<xref ref-type="fig" rid="F3">Figure 3B</xref>). We speculate that these Arg sites may participate in interactions with negatively charged rRNAs, and their mutations may disrupt or weaken these interactions. As for Leu, the speculation is that they often form the hydrophobic core, and mutations to polar residues may disturb the stability of the protein. Mutations from small residues to large ones may also influence the stability due to steric hindrance effect.</p>
<p>One striking feature is that a larger number of residues (29.4%, 15/51) have been mutated into Proline (Pro) (<xref ref-type="fig" rid="F3">Figure 3B</xref>), a well-known helix breaker due to its cyclic side chain (a pyrrolidine ring) (<xref ref-type="bibr" rid="B77">Woolfson and Williams, 1990</xref>). Considering that the hydrophobic core of RPS19 is mainly composed of the bundle of the five helices, these mutations will presumably affect protein structure stability by destroying helices. There were also many other residues mutated to Arg (<xref ref-type="fig" rid="F3">Figure 3B</xref>), most of which were mutated from hydrophobic residues, such as Leu and Tryptophan (Trp) (<xref ref-type="supplementary-material" rid="TS2">Supplementary Table 2</xref>). The hydrophobic effect of these residues would be disrupted by charged Arg, and thus protein folding would be affected.</p>
<p>Based on the analyses concerning the residue types, we can roughly infer that DBA mutations may mainly disrupt RPS19 from two perspectives: protein folding stability and interaction, as other studies have done (<xref ref-type="bibr" rid="B83">Zhang et al., 2010</xref>; <xref ref-type="bibr" rid="B58">Peng et al., 2018</xref>) or reviews have summarized (<xref ref-type="bibr" rid="B71">Stefl et al., 2013</xref>; <xref ref-type="bibr" rid="B80">Yates and Sternberg, 2013</xref>). However, more systematic data mining is required to consolidate these inferences, and more detailed mechanisms need to be explored in order to answer how a DBA missense mutation destabilize RPS19 or perturb its interactions. Compared to sequence, structure is more directly related to function. In the following analyses, more in-depth investigations in structural contexts have thus been conducted.</p>
</sec>
<sec id="S2.SS3">
<title>Structural Basis of DBA Mutations</title>
<sec id="S2.SS3.SSS1">
<title>Substantial DBA Mutations Decrease Structure Stability by Two Approaches</title>
<p>To investigate to what extent the mutations affect stability of the protein structure, we calculated the change in free energy (&#x0394;&#x0394;G) of folding of RPS19 caused by mutations using FoldX, a protein design algorithm that uses an empirical force field (<xref ref-type="bibr" rid="B69">Schymkowitz et al., 2005</xref>). We found that DBA mutations are more capable than neutral ones of decreasing the structural stability (median: 1.55 vs. 0.106, <italic>p</italic> = 2.27E-4, one-tailed Mann-Whitney U test, <xref ref-type="fig" rid="F4">Figure 4A</xref>). There were 30 DBA mutations with &#x0394;&#x0394;G greater than 1 kcal/mol, which was usually used as the cut-off to distinguish destabilizing mutations (<xref ref-type="bibr" rid="B15">Buss et al., 2018</xref>). This result has quantitatively demonstrated that the majority of DBA mutations should have reduced the stability of RPS19.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>The boxplots of &#x0394;&#x0394;G and rSASA. <bold>(A)</bold> The comparison of the &#x0394;&#x0394;G caused by mutations. <bold>(B)</bold> The comparison of the rSASA of mutated sites.</p></caption>
<graphic xlink:href="fgene-12-650897-g004.tif"/>
</fig>
<p>We further investigated the approaches by which DBA mutations decrease the structure stability. The first is the destruction of hydrophobic core. We compared the relative solvent accessible surface area (rSASA) between DBA and neutral mutations (<xref ref-type="fig" rid="F4">Figure 4B</xref>), and found that DBA mutation sites are more buried than the neutral ones (median: 0.13 vs. 0.42, <italic>p</italic> = 2.17E-5, one-tailed Mann-Whitney U test). Further restricting in the 30 DBA mutations with &#x0394;&#x0394;G &#x003E; 1 kcal/mol shows even smaller rSASA values (median: 0.061). We checked the residue types of these 30 mutations, and found two major classes: 13 are mutated from typical hydrophobic residues (Leu, Trp, Val, and Phe) to other types with 22.96% hydrophobicity decreased on average (<xref ref-type="bibr" rid="B66">Rose et al., 1985</xref>), and 7 of them are mutated from small residues (Gly, Ala) to larger ones with 108.36% volume increased on average (<xref ref-type="supplementary-material" rid="TS2">Supplementary Table 2</xref>; <xref ref-type="bibr" rid="B81">Zamyatnin, 1972</xref>).</p>
<p>The decrease of hydrophobicity can weaken the hydrophobic interaction during protein folding, and the increasing of residue volume can result in steric hindrance effect. Either of them could destroy the hydrophobic core and thus lead to reduced stability (<xref ref-type="bibr" rid="B50">Liu et al., 2000</xref>; <xref ref-type="bibr" rid="B51">Loladze et al., 2002</xref>). The best examples are the two mutations occurring at Gly127 (Gly127Arg, Gly127Glu). Gly127 is located at the N-terminal of h5 and is fully buried (rSASA = 0, <xref ref-type="fig" rid="F5">Figure 5A</xref>). Its surrounding residues (Lys111, Met112, Leu123, Thr124, Gln126, Asp130, and Leu131) forms a crowed interior space, which cannot accommodate larger residues. Therefore, the substitutions from Gly, which is the smallest residue (volume = 60.1 &#x00C5;<sup>3</sup>), to larger Arg and Glu (volume = 173.4 and 138.4 &#x00C5;<sup>3</sup>) would pose significant steric hindrance effect with 188.52 and 130.28% volume increase, respectively. Moreover, Arg and Glu have charged side chains with high hydrophilicity, which will also disturb the formation of the hydrophobic core during protein folding. It should be these two effects that cause Gly127Arg and Gly127Glu to result in the largest structural destabilization with &#x0394;&#x0394;G = 18.72 and 17.12 kcal/mol, respectively (<xref ref-type="fig" rid="F4">Figures 4A</xref>, <xref ref-type="fig" rid="F5">5A</xref>).</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p>The structural visualization of the Gly127 and the sites mutated to Pro. <bold>(A)</bold> The locations of the Gly127 (red sphere) and its surrounding residues (blue sphere). <bold>(B)</bold> The locations of residues mutated to Pro (red sticks).</p></caption>
<graphic xlink:href="fgene-12-650897-g005.tif"/>
</fig>
<p>Secondly, as a helix-dominant structure, RPS19 can be destabilized by disturbing the folding of helices. Pro is well-known as a helix breaker (<xref ref-type="bibr" rid="B49">Levitt, 1978</xref>; <xref ref-type="bibr" rid="B77">Woolfson and Williams, 1990</xref>), which disrupts two adjacent hydrogen bonds and whose pyrrolidine ring pushes the preceding turn of backbone away by about 1 &#x00C5; (<xref ref-type="bibr" rid="B64">Richardson and Richardson, 1988</xref>). Among the 30 DBA mutations with &#x0394;&#x0394;G &#x003E; 1 kcal/mol, eight are mutated to Pro. In the whole dataset, more than one quarter of disease mutations (29.4%, 15/51) are mutated to Pro (<xref ref-type="supplementary-material" rid="TS2">Supplementary Table 2</xref>), and there exists at least one such mutation in each helix (<xref ref-type="fig" rid="F5">Figure 5B</xref>). These mutations may distort the folding of helices, disturb the formation of the helix bundle, and thus decrease the stability of RPS19. Many of these mutations (Leu64Pro, Ala20Pro, Thr76Pro, Arg102Pro, and Leu131Pro) were also proposed to affect the protein&#x2019;s stability through breaking the folding of helices in other studies (<xref ref-type="bibr" rid="B16">Campagnoli et al., 2008</xref>; <xref ref-type="bibr" rid="B73">Ulirsch et al., 2019</xref>).</p>
</sec>
<sec id="S2.SS3.SSS2">
<title>Numerous DBA Mutations Disrupt Interactions With 18S rRNA by Three Avenues</title>
<p>In section &#x201C;Analyses of Interactions in Packed State,&#x201D; we have identified 5 conserved surface patches that are involved in the interactions with 18S rRNA (<xref ref-type="table" rid="T1">Table 1</xref>). There are 24 of 51 DBA mutations are located in these patches, indicating that interfering with RPS19-18S rRNA interactions serves as another main feature of DBA mutations. When the RPS19-18S rRNA interactions are perturbed, the SSU assembling will be affected and thereby lead to insufficient biogenesis of ribosomes.</p>
<p>Through analyzing the 3D structure, we have identified three avenues of affecting interactions. First, residue substitutions caused by DBA mutations may disrupt the hydrogen bonds formed between RPS19 and 18S rRNA, such as losing hydrogen donor or acceptor, increasing the distance between bonding atoms, and distorting the bond angle to unfavorable situations. The RPS19-18S rRNA interaction is stabilized by 46 hydrogen bonds (<xref ref-type="supplementary-material" rid="TS1">Supplementary Tables 1</xref>, <xref ref-type="supplementary-material" rid="TS4">4</xref>), and about 16 DBA mutations break some of these hydrogen bonds (<xref ref-type="supplementary-material" rid="TS5">Supplementary Table 5</xref>). For example, mutation Lys38Asn in surface Patch I (<xref ref-type="fig" rid="F6">Figure 6A</xref>) and Arg101Cys in surface Patch IV result in shorter side chains that lead to unsuitable distance between the potential bonding atoms; mutation Arg94Leu in Patch IV loses the hydrogen donor; and mutation Arg62Gln in Patch II distorts the bond angle to an unfavorable situation (<xref ref-type="supplementary-material" rid="TS5">Supplementary Table 5</xref>).</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p>The three avenues of affecting interactions by DBA mutations. <bold>(A)</bold> Destroy hydrogen bonds by increasing the distance between bonding atoms (Lys38Asn). <bold>(B)</bold> Reverse the surface electrostatic properties (Arg62Trp). <bold>(C)</bold> Introduce steric hindrance at the binding interface (Ser59Phe).</p></caption>
<graphic xlink:href="fgene-12-650897-g006.tif"/>
</fig>
<p>Second, DBA mutations may alter the surface electrostatic properties of the protein. Considering the prevalent electrostatic interactions between negatively charged rRNA and positively charged surface patches of RPS19 (<xref ref-type="fig" rid="F2">Figure 2</xref>), we can speculate that DBA mutations in these patches may perturb them. For example, mutation Arg62Trp in Patch II have reversed the surface electrostatic potential from positive to negative, affecting the interaction between this site and h41 of 18S rRNA (<xref ref-type="fig" rid="F6">Figure 6B</xref>). Although Trp itself is nearly electrostatic neutral, several nearby aromatic residues (Phe14, His63, and Tyr65) forming potential &#x03C0;&#x2013;&#x03C0; stacking and several negatively charged or polar residues (Gln11, Gln12, and Glu13) may have contributed to the negative electrostatic potential at this site. Another example is the mutation Gly71Glu in Patch IV, which changed the surface electrostatic potential from positive to neutral, and thus perturbed the interaction between this site and h41es10 of 18S rRNA (<xref ref-type="supplementary-material" rid="TS5">Supplementary Table 5</xref>).</p>
<p>Last, a set of DBA mutations occurring at the interaction interface do not influence the interaction through hydrogen bonds or other electrostatic perturbations, but they substitute small residues into large ones, which will introduce steric hindrance at the binding interface. One of the examples is Ser59Phe in Patch II, which will perturb the interaction with h41 of 18S rRNA (<xref ref-type="fig" rid="F6">Figure 6C</xref>). Another example in Patch II is the mutation Thr55Met, which will lead to the spatial collision between it and h41 of 18S rRNA. A previous experimental study showed that Thr55Met partially impairs the function of RPS19, but the mechanism was not clear yet (<xref ref-type="bibr" rid="B9">Aspesi et al., 2018</xref>). Here, our study has proposed a possible mechanism to explain this effect.</p>
<p>It is worth noting that a mutation may disrupt the hydrophobic core or perturb the interaction through more than one approach (<xref ref-type="fig" rid="F7">Figures 7A,B</xref>). Taken Ser59Phe as an example, it not only results in the steric hindrance effect, but also breaks a hydrogen bond. Specifically, when substituted to Phe from Ser, the residue volume has increased by 134%. On the other hand, Ser59 is located at the hydrophilic side of h2 and is a component of the surface Patch II; a hydrogen bond was formed between its hydroxyl group and G1541 of 18S rRNA, and the substitution with Phe would lead to the loss of hydrogen donor (<xref ref-type="fig" rid="F6">Figure 6C</xref>). The detailed structural bases of all the DBA mutations are provided in <xref ref-type="supplementary-material" rid="TS5">Supplementary Table 5</xref>. In summary, 15.7% (8/51) of DBA mutations may affect structure stability and RPS19-rRNA interactions at the same time. More specifically, 56.7% (17/30) of destabilizing mutations and 47.6% of interaction-disrupting mutations (10/21) may manifest their deleterious impact through more than one approach (<xref ref-type="fig" rid="F7">Figures 7A,B</xref> and <xref ref-type="supplementary-material" rid="TS5">Supplementary Table 5</xref>).</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption><p>The number of DBA mutations with different structural basis. <bold>(A)</bold> Decrease structure stability. <bold>(B)</bold> Disrupt interaction with 18S rRNA.</p></caption>
<graphic xlink:href="fgene-12-650897-g007.tif"/>
</fig>
</sec>
</sec>
<sec id="S2.SS4">
<title>Predicting Pathogenicity of New RPS19 Mutations</title>
<p>Our efforts of understanding the structural basis of DBA mutations of RPS19 can provide clues to predict the pathogenicity of newly identified RPS19 mutations. General prediction tools, such as PMut (<xref ref-type="bibr" rid="B52">Lopez-Ferrando et al., 2017</xref>), MutPred2 (<xref ref-type="bibr" rid="B57">Pejaver et al., 2017</xref>), PolyPhen2 (<xref ref-type="bibr" rid="B1">Adzhubei et al., 2010</xref>), and SIFT (<xref ref-type="bibr" rid="B70">Sim et al., 2012</xref>), do not perform well with high false positive rate (FPR) on RPS19 mutations (<xref ref-type="fig" rid="F8">Figures 8A,B</xref>), possibly because they internally have the tendency to overestimate the pathogenicity of mutations, as discussed previously (<xref ref-type="bibr" rid="B5">Andersen et al., 2017</xref>; <xref ref-type="bibr" rid="B31">Holland et al., 2017</xref>; <xref ref-type="bibr" rid="B58">Peng et al., 2018</xref>). Another reason may come from that these general tools do not incorporate specific features about RPS19. If we adopt the RPS19 mutations as a specific dataset and incorporate as features the clues in the understanding of DBA mutations, it should be promising to build a better pathogenicity predictor by using machine learning methods, as demonstrated in developing the IDR-specific disease mutation predictor previously (<xref ref-type="bibr" rid="B84">Zhou et al., 2020</xref>).</p>
<fig id="F8" position="float">
<label>FIGURE 8</label>
<caption><p>Performance comparison between RPS19-SVM (cross-validation) and other four well-known prediction tools <bold>(A,B)</bold> and the heatmap of RPS19-SVM pathogenicity predictions of all possible mutations of RPS19 <bold>(C)</bold>. The definitions of the performance metrics are described in <xref ref-type="supplementary-material" rid="TS6">Supplementary Table 6</xref>. The larger the pathogenicity score (ranging from 0 to 1) given by RPS19-SVM, the higher the possibility of being pathogenic.</p></caption>
<graphic xlink:href="fgene-12-650897-g008.tif"/>
</fig>
<p>We adopted the support vector machine (SVM) to build the RPS19-specific prediction tool. After further curation, 29 DBA mutations (positive samples) and 30 neutral ones (negative samples) were selected from the mutation dataset as training data. We then extracted the features based on our understandings of the structural basis of DBA mutations. In total, 18 candidate features were extracted for each mutation concerning the interaction with rRNA, structural stability, conservation, etc. After feature selection, 8 features were finally selected (BSA, rBSA, HB_Num, &#x0394;charge, &#x0394;&#x0394;G, &#x0394;helix, BLOSUM62, and &#x0394;disorder) (<xref ref-type="supplementary-material" rid="TS7">Supplementary Table 7</xref>).</p>
<p>Based on fivefold cross-validation, we identified the best hyper-parameters (C = 100, &#x03B3; = 0.01), and 26 of the 29 DBA mutations can be correctly predicted in the cross-validation, similar to other well-known tools (<xref ref-type="fig" rid="F8">Figure 8A</xref>). At the same time, the FPR has been significantly decreased compared with others (<xref ref-type="fig" rid="F8">Figure 8A</xref>). Overall, much better performance has been achieved as measured by ACC, F1 score, and MCC (<xref ref-type="fig" rid="F8">Figure 8B</xref>). We manually inspected the nine false positives of SIFT (Pro2Leu, Val4Phe, Thr5Pro, Lys24Asn, Asp35Gly, Thr60Ala, Tyr79Cys, Gln105Arg, His145Tyr), and found most of them are located at conserved sites according to the Consurf scores (<xref ref-type="supplementary-material" rid="TS3">Supplementary Table 3</xref>). According to its prediction logic, SIFT will tend to predict these variants as deleterious. As for the cross-validation of SVM training here, several features independent of conservation have been adopted, which may be responsible for its lower FPR. For these variants, most of them have small or even negative &#x0394;&#x0394;G values (<xref ref-type="supplementary-material" rid="TS3">Supplementary Table 3</xref>), indicating only trivial perturbation of structural stability, which may serve as the reason why the SVM classifier predicts them as neutral.</p>
<p>Finally, we re-trained the predictor, namely RPS19-SVM, on all the training data with the best hyper-parameters, and utilized it to predict the pathogenicity for all possible missense mutations of RPS19 (<xref ref-type="fig" rid="F8">Figure 8C</xref> and <xref ref-type="supplementary-material" rid="TS8">Supplementary Table 8</xref>). The resulted pathogenicity scores can be valuable in inferring the disease-association of newly identified RPS19 mutations, or the deleteriousness of newly designed mutagenesis mutations.</p>
<p>As ExAC has been upgraded to gnomAD later (<xref ref-type="bibr" rid="B37">Karczewski et al., 2020</xref>), we retrieved 25 additional neutral missense variants of RPS19 from gnomAD v2. They were not utilized in the training of RPS19-SVM, so we can use them as an independent testing dataset. It turns out that RPS19-SVM can accurately predict 23 of them as &#x201C;neutral&#x201D; with a low FPR of 8% (2/25), much better than the other four tools (PMut: 60%, MutPred2: 76%, PolyPhen2: 20%, SIFT: 40%), confirming its superior performance (<xref ref-type="supplementary-material" rid="TS9">Supplementary Table 9</xref>). When new positive data are available in the future, further validation could be conducted as well.</p>
<p>Moreover, the heatmap based on the pathogenicity scores (<xref ref-type="fig" rid="F8">Figure 8C</xref>) also confirms our understanding of their molecular mechanisms and can provide new insights. First, mutations substituted by positive residues (Arg, Lys, and His) or by negative residues (Asp and Glu) have relatively low or high pathogenicity scores, respectively. Second, mutations substituted by Proline are not tolerated by RPS19, especially in helix regions. These results are consistent with its characteristics of interaction with negatively charged 18S rRNA and hydrophobic core with a bundle of helices. Moreover, we also found that the two &#x03B2;-hairpin regions, overlapping with the second and third putative IDRs, had lower mutation tolerance for substitution of hydrophobic residues, though few disease mutations in these regions were reported previously. They participate in the composition of surface Patch III and IV, and substitutions by hydrophobic residues may thus affect the interactions with 18S rRNA.</p>
</sec>
</sec>
<sec id="S3">
<title>Discussion</title>
<p>In this work, we conducted a systematic study aiming at revealing the structural basis of DBA mutations at RPS19. Our study illustrated that DBA mutations would disrupt the hydrophobic core related to structural stability or perturb the interactions between RPS19 and 18S rRNA through two or three mechanisms, respectively. Based on these, we further trained an RPS19-specific predictor and predicted the pathogenicity of all possible RPS19 mutations. Logically, the RPS19 molecules bearing DBA mutations would thus be subject to faster degradation or incapability of assembling into the ribosome SSU, resulting in insufficient ribosome quantities and finally DBA symptoms.</p>
<p>Compared with a previous study (<xref ref-type="bibr" rid="B30">Gregory et al., 2007</xref>), our work is more comprehensive in that a more complete list of DBA mutations were incorporated, and more specific mechanisms were investigated. Moreover, our studies on several mutations were more reliable. For example, Trp52 and Gly120 were considered to be located at the surface of RPS19, and Trp52Arg and Gly120Ser were thus believed to interrupt the interaction with other molecules according to the previous study (<xref ref-type="bibr" rid="B30">Gregory et al., 2007</xref>). However, based on the complex structure of human ribosome, we in this work found that Trp52 is actually almost buried (rSASA = 0.067), and does not participate in the interaction. Moreover, there may exist &#x03C0;&#x2013;&#x03C0; stacking between Trp52 and nearby residues such as Phe53 and Pro47. The mutation Trp52Arg would decrease the hydrophobicity of this site, and thus destabilize the hydrophobic core, as supported by the calculated &#x0394;&#x0394;G (4.35 kcal/mol). As for Gly120, it is actually only partially exposed (rSASA = 0.135), and no interaction can be observed. The mutation Gly120Arg introduces a large and hydrophilic residue, which would decrease the stability of RPS19, as indicated by the &#x0394;&#x0394;G calculation (6.51 kcal/mol) as well. Hence, our work has improved or even corrected previous understandings of some DBA mutations.</p>
<p>It should be admitted that our analyses have some shortcomings yet. First, ribosomopathies, including DBA, often manifest in a tissue-specific way, with which two hypotheses have been proposed: the specialized ribosome hypothesis and the ribosome concentration hypothesis (<xref ref-type="bibr" rid="B27">Farley-Barnes et al., 2019</xref>). Our work was mainly conducted by following the latter. If the specialized ribosome hypothesis is proved also a major reason for DBA in the future, this kind of systematic analyses should be further improved from this perspective.</p>
<p>Second, in addition to being the indispensable component of ribosome SSU, RPS19 also plays essential roles in many other pathways, including pre-rRNA processing, exportation of SSU precursors from nucleus to cytoplasm, and conformation maturation of 18S rRNA (<xref ref-type="bibr" rid="B47">Leger-Silvestre et al., 2005</xref>; <xref ref-type="bibr" rid="B28">Flygare et al., 2007</xref>; <xref ref-type="bibr" rid="B34">Juli et al., 2016</xref>; <xref ref-type="bibr" rid="B25">Duss et al., 2019</xref>). Our work currently only considers the RPS19 mutations&#x2019; effects on the maintenance of mature ribosome, and analyses of their perturbation on other pathways of RPS19 should also be conducted in the future.</p>
<p>Third, we only analyzed the structural stability and interactions of RPS19, but the mutations may act from other aspects, such as subcellular localization. Ribosome precursors are first formed in the nucleoli (<xref ref-type="bibr" rid="B61">Phipps et al., 2011</xref>; <xref ref-type="bibr" rid="B3">Ameismeier et al., 2018</xref>) and then mature in the cytoplasm (<xref ref-type="bibr" rid="B67">Rouquette et al., 2005</xref>; <xref ref-type="bibr" rid="B3">Ameismeier et al., 2018</xref>). According to immunofluorescence experiments and structural analysis of ribosome intermediates, RPS19 participates in ribosome assembly in the nucleoli (<xref ref-type="bibr" rid="B3">Ameismeier et al., 2018</xref>; <xref ref-type="bibr" rid="B42">Klinge and Woolford, 2019</xref>), so the RPS19 synthesized in the cytoplasm first needs to enter into the nucleus and be localized in the nucleoli. Two nucleolar localization signals (NoSs)&#x2014;Met1 to Arg16 and Gly120 to Asn142&#x2014;in RPS19 have been identified previously (<xref ref-type="bibr" rid="B22">Da Costa et al., 2003</xref>). Two DBA mutations&#x2014;Val15Phe and Gly127Gln&#x2014;located in these two NoSs, have been proved to fail to localize RPS19 to the nucleoli (<xref ref-type="bibr" rid="B22">Da Costa et al., 2003</xref>). Moreover, other mutations like Ala57Pro and Ala61Glu, which are not located in the NoSs, also caused mislocalization of RPS19 (<xref ref-type="bibr" rid="B6">Angelini et al., 2007</xref>). Incorporating the effects on subcellular localization of DBA mutations in the future analyses would be beneficial for better understanding of the mechanisms of disease mutations.</p>
<p>It should be noted that the approaches we proposed cannot cover all the DBA mutations. Some residue substitutions are highly likely to disrupt the protein structure, such as Ala17Pro, Ala20Pro, Ala57Pro, Ala58Pro, and Leu64Pro, as they have introduced proline into helical regions. But the free energy changes calculated by FoldX did not support this. Considering that previous studies have suggested that FoldX calculation for mutations containing Pro needs to be optimized (<xref ref-type="bibr" rid="B62">Potapov et al., 2009</xref>; <xref ref-type="bibr" rid="B78">Yang et al., 2020</xref>), the &#x0394;&#x0394;G calculated for these mutations may be far from accurate. Hence, we still treat these mutations as possibly belonging to &#x201C;Break helix (Mutated to Proline)&#x201D; (<xref ref-type="supplementary-material" rid="TS5">Supplementary Table 5</xref>). Another example is Lys23Arg, which may result in an additional intra-molecular hydrogen bond to Leu28, a residue in the IDR located between h1 and h2. Thus, the conformation of this IDR could be restrained, and the interaction between RPS19 and 18S rRNA would be influenced. The summary of the structural bases of those mutations that cannot be covered by our proposed approaches in the section of Results is provided in the column of &#x201C;Notes&#x201D; of <xref ref-type="supplementary-material" rid="TS5">Supplementary Table 5</xref>. It will be promising that the more specific and detailed mechanisms of these mutations are studied by using molecular dynamics simulations in the future.</p>
<p>Except for RPS19, DBA may also stem from defects at other RPs. In our perspective, further studies on the DBA mutations in all related RPs in the future would provide a more comprehensive picture of the pathogenesis mechanisms, which may also shed light on the pathogenesis of other ribosomopathies. Some DBA mutations in other RPs may share similar structural basis as those in RPS19. For example, an N2-Q22 deletion variant of RPS24 is known to cause DBA (<xref ref-type="bibr" rid="B18">Choesmel et al., 2008</xref>). We checked the structure of the ribosome complex, and found that the deleted fragment interacts with the h21es6a of 18S rRNA. Hence, it can be assumed that its structural basis may be disrupting the interactions with 18S rRNA. Moreover, considering that haploinsufficiency have been found in many DBA-related RPs (<xref ref-type="bibr" rid="B26">Engidaye et al., 2019</xref>), destabilizing protein structure may also be a common mechanism in DBA mutations of these RPs. On the other hand, some RPs are located near the active site in the ribosome complex, so DBA mutations in these RPs may have different mechanisms that were not studied in this work. For instance, RPS26 can bind mRNA molecules during the procedure of translation (<xref ref-type="bibr" rid="B32">Hussain et al., 2014</xref>), so some DBA mutations in RPS26 may hold the mechanism of perturbing its binding with mRNA molecules, but not the rRNA.</p>
</sec>
<sec id="S4" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec id="S4.SS1">
<title>RPS19 Structure Analyses</title>
<p>The RPS19 structure was extracted from the coordinates of human ribosome SSU complex (PDB ID: 6G5H) (<xref ref-type="bibr" rid="B3">Ameismeier et al., 2018</xref>), and was used as initial conformation for performing molecular dynamics (MD) simulation. The simulation was conducted by using the Amber16 package (<xref ref-type="bibr" rid="B17">Case et al., 2016</xref>) with the force field of RSFF2C (<xref ref-type="bibr" rid="B36">Kang et al., 2018</xref>), and the explicit solvent model of TIP3P (<xref ref-type="bibr" rid="B33">Jorgensen et al., 1983</xref>) was adopted with a periodic box whose edges had a minimum distance of 8 &#x00C5; to any atom originally presented in the solute.</p>
<p>Before production simulation, energy minimization was performed to relax possible atom collisions, and the system was equilibrated for 2 ns with the final temperature reaching 300.0 K. The production simulation was conducted in an NPT ensemble and lasted 1,000 ns. A timestep of 2 fs was used in both equilibration and production simulation.</p>
<p>The conformations were saved in the trajectory file with an interval of 1 ps, and were analyzed using CPPTRAJ (<xref ref-type="bibr" rid="B65">Roe and Cheatham, 2013</xref>). Conformation clustering (dbscan) was performed by setting the backbone RMSD cut-off to 2 &#x00C5;. The representative conformation of the resulted largest cluster was used for the assignment of secondary structure by using DSSP (<xref ref-type="bibr" rid="B35">Kabsch and Sander, 1983</xref>). The 8 states of secondary structure definition were then simplified into 3 states: helix (G, H, I), sheet (B, E), and coil (T, S, C). The backbone RMSF (root mean square fluctuation) of RPS19 was calculated by sampling a frame per 1 ps and using the average conformation as reference. The residues in the middle of the protein with RMSF greater than 2 &#x00C5; were considered as intrinsically disordered.</p>
<p>The interaction interface of RPS19 in the ribosome complex was identified by using PDBePISA (version 1.48<sup><xref ref-type="fn" rid="footnote1">1</xref></sup>; <xref ref-type="bibr" rid="B43">Krissinel and Henrick, 2007</xref>).</p>
</sec>
<sec id="S4.SS2">
<title>Mutation Data Collection</title>
<p>The DBA missense mutations were collected from dbagenes (<xref ref-type="bibr" rid="B14">Boria et al., 2010</xref>), ClinVar (<xref ref-type="bibr" rid="B46">Landrum et al., 2016</xref>), and manual literature review. The canonical transcript (ENST00000598742 in Ensembl or NM_001022.3 in RefSeq database) (<xref ref-type="bibr" rid="B56">O&#x2019;Leary et al., 2016</xref>; <xref ref-type="bibr" rid="B79">Yates et al., 2020</xref>), was adopted as the reference to map all the mutations. During the data review, several mutations, incorrectly recorded in their original sources, were corrected in this work, including c.182C &#x003E; A (Ala61Glu), c.358G &#x003E; C (Gly120Arg), and c.380G &#x003E; A (Gly127Arg).</p>
<p>The missense variants of RPS19 in the ExAC (<xref ref-type="bibr" rid="B38">Karczewski et al., 2017</xref>) were also collected. Based on that the prevalence of DBA is 5&#x2013;7 in one million individuals, and 25% of them are caused by RPS19 mutations, and the inheritance pattern is autosomal dominant, the minor allele frequency (MAF) of pathogenic RPS19 mutations can be estimated as 6.25E-7 to 8.75E-7. The collected ExAC variants here have allele frequency values with at least one order of magnitude higher. Moreover, considering that DBA usually presents within the first year of life (<xref ref-type="bibr" rid="B73">Ulirsch et al., 2019</xref>) and that ExAC has attempted to exclude severe pediatric diseases according to its description (<xref ref-type="bibr" rid="B48">Lek et al., 2016</xref>), one can assume that most of these ExAC variants would be neutral. After removing those that has been explicitly recorded as DBA mutations (c.68A &#x003E; G, c.164C &#x003E; T, c.208G &#x003E; A, c.301C &#x003E; T), the remaining ones served as the neutral dataset in this work.</p>
</sec>
<sec id="S4.SS3">
<title>Conservation and Electrostatic Potential Calculation</title>
<p>The Consurf score of each site of RPS19 was calculated by using Consurf 2016 server (<xref ref-type="bibr" rid="B8">Ashkenazy et al., 2016</xref>). The scores were normalized with mean of 0 and standard deviation of 1. The lower the score, the slower the evolution rate, and the higher the conservation level.</p>
<p>The electrostatic potential on the surface of RPS19 was calculated by using the APBS Electrostatics Plugin integrated in PyMOL (version 2.3.0) with default parameters (<xref ref-type="bibr" rid="B11">Baker et al., 2001</xref>; <xref ref-type="bibr" rid="B24">Dolinsky et al., 2004</xref>; <xref ref-type="bibr" rid="B68">Schrodinger, 2015</xref>).</p>
<p>The Consurf scores and electrostatic potential were rendered at the protein&#x2019;s solvent-excluded molecular surface with different color schemes by using UCSF Chimera (<xref ref-type="bibr" rid="B60">Pettersen et al., 2004</xref>).</p>
</sec>
<sec id="S4.SS4">
<title>Calculation of Folding Free Energy Change and rSASA</title>
<p>The effect of a mutation on protein structural stability was measured by the change of folding free energy (&#x0394;&#x0394;G), and was calculated by using the FoldX package (<xref ref-type="bibr" rid="B69">Schymkowitz et al., 2005</xref>). In detail, the RPS19 coordinates from Cryo-EM ribosome structure (PDB ID: 6G5H) was first processed iteratively by using RepairPDB command of FoldX (an energy minimization process) until the decrease of calculated folding energy was lower than 1 kcal/mol in at least 5 iterations. Then we used the PositionScan command of FoldX to mutate all sites to all possible residues, and to calculate the &#x0394;&#x0394;G values for all the mutations.</p>
<p>The solvent accessible surface area (SASA) of each residue was calculated by DSSP (<xref ref-type="bibr" rid="B35">Kabsch and Sander, 1983</xref>). The relative SASA (rSASA) was obtained by dividing SASA with the maximum SASA, which was computed by placing the specific residue between two Gly residues in an extended conformation accordingly (<xref ref-type="bibr" rid="B72">Tien et al., 2013</xref>).</p>
</sec>
<sec id="S4.SS5">
<title>Predictor Training</title>
<p>The neutral mutations collected from ExAC were treated as negative samples. The positive samples were selected from the DBA mutations if they met any one of these criteria: (1) identified in more than one patient, (2) explicitly annotated as &#x201C;Pathogenic&#x201D; or &#x201C;Likely pathogenic&#x201D; in ClinVar, and (3) experimentally confirmed to affect the physiological function of RPS19 (expression, nucleolar localization, ribosomal abundance, etc.).</p>
<p>Based on our analyses of DBA mutations, 18 features in four categories were extracted for describing each mutation. The details of these features are listed in <xref ref-type="supplementary-material" rid="TS7">Supplementary Table 7</xref>.</p>
<p>The feature selection and the SVM hyper-parameter searching were conducted by using the scikit-learn (version: 0.21.3) package in Python. In detail, all the possible feature combinations with at least 5 features were enumerated, and the hyper-parameters C and &#x03B3; were enumerated from 0.001, 0.01, 0.1, 1, and 100. For each combination of features and hyper-parameters, fivefold cross-validation was run on the training dataset containing the positive and negative samples. When the maximum cross-validation MCC was reached, the optimal feature combination and optimal hyper-parameters were obtained accordingly. The resulted optimal hyper-parameters and features were then utilized to train the RPS19-SVM predictor on all the training data. The predicted results of PMut (<xref ref-type="bibr" rid="B52">Lopez-Ferrando et al., 2017</xref>), MutPred2 (<xref ref-type="bibr" rid="B57">Pejaver et al., 2017</xref>), PolyPhen2 (<xref ref-type="bibr" rid="B1">Adzhubei et al., 2010</xref>), and SIFT (<xref ref-type="bibr" rid="B70">Sim et al., 2012</xref>) were obtained by submitting the mutations to their web servers with default settings.</p>
</sec>
</sec>
<sec id="S5">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="supplementary-material" rid="FS1">Supplementary Material</xref>, further inquiries can be directed to the corresponding author/s.</p>
</sec>
<sec id="S6">
<title>Author Contributions</title>
<p>KA, Z-QY, and Y-DW conceived the study. KA, J-BZ, YX, and Z-QY collected the data and performed the computational analysis. KA and Z-QY initially drafted the manuscript. Z-QY, Y-DW, WH, and TW revised and proved the manuscript. Z-QY and Y-DW supervised the whole work. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="financial-disclosure">
<p><bold>Funding.</bold> This research was funded by the National Natural Science Foundation of China (32070664, 21933004, and 31471243), the Key-Area Research and Development Program of Guangdong Province (2020B0101350001), the Shenzhen Science and Technology Innovation Commission (JCYJ20170818085409785 and JCYJ20170412150507046), and the Shenzhen Municipal Health Commission (SZSM201809085).</p>
</fn>
</fn-group>
<ack>
<p>We would like to thank Drs. Xinhao Zhang, Fan Jiang, Yongjuan Zhao, and Haipeng Bai for helpful suggestions, and thank Dr. Wei Kang and Ms. Jiajie Feng for the help in MD simulations.</p>
</ack>
<sec id="S9" sec-type="supplementary material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fgene.2021.650897/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fgene.2021.650897/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Image_1.PDF" id="FS1" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Image_2.PDF" id="FS2" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_1.DOCX" id="TS1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_2.XLSX" id="TS2" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_3.XLSX" id="TS3" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_4.XLSX" id="TS4" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_5.XLSX" id="TS5" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_6.DOCX" id="TS6" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_7.DOCX" id="TS7" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_8.XLSX" id="TS8" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_9.xlsx" id="TS9" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Adzhubei</surname> <given-names>I. A.</given-names></name> <name><surname>Schmidt</surname> <given-names>S.</given-names></name> <name><surname>Peshkin</surname> <given-names>L.</given-names></name> <name><surname>Ramensky</surname> <given-names>V. E.</given-names></name> <name><surname>Gerasimova</surname> <given-names>A.</given-names></name> <name><surname>Bork</surname> <given-names>P.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>A method and server for predicting damaging missense mutations.</article-title> <source><italic>Nat. Methods</italic></source> <volume>7</volume> <fpage>248</fpage>&#x2013;<lpage>249</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth0410-248</pub-id> <pub-id pub-id-type="pmid">20354512</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alexov</surname> <given-names>E.</given-names></name> <name><surname>Sternberg</surname> <given-names>M.</given-names></name></person-group> (<year>2013</year>). <article-title>Understanding molecular effects of naturally occurring genetic differences.</article-title> <source><italic>J. Mol. Biol.</italic></source> <volume>425</volume> <fpage>3911</fpage>&#x2013;<lpage>3913</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmb.2013.08.013</pub-id> <pub-id pub-id-type="pmid">23968859</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ameismeier</surname> <given-names>M.</given-names></name> <name><surname>Cheng</surname> <given-names>J.</given-names></name> <name><surname>Berninghausen</surname> <given-names>O.</given-names></name> <name><surname>Beckmann</surname> <given-names>R.</given-names></name></person-group> (<year>2018</year>). <article-title>Visualizing late states of human 40S ribosomal subunit maturation.</article-title> <source><italic>Nature</italic></source> <volume>558</volume> <fpage>249</fpage>&#x2013;<lpage>253</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-018-0193-0</pub-id> <pub-id pub-id-type="pmid">29875412</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Amsterdam</surname> <given-names>A.</given-names></name> <name><surname>Sadler</surname> <given-names>K. C.</given-names></name> <name><surname>Lai</surname> <given-names>K.</given-names></name> <name><surname>Farrington</surname> <given-names>S.</given-names></name> <name><surname>Bronson</surname> <given-names>R. T.</given-names></name> <name><surname>Lees</surname> <given-names>J. A.</given-names></name><etal/></person-group> (<year>2004</year>). <article-title>Many ribosomal protein genes are cancer genes in zebrafish.</article-title> <source><italic>PLoS Biol.</italic></source> <volume>2</volume>:<issue>E139</issue>. <pub-id pub-id-type="doi">10.1371/journal.pbio.0020139</pub-id> <pub-id pub-id-type="pmid">15138505</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Andersen</surname> <given-names>L. L.</given-names></name> <name><surname>Terczynska-Dyla</surname> <given-names>E.</given-names></name> <name><surname>Mork</surname> <given-names>N.</given-names></name> <name><surname>Scavenius</surname> <given-names>C.</given-names></name> <name><surname>Enghild</surname> <given-names>J. J.</given-names></name> <name><surname>Honing</surname> <given-names>K.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>Frequently used bioinformatics tools overestimate the damaging effect of allelic variants.</article-title> <source><italic>Genes. Immun.</italic></source> <volume>20</volume> <fpage>10</fpage>&#x2013;<lpage>22</lpage>. <pub-id pub-id-type="doi">10.1038/s41435-017-0002-z</pub-id> <pub-id pub-id-type="pmid">29217828</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Angelini</surname> <given-names>M.</given-names></name> <name><surname>Cannata</surname> <given-names>S.</given-names></name> <name><surname>Mercaldo</surname> <given-names>V.</given-names></name> <name><surname>Gibello</surname> <given-names>L.</given-names></name> <name><surname>Santoro</surname> <given-names>C.</given-names></name> <name><surname>Dianzani</surname> <given-names>I.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>Missense mutations associated with Diamond-Blackfan anemia affect the assembly of ribosomal protein S19 into the ribosome.</article-title> <source><italic>Hum. Mol. Genet.</italic></source> <volume>16</volume> <fpage>1720</fpage>&#x2013;<lpage>1727</lpage>. <pub-id pub-id-type="doi">10.1093/hmg/ddm120</pub-id> <pub-id pub-id-type="pmid">17517689</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anger</surname> <given-names>A. M.</given-names></name> <name><surname>Armache</surname> <given-names>J. P.</given-names></name> <name><surname>Berninghausen</surname> <given-names>O.</given-names></name> <name><surname>Habeck</surname> <given-names>M.</given-names></name> <name><surname>Subklewe</surname> <given-names>M.</given-names></name> <name><surname>Wilson</surname> <given-names>D. N.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Structures of the human and <italic>Drosophila</italic> 80S ribosome.</article-title> <source><italic>Nature</italic></source> <volume>497</volume> <fpage>80</fpage>&#x2013;<lpage>85</lpage>. <pub-id pub-id-type="doi">10.1038/nature12104</pub-id> <pub-id pub-id-type="pmid">23636399</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ashkenazy</surname> <given-names>H.</given-names></name> <name><surname>Abadi</surname> <given-names>S.</given-names></name> <name><surname>Martz</surname> <given-names>E.</given-names></name> <name><surname>Chay</surname> <given-names>O.</given-names></name> <name><surname>Mayrose</surname> <given-names>I.</given-names></name> <name><surname>Pupko</surname> <given-names>T.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>44</volume> <fpage>W344</fpage>&#x2013;<lpage>W350</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw408</pub-id> <pub-id pub-id-type="pmid">27166375</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aspesi</surname> <given-names>A.</given-names></name> <name><surname>Betti</surname> <given-names>M.</given-names></name> <name><surname>Sculco</surname> <given-names>M.</given-names></name> <name><surname>Actis</surname> <given-names>C.</given-names></name> <name><surname>Olgasi</surname> <given-names>C.</given-names></name> <name><surname>Wlodarski</surname> <given-names>M. W.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>A functional assay for the clinical annotation of genetic variants of uncertain significance in diamond-blackfan anemia.</article-title> <source><italic>Hum. Mutat.</italic></source> <volume>39</volume> <fpage>1102</fpage>&#x2013;<lpage>1111</lpage>. <pub-id pub-id-type="doi">10.1002/humu.23551</pub-id> <pub-id pub-id-type="pmid">29766597</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Aspesi</surname> <given-names>A.</given-names></name> <name><surname>Ellis</surname> <given-names>S. R.</given-names></name></person-group> (<year>2019</year>). <article-title>Rare ribosomopathies: insights into mechanisms of cancer.</article-title> <source><italic>Nat. Rev. Cancer</italic></source> <volume>19</volume> <fpage>228</fpage>&#x2013;<lpage>238</lpage>. <pub-id pub-id-type="doi">10.1038/s41568-019-0105-0</pub-id> <pub-id pub-id-type="pmid">30670820</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baker</surname> <given-names>N. A.</given-names></name> <name><surname>Sept</surname> <given-names>D.</given-names></name> <name><surname>Joseph</surname> <given-names>S.</given-names></name> <name><surname>Holst</surname> <given-names>M. J.</given-names></name> <name><surname>McCammon</surname> <given-names>J. A.</given-names></name></person-group> (<year>2001</year>). <article-title>Electrostatics of nanosystems: application to microtubules and the ribosome.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>98</volume> <fpage>10037</fpage>&#x2013;<lpage>10041</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.181342398</pub-id> <pub-id pub-id-type="pmid">11517324</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ball</surname> <given-names>S.</given-names></name></person-group> (<year>2011</year>). <article-title>Diamond Blackfan anemia.</article-title> <source><italic>Hematol. Am. Soc. Hematol. Educ. Program.</italic></source> <volume>2011</volume> <fpage>487</fpage>&#x2013;<lpage>491</lpage>. <pub-id pub-id-type="doi">10.1182/asheducation-2011.1.487</pub-id> <pub-id pub-id-type="pmid">22160079</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Behrmann</surname> <given-names>E.</given-names></name> <name><surname>Loerke</surname> <given-names>J.</given-names></name> <name><surname>Budkevich</surname> <given-names>T. V.</given-names></name> <name><surname>Yamamoto</surname> <given-names>K.</given-names></name> <name><surname>Schmidt</surname> <given-names>A.</given-names></name> <name><surname>Penczek</surname> <given-names>P. A.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Structural snapshots of actively translating human ribosomes.</article-title> <source><italic>Cell</italic></source> <volume>161</volume> <fpage>845</fpage>&#x2013;<lpage>857</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2015.03.052</pub-id> <pub-id pub-id-type="pmid">25957688</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boria</surname> <given-names>I.</given-names></name> <name><surname>Garelli</surname> <given-names>E.</given-names></name> <name><surname>Gazda</surname> <given-names>H. T.</given-names></name> <name><surname>Aspesi</surname> <given-names>A.</given-names></name> <name><surname>Quarello</surname> <given-names>P.</given-names></name> <name><surname>Pavesi</surname> <given-names>E.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>The ribosomal basis of diamond-blackfan anemia: mutation and database update.</article-title> <source><italic>Hum. Mutat.</italic></source> <volume>31</volume> <fpage>1269</fpage>&#x2013;<lpage>1279</lpage>. <pub-id pub-id-type="doi">10.1002/humu.21383</pub-id> <pub-id pub-id-type="pmid">20960466</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Buss</surname> <given-names>O.</given-names></name> <name><surname>Rudat</surname> <given-names>J.</given-names></name> <name><surname>Ochsenreither</surname> <given-names>K.</given-names></name></person-group> (<year>2018</year>). <article-title>FoldX as protein engineering tool: better than random based approaches?</article-title> <source><italic>Comput. Struct. Biotechnol. J.</italic></source> <volume>16</volume> <fpage>25</fpage>&#x2013;<lpage>33</lpage>. <pub-id pub-id-type="doi">10.1016/j.csbj.2018.01.002</pub-id> <pub-id pub-id-type="pmid">30275935</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Campagnoli</surname> <given-names>M. F.</given-names></name> <name><surname>Ramenghi</surname> <given-names>U.</given-names></name> <name><surname>Armiraglio</surname> <given-names>M.</given-names></name> <name><surname>Quarello</surname> <given-names>P.</given-names></name> <name><surname>Garelli</surname> <given-names>E.</given-names></name> <name><surname>Carando</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2008</year>). <article-title>RPS19 mutations in patients with Diamond-Blackfan anemia.</article-title> <source><italic>Hum. Mutat.</italic></source> <volume>29</volume> <fpage>911</fpage>&#x2013;<lpage>920</lpage>. <pub-id pub-id-type="doi">10.1002/humu.20752</pub-id> <pub-id pub-id-type="pmid">18412286</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Case</surname> <given-names>D. A.</given-names></name> <name><surname>Betz</surname> <given-names>R.</given-names></name> <name><surname>Cerutti</surname> <given-names>D.</given-names></name> <name><surname>Cheatham</surname> <given-names>T.</given-names></name> <name><surname>Darden</surname> <given-names>T.</given-names></name> <name><surname>Duke</surname> <given-names>R.</given-names></name><etal/></person-group> (<year>2016</year>). <source><italic>AMBER 2016.</italic></source> <publisher-loc>San Francisco, CA</publisher-loc>: <publisher-name>University of California</publisher-name>, <fpage>1</fpage>&#x2013;<lpage>923</lpage>.</citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Choesmel</surname> <given-names>V.</given-names></name> <name><surname>Fribourg</surname> <given-names>S.</given-names></name> <name><surname>Aguissa-Toure</surname> <given-names>A. H.</given-names></name> <name><surname>Pinaud</surname> <given-names>N.</given-names></name> <name><surname>Legrand</surname> <given-names>P.</given-names></name> <name><surname>Gazda</surname> <given-names>H. T.</given-names></name><etal/></person-group> (<year>2008</year>). <article-title>Mutation of ribosomal protein RPS24 in Diamond-Blackfan anemia results in a ribosome biogenesis disorder.</article-title> <source><italic>Hum. Mol. Genet.</italic></source> <volume>17</volume>, <fpage>1253</fpage>&#x2013;<lpage>1263</lpage>. <pub-id pub-id-type="doi">10.1093/hmg/ddn015</pub-id> <pub-id pub-id-type="pmid">18230666</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cmejla</surname> <given-names>R.</given-names></name> <name><surname>Blafkova</surname> <given-names>J.</given-names></name> <name><surname>Stopka</surname> <given-names>T.</given-names></name> <name><surname>Zavadil</surname> <given-names>J.</given-names></name> <name><surname>Pospisilova</surname> <given-names>D.</given-names></name> <name><surname>Mihal</surname> <given-names>V.</given-names></name><etal/></person-group> (<year>2000</year>). <article-title>Ribosomal protein S19 gene mutations in patients with diamond-blackfan anemia and identification of ribosomal protein S19 pseudogenes.</article-title> <source><italic>Blood Cells Mol. Dis.</italic></source> <volume>26</volume> <fpage>124</fpage>&#x2013;<lpage>132</lpage>. <pub-id pub-id-type="doi">10.1006/bcmd.2000.0286</pub-id> <pub-id pub-id-type="pmid">10753603</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cmejla</surname> <given-names>R.</given-names></name> <name><surname>Cmejlova</surname> <given-names>J.</given-names></name></person-group> (<year>2003</year>). <article-title>Molecular basis of Diamond-Blackfan anaemia: what have we learnt so far?</article-title> <source><italic>Rev. Article. Sbornik Lekarsky</italic></source> <volume>104</volume> <fpage>171</fpage>&#x2013;<lpage>181</lpage>.</citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Da Costa</surname> <given-names>L.</given-names></name> <name><surname>O&#x2019;Donohue</surname> <given-names>M. F.</given-names></name> <name><surname>van Dooijeweert</surname> <given-names>B.</given-names></name> <name><surname>Albrecht</surname> <given-names>K.</given-names></name> <name><surname>Unal</surname> <given-names>S.</given-names></name> <name><surname>Ramenghi</surname> <given-names>U.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Molecular approaches to diagnose Diamond-Blackfan anemia: the EuroDBA experience.</article-title> <source><italic>Eur. J. Med. Genet.</italic></source> <volume>61</volume> <fpage>664</fpage>&#x2013;<lpage>673</lpage>. <pub-id pub-id-type="doi">10.1016/j.ejmg.2017.10.017</pub-id> <pub-id pub-id-type="pmid">29081386</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Da Costa</surname> <given-names>L.</given-names></name> <name><surname>Tchernia</surname> <given-names>G.</given-names></name> <name><surname>Gascard</surname> <given-names>P.</given-names></name> <name><surname>Lo</surname> <given-names>A.</given-names></name> <name><surname>Meerpohl</surname> <given-names>J.</given-names></name> <name><surname>Niemeyer</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>2003</year>). <article-title>Nucleolar localization of RPS19 protein in normal cells and mislocalization due to mutations in the nucleolar localization signals in 2 Diamond-Blackfan anemia patients: potential insights into pathophysiology.</article-title> <source><italic>Blood</italic></source> <volume>101</volume> <fpage>5039</fpage>&#x2013;<lpage>5045</lpage>. <pub-id pub-id-type="doi">10.1182/blood-2002-12-3878</pub-id> <pub-id pub-id-type="pmid">12586610</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Diamond</surname> <given-names>L. K.</given-names></name></person-group> (<year>1938</year>). <article-title>Hypoplastic anemia.</article-title> <source><italic>Am. J. Dis. Child</italic></source> <volume>56</volume> <fpage>464</fpage>&#x2013;<lpage>467</lpage>.</citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dolinsky</surname> <given-names>T. J.</given-names></name> <name><surname>Nielsen</surname> <given-names>J. E.</given-names></name> <name><surname>McCammon</surname> <given-names>J. A.</given-names></name> <name><surname>Baker</surname> <given-names>N. A.</given-names></name></person-group> (<year>2004</year>). <article-title>PDB2PQR: an automated pipeline for the setup of poisson&#x2013;boltzmann electrostatics calculations.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>32(suppl_2)</volume> <fpage>W665</fpage>&#x2013;<lpage>W667</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkh381</pub-id> <pub-id pub-id-type="pmid">15215472</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Duss</surname> <given-names>O.</given-names></name> <name><surname>Stepanyuk</surname> <given-names>G. A.</given-names></name> <name><surname>Puglisi</surname> <given-names>J. D.</given-names></name> <name><surname>Williamson</surname> <given-names>J. R.</given-names></name></person-group> (<year>2019</year>). <article-title>Transient protein-RNA interactions guide nascent ribosomal RNA folding.</article-title> <source><italic>Cell</italic></source> <volume>179</volume> <fpage>1357</fpage>&#x2013;<lpage>1369 e1316</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2019.10.035</pub-id> <pub-id pub-id-type="pmid">31761533</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Engidaye</surname> <given-names>G.</given-names></name> <name><surname>Melku</surname> <given-names>M.</given-names></name> <name><surname>Enawgaw</surname> <given-names>B.</given-names></name></person-group> (<year>2019</year>). <article-title>Diamond Blackfan Anemia: genetics, pathogenesis, diagnosis and treatment.</article-title> <source><italic>EJIFCC</italic></source> <volume>30</volume> <fpage>67</fpage>&#x2013;<lpage>81</lpage>.</citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Farley-Barnes</surname> <given-names>K. I.</given-names></name> <name><surname>Ogawa</surname> <given-names>L. M.</given-names></name> <name><surname>Baserga</surname> <given-names>S. J.</given-names></name></person-group> (<year>2019</year>). <article-title>Ribosomopathies: old concepts.</article-title> <source><italic>New Controversies. Trends Genet.</italic></source> <volume>35</volume> <fpage>754</fpage>&#x2013;<lpage>767</lpage>. <pub-id pub-id-type="doi">10.1016/j.tig.2019.07.004</pub-id> <pub-id pub-id-type="pmid">31376929</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Flygare</surname> <given-names>J.</given-names></name> <name><surname>Aspesi</surname> <given-names>A.</given-names></name> <name><surname>Bailey</surname> <given-names>J. C.</given-names></name> <name><surname>Miyake</surname> <given-names>K.</given-names></name> <name><surname>Caffrey</surname> <given-names>J. M.</given-names></name> <name><surname>Karlsson</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>Human RPS19, the gene mutated in Diamond-Blackfan anemia, encodes a ribosomal protein required for the maturation of 40S ribosomal subunits.</article-title> <source><italic>Blood</italic></source> <volume>109</volume> <fpage>980</fpage>&#x2013;<lpage>986</lpage>. <pub-id pub-id-type="doi">10.1182/blood-2006-07-038232</pub-id> <pub-id pub-id-type="pmid">16990592</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gazda</surname> <given-names>H. T.</given-names></name> <name><surname>Zhong</surname> <given-names>R.</given-names></name> <name><surname>Long</surname> <given-names>L.</given-names></name> <name><surname>Niewiadomska</surname> <given-names>E.</given-names></name> <name><surname>Lipton</surname> <given-names>J. M.</given-names></name> <name><surname>Ploszynska</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2004</year>). <article-title>RNA and protein evidence for haplo-insufficiency in Diamond-Blackfan anaemia patients with RPS19 mutations.</article-title> <source><italic>Br. J. Haematol.</italic></source> <volume>127</volume> <fpage>105</fpage>&#x2013;<lpage>113</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-2141.2004.05152.x</pub-id> <pub-id pub-id-type="pmid">15384984</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gregory</surname> <given-names>L. A.</given-names></name> <name><surname>Aguissa-Toure</surname> <given-names>A.-H.</given-names></name> <name><surname>Pinaud</surname> <given-names>N.</given-names></name> <name><surname>Legrand</surname> <given-names>P.</given-names></name> <name><surname>Gleizes</surname> <given-names>P.-E.</given-names></name> <name><surname>Fribourg</surname> <given-names>S.</given-names></name></person-group> (<year>2007</year>). <article-title>Molecular basis of Diamond-Blackfan anemia: structure and function analysis of RPS19.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>35</volume> <fpage>5913</fpage>&#x2013;<lpage>5921</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkm626</pub-id> <pub-id pub-id-type="pmid">17726054</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holland</surname> <given-names>K. D.</given-names></name> <name><surname>Bouley</surname> <given-names>T. M.</given-names></name> <name><surname>Horn</surname> <given-names>P. S.</given-names></name></person-group> (<year>2017</year>). <article-title>Comparison and optimization of in silico algorithms for predicting the pathogenicity of sodium channel variants in epilepsy.</article-title> <source><italic>Epilepsia</italic></source> <volume>58</volume> <fpage>1190</fpage>&#x2013;<lpage>1198</lpage>. <pub-id pub-id-type="doi">10.1111/epi.13798</pub-id> <pub-id pub-id-type="pmid">28518218</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hussain</surname> <given-names>T.</given-names></name> <name><surname>Llacer</surname> <given-names>J. L.</given-names></name> <name><surname>Fernandez</surname> <given-names>I. S.</given-names></name> <name><surname>Munoz</surname> <given-names>A.</given-names></name> <name><surname>Martin-Marcos</surname> <given-names>P.</given-names></name> <name><surname>Savva</surname> <given-names>C. G.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Structural changes enable start codon recognition by the eukaryotic translation initiation complex.</article-title> <source><italic>Cell</italic></source> <volume>159</volume>, <fpage>597</fpage>&#x2013;<lpage>607</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2014.10.001</pub-id> <pub-id pub-id-type="pmid">25417110</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jorgensen</surname> <given-names>W. L.</given-names></name> <name><surname>Chandrasekhar</surname> <given-names>J.</given-names></name> <name><surname>Madura</surname> <given-names>J. D.</given-names></name> <name><surname>Impey</surname> <given-names>R. W.</given-names></name> <name><surname>Klein</surname> <given-names>M. L.</given-names></name></person-group> (<year>1983</year>). <article-title>Comparison of simple potential functions for simulating liquid water.</article-title> <source><italic>J. Chem. Phys.</italic></source> <volume>79</volume> <fpage>926</fpage>&#x2013;<lpage>935</lpage>.</citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Juli</surname> <given-names>G.</given-names></name> <name><surname>Gismondi</surname> <given-names>A.</given-names></name> <name><surname>Monteleone</surname> <given-names>V.</given-names></name> <name><surname>Caldarola</surname> <given-names>S.</given-names></name> <name><surname>Iadevaia</surname> <given-names>V.</given-names></name> <name><surname>Aspesi</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Depletion of ribosomal protein S19 causes a reduction of rRNA synthesis.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>6</volume>:<issue>35026</issue>. <pub-id pub-id-type="doi">10.1038/srep35026</pub-id> <pub-id pub-id-type="pmid">27734913</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kabsch</surname> <given-names>W.</given-names></name> <name><surname>Sander</surname> <given-names>C.</given-names></name></person-group> (<year>1983</year>). <article-title>Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features.</article-title> <source><italic>Biopolymers</italic></source> <volume>22</volume> <fpage>2577</fpage>&#x2013;<lpage>2637</lpage>. <pub-id pub-id-type="doi">10.1002/bip.360221211</pub-id> <pub-id pub-id-type="pmid">6667333</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kang</surname> <given-names>W.</given-names></name> <name><surname>Jiang</surname> <given-names>F.</given-names></name> <name><surname>Wu</surname> <given-names>Y. D.</given-names></name></person-group> (<year>2018</year>). <article-title>Universal implementation of a residue-specific force field based on CMAP potentials and free energy decomposition.</article-title> <source><italic>J. Chem. Theory Comput.</italic></source> <volume>14</volume> <fpage>4474</fpage>&#x2013;<lpage>4486</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jctc.8b00285</pub-id> <pub-id pub-id-type="pmid">29906395</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Karczewski</surname> <given-names>K. J.</given-names></name> <name><surname>Francioli</surname> <given-names>L. C.</given-names></name> <name><surname>Tiao</surname> <given-names>G.</given-names></name> <name><surname>Cummings</surname> <given-names>B. B.</given-names></name> <name><surname>Alfoldi</surname> <given-names>J.</given-names></name> <name><surname>Wang</surname> <given-names>Q.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>The mutational constraint spectrum quantified from variation in 141,456 humans.</article-title> <source><italic>Nature</italic></source> <volume>581</volume>, <fpage>434</fpage>&#x2013;<lpage>443</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-020-2308-7</pub-id> <pub-id pub-id-type="pmid">32461654</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Karczewski</surname> <given-names>K. J.</given-names></name> <name><surname>Weisburd</surname> <given-names>B.</given-names></name> <name><surname>Thomas</surname> <given-names>B.</given-names></name> <name><surname>Solomonson</surname> <given-names>M.</given-names></name> <name><surname>Ruderfer</surname> <given-names>D. M.</given-names></name> <name><surname>Kavanagh</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>The ExAC browser: displaying reference data information from over 60 000 exomes.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>45</volume> <fpage>D840</fpage>&#x2013;<lpage>D845</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw971</pub-id> <pub-id pub-id-type="pmid">27899611</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kato</surname> <given-names>S.</given-names></name> <name><surname>Han</surname> <given-names>S. Y.</given-names></name> <name><surname>Liu</surname> <given-names>W.</given-names></name> <name><surname>Otsuka</surname> <given-names>K.</given-names></name> <name><surname>Shibata</surname> <given-names>H.</given-names></name> <name><surname>Kanamaru</surname> <given-names>R.</given-names></name><etal/></person-group> (<year>2003</year>). <article-title>Understanding the function-structure and function-mutation relationships of p53 tumor suppressor protein by high-resolution missense mutation analysis.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>100</volume> <fpage>8424</fpage>&#x2013;<lpage>8429</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1431692100</pub-id> <pub-id pub-id-type="pmid">12826609</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Khajuria</surname> <given-names>R. K.</given-names></name> <name><surname>Munschauer</surname> <given-names>M.</given-names></name> <name><surname>Ulirsch</surname> <given-names>J. C.</given-names></name> <name><surname>Fiorini</surname> <given-names>C.</given-names></name> <name><surname>Ludwig</surname> <given-names>L. S.</given-names></name> <name><surname>McFarland</surname> <given-names>S. K.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Ribosome levels selectively regulate translation and lineage commitment in human hematopoiesis.</article-title> <source><italic>Cell</italic></source> <volume>173</volume> <fpage>90</fpage>&#x2013;<lpage>103.e19</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2018.02.036</pub-id> <pub-id pub-id-type="pmid">29551269</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Khatter</surname> <given-names>H.</given-names></name> <name><surname>Myasnikov</surname> <given-names>A. G.</given-names></name> <name><surname>Natchiar</surname> <given-names>S. K.</given-names></name> <name><surname>Klaholz</surname> <given-names>B. P.</given-names></name></person-group> (<year>2015</year>). <article-title>Structure of the human 80S ribosome.</article-title> <source><italic>Nature</italic></source> <volume>520</volume> <fpage>640</fpage>&#x2013;<lpage>645</lpage>. <pub-id pub-id-type="doi">10.1038/nature14427</pub-id> <pub-id pub-id-type="pmid">25901680</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klinge</surname> <given-names>S.</given-names></name> <name><surname>Woolford</surname> <given-names>J. L.</given-names> <suffix>Jr.</suffix></name></person-group> (<year>2019</year>). <article-title>Ribosome assembly coming into focus.</article-title> <source><italic>Nat. Rev. Mol. Cell Biol.</italic></source> <volume>20</volume> <fpage>116</fpage>&#x2013;<lpage>131</lpage>. <pub-id pub-id-type="doi">10.1038/s41580-018-0078-y</pub-id> <pub-id pub-id-type="pmid">30467428</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krissinel</surname> <given-names>E.</given-names></name> <name><surname>Henrick</surname> <given-names>K.</given-names></name></person-group> (<year>2007</year>). <article-title>Inference of macromolecular assemblies from crystalline state.</article-title> <source><italic>J. Mol. Biol.</italic></source> <volume>372</volume> <fpage>774</fpage>&#x2013;<lpage>797</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmb.2007.05.022</pub-id> <pub-id pub-id-type="pmid">17681537</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kumar</surname> <given-names>S.</given-names></name> <name><surname>Suleski</surname> <given-names>M. P.</given-names></name> <name><surname>Markov</surname> <given-names>G. J.</given-names></name> <name><surname>Lawrence</surname> <given-names>S.</given-names></name> <name><surname>Marco</surname> <given-names>A.</given-names></name> <name><surname>Filipski</surname> <given-names>A. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Positional conservation and amino acids shape the correct diagnosis and population frequencies of benign and damaging personal amino acid mutations.</article-title> <source><italic>Genome. Res.</italic></source> <volume>19</volume> <fpage>1562</fpage>&#x2013;<lpage>1569</lpage>. <pub-id pub-id-type="doi">10.1101/gr.091991.109</pub-id> <pub-id pub-id-type="pmid">19546171</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lal</surname> <given-names>D.</given-names></name> <name><surname>May</surname> <given-names>P.</given-names></name> <name><surname>Perez-Palma</surname> <given-names>E.</given-names></name> <name><surname>Samocha</surname> <given-names>K. E.</given-names></name> <name><surname>Kosmicki</surname> <given-names>J. A.</given-names></name> <name><surname>Robinson</surname> <given-names>E. B.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Gene family information facilitates variant interpretation and identification of disease-associated genes in neurodevelopmental disorders.</article-title> <source><italic>Genome. Med.</italic></source> <volume>12</volume>:<issue>28</issue>. <pub-id pub-id-type="doi">10.1186/s13073-020-00725-6</pub-id> <pub-id pub-id-type="pmid">32183904</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Landrum</surname> <given-names>M. J.</given-names></name> <name><surname>Lee</surname> <given-names>J. M.</given-names></name> <name><surname>Benson</surname> <given-names>M.</given-names></name> <name><surname>Brown</surname> <given-names>G.</given-names></name> <name><surname>Chao</surname> <given-names>C.</given-names></name> <name><surname>Chitipiralla</surname> <given-names>S.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>ClinVar: public archive of interpretations of clinically relevant variants.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>44</volume> <fpage>D862</fpage>&#x2013;<lpage>D868</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkv1222</pub-id> <pub-id pub-id-type="pmid">26582918</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Leger-Silvestre</surname> <given-names>I.</given-names></name> <name><surname>Caffrey</surname> <given-names>J. M.</given-names></name> <name><surname>Dawaliby</surname> <given-names>R.</given-names></name> <name><surname>Alvarez-Arias</surname> <given-names>D. A.</given-names></name> <name><surname>Gas</surname> <given-names>N.</given-names></name> <name><surname>Bertolone</surname> <given-names>S. J.</given-names></name><etal/></person-group> (<year>2005</year>). <article-title>Specific role for yeast homologs of the diamond blackfan anemia-associated Rps19 Protein in ribosome synthesis.</article-title> <source><italic>J. Biol. Chem.</italic></source> <volume>280</volume> <fpage>38177</fpage>&#x2013;<lpage>38185</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.M506916200</pub-id> <pub-id pub-id-type="pmid">16159874</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lek</surname> <given-names>M.</given-names></name> <name><surname>Karczewski</surname> <given-names>K. J.</given-names></name> <name><surname>Minikel</surname> <given-names>E. V.</given-names></name> <name><surname>Samocha</surname> <given-names>K. E.</given-names></name> <name><surname>Banks</surname> <given-names>E.</given-names></name> <name><surname>Fennell</surname> <given-names>T.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Analysis of protein-coding genetic variation in 60,706 humans.</article-title> <source><italic>Nature</italic></source> <volume>536</volume>, <fpage>285</fpage>&#x2013;<lpage>291</lpage>. <pub-id pub-id-type="doi">10.1038/nature19057</pub-id> <pub-id pub-id-type="pmid">27535533</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Levitt</surname> <given-names>M.</given-names></name></person-group> (<year>1978</year>). <article-title>Conformational preferences of amino acids in globular proteins.</article-title> <source><italic>Biochemistry</italic></source> <volume>17</volume> <fpage>4277</fpage>&#x2013;<lpage>4285</lpage>. <pub-id pub-id-type="doi">10.1021/bi00613a026</pub-id> <pub-id pub-id-type="pmid">708713</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>R.</given-names></name> <name><surname>Baase</surname> <given-names>W. A.</given-names></name> <name><surname>Matthews</surname> <given-names>B. W.</given-names></name></person-group> (<year>2000</year>). <article-title>The introduction of strain and its effects on the structure and stability of T4 lysozyme.</article-title> <source><italic>J. Mol. Biol.</italic></source> <volume>295</volume> <fpage>127</fpage>&#x2013;<lpage>145</lpage>. <pub-id pub-id-type="doi">10.1006/jmbi.1999.3300</pub-id> <pub-id pub-id-type="pmid">10623513</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Loladze</surname> <given-names>V. V.</given-names></name> <name><surname>Ermolenko</surname> <given-names>D. N.</given-names></name> <name><surname>Makhatadze</surname> <given-names>G. I.</given-names></name></person-group> (<year>2002</year>). <article-title>Thermodynamic consequences of burial of polar and non-polar amino acid residues in the protein interior.</article-title> <source><italic>J. Mol. Biol.</italic></source> <volume>320</volume> <fpage>343</fpage>&#x2013;<lpage>357</lpage>. <pub-id pub-id-type="doi">10.1016/S0022-2836(02)00465-5</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lopez-Ferrando</surname> <given-names>V.</given-names></name> <name><surname>Gazzo</surname> <given-names>A.</given-names></name> <name><surname>de la Cruz</surname> <given-names>X.</given-names></name> <name><surname>Orozco</surname> <given-names>M.</given-names></name> <name><surname>Gelpi</surname> <given-names>J. L.</given-names></name></person-group> (<year>2017</year>). <article-title>PMut: a web-based tool for the annotation of pathological variants on proteins, 2017 update.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>45</volume> <fpage>W222</fpage>&#x2013;<lpage>W228</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkx313</pub-id> <pub-id pub-id-type="pmid">28453649</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matsson</surname> <given-names>H.</given-names></name> <name><surname>Davey</surname> <given-names>E. J.</given-names></name> <name><surname>Draptchinskaia</surname> <given-names>N.</given-names></name> <name><surname>Hamaguchi</surname> <given-names>I.</given-names></name> <name><surname>Ooka</surname> <given-names>A.</given-names></name> <name><surname>Leveen</surname> <given-names>P.</given-names></name><etal/></person-group> (<year>2004</year>). <article-title>Targeted disruption of the ribosomal protein S19 gene is lethal prior to implantation.</article-title> <source><italic>Mol. Cell Biol</italic>.</source> <volume>24</volume>, <fpage>4032</fpage>&#x2013;<lpage>4037</lpage>. <pub-id pub-id-type="doi">10.1128/mcb.24.9.4032-4037.2004</pub-id> <pub-id pub-id-type="pmid">15082795</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Myasnikov</surname> <given-names>A. G.</given-names></name> <name><surname>Kundhavai Natchiar</surname> <given-names>S.</given-names></name> <name><surname>Nebout</surname> <given-names>M.</given-names></name> <name><surname>Hazemann</surname> <given-names>I.</given-names></name> <name><surname>Imbert</surname> <given-names>V.</given-names></name> <name><surname>Khatter</surname> <given-names>H.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Structure-function insights reveal the human ribosome as a cancer target for antibiotics.</article-title> <source><italic>Nat. Commun.</italic></source> <volume>7</volume>:<issue>12856</issue>. <pub-id pub-id-type="doi">10.1038/ncomms12856</pub-id> <pub-id pub-id-type="pmid">27665925</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Natchiar</surname> <given-names>S. K.</given-names></name> <name><surname>Myasnikov</surname> <given-names>A. G.</given-names></name> <name><surname>Kratzat</surname> <given-names>H.</given-names></name> <name><surname>Hazemann</surname> <given-names>I.</given-names></name> <name><surname>Klaholz</surname> <given-names>B. P.</given-names></name></person-group> (<year>2017</year>). <article-title>Visualization of chemical modifications in the human 80S ribosome structure.</article-title> <source><italic>Nature</italic></source> <volume>551</volume> <fpage>472</fpage>&#x2013;<lpage>477</lpage>. <pub-id pub-id-type="doi">10.1038/nature24482</pub-id> <pub-id pub-id-type="pmid">29143818</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>O&#x2019;Leary</surname> <given-names>N. A.</given-names></name> <name><surname>Wright</surname> <given-names>M. W.</given-names></name> <name><surname>Brister</surname> <given-names>J. R.</given-names></name> <name><surname>Ciufo</surname> <given-names>S.</given-names></name> <name><surname>Haddad</surname> <given-names>D.</given-names></name> <name><surname>McVeigh</surname> <given-names>R.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>44</volume>, <fpage>D733</fpage>&#x2013;<lpage>D745</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkv1189</pub-id> <pub-id pub-id-type="pmid">26553804</pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pejaver</surname> <given-names>V.</given-names></name> <name><surname>Urresti</surname> <given-names>J.</given-names></name> <name><surname>Lugo-Martinez</surname> <given-names>J.</given-names></name> <name><surname>Pagel</surname> <given-names>K. A.</given-names></name> <name><surname>Lin</surname> <given-names>G. N.</given-names></name> <name><surname>Nam</surname> <given-names>H.-J.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>MutPred2: inferring the molecular and phenotypic impact of amino acid variants.</article-title> <source><italic>BioRxiv</italic> [Preprint]</source> <comment>BioRxiv 134981</comment>,</citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peng</surname> <given-names>Y.</given-names></name> <name><surname>Myers</surname> <given-names>R.</given-names></name> <name><surname>Zhang</surname> <given-names>W.</given-names></name> <name><surname>Alexov</surname> <given-names>E.</given-names></name></person-group> (<year>2018</year>). <article-title>Computational investigation of the missense mutations in DHCR7 gene associated with smith-lemli-opitz syndrome.</article-title> <source><italic>Int. J. Mol. Sci.</italic></source> <volume>19</volume>:<issue>141</issue>. <pub-id pub-id-type="doi">10.3390/ijms19010141</pub-id> <pub-id pub-id-type="pmid">29300326</pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Petrov</surname> <given-names>A. S.</given-names></name> <name><surname>Bernier</surname> <given-names>C. R.</given-names></name> <name><surname>Gulen</surname> <given-names>B.</given-names></name> <name><surname>Waterbury</surname> <given-names>C. C.</given-names></name> <name><surname>Hershkovits</surname> <given-names>E.</given-names></name> <name><surname>Hsiao</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Secondary structures of rRNAs from all three domains of life.</article-title> <source><italic>PLoS One</italic></source> <volume>9</volume>:<issue>e88222</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0088222</pub-id> <pub-id pub-id-type="pmid">24505437</pub-id></citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pettersen</surname> <given-names>E. F.</given-names></name> <name><surname>Goddard</surname> <given-names>T. D.</given-names></name> <name><surname>Huang</surname> <given-names>C. C.</given-names></name> <name><surname>Couch</surname> <given-names>G. S.</given-names></name> <name><surname>Greenblatt</surname> <given-names>D. M.</given-names></name> <name><surname>Meng</surname> <given-names>E. C.</given-names></name><etal/></person-group> (<year>2004</year>). <article-title>UCSF Chimera&#x2013;a visualization system for exploratory research and analysis.</article-title> <source><italic>J. Comput. Chem.</italic></source> <volume>25</volume> <fpage>1605</fpage>&#x2013;<lpage>1612</lpage>. <pub-id pub-id-type="doi">10.1002/jcc.20084</pub-id> <pub-id pub-id-type="pmid">15264254</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Phipps</surname> <given-names>K. R.</given-names></name> <name><surname>Charette</surname> <given-names>J.</given-names></name> <name><surname>Baserga</surname> <given-names>S. J.</given-names></name></person-group> (<year>2011</year>). <article-title>The small subunit processome in ribosome biogenesis-progress and prospects.</article-title> <source><italic>Wiley Interdiscip. Rev. RNA</italic></source> <volume>2</volume> <fpage>1</fpage>&#x2013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.1002/wrna.57</pub-id> <pub-id pub-id-type="pmid">21318072</pub-id></citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Potapov</surname> <given-names>V.</given-names></name> <name><surname>Cohen</surname> <given-names>M.</given-names></name> <name><surname>Schreiber</surname> <given-names>G.</given-names></name></person-group> (<year>2009</year>). <article-title>Assessing computational methods for predicting protein stability upon mutation: good on average but not in the details.</article-title> <source><italic>Protein Eng. Des. Sel</italic>.</source> <volume>22</volume>, <fpage>553</fpage>&#x2013;<lpage>560</lpage>. <pub-id pub-id-type="doi">10.1093/protein/gzp030</pub-id> <pub-id pub-id-type="pmid">19561092</pub-id></citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quade</surname> <given-names>N.</given-names></name> <name><surname>Boehringer</surname> <given-names>D.</given-names></name> <name><surname>Leibundgut</surname> <given-names>M.</given-names></name> <name><surname>van den Heuvel</surname> <given-names>J.</given-names></name> <name><surname>Ban</surname> <given-names>N.</given-names></name></person-group> (<year>2015</year>). <article-title>Cryo-EM structure of hepatitis C virus IRES bound to the human ribosome at 3.9-A resolution.</article-title> <source><italic>Nat. Commun.</italic></source> <volume>6</volume>:<issue>7646</issue>. <pub-id pub-id-type="doi">10.1038/ncomms8646</pub-id> <pub-id pub-id-type="pmid">26155016</pub-id></citation></ref>
<ref id="B64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Richardson</surname> <given-names>J. S.</given-names></name> <name><surname>Richardson</surname> <given-names>D. C.</given-names></name></person-group> (<year>1988</year>). <article-title>Amino acid preferences for specific locations at the ends of alpha helices.</article-title> <source><italic>Science</italic></source> <volume>240</volume> <fpage>1648</fpage>&#x2013;<lpage>1652</lpage>. <pub-id pub-id-type="doi">10.1126/science.3381086</pub-id> <pub-id pub-id-type="pmid">3381086</pub-id></citation></ref>
<ref id="B65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roe</surname> <given-names>D. R.</given-names></name> <name><surname>Cheatham</surname> <given-names>T. E.</given-names> <suffix>III</suffix></name></person-group> (<year>2013</year>). <article-title>PTRAJ and CPPTRAJ: software for processing and analysis of molecular dynamics trajectory data.</article-title> <source><italic>J Chem. Theory Comput.</italic></source> <volume>9</volume> <fpage>3084</fpage>&#x2013;<lpage>3095</lpage>. <pub-id pub-id-type="doi">10.1021/ct400341p</pub-id> <pub-id pub-id-type="pmid">26583988</pub-id></citation></ref>
<ref id="B66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rose</surname> <given-names>G. D.</given-names></name> <name><surname>Geselowitz</surname> <given-names>A. R.</given-names></name> <name><surname>Lesser</surname> <given-names>G. J.</given-names></name> <name><surname>Lee</surname> <given-names>R. H.</given-names></name> <name><surname>Zehfus</surname> <given-names>M. H.</given-names></name></person-group> (<year>1985</year>). <article-title>Hydrophobicity of amino acid residues in globular proteins.</article-title> <source><italic>Science</italic></source> <volume>229</volume> <fpage>834</fpage>&#x2013;<lpage>838</lpage>. <pub-id pub-id-type="doi">10.1126/science.4023714</pub-id> <pub-id pub-id-type="pmid">4023714</pub-id></citation></ref>
<ref id="B67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rouquette</surname> <given-names>J.</given-names></name> <name><surname>Choesmel</surname> <given-names>V.</given-names></name> <name><surname>Gleizes</surname> <given-names>P. E.</given-names></name></person-group> (<year>2005</year>). <article-title>Nuclear export and cytoplasmic processing of precursors to the 40S ribosomal subunits in mammalian cells.</article-title> <source><italic>EMBO J.</italic></source> <volume>24</volume> <fpage>2862</fpage>&#x2013;<lpage>2872</lpage>. <pub-id pub-id-type="doi">10.1038/sj.emboj.7600752</pub-id> <pub-id pub-id-type="pmid">16037817</pub-id></citation></ref>
<ref id="B68"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schrodinger</surname> <given-names>L. L. C.</given-names></name></person-group> (<year>2015</year>). <source><italic>The PyMOL Molecular Graphics System, Version 2.3.</italic></source></citation></ref>
<ref id="B69"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schymkowitz</surname> <given-names>J.</given-names></name> <name><surname>Borg</surname> <given-names>J.</given-names></name> <name><surname>Stricher</surname> <given-names>F.</given-names></name> <name><surname>Nys</surname> <given-names>R.</given-names></name> <name><surname>Rousseau</surname> <given-names>F.</given-names></name> <name><surname>Serrano</surname> <given-names>L.</given-names></name></person-group> (<year>2005</year>). <article-title>The FoldX web server: an online force field.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>33</volume> <fpage>W382</fpage>&#x2013;<lpage>W388</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gki387</pub-id> <pub-id pub-id-type="pmid">15980494</pub-id></citation></ref>
<ref id="B70"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sim</surname> <given-names>N. L.</given-names></name> <name><surname>Kumar</surname> <given-names>P.</given-names></name> <name><surname>Hu</surname> <given-names>J.</given-names></name> <name><surname>Henikoff</surname> <given-names>S.</given-names></name> <name><surname>Schneider</surname> <given-names>G.</given-names></name> <name><surname>Ng</surname> <given-names>P. C.</given-names></name></person-group> (<year>2012</year>). <article-title>SIFT web server: predicting effects of amino acid substitutions on proteins.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>40</volume> <fpage>W452</fpage>&#x2013;<lpage>W457</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gks539</pub-id> <pub-id pub-id-type="pmid">22689647</pub-id></citation></ref>
<ref id="B71"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stefl</surname> <given-names>S.</given-names></name> <name><surname>Nishi</surname> <given-names>H.</given-names></name> <name><surname>Petukh</surname> <given-names>M.</given-names></name> <name><surname>Panchenko</surname> <given-names>A. R.</given-names></name> <name><surname>Alexov</surname> <given-names>E.</given-names></name></person-group> (<year>2013</year>). <article-title>Molecular mechanisms of disease-causing missense mutations.</article-title> <source><italic>J. Mol. Biol.</italic></source> <volume>425</volume> <fpage>3919</fpage>&#x2013;<lpage>3936</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmb.2013.07.014</pub-id> <pub-id pub-id-type="pmid">23871686</pub-id></citation></ref>
<ref id="B72"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tien</surname> <given-names>M. Z.</given-names></name> <name><surname>Meyer</surname> <given-names>A. G.</given-names></name> <name><surname>Sydykova</surname> <given-names>D. K.</given-names></name> <name><surname>Spielman</surname> <given-names>S. J.</given-names></name> <name><surname>Wilke</surname> <given-names>C. O.</given-names></name></person-group> (<year>2013</year>). <article-title>Maximum allowed solvent accessibilites of residues in proteins.</article-title> <source><italic>PLoS One</italic></source> <volume>8</volume>:<issue>e80635</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0080635</pub-id> <pub-id pub-id-type="pmid">24278298</pub-id></citation></ref>
<ref id="B73"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ulirsch</surname> <given-names>J. C.</given-names></name> <name><surname>Verboon</surname> <given-names>J. M.</given-names></name> <name><surname>Kazerounian</surname> <given-names>S.</given-names></name> <name><surname>Guo</surname> <given-names>M. H.</given-names></name> <name><surname>Yuan</surname> <given-names>D.</given-names></name> <name><surname>Ludwig</surname> <given-names>L. S.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>The genetic landscape of diamond-blackfan anemia.</article-title> <source><italic>Am. J. Hum. Genet.</italic></source> <volume>104</volume>:<issue>356</issue>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2018.12.011</pub-id> <pub-id pub-id-type="pmid">30735661</pub-id></citation></ref>
<ref id="B74"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vlachos</surname> <given-names>A.</given-names></name> <name><surname>Klein</surname> <given-names>G. W.</given-names></name> <name><surname>Lipton</surname> <given-names>J. M.</given-names></name></person-group> (<year>2001</year>). <article-title>The Diamond Blackfan anemia registry: tool for investigating the epidemiology and biology of Diamond-Blackfan anemia.</article-title> <source><italic>J. Pediatr. Hematol. Oncol.</italic></source> <volume>23</volume> <fpage>377</fpage>&#x2013;<lpage>382</lpage>. <pub-id pub-id-type="doi">10.1097/00043426-200108000-00015</pub-id> <pub-id pub-id-type="pmid">11563775</pub-id></citation></ref>
<ref id="B75"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vlachos</surname> <given-names>A.</given-names></name> <name><surname>Rosenberg</surname> <given-names>P. S.</given-names></name> <name><surname>Atsidaftos</surname> <given-names>E.</given-names></name> <name><surname>Kang</surname> <given-names>J.</given-names></name> <name><surname>Onel</surname> <given-names>K.</given-names></name> <name><surname>Sharaf</surname> <given-names>R. N.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Increased risk of colon cancer and osteogenic sarcoma in Diamond-Blackfan anemia.</article-title> <source><italic>Blood</italic></source> <volume>132</volume> <fpage>2205</fpage>&#x2013;<lpage>2208</lpage>. <pub-id pub-id-type="doi">10.1182/blood-2018-05-848937</pub-id> <pub-id pub-id-type="pmid">30266775</pub-id></citation></ref>
<ref id="B76"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Weisser</surname> <given-names>M.</given-names></name> <name><surname>Schafer</surname> <given-names>T.</given-names></name> <name><surname>Leibundgut</surname> <given-names>M.</given-names></name> <name><surname>Bohringer</surname> <given-names>D.</given-names></name> <name><surname>Aylett</surname> <given-names>C. H. S.</given-names></name> <name><surname>Ban</surname> <given-names>N.</given-names></name></person-group> (<year>2017</year>). <article-title>Structural and e-functional insights into human rinitiation complexes.</article-title> <source><italic>Mol. Cell</italic></source> <volume>67</volume> <fpage>447</fpage>&#x2013;<lpage>456 e447</lpage>. <pub-id pub-id-type="doi">10.1016/j.molcel.2017.06.032</pub-id> <pub-id pub-id-type="pmid">28732596</pub-id></citation></ref>
<ref id="B77"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Woolfson</surname> <given-names>D. N.</given-names></name> <name><surname>Williams</surname> <given-names>D. H.</given-names></name></person-group> (<year>1990</year>). <article-title>The influence of proline residues on alpha-helical structure.</article-title> <source><italic>FEBS Lett.</italic></source> <volume>277</volume> <fpage>185</fpage>&#x2013;<lpage>188</lpage>. <pub-id pub-id-type="doi">10.1016/0014-5793(90)80839-b</pub-id></citation></ref>
<ref id="B78"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>J.</given-names></name> <name><surname>Naik</surname> <given-names>N.</given-names></name> <name><surname>Patel</surname> <given-names>J. S.</given-names></name> <name><surname>Wylie</surname> <given-names>C. S.</given-names></name> <name><surname>Gu</surname> <given-names>W.</given-names></name> <name><surname>Huang</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Predicting the viability of beta-lactamase: How folding and binding free energies correlate with beta-lactamase fitness.</article-title> <source><italic>PLoS One</italic></source> <volume>15</volume>:<issue>e0233509</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0233509</pub-id> <pub-id pub-id-type="pmid">32470971</pub-id></citation></ref>
<ref id="B79"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yates</surname> <given-names>A. D.</given-names></name> <name><surname>Achuthan</surname> <given-names>P.</given-names></name> <name><surname>Akanni</surname> <given-names>W.</given-names></name> <name><surname>Allen</surname> <given-names>J.</given-names></name> <name><surname>Allen</surname> <given-names>J.</given-names></name> <name><surname>Alvarez-Jarreta</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2020</year>). <article-title>Ensembl 2020.</article-title> <source><italic>Nucleic Acids Res</italic>.</source> <volume>48</volume>, <fpage>D682</fpage>&#x2013;<lpage>D688</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz966</pub-id> <pub-id pub-id-type="pmid">31691826</pub-id></citation></ref>
<ref id="B80"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yates</surname> <given-names>C. M.</given-names></name> <name><surname>Sternberg</surname> <given-names>M. J.</given-names></name></person-group> (<year>2013</year>). <article-title>The effects of non-synonymous single nucleotide polymorphisms (nsSNPs) on protein-protein interactions.</article-title> <source><italic>J. Mol. Biol.</italic></source> <volume>425</volume> <fpage>3949</fpage>&#x2013;<lpage>3963</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmb.2013.07.012</pub-id> <pub-id pub-id-type="pmid">23867278</pub-id></citation></ref>
<ref id="B81"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zamyatnin</surname> <given-names>A. A.</given-names></name></person-group> (<year>1972</year>). <article-title>Protein volume in solution.</article-title> <source><italic>Prog. Biophys. Mol. Biol.</italic></source> <volume>24</volume> <fpage>107</fpage>&#x2013;<lpage>123</lpage>.</citation></ref>
<ref id="B82"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Lai</surname> <given-names>M.</given-names></name> <name><surname>Chang</surname> <given-names>W.</given-names></name> <name><surname>Yu</surname> <given-names>I.</given-names></name> <name><surname>Ding</surname> <given-names>K.</given-names></name> <name><surname>Mrazek</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>Structures and stabilization of kinetoplastid-specific split rRNAs revealed by comparing leishmanial and human ribosomes.</article-title> <source><italic>Nat. Commun.</italic></source> <volume>7</volume>:<issue>13223</issue>. <pub-id pub-id-type="doi">10.1038/ncomms13223</pub-id> <pub-id pub-id-type="pmid">27752045</pub-id></citation></ref>
<ref id="B83"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Z.</given-names></name> <name><surname>Teng</surname> <given-names>S.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name> <name><surname>Schwartz</surname> <given-names>C. E.</given-names></name> <name><surname>Alexov</surname> <given-names>E.</given-names></name></person-group> (<year>2010</year>). <article-title>Computational analysis of missense mutations causing Snyder-Robinson syndrome.</article-title> <source><italic>Hum. Mutat.</italic></source> <volume>31</volume> <fpage>1043</fpage>&#x2013;<lpage>1049</lpage>. <pub-id pub-id-type="doi">10.1002/humu.21310</pub-id> <pub-id pub-id-type="pmid">20556796</pub-id></citation></ref>
<ref id="B84"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>J. B.</given-names></name> <name><surname>Xiong</surname> <given-names>Y.</given-names></name> <name><surname>An</surname> <given-names>K.</given-names></name> <name><surname>Ye</surname> <given-names>Z. Q.</given-names></name> <name><surname>Wu</surname> <given-names>Y. D.</given-names></name></person-group> (<year>2020</year>). <article-title>IDRMutPred: predicting disease-associated germline nonsynonymous single nucleotide variants (nsSNVs) in intrinsically disordered regions.</article-title> <source><italic>Bioinformatics</italic></source> <volume>36</volume> <fpage>4977</fpage>&#x2013;<lpage>4983</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btaa618</pub-id> <pub-id pub-id-type="pmid">32756939</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="footnote1">
<label>1</label>
<p><ext-link ext-link-type="uri" xlink:href="https://www.ebi.ac.uk/pdbe/prot_int/pistart.html">https://www.ebi.ac.uk/pdbe/prot_int/pistart.html</ext-link></p></fn>
</fn-group>
</back>
</article>
