<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Mol. Biosci.</journal-id>
<journal-title>Frontiers in Molecular Biosciences</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Mol. Biosci.</abbrev-journal-title>
<issn pub-type="epub">2296-889X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">800819</article-id>
<article-id pub-id-type="doi">10.3389/fmolb.2021.800819</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Molecular Biosciences</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Prediction of Residue-specific Contributions to Binding and Thermal Stability Using Yeast Surface Display</article-title>
<alt-title alt-title-type="left-running-head">Ahmed et&#x20;al.</alt-title>
<alt-title alt-title-type="right-running-head">Predicting Residue Burial, Mutant Stability</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Ahmed</surname>
<given-names>Shahbaz</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Bhasin</surname>
<given-names>Munmun</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Manjunath</surname>
<given-names>Kavyashree</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Varadarajan</surname>
<given-names>Raghavan</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="corresp" rid="c001">
<sup>&#x2a;</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/201781/overview"/>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>Molecular Biophysics Unit, Indian Institute of Science</institution>, <addr-line>Bangalore</addr-line>, <country>India</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Institute for Stem Cell Science and Regenerative Medicine</institution>, <addr-line>Bangalore</addr-line>, <country>India</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1034012/overview">Paulo Ricardo Batista</ext-link>, Oswaldo Cruz Foundation (Fiocruz), Brazil</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1088716/overview">Mattia Miotto</ext-link>, Sapienza University of Rome, Italy</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1113399/overview">Alessandro Angelini</ext-link>, Ca&#x2019; Foscari University of Venice, Italy</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Raghavan Varadarajan, <email>varadar@iisc.ac.in</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Biological Modeling and Simulation, a section of the journal Frontiers in Molecular Biosciences</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>21</day>
<month>01</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>8</volume>
<elocation-id>800819</elocation-id>
<history>
<date date-type="received">
<day>24</day>
<month>10</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>14</day>
<month>12</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Ahmed, Bhasin, Manjunath and Varadarajan.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Ahmed, Bhasin, Manjunath and Varadarajan</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#x20;terms.</p>
</license>
</permissions>
<abstract>
<p>Accurate prediction of residue burial as well as quantitative prediction of residue-specific contributions to protein stability and activity is challenging, especially in the absence of experimental structural information. This is important for prediction and understanding of disease causing mutations, and for protein stabilization and design. Using yeast surface display of a saturation mutagenesis library of the bacterial toxin CcdB, we probe the relationship between ligand binding and expression level of displayed protein, with <italic>in vivo</italic> solubility in <italic>E.&#x20;coli</italic> and <italic>in&#x20;vitro</italic> thermal stability. We find that both the stability and solubility correlate well with the total amount of active protein on the yeast cell surface but not with total amount of expressed protein. We coupled FACS and deep sequencing to reconstruct the binding and expression mean fluorescent intensity of each mutant. The reconstructed mean fluorescence intensity (MFI<sub>seq</sub>) was used to differentiate between buried site, exposed non active-site and exposed active-site positions with high accuracy. The MFI<sub>seq</sub> was also used as a criterion to identify destabilized as well as stabilized mutants in the library, and to predict the melting temperatures of destabilized mutants. These predictions were experimentally validated and were more accurate than those of various computational predictors. The approach was extended to successfully identify buried and active-site residues in the receptor binding domain of the spike protein of SARS-CoV-2, suggesting it has general applicability.</p>
</abstract>
<kwd-group>
<kwd>mutational scanning</kwd>
<kwd>residue burial</kwd>
<kwd>saturation mutagenesis</kwd>
<kwd>free energy</kwd>
<kwd>protein stability</kwd>
</kwd-group>
<contract-sponsor id="cn001">Bill and Melinda Gates Foundation<named-content content-type="fundref-id">10.13039/100000865</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>Mutagenesis is often used to generate variants of proteins with improved biophysical properties such as solubility and activity and to understand protein function. The advancement of high-throughput mutagenesis techniques has enabled the generation of a large number of variants of a protein in a short span of time, in a massively parallelizable manner (<xref ref-type="bibr" rid="B77">Zheng et&#x20;al., 2004</xref>; <xref ref-type="bibr" rid="B29">Jain and Varadarajan, 2014</xref>; <xref ref-type="bibr" rid="B73">Wrenbeck et&#x20;al., 2016</xref>). If an appropriate functional assay to score protein activity <italic>in vivo</italic> exist, it is possible to infer the relative activity of each variant in the library, through library screening coupled to next generation sequencing (<xref ref-type="bibr" rid="B25">Fowler et&#x20;al., 2010</xref>; <xref ref-type="bibr" rid="B1">Adkar et&#x20;al., 2012</xref>; <xref ref-type="bibr" rid="B39">Matreyek et&#x20;al., 2018</xref>). However, there is a dearth of efficient, high-throughput methods to measure the solubility and stability of multiple protein variants in parallel, and to discriminate between buried and active-site residues solely using mutational data (<xref ref-type="bibr" rid="B6">Bhasin and Varadarajan, 2021</xref>).</p>
<p>Yeast surface display (YSD) is commonly used as a tool to identify protein variants with improved biophysical properties (<xref ref-type="bibr" rid="B59">Schweickhardt et&#x20;al., 2003</xref>; <xref ref-type="bibr" rid="B30">Jones et&#x20;al., 2006</xref>). YSD is preferable to bacterial expression for disulfide containing or glycosylated proteins. Agglutinin based Aga2p is the most widely used system to display proteins on the yeast cell surface (<xref ref-type="bibr" rid="B62">Shusta et&#x20;al., 2008</xref>). Aga2p is a small protein (7.5&#xa0;kDa), covalently linked via disulphide linkages to the yeast cell surface protein Aga1p (<xref ref-type="bibr" rid="B7">Boder and Wittrup, 1997</xref>). Previous studies have shown that the amount of protein displayed on the yeast cell surface is directly correlated to the amount of protein secreted by the cells, as well as the thermal stability of the protein (<xref ref-type="bibr" rid="B61">Shusta et&#x20;al., 1999</xref>). However, in other studies where the secretion efficiency (<xref ref-type="bibr" rid="B27">Hagihara and Kim, 2002</xref>) or yeast cell surface expression of proteins was measured, no such correlation was observed (<xref ref-type="bibr" rid="B45">Park et&#x20;al., 2006</xref>; <xref ref-type="bibr" rid="B49">Piatesi et&#x20;al., 2006</xref>). Proteolysis of yeast surface displayed proteins has also been used to differentiate properly folded, stable variants from unstructured variants or molten globules, as a proxy for stabilization (<xref ref-type="bibr" rid="B18">Chevalier et&#x20;al., 2017</xref>; <xref ref-type="bibr" rid="B54">Rocklin et&#x20;al., 2017</xref>; <xref ref-type="bibr" rid="B4">Basanta et&#x20;al., 2020</xref>). However, this has primarily been applied to relatively small proteins (<xref ref-type="bibr" rid="B18">Chevalier et&#x20;al., 2017</xref>; <xref ref-type="bibr" rid="B54">Rocklin et&#x20;al., 2017</xref>; <xref ref-type="bibr" rid="B21">Dou et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B4">Basanta et&#x20;al., 2020</xref>).</p>
<p>A previous study which showed correlation between stability and expression levels was carried out on a limited number of mutants, that were studied individually. In addition, the WT protein itself had a very low T<sub>m</sub> (<xref ref-type="bibr" rid="B61">Shusta et&#x20;al., 1999</xref>). It has also been suggested that if the stability of a protein crosses a certain threshold, its expression does not increase linearly with increase in stability and it is therefore difficult to distinguish stable mutants from less stable ones, using only expression as the criterion (<xref ref-type="bibr" rid="B67">Traxlmayr and Shusta, 2017</xref>). With a very high level of yeast surface expression for unstable variants, the yeast quality control system may not be able to differentiate between properly folded, unfolded or molten globule like proteins. However, once displayed on the yeast cell surface such mutants may unfold or aggregate and hence will not bind to a tertiary structure specific ligand or cognate partner.</p>
<p>To verify the above hypothesis, we used <italic>Escherichia. coli</italic> (<italic>E.coli</italic>) CcdB as a model protein. CcdB is the toxin component of the CcdAB toxin-antitoxin (TA) module which binds both free DNA Gyrase and the DNA Gyrase-DNA complex, these are referred to as inhibition and poisoning respectively. Formation of the poisoned CcdB:DNA Gyrase:DNA ternary complex stalls replication and causes cell death (<xref ref-type="bibr" rid="B5">Bernard and Couturier, 1992</xref>). The other component of this TA module codes for an antitoxin CcdA, which neutralizes the toxicity of the CcdB toxin upon binding to CcdB. A mutation of Arginine to Cysteine in the DNA Gyrase subunit A (GyrA) at residue 462 can abolish the binding of Gyrase to CcdB (<xref ref-type="bibr" rid="B5">Bernard and Couturier, 1992</xref>). The CSH501&#x20;<italic>E.&#x20;coli</italic> strain carries this mutation in the gene of the <italic>gyrA</italic> subunit which makes it insensitive to CcdB (<xref ref-type="bibr" rid="B3">Bajaj et&#x20;al., 2008</xref>). In a previous study, a single-site saturation mutagenesis library of CcdB was generated and the mutants were scored based on their <italic>in vivo</italic> growth phenotype (MS<sub>seq</sub> score) (<xref ref-type="bibr" rid="B1">Adkar et&#x20;al., 2012</xref>). In <italic>E.&#x20;coli</italic>, a good correlation was found between the MS<sub>seq</sub> score of &#x223c;70 mutants with either &#x394;T<sub>m</sub> of purified protein (r &#x3d; 0.65) or <italic>in vivo</italic> solubility in <italic>E.&#x20;coli</italic> (r &#x3d; 0.69) (<xref ref-type="bibr" rid="B68">Tripathi et&#x20;al., 2016</xref>). In contrast to plate based phenotypes, YSD provides greater flexibility and improved quantitation. We therefore wished to explore the correlation between the amount of surface expression or ligand binding seen with YSD, with thermal stability and <italic>E.&#x20;coli in&#x20;vivo</italic> solubility using this large set of characterized mutants, which had a range of <italic>in&#x20;vitro</italic> thermal stability and <italic>in vivo</italic> solubility.</p>
<p>We initially examined 30 different variants of CcdB. Mutants were chosen so as to have varying solubility (when expressed in <italic>E.&#x20;coli</italic>), <italic>in&#x20;vitro</italic> thermal stability, accessibility and residue depth. Fewer mutants were chosen for exposed residues, where most mutants are tolerated. Residue V18 is one of the most highly buried residues in CcdB and several mutants which span a range of thermal stability and <italic>in vivo</italic> solubility were chosen at this position. The <italic>in vivo</italic> solubility of these mutants ranged from completely soluble to insoluble. We did not find a good correlation between total expressed protein amount on the yeast cell surface and either <italic>in vivo</italic> solubility in <italic>E.&#x20;coli,</italic> or <italic>in&#x20;vitro</italic> determined thermal stability. However, a better correlation was observed between the amount of active protein on the yeast cell surface (i.e.,&#x20;the amount of bound ligand) with <italic>in vivo</italic> solubility/thermal stability. In the yeast cell surface display system (<xref ref-type="bibr" rid="B13">Chao et&#x20;al., 2006</xref>), activity was monitored by measuring the extent of binding of yeast cell surface displayed CcdB to a FLAG tagged fragment of GyrA14 as described previously (<xref ref-type="bibr" rid="B57">Sahoo et&#x20;al., 2015</xref>).</p>
<p>Multiple rounds of sorting enrich mutants which have the highest expression and binding on the yeast cell surface. Sorting in such a way may lead to the identification of mutants with better biophysical properties, however, it does not give any information about the relative activity of all the mutants in a library. We coupled FACS and deep sequencing to reconstruct the MFI (MFI<sub>seq</sub>) of each mutant in the Site Saturation Mutagenesis (SSM) library of CcdB, using single round FACS sorting methodology. We use this parameter MFI<sub>seq</sub>, to rank all the mutants based on their activity to generate the mutational landscape or distribution of fitness effects (DFE). We found that the DFE generated using binding was more accurate than the DFE generated using expression. Overall, our MFI<sub>seq</sub> scoring parameter could readily discriminate between stable and destabilized mutants of CcdB in a highly multiplexed manner.</p>
<p>It is well known that mutations that affect activity occur primarily at either surface exposed residues directly involved in binding or catalysis or at buried residues important for folding and stability. It has been difficult to distinguish between these two classes of residues, solely from mutational data (<xref ref-type="bibr" rid="B6">Bhasin and Varadarajan, 2021</xref>). We show here that by examining the effects of charged substitution on surface expression we can discriminate between the two classes of residues. To further validate the approach described above, we analyzed previously published saturation mutagenesis YSD expression and binding data for the receptor binding domain (RBD) of SARS-CoV-2 to its ligand ACE-2 (<xref ref-type="bibr" rid="B64">Starr et&#x20;al., 2020</xref>). We could successfully predict both binding-site and buried residues solely from the mutational data in this system as&#x20;well.</p>
</sec>
<sec sec-type="materials|methods" id="s2">
<title>Materials and Methods</title>
<sec id="s2-1">
<title>Bacterial Strains, Yeast Strains and Plasmids</title>
<p>
<italic>E.coli</italic> CSH501 strain carries a mutation in the gyrA gene which abolishes inhibition and poisoning by CcdB (<xref ref-type="bibr" rid="B3">Bajaj et&#x20;al., 2008</xref>). The EBY100 strain of Saccharomyces cerevisiae has the aga1 gene under the Gal1 promoter for inducible expression and a TRP1 auxotrophic mutation. The strain lacks the aga2 gene, so only Aga2p fused protein expressed from the plasmid, will form a complex with the Aga1p for yeast cell surface display (<xref ref-type="bibr" rid="B8">Boder and Wittrup, 2000</xref>). The ccdB gene was cloned in the pBAD24 plasmid for controllable expression in <italic>E.&#x20;coli</italic>. ccdB mutants were cloned in the pPNLS shuttle vector for yeast cell surface expression (<xref ref-type="bibr" rid="B41">Najar et&#x20;al., 2017</xref>).</p>
</sec>
<sec id="s2-2">
<title>Cloning of WT and Mutant ccdB in <italic>E.coli</italic>
</title>
<p>ccdB mutants in pBAD24 were generated using three fragment Gibson assembly. Briefly, ccdB was amplified in two fragments using two sets of oligos. For each fragment one of the oligos binds to the vector and the other binds to the gene. The primer of both fragments which bind to the gene were completely overlapping and contained the desired mutation. The fragments were gel extracted and Gibson assembled with NdeI and HindIII digested pBAD24 vector. The Gibson assembled product was electroporated in <italic>E.&#x20;coli</italic> CSH501 strain and positive transformants were selected on LB agar media containing ampicillin (100&#xa0;&#x3bc;g/ml). The sequence was confirmed by Sanger sequencing. Sequence confirmed WT or mutant ccdB in pBAD24 vector was used as a template for PCR to amplify the ccdB gene by Vent DNA polymerase. The PCR amplified product was co-transformed with SfiI digested pPNLS vector in the EBY100 strain of S<italic>accharomyces cerevisiae</italic> using LiAc/SS carrier DNA/PEG method for <italic>in vivo</italic> recombination (<xref ref-type="bibr" rid="B26">Gietz and Schiestl, 2007</xref>). Positive transformants were selected on SDCAA Tryptophan dropout media plates and the sequence was confirmed by Sanger sequencing.</p>
</sec>
<sec id="s2-3">
<title>Protein Purification</title>
<p>WT and mutant CcdB was purified as described previously (<xref ref-type="bibr" rid="B14">Chattopadhyay and Varadarajan, 2019</xref>). Briefly, an overnight culture was diluted 100-fold in LB media containing ampicillin (100&#xa0;&#x3bc;g/ml) and induced with L-arabinose (0.2% w/v) at an OD<sub>600</sub> of &#x223c;0.5. Following induction for 3&#xa0;h, cells were harvested and lysed by sonication. The soluble fraction was separated using centrifugation and incubated with CcdA peptide (residues 45&#x2013;72<sup>nd</sup>) coupled to Affigel-15 at 4&#x00B0;C. The unbound fraction was removed and the column was washed with bicarbonate buffer (50&#xa0;mM NaHCO<sub>3</sub>, 500&#xa0;mM NaCl, pH 8.5). The bound protein was eluted with 200&#xa0;mM glycine (pH 2.5) and collected in an equal volume of 400&#xa0;mM HEPES buffer (pH 8) to neutralize the acidity of glycine.</p>
<p>GyrA14 was purified as described previously (<xref ref-type="bibr" rid="B19">Dao-Thi et&#x20;al., 2004</xref>). Briefly, an overnight culture was diluted 100-fold in LB media containing ampicillin (100&#xa0;&#x3bc;g/ml) and induced with IPTG (1&#xa0;mM) at an OD<sub>600</sub> of &#x223c;0.5. Following induction for 3&#xa0;h, cells were harvested and resuspended in TES buffer (0.2&#xa0;M Tris, pH 7.5, 0.5&#xa0;mM EDTA, 0.5&#xa0;M sucrose and 1&#xa0;mM PMSF). Cells were lysed and the soluble fraction was separated using centrifugation. The soluble fraction was incubated with pre-equilibrated Ni-NTA beads for 2&#xa0;h at 4&#x00B0;C. The unbound fraction was removed, and the column was washed with 100 column volumes of wash buffer (50&#xa0;mM imidazole in 0.05&#xa0;M Tris, pH 8, 0.5&#xa0;M NaCl). The protein was eluted with 500&#xa0;mM imidazole in 0.05&#xa0;M Tris, pH 8, 0.5&#xa0;M NaCl and dialysed against 1x&#x20;PBS.</p>
</sec>
<sec id="s2-4">
<title>Estimation of Solubility of WT and Mutant CcdB in <italic>E.coli</italic>
</title>
<p>
<italic>E.coli</italic> CSH501 strain, transformed with pBAD24 plasmid containing WT or mutant ccdB, was grown in media containing ampicillin for 16&#xa0;h at 37&#x00B0;C and 180 RPM. A secondary culture was grown by diluting overnight grown culture 100-fold. Upon reaching an OD<sub>600</sub> of 0.4&#x2013;0.5, CcdB variants were induced with Arabinose at a final concentration of 0.2% (w/v) for 3&#xa0;h. The cells were harvested from 1.5&#xa0;ml culture and lysed in 500&#xa0;&#xb5;l 1X PBS, using sonication. Supernatant and pellet fractions were separated by centrifugation at 13,000 RPM at 4&#x00B0;C. The pellet fraction was resuspended in 500&#xa0;&#xb5;l 1X PBS and equal volumes of pellet and supernatant fractions were loaded on Tricine-SDS-PAGE to measure the relative amounts of protein in each fraction.</p>
</sec>
<sec id="s2-5">
<title>Protein Thermal Stability Measurement Using Thermal Shift Assay</title>
<p>The thermal shift assay was conducted in an iCycle iQ5 Real Time Detection System (Bio-Rad, Hercules, CA). A solution of total volume 20&#xa0;&#x3bc;l containing 10&#xa0;&#x3bc;M of the purified CcdB protein and 2.5X Sypro orange dye in suitable buffer (200&#xa0;mM HEPES, 100&#xa0;mM glycine), pH 7.5 was added to a well of a 96-well iCycler iQ PCR plate. The plate was heated from 15&#x00B0;C to 90&#x00B0;C with a 0.5&#x00B0;C increment every 30&#xa0;s. The normalized fluorescence data was plotted against temperature and T<sub>m</sub> measured as described (<xref ref-type="bibr" rid="B42">Niesen et&#x20;al., 2007</xref>; <xref ref-type="bibr" rid="B68">Tripathi et&#x20;al., 2016</xref>).</p>
</sec>
<sec id="s2-6">
<title>Yeast Surface Expression of WT and Mutant CcdB Proteins in EBY100 Cells and Flow Cytometric Analysis</title>
<p>
<italic>Saccharomyces cerevisiae</italic> EBY100 cells containing WT ccdB or mutant in pPNLS plasmids were grown in 3&#xa0;ml SDCAA media (glucose 20&#xa0;g/L, yeast nitrogen base 6.7&#xa0;g/L, casamino acid 5&#xa0;g/L, citrate 4.3&#xa0;g/L, sodium citrate dihydrate 14.3&#xa0;g/L) for 16&#xa0;hours. Grown cells were diluted to an OD<sub>600</sub> of 0.2 in 3&#xa0;ml SDCAA media and grown till the OD<sub>600</sub> reached two. Thirty million cells were harvested using centrifugation and resuspended in 3&#xa0;ml SGCAA induction media (galactose 20&#xa0;g/L, yeast nitrogen base 6.7&#xa0;g/L, casamino acid 5&#xa0;g/L, citrate 4.3&#xa0;g/L, sodium citrate dihydrate 14.3&#xa0;g/L) for 16&#xa0;hours at 30&#x00B0;C, 250 RPM (<xref ref-type="bibr" rid="B13">Chao et&#x20;al., 2006</xref>). One million cells were used for flow cytometric analysis. The amount of total protein expressed on the yeast cell surface was estimated by incubating the induced cells in 20&#xa0;&#x3bc;l FACS buffer (1X PBS and 0.5% BSA), containing chicken anti-HA antibodies from Bethyl labs (1:600 dilution) for 30&#xa0;min at 4&#x00B0;C. This was followed by washing the cells twice with 100&#xa0;&#x3bc;l FACS buffer at 4&#x00B0;C. Washed cells were incubated with 20&#xa0;&#xb5;L FACS buffer containing goat anti-chicken antibodies conjugated to Alexa Fluor 488 (1:300 dilution), for 20&#xa0;min at 4&#x00B0;C. Fluorescence of yeast cells was measured by flow-cytometric analysis. The total amount of active protein on the yeast cell surface was estimated by incubating the induced cells in 20&#xa0;&#x3bc;l FACS buffer containing 100&#xa0;nM GyrA14 for 45&#xa0;min at 4&#x00B0;C. Cells were washed and incubated with 20&#xa0;&#xb5;l mouse anti-FLAG antibodies (1:300). This was followed by washing the cells twice with FACS buffer, followed by incubating with 20&#xa0;&#xb5;l rabbit anti-mouse antibodies conjugated to Alexa Fluor 633 (1:1,600 dilution). The flow-cytometric analysis was carried out on BD Accuri or BD Aria III instruments.</p>
</sec>
<sec id="s2-7">
<title>Yeast Surface Expression and Sorting of CcdB Single-Site Saturation Mutagenesis Library</title>
<p>Previously, an SSM library of ccdB was generated in the pBAD24 vector (<xref ref-type="bibr" rid="B1">Adkar et&#x20;al., 2012</xref>; <xref ref-type="bibr" rid="B68">Tripathi et&#x20;al., 2016</xref>). The library was PCR amplified using primers having homology to the pPNLS vector. The PCR amplified library was gel extracted and cloned in pPNLS vector using yeast <italic>in vivo</italic> recombination.</p>
<p>A similar protocol was used for sample preparation of the library for FACS as described above for the single mutants with slight modifications. Briefly, ten million cells were taken for FACS sample preparation and the reagents were used in 10X higher volumes compared to the earlier flowcytometric analysis. Two different concentrations of GyrA14 (100&#xa0;nM, 5&#xa0;nM) were used for sorting CcdB mutants based on the binding in the 1D histogram. The cells were sorted in 11 and 10 different populations (bins) in case of binding with GyrA14 at concentrations of 100 and 5&#xa0;nM respectively. Additionally, 11 different populations (bins) were sorted from the expression histogram. The experiment was repeated in a biological replicate. The sorting of CcdB libraries was performed using a BD Aria III cell sorter.</p>
</sec>
<sec id="s2-8">
<title>Sample Preparation for Deep Sequencing</title>
<p>Sorted populations were grown on SDCAA agar plates for 48&#xa0;h. Colonies were scraped and plasmids were extracted from the cells. The ccdB gene was PCR amplified using primers which bind upstream and downstream of the ccdB sequence and had multiplex identifier (MID) sequence to segregate the reads from different sorted bins. The DNA was amplified for 15 cycles using PCR and the amplified product was gel extracted and purified. Equal amounts of DNA from each sorted population were pooled, and the library was generated using the TruSeq&#x2122; DNA PCR-Free kit from Illumina. The sequencing was done on an Illumina HiSeq 2,500 250&#xa0;PE platform at Macrogen, South Korea after incorporating 20% &#x3d5;X174 DNA in the library.</p>
</sec>
<sec id="s2-9">
<title>Analysis of Deep Sequencing Data</title>
<p>Deep sequencing data for the ccdB mutants obtained from the Hiseq 2,500 platform was processed using a pipeline developed by adopting certain aspects from an already existing in-house protocol (<ext-link ext-link-type="uri" xlink:href="https://github.com/skshrutikhare/cys_library_analysis">https://github.com/skshrutikhare/cys_library_analysis</ext-link>). The latter method involved the alignment with wild type sequence followed by merging of the paired-end reads, while in the modified protocol, the reads are first merged and then aligned with the wild-type sequence. The present methodology consists of the following steps: assembling the paired end reads, quality filtering, binning, alignment and mutant identification. All these steps were incorporated in a pipeline and made executable from a single command using a parameter file unique to a given data-set. In the first step, paired end reads were assembled using the PEAR v0.9.6 (Paired-End Read Merger) tool (<xref ref-type="bibr" rid="B76">Zhang et&#x20;al., 2014</xref>). The &#x201c;quality filtering&#x201d; step involved deletion of terminal &#x201c;NNN&#x201d; residues in the reads, and removal of reads, not containing the relevant MID and/or primers, along with the reads having mismatched MID&#x2019;s. Finally, only those reads having bases with Phred score &#x2265;20 are retained. A binning step involved further filtering, which eliminated all those reads having incorrectly placed primers, truncated MIDs/primers (due to quality filtering) and shorter/longer sequences than the length of the wild type sequences. The remaining reads were binned according to the respective MIDs. In the alignment step, reads were aligned with the wild type ccdB sequence using the Water v6.4.0.0 program (<xref ref-type="bibr" rid="B63">Smith and Waterman, 1981</xref>) and reformatted. The default values of all parameters, except the gap opening penalty, which was changed to 20, was used. In the final step of &#x201c;substitution&#x201d;, reads were classified based on insertions, deletions and substitutions (single, double etc mutants).</p>
</sec>
<sec id="s2-10">
<title>Mean Fluorescence Intensity Reconstruction From Deep Sequencing Data</title>
<p>Reads of each mutant were normalized across different bins individually (<xref ref-type="disp-formula" rid="e1">Equation 1</xref>), and the fraction of each mutant (<italic>Xi</italic>) distributed amongst the different bins was calculated (<xref ref-type="disp-formula" rid="e2">Equation 2</xref>). The reconstructed MFI for an individual mutant was calculated by the summation of the product, obtained upon multiplying the fraction (<italic>Xi</italic>) of the mutant in a particular bin 1) with the MFI of the corresponding bin obtained from the FACS experiment (<italic>Fi</italic>), across the various bins populated by the respective mutant (<xref ref-type="disp-formula" rid="e3">Equation 3</xref>).<disp-formula id="e1">
<mml:math id="m1">
<mml:mrow>
<mml:mi mathvariant="normal">Normalized&#xa0;read&#xa0;of&#xa0;mutant&#xa0;in</mml:mi>
<mml:mtext>&#x2009;</mml:mtext>
<mml:mi mathvariant="normal">bin</mml:mi>
<mml:mtext>&#x2009;</mml:mtext>
<mml:mtext>&#x2009;</mml:mtext>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="italic">Ni</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mo>&#x3d;</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="bold">No</mml:mi>
<mml:mo>.</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mi mathvariant="bold">of&#xa0;reads&#xa0;of&#xa0;mutant</mml:mi>
<mml:mtext>&#x2009;</mml:mtext>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mtext>&#x2009;</mml:mtext>
<mml:mi mathvariant="bold">in</mml:mi>
<mml:mtext>&#x2009;</mml:mtext>
<mml:mi mathvariant="bold">bin</mml:mi>
<mml:mtext>&#x2009;</mml:mtext>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mtext>&#xa0;</mml:mtext>
</mml:mrow>
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">reads</mml:mi>
<mml:mtext>&#x2009;</mml:mtext>
<mml:mi mathvariant="bold">in</mml:mi>
<mml:mtext>&#x2009;</mml:mtext>
<mml:mi mathvariant="bold">bin</mml:mi>
<mml:mtext>&#x2009;</mml:mtext>
<mml:mo>&#xa0;</mml:mo>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
<label>Equation 1</label>
</disp-formula>
<disp-formula id="e2">
<mml:math id="m2">
<mml:mrow>
<mml:mi mathvariant="normal">Fraction&#xa0;of&#xa0;mutant&#xa0;in&#xa0;each&#xa0;gate</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="italic">Xi</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="bold-italic">Ni</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mn>1</mml:mn>
<mml:mi mathvariant="bold-italic">n</mml:mi>
</mml:msubsup>
<mml:mi mathvariant="bold-italic">Ni</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
<label>Equation 2</label>
</disp-formula>
<disp-formula id="e3">
<mml:math id="m3">
<mml:mrow>
<mml:mi mathvariant="normal">Reconstructed&#xa0;MFI</mml:mi>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mo>&#x3d;</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:munderover>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mn>1</mml:mn>
<mml:mi>n</mml:mi>
</mml:munderover>
<mml:mi>F</mml:mi>
<mml:mi>i</mml:mi>
<mml:mo>&#x2217;</mml:mo>
<mml:mi>X</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>Equation 3</mml:mi>
</mml:mrow>
</mml:math>
</disp-formula>
</p>
<p>The MFI<sub>seq</sub> of the biological replicates were different so the MFI<sub>seq</sub> of one of the replicates was adjusted using &#x201c;m&#x201d; and &#x201c;c&#x201d; obtained from the correlation between the replicates and then averaged.<disp-formula id="equ1">
<mml:math id="m4">
<mml:mrow>
<mml:mi mathvariant="normal">Average&#xa0;MF</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">I</mml:mi>
<mml:mrow>
<mml:mi mathvariant="bold">seq</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mo>&#x3d;</mml:mo>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="bold">MFIseq</mml:mi>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">replicate</mml:mi>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2b;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">m</mml:mi>
<mml:mo>&#x2217;</mml:mo>
<mml:mi mathvariant="bold">MFIseq&#xa0;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="bold">replicate</mml:mi>
<mml:mtext>&#xa0;</mml:mtext>
<mml:mn>2</mml:mn>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>C</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:mfrac>
</mml:mrow>
</mml:math>
</disp-formula>
</p>
</sec>
<sec id="s2-11">
<title>Maximum Likelihood Mean Fluorescence Intensity) Calculation</title>
<p>Reads of each mutant were normalized within and across the bins. The fraction of each mutant (<italic>Xi</italic>), distributed amongst the different bins, was calculated as explained in the above section. The fraction (<italic>Xi</italic>) was multiplied with a scaling factor to convert the data into integers as this is required by the package below. The mlMFI was calculated using a maximum likelihood method using the fitdistrplus R package as explained earlier (<xref ref-type="bibr" rid="B64">Starr et&#x20;al., 2020</xref>). The &#x201c;fitdistcens&#x201d; function in the fitdistplus R package helps in the estimation of fluorescence values for such observations using a maximum likelihood approach, where the values are transformed into a data frame of two columns left and right, describing each observed value as an interval and assuming a normal distribution of values. The left column contains the left bound of the interval and the right column contains the right bound of the interval for interval-censored observations, based on the fluorescence boundaries of each bin. The maximum likelihood approach was used to estimate the MFI of binding and expression for each mutant, based on its distribution of reads across the sorted bins, and the fluorescence boundaries of each sorted&#x20;bin.</p>
</sec>
<sec id="s2-12">
<title>Mean Fluorescence Intensity Calculations After Bins Merging</title>
<p>The bins were merged following which mlMFI amd MFI<sub>seq</sub> were calculated for GyrA14 binding (100&#xa0;nM) for replicate 1. The fraction of each mutant in each bin was calculated as explained in the earlier sections. To merge bins for a given mutant, fractions present in each of the bins to be merged were added arithmetically. For mlMFA calculation, the minimum and maximum fluorescent boundary of the merged bin was set at the lowest and highest value of the fluorescent boundary for that set of bins. The mlMFI of CcdB mutants was calculated as explained above. In the case of MFI<sub>seq</sub>, the mean fluorescent intensity of merged bins was determined by making a new bin spanning the set of merged bins. The MFI<sub>seq</sub> of CcdB mutants was then calculated as explained&#x20;above.</p>
</sec>
<sec id="s2-13">
<title>Depth, Accessibility and RankScore Calculations</title>
<p>Depth was calculated using the server DEPTH (<xref ref-type="bibr" rid="B12">Chakravarty and Varadarajan, 1999</xref>; <xref ref-type="bibr" rid="B65">Tan et&#x20;al., 2011</xref>). Accessibility was calculated using the program NACCESS (<xref ref-type="bibr" rid="B28">Hubbard SJ, 1993</xref>). In both cases, the input co-ordinates were homodimeric CcdB (PDB ID 3VUB). RankScore and MS<sub>seq</sub> are measures of mutational sensitivity in <italic>E.&#x20;coli</italic>. Values were obtained from Adkar et&#x20;al. (<xref ref-type="bibr" rid="B1">Adkar et&#x20;al., 2012</xref>). Buried residues were those with &#x3c;10% accessibility in 3VUB. Active-site residues were those with &#x394;ASA&#x3e;0. &#x394;ASA difference between the solvent accessible surface area of CcdB residues in the free (3VUB) and GyrA14-bound forms (1X75) respectively (<xref ref-type="bibr" rid="B2">Aghera et&#x20;al., 2020</xref>).</p>
</sec>
<sec id="s2-14">
<title>Deep Mutational Scanning of SARS COV-2 Receptor Binding Domain</title>
<p>The deep mutational scanning data was taken from a recent report (<xref ref-type="bibr" rid="B64">Starr et&#x20;al., 2020</xref>) in which two independent libraries of RBD were generated and sorted in four different bins based on expression or binding to ACE-2. In the MFI of binding and expression for individual mutants was reconstructed in that study using a maximum likelihood method using fitdistrplus R package. The expression MFI [Sortseq (expr)] data was shared by the authors in a repository (<ext-link ext-link-type="uri" xlink:href="https://github.com/jbloomlab/SARS-CoV-2-RBD_DMS">https://github.com/jbloomlab/SARS-CoV-2-RBD_DMS</ext-link>). We reconstructed the binding MFI [Sortseq (bind)] at an ACE-2 concentration of 100 pM (TiteSeq_09). For Sortseq (bind) estimation we used the script provided by the authors (<ext-link ext-link-type="uri" xlink:href="https://github.com/jbloomlab/SARS-CoV-2-RBD_DMS/blob/master/results/summary/compute_expression_meanF.md">https://github.com/jbloomlab/SARS-CoV-2-RBD_DMS/blob/master/results/summary/compute_expression_meanF.md</ext-link>). The authors used data from both single and multiple mutants, together with a model to account for epistatic effects to infer the MFI values for individual mutants. We modified the script to change the input data required to calculate Sortseq (bind). For both Sortseq (bind) and Sortseq (expr), we analyzed only single mutant data to avoid any artifacts that might arise from the epistatic model and took the average of delta Sortseq MFI {log (Sortseq (WT))&#x2014;log [Sortseq (mutant)]} of mutants which had multiple barcodes. The Sortseq MFI values of mutants were averaged between the two libraries and the antilog was calculated for delta Sortseq MFI to analyse the ratio of Sortseq (bind) or Sortseq (expr) of mutants with respect to&#x20;WT.</p>
</sec>
<sec id="s2-15">
<title>Statistical Analysis</title>
<p>The correlations and <italic>p</italic> values for its significance were calculated using the GraphPad Prism software 9.0.0 (&#x2a; indicates <italic>p</italic>&#x20;&#x3c; 0.05, &#x2a;&#x2a; indicates <italic>p</italic>&#x20;&#x3c; 0.01, &#x2a;&#x2a;&#x2a;&#x2a; indicates <italic>p</italic>&#x20;&#x3c; 0.0001). The weighted correlations were calculated using the weights function of R. For the computation of weighted correlation, a weight of 1/(&#x3c3;/&#xb5;) was used on the mean values of replicates.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec id="s3-1">
<title>Yeast Surface Display of CcdB Mutants</title>
<p>Yeast surface display (YSD) has become an increasingly popular tool for protein engineering and library screening applications (<xref ref-type="bibr" rid="B47">Pepper et&#x20;al., 2008</xref>). Aga2p mating adhesion receptor of <italic>Saccharomyces cerevisiae</italic> is used as a fusion protein for yeast surface display. For surface expression, we used a vector in which CcdB is fused at the C-terminus of Aga2 (<xref ref-type="bibr" rid="B57">Sahoo et&#x20;al., 2015</xref>). We generated (<xref ref-type="sec" rid="s11">Supplementary Figure S1</xref>) and individually characterized 30 CcdB variants on the yeast cell surface. Most CcdB mutants had similar levels of expression to the WT protein (<xref ref-type="fig" rid="F1">Figure&#x20;1A</xref>). However, the mutants showed different amounts of active protein as assayed by binding to the FLAG tagged GyrA14 compared to the WT protein (<xref ref-type="fig" rid="F1">Figure&#x20;1B</xref>). Previously, we have characterized the <italic>in&#x20;vitro</italic> thermal stability and <italic>in vivo</italic> solubility of several CcdB mutants (<xref ref-type="bibr" rid="B68">Tripathi et&#x20;al., 2016</xref>). The amounts of total and active protein were estimated using antibodies against the HA-tag at the N-terminal of the yeast surface displayed CcdB and the C-terminal FLAG tag of GyrA14 respectively. The correlation coefficient (r) between amount of total protein on the yeast cell surface with <italic>in vivo</italic> solubility or T<sub>m</sub> of the corresponding purified protein were 0.31 and 0.70 respectively (<xref ref-type="fig" rid="F2">Figures 2A,B</xref>). It is unclear why mutants which have very low solubility in <italic>E.&#x20;coli</italic> are highly expressed on the yeast cell surface. It was previously hypothesized that the protein folding quality control system in yeast is not as effective as in mammalian systems, therefore partially folded/molten globule/aggregated protein may exist on the surface of yeast (<xref ref-type="bibr" rid="B45">Park et&#x20;al., 2006</xref>). A correlation of r &#x3d; 0.80 was found between the amount of active protein on the yeast cell surface with its <italic>in vivo</italic> solubility determined in <italic>E.&#x20;coli</italic> (<xref ref-type="fig" rid="F2">Figure&#x20;2C</xref>). We also found a better correlation (r &#x3d; 0.90) between amount of active CcdB protein on the yeast cell surface and its <italic>in&#x20;vitro</italic> thermal stability (<xref ref-type="fig" rid="F2">Figure&#x20;2D</xref>), compared to that between total CcdB protein on the yeast cell surface and thermal stability.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Comparison of the level of expression and binding of CcdB mutants on the yeast cell surface. <bold>(A)</bold> The expression and <bold>(B)</bold> binding to GyrA14 of individual mutants. Errors are calculated from two biological replicates. Most mutants expressed at high levels, however, the amount of active protein varied widely. A few mutants which showed a high level of expression did not show any binding to GyrA14. In both panels, mutants are arranged in order of increasing expression level.</p>
</caption>
<graphic xlink:href="fmolb-08-800819-g001.tif"/>
</fig>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Weighted correlations of E.&#x20;coli <italic>in&#x20;vivo</italic> solubility and <italic>in&#x20;vitro</italic> thermal stability with the amount of total and active protein respectively, on the yeast cell surface. For individual mutants, MFI&#x2019;s of expression and binding were estimated by probing the HA tag on surface expressed protein and the FLAG tag on cell surface bound GyrA14 respectively. For weighted correlation calculation, a weight of 1/(&#x3c3;/&#xb5;) was used. Here &#x3c3; and &#xb5; represent the standard deviation and mean values for each point respectively. Weighted correlation of the total amount of protein (Expression MFI) displayed on the yeast cell surface with <bold>(A)</bold> <italic>in vivo</italic> solubility and <bold>(B)</bold> &#x394;T<sub>m</sub> [T<sub>m</sub> (mutant)-T<sub>m</sub> (WT)] of CcdB mutants. Weighted correlation of the amount of active protein (Binding MFI) on the yeast cell surface with <bold>(C)</bold> <italic>E.&#x20;coli in&#x20;vivo</italic> solubility and <bold>(D)</bold> &#x394;T<sub>m</sub> of CcdB mutants. A better correlation was observed between biophysical parameters with binding MFI rather than expression MFI. In the figure, the &#x394;T<sub>m</sub> of WT was increased by 1&#x00B0;C to remove overlap with another point. Data for <italic>E.&#x20;coli in&#x20;vivo</italic> solubility and thermal stability was taken from Tripathi et&#x20;al. (<xref ref-type="bibr" rid="B68">Tripathi et&#x20;al., 2016</xref>). WT data is shown in open circles. <italic>p</italic> values indicate the significance for non-zero slope values in all the correlations.</p>
</caption>
<graphic xlink:href="fmolb-08-800819-g002.tif"/>
</fig>
</sec>
<sec id="s3-2">
<title>Deep Sequencing Analysis of CcdB Library and Mean Fluorescence Intensity Calculation for CcdB Mutants</title>
<p>To extend these results, an SSM library of ccdB was expressed on the yeast cell surface. Different populations based on extent of binding to gyrase or cell surface expression were sorted. A total of 32 different populations were sorted at two different concentrations of GyrA14 (100&#xa0;nM, 5&#xa0;nM) as a function of either surface expression level or the extent of binding to GyrA14 (<xref ref-type="sec" rid="s11">Supplementary Figure S2</xref>). The lower concentration of GyrA14 was chosen to be around the K<sub>D</sub> of CcdB-GyrA binding (<xref ref-type="sec" rid="s11">Supplementary Figure S3</xref>), the higher concentration was one where WT CcdB approaches saturation in binding with GyrA14 on the yeast cell surface. We hypothesized that at lower concentrations of GyrA14, the binding on the yeast cell surface will be a function of both stability as well as binding affinity. However, at saturating concentration of GyrA14, the binding on the yeast cell surface will largely be a function of amount of correctly folded protein that in turn might be a function of protein stability, rather than the K<sub>d</sub> of the mutant(s). MFI was calculated for each mutant as explained in the Methods section. The MFI was calculated at different stringencies (where the stringency refers to the sum of reads for a given mutant over each gate of the histogram), namely 25, 50, 100, 150, and 200 reads. All mutants with a total read number less than the stringency value were removed from the analysis. As the stringency increased, the pairwise correlation between the biological replicates increased (<xref ref-type="sec" rid="s11">Supplementary Figure S4</xref>, <xref ref-type="sec" rid="s11">Supplementary Table S1</xref>). The data was analysed with a stringency of 50 reads, since at higher stringencies, correlation did not improve significantly, but the number of mutants reduced. Reconstructed Binding and Expression MFI from deep sequencing data are hereafter referred to as MFI<sub>seq</sub> (bind) and MFI<sub>seq</sub> (expr) respectively.</p>
</sec>
<sec id="s3-3">
<title>Mean Fluorescence Intensity Reconstruction and its Correlation With Stability, Solubility and Residue Burial</title>
<p>A few published studies have described estimation of MFI values using deep sequencing of sorted populations and are therefore similar to our experimental strategy. However, the procedure for MFI reconstruction in these reports was relatively complicated compared to that used here (<xref ref-type="bibr" rid="B60">Sharon et&#x20;al., 2012</xref>; <xref ref-type="bibr" rid="B43">Noderer et&#x20;al., 2014</xref>; <xref ref-type="bibr" rid="B48">Peterman and Levine, 2016</xref>; <xref ref-type="bibr" rid="B10">Cambray et&#x20;al., 2018</xref>). In those studies, the fractions of reads were calculated in each bin for all the mutants and MFI (mlMFI) of mutants were calculated by fitting the data to a maximum likelihood distribution of the histogram. We found that if mutants are present in only one bin (highly destabilized or nonsense mutants) then this method is unable to perform the MFI calculation (<xref ref-type="bibr" rid="B64">Starr et&#x20;al., 2020</xref>). For the remaining mutants we found a good correlation between MFI<sub>seq</sub> and mlMFI for binding at 5 and 100&#xa0;nM GyrA14, and for expression (<xref ref-type="sec" rid="s11">Supplementary Figure S5</xref>). For mutants with over 50 reads, we could calculate the MFI of 11,153 mutants using the maximum likelihood method and 11,436 mutants using our method. We also found that progressively reducing the number of bins from eleven to six, does not significantly affect the estimated MFI values, however a further reduction to four bins results in a noticeable change in the estimated values using either method (<xref ref-type="sec" rid="s11">Supplementary Figure S6</xref>). A good correlation was also found between the MFI of individually analysed mutants and their corresponding MFI<sub>seq</sub> values, validating our approach of MFI reconstruction (<xref ref-type="sec" rid="s11">Supplementary Figure S7A, 7B</xref>). Individually analysed mutants showed a good correlation between the amount of active protein on the cell surface and <italic>in&#x20;vitro</italic> measured thermal stability of the purified protein. Similarly, we also found a good correlation between MFI<sub>seq</sub> (bind) of mutants inferred from deep sequencing, and thermal stability as well as <italic>in vivo</italic> solubility for the selected mutants (<xref ref-type="sec" rid="s11">Supplementary Figure S7C,&#x20;7D</xref>).</p>
<p>For the exposed residues (&#x3e;10% accessibility) (<xref ref-type="sec" rid="s11">Supplementary Figure S8</xref>), mutations did not affect the degree of surface expression and binding to GyrA14 (<xref ref-type="fig" rid="F3">Figures 3A,B</xref>). Expression was also unaffected by mutations in the active-site residues (identified from PDB ID:1X75) (<xref ref-type="fig" rid="F3">Figure&#x20;3C</xref>, <xref ref-type="sec" rid="s11">Supplementary Figure S8</xref>). However, many buried site mutants showed very low expression, possibly because of aggregation and degradation inside cells or during export (<xref ref-type="fig" rid="F3">Figure&#x20;3C</xref>). In the case of binding for buried and active-site residues, a very high mutational sensitivity was found (<xref ref-type="fig" rid="F3">Figure&#x20;3D</xref>) similar to the previous report of CcdB mutants in <italic>E.&#x20;coli</italic> (<xref ref-type="bibr" rid="B68">Tripathi et&#x20;al., 2016</xref>). We also found a very high mutational sensitivity of binding for a few non-interacting residues in the loop connecting beta strands S2 and S3 at both 5 and 100&#xa0;nM GyrA14 concentration (<xref ref-type="sec" rid="s11">Supplementary Figure S9</xref>). The residues I24, I25 and D26 in this loop are directly involved in interacting with Gyrase and mutation at non-interacting residues (22, 23 and 27) in the loop might restrict or alter the conformation of the loop, thus reducing the affinity of CcdB mutants to GyrA14. However, there was no effect on the expression of the mutants in this loop, indicating that the mutant proteins are not destabilized (<xref ref-type="sec" rid="s11">Supplementary Figure S9</xref>). We did not find a high correlation between MFI<sub>seq</sub> (bind) and either accessibility or depth, because many mutations at both buried and active-site residues have high mutational sensitivity (<xref ref-type="sec" rid="s11">Supplementary Table S2</xref>). The previously described parameter RankScore, is a measure of mutant activity in <italic>E.&#x20;coli</italic> (<xref ref-type="bibr" rid="B1">Adkar et&#x20;al., 2012</xref>) with high RankScore denoting lower activity. We found a poor correlation between the MFI<sub>seq</sub> (bind) values of CcdB mutants at both exposed non active-site as well as active-site residues, and RankScore. In <italic>E.&#x20;coli,</italic> most of the exposed non active-site residues do not show any mutational sensitivity, i.e.,&#x20;they have the same RankScore values as WT. However, in the present case many such CcdB mutants show lower binding to GyrA14 compared to WT. The loss of binding could be attributed to the decrease in the affinity between CcdB and Gyrase, or destabilization due to mutation. We defined a new parameter MrMFI (mean residue MFI) which is the mean of the MFI values of all the mutants at a certain position. MrMFI (expr) and MrMFI (bind) at 100&#xa0;nM GyrA14, show a good correlation with RankScore (<xref ref-type="sec" rid="s11">Supplementary Table S2</xref>). MrMFI (expr) also showed good correlation with Depth which is a structural measure of residue burial (<xref ref-type="bibr" rid="B12">Chakravarty and Varadarajan, 1999</xref>). However, in the case of binding at 5&#xa0;nM, a weaker correlation of MrMFI (bind) with the aforementioned parameters was observed (<xref ref-type="sec" rid="s11">Supplementary Table S2</xref>). In previous studies, identification of the active-site residues solely from the deep sequencing data was not very efficient (<xref ref-type="bibr" rid="B1">Adkar et&#x20;al., 2012</xref>; <xref ref-type="bibr" rid="B6">Bhasin and Varadarajan, 2021</xref>), this is presumably because <italic>in vivo</italic> activity is often governed by threshold effects, and because mutations at buried residues also affect activity. The current methodology removes such drawbacks. We could distinguish between buried and active-site residues by comparing the MFI<sub>seq</sub> (bind) and MFI<sub>seq</sub> (expr). Most buried site residues showed low values of both MFI<sub>seq</sub> (bind) and MFI<sub>seq</sub> (expr) compared to WT. However, the active-site residues showed low MFI<sub>seq</sub> (bind) but similar MFI<sub>seq</sub> (expr) compared to WT. We found that the average MFI<sub>seq</sub> values of charged residues are a good predictor to discriminate between buried and active-site residues. For calculating MrMFI<sub>charged</sub> of charged WT residues, we only consider mutants with opposite charge. For some mutants at buried positions, we found a very low MrMFI<sub>charged</sub> (expr) but the mutants were absent in MrMFI<sub>charged</sub> (bind). We found that such mutants had very high reads, suggesting that the values of MrMFI<sub>charged</sub> (expr) are correct. We anticipated that such mutants lack binding and are therefore present only in the bin which had a background level of binding signal, the presence of mutant in only that gate led to the removal of such mutants due to the stringency set for the analysis. Hence, such mutants were assigned a MrMFI<sub>charged</sub> (bind) similar to other buried positions. MrMFI<sub>charged</sub> had a bimodal distribution (<xref ref-type="sec" rid="s11">Supplementary Figure S10</xref>), so k-means clustering was performed to identify the mean (&#xb5;) and standard deviation (&#x3c3;) of each distribution. The distributions were named D1 (higher MrMFI<sub>charged</sub>) and D2 (lower MrMFI <sub>charged</sub>). Buried site residues were assigned to be those which have MrMFI<sub>charged</sub> (bind) and MFI<sub>seq</sub> (expr) less than the set threshold (&#xb5;&#x2b;0.5&#x2a;&#x3c3;) for distribution D2. Active-site residues were assigned as those which had MrMFI<sub>charged</sub> (bind) less than (&#xb5;&#x2b;&#x3c3;) of the D2 distribution and MFI<sub>seq</sub> (expr) higher than (&#xb5;&#x2212;2&#x2a;&#x3c3;) of distribution D1 (<xref ref-type="fig" rid="F4">Figure&#x20;4</xref>). The accuracy, specificity and sensitivity of prediction of exposed non active-site, buried and exposed active-site residues are mentioned in <xref ref-type="sec" rid="s11">Supplementary Table S3</xref>. We also compared our prediction results derived from saturation mutagenesis phenotypes with those of an <italic>in silico</italic> predictor, PROF (<xref ref-type="bibr" rid="B56">Rost and Sander, 1994</xref>). For a residue to be classified as buried by PROF, the relative solvent accessibility cut-off used is&#x20;&#x3c; 12. We observed a slightly lower specificity and accuracy for CcdB, and lower sensitivity in the case of RBD when predictions were made using PROF (<xref ref-type="sec" rid="s11">Supplementary Table S4</xref>), relative to our predictions. We also examined the performance of PROF with other proteins and found that the specificity of the predictions was higher than 0.8 in all the cases except for CcdB. However, the sensitivity of the predictions was lower than 0.8 in all the cases except for CcdB, Gal4 and Ubiquitin. The accuracy for the PROF prediction was 0.77 and 0.78 for CcdB and RBD respectively, comparable but slightly lower than the corresponding values of 0.92 and 0.8 for CcdB and RBD respectively, from the saturation mutagenesis predictions in this&#x20;work.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Heatmap of normalized MFI<sub>seq</sub> values for CcdB mutants. MFI<sub>seq</sub> value of mutant was divided by the MFI<sub>seq</sub> value of WT to normalize it. <bold>(A)</bold> MFI<sub>seq</sub> (expr) and <bold>(B)</bold> MFI<sub>seq</sub> (bind) at 100&#xa0;nM GyrA14 for exposed non active-site residues. <bold>(C)</bold> MFI<sub>seq</sub> (expr) and <bold>(D)</bold> MFI<sub>seq</sub> (bind) for buried and active-site residues. Exposed, buried (PDB ID:3VUB) and active-site (PDB ID:1X75) residues are segregated based on the crystal structure. Residues which had accessibility greater than 10% were considered exposed, all remaining residues were considered buried, and active-site mutants in contact with GyrA14 were identified as explained the Methods section. Blue to red colour represents increasing normalized MFI<sub>seq</sub> values, black colour shows the WT residue at the corresponding position. White colour indicates that the mutant is not available. The buried site residues have very high mutational sensitivity both in case of expression and binding. The active-site residues show mutational sensitivity only with respect to Gyrase binding. Information about the mutational sensitivity of expression and binding can be used to differentiate exposed, buried and active-site residues.</p>
</caption>
<graphic xlink:href="fmolb-08-800819-g003.tif"/>
</fig>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Identification of buried and active-site residues from MrMFI<sub>charged</sub> (bind) and MrMFI<sub>charged</sub> (expr). Side chain accessibilities in dimeric CcdB (PDB: 3VUB), darker to lighter shade indicate increasing accessibility, accessibility is reported as log accessibility. The mutants were clustered into two bins based on the distribution of MrMFI<sub>charged</sub> and k-means and standard deviations were calculated for both distributions. The distributions were named D1 (higher MrMFI<sub>charged</sub>) and D2 (lower MrMFI<sub>charged</sub>). Residues which had MrMFI<sub>charged</sub> (binding) and MrMFI<sub>charged</sub> (expr) lower than (&#xb5;&#x2b;0.5&#x2a;&#x3c3;) of distribution D2 were characterized as buried. The false negatives were Y6, D19, Q21, S22, S70, V75 and G77, the polar side chains of these residues are pointing towards the surface. Active-site residues were identified as those in contact with GyyrA14 (PDB ID 1X75). Residues which had MrMFI<sub>charged</sub> (binding) less than (&#xb5;&#x2b;&#x3c3;) of D2 distribution and MrMFI<sub>charged</sub> (expr) higher than (&#xb5;-2&#x2a;&#x3c3;) of distribution D1 were predicted as active-site. We obtained a few putative false positives. However, these residues are likely involved in functional aspects of activity that cannot be inferred from the CcdB:GyrA14 crystal structure. The same residues were seen to be important for CcdB activity <italic>in vivo</italic> in <italic>E.&#x20;coli</italic> (<xref ref-type="bibr" rid="B68">Tripathi et&#x20;al., 2016</xref>). Some positions could not be categorized due to lack of reads, such positions are indicated with an &#x2018;X&#x2019;. Positions indicated with &#x2018;&#x2a;&#x2019; are the ones where MrMFI<sub>charged</sub> (expr) was observed and the mutants had high read counts but the mutants were absent in MrMFI<sub>charged</sub> (bind), such positions were assigned MrMFI<sub>charged</sub> (bind) values similar to other buried positions.</p>
</caption>
<graphic xlink:href="fmolb-08-800819-g004.tif"/>
</fig>
</sec>
<sec id="s3-4">
<title>Selection and Characterization of Putative Stabilized Mutants From Deep Sequencing Data</title>
<p>In the previous section, we discussed the correlation between protein biophysical properties such as thermal stability and <italic>in vivo</italic> solubility with either the amount of active protein or the ratio of active protein to total protein on the yeast cell surface for a few (30) mutants. However, most of these mutants were destabilized with respect to the WT protein. To confirm whether this correlation also holds for mutants that have stability similar or greater than WT, we selected a few CcdB mutants based on either the MFI<sub>seq</sub> (bind) or MFI<sub>seq</sub> (ratio) [MFI<sub>seq</sub> (bind)/MFI<sub>seq</sub> (expr)] for <italic>in&#x20;vitro</italic> characterization of thermal stability. We examined the average and standard deviation of expression for all mutants and selected only those mutants based on MFI<sub>seq</sub> (ratio) which cross a minimum cut-off (&#xb5;&#x2b;0.5&#x2a;&#x3c3;) for MFI<sub>seq</sub> (expr) to remove the bias created by mutants which have very low expression. No threshold for expression was set for selection of mutants based on their MFI<sub>seq</sub> (bind). No selection of the mutants was performed based solely on the MFI<sub>seq</sub> (expr).</p>
<p>Six mutants were characterized using the criteria MFI<sub>seq</sub> (bind) at 5&#xa0;nM GyrA14, none of them showed a higher T<sub>m</sub> than WT (<xref ref-type="fig" rid="F5">Figure&#x20;5A</xref>); whereas two of the mutants selected on the basis of MFI<sub>seq</sub> (ratio) showed a significantly higher T<sub>m</sub> than WT (<xref ref-type="fig" rid="F5">Figure&#x20;5B</xref>). A subset of seven mutants was selected based on MFI<sub>seq</sub> (bind) at 100&#xa0;nM GyrA14, none of the mutants showed higher stability than WT CcdB (<xref ref-type="fig" rid="F5">Figure&#x20;5C</xref>). Ten mutants were selected based on MFI<sub>seq</sub> (ratio) and characterized, four showed higher stability, two mutants were similar to WT and the remaining four were less stable than WT CcdB (<xref ref-type="fig" rid="F5">Figure&#x20;5D</xref>). We therefore hypothesize that if the stability of a mutant crosses a threshold then its expression will not increase further. To confirm this hypothesis, we measured the amount of active protein on the yeast cell surface for seven individual mutants which had T<sub>m</sub>&#x2019;s ranging from 60&#x00B0;C to 70&#x00B0;C, and found that the expression and binding for these mutants are similar to each other and to WT (<xref ref-type="sec" rid="s11">Supplementary Figure&#x20;S11</xref>).</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>&#x394;T<sub>m</sub> of putative stabilized CcdB mutants. Mutants were identified from <bold>(A)</bold> MFI<sub>seq</sub> (bind) at 5&#xa0;nM GyrA14, <bold>(B)</bold> MFI<sub>seq</sub> (ratio) at 5&#xa0;nM GyrA14, <bold>(C)</bold> MFI<sub>seq</sub> (bind) at 100&#xa0;nM GyrA14, <bold>(D)</bold> MFI<sub>seq</sub> (ratio) at 100&#xa0;nM GyrA14. The mutants were randomly selected from a subset of forty mutants which showed the highest MFI<sub>seq</sub> (bind) or the highest MFI<sub>seq</sub> (ratio) and had MFI<sub>seq</sub> (expr) &#x3e; 6,672.</p>
</caption>
<graphic xlink:href="fmolb-08-800819-g005.tif"/>
</fig>
</sec>
<sec id="s3-5">
<title>Prediction of Thermal Stabilities of Putative Destabilized Mutants</title>
<p>For destabilized mutants we observed a good correlation between MFI<sub>seq</sub> (bind) and T<sub>m</sub> of individual mutants (<xref ref-type="sec" rid="s11">Supplementary Figure S7D</xref>). Using this correlation, we next predicted the T<sub>m</sub> of each mutant for an additional set of (n &#x3d; 28) previously described CcdB mutants (<xref ref-type="bibr" rid="B68">Tripathi et&#x20;al., 2016</xref>) based on their MFI<sub>seq</sub> (bind). We found a good correlation (r &#x3d; 0.82) between predicted and <italic>in&#x20;vitro</italic> measured T<sub>m</sub> for this set of CcdB mutants as well (<xref ref-type="sec" rid="s11">Supplementary Figure S12A</xref>). This now allows us to identify putative destabilized mutants and accurately predict the extent of destabilization for all such mutants in the CcdB YSD library. We also predicted the thermal stability of CcdB mutants using the <italic>in silico</italic> predictor, HoTMuSiCv1.0 (<xref ref-type="bibr" rid="B53">Pucci et&#x20;al., 2020</xref>), however, we did not find a good correlation between measured and predicted T<sub>m</sub> (<xref ref-type="sec" rid="s11">Supplementary Figure S12B</xref>). It has been shown that <italic>in&#x20;vitro</italic> protein thermal stability and free energy of unfolding are correlated (<xref ref-type="bibr" rid="B16">Chen et&#x20;al., 2000</xref>; <xref ref-type="bibr" rid="B52">Prajapati et&#x20;al., 2007</xref>; <xref ref-type="bibr" rid="B68">Tripathi et&#x20;al., 2016</xref>). We therefore predicted the free energy of unfolding for CcdB mutants using SDM (<xref ref-type="bibr" rid="B44">Pandurangan et&#x20;al., 2017</xref>), mCSM (<xref ref-type="bibr" rid="B51">Pires et&#x20;al., 2014b</xref>), PoPMuSiC (<xref ref-type="bibr" rid="B20">Dehouck et&#x20;al., 2011</xref>), DynaMut (<xref ref-type="bibr" rid="B55">Rodrigues et&#x20;al., 2018</xref>), DUET (<xref ref-type="bibr" rid="B50">Pires et&#x20;al., 2014a</xref>), MAESTROweb (<xref ref-type="bibr" rid="B36">Laimer et&#x20;al., 2016</xref>), DeepDDG (<xref ref-type="bibr" rid="B11">Cao et&#x20;al., 2019</xref>), CUPSAT (<xref ref-type="bibr" rid="B46">Parthiban et&#x20;al., 2006</xref>), PremPS (<xref ref-type="bibr" rid="B17">Chen et&#x20;al., 2020</xref>) and INPS-MD (<xref ref-type="bibr" rid="B58">Savojardo et&#x20;al., 2016</xref>). We found moderate correlations, with DeepDDG performing the best (r &#x3d; 0.59), but still poorer compared to our prediction from YSD data (r &#x3d; 0.82). For a more detailed comparison we analysed the predictions of stability by DeepDDG, since this showed the highest correlation with measured stability of individual mutants at non active-site residues. We excluded residues 21, 22, 23 and 27 as these positions behaved like active-site residues. We found that trends for &#x394;&#x394;G predicted by DeepDDG for exposed non active-site residues are similar to those obtained from MFI<sub>seq</sub> (bind) (<xref ref-type="fig" rid="F6">Figures 6A,B</xref>). However, we observed some mutant specific differences at residues 8, 16, 50, 53 and 96. Mutations at residues 50 and 96 have highly deleterious effects which reduced GyrA14 binding to yeast surface displayed protein, these are only partially predicted by DeepDDG. In the case of charged and polar mutations at residue 8, 16 and 53 we did not observe a reduction in binding, but the software predicted them to be destabilizing. In the case of buried positions, we found mutation specific effects at 35, 52 and 94 where DeepDDG predicted changes were significantly smaller than the experimentally observed ones. We also found that most of the phenylalanine, tryptophan and arginine mutations were highly destabilizing and the mutants did not bind to GyrA14, however the software gave a lower stability penalty for these substitutions (<xref ref-type="fig" rid="F6">Figures 6C,D</xref>). Our MFI based measurements suggested greater destabilization for several mutants relative to DeepDDG prediction. While the overall trends were similar, as discussed above, there are several differences between MFI based and DeepDDG based stability predictions.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>Comparison of stabilities estimated by DeepDDG and yeast surface display. Heat maps for <bold>(A,C)</bold> MFI<sub>seq</sub> (bind) normalized to WT and <bold>(B,D)</bold> &#x394;&#x394;G predicted by DeepDDG. Residue positions or specific amino acid mutations showing significantly different predicted stabilities by the two methods are highlighted by a box. Blue to red colour corresponds to increasing stability.</p>
</caption>
<graphic xlink:href="fmolb-08-800819-g006.tif"/>
</fig>
</sec>
<sec id="s3-6">
<title>Deep Mutational Scanning of SARS-CoV-2 Receptor Binding Domain</title>
<p>To examine the generality of our approach, we also analyzed recently reported deep mutational scanning data of the SARS-CoV-2 receptor binding domain (<xref ref-type="bibr" rid="B64">Starr et&#x20;al., 2020</xref>). In this study two separate libraries were generated and individually sorted based on expression and binding to ACE-2. The binding [Sortseq (bind)] or expression [Sortseq (expr]) MFIs relative to WT for barcoded mutants were calculated from the deposited NGS data as explained in the Methods section. Additionally, we analyzed binding at only one concentration of ACE-2 (100&#xa0;pM, TiteSeq_09) at which the binding started to saturate. Buried residues were those with &#x3c;10% side chain accessibility in chain C of PDB ID 7KMH (<xref ref-type="bibr" rid="B31">Jones B. E. et&#x20;al., 2020</xref>). ACE-2 binding (active-site) residues were assigned as those contacting ACE-2 (<xref ref-type="bibr" rid="B38">Malladi et&#x20;al., 2021</xref>). To identify the active-site and buried residues from Sortseq data, we calculated the MrMFI<sub>charged</sub> for each position. Similar to CcdB, we observed a bimodal distribution for both MrMFI<sub>charged</sub> (bind) and MrMFI<sub>charged</sub> (expr) (<xref ref-type="sec" rid="s11">Supplementary Figure S13</xref>) and k-means and standard deviation were calculated for both the distribution D1 (higher MrMFI<sub>charged</sub>) and D2 (lower MrMFI<sub>charged</sub>). As described above for CcdB, buried residues were identified as those which had MrMFI<sub>charged</sub> (bind) and MrMFI<sub>charged</sub> (expr) less than the set threshold (&#xb5;&#x2b;0.5&#x2a;&#x3c3;) for distribution D2. The active-site positions were identified as those which had MrMFI<sub>charged</sub> (bind) lower than the set threshold (&#xb5;&#x2b;&#x3c3;) for population D2 and MrMFI<sub>charged</sub> (expr) values higher then (&#xb5;-2&#x2a;&#x3c3;) for population D1. We accurately identified most of the buried residues, however there were some false positive and false negative predictions relative to the crystal structure information (<xref ref-type="fig" rid="F7">Figure&#x20;7</xref>). We found 21 positions to be false negative buried positions. We categorized these false negatives into two categories, namely, glycine and the side chains which are pointing towards the surface. The accessibility calculated by DEPTH server for glycine was zero and we therefore expected glycine to fall into the false negative buried category. Thirteen positions out of twenty-one false negative were glycine. Another six positions, 336, 348, 361, 443 and 480 had their side chains pointing towards the protein surface. We also found similar false negative buried residues in CcdB where the side chain hydrophilic group was pointing towards the protein surface. Position 363 and 365 in RBD had accessibility &#x3c;10% and were pointing towards the core of the protein in the PDB (7KMH) used to calculate accessibility. However, we found that these positions have high accessibility (&#x3e;30%) in another structure (PDB ID 7D2Z). All the available RBD structures are in complex with other molecules, this might be responsible for variation in the accessibility of residues in different RBD structures. We found 17 false positive buried residue predictions, seven of them were aromatic, seven are charged or polar, two are prolines and one is an aliphatic residue. These positions have both reduced expression and binding for charged residue substitutions (<xref ref-type="sec" rid="s11">Supplementary Figure S14A, 14D</xref>) similar to the buried residues (<xref ref-type="sec" rid="s11">Supplementary Figure S14B, 14E</xref>). The specificity, sensitivity and accuracy of prediction is mentioned in <xref ref-type="sec" rid="s11">Supplementary Table S3</xref>. Active site residues were identified with very high accuracy (<xref ref-type="sec" rid="s11">Supplementary Table S3</xref>), though there were a few false negative and false positive predictions. Additionally, we found several positions which had Sortseq (expr) like WT, however, they had very low Sortseq (bind) (<xref ref-type="sec" rid="s11">Supplementary Figure S14A, 14D</xref>). We hypothesize that these positions are also assisting in the maintenance of proper RBM conformation and enabling its binding to ACE-2. Residues 447, 448, 473 and 476 which gave false positive results, 447 and 476 are part of the receptor binding motif (RBM) and contain glycine in a conformation which is available only for glycine. Hence mutation to a non-Gly residue will likely disrupt the conformation of the RBM thus decreasing binding to ACE-2. Mutations at positions 446, 453, 493 and 498 gave false negative results. Of these false negative positions, 446 is again glycine. We found that the Arg mutants at N493 and N498 positions have very little effect on expression and binding (<xref ref-type="sec" rid="s11">Supplementary Figure S14C, 14F</xref>). We hypothesized that these positions may not have the most optimal WT residue, or they may show no mutational penalty for binding to ACE-2. A recent report showed that the affinity of Q498R to ACE-2 is higher than WT RBD (<xref ref-type="bibr" rid="B74">Xue et&#x20;al., 2020</xref>) and was enriched as double mutant Q498R/N501Y when selection was performed for RBD mutants having high affinity towards ACE-2 (<xref ref-type="bibr" rid="B75">Zahradn&#xed;k et&#x20;al., 2021</xref>). It has also been reported that when chimeric virus evolved in the presence of neutralizing antibodies C121 and C141, this enriched for the Q493R mutation. The mutant virus grows to high PFU titers similar to WT, and infectivity is also inhibited by a chimeric ACE-2 analog, similar to WT (<xref ref-type="bibr" rid="B70">Weisblum et&#x20;al., 2020</xref>). The specificity, sensitivity and accuracy of prediction is mentioned in <xref ref-type="sec" rid="s11">Supplementary Table&#x20;S3</xref>.</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p>Prediction of buried and active-site positions in SARS-CoV-2 RBD from Sortseq data. Buried residues were identified from chain C of PDB ID 7KMH, residues which had &#x3c;10% side chain accessibility were categorized as buried. The accessibility and depth was calculated using DEPTH server (<xref ref-type="bibr" rid="B65">Tan et&#x20;al., 2011</xref>). Active-site residues were identified from PDB ID 6M0J as explained earlier (<xref ref-type="bibr" rid="B38">Malladi et&#x20;al., 2021</xref>). Criteria used to predict buried and active-site positions from MFI data were identical to those used for CcdB. Positions which did not have MrMFI data or could not be assigned to either buried or active-site categories are highlighted with &#x201c;X&#x201d;. Accessibility calculated by DEPTH server for glycine is zero and these are marked with a &#x201c;&#x2a;&#x201d;.</p>
</caption>
<graphic xlink:href="fmolb-08-800819-g007.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>With the advancement of mutagenesis and directed evolution methodologies, proteins with modified traits and function can be developed in a relatively short duration of time (<xref ref-type="bibr" rid="B15">Chen and Arnold, 1991</xref>; <xref ref-type="bibr" rid="B72">Winter et&#x20;al., 1994</xref>; <xref ref-type="bibr" rid="B9">Bornscheuer et&#x20;al., 2019</xref>). <italic>E.&#x20;coli</italic> remains an expression host of choice for many proteins and high level, soluble <italic>E.&#x20;coli</italic> expression is a desirable attribute. When eukaryotic or unstable prokaryotic proteins are overexpressed in bacteria, they often tend to form insoluble aggregates called inclusion bodies (IB). Formation of IBs often results in low yields of purified soluble protein. Designing improved variants of a protein by increasing half-life, stability and activity is an ongoing requirement of most pharmaceutical and biotechnology industries. However, a reliable, high-throughput, efficient and rapid method is required for solubility and stability analysis of engineered proteins. Previously, several high-throughput methods to select for soluble expression have been developed based on fusion to a reporter protein. These rely on the reporter activity, which is perturbed if an aggregation prone protein is fused (<xref ref-type="bibr" rid="B40">Maxwell et&#x20;al., 1999</xref>; <xref ref-type="bibr" rid="B69">Waldo et&#x20;al., 1999</xref>; <xref ref-type="bibr" rid="B71">Wigley et&#x20;al., 2001</xref>; <xref ref-type="bibr" rid="B24">Fisher, 2006</xref>). These methods can be used to isolate protein variants with enhanced solubility but cannot reveal if the fused protein is properly folded. In some cases, such unstable proteins may also form soluble aggregates (<xref ref-type="bibr" rid="B68">Tripathi et&#x20;al., 2016</xref>). Since many of these reporter screens employ cytoplasmic expression and use bacterial hosts, disulphide rich or glycosylated proteins, or those binding to complex ligands cannot be studied. Yeast surface display coupled to FACS, has been widely used to evolve such targets. Typically, populations are sorted for multiple rounds to enrich for stable binders to a target of interest (<xref ref-type="bibr" rid="B34">Kieke et&#x20;al., 1999</xref>; <xref ref-type="bibr" rid="B22">Esteban and Zhao, 2004</xref>; <xref ref-type="bibr" rid="B35">Kim et&#x20;al., 2006</xref>; <xref ref-type="bibr" rid="B66">Traxlmayr and Obinger, 2012</xref>). While this approach readily selects for high affinity binders, selecting for stable proteins is more difficult. In some cases, this methodology has also been used to isolate stable variants of proteins (<xref ref-type="bibr" rid="B47">Pepper et&#x20;al., 2008</xref>) and a good correlation was observed between surface expression and improved biophysical parameters. However, other studies in different systems did not find such a correlation (<xref ref-type="bibr" rid="B45">Park et&#x20;al., 2006</xref>; <xref ref-type="bibr" rid="B49">Piatesi et&#x20;al., 2006</xref>).</p>
<p>In the present work we utilize YSD to measure the amount of total protein as well as total active protein displayed on the yeast cell surface. A good correlation was found between the amount of active CcdB mutant on the yeast surface and corresponding <italic>in vivo</italic> solubility in <italic>E.&#x20;coli</italic> (r &#x3d; 0.85) or T<sub>m</sub> (r &#x3d; 0.80). A recent report also suggests that the amount of active protein on the yeast cell surface can be used as a criterion to isolate stable mutants (<xref ref-type="bibr" rid="B67">Traxlmayr and Shusta, 2017</xref>). In the present study, no correlation was found between the amount of total protein on the yeast cell surface and the biophysical properties of mutants. A few mutants which have very low solubility in <italic>E.&#x20;coli</italic> showed very high expression, but there was a negligible amount of active protein on the yeast surface. It has been previously suggested that the quality control system in yeast is not able to discriminate these mutants from properly folded ones or alternatively that the folded conformation is maintained by chaperones in the ER (<xref ref-type="bibr" rid="B45">Park et&#x20;al., 2006</xref>). Once these mutants are exported to the cell surface they may start to unfold. This could be one reason why some groups including ours did not find a good correlation of surface expression with the stability or solubility of these proteins. In previous studies (<xref ref-type="bibr" rid="B61">Shusta et&#x20;al., 1999</xref>), a very limited number of proteins were used for surface expression studies, it is possible that in this small number, mutants which had high surface expression or secretion but lower stability than WT were not observed.</p>
<p>Yeast surface display coupled to FACS typically requires multiple rounds of sorting to enrich variants with desired activity and phenotype. Here, we have performed a single round of sorting and developed a rapid, uncomplicated procedure of estimating MFI&#x2019;s of individual mutants of CcdB combining FACS and deep sequencing. This MFI<sub>seq</sub> was shown to correlate well with the corresponding experimentally measured MFIs for several individual mutants. The MFI<sub>seq</sub> was used to generate the mutational landscape of expression and binding of a mutant library. We showed that such data can be used to accurately discriminate between buried, exposed non active-site and exposed active-site residues both for CcdB and an unrelated protein, RBD of the spike protein of SARS-CoV-2. Highly destabilizing charged mutations in the core of the protein decreased both expression and binding, while the active-site residues showed reduction in binding alone for charged mutations. Relative to an earlier study which assayed <italic>in vivo</italic> activity in <italic>E.&#x20;coli</italic> (<xref ref-type="bibr" rid="B1">Adkar et&#x20;al., 2012</xref>), the present methodology is better able to identify and distinguish between the two categories of mutationally sensitive residues, namely buried and exposed, active-site residues. Identification of active-site residues of interacting partners through charged mutation scanning provides a better alternative to alanine and cysteine scanning mutagenesis. In general, mutations that affect total activity <italic>in vivo</italic> can do so by affecting specific activity without changing the amount of folded protein, decrease the amount of folded protein without affecting specific activity or a combination of the above. The present analysis distinguishes between the above possibilities, and is therefore able to distinguish buried from exposed, active-site positions. This is useful for applications that attempt to use saturation mutagenesis data for protein model discrimination and structure prediction (<xref ref-type="bibr" rid="B33">Khare et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B32">Jones E. M. et&#x20;al., 2020</xref>) as well as interpreting clinical data on disease causing mutations (<xref ref-type="bibr" rid="B23">Findlay et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B37">Livesey and Marsh, 2020</xref>).</p>
<p>MFI<sub>seq</sub> (bind) was also used to predict the T<sub>m</sub> of CcdB mutants. We found a good correlation between predicted and measured &#x394;T<sub>m</sub> for a subset of CcdB mutants. We also compared the accuracy of <italic>in silico</italic> approaches used to predict the stability of mutants and found that these predictors had lower accuracy relative to our approach. We used experimental stability measurements for a small number of destabilized mutations, combined with MFI<sub>seq</sub> measurement to predict stabilities of all destabilized mutants in the saturation mutagenesis library. We could readily identify destabilized mutants of CcdB, however, the recovery of mutants more stable than WT was lower, but still significant, considering the rarity of such mutations. This is likely due to the possibility that if the stability of the protein crosses a threshold, additional increments in stability do not result in enhanced expression or binding.</p>
<p>A limitation of the present approach is that it requires an epitope tagged or fluorescently labelled conformation specific binding partner. Another limitation could be differential relative stability of proteins upon yeast cell surface display compared to expression in the native host and/or intracellular expression. For glycosylated proteins, the stability of mutants may also be altered because of hyper glycosylation of protein on the yeast cell surface compared to proteins expressed in mammalian systems or prokaryotic systems where glycosylation is absent. The presence of glycosylation may also affect the binding to a cognate partner which in turn may give rise to false results. This does not appear to be the case for the SARS-CoV-2 RBD which contains two glycans at residues 331 and 343, but may be an issue for proteins with multiple glycosylation sites. We are examining these possibilities in ongoing studies. Despite these caveats, the present study suggests that the proposed methodology can accurately distinguish buried from active-site residues, quantitatively estimate thermal stabilities of destabilized mutants in large libraries, and also be used with moderate accuracy to identify stabilized mutants.</p>
</sec>
</body>
<back>
<sec id="s5">
<title>Data Availability Statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/<xref ref-type="sec" rid="s11">Supplementary Material</xref>.</p>
</sec>
<sec id="s6">
<title>Author Contributions</title>
<p>RV and SA designed the experiments. SA performed all the experiments, RV and SA analyzed all the data. KM wrote the software and carried out the processing of the deep sequencing data. MB calculated the MFI of CcdB mutants using maximum likelihood method. RV and SA wrote most of the manuscript.</p>
</sec>
<sec id="s7">
<title>Funding</title>
<p>This work was funded by grants to RV from the Department of Science and Technology, grant number-EMR/2017/004054, DT.December 15, 2018), Government of India, Department of Biotechnology, grant no. BT/COE/34/SP15219/2015 DT. November 20, 2015, Ministry of Science and Technology, Government of India and Bill and Melinda Gates Foundation (United&#x20;States) (INV-005948). We also acknowledge funding for infrastructural support from the following programs of the Government of India: DST FIST, UGC Centre for Advanced study, Ministry of Human Resource Development (MHRD), and the DBT IISc Partnership Program. The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.</p>
</sec>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ack>
<p>SA acknowledges Council of Scientific and Industrial Research for his fellowship (SPM-07/079(0218)/2015-EMR-I). KM is thankful to Department of Science and Technology (DST) Science and Engineering Research Board for financial support, sanction order no: PDF/2017/002641. MB acknowledges Council of Scientific and Industrial Research for her fellowship (SRF-09/079(2766)/2017-EMR-I). RV is a J.&#x20;C. Bose Fellow of DST. Aparna Asok is duly acknowledged for&#x20;FACS.</p>
</ack>
<sec id="s10">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmolb.2021.800819/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmolb.2021.800819/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="DataSheet1.PDF" id="SM1" mimetype="application/PDF" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<sec id="s11">
<title>Abbreviations</title>
<p>YSD, yeast surface display; SSM, site saturation mutagenesis; FACS, fluorescence-activated cell sorting; DFE, distribution of fitness effects; RBD, receptor binding domain; SARS-CoV-2, severe acute respiratory syndrome coronavirus 2; ACE-2, angiotensin-converting enzyme&#x20;2.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Adkar</surname>
<given-names>B. V.</given-names>
</name>
<name>
<surname>Tripathi</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Sahoo</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Bajaj</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Goswami</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Chakrabarti</surname>
<given-names>P.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>Protein Model Discrimination Using Mutational Sensitivity Derived from Deep Sequencing</article-title>. <source>Structure</source> <volume>20</volume>, <fpage>371</fpage>&#x2013;<lpage>381</lpage>. <pub-id pub-id-type="doi">10.1016/j.str.2011.11.021</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Aghera</surname>
<given-names>N. K.</given-names>
</name>
<name>
<surname>Prabha</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Tandon</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Chattopadhyay</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Vishwanath</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Srinivasan</surname>
<given-names>N.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Mechanism of CcdA-Mediated Rejuvenation of DNA Gyrase</article-title>. <source>Structure</source> <volume>28</volume>, <fpage>562</fpage>&#x2013;<lpage>572.e4</lpage>. <pub-id pub-id-type="doi">10.1016/j.str.2020.03.006</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bajaj</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Dewan</surname>
<given-names>P. C.</given-names>
</name>
<name>
<surname>Chakrabarti</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Goswami</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Barua</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Baliga</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2008</year>). <article-title>Structural Correlates of the Temperature Sensitive Phenotype Derived from Saturation Mutagenesis Studies of CcdB</article-title>. <source>Biochemistry</source> <volume>47</volume>, <fpage>12964</fpage>&#x2013;<lpage>12973</lpage>. <pub-id pub-id-type="doi">10.1021/bi8014345</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Basanta</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Bick</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Bera</surname>
<given-names>A. K.</given-names>
</name>
<name>
<surname>Norn</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Chow</surname>
<given-names>C. M.</given-names>
</name>
<name>
<surname>Carter</surname>
<given-names>L. P.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>An Enumerative Algorithm for De Novo Design of Proteins with Diverse Pocket Structures</article-title>. <source>Proc. Natl. Acad. Sci. USA</source> <volume>117</volume>, <fpage>22135</fpage>&#x2013;<lpage>22145</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.2005412117</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bernard</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Couturier</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>1992</year>). <article-title>Cell Killing by the F Plasmid CcdB Protein Involves Poisoning of DNA-Topoisomerase II Complexes</article-title>. <source>J.&#x20;Mol. Biol.</source> <volume>226</volume>, <fpage>735</fpage>&#x2013;<lpage>745</lpage>. <pub-id pub-id-type="doi">10.1016/0022-2836(92)90629-X</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bhasin</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Varadarajan</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Prediction of Function Determining and Buried Residues through Analysis of Saturation Mutagenesis Datasets</article-title>. <source>Front. Mol. Biosci.</source> <volume>8</volume>, <fpage>635425</fpage>. <pub-id pub-id-type="doi">10.3389/fmolb.2021.635425</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Boder</surname>
<given-names>E. T.</given-names>
</name>
<name>
<surname>Wittrup</surname>
<given-names>K. D.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>Yeast Surface Display for Screening Combinatorial Polypeptide Libraries</article-title>. <source>Nat. Biotechnol.</source> <volume>15</volume>, <fpage>553</fpage>&#x2013;<lpage>557</lpage>. <pub-id pub-id-type="doi">10.1038/nbt0697-553</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Boder</surname>
<given-names>E. T.</given-names>
</name>
<name>
<surname>Wittrup</surname>
<given-names>K. D.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>[25] Yeast Surface Display for Directed Evolution of Protein Expression, Affinity, and Stability</article-title>. <source>Meth. Enzym.</source> <volume>328</volume>, <fpage>430</fpage>&#x2013;<lpage>444</lpage>. <pub-id pub-id-type="doi">10.1016/s0076-6879(00)28410-3</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bornscheuer</surname>
<given-names>U. T.</given-names>
</name>
<name>
<surname>Hauer</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Jaeger</surname>
<given-names>K. E.</given-names>
</name>
<name>
<surname>Schwaneberg</surname>
<given-names>U.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Directed Evolution Empowered Redesign of Natural Proteins for the Sustainable Production of Chemicals and Pharmaceuticals</article-title>. <source>Angew. Chem. Int. Ed.</source> <volume>58</volume>, <fpage>36</fpage>&#x2013;<lpage>40</lpage>. <pub-id pub-id-type="doi">10.1002/anie.201812717</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cambray</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Guimaraes</surname>
<given-names>J.&#x20;C.</given-names>
</name>
<name>
<surname>Arkin</surname>
<given-names>A. P.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Evaluation of 244,000 Synthetic Sequences Reveals Design Principles to Optimize Translation in <italic>Escherichia coli</italic>
</article-title>. <source>Nat. Biotechnol.</source> <volume>36</volume>, <fpage>1005</fpage>&#x2013;<lpage>1015</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.4238</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cao</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>He</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Qi</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J.&#x20;Z.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>DeepDDG: Predicting the Stability Change of Protein Point Mutations Using Neural Networks</article-title>. <source>J.&#x20;Chem. Inf. Model.</source> <volume>59</volume>, <fpage>1508</fpage>&#x2013;<lpage>1514</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jcim.8b00697</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chakravarty</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Varadarajan</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Residue Depth: a Novel Parameter for the Analysis of Protein Structure and Stability</article-title>. <source>Structure</source> <volume>7</volume>, <fpage>723</fpage>&#x2013;<lpage>732</lpage>. <pub-id pub-id-type="doi">10.1016/s0969-2126(99)80097-5</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chao</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Lau</surname>
<given-names>W. L.</given-names>
</name>
<name>
<surname>Hackel</surname>
<given-names>B. J.</given-names>
</name>
<name>
<surname>Sazinsky</surname>
<given-names>S. L.</given-names>
</name>
<name>
<surname>Lippow</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Wittrup</surname>
<given-names>K. D.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Isolating and Engineering Human Antibodies Using Yeast Surface Display</article-title>. <source>Nat. Protoc.</source> <volume>1</volume>, <fpage>755</fpage>&#x2013;<lpage>768</lpage>. <pub-id pub-id-type="doi">10.1038/nprot.2006.94</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chattopadhyay</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Varadarajan</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Facile Measurement of Protein Stability and Folding Kinetics Using a Nano Differential Scanning Fluorimeter</article-title>. <source>Protein Sci.</source> <volume>28</volume>, <fpage>1127</fpage>&#x2013;<lpage>1134</lpage>. <pub-id pub-id-type="doi">10.1002/pro.3622</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Arnold</surname>
<given-names>F. H.</given-names>
</name>
</person-group> (<year>1991</year>). <article-title>Enzyme Engineering for Nonaqueous Solvents: Random Mutagenesis to Enhance Activity of Subtilisin E in Polar Organic Media</article-title>. <source>Nat. Biotechnol.</source> <volume>9</volume>, <fpage>1073</fpage>&#x2013;<lpage>1077</lpage>. <pub-id pub-id-type="doi">10.1038/nbt1191-1073</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Sakon</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Stites</surname>
<given-names>W. E.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>Increasing the Thermostability of Staphylococcal Nuclease: Implications for the Origin of Protein Thermostability</article-title>. <source>J.&#x20;Mol. Biol.</source> <volume>303</volume>, <fpage>125</fpage>&#x2013;<lpage>130</lpage>. <pub-id pub-id-type="doi">10.1006/jmbi.2000.4140</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>PremPS: Predicting the Impact of Missense Mutations on Protein Stability</article-title>. <source>PLOS Comput. Biol.</source> <volume>16</volume>, <fpage>e1008543</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1008543</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chevalier</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Silva</surname>
<given-names>D.-A.</given-names>
</name>
<name>
<surname>Rocklin</surname>
<given-names>G. J.</given-names>
</name>
<name>
<surname>Hicks</surname>
<given-names>D. R.</given-names>
</name>
<name>
<surname>Vergara</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Murapa</surname>
<given-names>P.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Massively Parallel De Novo Protein Design for Targeted Therapeutics</article-title>. <source>Nature</source> <volume>550</volume>, <fpage>74</fpage>&#x2013;<lpage>79</lpage>. <pub-id pub-id-type="doi">10.1038/nature23912</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dao-Thi</surname>
<given-names>M.-H.</given-names>
</name>
<name>
<surname>Van Melderen</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>De Genst</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Buts</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Ranquin</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Wyns</surname>
<given-names>L.</given-names>
</name>
<etal/>
</person-group> (<year>2004</year>). <article-title>Crystallization of CcdB in Complex with a GyrA Fragment</article-title>. <source>Acta Crystallogr. D Biol. Cryst.</source> <volume>60</volume>, <fpage>1132</fpage>&#x2013;<lpage>1134</lpage>. <pub-id pub-id-type="doi">10.1107/S0907444904007814</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dehouck</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Kwasigroch</surname>
<given-names>J.&#x20;M.</given-names>
</name>
<name>
<surname>Gilis</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Rooman</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>PoPMuSiC 2.1: A Web Server for the Estimation of Protein Stability Changes upon Mutation and Sequence Optimality</article-title>. <source>BMC Bioinformatics</source> <volume>12</volume>, <fpage>151</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-12-151</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dou</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Vorobieva</surname>
<given-names>A. A.</given-names>
</name>
<name>
<surname>Sheffler</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Doyle</surname>
<given-names>L. A.</given-names>
</name>
<name>
<surname>Park</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Bick</surname>
<given-names>M. J.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>De Novo design of a Fluorescence-Activating &#x3b2;-barrel</article-title>. <source>Nature</source> <volume>561</volume>, <fpage>485</fpage>&#x2013;<lpage>491</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-018-0509-0</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Esteban</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Directed Evolution of Soluble Single-Chain Human Class II MHC Molecules</article-title>. <source>J.&#x20;Mol. Biol.</source> <volume>340</volume>, <fpage>81</fpage>&#x2013;<lpage>95</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmb.2004.04.054</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Findlay</surname>
<given-names>G. M.</given-names>
</name>
<name>
<surname>Daza</surname>
<given-names>R. M.</given-names>
</name>
<name>
<surname>Martin</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>M. D.</given-names>
</name>
<name>
<surname>Leith</surname>
<given-names>A. P.</given-names>
</name>
<name>
<surname>Gasperini</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Accurate Classification of BRCA1 Variants with Saturation Genome Editing</article-title>. <source>Nature</source> <volume>562</volume>, <fpage>217</fpage>&#x2013;<lpage>222</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-018-0461-z</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fisher</surname>
<given-names>A. C.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Genetic Selection for Protein Solubility Enabled by the Folding Quality Control Feature of the Twin-Arginine Translocation Pathway</article-title>. <source>Protein Sci.</source> <volume>15</volume>, <fpage>449</fpage>&#x2013;<lpage>458</lpage>. <pub-id pub-id-type="doi">10.1110/ps.051902606</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fowler</surname>
<given-names>D. M.</given-names>
</name>
<name>
<surname>Araya</surname>
<given-names>C. L.</given-names>
</name>
<name>
<surname>Fleishman</surname>
<given-names>S. J.</given-names>
</name>
<name>
<surname>Kellogg</surname>
<given-names>E. H.</given-names>
</name>
<name>
<surname>Stephany</surname>
<given-names>J.&#x20;J.</given-names>
</name>
<name>
<surname>Baker</surname>
<given-names>D.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). <article-title>High-resolution Mapping of Protein Sequence-Function Relationships</article-title>. <source>Nat. Methods</source> <volume>7</volume>, <fpage>741</fpage>&#x2013;<lpage>746</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.1492</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gietz</surname>
<given-names>R. D.</given-names>
</name>
<name>
<surname>Schiestl</surname>
<given-names>R. H.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>High-efficiency Yeast Transformation Using the LiAc/SS Carrier DNA/PEG Method</article-title>. <source>Nat. Protoc.</source> <volume>2</volume>, <fpage>31</fpage>&#x2013;<lpage>34</lpage>. <pub-id pub-id-type="doi">10.1038/nprot.2007.13</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hagihara</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>P. S.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>Toward Development of a Screen to Identify Randomly Encoded, Foldable Sequences</article-title>. <source>Proc. Natl. Acad. Sci.</source> <volume>99</volume>, <fpage>6619</fpage>&#x2013;<lpage>6624</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.102172099</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Hubbard SJ</surname>
<given-names>T. J.</given-names>
</name>
</person-group> (<year>1993</year>). <source>NACCESS</source>. <publisher-loc>London</publisher-loc>: <publisher-name>Dep. Biochem. Mol. Biol. Univ. Coll. London</publisher-name>. </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jain</surname>
<given-names>P. C.</given-names>
</name>
<name>
<surname>Varadarajan</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>A Rapid, Efficient, and Economical Inverse Polymerase Chain Reaction-Based Method for Generating a Site Saturation Mutant Library</article-title>. <source>Anal. Biochem.</source> <volume>449</volume>, <fpage>90</fpage>&#x2013;<lpage>98</lpage>. <pub-id pub-id-type="doi">10.1016/j.ab.2013.12.002</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jones</surname>
<given-names>L. L.</given-names>
</name>
<name>
<surname>Brophy</surname>
<given-names>S. E.</given-names>
</name>
<name>
<surname>Bankovich</surname>
<given-names>A. J.</given-names>
</name>
<name>
<surname>Colf</surname>
<given-names>L. A.</given-names>
</name>
<name>
<surname>Hanick</surname>
<given-names>N. A.</given-names>
</name>
<name>
<surname>Garcia</surname>
<given-names>K. C.</given-names>
</name>
<etal/>
</person-group> (<year>2006</year>). <article-title>Engineering and Characterization of a Stabilized &#x3b1;1/&#x3b1;2 Module of the Class I Major Histocompatibility Complex Product Ld</article-title>. <source>J.&#x20;Biol. Chem.</source> <volume>281</volume>, <fpage>25734</fpage>&#x2013;<lpage>25744</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.M604343200</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jones</surname>
<given-names>B. E.</given-names>
</name>
<name>
<surname>Brown-Augsburger</surname>
<given-names>P. L.</given-names>
</name>
<name>
<surname>Corbett</surname>
<given-names>K. S.</given-names>
</name>
<name>
<surname>Westendorf</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Davies</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Cujec</surname>
<given-names>T. P.</given-names>
</name>
<etal/>
</person-group> (<year>2020a</year>). <article-title>LY-CoV555, a Rapidly Isolated Potent Neutralizing Antibody, Provides protection in a Non-human Primate Model of SARS-CoV-2 Infection</article-title>. <source>Biorxiv Prepr. Serv. Biol.</source> <pub-id pub-id-type="doi">10.1101/2020.09.30.318972</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jones</surname>
<given-names>E. M.</given-names>
</name>
<name>
<surname>Lubock</surname>
<given-names>N. B.</given-names>
</name>
<name>
<surname>Venkatakrishnan</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Tseng</surname>
<given-names>A. M.</given-names>
</name>
<name>
<surname>Paggi</surname>
<given-names>J.&#x20;M.</given-names>
</name>
<etal/>
</person-group> (<year>2020b</year>). <article-title>Structural and Functional Characterization of G Protein-Coupled Receptors with Deep Mutational Scanning</article-title>. <source>Elife</source> <volume>9</volume>, <fpage>e61312</fpage>. <pub-id pub-id-type="doi">10.7554/eLife.54895</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Khare</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Bhasin</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Sahoo</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Varadarajan</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Protein Model Discrimination Attempts Using Mutational Sensitivity, Predicted Secondary Structure, and Model Quality Information</article-title>. <source>Proteins</source> <volume>87</volume>, <fpage>326</fpage>&#x2013;<lpage>336</lpage>. <pub-id pub-id-type="doi">10.1002/prot.25654</pub-id> </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kieke</surname>
<given-names>M. C.</given-names>
</name>
<name>
<surname>Shusta</surname>
<given-names>E. V.</given-names>
</name>
<name>
<surname>Boder</surname>
<given-names>E. T.</given-names>
</name>
<name>
<surname>Teyton</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Wittrup</surname>
<given-names>K. D.</given-names>
</name>
<name>
<surname>Kranz</surname>
<given-names>D. M.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Selection of Functional T&#x20;Cell Receptor Mutants from a Yeast Surface-Display Library</article-title>. <source>Proc. Natl. Acad. Sci.</source> <volume>96</volume>, <fpage>5651</fpage>&#x2013;<lpage>5656</lpage>. <comment>Available</comment>at: <comment>
<ext-link ext-link-type="uri" xlink:href="http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=21915&amp;tool=pmcentrez&amp;rendertype=abstract">http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid&#x3d;21915&#x26;tool&#x3d;pmcentrez&#x26;rendertype&#x3d;abstract</ext-link>.</comment> <pub-id pub-id-type="doi">10.1073/pnas.96.10.5651</pub-id> </citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kim</surname>
<given-names>Y.-S.</given-names>
</name>
<name>
<surname>Bhandari</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Cochran</surname>
<given-names>J.&#x20;R.</given-names>
</name>
<name>
<surname>Kuriyan</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wittrup</surname>
<given-names>K. D.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Directed Evolution of the Epidermal Growth Factor Receptor Extracellular Domain for Expression in Yeast</article-title>. <source>Proteins</source> <volume>62</volume>, <fpage>1026</fpage>&#x2013;<lpage>1035</lpage>. <pub-id pub-id-type="doi">10.1002/prot.20618</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Laimer</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Hiebl-Flach</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Lengauer</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Lackner</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>MAESTROweb: a Web Server for Structure-Based Protein Stability Prediction</article-title>. <source>Bioinformatics</source> <volume>32</volume>, <fpage>1414</fpage>&#x2013;<lpage>1416</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btv769</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Livesey</surname>
<given-names>B. J.</given-names>
</name>
<name>
<surname>Marsh</surname>
<given-names>J.&#x20;A.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Using Deep Mutational Scanning to Benchmark Variant Effect Predictors and Identify Disease Mutations</article-title>. <source>Mol. Syst. Biol.</source> <volume>16</volume>, <fpage>e9380</fpage>. <pub-id pub-id-type="doi">10.15252/msb.20199380</pub-id> </citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Malladi</surname>
<given-names>S. K.</given-names>
</name>
<name>
<surname>Singh</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Pandey</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Gayathri</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Kanjo</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Ahmed</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Design of a Highly Thermotolerant, Immunogenic SARS-CoV-2 Spike Fragment</article-title>. <source>J.&#x20;Biol. Chem.</source> <volume>296</volume>, <fpage>100025</fpage>. <pub-id pub-id-type="doi">10.1074/jbc.RA120.016284</pub-id> </citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Matreyek</surname>
<given-names>K. A.</given-names>
</name>
<name>
<surname>Starita</surname>
<given-names>L. M.</given-names>
</name>
<name>
<surname>Stephany</surname>
<given-names>J.&#x20;J.</given-names>
</name>
<name>
<surname>Martin</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Chiasson</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Gray</surname>
<given-names>V. E.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Multiplex Assessment of Protein Variant Abundance by Massively Parallel Sequencing</article-title>. <source>Nat. Genet.</source> <volume>50</volume>, <fpage>874</fpage>&#x2013;<lpage>882</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-018-0122-z</pub-id> </citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Maxwell</surname>
<given-names>K. L.</given-names>
</name>
<name>
<surname>Mittermaier</surname>
<given-names>A. K.</given-names>
</name>
<name>
<surname>Forman-Kay</surname>
<given-names>J.&#x20;D.</given-names>
</name>
<name>
<surname>Davidson</surname>
<given-names>A. R.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>A Simple <italic>In Vivo</italic> Assay for Increased Protein Solubility</article-title>. <source>Protein Sci.</source> <volume>8</volume>, <fpage>1908</fpage>&#x2013;<lpage>1911</lpage>. <pub-id pub-id-type="doi">10.1110/ps.8.9.1908</pub-id> </citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Najar</surname>
<given-names>T. A.</given-names>
</name>
<name>
<surname>Khare</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Pandey</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Gupta</surname>
<given-names>S. K.</given-names>
</name>
<name>
<surname>Varadarajan</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Mapping Protein Binding Sites and Conformational Epitopes Using Cysteine Labeling and Yeast Surface Display</article-title>. <source>Structure</source> <volume>25</volume>, <fpage>395</fpage>&#x2013;<lpage>406</lpage>. <pub-id pub-id-type="doi">10.1016/j.str.2016.12.016</pub-id> </citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Niesen</surname>
<given-names>F. H.</given-names>
</name>
<name>
<surname>Berglund</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Vedadi</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>The Use of Differential Scanning Fluorimetry to Detect Ligand Interactions that Promote Protein Stability</article-title>. <source>Nat. Protoc.</source> <volume>2</volume>, <fpage>2212</fpage>&#x2013;<lpage>2221</lpage>. <pub-id pub-id-type="doi">10.1038/nprot.2007.321</pub-id> </citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Noderer</surname>
<given-names>W. L.</given-names>
</name>
<name>
<surname>Flockhart</surname>
<given-names>R. J.</given-names>
</name>
<name>
<surname>Bhaduri</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Diaz de Arce</surname>
<given-names>A. J.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Khavari</surname>
<given-names>P. A.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Quantitative Analysis of Mammalian Translation Initiation Sites by FACS &#x2010;seq</article-title>. <source>Mol. Syst. Biol.</source> <volume>10</volume>, <fpage>748</fpage>. <pub-id pub-id-type="doi">10.15252/msb.20145136</pub-id> </citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pandurangan</surname>
<given-names>A. P.</given-names>
</name>
<name>
<surname>Ochoa-Monta&#xf1;o</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Ascher</surname>
<given-names>D. B.</given-names>
</name>
<name>
<surname>Blundell</surname>
<given-names>T. L.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>SDM: A Server for Predicting Effects of Mutations on Protein Stability</article-title>. <source>Nucleic Acids Res.</source> <volume>45</volume>, <fpage>W229</fpage>&#x2013;<lpage>W235</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkx439</pub-id> </citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Park</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Stowell</surname>
<given-names>X. F.</given-names>
</name>
<name>
<surname>Gai</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Saven</surname>
<given-names>J.&#x20;G.</given-names>
</name>
<name>
<surname>Boder</surname>
<given-names>E. T.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Limitations of Yeast Surface Display in Engineering Proteins of High Thermostability</article-title>. <source>Protein Eng. Des. Sel.</source> <volume>19</volume>, <fpage>211</fpage>&#x2013;<lpage>217</lpage>. <pub-id pub-id-type="doi">10.1093/protein/gzl003</pub-id> </citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Parthiban</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Gromiha</surname>
<given-names>M. M.</given-names>
</name>
<name>
<surname>Schomburg</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>CUPSAT: Prediction of Protein Stability upon point Mutations</article-title>. <source>Nucleic Acids Res.</source> <volume>34</volume>, <fpage>W239</fpage>&#x2013;<lpage>W242</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkl190</pub-id> </citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pepper</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Cho</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Boder</surname>
<given-names>E. T.</given-names>
</name>
<name>
<surname>Shusta</surname>
<given-names>E.V.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>A Decade of Yeast Surface Display Technology: where Are We Now?</article-title> <source>Cchts</source> <volume>11</volume>, <fpage>127</fpage>&#x2013;<lpage>134</lpage>. <pub-id pub-id-type="doi">10.2174/138620708783744516</pub-id> </citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peterman</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Levine</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Sort-seq under the Hood: Implications of Design Choices on Large-Scale Characterization of Sequence-Function Relations</article-title>. <source>BMC Genomics</source> <volume>17</volume>, <fpage>206</fpage>. <pub-id pub-id-type="doi">10.1186/s12864-016-2533-5</pub-id> </citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Piatesi</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Howland</surname>
<given-names>S. W.</given-names>
</name>
<name>
<surname>Rakestraw</surname>
<given-names>J.&#x20;A.</given-names>
</name>
<name>
<surname>Renner</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Robson</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Cebon</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2006</year>). <article-title>Directed Evolution for Improved Secretion of Cancer-Testis Antigen NY-ESO-1 from Yeast</article-title>. <source>Protein Expr. Purif.</source> <volume>48</volume>, <fpage>232</fpage>&#x2013;<lpage>242</lpage>. <pub-id pub-id-type="doi">10.1016/j.pep.2006.01.026</pub-id> </citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pires</surname>
<given-names>D. E. V.</given-names>
</name>
<name>
<surname>Ascher</surname>
<given-names>D. B.</given-names>
</name>
<name>
<surname>Blundell</surname>
<given-names>T. L.</given-names>
</name>
</person-group> (<year>2014a</year>). <article-title>DUET: A Server for Predicting Effects of Mutations on Protein Stability Using an Integrated Computational Approach</article-title>. <source>Nucleic Acids Res.</source> <volume>42</volume>, <fpage>W314</fpage>&#x2013;<lpage>W319</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gku411</pub-id> </citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pires</surname>
<given-names>D. E. V.</given-names>
</name>
<name>
<surname>Ascher</surname>
<given-names>D. B.</given-names>
</name>
<name>
<surname>Blundell</surname>
<given-names>T. L.</given-names>
</name>
</person-group> (<year>2014b</year>). <article-title>MCSM: Predicting the Effects of Mutations in Proteins Using Graph-Based Signatures</article-title>. <source>Bioinformatics</source> <volume>30</volume>, <fpage>335</fpage>&#x2013;<lpage>342</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btt691</pub-id> </citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Prajapati</surname>
<given-names>R. S.</given-names>
</name>
<name>
<surname>Das</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Sreeramulu</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Sirajuddin</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Srinivasan</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Krishnamurthy</surname>
<given-names>V.</given-names>
</name>
<etal/>
</person-group> (<year>2007</year>). <article-title>Thermodynamic Effects of Proline Introduction on Protein Stability</article-title>. <source>Proteins</source> <volume>66</volume>, <fpage>480</fpage>&#x2013;<lpage>491</lpage>. <pub-id pub-id-type="doi">10.1002/prot.21215</pub-id> </citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pucci</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Kwasigroch</surname>
<given-names>J.&#x20;M.</given-names>
</name>
<name>
<surname>Rooman</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Protein Thermal Stability Engineering Using HoTMuSiC</article-title>. <source>Methods Mol. Biol.</source> <volume>2112</volume>, <fpage>59</fpage>&#x2013;<lpage>73</lpage>. <pub-id pub-id-type="doi">10.1007/978-1-0716-0270-6_5</pub-id> </citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rocklin</surname>
<given-names>G. J.</given-names>
</name>
<name>
<surname>Chidyausiku</surname>
<given-names>T. M.</given-names>
</name>
<name>
<surname>Goreshnik</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Ford</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Houliston</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Lemak</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Global Analysis of Protein Folding Using Massively Parallel Design, Synthesis, and Testing</article-title>. <source>Science</source> <volume>357</volume>, <fpage>168</fpage>&#x2013;<lpage>175</lpage>. <pub-id pub-id-type="doi">10.1126/science.aan0693</pub-id> </citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rodrigues</surname>
<given-names>C. H.</given-names>
</name>
<name>
<surname>Pires</surname>
<given-names>D. E.</given-names>
</name>
<name>
<surname>Ascher</surname>
<given-names>D. B.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>DynaMut: Predicting the Impact of Mutations on Protein Conformation, Flexibility and Stability</article-title>. <source>Nucleic Acids Res.</source> <volume>46</volume>, <fpage>W350</fpage>&#x2013;<lpage>W355</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gky300</pub-id> </citation>
</ref>
<ref id="B56">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rost</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Sander</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>1994</year>). <article-title>Combining Evolutionary Information and Neural Networks to Predict Protein Secondary Structure</article-title>. <source>Proteins</source> <volume>19</volume>, <fpage>55</fpage>&#x2013;<lpage>72</lpage>. <pub-id pub-id-type="doi">10.1002/prot.340190108</pub-id> </citation>
</ref>
<ref id="B57">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sahoo</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Khare</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Devanarayanan</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Jain</surname>
<given-names>P. C.</given-names>
</name>
<name>
<surname>Varadarajan</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Residue Proximity Information and Protein Model Discrimination Using Saturation-Suppressor Mutagenesis</article-title>. <source>Elife</source> <volume>4</volume>, <fpage>e09532</fpage>. <pub-id pub-id-type="doi">10.7554/eLife.09532</pub-id> </citation>
</ref>
<ref id="B58">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Savojardo</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Fariselli</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Martelli</surname>
<given-names>P. L.</given-names>
</name>
<name>
<surname>Casadio</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>INPS-MD: a Web Server to Predict Stability of Protein Variants from Sequence and Structure</article-title>. <source>Bioinformatics</source> <volume>32</volume>, <fpage>2542</fpage>&#x2013;<lpage>2544</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btw192</pub-id> </citation>
</ref>
<ref id="B59">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schweickhardt</surname>
<given-names>R. L.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Garone</surname>
<given-names>L. M.</given-names>
</name>
<name>
<surname>Brondyk</surname>
<given-names>W. H.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>Structure-expression Relationship of Tumor Necrosis Factor Receptor Mutants that Increase Expression</article-title>. <source>J.&#x20;Biol. Chem.</source> <volume>278</volume>, <fpage>28961</fpage>&#x2013;<lpage>28967</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.M212019200</pub-id> </citation>
</ref>
<ref id="B60">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sharon</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Kalma</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Sharp</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Raveh-Sadka</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Levo</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Zeevi</surname>
<given-names>D.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>Inferring Gene Regulatory Logic from High-Throughput Measurements of Thousands of Systematically Designed Promoters</article-title>. <source>Nat. Biotechnol.</source> <volume>30</volume>, <fpage>521</fpage>&#x2013;<lpage>530</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.2205</pub-id> </citation>
</ref>
<ref id="B61">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shusta</surname>
<given-names>E. V.</given-names>
</name>
<name>
<surname>Kieke</surname>
<given-names>M. C.</given-names>
</name>
<name>
<surname>Parke</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Kranz</surname>
<given-names>D. M.</given-names>
</name>
<name>
<surname>Wittrup</surname>
<given-names>K. D.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Yeast Polypeptide Fusion Surface Display Levels Predict thermal Stability and Soluble Secretion Efficiency 1&#x20;1Edited by J.&#x20;A. Wells</article-title>. <source>J.&#x20;Mol. Biol.</source> <volume>292</volume>, <fpage>949</fpage>&#x2013;<lpage>956</lpage>. <pub-id pub-id-type="doi">10.1006/jmbi.1999.3130</pub-id> </citation>
</ref>
<ref id="B62">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shusta</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Pepper</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Cho</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Boder</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>A Decade of Yeast Surface Display Technology: Where Are We Now?</article-title> <source>Cchts</source> <volume>11</volume>, <fpage>127</fpage>&#x2013;<lpage>134</lpage>. <pub-id pub-id-type="doi">10.2174/138620708783744516</pub-id> </citation>
</ref>
<ref id="B63">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Smith</surname>
<given-names>T. F.</given-names>
</name>
<name>
<surname>Waterman</surname>
<given-names>M. S.</given-names>
</name>
</person-group> (<year>1981</year>). <article-title>Identification of Common Molecular Subsequences</article-title>. <source>J.&#x20;Mol. Biol.</source> <volume>147</volume>, <fpage>195</fpage>&#x2013;<lpage>197</lpage>. <pub-id pub-id-type="doi">10.1016/0022-2836(81)90087-5</pub-id> </citation>
</ref>
<ref id="B64">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Starr</surname>
<given-names>T. N.</given-names>
</name>
<name>
<surname>Greaney</surname>
<given-names>A. J.</given-names>
</name>
<name>
<surname>Hilton</surname>
<given-names>S. K.</given-names>
</name>
<name>
<surname>Ellis</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Crawford</surname>
<given-names>K. H. D.</given-names>
</name>
<name>
<surname>Dingens</surname>
<given-names>A. S.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Deep Mutational Scanning of SARS-CoV-2 Receptor Binding Domain Reveals Constraints on Folding and ACE2 Binding</article-title>. <source>Cell</source> <volume>182</volume>, <fpage>1295</fpage>&#x2013;<lpage>1310.e20</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2020.08.012</pub-id> </citation>
</ref>
<ref id="B65">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tan</surname>
<given-names>K. P.</given-names>
</name>
<name>
<surname>Varadarajan</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Madhusudhan</surname>
<given-names>M. S.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>DEPTH: a Web Server to Compute Depth and Predict Small-Molecule Binding Cavities in Proteins</article-title>. <source>Nucleic Acids Res.</source> <volume>39</volume>, <fpage>W242</fpage>&#x2013;<lpage>W248</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkr356</pub-id> </citation>
</ref>
<ref id="B66">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Traxlmayr</surname>
<given-names>M. W.</given-names>
</name>
<name>
<surname>Obinger</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Directed Evolution of Proteins for Increased Stability and Expression Using Yeast Display</article-title>. <source>Arch. Biochem. Biophys.</source> <volume>526</volume>, <fpage>174</fpage>&#x2013;<lpage>180</lpage>. <pub-id pub-id-type="doi">10.1016/j.abb.2012.04.022</pub-id> </citation>
</ref>
<ref id="B67">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Traxlmayr</surname>
<given-names>M. W.</given-names>
</name>
<name>
<surname>Shusta</surname>
<given-names>E. V.</given-names>
</name>
</person-group> (<year>2017</year>). &#x201c;<article-title>Directed Evolution of Protein thermal Stability Using Yeast Surface Display</article-title>,&#x201d; in <source>Methods in Molecular Biology</source> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Humana Press</publisher-name>), <fpage>45</fpage>&#x2013;<lpage>65</lpage>. <pub-id pub-id-type="doi">10.1007/978-1-4939-6857-2_4</pub-id> </citation>
</ref>
<ref id="B68">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tripathi</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Gupta</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Khare</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Jain</surname>
<given-names>P. C.</given-names>
</name>
<name>
<surname>Patel</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>P.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Molecular Determinants of Mutant Phenotypes, Inferred from Saturation Mutagenesis Data</article-title>. <source>Mol. Biol. Evol.</source> <volume>33</volume>, <fpage>2960</fpage>&#x2013;<lpage>2975</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msw182</pub-id> </citation>
</ref>
<ref id="B69">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Waldo</surname>
<given-names>G. S.</given-names>
</name>
<name>
<surname>Standish</surname>
<given-names>B. M.</given-names>
</name>
<name>
<surname>Berendzen</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Terwilliger</surname>
<given-names>T. C.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Rapid Protein-Folding Assay Using green Fluorescent Protein</article-title>. <source>Nat. Biotechnol.</source> <volume>17</volume>, <fpage>691</fpage>&#x2013;<lpage>695</lpage>. <pub-id pub-id-type="doi">10.1038/10904</pub-id> </citation>
</ref>
<ref id="B70">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Weisblum</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Schmidt</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>DaSilva</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Poston</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Lorenzi</surname>
<given-names>J.&#x20;C.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Escape from Neutralizing Antibodies by SARS-CoV-2 Spike Protein Variants</article-title>. <source>Elife</source> <volume>9</volume>, <fpage>e61312</fpage>. <pub-id pub-id-type="doi">10.7554/eLife.61312</pub-id> </citation>
</ref>
<ref id="B71">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wigley</surname>
<given-names>W. C.</given-names>
</name>
<name>
<surname>Stidham</surname>
<given-names>R. D.</given-names>
</name>
<name>
<surname>Smith</surname>
<given-names>N. M.</given-names>
</name>
<name>
<surname>Hunt</surname>
<given-names>J.&#x20;F.</given-names>
</name>
<name>
<surname>Thomas</surname>
<given-names>P. J.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>Protein Solubility and Folding Monitored <italic>In Vivo</italic> by Structural Complementation of a Genetic Marker Protein</article-title>. <source>Nat. Biotechnol.</source> <volume>19</volume>, <fpage>131</fpage>&#x2013;<lpage>136</lpage>. <pub-id pub-id-type="doi">10.1038/84389</pub-id> </citation>
</ref>
<ref id="B72">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Winter</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Griffiths</surname>
<given-names>A. D.</given-names>
</name>
<name>
<surname>Hawkins</surname>
<given-names>R. E.</given-names>
</name>
<name>
<surname>Hoogenboom</surname>
<given-names>H. R.</given-names>
</name>
</person-group> (<year>1994</year>). <article-title>Making Antibodies by Phage Display Technology</article-title>. <source>Annu. Rev. Immunol.</source> <volume>12</volume>, <fpage>433</fpage>&#x2013;<lpage>455</lpage>. <pub-id pub-id-type="doi">10.1146/annurev.iy.12.040194.002245</pub-id> </citation>
</ref>
<ref id="B73">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wrenbeck</surname>
<given-names>E. E.</given-names>
</name>
<name>
<surname>Klesmith</surname>
<given-names>J.&#x20;R.</given-names>
</name>
<name>
<surname>Stapleton</surname>
<given-names>J.&#x20;A.</given-names>
</name>
<name>
<surname>Adeniran</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Tyo</surname>
<given-names>K. E. J.</given-names>
</name>
<name>
<surname>Whitehead</surname>
<given-names>T. A.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Plasmid-based One-Pot Saturation Mutagenesis</article-title>. <source>Nat. Methods</source> <volume>13</volume>, <fpage>928</fpage>&#x2013;<lpage>930</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.4029</pub-id> </citation>
</ref>
<ref id="B74">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xue</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Lai</surname>
<given-names>L.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Single point Mutations Can Potentially Enhance Infectivity of SARS-CoV-2 Revealed by In Silico Affinity Maturation and SPR Assay</article-title>. <source>bioRxiv</source>. <pub-id pub-id-type="doi">10.1101/2020.12.24.424245</pub-id> </citation>
</ref>
<ref id="B75">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zahradn&#xed;k</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Marciano</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Shemesh</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Zoler</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Chiaravalli</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Meyer</surname>
<given-names>B.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>SARS-CoV-2 RBD <italic>In Vitro</italic> Evolution Follows Contagious Mutation Spread, yet Generates an Able Infection Inhibitor</article-title>. <source>bioRxiv</source>. <pub-id pub-id-type="doi">10.1101/2021.01.06.425392</pub-id> </citation>
</ref>
<ref id="B76">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Kobert</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Flouri</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Stamatakis</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>PEAR: A Fast and Accurate Illumina Paired-End reAd mergeR</article-title>. <source>Bioinformatics</source> <volume>30</volume>, <fpage>614</fpage>&#x2013;<lpage>620</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btt593</pub-id> </citation>
</ref>
<ref id="B77">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zheng</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Baumann</surname>
<given-names>U.</given-names>
</name>
<name>
<surname>Reymond</surname>
<given-names>J.-L.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>An Efficient One-step Site-Directed and Site-Saturation Mutagenesis Protocol</article-title>. <source>Nucleic Acids Res.</source> <volume>32</volume>, <fpage>e115</fpage>. <pub-id pub-id-type="doi">10.1093/NAR/GNH110</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>