<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="review-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Mol. Biosci.</journal-id>
<journal-title>Frontiers in Molecular Biosciences</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Mol. Biosci.</abbrev-journal-title>
<issn pub-type="epub">2296-889X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">787510</article-id>
<article-id pub-id-type="doi">10.3389/fmolb.2021.787510</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Molecular Biosciences</subject>
<subj-group>
<subject>Methods</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>QSalignWeb: A Server to Predict and Analyze Protein Quaternary Structure</article-title>
<alt-title alt-title-type="left-running-head">Dey et&#x20;al.</alt-title>
<alt-title alt-title-type="right-running-head">QSalignWeb: Predicting Protein Quaternary Structure</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Dey</surname>
<given-names>Sucharita</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="fn" rid="fn1">
<sup>&#x2020;</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1535210/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Prilusky</surname>
<given-names>Jaime</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Levy</surname>
<given-names>Emmanuel D.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1134827/overview"/>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>Department of Chemical and Structural Biology, Weizmann Institute of Science</institution>, <addr-line>Rehovot</addr-line>, <country>Israel</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Department of Life Sciences and Core Facilities, Weizmann Institute of Science</institution>, <addr-line>Rehovot</addr-line>, <country>Israel</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1126425/overview">Jessica Andreani</ext-link>, UMR9198 Institut de Biologie Int&#xe9;grative de la Cellule (I2BC), France</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/788178/overview">Ezgi Karaca</ext-link>, Dokuz Eylul University, Turkey</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/164086/overview">Elodie Laine</ext-link>, Sorbonne Universit&#xe9;, France</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Emmanuel D. Levy, <email>emmanuel.levy@weizmann.ac.il</email>
</corresp>
<fn fn-type="present-address" id="fn1">
<label>
<sup>
<bold>&#x2020;</bold>
</sup>
</label>
<p>
<bold>Present Address:</bold> Sucharita Dey, Department of Bioscience and Bioengineering, Indian Institute of Technology, Jodhpur, India</p>
</fn>
<fn fn-type="other">
<p>This article was submitted to Biological Modeling and Simulation, a section of the journal Frontiers in Molecular Biosciences</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>05</day>
<month>01</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>8</volume>
<elocation-id>787510</elocation-id>
<history>
<date date-type="received">
<day>30</day>
<month>09</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>02</day>
<month>12</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Dey, Prilusky and Levy.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Dey, Prilusky and Levy</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#x20;terms.</p>
</license>
</permissions>
<abstract>
<p>The identification of physiologically relevant quaternary structures (QSs) in crystal lattices is challenging. To predict the physiological relevance of a particular QS, QSalign searches for homologous structures in which subunits interact in the same geometry. This approach proved accurate but was limited to structures already present in the Protein Data Bank (PDB). Here, we introduce a webserver (<ext-link ext-link-type="uri" xlink:href="http://www.qsalign.org/">www.QSalign.org</ext-link>) allowing users to submit homo-oligomeric structures of their choice to the QSalign pipeline. Given a user-uploaded structure, the sequence is extracted and used to search homologs based on sequence similarity and PFAM domain architecture. If structural conservation is detected between a homolog and the user-uploaded QS, physiological relevance is inferred. The web server also generates alternative QSs with PISA and processes them the same way as the query submitted to widen the predictions. The result page also shows representative QSs in the protein family of the query, which is informative if no QS conservation was detected or if the protein appears monomeric. These representative QSs can also serve as a starting point for homology modeling.</p>
</abstract>
<kwd-group>
<kwd>web server</kwd>
<kwd>protein evolution</kwd>
<kwd>protein quaternary structure</kwd>
<kwd>protein structure alignment</kwd>
<kwd>physiological interface</kwd>
<kwd>crystal contact</kwd>
<kwd>protein superposition</kwd>
<kwd>protein interactions</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>Protein self-interactions are prevalent, and they drive the formation of homo-oligomeric structures (<xref ref-type="bibr" rid="B21">Goodsell and Olson, 2000</xref>; <xref ref-type="bibr" rid="B29">Levy et&#x20;al., 2005</xref>; <xref ref-type="bibr" rid="B30">Levy and Teichmann, 2013</xref>; <xref ref-type="bibr" rid="B33">Marsh and Teichmann, 2015</xref>). The spatial organization of the subunits within a protein homo-oligomer defines its quaternary structure (QS). Knowledge of the physiological QS for a protein is not only key to understand its function (<xref ref-type="bibr" rid="B21">Goodsell and Olson, 2000</xref>; <xref ref-type="bibr" rid="B32">Marianayagam et&#x20;al., 2004</xref>; <xref ref-type="bibr" rid="B2">Amoutzias et&#x20;al., 2008</xref>), but also to analyze its evolution (<xref ref-type="bibr" rid="B18">Franzosa and Xia, 2009</xref>; <xref ref-type="bibr" rid="B20">Garcia-Seisdedos et&#x20;al., 2017</xref>) or to predict the impact of polymorphisms in human diseases (<xref ref-type="bibr" rid="B52">Yates and Sternberg, 2013</xref>).</p>
<p>With currently over 150,000 crystallographic structures of proteins in the Protein Data Bank (PDB) (<xref ref-type="bibr" rid="B44">Rose et&#x20;al., 2017</xref>; <xref ref-type="bibr" rid="B4">Armstrong et&#x20;al., 2019</xref>), much of our knowledge on protein QS comes from X-ray crystallography (<xref ref-type="bibr" rid="B36">Perutz et&#x20;al., 1960</xref>). However, one caveat of X-ray crystallography is its requirement for protein molecules to be arranged in a regular array to form a crystal lattice. In this lattice, some protein-protein contacts may be part of a protein&#x2019;s quaternary structure, whereas others only result from the crystal formation and are called crystal contacts.</p>
<p>Much work has been dedicated to distinguishing physiological interfaces from fortuitous crystal contacts (<xref ref-type="bibr" rid="B8">Capitani et&#x20;al., 2016</xref>; <xref ref-type="bibr" rid="B11">Dey and Levy, 2018</xref>; <xref ref-type="bibr" rid="B48">Xu and Dunbrack, 2019</xref>; <xref ref-type="bibr" rid="B16">Elez et&#x20;al., 2020</xref>). Several physicochemical, geometric, and evolutionary properties of the protein-protein binding surface such as amino acid composition (<xref ref-type="bibr" rid="B37">Ponstingl et&#x20;al., 2000</xref>; <xref ref-type="bibr" rid="B39">Bahadur et&#x20;al., 2004</xref>; <xref ref-type="bibr" rid="B53">Zhu et&#x20;al., 2006</xref>), interface size (<xref ref-type="bibr" rid="B25">Janin, 1997</xref>; <xref ref-type="bibr" rid="B23">Henrick and Thornton, 1998</xref>; <xref ref-type="bibr" rid="B28">Krissinel and Henrick, 2007</xref>), shape (<xref ref-type="bibr" rid="B47">Tsuchiya et&#x20;al., 2008</xref>), packing (<xref ref-type="bibr" rid="B39">Bahadur et&#x20;al., 2004</xref>; <xref ref-type="bibr" rid="B53">Zhu et&#x20;al., 2006</xref>; <xref ref-type="bibr" rid="B47">Tsuchiya et&#x20;al., 2008</xref>), sequence conservation (<xref ref-type="bibr" rid="B15">Elcock and McCammon, 2001</xref>; <xref ref-type="bibr" rid="B22">Guharoy and Chakrabarti, 2005</xref>; <xref ref-type="bibr" rid="B6">Baskaran et&#x20;al., 2014</xref>) or structure conservation (<xref ref-type="bibr" rid="B50">Xu and Dunbrack, 2011</xref>; <xref ref-type="bibr" rid="B49">Xu and Dunbrack, 2020</xref>) can discriminate physiological interfaces from crystal contacts. Several methods also integrated multiple features to train a classifier (<xref ref-type="bibr" rid="B53">Zhu et&#x20;al., 2006</xref>; <xref ref-type="bibr" rid="B7">Bernauer et&#x20;al., 2008</xref>; <xref ref-type="bibr" rid="B34">Mitra and Pal, 2011</xref>; <xref ref-type="bibr" rid="B10">Silva et&#x20;al., 2015</xref>; <xref ref-type="bibr" rid="B24">Hu et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B19">Fukasawa and Tomii, 2019</xref>; <xref ref-type="bibr" rid="B26">Jim&#xe9;nez-Garc&#xed;a et&#x20;al., 2019</xref>). Features range from types of atomic contacts at the interface to amino acid interface propensity scores, contact preferences, packing, co-evolution, and&#x20;more.</p>
<p>While these works focused on individual interfaces, PQS (<xref ref-type="bibr" rid="B23">Henrick and Thornton, 1998</xref>) enabled predicting the full QS, which could involve more than two chains and multiple distinct interfaces (e.g., in the case of a tetramer with dihedral symmetry). Today PQS has been succeeded by PISA (<xref ref-type="bibr" rid="B28">Krissinel and Henrick, 2007</xref>), which predicts the stability of QSs compatible with the crystal lattice of a protein. Like PISA, EPPIC (<xref ref-type="bibr" rid="B6">Baskaran et&#x20;al., 2014</xref>) predicts full QSs while placing emphasis on the conservation of amino acids at interfaces to infer their physiological relevance. QSalign (<xref ref-type="bibr" rid="B13">Dey et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B12">Dey and Levy, 2021</xref>) also employs evolutionary conservation, but it does not rely on sequence conservation. Instead, it searches for structural conservation of the structure of a QS across homologs. For example, if the QSs of two homologous homotetramers match, QSalign annotates both as physiologically relevant. An advantage of this approach is its accuracy: While EPPIC and PISA reach accuracies of &#x223c;85% for homo-oligomers, the accuracy of QSalign was 96%. At the same time, a drawback of the methodology is its coverage. Homologous QSs are necessary for the annotation and as a result, only about half of the QSs in the PDB can be annotated with this strategy. Nevertheless, as more structures are solved, coverage will increase. We previously used this strategy to annotate structures from the PDB. The implementation of QSalign relied on the 3DComplex database (<xref ref-type="bibr" rid="B29">Levy et&#x20;al., 2005</xref>), which is a classification of protein complexes of known structure. This dependency placed a barrier to generalizing this approach to any structure of choice. Here, we report the web server version of QSalign where users can upload a query structure of their choice.</p>
</sec>
<sec id="s2">
<title>Design, Use and Performance of the QSalignWeb Server</title>
<sec id="s2-1">
<title>Server Input</title>
<p>The user provides two pieces of information to the server. First, an email address is required to send the results to the user. Second, a structure in PDB format is uploaded. The structure may contain a particular QS with multiple chains or may also include a single chain. The way in which the QSalign strategy works has been described and benchmarked before and we refer the reader to the original paper (<xref ref-type="bibr" rid="B13">Dey et&#x20;al., 2018</xref>) for details.</p>
</sec>
<sec id="s2-2">
<title>Server Processing</title>
<p>Upon submission of the request by the user, three main actions are performed, as described in <xref ref-type="fig" rid="F1">Figure&#x20;1</xref>. First, the server analyzes the query structure to identify the number of subunits. If two or more subunits are found, the query structure is considered an assembly, referred to as A0. Then, we use PISA (<xref ref-type="bibr" rid="B28">Krissinel and Henrick, 2007</xref>) to generate additional possible assemblies, referred to as A1, A2, &#x2026; etc. We also record the number of subunits for each assembly.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Workflow of QSalignWeb. The user submits a query structure. Additional assemblies are identified using PISA. The resulting assemblies are each superposed with candidate QSs. Because structure superposition is computationally expensive, we only superpose QSs that exhibit the same number of subunits and show sequence homology. Homologs are identified based on two searches: A sequence similarity search with FASTA yields list L1, and a PFAM domain architecture similarity search yields list L0. We take the union of these two lists, and discard very close homologs (list L2, sequence identity &#x3e; 80%). The structure superposition and inference of physiological relevance is carried out as described previously (<xref ref-type="bibr" rid="B13">Dey et&#x20;al., 2018</xref>). On top of the results of the QS superposition, we also display a table of non-redundant QSs that share sequence similarity with the query.</p>
</caption>
<graphic xlink:href="fmolb-08-787510-g001.tif"/>
</fig>
<p>Second, the sequence of the structure is extracted, and PFAM domains are predicted based on version 33 (<xref ref-type="bibr" rid="B14">El-Gebali et&#x20;al., 2019</xref>). The PFAM domain architecture of the query sequence is subsequently used to search for homologs with the same domain architecture. This search is executed on a non-redundant set of QSs that we call NR50 (described below). The search by domain architecture yields a list of candidate proteins called L0. Additionally, we execute a FASTA (<xref ref-type="bibr" rid="B35">Pearson, 1990</xref>) search on sequences from the NR50 set and retrieve two lists: L1, the list of homologs showing <italic>less</italic> than 80% sequence identity (this cut-off can be adjusted in the submission form) together with a sequence-coverage &#x3e;70% (this cut-off can be adjusted in the submission form). We also retrieve L2, a list of structures with a sequence similar to that of the query (&#x3e;80% identity). The final list of candidate QSs used in the next step results from the union of L0 and L1 after removing structures from L2. This list contains valid homologs, and we call it H. Each query assembly is then superposed using Kpax (<xref ref-type="bibr" rid="B42">Ritchie, 2016</xref>) onto all the QSs from list H that share the same number of subunits. If a superposition yields a TM score above 0.65 (this cut-off can be adjusted in the submission form), we infer that the query QS is conserved and likely physiological.</p>
<p>Third, we search for non-redundant QSs whose sequence is similar to the query sequence. This search is executed on a dataset we call NR100 (described below). The hits are then ordered by decreasing sequence identity, and only the closest target is kept per NR50 group, yielding a list of N closest distinct&#x20;QS.</p>
<p>The NR50 and NR100 sets consist of non-redundant sets of QSs. Classically, redundancy is removed at the sequence level only. In this case, a homodimer and a monomer sharing 60% sequence identity would be grouped in the same NR50 cluster. In contrast, the non-redundant sets available in 3DComplex compare both graph topology created by connected chains along with sequence identity (<xref ref-type="bibr" rid="B29">Levy et&#x20;al., 2005</xref>). Since a monomer and a dimer yield different graphs, they would end up in two classes. However, two homodimers can be similar at the sequence level while interacting with a different interface. In this case, they would be wrongly grouped based on their graph topology. To avoid this, we created new non-redundant sets following the procedure from 3DComplex, while also imposing that QSs in the same group show a TM-score above&#x20;0.65.</p>
</sec>
<sec id="s2-3">
<title>Server Output</title>
<p>The server consists of both a visual output, and a downloadable archive containing the information computed by the server. The visual output is illustrated in <xref ref-type="fig" rid="F2">Figure&#x20;2</xref>. It includes an inference on the validity of the query based on the results of the QS superposition. Additionally, superposed structures are displayed using Mol&#x2a; (<xref ref-type="bibr" rid="B46">Sehnal et&#x20;al., 2021</xref>). Each QS of the superposed pair can be interactively toggled on or off to facilitate comparing their structure.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Results of QSalignWeb. The result page of QSalignWeb describes the prediction made by QSalign based on the superposition with homologous QSs. The superposition of the two QSs based on which the prediction is made is shown on the right-hand side. A table displays the result of the search of non-redundant QSs with a sequence similar to the query. In this list, the closest homolog with a high-confidence QS is highlighted in green and represents the structure we judge best for homology modeling.</p>
</caption>
<graphic xlink:href="fmolb-08-787510-g002.tif"/>
</fig>
<p>The QSs resulting from the search of the NR100 dataset are shown as a table and provide an overview of all QS types that exist among homologs, while prioritizing closely related sequences (even identical sequences). Those QSs are often annotated by QSalign, and we provide a confidence estimate for these QSs based on their QSalign annotation. The &#x201c;high confidence&#x201d; QS with the closest sequence to the query is highlighted and represents the best candidate for homology modeling. For example, if a user submits a single-chain structure for which no stable assembly is found with PISA, a closely related dimer annotated with high-confidence could serve as a template for modeling the query structure as a&#x20;dimer.</p>
<p>The downloadable archive contains a list of homologs for each assembly as well as their structure in PDB format files. It also provides aligned coordinates for the query and target to enable comparing their structure using a local visualization software.</p>
</sec>
<sec id="s2-4">
<title>Server Implementation</title>
<p>The server is hosted by the Weizmann Institute on a 64-bit machine running Linux CentOs. The backend runs on Perl and MySQL, the frontend uses Perl, JavaScript and the PDBe implementation of Mol&#x2a; (<ext-link ext-link-type="uri" xlink:href="https://github.com/PDBeurope/pdbe-molstar">https://github.com/PDBeurope/pdbe-molstar</ext-link>) to render the 3D representation of the molecule (<xref ref-type="bibr" rid="B46">Sehnal et&#x20;al., 2021</xref>). QSalignWeb is freely available and requires an email address for results to be sent when they become available. For details on the QSalign methodology we refer the reader to the original paper (<xref ref-type="bibr" rid="B13">Dey et&#x20;al., 2018</xref>).</p>
</sec>
<sec id="s2-5">
<title>Server Performance</title>
<p>The performance of the prediction pipeline of QSalignWeb was benchmarked in previous work (<xref ref-type="bibr" rid="B13">Dey et&#x20;al., 2018</xref>). According to this benchmark, the validation of a QS based on the conservation of its geometry yields predictions with an accuracy of 97%. Thus, the user can be confident about an inference of the server when homologs are found with a conserved QS. Also, according to the same benchmark, the correction of a QS by transitivity (i.e.,&#x20;query QS shows the same sequence but a different geometry to a valid QS) is more error-prone, with an accuracy of&#x20;89%.</p>
</sec>
</sec>
<sec sec-type="conclusion" id="s3">
<title>Conclusion</title>
<p>Sequence-based tertiary structure predictions have seen a recent breakthrough, notably with Alphafold2 (<xref ref-type="bibr" rid="B9">Cramer, 2021</xref>; <xref ref-type="bibr" rid="B27">Jumper et&#x20;al., 2021</xref>) and Rosettafold (<xref ref-type="bibr" rid="B5">Baek et&#x20;al., 2021</xref>). At the same time, predicting the QS of a protein remains challenging even when the tertiary structure is known. The use of evolution and coevolution information together with deep learning has also seen recent developments for scoring and predicting protein-protein interactions (<xref ref-type="bibr" rid="B3">Andreani et&#x20;al., 2020</xref>; <xref ref-type="bibr" rid="B40">Quadir et&#x20;al., 2021</xref>; <xref ref-type="bibr" rid="B41">Quignot et&#x20;al., 2021</xref>; <xref ref-type="bibr" rid="B51">Yan and Huang, 2021</xref>). Complementary to such residue-level information, the use of subunit interaction geometry conservation as evidence of a QS being physiological is a powerful approach, which yields accurate predictions (<xref ref-type="bibr" rid="B13">Dey et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B12">Dey and Levy, 2021</xref>). In that respect, we hope that making this prediction pipeline available as a web server will help biologists identify relevant QS of proteins and will help them investigate the QS of specific proteins. Together, these approaches will help bridge the gap between the sequence and structure space by adding a third dimension to proteomes and interactomes (<xref ref-type="bibr" rid="B1">Aloy and Russell, 2006</xref>; <xref ref-type="bibr" rid="B43">Rolland et&#x20;al., 2014</xref>; <xref ref-type="bibr" rid="B17">Elofsson, 2021</xref>; <xref ref-type="bibr" rid="B38">Postic et&#x20;al., 2021</xref>; <xref ref-type="bibr" rid="B45">Sali, 2021</xref>), thus making 3D proteomics or &#x201c;structuromics&#x201d; (<xref ref-type="bibr" rid="B31">Levy and Vogel, 2021</xref>) accessible.</p>
</sec>
</body>
<back>
<sec id="s4">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s5">
<title>Author Contributions</title>
<p>SD and JP developed the server with input from EL. SD and EL wrote the manuscript with help from&#x20;JP.</p>
</sec>
<sec id="s6">
<title>Funding</title>
<p>This work was supported by the European Research Council (ERC) under the European Union&#x2019;s Horizon 2020 research and innovation program (grant agreement No. 819318), by the Israel Science Foundation (Grant No. 1452/18), by a research grant from A.-M. Boucher, by research grants from the Estelle Funk Foundation, the Estate of Fannie Sherr, the Estate of Albert Delighter, the Merle S. Cahn Foundation, Mildred S. Gosden, the Estate of Elizabeth Wachsman, the Arnold Bortman Family Foundation.&#x20;</p>
</sec>
<sec sec-type="COI-statement" id="s7">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s8">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ack>
<p>We thank Eugene Krissinel for making PISA freely available, and Harry Greenblatt for help with the computer infrastructure. SD is acknowledging support from the Ramalingaswami re-entry fellowship (NO. BT/HRD/35/02/2006 dated March 30th 2021) by DBT, GOVT. of India.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Aloy</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Russell</surname>
<given-names>R. B.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Structural Systems Biology: Modelling Protein Interactions</article-title>. <source>Nat. Rev. Mol. Cell Biol.</source> <volume>7</volume>, <fpage>188</fpage>&#x2013;<lpage>197</lpage>. <pub-id pub-id-type="doi">10.1038/nrm1859</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Amoutzias</surname>
<given-names>G. D.</given-names>
</name>
<name>
<surname>Robertson</surname>
<given-names>D. L.</given-names>
</name>
<name>
<surname>Van de Peer</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Oliver</surname>
<given-names>S. G.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Choose Your Partners: Dimerization in Eukaryotic Transcription Factors</article-title>. <source>Trends Biochem. Sci.</source> <volume>33</volume>, <fpage>220</fpage>&#x2013;<lpage>229</lpage>. <pub-id pub-id-type="doi">10.1016/j.tibs.2008.02.002</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Andreani</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Quignot</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Guerois</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Structural Prediction of Protein Interactions and Docking Using Conservation and Coevolution</article-title>. <source>Wires Comput. Mol. Sci.</source> <volume>10</volume>, <fpage>e1470</fpage>. <pub-id pub-id-type="doi">10.1002/wcms.1470</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Armstrong</surname>
<given-names>D. R.</given-names>
</name>
<name>
<surname>Berrisford</surname>
<given-names>J.&#x20;M.</given-names>
</name>
<name>
<surname>Conroy</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Gutmanas</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Anyango</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Choudhary</surname>
<given-names>P.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>PDBe: Improved Findability of Macromolecular Structure Data in the PDB</article-title>. <source>Nucleic Acids Res.</source> <volume>48</volume>, <fpage>D335</fpage>&#x2013;<lpage>D343</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkz990</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Baek</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>DiMaio</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Anishchenko</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Dauparas</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Ovchinnikov</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>G. R.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Accurate Prediction of Protein Structures and Interactions Using a Three-Track Neural Network</article-title>. <source>Science</source> <volume>373</volume>, <fpage>871</fpage>&#x2013;<lpage>876</lpage>. <pub-id pub-id-type="doi">10.1126/science.abj8754</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Baskaran</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Duarte</surname>
<given-names>J.&#x20;M.</given-names>
</name>
<name>
<surname>Biyani</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Bliven</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Capitani</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>A PDB-wide, Evolution-Based Assessment of Protein-Protein Interfaces</article-title>. <source>BMC Struct. Biol.</source> <volume>14</volume>, <fpage>22</fpage>. <pub-id pub-id-type="doi">10.1186/s12900-014-0022-0</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bernauer</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Bahadur</surname>
<given-names>R. P.</given-names>
</name>
<name>
<surname>Rodier</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Janin</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Poupon</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>DiMoVo: a Voronoi Tessellation-Based Method for Discriminating Crystallographic and Biological Protein-Protein Interactions</article-title>. <source>Bioinformatics</source> <volume>24</volume>, <fpage>652</fpage>&#x2013;<lpage>658</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btn022</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Capitani</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Duarte</surname>
<given-names>J.&#x20;M.</given-names>
</name>
<name>
<surname>Baskaran</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Bliven</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Somody</surname>
<given-names>J.&#x20;C.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Understanding the Fabric of Protein Crystals: Computational Classification of Biological Interfaces and crystal Contacts</article-title>. <source>Bioinformatics</source> <volume>32</volume>, <fpage>481</fpage>&#x2013;<lpage>489</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btv622</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cramer</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>AlphaFold2 and the Future of Structural Biology</article-title>. <source>Nat. Struct. Mol. Biol.</source> <volume>28</volume>, <fpage>704</fpage>&#x2013;<lpage>705</lpage>. <pub-id pub-id-type="doi">10.1038/s41594-021-00650-1</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Da Silva</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Desaphy</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Bret</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Rognan</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Rognan</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>IChemPIC: A Random Forest Classifier of Biological and Crystallographic Protein-Protein Interfaces</article-title>. <source>J.&#x20;Chem. Inf. Model.</source> <volume>55</volume>, <fpage>2005</fpage>&#x2013;<lpage>2014</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jcim.5b00190</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Dey</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Levy</surname>
<given-names>E. D.</given-names>
</name>
</person-group> (<year>2018</year>). &#x201c;<article-title>Inferring and Using Protein Quaternary Structure Information from Crystallographic Data</article-title>,&#x201d; in <source>Protein Complex&#x20;Assembly: Methods and Protocols</source>. Editor <person-group person-group-type="editor">
<name>
<surname>Marsh</surname>
<given-names>J.&#x20;A.</given-names>
</name>
</person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer New York</publisher-name>), <fpage>357</fpage>&#x2013;<lpage>375</lpage>. <pub-id pub-id-type="doi">10.1007/978-1-4939-7759-8_23</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dey</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Levy</surname>
<given-names>E. D.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>PDB-wide Identification of Physiological Hetero-Oligomeric Assemblies Based on Conserved Quaternary Structure Geometry</article-title>. <source>Structure</source> <volume>29</volume>, <fpage>1303</fpage>&#x2013;<lpage>1311</lpage>. <pub-id pub-id-type="doi">10.1016/j.str.2021.07.012</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dey</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Ritchie</surname>
<given-names>D. W.</given-names>
</name>
<name>
<surname>Levy</surname>
<given-names>E. D.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>PDB-wide Identification of Biological Assemblies from Conserved Quaternary Structure Geometry</article-title>. <source>Nat. Methods</source> <volume>15</volume>, <fpage>67</fpage>&#x2013;<lpage>72</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.4510</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>El-Gebali</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Mistry</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Bateman</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Eddy</surname>
<given-names>S. R.</given-names>
</name>
<name>
<surname>Luciani</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Potter</surname>
<given-names>S. C.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>The Pfam Protein Families Database in 2019</article-title>. <source>Nucleic Acids Res.</source> <volume>47</volume>, <fpage>D427</fpage>&#x2013;<lpage>D432</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gky995</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Elcock</surname>
<given-names>A. H.</given-names>
</name>
<name>
<surname>McCammon</surname>
<given-names>J.&#x20;A.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>Identification of Protein Oligomerization States by Analysis of Interface Conservation</article-title>. <source>Proc. Natl. Acad. Sci.</source> <volume>98</volume>, <fpage>2990</fpage>&#x2013;<lpage>2994</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.061411798</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Elez</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Bonvin</surname>
<given-names>A. M. J.&#x20;J.</given-names>
</name>
<name>
<surname>Vangone</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Biological vs. Crystallographic Protein Interfaces: An Overview of Computational Approaches for Their Classification</article-title>. <source>Crystals</source> <volume>10</volume>, <fpage>114</fpage>. <pub-id pub-id-type="doi">10.3390/cryst10020114</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Elofsson</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Toward Characterising the Cellular 3D-Proteome</article-title>. <source>Front. Bioinform.</source> <volume>1</volume>, <fpage>2</fpage>. <pub-id pub-id-type="doi">10.3389/fbinf.2021.598878</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Franzosa</surname>
<given-names>E. A.</given-names>
</name>
<name>
<surname>Xia</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Structural Determinants of Protein Evolution Are Context-Sensitive at the Residue Level</article-title>. <source>Mol. Biol. Evol.</source> <volume>26</volume>, <fpage>2387</fpage>&#x2013;<lpage>2395</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/msp146</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fukasawa</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Tomii</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Accurate Classification of Biological and Non-biological Interfaces in Protein Crystal Structures Using Subtle&#x20;Covariation Signals</article-title>. <source>Sci. Rep.</source> <volume>9</volume>, <fpage>12603</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-019-48913-8</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Garcia-Seisdedos</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Empereur-Mot</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Elad</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Levy</surname>
<given-names>E. D.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Proteins Evolve on the Edge of Supramolecular Self-Assembly</article-title>. <source>Nature</source> <volume>548</volume>, <fpage>244</fpage>&#x2013;<lpage>247</lpage>. <pub-id pub-id-type="doi">10.1038/nature23320</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Goodsell</surname>
<given-names>D. S.</given-names>
</name>
<name>
<surname>Olson</surname>
<given-names>A. J.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>Structural Symmetry and Protein Function</article-title>. <source>Annu. Rev. Biophys. Biomol. Struct.</source> <volume>29</volume>, <fpage>105</fpage>&#x2013;<lpage>153</lpage>. <pub-id pub-id-type="doi">10.1146/annurev.biophys.29.1.105</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guharoy</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Chakrabarti</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Conservation and Relative Importance of Residues across Protein-Protein Interfaces</article-title>. <source>Proc. Natl. Acad. Sci.</source> <volume>102</volume>, <fpage>15447</fpage>&#x2013;<lpage>15452</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0505425102</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Henrick</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Thornton</surname>
<given-names>J.&#x20;M.</given-names>
</name>
</person-group> (<year>1998</year>). <article-title>PQS: a Protein Quaternary Structure File Server</article-title>. <source>Trends Biochem. Sci.</source> <volume>23</volume>, <fpage>358</fpage>&#x2013;<lpage>361</lpage>. <pub-id pub-id-type="doi">10.1016/s0968-0004(98)01253-5</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>H.-F.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Integrating Co-evolutionary Signals and Other Properties of Residue Pairs to Distinguish Biological Interfaces from crystal Contacts</article-title>. <source>Protein Sci.</source> <volume>27</volume>, <fpage>1723</fpage>&#x2013;<lpage>1735</lpage>. <pub-id pub-id-type="doi">10.1002/pro.3448</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Janin</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>Specific versus Non-specific Contacts in Protein Crystals</article-title>. <source>Nat. Struct. Mol. Biol.</source> <volume>4</volume>, <fpage>973</fpage>&#x2013;<lpage>974</lpage>. <pub-id pub-id-type="doi">10.1038/nsb1297-973</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jim&#xe9;nez-Garc&#xed;a</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Elez</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Koukos</surname>
<given-names>P. I.</given-names>
</name>
<name>
<surname>Bonvin</surname>
<given-names>A. M.</given-names>
</name>
<name>
<surname>Vangone</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>PRODIGY-crystal: a Web-Tool for Classification of Biological Interfaces in Protein Complexes</article-title>. <source>Bioinformatics</source> <volume>35</volume>, <fpage>4821</fpage>&#x2013;<lpage>4823</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btz437</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jumper</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Evans</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Pritzel</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Green</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Figurnov</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ronneberger</surname>
<given-names>O.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Highly Accurate Protein Structure Prediction with AlphaFold</article-title>. <source>Nature</source> <volume>596</volume>, <fpage>583</fpage>&#x2013;<lpage>589</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-021-03819-2</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Krissinel</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Henrick</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Inference of Macromolecular Assemblies from Crystalline State</article-title>. <source>J.&#x20;Mol. Biol.</source> <volume>372</volume>, <fpage>774</fpage>&#x2013;<lpage>797</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmb.2007.05.022</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Levy</surname>
<given-names>E. D.</given-names>
</name>
<name>
<surname>Pereira-Leal</surname>
<given-names>J.&#x20;B.</given-names>
</name>
<name>
<surname>Chothia</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Teichmann</surname>
<given-names>S. A.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>3D Complex: a Structural Classification of Protein Complexes</article-title>. <source>Plos Comp. Biol.</source> <volume>2</volume>, <fpage>e155</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.0020155.eor</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Levy</surname>
<given-names>E. D.</given-names>
</name>
<name>
<surname>Teichmann</surname>
<given-names>S. A.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Structural, Evolutionary, and Assembly Principles of Protein Oligomerization</article-title>. <source>Prog. Mol. Biol. Transl. Sci.</source> <volume>117</volume>, <fpage>25</fpage>&#x2013;<lpage>51</lpage>. <pub-id pub-id-type="doi">10.1016/B978-0-12-386931-9.00002-7</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Levy</surname>
<given-names>E. D.</given-names>
</name>
<name>
<surname>Vogel</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>&#x201c;Structuromics&#x201d;: Another Step toward a&#x20;Holistic View of the Cell</article-title>. <source>Cell</source> <volume>184</volume>, <fpage>301</fpage>&#x2013;<lpage>303</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2020.12.030</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Marianayagam</surname>
<given-names>N. J.</given-names>
</name>
<name>
<surname>Sunde</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Matthews</surname>
<given-names>J.&#x20;M.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>The Power of Two: Protein Dimerization in Biology</article-title>. <source>Trends Biochem. Sci.</source> <volume>29</volume>, <fpage>618</fpage>&#x2013;<lpage>625</lpage>. <pub-id pub-id-type="doi">10.1016/j.tibs.2004.09.006</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Marsh</surname>
<given-names>J.&#x20;A.</given-names>
</name>
<name>
<surname>Teichmann</surname>
<given-names>S. A.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Structure, Dynamics, Assembly, and Evolution of Protein Complexes</article-title>. <source>Annu. Rev. Biochem.</source> <volume>84</volume>, <fpage>551</fpage>&#x2013;<lpage>575</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-biochem-060614-034142</pub-id> </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mitra</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Pal</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Combining Bayes Classification and Point Group Symmetry under Boolean Framework for Enhanced Protein Quaternary Structure Inference</article-title>. <source>Structure</source> <volume>19</volume>, <fpage>304</fpage>&#x2013;<lpage>312</lpage>. <pub-id pub-id-type="doi">10.1016/j.str.2011.01.009</pub-id> </citation>
</ref>
<ref id="B35">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Pearson</surname>
<given-names>W. R.</given-names>
</name>
</person-group> (<year>1990</year>). &#x201c;<article-title>[5] Rapid and Sensitive Sequence Comparison with FASTP and FASTA</article-title>,&#x201d; in <source>Methods in Enzymology</source> (<publisher-loc>Cambridge, UK</publisher-loc>: <publisher-name>Academic Press</publisher-name>), <fpage>63</fpage>&#x2013;<lpage>98</lpage>. <pub-id pub-id-type="doi">10.1016/0076-6879(90)83007-V</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Perutz</surname>
<given-names>M. F.</given-names>
</name>
<name>
<surname>Rossmann</surname>
<given-names>M. G.</given-names>
</name>
<name>
<surname>Cullis</surname>
<given-names>A. F.</given-names>
</name>
<name>
<surname>Muirhead</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Will</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>North</surname>
<given-names>A. C. T.</given-names>
</name>
</person-group> (<year>1960</year>). <article-title>Structure of H&#xe6;moglobin: A Three-Dimensional Fourier Synthesis at 5.5-&#xc5;. Resolution, Obtained by X-Ray Analysis</article-title>. <source>Nature</source> <volume>185</volume>, <fpage>416</fpage>&#x2013;<lpage>422</lpage>. <pub-id pub-id-type="doi">10.1038/185416a0</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ponstingl</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Henrick</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Thornton</surname>
<given-names>J.&#x20;M.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>Discriminating between Homodimeric and Monomeric Proteins in the Crystalline State</article-title>. <source>Proteins</source> <volume>41</volume>, <fpage>47</fpage>&#x2013;<lpage>57</lpage>. <pub-id pub-id-type="doi">10.1002/1097-0134(20001001)41:1&#x3c;47:aid-prot80&#x3e;3.0.co;2-8</pub-id> </citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Postic</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Andreani</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Marcoux</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Reys</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Guerois</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Rey</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Proteo3Dnet: a Web Server for the Integration of Structural Information with Interactomics Data</article-title>. <source>Nucleic Acids Res.</source> <volume>49</volume>, <fpage>W567</fpage>&#x2013;<lpage>W572</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkab332</pub-id> </citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Prasad Bahadur</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Chakrabarti</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Rodier</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Janin</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>A Dissection of Specific and Non-specific Protein-Protein Interfaces</article-title>. <source>J.&#x20;Mol. Biol.</source> <volume>336</volume>, <fpage>943</fpage>&#x2013;<lpage>955</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmb.2003.12.073</pub-id> </citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Quadir</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Roy</surname>
<given-names>R. S.</given-names>
</name>
<name>
<surname>Halfmann</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>DNCON2_Inter: Predicting Interchain Contacts for Homodimeric and Homomultimeric Protein Complexes Using Multiple Sequence Alignments of Monomers&#x20;and Deep Learning</article-title>. <source>Sci. Rep.</source> <volume>11</volume>, <fpage>12295</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-021-91827-7</pub-id> </citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Quignot</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Granger</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Chac&#xf3;n</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Guerois</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Andreani</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Atomic-level Evolutionary Information Improves Protein-Protein Interface Scoring</article-title>. <source>Bioinformatics</source> <volume>37</volume>, <fpage>3175</fpage>&#x2013;<lpage>3181</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btab254</pub-id> </citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ritchie</surname>
<given-names>D. W.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Calculating and Scoring High Quality Multiple Flexible Protein Structure Alignments</article-title>. <source>Bioinformatics</source> <volume>32</volume>, <fpage>2650</fpage>&#x2013;<lpage>2658</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btw300</pub-id> </citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rolland</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Ta&#x15f;an</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Charloteaux</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Pevzner</surname>
<given-names>S. J.</given-names>
</name>
<name>
<surname>Zhong</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Sahni</surname>
<given-names>N.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>A Proteome-Scale Map of the Human Interactome Network</article-title>. <source>Cell</source> <volume>159</volume>, <fpage>1212</fpage>&#x2013;<lpage>1226</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2014.10.050</pub-id> </citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rose</surname>
<given-names>P. W.</given-names>
</name>
<name>
<surname>Prli&#x107;</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Altunkaya</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Bi</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Bradley</surname>
<given-names>A. R.</given-names>
</name>
<name>
<surname>Christie</surname>
<given-names>C. H.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>The RCSB Protein Data Bank: Integrative View of Protein, Gene and 3D Structural Information</article-title>. <source>Nucleic Acids Res.</source> <volume>45</volume>, <fpage>D271</fpage>&#x2013;<lpage>D281</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkw1000</pub-id> </citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sali</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>From Integrative Structural Biology to Cell Biology</article-title>. <source>J.&#x20;Biol. Chem.</source> <volume>296</volume>, <fpage>100743</fpage>. <pub-id pub-id-type="doi">10.1016/j.jbc.2021.100743</pub-id> </citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sehnal</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Bittrich</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Deshpande</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Svobodov&#xe1;</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Berka</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Bazgier</surname>
<given-names>V.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Mol&#x2a; Viewer: Modern Web App for 3D Visualization and Analysis of Large Biomolecular Structures</article-title>. <source>Nucleic Acids Res.</source> <volume>49</volume>, <fpage>W431</fpage>&#x2013;<lpage>W437</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkab314</pub-id> </citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tsuchiya</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Nakamura</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Kinoshita</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Discrimination between Biological Interfaces and crystal-packing Contacts</article-title>. <source>Aabc</source> <volume>1</volume>, <fpage>99</fpage>&#x2013;<lpage>113</lpage>. <pub-id pub-id-type="doi">10.2147/aabc.s4255</pub-id> </citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Dunbrack</surname>
<given-names>R. L.</given-names>
<suffix>Jr</suffix>
</name>
</person-group> (<year>2019</year>). <article-title>Principles and Characteristics of Biological Assemblies in Experimentally Determined Protein Structures</article-title>. <source>Curr. Opin. Struct. Biol.</source> <volume>55</volume>, <fpage>34</fpage>&#x2013;<lpage>49</lpage>. <pub-id pub-id-type="doi">10.1016/j.sbi.2019.03.006</pub-id> </citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Dunbrack</surname>
<given-names>R. L.</given-names>
<suffix>Jr</suffix>
</name>
</person-group> (<year>2020</year>). <article-title>ProtCID: a Data Resource for Structural Information on Protein Interactions</article-title>. <source>Nat. Commun.</source> <volume>11</volume>, <fpage>711</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-020-14301-4</pub-id> </citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Dunbrack</surname>
<given-names>R. L.</given-names>
<suffix>Jr</suffix>
</name>
</person-group> (<year>2011</year>). <article-title>The Protein Common Interface Database (ProtCID)-Aa Comprehensive Database of Interactions of Homologous Proteins in Multiple crystal Forms</article-title>. <source>Nucleic Acids Res.</source> <volume>39</volume>, <fpage>D761</fpage>&#x2013;<lpage>D770</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkq1059</pub-id> </citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yan</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>S.-Y.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Accurate Prediction of Inter-protein Residue-Residue Contacts for Homo-Oligomeric Protein Complexes</article-title>. <source>Brief. Bioinform.</source> <volume>22</volume>, <fpage>bbab038</fpage>. <pub-id pub-id-type="doi">10.1093/bib/bbab038</pub-id> </citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yates</surname>
<given-names>C. M.</given-names>
</name>
<name>
<surname>Sternberg</surname>
<given-names>M. J.&#x20;E.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>The Effects of Non-synonymous Single Nucleotide Polymorphisms (nsSNPs) on Protein-Protein Interactions</article-title>. <source>J.&#x20;Mol. Biol.</source> <volume>425</volume>, <fpage>3949</fpage>&#x2013;<lpage>3963</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmb.2013.07.012</pub-id> </citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Domingues</surname>
<given-names>F. S.</given-names>
</name>
<name>
<surname>Sommer</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Lengauer</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>NOXclass: Prediction of Protein-Protein Interaction Types</article-title>. <source>BMC Bioinformatics</source> <volume>7</volume>, <fpage>27</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-7-27</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>