<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2014.00370</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Hypothesis and Theory Article</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>A comprehensive analysis of the Omp85/TpsB protein superfamily structural diversity, taxonomic occurrence, and evolution</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Heinz</surname> <given-names>Eva</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/115435"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Lithgow</surname> <given-names>Trevor</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://community.frontiersin.org/people/u/165948"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Microbiology, Monash University</institution> <country>Melbourne, VIC, Australia</country></aff>
<aff id="aff2"><sup>2</sup><institution>Victorian Bioinformatics Consortium, Monash University</institution> <country>Melbourne, VIC, Australia</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: <italic>Frank T. Robb, University of California, USA</italic></p></fn>
<fn fn-type="edited-by"><p>Reviewed by: <italic>Dirk Linke, Max Planck Society, Germany; David L. Bernick, University of California, Santa Cruz, USA</italic></p></fn>
<fn fn-type="corresp" id="fn002"><p>&#x0002A;Correspondence: <italic>Trevor Lithgow, Department of Microbiology, Monash University, Melbourne, VIC 3800, Australia e-mail: <email>trevor.lithgow@monash.edu</email></italic></p></fn>
<fn fn-type="other" id="fn001"><p>This article was submitted to Evolutionary and Genomic Microbiology, a section of the journal Frontiers in Microbiology.</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>21</day>
<month>07</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="collection">
<year>2014</year>
</pub-date>
<volume>5</volume>
<elocation-id>370</elocation-id>
<history>
<date date-type="received">
<day>06</day>
<month>06</month>
<year>2014</year>
</date>
<date date-type="accepted">
<day>02</day>
<month>07</month>
<year>2014</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2014 Heinz and Lithgow.</copyright-statement>
<copyright-year>2014</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0/"><p> This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Members of the Omp85/TpsB protein superfamily are ubiquitously distributed in Gram-negative bacteria, and function in protein translocation (e.g., FhaC) or the assembly of outer membrane proteins (e.g., BamA). Several recent findings are suggestive of a further level of variation in the superfamily, including the identification of the novel membrane protein assembly factor TamA and protein translocase PlpD. To investigate the diversity and the causal evolutionary events, we undertook a comprehensive comparative sequence analysis of the Omp85/TpsB proteins. A total of 10 protein subfamilies were apparent, distinguished in their domain structure and sequence signatures. In addition to the proteins FhaC, BamA, and TamA, for which structural and functional information is available, are families of proteins with so far undescribed domain architectures linked to the Omp85 &#x003B2;-barrel domain. This study brings a classification structure to a dynamic protein superfamily of high interest given its essential function for Gram-negative bacteria as well as its diverse domain architecture, and we discuss several scenarios of putative functions of these so far undescribed proteins.</p>
</abstract>
<kwd-group>
<kwd>outer membrane protein assembly</kwd>
<kwd>Omp85</kwd>
<kwd>Omp85/TpsB superfamily</kwd>
<kwd>two-partner secretion</kwd>
<kwd>BamA</kwd>
</kwd-group>
<counts>
<fig-count count="6"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="69"/>
<page-count count="13"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec>
<title>INTRODUCTION</title>
<p>The Omp85/TpsB protein superfamily is a unique group of bacterial outer membrane proteins, which can function as protein translocases or as membrane protein assembly factors (<xref ref-type="bibr" rid="B41">Mazar and Cotter, 2007</xref>; <xref ref-type="bibr" rid="B26">Hagan et al., 2011</xref>); with a well-studied example described for each of these two functions: The TpsB family protein FhaC secretes a partner protein (FHA) through the outer membrane to the extracellular milieu (<xref ref-type="bibr" rid="B41">Mazar and Cotter, 2007</xref>; <xref ref-type="bibr" rid="B30">Jacob-Dubuisson et al., 2013</xref>). The Omp85 family protein BamA functions as chaperone, receiving nascent &#x003B2;-barrel proteins from periplasmic chaperones and assembling these into the outer membrane (<xref ref-type="bibr" rid="B26">Hagan et al., 2011</xref>; <xref ref-type="bibr" rid="B34">Kim et al., 2012</xref>).</p>
<p>The Omp85/TpsB protein superfamily is characterized through sequence similarity and shared structural characteristics (<xref ref-type="bibr" rid="B68">Yen et al., 2002</xref>; <xref ref-type="bibr" rid="B46">Moslavac et al., 2005</xref>), there is however a clear separation between the Omp85 family (e.g., BamA) and TpsB family (e.g., FhaC) at the sequence level. This is reflected in two defining Pfam profiles: PF01103 (&#x0201C;Bac_surface_Ag&#x0201D;) for Omp85 proteins and PF03865 (&#x0201C;ShlB&#x0201D;) for TpsB proteins. Despite this distinction, there is an underlying sequence similarity in the membrane-embedded &#x003B2;-barrel domains (<xref ref-type="bibr" rid="B68">Yen et al., 2002</xref>; <xref ref-type="bibr" rid="B46">Moslavac et al., 2005</xref>), which is also represented on a structural level (<xref ref-type="bibr" rid="B16">Clantin et al., 2007</xref>; <xref ref-type="bibr" rid="B25">Gruss et al., 2013</xref>; <xref ref-type="bibr" rid="B50">Noinaj et al., 2013</xref>). In both of these proteins, a series of &#x0223C;10 kDa globular domains (Polypeptide Transport Domains or POTRAs; <xref ref-type="bibr" rid="B57">Sanchez-Pulido et al., 2003</xref>) stretch out from the N-terminal part of the barrel domain, and are located within the bacterial periplasm.</p>
<p>Differences between the two families are also found in their taxonomic distribution. TpsB proteins function as translocases dedicated to the secretion of a single protein substrate, characteristically haemagglutinin-like partner proteins, and they are therefore found predominantly in pathogenic organisms in a distribution pattern indicative of horizontal gene transfer (HGT). Conversely, the Omp85 protein BamA is essential for the assembly of &#x003B2;-barrel proteins, and Omp85 family proteins have been reported in all Gram-negative phyla (<xref ref-type="bibr" rid="B13">Cavalier-Smith, 2006</xref>; <xref ref-type="bibr" rid="B65">Sutcliffe, 2010</xref>; <xref ref-type="bibr" rid="B21">Errington, 2013</xref>). Mitochondria and plastids, as eukaryotic organelles derived from bacterial endosymbionts, each harbor an Omp85 protein in their outer membranes. These proteins are homologs of BamA, chaperoning the assembly of &#x003B2;-barrel proteins into organellar outer membranes. The mitochondrial Omp85 protein, Sam50, is most similar to &#x003B1;-proteobacterial BamA (<xref ref-type="bibr" rid="B24">Gentle et al., 2004</xref>) and the plastid proteins Toc75-III and Oep80 are most similar to the cyanobacterial Omp85 proteins (<xref ref-type="bibr" rid="B10">Bolter et al., 1998</xref>; <xref ref-type="bibr" rid="B54">Reumann and Keegstra, 1999</xref>; <xref ref-type="bibr" rid="B58">Schleiff and Becker, 2011</xref>). This correlates with our understanding of the ancestry of the organelles.</p>
<p>Two recent findings have highlighted the complexity of this superfamily, and insist on a refinement of the existing Omp85/TpsB dichotomy. The translocation and assembly machinery (TAM) consists of the outer membrane protein TamA and the inner membrane protein TamB (<xref ref-type="bibr" rid="B60">Selkrig et al., 2012</xref>), and functions in the assembly of outer membrane proteins. Structurally, TamA is similar to BamA (<xref ref-type="bibr" rid="B25">Gruss et al., 2013</xref>; <xref ref-type="bibr" rid="B50">Noinaj et al., 2013</xref>), but has only three POTRA domains and can be clearly distinguished from BamA based on sequence characteristics. A further Omp85 protein was identified recently in <italic>Pseudomonas aeruginosa</italic>, the patatin-like Omp85 protein PlpD, which carries a single POTRA domain followed by a patatin domain at the N-terminus. The patatin domain is translocated across the outer membrane and released into the environment, potentially acting as virulence factor for <italic>Pseudomonas</italic> (<xref ref-type="bibr" rid="B56">Salacha et al., 2010</xref>).</p>
<p>To understand the diversity and distribution of this important protein superfamily, we performed a comprehensive analysis, extracting all detectable Omp85/TpsB-like sequences from current databases, followed by manual curation. Clustering analysis was used to group the sequences, and further analyses were used to improve this grouping scheme. We observed 10 domain architectures; several of these so far undescribed, and we have developed a comprehensive classification scheme based around the domain structure and sequence characteristics. This classification scheme provides a framework for functional associations, and yields useful insights into the way this family of proteins has evolved. The dynamic evolutionary history of the Omp85/TpsB superfamily is reminiscent of other molecular chaperones, and the implications of these similarities are discussed.</p>
</sec>
<sec id="s1" sec-type="materials|methods">
<title>MATERIALS AND METHODS</title>
<sec>
<title>DATABASES AND SOFTWARE PACKAGES</title>
<p>All searches were performed against, and sequences and taxonomic information were retrieved from, the UniProt database (<xref ref-type="bibr" rid="B40">Magrane and Consortium, 2011</xref>; release 06032013) unless stated otherwise. Protein domains were retrieved from the Interpro database (<xref ref-type="bibr" rid="B29">Hunter et al., 2012</xref>; version 41.0). Markov Clustering (MCL) was performed using the mclblastline suite (mcl version 12-135; <xref ref-type="bibr" rid="B20">Enright et al., 2002</xref>), with several different inflation parameters, where the optimal settings were chosen after manual inspections of the resulting datasets with respect to known functionally different homologs (BamA, TamA, Sam50, Sam51); all-against-all blast values for mclblastline clustering were obtained by using the blastall -p blastp command (blastall 2.2.24) with the -m8 output option, all other settings as default. For network representations in cytoscape (version 3.1; <xref ref-type="bibr" rid="B61">Shannon et al., 2003</xref>), protein diversity was first reduced by clustering all sequences with the usearch program (<xref ref-type="bibr" rid="B19">Edgar, 2010</xref>; search performed using the &#x02013;cluster_fast algorithm with a cutoff of &#x02013;id 0.80, the &#x02013;centroid command was used to obtain the sequences). The resulting sequences were used as input for an all-against-all blastp run (version 2.2.26+; cutoff <italic>e-</italic>value 1E-5) and self-loops were removed before network analyses. For clustering of the barrel or N-terminal domains only, the same accession numbers as used for the full-length clustering (i.e., the centroids resulting from uclust) were retrieved from the respective barrel-only or N-terminus-only sequence sets; the formation of these datasets is described below. Lipoprotein signature signal sequences were recovered from the LipoP predictor with default settings (version 1.0, <xref ref-type="bibr" rid="B31">Juncker et al., 2003</xref>), and secondary structure predictions to identify and confirm POTRA and other domains in novel Omp85 subfamilies were performed using Phyre2 (<xref ref-type="bibr" rid="B33">Kelley and Sternberg, 2009</xref>) and Praline (<xref ref-type="bibr" rid="B63">Simossis and Heringa, 2005</xref>). For clusters >100 amino acids, usearch was used as above reducing the number sequences to &#x02013;id 0.50 prior to submission to Phyre2. The heatmap representation was performed with the R software package (The R Project for Statistical Computing)<sup><xref ref-type="fn" rid="fn01">1</xref></sup> using the &#x0201C;heatmap&#x0201D; command with the scale set to &#x0201C;none,&#x0201D; and representation of protein structures was performed using the UCSF Chimera package (<xref ref-type="bibr" rid="B52">Pettersen et al., 2004</xref>).</p>
</sec>
<sec>
<title>Omp85/TpsB SUPERFAMILY DATASET GENERATION</title>
<p>The initial HMMER profiles were retrieved from the Pfam website<sup><xref ref-type="fn" rid="fn02">2</xref></sup> (<xref ref-type="bibr" rid="B53">Punta et al., 2012</xref>) as PF01103.18 and PF03865.8, and searched against UniProt. The HMMER search (version 3.1dev; <xref ref-type="bibr" rid="B17">Eddy, 2011</xref>) was performed with hmmsearch using an <italic>e</italic>-value cutoff &#x02013;incE 1 for the PF01103 dataset and &#x02013;incE 0.1 for the PF03865 dataset and both searches were performed by disabling all additional filters (&#x02013;max option). Following manual inspections, we decided to include all hits below the inclusion cutoff for further analyses as well, as several Omp85/TpsB-like proteins were identified below the cutoff values, resulting in a combined dataset of 13,713 protein sequences after removing proteins detected by both profiles. We sought to better distinguish contaminants, which share some underlying sequence similarity with Omp85/TpsB proteins but belong to different protein families, from highly divergent Omp85/TpsB proteins. To this end, sequences were grouped into their UniProt100 groups to decrease the sample size, and clustered using the mclblastline (<italic>e</italic>-value cutoff of 1E-2, inflation value 1.5, scheme 7). These initial clusters were manually investigated to identify contaminants by analysing similarity of the proteins in the nr and UniProt databases, Pfam domain profiles and additional domain and other annotations as given in public databases. In any cluster containing contaminants belonging to different protein families, all proteins grouped in this cluster (including hypothetical and unknown proteins without annotated features) were considered contaminants; whereas in a cluster containing Omp85/TpsB-like proteins, all proteins (including hypothetical and unknown without annotated features) were considered Omp85/TpsB members. No contradicting clusters (being a mixture of clear contaminants and true Omp85/TpsB proteins) were encountered. After removal of all contaminants from the original search results (i.e., removal of all sequences belonging to the respective UniProt100 groups judged as contaminants), the final dataset was clustered again using mclblastline (<italic>e</italic>-value cutoff 1E-2, inflation value 1.3, scheme 7). A final curation step included removal of sequences with less than 250aa, and the final dataset consisted of 12,869 proteins in 40 clusters, all accession numbers for the respective clusters are given in <bold>Table <xref ref-type="supplementary-material" rid="ST1">S1</xref></bold>. For analyses of the presence or absence of the respective copies only proteins and their corresponding taxa flagged as &#x0201C;complete proteome&#x0201D; entry in the UniProt database were considered. The taxonomic tree used to plot different numbers of paralogs and orthologs was obtained from sTOL (<xref ref-type="bibr" rid="B22">Fang et al., 2013)</xref><sup><xref ref-type="fn" rid="fn03">3</xref></sup>, download date 30. 04. 2014. The graphical tree representation was prepared using the iTol web tool (<xref ref-type="bibr" rid="B38">Letunic and Bork, 2011</xref>).</p>
</sec>
<sec>
<title>DATASET GENERATION TO ANALYZE N-TERMINI, BARREL REGIONS, AND POTRAs</title>
<p>For the barrel-only dataset used in the protein&#x02013;protein similarity network analyses as indicated in the figure legend, all sequences were retrieved using the first position of the alignment (the &#x0201C;envelope start&#x0201D; position) as given in the initial HMMER search result as the N-terminal border of the barrel, and the actual end of the protein sequence as the C-terminal border. For proteins retrieved in both searches, the higher scoring HMMER result was used. The N-terminal dataset for all sequences was retrieved using the actual start position of the sequence as N-terminus and the first position of the HMMER search alignment region (i.e., the start of the barrel domain as described above) as C-terminus; since some subfamilies have only a very short N-terminal region, sequences with less than 20 aa remaining for the N-terminus were removed from the dataset. For the POTRA analyses, the respective main clusters (minimum 30 members) as given in <bold>Table <xref ref-type="supplementary-material" rid="ST1">S1</xref></bold> with predicted POTRA domains (BamA, TamA, BamA-like, Patatin-like, Sam50, FhaC, Hmw1B, Lipo) were reduced to id 0.50 using uclust. These sequences were submitted to the Praline (<xref ref-type="bibr" rid="B63">Simossis and Heringa, 2005</xref>) web server, and the secondary structure prediction was performed with the implemented PsiPred program (<xref ref-type="bibr" rid="B43">McGuffin et al., 2000</xref>). The POTRA domains were subsequently extracted from the aligned id 0.50 datasets, and sequences &#x0003C;25 aa and >125 aa were removed. Only one set of POTRA domains per cluster was defined, removing additionally gained POTRA domains in small numbers of sequences. In addition, we extracted all FtsQ sequences available in the Swissprot database (retrieved on 12. 02. 2014 online; search term &#x0201C;PF03799&#x0201D;), extracted the POTRA domain as described above, and added it to our dataset, which was then used for clustering in cytoscape as described above with an e-value cutoff of 1E-3.</p>
</sec>
<sec>
<title>PHYLOGENETIC TREE INFERENCE</title>
<p>Alignments were generated with muscle (<xref ref-type="bibr" rid="B18">Edgar, 2004</xref>), and sites for tree inference were chosen using trimal under the &#x0201C;-automated1&#x0201D; setting (<xref ref-type="bibr" rid="B12">Capella-Gutierrez et al., 2009</xref>). Trees were calculated using Phylobayes v3.3d (<xref ref-type="bibr" rid="B37">Lartillot et al., 2009</xref>) under the C20 or C60 model as indicated in the figure legends, with two independent chains for each, and chain convergence was analyzed manually using the bpcomp and tracecomp command as suggested by the authors (<xref ref-type="bibr" rid="B37">Lartillot et al., 2009</xref>), posterior probabilities are shown as branch support values.</p>
</sec>
</sec>
<sec>
<title>RESULTS</title>
<sec>
<title>THE Omp85/TpsB SUPERFAMILY IS COMPOSED OF 10 DISTINCT SUBFAMILIES</title>
<p>The defining feature of the Omp85/TpsB superfamily is the membrane-embedded barrel domain (<xref ref-type="bibr" rid="B24">Gentle et al., 2004</xref>; <xref ref-type="bibr" rid="B7">Arnold et al., 2010</xref>; <xref ref-type="bibr" rid="B56">Salacha et al., 2010</xref>; <xref ref-type="bibr" rid="B60">Selkrig et al., 2012</xref>). To find the maximal number of Omp85/TpsB proteins from which to start a classification, only the conserved regions of the barrel-domain sequences (see section &#x0201C;Methods&#x0201D;) were used as search input. By this definition, a search against the UniProt database and manual curation identified 12,869 protein sequences in bacteria and eukaryotes as members of the Omp85/TpsB superfamily (<bold>Table <xref ref-type="supplementary-material" rid="ST1">S1</xref></bold>). No Omp85/TpsB proteins were detected in archaea.</p>
<p>Unexpectedly, many proteins were discovered to be distinct from the known domain arrangement based on an absence of POTRA sequences in their domain profiles. The 40 clusters retrieved from our initial sequence clustering could be resolved to represent 10 protein subfamilies in bacteria (<bold>Figure <xref ref-type="fig" rid="F1">1</xref></bold>). Most of these have not been recognized previously, including POTRA-containing Omp85 proteins divergent from the cognate BamA and TamA (&#x0201C;BamA-like&#x0201D;), as well as non-POTRA domain architectures described below (<bold>Figure <xref ref-type="fig" rid="F1">1</xref></bold>; <bold>Table <xref ref-type="supplementary-material" rid="ST2">S2</xref></bold>). The sequence-based split of the TpsB family into two groups (&#x0201C;FhaC&#x0201D; and &#x0201C;Hmw1B&#x0201D;) was observed as before (<xref ref-type="bibr" rid="B30">Jacob-Dubuisson et al., 2013</xref>), and no further subfamilies or domain profiles could be identified associated with the TpsB-type barrel domain.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p><bold>Structural diversity of the Omp85/TpsB superfamily.</bold> Schematic representation of the domain architectures (detailed in <bold>Table <xref ref-type="supplementary-material" rid="ST2">S2</xref></bold>) of the ten bacterial protein subfamilies that comprise the Omp85/TpsB superfamily, as well as the eukaryotic Sam50. The cyanobacterial BamA is shown as a separate group due to its exceptional domain architecture within the BamA subfamily. Also shown are the crystal structures for the three known exemplars: BamA (PDB 4K3B; <xref ref-type="bibr" rid="B50">Noinaj et al., 2013</xref>), TamA (PDB 4C00; <xref ref-type="bibr" rid="B25">Gruss et al., 2013</xref>) and FhaC (PDB 2QDZ; <xref ref-type="bibr" rid="B16">Clantin et al., 2007</xref>). In each case the POTRA domains can be seen emanating from the N-terminal region of the barrel domain.</p></caption>
<graphic xlink:href="fmicb-05-00370-g001.tif"/>
</fig>
<p>The most conservative hypothesis for the function of the unknown subfamilies with high similarity to Omp85 proteins is a role in some aspect of protein assembly into or across the outer membrane. This is the general function of Omp85 family members, but experimentation will be required to test this hypothesis. The diverse domain architectures identified in the N-terminal region of the Omp85 barrel, serve to define the ten protein subfamilies (Figures <xref ref-type="fig" rid="F1">1</xref> and <xref ref-type="fig" rid="F2">2A</xref>).</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p><bold>Distinctions between the Omp85/TpsB subgroups in sequence similarities. (A)</bold> Protein&#x02013;protein similarity network representation of full-length sequences, demonstrating the ten bacterial subfamilies; due to its origin from bacterial BamA the eukaryotic sequences were included to the BamA subfamily. <bold>(C)</bold> The similarity network representation of barrel-domain sequences and <bold>(E)</bold> the similarity network representation of N-terminal domain sequences, where the colors describe the different subfamilies as depicted in <bold>(A)</bold>. The circled area in <bold>(E)</bold> illustrates a connected cluster consisting of proteins encoding one or more POTRA domains, whereas the sequences with alternative (non-POTRA) N-terminal domains segregate into distinct groups. <bold>(B,D,F)</bold> are a recolouring of <bold>(A,C,E)</bold>, respectively, according to different bacterial Phyla (eukaryotes in gray). The color corresponding to each phylum is depicted in <bold>(B)</bold>.</p></caption>
<graphic xlink:href="fmicb-05-00370-g002.tif"/>
</fig>
<p>Proteins in the WD40-Omp85 cluster have a beta-propeller-like structure encoded in the N-terminal WD40 domain repeat sequences (<bold>Figure <xref ref-type="fig" rid="F1">1</xref></bold>; <bold>Table <xref ref-type="supplementary-material" rid="ST3">S3</xref></bold>). There are two relevant WD40 domain proteins associated with the functions ascribed to the Omp85 family. The first, TolB, is a periplasmic component of the bacterial Tol-Pal system with a WD40 domain structure (<xref ref-type="bibr" rid="B11">Bonsor et al., 2007</xref>); the beta-propeller domain of TolB also shows the highest structural similarity to the Omp85 WD40 domain structure. A function in peptidoglycan recycling, or the covalent linking with lipoproteins, was suggested for TolB (<xref ref-type="bibr" rid="B2">Abergel et al., 1999</xref>) and its partner protein Pal can interact with BamA (<xref ref-type="bibr" rid="B5">Anwari et al., 2010</xref>). BamB is a highly conserved WD40 protein found in most Proteobacteria (<xref ref-type="bibr" rid="B6">Anwari et al., 2012</xref>) that serves as a lipoprotein partner of BamA (<xref ref-type="bibr" rid="B3">Albrecht and Zeth, 2011</xref>; <xref ref-type="bibr" rid="B28">Heuck et al., 2011</xref>; <xref ref-type="bibr" rid="B35">Kim and Paetzel, 2011</xref>; <xref ref-type="bibr" rid="B49">Noinaj et al., 2011</xref>). These Omp85 WD40-like proteins are therefore reminiscent of a fusion between BamA and BamB, which serves as a platform for the attachment of other members of the BAM complex.</p>
<p>Like the TpsB proteins and the Toc75 found in plastids, the patatin-like Omp85 protein PlpD from Pseudomonas aeruginosa translocates proteins through the outer membrane. As characterized recently, PlpD delivers a lipolytic enzyme domain onto the bacterial surface by a mechanism that was suggested to be similar to that of FhaC (<xref ref-type="bibr" rid="B56">Salacha et al., 2010</xref>). This is made all the more intriguing, given the close similarity between PlpD and members of the Omp85 family, rather than TpsB family, of proteins (<bold>Figure <xref ref-type="fig" rid="F2">2C</xref></bold>). Structural investigations into the patatin-like Omp85 proteins will be fascinating, given that the structures of BamA and TamA both show the Omp85-type barrel domain to be fully closed to the extracellular milieu.</p>
<p>Depending on the final topology of the proteins, the Omp85-metalloproteases (&#x0201C;Metallo&#x0201D;) might aid in the proteolytic quality control in the periplasm as do proteases such as Clp and DegP (<xref ref-type="bibr" rid="B44">Merdanovic et al., 2011</xref>) or, by analogy with the action of the patatin-like Omp85 proteins, the metalloprotease domain could function as a virulence factor if translocated across the outer membrane. Theoretical support for the former hypothesis comes from observations that the specific metalloprotease domain (PF00149) found in these Omp85 proteins shows over 400 annotated domain architectures in Pfam, linking it to other domains that would be located in the periplasm/cell wall. These include domain architectures associated with periplasmic/outer envelope locations such as the peptidoglycan-binding LysM domain (PF01476), a cell-wall binding domain (PF04122), a Gram-positive anchor domain (PF00746) and S-layer domains (PF00395) all suggestive of a function in diverse different cell envelope environments.</p>
<p>The Omp85 lipoproteins (&#x0201C;Lipo&#x0201D;) have three N-terminal POTRA domains (<bold>Table <xref ref-type="supplementary-material" rid="ST3">S3</xref></bold>), but the presence of a lipid anchor at the N-terminus of the first POTRA domain in 386 out of 513 proteins would attach the domain to the periplasmic surface of either the outer or inner membrane. It is uncertain whether three POTRA domains would be sufficient to span the periplasm in order to allow the lipid to anchor the N-terminus in the inner membrane. Positioning the N-terminal lipid at the periplasmic surface of the outer membrane would fix the POTRA domains: diminishing their flexibility, and serving thereby to constrain exposed regions of the POTRAs to assist interaction with other proteins. These Omp85 lipoproteins are detected in species throughout the Bacteroidetes and Chlorobi, with often more than one copy per genome. Besides BamA and TamA, the Omp85 lipoprotein subfamily is the only group of proteins with a taxonomic distribution indicating vertical inheritance rather than HGT (<bold>Figure <xref ref-type="fig" rid="F3">3</xref></bold>).</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p><bold>The uneven distribution of Omp85/TpsB subgroups by taxa. (A)</bold> Sequences of the Omp85/TpsB protein subfamilies are represented by bars plotted to the respective taxa in the guidance tree. Length of the bar indicates numbers of gene copies, bar color indicates the Omp85/TpsB subfamilies as in Figure <xref ref-type="fig" rid="F1">1</xref>, branch color indicates bacterial Phylum as displayed in Figure <xref ref-type="fig" rid="F1">1</xref>. <bold>(B)</bold> A heatmap indicating the percentage of completed genomes of the respective Phylum in which the respective Omp85/TpsB subfamily has been identified; colors are based on a percentage scale ranging from deep blue (100%) to white (0%).</p></caption>
<graphic xlink:href="fmicb-05-00370-g003.tif"/>
</fig>
<p>The Omp85 proteins without any N-terminal extension (&#x0201C;noNterm&#x0201D;; <bold>Figure <xref ref-type="fig" rid="F1">1</xref></bold>) might also function in membrane protein biogenesis, given the experimental observation that the mitochondrial homolog of BamA, Sam50, is functional in the binding and the assembly of &#x003B2;-barrel protein substrates into outer membranes even if the single POTRA domain is removed (<xref ref-type="bibr" rid="B64">Stroud et al., 2011</xref>). The barrel domains of these proteins show some sequence-based similarities to the Omp85 metalloprotease protein, and could be the ancestor of this subfamily, which subsequently gained the metalloprotease domain (Figures <xref ref-type="fig" rid="F2">2A,C</xref>; <bold>Table <xref ref-type="supplementary-material" rid="ST3">S2</xref></bold>).</p>
<p>The BamA-like proteins are another intriguing subfamily that have 1-3 N-terminal POTRA domains (<bold>Table <xref ref-type="supplementary-material" rid="ST3">S3</xref></bold>). They form distinct sequence cluster from the BamA sequences (Figures <xref ref-type="fig" rid="F2">2A,C,E</xref>; <bold>Table <xref ref-type="supplementary-material" rid="ST1">S1</xref></bold>) and are always present in addition to BamA (i.e., each organism with a BamA-like protein also encodes a protein grouped as &#x0201C;BamA&#x0201D; in this study). Based on their barrel+POTRA structure, we hypothesize that these function in a manner similar to BamA and TamA, as membrane protein assembly factors.</p>
<p>The sequence diversity between the subfamilies does not correlate with the taxa in which the sequences are found (<bold>Figure <xref ref-type="fig" rid="F2">2B</xref></bold>), supporting that the ten protein subfamilies have ancestries that indicate HGT as well as vertical descent. Investigating the sequence-based similarities on a large scale through visualization of the protein similarity network supported our manual annotation: this is true when considering full-length sequences (<bold>Figure <xref ref-type="fig" rid="F2">2B</xref></bold>), when considering only the barrel domain sequences (<bold>Figure <xref ref-type="fig" rid="F2">2D</xref></bold>) or N-terminal parts of the sequences (<bold>Figure <xref ref-type="fig" rid="F2">2F</xref></bold>), each of which show a consistent clustering of the 10 subfamilies.</p>
</sec>
<sec>
<title>THE TWO-PARTNER SECRETION SYSTEMS: FhaC-TYPE AND Hmw1B-TYPE</title>
<p>The network representation also supports previous observations of a split between two sequence groups of the TpsB proteins, the FhaC subgroup and the Hmw1B subgroup (<xref ref-type="bibr" rid="B30">Jacob-Dubuisson et al., 2013</xref>). We observe further differences in the taxonomic diversity of these two TpsB subfamilies: while the FhaC group is comprised almost exclusively of sequences from <italic>Proteobacteria</italic>, the Hmw1B subgroup consists of sequences from a large number of <italic>Cyanobacteria</italic> but also various <italic>Proteobacteria</italic> &#x02013; in several cases the same taxa encode proteins of the FhaC subgroup as well as the Hmw1B (<bold>Figure <xref ref-type="fig" rid="F3">3</xref></bold>).</p>
<p>Domain profiling shows the barrel domain of the Hmw1B subfamily as an Omp85-type barrel in the majority of cases, as opposed to the FhaC group that has the ShlB (TpsB)-type barrel (<bold>Table <xref ref-type="supplementary-material" rid="ST2">S2</xref></bold>). However, a structure-based search using Phyre2 confirms that the majority of the Hmw1B proteins are more similar to the FhaC structure, than to the BamA structure (data not shown). The higher sequence similarity to the Omp85-type barrel rather than the TpsB type suggests the Hmw1B group could reflect a more ancestral state and possibly the origin of the TpsB family. This is also in accordance with its taxonomic distribution; the Hmw1B subgroup can be found predominantly in early-branching <italic>Cyanobacteria</italic>, whereas the FhaC-type proteins likely reflect a further level of specification, possibly derived from a gene duplication of an Hmw1B protein and subsequent spread by HGT.</p>
</sec>
<sec>
<title>THE POTRA DOMAINS REVEAL STRIKING SPECIALIZATION</title>
<p>Previous analyses of POTRA sequences showed the sequence relationships between the mitochondrial Sam50 and the plastid Toc75 and Oep80 to proteobacterial and cyanobacterial sequences, respectively (<xref ref-type="bibr" rid="B7">Arnold et al., 2010</xref>). We therefore sought to expand this validated approach to use the POTRA domain sequence signatures for an understanding of evolution within the greater Omp85/TpsB superfamily. POTRA domain sequences from TamA, the BamA-like proteins, the Patatin-like sequences, the lipid-anchored BamA-like proteins (Lipo), as well as FtsQ, the only other protein known to encode POTRA domains (<xref ref-type="bibr" rid="B57">Sanchez-Pulido et al., 2003</xref>) were collected and compared.</p>
<p>The POTRA domains of TpsB proteins are so distinct that they conform to a distinct Pfam profile (PF08479 &#x02013; &#x0201C;POTRA_2&#x0201D;). The majority of POTRA sequences from the Omp85 protein subfamilies conform to Pfam profile PF07244 (&#x0201C;Surf_Ag_VNR&#x0201D;), but even so clear clusters of POTRA sequences are evident (<bold>Figure <xref ref-type="fig" rid="F4">4A</xref></bold>). In the case of the TamA protein subfamily and the Omp85-lipoprotein subfamily, the third POTRA domain shows remarkable similarity to the POTRA domains found in BamA, but the first two POTRA domains form discrete clusters. This indicates that while POTRA three is likely directly inherited from the original BamA duplication event leading to the subfamilies, POTRAs one and two have strongly diverged, either through sequence drift or mixing of the secondary structure elements. This fits well with the hypothesis that the POTRA domain closest to the barrel experiences the strongest selective pressure, arising from structural restrictions due to its proximity to the membrane-embedded barrel. Structurally, this POTRA domain makes important contacts with the barrel domain (<xref ref-type="bibr" rid="B50">Noinaj et al., 2013</xref>). The distinct features of the more N-terminal POTRAs would be explained by them being the domains that interact with partner proteins, which differ between BamA and TamA (<xref ref-type="bibr" rid="B26">Hagan et al., 2011</xref>; <xref ref-type="bibr" rid="B60">Selkrig et al., 2012</xref>).</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p><bold>Sequence similarity network of POTRA domains highlights diversity based on subfamilies as well as the location of the POTRA respective to the barrel. (A)</bold> Protein-protein similarity network representation of POTRA domain sequences, where the colors depict the different subfamilies. <bold>(B)</bold> Recolouring of <bold>(A)</bold> as indicated; with each POTRA domain of each subfamily highlighted in a distinct color. Only the POTRA domains conserved in the majority of the respective sequences (e.g., five for BamA) are shown; for proteins with additional POTRA domains (<xref ref-type="bibr" rid="B7">Arnold et al., 2010</xref>), the regions of the five most conserved based on a multiple sequence alignment are depicted, as described in the Section &#x0201C;Methods.&#x0201D;</p></caption>
<graphic xlink:href="fmicb-05-00370-g004.tif"/>
</fig>
<p>In modular protein complexes, the capacity of binding sites to interact with substrates is often modified by adding or duplicating domains (<xref ref-type="bibr" rid="B8">Bjorklund et al., 2006</xref>). The internal POTRA domains (P2-P4) in BamA show highest sequence similarity to each other, consistent with a pattern of domain duplications (<bold>Figure <xref ref-type="fig" rid="F4">4B</xref></bold>); and the trend in BamA to duplicate the internal POTRAs goes in accordance with observations on larger scales (<xref ref-type="bibr" rid="B8">Bjorklund et al., 2006</xref>). The dynamic potential of POTRA domains is further emphasized by some organisms having BamA sequences with more than five POTRA domains as observed previously (<xref ref-type="bibr" rid="B7">Arnold et al., 2010</xref>); only the conserved five POTRAs present in the majority of sequences were included in the analysis (<bold>Figure <xref ref-type="fig" rid="F4">4</xref></bold>) to avoid generating too much complexity in the network. The seemingly contrary trend in the TamA and Omp85-lipoprotein subfamilies can be explained by assuming that BamA is the original Omp85, which already carried several POTRA domains, and later functional adaptations led to a divergence of the POTRA domains P1 and P2 in these two subfamilies.</p>
<p>As previously observed, there is complexity within the cyanobacterial BamA cluster, including the plastid Oep80 and Toc75 sequences (<xref ref-type="bibr" rid="B7">Arnold et al., 2010</xref>; <xref ref-type="bibr" rid="B36">Koenig et al., 2010</xref>). Predominantly, these contain only three POTRA domains, differentiating these sequences from the majority of all other BamA proteins, and some of these POTRA domains conform to the sequence characteristics of TpsB-type POTRAs (<bold>Table <xref ref-type="supplementary-material" rid="ST2">S2</xref></bold>, <xref ref-type="bibr" rid="B36">Koenig et al., 2010</xref>). For the purpose of the analysis depicted in <bold>Figure <xref ref-type="fig" rid="F4">4</xref></bold>, therefore, the entire cluster is colored separately and denoted &#x0201C;BamA 4&#x0201D; (for the fourth largest BamA cluster as given in <bold>Table <xref ref-type="supplementary-material" rid="ST1">S1</xref></bold>; <bold>Figure <xref ref-type="fig" rid="F4">4B</xref></bold>), consistent with the nomenclature used in <bold>Table <xref ref-type="supplementary-material" rid="ST1">S1</xref></bold>. The second POTRA domain (P2) is often recognized by the TpsB-specific POTRA domain motif (PF08479), consistent with previous observations (<xref ref-type="bibr" rid="B7">Arnold et al., 2010</xref>). Also of note, BamA from the <italic>Deinococcus-Thermus</italic> phylum, which also clustered in the predominantly cyanobacterial group (BamA4 in <bold>Table <xref ref-type="supplementary-material" rid="ST1">S1</xref></bold>), have POTRA P1 domains with strong similarity to the sequence features of the POTRA P2 domain from the FhaC protein subfamily (<bold>Figure <xref ref-type="fig" rid="F4">4</xref></bold>). These distinguishing features indicate an adapted function of the BamA of this Phylum, perhaps to unique features of their cell envelope (<xref ref-type="bibr" rid="B23">Farci et al., 2014</xref>).</p>
<p>The single POTRA domain for Sam50 is highlighted in gray (<bold>Figure <xref ref-type="fig" rid="F4">4B</xref></bold>) and is highly divergent from all bacterial POTRA sequences. This divergence might be a reflection of the simpler substrate repertoire and/or the reduced function of the POTRA domain in the mitochondrial outer membrane, and it is consistent with the observation that Sam50 is functional even if the POTRA domain is deleted (<xref ref-type="bibr" rid="B64">Stroud et al., 2011</xref>).</p>
</sec>
<sec>
<title>THE TAXONOMIC DISTRIBUTION OF THE SUBFAMILIES HIGHLIGHTS VERTICAL VERSUS HORIZONTAL INHERITANCE</title>
<p>BamA is essential for outer membrane biogenesis through its catalysis of &#x003B2;-barrel protein assembly. Given the clearly defined &#x0201C;BamA family,&#x0201D; the question of whether a BamA is found ubiquitously in organisms with an outer membrane could be addressed with confidence (<bold>Figure <xref ref-type="fig" rid="F3">3A</xref></bold>; <bold>Table <xref ref-type="supplementary-material" rid="ST4">S4</xref></bold>). There is no evidence of BamA in genomes from the taxa known to lack a Gram-negative type cell envelope, nor in the proteobacterial obligate intracellular endosymbionts which lack the capacity for outer membrane biogenesis: <italic>Candidatus</italic> Tremblaya princeps; <italic>Candidatus</italic> Hodgkinia cicadicola; <italic>Candidatus</italic> Carsonella ruddii, and <italic>Candidatus</italic> Zinderia insecticola (<xref ref-type="bibr" rid="B42">McCutcheon and Moran, 2012</xref>) all lack a gene encoding BamA (<bold>Table <xref ref-type="supplementary-material" rid="ST4">S4</xref></bold>, green font). Consistent with this, in the fifth member of the &#x0201C;tiny genome&#x0201D; organisms <italic>Candidatus</italic> Sulcia Muelleri, in which there remains several genes for cell envelope biosynthesis (<xref ref-type="bibr" rid="B42">McCutcheon and Moran, 2012</xref>), each of the strains present in our dataset has a BamA sequence (<bold>Table <xref ref-type="supplementary-material" rid="ST1">S1</xref></bold>).</p>
<p>We could not identify any BamA proteins for the curious bacterium <italic>Caldisericum exile</italic> (DSM 21853). Electron microscopy shows that <italic>C. exile</italic> has an outer membrane-like envelope, but further experiments failed to clarify whether it is Gram-positive or Gram-negative (<xref ref-type="bibr" rid="B45">Mori et al., 2009</xref>); our observation of the lack of BamA or any other proteins annotated as outer membrane-localized (PsortB; <xref ref-type="bibr" rid="B69">Yu et al., 2010</xref>) point to <italic>C. exile</italic> having a Gram-positive-type cell envelope.</p>
<p>The distribution of the additional subfamilies is more disseminated. As noted, the Omp85 Lipo in <italic>Bacteroidetes</italic> and <italic>Chlorobi</italic> and TamA in <italic>Proteobacteria</italic> are found in phylogenetic subgroups on phylum-level suggesting their origin from a single BamA duplication followed by vertical inheritance (<bold>Figure <xref ref-type="fig" rid="F3">3</xref></bold>; <bold>Table <xref ref-type="supplementary-material" rid="ST3">S3</xref></bold>). However, the other Omp85 families indicate a later evolutionary origin in the respective taxa, as they can only be found conserved at genus-level (<bold>Figure <xref ref-type="fig" rid="F3">3</xref></bold>; <bold>Table <xref ref-type="supplementary-material" rid="ST3">S3</xref></bold>; e.g., Metallo). The latter subfamilies, and this includes FhaC and Hmw1B, show a distribution across a variety of different groups strongly suggesting inheritance through HGT. This mode of inheritance is common for other membrane proteins associated with virulence (<xref ref-type="bibr" rid="B51">Pallen and Wren, 2007</xref>), including oligomeric molecular machines such as the protein secretion systems (for example, see <xref ref-type="bibr" rid="B15">Cianciotto, 2005</xref>; <xref ref-type="bibr" rid="B4">Alvarez-Martinez and Christie, 2009</xref>; <xref ref-type="bibr" rid="B1">Abby and Rocha, 2012</xref>). Considerable expansion in diversity has taken place in the <italic>Bacteroidetes</italic>/<italic>Chlorobi</italic> as well as some of the Phyla so far only poorly represented in the sequence databases (<italic>Ignavibacteria, Chrysiogenetes, Verrucomicrobia</italic>), whereas the Phyla considered to be among the early branching ones often encode a single copy of BamA and no other Omp85/TpsB family members (<bold>Figure <xref ref-type="fig" rid="F3">3</xref></bold>; <italic>Thermotogae</italic>, <italic>Deinococcus</italic>-<italic>Thermus</italic>).</p>
</sec>
<sec>
<title>A HIGH LEVEL OF DIVERSITY IN BamA, THE Omp85 BLUEPRINT</title>
<p>Given the proposed evolution of Omp85 protein subfamilies from gene duplication events involving BamA, we investigated what appeared to be recent gene duplication events; many organisms were found to have two or more genes encoding BamA paralogs (<bold>Figure <xref ref-type="fig" rid="F3">3A</xref></bold>), and phylogenetic analysis of the BamA sequences was used to investigate their evolutionary history. Attempts at aligning the barrel region for all BamA sequences resulted in very few informative sites which could be used for tree calculations. We therefore chose to focus our attention on BamA diversity at a smaller scale, restricted to sequences with higher conservation.</p>
<p>Several <italic>Pseudomonas</italic> spp. encode two BamA paralogs, and initial sequence alignments showed very high similarity between these BamA sequences and their closest relatives. Phylogenetic analysis of full-length sequences suggested a very recent duplication event resulting in a highly similar duplicate; BamA paralogs are present in non-pathogenic species <italic>P. brassicacearum</italic>, <italic>P. fluorescens</italic> and <italic>P. putida</italic>, which are known for their role in promoting plant growth and bioremediation (<bold>Figure <xref ref-type="fig" rid="F5">5A</xref></bold>), and a few other of the numerous sequenced <italic>P. syringae</italic> strains also contain two BamA sequences (<bold>Table <xref ref-type="supplementary-material" rid="ST1">S1</xref></bold>). Some species, however, have a single gene encoding BamA; such is the case for strains of the human pathogens <italic>P. aeruginosa</italic> and <italic>P. mendocina</italic> (<bold>Figure <xref ref-type="fig" rid="F5">5</xref></bold>; <bold>Table <xref ref-type="supplementary-material" rid="ST1">S1</xref></bold>). Analysis of the gene synteny (<bold>Figure <xref ref-type="fig" rid="F5">5B</xref></bold>) shows a conserved surrounding of the original <italic>bamA</italic> sequences, whereas the duplicated genes (&#x0201C;<italic>bamA2</italic>&#x0201D;) are at a different location in the genome and share similar downstream genes, whereas the upstream genes differ. This observation confirms our assignment of original versus additional BamA, and also reflects the extremely high genome plasticity in <italic>Pseudomonas</italic> spp. (<xref ref-type="bibr" rid="B62">Silby et al., 2011</xref>).</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p><bold>Highly similar BamA paralogs in <italic>Pseudomonas</italic> spp. (A)</bold> Phylogenetic tree of the <italic>Pseudomonas</italic> spp. BamA sequences and their closest taxonomic relatives. Different colors indicate organisms with more than one BamA copy with dark blue displaying the original (most conserved) sequence, whereas additional copies are displayed in light blue. Tree calculations were performed using phylobayes under the C20 model, posterior probabilities are shown as branch support values. The interrupted branch was shortened for display purposes. <bold>(B)</bold> Synteny view of the <italic>Pseudomonas</italic> spp. bamA and their surrounding genes, the underlying data were retrieved from the NCBI database. The genes upstream of the additional bamA are not conserved, and indicated as &#x0201C;orf1&#x0201D; and &#x0201C;orf2&#x0201D; in the overview and depicted in gray shades in the comparative view.</p></caption>
<graphic xlink:href="fmicb-05-00370-g005.tif"/>
</fig>
<p>A more complicated scenario is evident in the <italic>Myxobacteria</italic>, which are members of the <italic>Deltaproteobacteria</italic> and are best known for their unusual characteristics such as gliding motility and social behavior (<xref ref-type="bibr" rid="B32">Kaiser, 2003</xref>; <xref ref-type="bibr" rid="B47">Nan and Zusman, 2011</xref>). BamA paralogs from these species are diverse in copy number (<bold>Figure <xref ref-type="fig" rid="F3">3</xref></bold>). Initial sequence alignments indicated that while all belong to the BamA subfamily, three distinct subgroups could be seen with varying numbers of POTRA domains, with some showing similarity to sequences outside the <italic>Deltaproteobacteria</italic>. We therefore used only the sequence corresponding to the barrel domain (see Methods) for the tree inference. To probe for potential HGT events, sequences displaying high similarity to the additional BamA copies were included in the tree calculation alongside BamA sequences from the closest taxonomic relatives. Three distinct monophyletic groupings were evident, each group resulting from one acquisition or duplication event in the <italic>Myxobacteria</italic> and a few close relatives (<bold>Figure <xref ref-type="fig" rid="F6">6</xref></bold>). While Group 1 branches according to vertical inheritance, and Group 2 indicates a single duplication within the <italic>Deltaproteobacteria</italic> followed by strong sequence divergence but no HGT, Group 3 seems to have been acquired from one of the early branching phyla (<italic>Firmicutes</italic>, <italic>Thermotogae</italic>, <italic>Deinococcus-Thermus</italic>, <italic>Cyanobacteria</italic>) through HGT. However, given the low sequence coverage of this area of the bacterial tree, as well as the low support for a monophyletic origin with the <italic>Deinococcus-Thermus</italic> and <italic>Cyanobacteria</italic> (branch support 0.5), the exact origin within these phyla should be interpreted with caution. Tree calculations using the C20 model in phylobayes (data not shown) consistently resulted in similar topologies for the monophyly of the <italic>Myxobacteria</italic> Group 1 with the <italic>Deltaproteobacteria</italic> as well as the <italic>Alphaproteobacteria</italic> monophyly, and supports a non-proteobacterial origin of the <italic>Myxobacteria</italic> sequence Group 3, indicating an acquisition through HGT. Group 2 branches off as a monophyletic branch between the <italic>Proteobacteria</italic> and all others possibly reflecting long-branch attraction due to the high divergence of the sequences.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p><bold>Independent sets of BamA proteins in the <italic>Myxococcales</italic>.</bold> Phylogenetic tree of BamA sequences identified in <italic>Myxobacteria</italic> and their closest taxonomic relatives. Tree calculations were performed using phylobayes under the C60 model, posterior probabilities are shown as branch support values. The <italic>Myxococcales</italic> Group 1, Group 2, and Group 3 sequences are indicated.</p></caption>
<graphic xlink:href="fmicb-05-00370-g006.tif"/>
</fig>
<p>These examples demonstrate the variability of BamA not only in copy numbers, but also in sequence origin and level of similarity. It provides plausibility to the scenario for duplication of BamA genes, followed by selection events for diversification of function. We suggest two scenarios why this selection could be advantageous: (i) the highly similar BamA paralogs (e.g., <bold>Figure <xref ref-type="fig" rid="F5">5</xref></bold>) could provide alternatives for control of gene expression, allowing for regulation in response to specific environmental conditions, and (ii) specialization of activity for a certain subset of outer membrane protein substrates, leading ultimately to become modules like TamA that assist the function of the cognate BamA in the assembly of diverse membrane protein substrates (<xref ref-type="bibr" rid="B59">Selkrig et al., 2014</xref>).</p>
</sec>
<sec>
<title>POTENTIAL IMPLICATIONS OF DIFFERENCES IN Omp85 PROTEINS</title>
<p>The diversity observed in the Omp85 family could reflect adaptations to different substrate (&#x0201C;client&#x0201D;) proteins, as has been observed in molecular chaperone protein families. Detailed studies on molecular chaperones found in the cytoplasm show high levels of variation with respect to their copy numbers; in order to cope with the assembly of their evolving range of substrate proteins, as well as to acquire novel (sub)functions themselves (<xref ref-type="bibr" rid="B27">Henderson et al., 2013</xref>; <xref ref-type="bibr" rid="B55">Ruiz-Gonzalez and Fares, 2013</xref>).</p>
<p>Gene duplications for cytoplasmic chaperones such as GroEL (Hsp60), Hsp70 or Hsp90 are very common amongst eukaryotes where the formation of distinct subgroups is well-described (<xref ref-type="bibr" rid="B9">Bogumil et al., 2014</xref>), and multiple paralogs of these cytoplasmic chaperones are also observed in prokaryotes (<xref ref-type="bibr" rid="B48">Nimura et al., 2001</xref>; <xref ref-type="bibr" rid="B14">Chen et al., 2006</xref>; <xref ref-type="bibr" rid="B39">Lund, 2009</xref>). For the GroEL-like chaperones, it has been proposed that the initial transfer of specific chaperones between unrelated organisms living in the same environment paves the way for subsequent transfer of other functions important in the respective niche (<xref ref-type="bibr" rid="B66">Williams et al., 2010</xref>). The presence of multiple BamA or BamA-like proteins detected through our study might likewise enable the respective organisms to acquire or evolve a more diverse outer membrane proteome, such as the diversity of cytoplasmic chaperones is controlling the mutation rate of proteins, enabling the organisms to generate a more diverse cytoplasmic proteome (<xref ref-type="bibr" rid="B67">Williams and Fares, 2010</xref>). This fits with the observations in this study showing that the expansion of paralogs is often specific for certain subgroups or species with a distinct lifestyle, and the enrichment of Omp85 proteins in organisms thriving in less stable environments such as marine or soil bacteria as opposed to pathogens. As the first point of contact, outer membrane proteins play a crucial role in an organism&#x02019;s interactions with its surroundings; the gain of specific Omp85 subfamilies could mediate adaptation on a rapid scale.</p>
</sec>
</sec>
<sec>
<title>SUMMARY</title>
<p>The protein architecture and sequence signatures identified within the Omp85/TpsB superfamily enables a classification structure to this highly diverse group of proteins. It suggests that the complex process of assembling proteins into bacterial outer membranes selects for diversity in the genes encoding BamA paralogs and BamA-related functions. Beyond the established and ancient BamA protein subfamily, other Omp85 protein subfamilies are present and have been acquired through HGT to become established in diverse bacterial taxa. We suggest that proteins with a barrel+POTRA domain architecture or the barrel-only Omp85 proteins serve as accessory modules in the &#x003B2;-barrel assembly machinery: assisting BamA to assemble subsets of outer membrane proteins, thereby enabling acquisition of a range of new genes for outer membrane proteins to be acquired. This diversity in Omp85 proteins thereby provides the potential for the organism to thrive in a new or changing environment.</p>
</sec>
<sec>
<title>AUTHOR CONTRIBUTIONS</title>
<p>Eva Heinz and Trevor Lithgow conceived the study. Eva Heinz designed and performed the experiments and analyzed and interpreted the data. Eva Heinz and Trevor Lithgow wrote the manuscript.</p>
</sec>
<sec>
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ack>
<p>The authors thank Dieter Bulach and Victoria Hewitt for critically reading the manuscript. This work was supported by the Australian Research Council (DP120101878 and FL130100038). Eva Heinz is an ARC FL Postdoctoral Research Fellow, Trevor Lithgow is an ARC Australian Laureate Research Fellow.</p>
</ack>
<sec sec-type="supplementary material" id="s2">
<title>SUPPLEMENTARY MATERIAL</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="http://www.frontiersin.org/journal/10.3389/fmicb.2014.00370/abstract">http://www.frontiersin.org/journal/10.3389/fmicb.2014.00370/abstract</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.XLSX" id="ST1" mimetype="application/xlsx" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Table S1</label>
<caption><p><bold>List of all UniProt accession numbers of Omp85 proteins in their respective clusters</bold>.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Data_Sheet_2.XLSX" id="ST2" mimetype="application/xlsx" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Table S2</label>
<caption><p><bold>List of the domain profiles identified for the main clusters based on the annotation of Pfam domains in Interpro</bold>.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Data_Sheet_3.XLSX" id="ST3" mimetype="application/xlsx" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Table S3</label>
<caption><p><bold>Summary of the prediction results using Phyre2 of sequences with novel domain profiles</bold>.</p></caption>
</supplementary-material>
<supplementary-material xlink:href="Data_Sheet_4.XLSX" id="ST4" mimetype="application/xlsx" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Table S4</label>
<caption><p><bold>List of all bacterial species with a completed proteome according to the UniProt database at the time of analysis, which lack a protein similar to BamA.</bold> Organisms that represent exceptions (highly reduced obligate intracellular bacteria, organisms with indications for Gram-positive or Gram-negative cell envelope) are highlighted in green, organisms where a BamA would be expected due to its presence in all other strains of the respective species are highlighted in red. All taxa underlined in gray are described to display a Gram-positive cell envelope.</p></caption>
</supplementary-material>
</sec>
<ref-list>
<title>REFERENCES</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abby</surname> <given-names>S. S.</given-names></name> <name><surname>Rocha</surname> <given-names>E. P.</given-names></name></person-group> (<year>2012</year>). <article-title>The non-flagellar type III secretion system evolved from the bacterial flagellum and diversified into host-cell adapted systems.</article-title> <source><italic>PLoS Genet.</italic></source> <volume>8</volume>:<issue>e1002983</issue>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1002983</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abergel</surname> <given-names>C.</given-names></name> <name><surname>Bouveret</surname> <given-names>E.</given-names></name> <name><surname>Claverie</surname> <given-names>J. M.</given-names></name> <name><surname>Brown</surname> <given-names>K.</given-names></name> <name><surname>Rigal</surname> <given-names>A.</given-names></name> <name><surname>Lazdunski</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>1999</year>). <article-title>Structure of the <italic>Escherichia coli</italic> TolB protein determined by MAD methods at 1.<italic>95</italic> A resolution.</article-title> <source><italic>Structure</italic></source> <volume>7</volume> <fpage>1291</fpage>&#x02013;<lpage>1300</lpage>. <pub-id pub-id-type="doi">10.1016/S0969-2126(00)80062-3</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Albrecht</surname> <given-names>R.</given-names></name> <name><surname>Zeth</surname> <given-names>K.</given-names></name></person-group> (<year>2011</year>). <article-title>Structural basis of outer membrane protein biogenesis in bacteria.</article-title> <source><italic>J. Biol. Chem.</italic></source> <volume>286</volume> <fpage>27792</fpage>&#x02013;<lpage>27803</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.M111.238931</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alvarez-Martinez</surname> <given-names>C. E.</given-names></name> <name><surname>Christie</surname> <given-names>P. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Biological diversity of prokaryotic type IV secretion systems.</article-title> <source><italic>Microbiol. Mol. Biol. Rev.</italic></source> <volume>73</volume> <fpage>775</fpage>&#x02013;<lpage>808</lpage>. <pub-id pub-id-type="doi">10.1128/MMBR.00023-09</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anwari</surname> <given-names>K.</given-names></name> <name><surname>Poggio</surname> <given-names>S.</given-names></name> <name><surname>Perry</surname> <given-names>A.</given-names></name> <name><surname>Gatsos</surname> <given-names>X.</given-names></name> <name><surname>Ramarathinam</surname> <given-names>S. H.</given-names></name> <name><surname>Williamson</surname> <given-names>N. A.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>A modular BAM complex in the outer membrane of the alpha-proteobacterium <italic>Caulobacter crescentus</italic>.</article-title> <source><italic>PLoS ONE</italic></source> <volume>5</volume>:<issue>e8619</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0008619</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anwari</surname> <given-names>K.</given-names></name> <name><surname>Webb</surname> <given-names>C. T.</given-names></name> <name><surname>Poggio</surname> <given-names>S.</given-names></name> <name><surname>Perry</surname> <given-names>A. J.</given-names></name> <name><surname>Belousoff</surname> <given-names>M.</given-names></name> <name><surname>Celik</surname> <given-names>N.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>The evolution of new lipoprotein subunits of the bacterial outer membrane BAM complex.</article-title> <source><italic>Mol. Microbiol.</italic></source> <volume>84</volume> <fpage>832</fpage>&#x02013;<lpage>844</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-2958.2012.08059.x</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arnold</surname> <given-names>T.</given-names></name> <name><surname>Zeth</surname> <given-names>K.</given-names></name> <name><surname>Linke</surname> <given-names>D.</given-names></name></person-group> (<year>2010</year>). <article-title>Omp85 from the thermophilic cyanobacterium <italic>Thermosynechococcus elongatus</italic> differs from proteobacterial Omp85 in structure and domain composition.</article-title> <source><italic>J. Biol. Chem.</italic></source> <volume>285</volume> <fpage>18003</fpage>&#x02013;<lpage>18015</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.M110.112516</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bjorklund</surname> <given-names>A. K.</given-names></name> <name><surname>Ekman</surname> <given-names>D.</given-names></name> <name><surname>Elofsson</surname> <given-names>A.</given-names></name></person-group> (<year>2006</year>). <article-title>Expansion of protein domain repeats.</article-title> <source><italic>PLoS Comput. Biol.</italic></source> <volume>2</volume>:<issue>e114</issue>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.0020114</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bogumil</surname> <given-names>D.</given-names></name> <name><surname>Alvarez-Ponce</surname> <given-names>D.</given-names></name> <name><surname>Landan</surname> <given-names>G.</given-names></name> <name><surname>McInerney</surname> <given-names>J. O.</given-names></name> <name><surname>Dagan</surname> <given-names>T.</given-names></name></person-group> (<year>2014</year>). <article-title>Integration of two ancestral chaperone systems into one: the evolution of eukaryotic molecular chaperones in light of eukaryogenesis.</article-title> <source><italic>Mol. Biol. Evol.</italic></source> <volume>31</volume> <fpage>410</fpage>&#x02013;<lpage>418</lpage>. <pub-id pub-id-type="doi">10.1093/molbev/mst212</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bolter</surname> <given-names>B.</given-names></name> <name><surname>Soll</surname> <given-names>J.</given-names></name> <name><surname>Schulz</surname> <given-names>A.</given-names></name> <name><surname>Hinnah</surname> <given-names>S.</given-names></name> <name><surname>Wagner</surname> <given-names>R.</given-names></name></person-group> (<year>1998</year>). <article-title>Origin of a chloroplast protein importer.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>95</volume> <fpage>15831</fpage>&#x02013;<lpage>15836</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.95.26.15831</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bonsor</surname> <given-names>D. A.</given-names></name> <name><surname>Grishkovskaya</surname> <given-names>I.</given-names></name> <name><surname>Dodson</surname> <given-names>E. J.</given-names></name> <name><surname>Kleanthous</surname> <given-names>C.</given-names></name></person-group> (<year>2007</year>). <article-title>Molecular mimicry enables competitive recruitment by a natively disordered protein.</article-title> <source><italic>J. Am. Chem. Soc.</italic></source> <volume>129</volume> <fpage>4800</fpage>&#x02013;<lpage>4807</lpage>. <pub-id pub-id-type="doi">10.1021/ja070153n</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Capella-Gutierrez</surname> <given-names>S.</given-names></name> <name><surname>Silla-Martinez</surname> <given-names>J. M.</given-names></name> <name><surname>Gabaldon</surname> <given-names>T.</given-names></name></person-group> (<year>2009</year>). <article-title>trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses.</article-title> <source><italic>Bioinformatics</italic></source> <volume>25</volume> <fpage>1972</fpage>&#x02013;<lpage>1973</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp348</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cavalier-Smith</surname> <given-names>T.</given-names></name></person-group> (<year>2006</year>). <article-title>Rooting the tree of life by transition analyses.</article-title> <source><italic>Biol. Direct.</italic></source> <volume>1</volume>:<issue>19</issue>. <pub-id pub-id-type="doi">10.1186/1745-6150-1-19</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>B.</given-names></name> <name><surname>Zhong</surname> <given-names>D.</given-names></name> <name><surname>Monteiro</surname> <given-names>A.</given-names></name></person-group> (<year>2006</year>). <article-title>Comparative genomics and evolution of the HSP90 family of genes across all kingdoms of organisms.</article-title> <source><italic>BMC Genomics</italic></source> <volume>7</volume>:<issue>156</issue>. <pub-id pub-id-type="doi">10.1186/1471-2164-7-156</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cianciotto</surname> <given-names>N. P.</given-names></name></person-group> (<year>2005</year>). <article-title>Type II secretion: a protein secretion system for all seasons.</article-title> <source><italic>Trends Microbiol.</italic></source> <volume>13</volume> <fpage>581</fpage>&#x02013;<lpage>588</lpage>. <pub-id pub-id-type="doi">10.1016/j.tim.2005.09.005</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clantin</surname> <given-names>B.</given-names></name> <name><surname>Delattre</surname> <given-names>A. S.</given-names></name> <name><surname>Rucktooa</surname> <given-names>P.</given-names></name> <name><surname>Saint</surname> <given-names>N.</given-names></name> <name><surname>Meli</surname> <given-names>A. C.</given-names></name> <name><surname>Locht</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>2007</year>). <article-title>Structure of the membrane protein FhaC: a member of the Omp85-TpsB transporter superfamily.</article-title> <source><italic>Science</italic></source> <volume>317</volume> <fpage>957</fpage>&#x02013;<lpage>961</lpage>. <pub-id pub-id-type="doi">10.1126/science.1143860</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eddy</surname> <given-names>S. R.</given-names></name></person-group> (<year>2011</year>). <article-title>Accelerated Profile HMM Searches.</article-title> <source><italic>PLoS Comput. Biol.</italic></source> <volume>7</volume>:<issue>e1002195</issue>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1002195</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Edgar</surname> <given-names>R. C.</given-names></name></person-group> (<year>2004</year>). <article-title>MUSCLE: multiple sequence alignment with high accuracy and high throughput.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>32</volume> <fpage>1792</fpage>&#x02013;<lpage>1797</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkh340</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Edgar</surname> <given-names>R. C.</given-names></name></person-group> (<year>2010</year>). <article-title>Search and clustering orders of magnitude faster than BLAST.</article-title> <source><italic>Bioinformatics</italic></source> <volume>26</volume> <fpage>2460</fpage>&#x02013;<lpage>2461</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btq461</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Enright</surname> <given-names>A. J.</given-names></name> <name><surname>Van Dongen</surname> <given-names>S.</given-names></name> <name><surname>Ouzounis</surname> <given-names>C. A.</given-names></name></person-group> (<year>2002</year>). <article-title>An efficient algorithm for large-scale detection of protein families.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>30</volume> <fpage>1575</fpage>&#x02013;<lpage>1584</lpage>. <pub-id pub-id-type="doi">10.1093/nar/30.7.1575</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Errington</surname> <given-names>J.</given-names></name></person-group> (<year>2013</year>). <article-title>L-form bacteria, cell walls and the origins of life.</article-title> <source><italic>Open Biol.</italic></source> <volume>3</volume>:<issue>120143</issue>. <pub-id pub-id-type="doi">10.1098/rsob.120143</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fang</surname> <given-names>H.</given-names></name> <name><surname>Oates</surname> <given-names>M. E.</given-names></name> <name><surname>Pethica</surname> <given-names>R. B.</given-names></name> <name><surname>Greenwood</surname> <given-names>J. M.</given-names></name> <name><surname>Sardar</surname> <given-names>A. J.</given-names></name> <name><surname>Rackham</surname> <given-names>O. J.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>A daily-updated tree of (sequenced) life as a reference for genome research.</article-title> <source><italic>Sci. Rep.</italic></source> <volume>3</volume>:<issue>2015</issue>. <pub-id pub-id-type="doi">10.1038/srep02015</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Farci</surname> <given-names>D.</given-names></name> <name><surname>Bowler</surname> <given-names>M. W.</given-names></name> <name><surname>Kirkpatrick</surname> <given-names>J.</given-names></name> <name><surname>McSweeney</surname> <given-names>S.</given-names></name> <name><surname>Tramontano</surname> <given-names>E.</given-names></name> <name><surname>Piano</surname> <given-names>D.</given-names></name></person-group> (<year>2014</year>). <article-title>New features of the cell wall of the radio-resistant bacterium <italic>Deinococcus radiodurans</italic>.</article-title> <source><italic>Biochim. Biophys. Acta</italic></source> <volume>1838</volume> <fpage>1978</fpage>&#x02013;<lpage>1984</lpage>. <pub-id pub-id-type="doi">10.1016/j.bbamem.2014.02.014</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gentle</surname> <given-names>I.</given-names></name> <name><surname>Gabriel</surname> <given-names>K.</given-names></name> <name><surname>Beech</surname> <given-names>P.</given-names></name> <name><surname>Waller</surname> <given-names>R.</given-names></name> <name><surname>Lithgow</surname> <given-names>T.</given-names></name></person-group> (<year>2004</year>). <article-title>The Omp85 family of proteins is essential for outer membrane biogenesis in mitochondria and bacteria.</article-title> <source><italic>J. Cell Biol.</italic></source> <volume>164</volume> <fpage>19</fpage>&#x02013;<lpage>24</lpage>. <pub-id pub-id-type="doi">10.1083/jcb.200310092</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gruss</surname> <given-names>F.</given-names></name> <name><surname>Zahringer</surname> <given-names>F.</given-names></name> <name><surname>Jakob</surname> <given-names>R. P.</given-names></name> <name><surname>Burmann</surname> <given-names>B. M.</given-names></name> <name><surname>Hiller</surname> <given-names>S.</given-names></name> <name><surname>Maier</surname> <given-names>T.</given-names></name></person-group> (<year>2013</year>). <article-title>The structural basis of autotransporter translocation by TamA.</article-title> <source><italic>Nat. Struct. Mol. Biol.</italic></source> <volume>20</volume> <fpage>1318</fpage>&#x02013;<lpage>1320</lpage>. <pub-id pub-id-type="doi">10.1038/nsmb.2689</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hagan</surname> <given-names>C. L.</given-names></name> <name><surname>Silhavy</surname> <given-names>T. J.</given-names></name> <name><surname>Kahne</surname> <given-names>D.</given-names></name></person-group> (<year>2011</year>). <article-title>beta-Barrel membrane protein assembly by the Bam complex.</article-title> <source><italic>Annu. Rev. Biochem.</italic></source> <volume>80</volume> <fpage>189</fpage>&#x02013;<lpage>210</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-biochem-061408-144611</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Henderson</surname> <given-names>B.</given-names></name> <name><surname>Fares</surname> <given-names>M. A.</given-names></name> <name><surname>Lund</surname> <given-names>P. A.</given-names></name></person-group> (<year>2013</year>). <article-title>Chaperonin 60: a paradoxical, evolutionarily conserved protein family with multiple moonlighting functions.</article-title> <source><italic>Biol. Rev. Camb. Philos. Soc.</italic></source> <volume>88</volume> <fpage>955</fpage>&#x02013;<lpage>987</lpage>. <pub-id pub-id-type="doi">10.1111/brv.12037</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Heuck</surname> <given-names>A.</given-names></name> <name><surname>Schleiffer</surname> <given-names>A.</given-names></name> <name><surname>Clausen</surname> <given-names>T.</given-names></name></person-group> (<year>2011</year>). <article-title>Augmenting beta-augmentation: structural basis of how BamB binds BamA and may support folding of outer membrane proteins.</article-title> <source><italic>J. Mol. Biol.</italic></source> <volume>406</volume> <fpage>659</fpage>&#x02013;<lpage>666</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmb.2011.01.002</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hunter</surname> <given-names>S.</given-names></name> <name><surname>Jones</surname> <given-names>P.</given-names></name> <name><surname>Mitchell</surname> <given-names>A.</given-names></name> <name><surname>Apweiler</surname> <given-names>R.</given-names></name> <name><surname>Attwood</surname> <given-names>T. K.</given-names></name> <name><surname>Bateman</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>InterPro in 2011: new developments in the family and domain prediction database.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>40</volume>:<fpage>D306</fpage>&#x02013;<lpage>D312</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkr948</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jacob-Dubuisson</surname> <given-names>F.</given-names></name> <name><surname>Guerin</surname> <given-names>J.</given-names></name> <name><surname>Baelen</surname> <given-names>S.</given-names></name> <name><surname>Clantin</surname> <given-names>B.</given-names></name></person-group> (<year>2013</year>). <article-title>Two-partner secretion: as simple as it sounds?</article-title> <source><italic>Res. Microbiol.</italic></source> <volume>164</volume> <fpage>583</fpage>&#x02013;<lpage>595</lpage>. <pub-id pub-id-type="doi">10.1016/j.resmic.2013.03.009</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Juncker</surname> <given-names>A. S.</given-names></name> <name><surname>Willenbrock</surname> <given-names>H.</given-names></name> <name><surname>Von Heijne</surname> <given-names>G.</given-names></name> <name><surname>Brunak</surname> <given-names>S.</given-names></name> <name><surname>Nielsen</surname> <given-names>H.</given-names></name> <name><surname>Krogh</surname> <given-names>A.</given-names></name></person-group> (<year>2003</year>). <article-title>Prediction of lipoprotein signal peptides in Gram-negative bacteria.</article-title> <source><italic>Protein Sci.</italic></source> <volume>12</volume> <fpage>1652</fpage>&#x02013;<lpage>1662</lpage>. <pub-id pub-id-type="doi">10.1110/ps.0303703</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kaiser</surname> <given-names>D.</given-names></name></person-group> (<year>2003</year>). <article-title>Coupling cell movement to multicellular development in myxobacteria.</article-title> <source><italic>Nat. Rev. Microbiol.</italic></source> <volume>1</volume> <fpage>45</fpage>&#x02013;<lpage>54</lpage>. <pub-id pub-id-type="doi">10.1038/nrmicro733</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kelley</surname> <given-names>L. A.</given-names></name> <name><surname>Sternberg</surname> <given-names>M. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Protein structure prediction on the Web: a case study using the Phyre server.</article-title> <source><italic>Nat. Protoc.</italic></source> <volume>4</volume> <fpage>363</fpage>&#x02013;<lpage>371</lpage>. <pub-id pub-id-type="doi">10.1038/nprot.2009.2</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>K. H.</given-names></name> <name><surname>Aulakh</surname> <given-names>S.</given-names></name> <name><surname>Paetzel</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>The bacterial outer membrane beta-barrel assembly machinery.</article-title> <source><italic>Protein Sci.</italic></source> <volume>21</volume> <fpage>751</fpage>&#x02013;<lpage>768</lpage>. <pub-id pub-id-type="doi">10.1002/pro.2069</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>K. H.</given-names></name> <name><surname>Paetzel</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>Crystal structure of Escherichia coli BamB, a lipoprotein component of the beta-barrel assembly machinery complex.</article-title> <source><italic>J. Mol. Biol.</italic></source> <volume>406</volume> <fpage>667</fpage>&#x02013;<lpage>678</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmb.2010.12.020</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Koenig</surname> <given-names>P.</given-names></name> <name><surname>Mirus</surname> <given-names>O.</given-names></name> <name><surname>Haarmann</surname> <given-names>R.</given-names></name> <name><surname>Sommer</surname> <given-names>M. S.</given-names></name> <name><surname>Sinning</surname> <given-names>I.</given-names></name> <name><surname> Schleiff</surname> <given-names>E.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>Conserved properties of polypeptide transport-associated (POTRA) domains derived from Cyanobacterial Omp85.</article-title> <source><italic>J. Biol. Chem.</italic></source> <volume>285</volume> <issue>18016</issue> &#x02013;18024. <pub-id pub-id-type="doi">10.1074/jbc.M110.112649</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lartillot</surname> <given-names>N.</given-names></name> <name><surname>Lepage</surname> <given-names>T.</given-names></name> <name><surname>Blanquart</surname> <given-names>S.</given-names></name></person-group> (<year>2009</year>). <article-title>PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating.</article-title> <source><italic>Bioinformatics</italic></source> <volume>25</volume> <fpage>2286</fpage>&#x02013;<lpage>2288</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btp368</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Letunic</surname> <given-names>I.</given-names></name> <name><surname>Bork</surname> <given-names>P.</given-names></name></person-group> (<year>2011</year>). <article-title>Interactive Tree Of Life v2: online annotation and display of phylogenetic trees made easy.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>39</volume>:<fpage>W475</fpage>&#x02013;<lpage>W478</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkr201</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lund</surname> <given-names>P. A.</given-names></name></person-group> (<year>2009</year>). <article-title>Multiple chaperonins in bacteria&#x02013;why so many?</article-title> <source><italic>FEMS Microbiol. Rev.</italic></source> <volume>33</volume> <fpage>785</fpage>&#x02013;<lpage>800</lpage>. <pub-id pub-id-type="doi">10.1111/j.1574-6976.2009.00178.x</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Magrane</surname> <given-names>M.</given-names></name> <name><surname>Consortium.</surname></name></person-group> (<year>2011</year>). <article-title>UniProt knowledgebase: a hub of integrated protein data.</article-title> <source><italic>Database: the journal of biological databases and curation</italic></source> <comment>2011: bar009</comment>. <pub-id pub-id-type="doi">10.1093/database/bar009</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mazar</surname> <given-names>J.</given-names></name> <name><surname>Cotter</surname> <given-names>P. A.</given-names></name></person-group> (<year>2007</year>). <article-title>New insight into the molecular mechanisms of two-partner secretion.</article-title> <source><italic>Trends Microbiol.</italic></source> <volume>15</volume> <fpage>508</fpage>&#x02013;<lpage>515</lpage>. <pub-id pub-id-type="doi">10.1016/j.tim.2007.10.005</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McCutcheon</surname> <given-names>J. P.</given-names></name> <name><surname>Moran</surname> <given-names>N. A.</given-names></name></person-group> (<year>2012</year>). <article-title>Extreme genome reduction in symbiotic bacteria.</article-title> <source><italic>Nat. Rev. Microbiol.</italic></source> <volume>10</volume> <fpage>13</fpage>&#x02013;<lpage>26</lpage>. <pub-id pub-id-type="doi">10.1038/nrmicro2670</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McGuffin</surname> <given-names>L. J.</given-names></name> <name><surname>Bryson</surname> <given-names>K.</given-names></name> <name><surname>Jones</surname> <given-names>D. T.</given-names></name></person-group> (<year>2000</year>). <article-title>The PSIPRED protein structure prediction server.</article-title> <source><italic>Bioinformatics</italic></source> <volume>16</volume> <fpage>404</fpage>&#x02013;<lpage>405</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/16.4.404</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Merdanovic</surname> <given-names>M.</given-names></name> <name><surname>Clausen</surname> <given-names>T.</given-names></name> <name><surname>Kaiser</surname> <given-names>M.</given-names></name> <name><surname>Huber</surname> <given-names>R.</given-names></name> <name><surname>Ehrmann</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>Protein quality control in the bacterial periplasm.</article-title> <source><italic>Annu. Rev. Microbiol.</italic></source> <volume>65</volume> <fpage>149</fpage>&#x02013;<lpage>168</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-micro-090110-102925</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mori</surname> <given-names>K.</given-names></name> <name><surname>Yamaguchi</surname> <given-names>K.</given-names></name> <name><surname>Sakiyama</surname> <given-names>Y.</given-names></name> <name><surname>Urabe</surname> <given-names>T.</given-names></name> <name><surname>Suzuki</surname> <given-names>K.</given-names></name></person-group> (<year>2009</year>). <article-title>Caldisericum exile gen. nov., sp. nov., an anaerobic, thermophilic, filamentous bacterium of a novel bacterial phylum, Caldiserica phyl. nov., originally called the candidate phylum OP5, and description of Caldisericaceae fam. nov., Caldisericales ord. nov. and Caldisericia classis nov.</article-title> <source><italic>Int. J. Syst. Evol. Microbiol.</italic></source> 59(Pt 11), <fpage>2894</fpage>&#x02013;<lpage>2898</lpage>. <pub-id pub-id-type="doi">10.1099/ijs.0.010033-0</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moslavac</surname> <given-names>S.</given-names></name> <name><surname>Mirus</surname> <given-names>O.</given-names></name> <name><surname>Bredemeier</surname> <given-names>R.</given-names></name> <name><surname>Soll</surname> <given-names>J.</given-names></name> <name><surname>von Haeseler</surname> <given-names>A.</given-names></name> <name><surname>Schleiff</surname> <given-names>E.</given-names></name></person-group> (<year>2005</year>). <article-title>Conserved pore-forming regions in polypeptide-transporting proteins.</article-title> <source><italic>FEBS J.</italic></source> <volume>272</volume> <fpage>1367</fpage>&#x02013;<lpage>1378</lpage>. <pub-id pub-id-type="doi">10.1111/j.1742-4658.2005.04569.x</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nan</surname> <given-names>B.</given-names></name> <name><surname>Zusman</surname> <given-names>D. R.</given-names></name></person-group> (<year>2011</year>). <article-title>Uncovering the mystery of gliding motility in the myxobacteria.</article-title> <source><italic>Annu. Rev. Genet.</italic></source> <volume>45</volume> <fpage>21</fpage>&#x02013;<lpage>39</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-genet-110410-132547</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nimura</surname> <given-names>K.</given-names></name> <name><surname>Takahashi</surname> <given-names>H.</given-names></name> <name><surname>Yoshikawa</surname> <given-names>H.</given-names></name></person-group> (<year>2001</year>). <article-title>Characterization of the dnaK multigene family in the Cyanobacterium <italic>Synechococcus</italic> sp. strain PCC7942.</article-title> <source><italic>J. Bacteriol.</italic></source> <volume>183</volume> <fpage>1320</fpage>&#x02013;<lpage>1328</lpage>. <pub-id pub-id-type="doi">10.1128/JB.183.4.1320-1328.2001</pub-id></citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Noinaj</surname> <given-names>N.</given-names></name> <name><surname>Fairman</surname> <given-names>J. W.</given-names></name> <name><surname>Buchanan</surname> <given-names>S. K.</given-names></name></person-group> (<year>2011</year>). <article-title>The crystal structure of BamB suggests interactions with BamA and its role within the BAM complex.</article-title> <source><italic>J. Mol. Biol.</italic></source> <volume>407</volume> <fpage>248</fpage>&#x02013;<lpage>260</lpage>. <pub-id pub-id-type="doi">10.1016/j.jmb.2011.01.042</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Noinaj</surname> <given-names>N.</given-names></name> <name><surname>Kuszak</surname> <given-names>A. J.</given-names></name> <name><surname>Gumbart</surname> <given-names>J. C.</given-names></name> <name><surname>Lukacik</surname> <given-names>P.</given-names></name> <name><surname>Chang</surname> <given-names>H.</given-names></name> <name><surname>Easley</surname> <given-names>N. C.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>Structural insight into the biogenesis of beta-barrel membrane proteins.</article-title> <source><italic>Nature</italic></source> <volume>501</volume> <fpage>385</fpage>&#x02013;<lpage>390</lpage>. <pub-id pub-id-type="doi">10.1038/nature12521</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pallen</surname> <given-names>M. J.</given-names></name> <name><surname>Wren</surname> <given-names>B. W.</given-names></name></person-group> (<year>2007</year>). <article-title>Bacterial pathogenomics.</article-title> <source><italic>Nature</italic></source> <volume>449</volume> <fpage>835</fpage>&#x02013;<lpage>842</lpage>. <pub-id pub-id-type="doi">10.1038/nature06248</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pettersen</surname> <given-names>E. F.</given-names></name> <name><surname>Goddard</surname> <given-names>T. D.</given-names></name> <name><surname>Huang</surname> <given-names>C. C.</given-names></name> <name><surname>Couch</surname> <given-names>G. S.</given-names></name> <name><surname>Greenblatt</surname> <given-names>D. M.</given-names></name> <name><surname>Meng</surname> <given-names>E. C.</given-names></name><etal/></person-group> (<year>2004</year>). <article-title>UCSF Chimera &#x02013; a visualization system for exploratory research and analysis.</article-title> <source><italic>J. Comput. Chem.</italic></source> <volume>25</volume> <fpage>1605</fpage>&#x02013;<lpage>1612</lpage>. <pub-id pub-id-type="doi">10.1002/jcc.20084</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Punta</surname> <given-names>M.</given-names></name> <name><surname>Coggill</surname> <given-names>P. C.</given-names></name> <name><surname>Eberhardt</surname> <given-names>R. Y.</given-names></name> <name><surname>Mistry</surname> <given-names>J.</given-names></name> <name><surname>Tate</surname> <given-names>J.</given-names></name> <name><surname>Boursnell</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>The Pfam protein families database.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>40</volume>:<fpage>D290</fpage>&#x02013;<lpage>D301</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkr1065</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Reumann</surname> <given-names>S.</given-names></name> <name><surname>Keegstra</surname> <given-names>K.</given-names></name></person-group> (<year>1999</year>). <article-title>The endosymbiotic origin of the protein import machinery of chloroplastic envelope membranes.</article-title> <source><italic>Trends Plant Sci.</italic></source> <volume>4</volume> <fpage>302</fpage>&#x02013;<lpage>307</lpage>. <pub-id pub-id-type="doi">10.1016/S1360-1385(99)01449-1</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ruiz-Gonzalez</surname> <given-names>M. X.</given-names></name> <name><surname>Fares</surname> <given-names>M. A.</given-names></name></person-group> (<year>2013</year>). <article-title>Coevolution analyses illuminate the dependencies between amino acid sites in the chaperonin system GroES-L.</article-title> <source><italic>BMC Evol. Biol.</italic></source> <volume>13</volume>:<issue>156</issue>. <pub-id pub-id-type="doi">10.1186/1471-2148-13-156</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Salacha</surname> <given-names>R.</given-names></name> <name><surname>Kovacic</surname> <given-names>F.</given-names></name> <name><surname>Brochier-Armanet</surname> <given-names>C.</given-names></name> <name><surname>Wilhelm</surname> <given-names>S.</given-names></name> <name><surname>Tommassen</surname> <given-names>J.</given-names></name> <name><surname>Filoux</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>The <italic>Pseudomonas aeruginosa</italic> patatin-like protein PlpD is the archetype of a novel Type V secretion system.</article-title> <source><italic>Environ. Microbiol.</italic></source> <volume>12</volume> <fpage>1498</fpage>&#x02013;<lpage>1512</lpage>. <pub-id pub-id-type="doi">10.1111/j.1462-2920.2010.02174.x</pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sanchez-Pulido</surname> <given-names>L.</given-names></name> <name><surname>Devos</surname> <given-names>D.</given-names></name> <name><surname>Genevrois</surname> <given-names>S.</given-names></name> <name><surname>Vicente</surname> <given-names>M.</given-names></name> <name><surname>Valencia</surname> <given-names>A.</given-names></name></person-group> (<year>2003</year>). <article-title>POTRA: a conserved domain in the FtsQ family and a class of beta-barrel outer membrane proteins.</article-title> <source><italic>Trends Biochem. Sci.</italic></source> <volume>28</volume> <fpage>523</fpage>&#x02013;<lpage>526</lpage>. <pub-id pub-id-type="doi">10.1016/j.tibs.2003.08.003</pub-id></citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schleiff</surname> <given-names>E.</given-names></name> <name><surname>Becker</surname> <given-names>T.</given-names></name></person-group> (<year>2011</year>). <article-title>Common ground for protein translocation: access control for mitochondria and chloroplasts.</article-title> <source><italic>Nat. Rev. Mol. Cell Biol.</italic></source> <volume>12</volume> <fpage>48</fpage>&#x02013;<lpage>59</lpage>. <pub-id pub-id-type="doi">10.1038/nrm3027</pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Selkrig</surname> <given-names>J.</given-names></name> <name><surname>Leyton</surname> <given-names>D. L.</given-names></name> <name><surname>Webb</surname> <given-names>C. T.</given-names></name> <name><surname>Lithgow</surname> <given-names>T.</given-names></name></person-group> (<year>2014</year>). <article-title>Assembly of &#x003B2;-barrel proteins into bacterial outer membranes.</article-title> <source><italic>Biochim. Biophys. Acta</italic></source> <volume>1843</volume> <fpage>1542</fpage>&#x02013;<lpage>1550</lpage>. <pub-id pub-id-type="doi">10.1016/j.bbamcr.2013.10.009</pub-id></citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Selkrig</surname> <given-names>J.</given-names></name> <name><surname>Mosbahi</surname> <given-names>K.</given-names></name> <name><surname>Webb</surname> <given-names>C. T.</given-names></name> <name><surname>Belousoff</surname> <given-names>M. J.</given-names></name> <name><surname>Perry</surname> <given-names>A. J.</given-names></name> <name><surname>Wells</surname> <given-names>T. J.</given-names></name><etal/></person-group> (<year>2012</year>). <article-title>Discovery of an archetypal protein transport system in bacterial outer membranes.</article-title> <source><italic>Nat. Struct. Mol. Biol.</italic></source> <volume>19</volume> <fpage>506</fpage>&#x02013;<lpage>510</lpage>. <pub-id pub-id-type="doi">10.1038/nsmb.2261</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shannon</surname> <given-names>P.</given-names></name> <name><surname>Markiel</surname> <given-names>A.</given-names></name> <name><surname>Ozier</surname> <given-names>O.</given-names></name> <name><surname>Baliga</surname> <given-names>N. S.</given-names></name> <name><surname>Wang</surname> <given-names>J. T.</given-names></name> <name><surname>Ramage</surname> <given-names>D.</given-names></name><etal/></person-group> (<year>2003</year>). <article-title>Cytoscape: a software environment for integrated models of biomolecular interaction networks.</article-title> <source><italic>Genome Res.</italic></source> <volume>13</volume> <fpage>2498</fpage>&#x02013;<lpage>2504</lpage>. <pub-id pub-id-type="doi">10.1101/gr.1239303</pub-id></citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Silby</surname> <given-names>M. W.</given-names></name> <name><surname>Winstanley</surname> <given-names>C.</given-names></name> <name><surname>Godfrey</surname> <given-names>S. A.</given-names></name> <name><surname>Levy</surname> <given-names>S. B.</given-names></name> <name><surname>Jackson</surname> <given-names>R. W.</given-names></name></person-group> (<year>2011</year>). <article-title>Pseudomonas genomes: diverse and adaptable.</article-title> <source><italic>FEMS Microbiol. Rev.</italic></source> <volume>35</volume> <fpage>652</fpage>&#x02013;<lpage>680</lpage>. <pub-id pub-id-type="doi">10.1111/j.1574-6976.2011.00269.x</pub-id></citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Simossis</surname> <given-names>V. A.</given-names></name> <name><surname>Heringa</surname> <given-names>J.</given-names></name></person-group> (<year>2005</year>). <article-title>PRALINE: a multiple sequence alignment toolbox that integrates homology-extended and secondary structure information.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>33</volume>:<fpage>W289</fpage>&#x02013;<lpage>W294</lpage>.</citation></ref>
<ref id="B64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stroud</surname> <given-names>D. A.</given-names></name> <name><surname>Becker</surname> <given-names>T.</given-names></name> <name><surname>Qiu</surname> <given-names>J.</given-names></name> <name><surname>Stojanovski</surname> <given-names>D.</given-names></name> <name><surname>Pfannschmidt</surname> <given-names>S.</given-names></name> <name><surname>Wirth</surname> <given-names>C.</given-names></name><etal/></person-group> (<year>2011</year>). <article-title>Biogenesis of mitochondrial beta-barrel proteins: the POTRA domain is involved in precursor release from the SAM complex.</article-title> <source><italic>Mol. Biol. Cell</italic></source> <volume>22</volume> <fpage>2823</fpage>&#x02013;<lpage>2833</lpage>. <pub-id pub-id-type="doi">10.1091/mbc.E11-02-0148</pub-id></citation></ref>
<ref id="B65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sutcliffe</surname> <given-names>I. C.</given-names></name></person-group> (<year>2010</year>). <article-title>A phylum level perspective on bacterial cell envelope architecture.</article-title> <source><italic>Trends Microbiol.</italic></source> <volume>18</volume> <fpage>464</fpage>&#x02013;<lpage>470</lpage>. <pub-id pub-id-type="doi">10.1016/j.tim.2010.06.005</pub-id></citation></ref>
<ref id="B66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Williams</surname> <given-names>T. A.</given-names></name> <name><surname>Codoner</surname> <given-names>F. M.</given-names></name> <name><surname>Toft</surname> <given-names>C.</given-names></name> <name><surname>Fares</surname> <given-names>M. A.</given-names></name></person-group> (<year>2010</year>). <article-title>Two chaperonin systems in bacterial genomes with distinct ecological roles.</article-title> <source><italic>Trends Genet.</italic></source> <volume>26</volume> <fpage>47</fpage>&#x02013;<lpage>51</lpage>. <pub-id pub-id-type="doi">10.1016/j.tig.2009.11.009</pub-id></citation></ref>
<ref id="B67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Williams</surname> <given-names>T. A.</given-names></name> <name><surname>Fares</surname> <given-names>M. A.</given-names></name></person-group> (<year>2010</year>). <article-title>The effect of chaperonin buffering on protein evolution.</article-title> <source><italic>Genome Biol. Evol.</italic></source> <volume>2</volume> <fpage>609</fpage>&#x02013;<lpage>619</lpage>. <pub-id pub-id-type="doi">10.1093/gbe/evq045</pub-id></citation></ref>
<ref id="B68"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yen</surname> <given-names>M. R.</given-names></name> <name><surname>Peabody</surname> <given-names>C. R.</given-names></name> <name><surname>Partovi</surname> <given-names>S. M.</given-names></name> <name><surname>Zhai</surname> <given-names>Y.</given-names></name> <name><surname>Tseng</surname> <given-names>Y. H.</given-names></name> <name><surname>Saier</surname> <given-names>M. H.</given-names></name></person-group> (<year>2002</year>). <article-title>Protein-translocating outer membrane porins of Gram-negative bacteria.</article-title> <source><italic>Biochim. Biophys. Acta</italic></source> <volume>1562</volume> <fpage>6</fpage>&#x02013;<lpage>31</lpage>. <pub-id pub-id-type="doi">10.1016/S0005-2736(02)00359-0</pub-id></citation></ref>
<ref id="B69"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname> <given-names>N. Y.</given-names></name> <name><surname>Wagner</surname> <given-names>J. R.</given-names></name> <name><surname>Laird</surname> <given-names>M. R.</given-names></name> <name><surname>Melli</surname> <given-names>G.</given-names></name> <name><surname>Rey</surname> <given-names>S.</given-names></name> <name><surname>Lo</surname> <given-names>R.</given-names></name><etal/></person-group> (<year>2010</year>). <article-title>PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes.</article-title> <source><italic>Bioinformatics</italic></source> <volume>26</volume> <fpage>1608</fpage>&#x02013;<lpage>1615</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btq249</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn01">
<label>1</label>
<p><ext-link ext-link-type="uri" xlink:href="http://www.r-project.org/">http://www.r-project.org/</ext-link></p>
</fn>
<fn id="fn02">
<label>2</label>
<p><ext-link ext-link-type="uri" xlink:href="http://pfam.sanger.ac.uk">http://pfam.sanger.ac.uk; version 27</ext-link></p>
</fn>
<fn id="fn03">
<label>3</label>
<p><ext-link ext-link-type="uri" xlink:href="http://supfam.org/SUPERFAMILY/cgi-bin/genome_names.cgi">http://supfam.org/SUPERFAMILY/cgi-bin/genome_names.cgi</ext-link></p>
</fn>
</fn-group>
</back>
</article>