ORIGINAL RESEARCH article

Front. Microbiol., 21 January 2014

Sec. Antimicrobials, Resistance and Chemotherapy

Volume 5 - 2014 | https://doi.org/10.3389/fmicb.2014.00004

Potential of known and short prokaryotic protein motifs as a basis for novel peptide-based antibacterial therapeutics: a computational survey

    HR

    Heini Ruhanen 1,2,3

    DH

    Daniel Hurley 1,2,3

    AG

    Ambarnil Ghosh 4

    KT

    Kevin T. O'Brien 1,2,3

    CR

    Catrióna R. Johnston 5

    DC

    Denis C. Shields 1,2,3*

  • 1. Complex and Adaptive Systems Laboratory, University College Dublin Dublin, Ireland

  • 2. Conway Institute of Biomolecular and Biomedical Science, University College Dublin Dublin, Ireland

  • 3. School of Medicine and Medical Science, University College Dublin Dublin, Ireland

  • 4. Crystallography and Molecular Biology Department, Saha Institute of Nuclear Physics Kolkata, India

  • 5. Translational Research Institute, Woolloongabba QLD, Australia

Article metrics

View details

16

Citations

7k

Views

2,1k

Downloads

Abstract

Short linear motifs (SLiMs) are functional stretches of protein sequence that are of crucial importance for numerous biological processes by mediating protein–protein interactions. These motifs often comprise peptides of less than 10 amino acids that modulate protein–protein interactions. While well-characterized in eukaryotic intracellular signaling, their role in prokaryotic signaling is less well-understood. We surveyed the distribution of known motifs in prokaryotic extracellular and virulence proteins across a range of bacterial species and conducted searches for novel motifs in virulence proteins. Many known motifs in virulence effector proteins mimic eukaryotic motifs and enable the pathogen to control the intracellular processes of their hosts. Novel motifs were detected by finding those that had evolved independently in three or more unrelated virulence proteins. The search returned several significantly over-represented linear motifs of which some were known motifs and others are novel candidates with potential roles in bacterial pathogenesis. A putative C-terminal G[AG].$ motif found in type IV secretion system proteins was among the most significant detected. A KK$ motif that has been previously identified in a plasminogen-binding protein, was demonstrated to be enriched across a number of adhesion and lipoproteins. While there is some potential to develop peptide drugs against bacterial infection based on bacterial peptides that mimic host components, this could have unwanted effects on host signaling. Thus, novel SLiMs in virulence factors that do not mimic host components but are crucial for bacterial pathogenesis, such as the type IV secretion system, may be more useful to develop as leads for anti-microbial peptides or drugs.

Introduction

Short linear motifs (SLiMs) are functional microdomains in proteins that play a critical role in many distinct biological processes such as cell signaling and regulation, post-translational modifications, proteolytic cleavage, and protein trafficking (Davey et al., 2011b; Mooney et al., 2012). These motifs are typically found in eukaryotic disordered protein regions and vary in size from 3 to 12 amino acids (Fuxreiter et al., 2007). In general, SLiMs have less than five defined amino acid positions and frequently these positions have some degree of flexibility in amino acid composition. Their shortness makes them evolutionarily plastic, allowing them to evolve convergently in unrelated proteins. This can allow proteins to rapidly acquire new protein interaction functions (Neduva and Russell, 2005; Diella et al., 2008; Davey et al., 2010, 2012b). Their short length also presents a challenge for SLiM discovery both experimentally and computationally, since there may be many false positive findings using both methods.

The presence of SLiMs in eukaryotes and viruses has been well-established. Several pioneering viral studies were crucial for the original characterization of SLiMs (Davey et al., 2011b). Viruses use SLiMs as a principal mechanism of hijacking cells by binding to host proteins and recruiting them to process viral proteins. A viral genome can contain various short motifs, many of which are necessary for the viral life cycle, providing a plethora of ways for the virus to take over the molecular machinery of the host cell (Kadaveru et al., 2008; Davey et al., 2011b). Like viruses, pathogenic bacteria are extremely proficient in intercepting host cell functions and in many cases it is still poorly understood how bacteria carry out the manipulation of the host cells. SLiMs have been documented in a number of cases to play a role in bacterial pathogenicity. However, bacterial linear motifs are not as well-characterized as in eukaryotes.

Most of the known instances of bacterial motifs are involved in pathogenicity including signals in effector proteins or host motif mimicry (Cornelis and Van Gijsegem, 2000; Alto et al., 2006). The tripeptide RGD motif is a known host extracellular matrix adhesion factor that is also used by bacteria to attach onto host cells (Tegtmeyer et al., 2010; Zimmermann et al., 2010; Zhang et al., 2012). RGD based anticancer and antithrombotic drugs are currently being developed but their direct impact on limiting bacterial adhesion and infectivity has not been investigated. A second example of a bacterial motif is the EPIYA motif found in several bacterial type III or IV secretion system effector proteins, which mimics SH2 binding peptides of the host (Hayashi et al., 2013). A third example of a bacterial motif has evolved to antagonize host proteins, but does this using a motif for which there is no eukaryotic equivalent. This W… E motif (where “.” indicates any amino acid) in bacterial effector proteins has been proposed to mimic host G-proteins (Alto et al., 2006; Jackson et al., 2008; Ham et al., 2009). Other motifs found in prokaryotes which are not simply mimicking known eukaryotic motifs play roles in transport, modification and proteolysis of the bacterial proteins (Table 1).

Table 1

Virulence factorMotifFunctionReferences
ADHERENCE
CagL, MceRGD*, NGRAn integrin binding cell adhesion motifConradi et al., 2012a,b; Zhang et al., 2012
CagLFEANEParticipates in integrin bindingConradi et al., 2012b
YadA[SG][VI][AS][IVT]G..SRepeated collagen binding motifTahir et al., 2000
InlJ, InlA, nanA, CspALP.TGCell wall anchor motifHarris et al., 2003; Sabet et al., 2005; Banerjee et al., 2010
Eno[LF]Y[DNK]… [KG][KV]Y[VD]*Plasminogen binding motifBergmann et al., 2003; Nogueira et al., 2012
CBPA-G, LytA-CW[WFY][FY]….G.MRepeated cholin (cell wall) binding motifGarau et al., 2005
SpsAYRNYPTHost secretory immunglobulin A (SlgA) and secretory component (SC) binding motifHammerschmidt et al., 2000
EFFECTOR
SipA, SopADEVD*, [DSTE][P][DEWHFYC]D[GSAN]Caspase 3 cleavage site motifSrikanth et al., 2010; Dinkel et al., 2012
AvrPto, HopF2, AvrB, AvrRpm1MG..C*,G… [STC], M{0, 1}(G)[EDRKHPFYW]..[STAGCN][P]N-myristoylation/S-palmitoylation motifShan et al., 2000; Robert-Seilaniantz et al., 2006; Dinkel et al., 2012; Hicks and Galan, 2013
WtsE, AvrE1, IpgB2, IpgB1, Map, EspM, EspT, SifA, SifBW… E*Host Rho GTPase activation/modulation motifAlto et al., 2006; Ham et al., 2009
WtsE,[LR][KQVS][KQLR][EST][GQR][FLKS][EGPK] [MLVAS][KNAL][SGIE]*Putative endoplasmic reticulum membrane retention/retrieval motifHam et al., 2006, 2009
SifA, AnkBCLCCFL*, (C)[DENQ][LIVM].$CAAX box, putative prenylation motif (addition of farnesyl or geranylgeranyl group)Boucrot et al., 2003; Hicks et al., 2011; Dinkel et al., 2012
YopE, SptP, ExoSG.LR… T(YopE*)Arginine finger motif, essential for Rho GAP function of virulence factorsBlack and Bliska, 2000; Wurtele et al., 2001
PopB, PopP2, AvrBs3[DE]((K[RK])|(RK))[KRP][KR][DE], [KR][KR].{7,15}[DE]((K[KR])|(RK))(([DE][KR]) |(KR][DE]))[DE], [DE]((K[RK])|(RK))(([DE][KR])|([KR][DE]))(([PKR]) |([DE][DE])), (([PKR].{0,1}[DE])|([PKR]))((K[RK])|(RK)) (([DE][KR])|([KR][DE]))[DE]Nuclear localization signal (NLS) motifsSzurek et al., 2002; Deslandes et al., 2003; Dean, 2011; Dinkel et al., 2012
CagA, Tarp, AnkA, LspAE[PNS][IV]Y[AEG]Membrane targeting/phosphorylation motifHigashi et al., 2005; Suzuki et al., 2009; Hayashi et al., 2013
SspH2, SseI….GSGC….., G(C)M[GS][CL][KP]C, M{0,1}G(C)..S[AKS]S-palmitoylation motifHicks et al., 2011; Dinkel et al., 2012
ExoS, SopE[FIV]..[FIV].[FIV]..[NC].[FIV]Membrane localization motif (targets ExoS to the Golgi-endoplasmic reticulum)Zhang and Barbieri, 2005
SopD2, SifA, SseJ, SspH2WEK[IM]..FFTranslocation/late endocytic compartments targeting motifBrown et al., 2006
ExoUKAWRNPlasma membrane localization/ubiquitinylation motifRabin and Hauser, 2005; Stirling et al., 2006
SopE, BopEGAG[AT]Catalytic loop motif essential for guanine nucleotide exchangeSchlumberger et al., 2003
AvrPphBGDKAutoproteolytic cleavage motifDowen et al., 2009
SopA, IpaH, SspH1L….TC, C.DE3 ubiquitin ligase motifZhang et al., 2006; Rohde et al., 2007
VirFLP… … ….LF-box domain motif, mediates protein–protein interactionsTzfira et al., 2004
VopL, VopF[R]..((.[ILMVF])|([ILMVF].))[P][P][ILVM]. {4,7}L(([KR].)|(NK))[VATIGS], [R]..[ILVMF][ILMVF][P][ILVM].{4,7}L(([KR].) |(NK))[VATI]WH2-domain motifLiverman et al., 2007; Dinkel et al., 2012
SpvC, OspF, VirA[KR]{0,2}[KR].{0,2}[KR].{2,4}[ILVM].[ILVF]D motif, Docking motif required for specific binding to MAPKsZhu et al., 2007; Dinkel et al., 2012
IpaAL..AA..VA..V..LI..A.Vinculin binding domain motifHamiaux et al., 2006
ExoS, ExoTLLDALDLAFAS (14-3-3 protein) binding motif, mediates activation of the ADPRT domainSun et al., 2004; Dean, 2011
EspF[RKY]..P..P, P..P.[KR],…[PV]..P, KP..[QK]…SH3 binding motifAlto et al., 2007; Dinkel et al., 2012
Map, NleH1, EspI (NleA)…[ST].[ACVILF]$, … [VLIFY].[ACVILF]$,…[DE].[ACVILF]$ (EspI*)C-terminal PDZ1 binding motifLee et al., 2008; Martinez et al., 2010; Dinkel et al., 2012
TOXIN
Listeriolysin O (LLO)PPASP*PEST-motif, involved in phagosomal escape of bacteria in infected cellsLety et al., 2001
OTHER
VirD4, VirB11, VirB4, SecAG….GK[TS]*Walker A motif, nucleotide-binding motifSato et al., 1996; Atmakuri et al., 2004
SecA[RK]….G….L[VILFWYMC]{4,4}DWalker B, nucleotide binding motifSato et al., 1996
MsbA, PiaA, PiuALSGGQ (PiaA*, PiuA*)ABC-motif, ATP binding cassette transporter motifGarmory and Titball, 2004; Buchaklian and Klug, 2006
EsxA, EsxB, esat6W.GW.G motif helps to create a shallow cleft structure and may represent a peptide recognition feature by which cargo proteins are acquired for transportBurts et al., 2005; Sundaramoorthy et al., 2008

Examples of known instances of Short Linear Motifs in bacterial virulence factors.

*

Proven role in virulence. Bold, a non-eukaryotic motif.

“∧”

start of the protein or if in the middle of the motif sequence states which amino acids are excluded in the position,

“$” end of the protein, “.” any amino acid, {} defines the length of a range in the motif sequence, [] defines which amino acids can occur at a given motif position, () marks positions of specific interest e.g., covalent modification or is used to group parts of the expression. Motif table modified from Dean (2011), additional motifs added from the literature.

Since SLiMs are used in a plethora of cellular processes in eukaryotes and are utilized by both pathogenic bacteria and viruses, discovering and characterizing new linear motifs is of great importance. As well as shedding light on the mechanisms of fundamental cellular processes they also hold promise as future therapeutic targets. There is an urgent need for new classes of antimicrobial therapeutics that are effective against multidrug resistant bacteria. Conventional antibiotics are becoming increasingly ineffective against pathogenic bacteria, such as methicillin resistant Staphylococcus aureus (MRSA) which presents a severe threat to public health.

We were interested in whether SLiMs may be valuable when developing new antimicrobial peptides or drugs. Compared with recombinant proteins, the smaller size of peptides makes them easier to manufacture and deliver. The use of chemically synthesized peptides in pharmacological and clinical applications is relatively limited by their low systemic stability and high clearance, poor membrane permeability, negligible activity when administered orally and their high cost of manufacture in comparison to small chemical compounds. However, to date more than 100 peptide-based drugs have already reached the market and of these, the majority are at the smaller end of the size spectrum at 8–10 amino acids (Craik et al., 2013).

Here, we conducted a study to discover SLiMs computationally in bacterial virulence factor datasets. We surveyed the distribution of these novel motifs, and compared their distribution with that of known motifs observed in prokaryotic proteins. The list of motifs given here represents a useful resource for experimental scientists interested in targeting SLiMs that may be important for the pathogenicity of bacteria.

Materials and methods

We utilized data from a virulence factor database MvirDB (Lawrence Livermore National Laboratory), which integrates DNA and protein sequence information from Tox-Prot, SCORPION, the PRINTS database of virulence factors, VFDB, TVFac, Islander, ARGO, CONUS, KNOTTIN, a subset of VIDA and sequences derived by means of literature searches (Zhou et al., 2007). MvirDB can be accessed at http://mvirdb.llnl.gov. The MvirDB browser tool was used to search the database to retrieve virulence factors by functional categories (Table 2) and to download sequences of interest. Protein sequence identifiers for the downloaded sequences for each functional category are available in Table S1.

Table 2

Virulence protein groupNumber of sequencesNumber of unrelated proteins
Adherence749181
Capsule33257
Chemotaxis19218
Effector11127
Endotoxin6627
Enzyme647121
Exotoxin9212
Lipoprotein46370
Motility8623
Siderophore15043
Type III secretion system57175
Type IV secretion system18138

Functional search terms used to retrieve and download protein sequences from virulence factor database MvirDBbrowser tool.

The recovered protein sequences in each functional category thought to be associated with pathogenicity were searched for SLiMs using SLiMFinder (Davey et al., 2010) both locally, and on a webserver that is available at http://bioware.ucd.ie. The default settings provided in SLiMFinder without any extra masking were used in the analysis. This method finds sets of three or more unrelated proteins in a dataset of proteins that share a motif. Chemotaxis and enzyme protein sequence datasets were filtered to contain only sequences longer than 20 amino acids and lipoprotein and Exotoxin datasets sequences longer than 40 amino acids prior to the analysis.

The motifs identified by the SLiMFinder analysis were further examined for similarity to known SLiMs from literature motifs using CompariMotif, which takes two lists of protein motifs and compares them to each other, identifying and scoring similarities between short motifs in the sets (Edwards et al., 2008).

Motifs were visualized using the MEME Suite (Bailey et al., 2009), by taking a stretch of 10 amino acid residues containing the motif of interest from each protein sequence where the motif was found. MEME represents motifs as position dependent letter probability matrices which describe the probability of each possible letter at each position in the pattern. These are displayed as “sequence LOGOS,” containing stacks of letters at each position in the motif. The total height of the stack is the “information content” of that position in the motif in bits. The height of the individual letters in a stack is the probability of the letter at that position multiplied by the total information content of the stack.

Datasets comprised of protein sequences obtained from UniProtKB that are predicted to be effector proteins from a selection of 60 organisms represented in the MvirDB were used to assess the distribution of prokaryotic protein motifs. The presence of both known and novel motifs in these datasets was investigated using the predictive computational tool SLiMSearch which can be used to determine the occurrences of predefined motifs in protein sequences (Davey et al., 2011a). Heat maps were generated to visualize the incidences of motifs in the protein datasets where the frequency of the heat map represents the logarithm of the normalized N_UPC (Number of incidences of a motif in an Unrelated Protein Cluster) value returned in the SLiMSearch results. The N_UPC for an individual motif in a specific organism was normalized by dividing the value by the total amount of UPCs (Unrelated Protein Clusters) in the specific organism and the average N_UPCs of a motif across all 60 organisms. For motifs where there were no incidences in a specific organism the frequency was set to an arbitrary value lower than the minimum actual observed value.

The organisms in Figures 2, 3 which cover the motif sequences were presented in a phylogenetic tree (Figure 4). The Taxonomic IDs for all the organisms are used as input in NCBI's Taxonomy Common Tree tool (http://www.ncbi.nlm.nih.gov/Taxonomy/CommonTree/wwwcmt.cgi). The “phenogram” taxonomic tree (*.phy format) obtained from the NCBI server was fed into Drawgram tree drawing program of Phylip package (version 3.695). Branches were colored according to the following scheme: Purple, High GC Gram+ bacteria; Blue, Firmicutes; Yellow, a-proteobacteria; Light Brown, b-proteobacteria; Dark Brown, e-proteobacteria; Green, g-proteobacteria (non-enterobacteria); Red, g-proteobacteria (enterobacteria); Black, others (CFB).

Results

Our objective was to discover novel SLiMs in non-homologous bacterial proteins with similar roles in virulence that may have functional importance in pathogenesis, and thus have potential to be developed into antimicrobial peptides or drugs. Our analysis returned both previously characterized and novel motifs in several different functional categories indicating the suitability of SLiMFinder for the analysis of bacterial sequence data as well as eukaryotic data. We focused on 12 groups of bacterial proteins with predefined roles in pathogenicity (Table 2). SLiMFinder identified numerous motifs among these proteins. Table 3 lists those with a p-value (Sig) less than 0.05. Bonferroni correction for significance with 12 search datasets would suggest that motifs with a Sig value of less than 0.004 are significant. Since pathogenesis proteins from bacteria often interact with host protein components, we examined whether any of the identified motifs showed similarity to known eukaryotic linear motifs, using the Comparimotif tool. However, we did not find any convincing similarities, in spite of the known occurrence of eukaryotic motifs in bacterial effector proteins. We also investigated if any of the motifs were known prokaryotic motifs identified in the literature.

Table 3

Virulence protein groupNumber of sequencesMotif patternMotif nameInformation contentOccurrencesUnrelated proteinsProbability (Sig value)References
(A) PREVIOUSLY DESCRIBED MOTIFS
Adherence749.K.{0,2}KSignal peptide motif3151470.00E+00Juncker et al., 2003; Bagos et al., 2008
.{1,2}K.{1,2}LSignal peptide motif3102351.15E–10Juncker et al., 2003; Bagos et al., 2008
.{1,2}K.{0,2}ISignal peptide motif391341.41E–08Juncker et al., 2003; Bagos et al., 2008
.K.[IL]Signal peptide motif2.7758215.80E–07Juncker et al., 2003; Bagos et al., 2008
.K..SSignal peptide motif324120.002Juncker et al., 2003; Bagos et al., 2008
KK$C-terminal KK323110.034Bergmann et al., 2003; Itzek et al., 2010
Capsule332.{1,2}K.{0,1}ISignal peptide motif355186.96E–06Juncker et al., 2003; Bagos et al., 2008
.{1,2}K.{0,1}I..VSignal peptide motif41670.002Juncker et al., 2003; Bagos et al., 2008
.{1,2}K.{0,2}ISignal peptide motif359180.003Juncker et al., 2003; Bagos et al., 2008
.K.[ILV]Signal peptide motif2.6339130.003Juncker et al., 2003; Bagos et al., 2008
.{1,2}K.{0,2}KSignal peptide motif352170.006Juncker et al., 2003; Bagos et al., 2008
Chemotaxis192
Effector111
Endotoxin66
Enzyme647.{1,2}KKSignal peptide motif327149.52E–05Juncker et al., 2003; Bagos et al., 2008
..K.{0,2}ISignal peptide motif330130.042Juncker et al., 2003; Bagos et al., 2008
Exotoxin92
Lipoprotein463L.[AG]C[AGS]Lipobox3.478300.00E+00Braun and Rehn, 1969; Babu et al., 2006
[FLV].L.[AG]CLipobox3.4136240.00E+00Braun and Rehn, 1969; Babu et al., 2006
[ILV].[AGS]CLipobox2.27370530.00E+00Braun and Rehn, 1969; Babu et al., 2006
[AGS]C[AGS]Lipobox2.27285507.22E–15Braun and Rehn, 1969; Babu et al., 2006
.{1,2}K.{0,2}KSignal peptide motif3109271.27E–09Juncker et al., 2003; Bagos et al., 2008
L.{1,2}GC.{0,1}ALipobox441151.65E–09Braun and Rehn, 1969; Babu et al., 2006
A.{0,2}L..C.{0,2}SLipobox468193.02E–09Braun and Rehn, 1969; Babu et al., 2006
.K..[ILV]Signal peptide motif2.6366194.36E–09Juncker et al., 2003; Bagos et al., 2008
..K..[FLV]Signal peptide motif2.6365171.07E–07Juncker et al., 2003; Bagos et al., 2008
[ILV]..C.[AGS]Lipobox2.27217364.11E–06Braun and Rehn, 1969; Babu et al., 2006
KK$C-terminal KK32290.016Bergmann et al., 2003; Itzek et al., 2010
Motility86
Siderophore150.[KR]ISignal peptide motif2.771270.024Juncker et al., 2003; Bagos et al., 2008
Type III secretion571
Type IV secretion181.K[KR]Signal peptide motif2.772990.003Juncker et al., 2003; Bagos et al., 2008
.K..[FIL]Signal peptide motif2.6327100.025Juncker et al., 2003; Bagos et al., 2008
(B) NOVEL MOTIFS
Adherence749LP.G.Y437120.012
Capsule332G.S..M.L41570.029
Chemotaxis192E..Q.I[AG].I4.772250.004
E..Q.[IV]..I3.772470.02
Effector111..I.{0,1}N32160.012
[LV].PY2.7746110.042
..I[ST]2.771760.049
Endotoxin66
Enzyme647A.I.P.VL51470.019
VSIL.S51170.049
Exotoxin92
Lipoprotein463ML..C31470.017
Motility86
Siderophore150I.K..G328170.044
GYP..TP5540.052
Type III secretion571
Type IV secretion181G[AG].$2.771992.64E–04

Significant motifs returned by SLiMFinder in each dataset (where probability <0.05).

Where very similar motifs are returned for a protein group, only a representative motif is shown.

Information content (Edwards et al., 2007),

start of the protein, “$” end of the protein, “.” any amino acid, {} defines the range of a repeat in the motif sequence, [] defines which amino acids can occur at a given motif position.

Italic font is used when Probability (Sig-value) is higher than the 0.05 confidence level.

Known motifs

Three of the motifs highlighted by SLiMFinder were previously known bacterial motifs. The most significant of these was the well-characterized prokaryotic N-terminal lipid modification [LVI][ASTVI][GAS]C motif that has been previously shown to be essential for the anchoring of bacterial proteins to the membrane surface (Braun and Rehn, 1969; Babu et al., 2006). The square brackets enclose alternative amino acids which are possible at that position in the motif. This motif is present in a wide range of proteins across Gram-positive and Gram-negative bacteria and is a clear example of a motif that has convergently evolved in many unrelated proteins. It was found in numerous configurations in the lipoprotein dataset of which seven are listed in Table 3. This “lipobox” motif sequence is located at the C-terminal end of the signal peptide and the lipid-modifiable cysteine (+1 position) is invariant (Juncker et al., 2003). Lipid modification of this cysteine residue (N-acyl-S-diacylglyceryl-Cys) has been found to be an essential, ubiquitous, and unique bacterial post-translational modification. Such a modification allows anchoring of even highly hydrophilic proteins to the membrane surface leaving the rest of the protein to carry out a variety of relevant functions in the aqueous or aqueous-membrane interface (Juncker et al., 2003; Babu et al., 2006). Bacterial lipoproteins affect a wide range of mechanisms in virulence. They have been shown to play key roles in adhesion to host cells and in translocation of virulence factors into host cells (Kovacs-Simon et al., 2011). Furthermore, they are potent inducers of host inflammatory responses.

The second known motif identified was an N-terminal MK.{0,2}K motif present in several search categories in varying configurations (Table 3, Adherence, Capsule, Enzyme, Lipoprotein, Siderophore, and Type IV secretion system). This motif representation indicates that the second K (lysine) may lay 0, 1, or 2 residues after the K that follows the initiator methionine. The “” symbol indicates the start of the protein, which is treated as a distinct character in motif discovery. SLiMFinder omits the M from the returned motif resulting in .K.{0,2}K representation, since initiator methionines were deliberately masked out to avoid returning motifs reliant simply on the strong enrichment of M at the start of proteins. The MK.{0,2}K motif is commonly found in bacterial signal peptides both in proteins that are targeted to the membrane and in secreted proteins (Juncker et al., 2003; Bagos et al., 2008). Both of the known motifs are presented as regular expressions in Figure 1, which provides some information on additional contextual preferences beyond the simple motif description. Signal peptides in bacteria are mainly divided into the secretory signal peptides that are cleaved by Signal Peptidase I and those cleaved by Signal Peptidase II which characterize the membrane-bound lipoproteins (Juncker et al., 2003; Bagos et al., 2008). The signal peptides in both classes of proteins in Gram-positive and Gram-negative bacteria are quite similar, sharing the N-terminal region which is characterized by presence of the positive amino acids at the start of the protein, as well as the preference for hydrophobic residues further along the signal peptide.

Figure 1

Figure 1

MEME suite motif logos of the novel and known motifs returned in the SLiMFinder analysis. Each position in the motif is represented as a stack of letters. The total height of the stack is the “information content” of that position in the motif in bits. The height of the individual letters in a stack is the probability of the letter at that position multiplied by the total information content of the stack. Black box: the most significant novel motif G[AG].$, Yellow box: KK$ motifs found in Adherence and Lipoprotein datasets, Red box: Known bacterial motifs .K.{0,2}K and [ILV].[AGS]C.

The third previously characterized bacterial motif returned in our analysis is the C-terminal KK$ motif (where $ indicates the end of the protein, and is treated as a distinct character in motif discovery) found in adherence and lipoprotein datasets (Table 3; Figure 1). This motif has been shown to play a role in plasminogen binding in S. pyogenes and S. pneumoniae α-enolase (Bergmann et al., 2003; Derbise et al., 2004; Itzek et al., 2010). Binding of plasminogen by α-enolase and its subsequent activation has been demonstrated to promote invasion of pathogenic bacteria and therefore represents an important determinant of virulence in invasive infection (Bergmann et al., 2003). Moreover, KK motifs close to the C-terminus are present in a family of Shigella flexneri glucosyl transferases (Gtr) that are integral membrane proteins embedded within the cytoplasmic membrane. These glucosyl transferases contribute to the altering of the structure of the bacterial surface lipopolysaccharide (LPS) O-antigen along with O-acetyltransferase (Lehane et al., 2005; Ramiscal et al., 2010). The KK motif has been shown to be essential for the activity of Gtrs. However, Ramiscal et al. showed that the KK motif in a recently identified GtrIc is not critical for its activity (Ramiscal et al., 2010). We hypothesize that the KK$ motif instances identified here in diverse proteins may play an adhesive role similar to the plasminogen binding instances in α-enolase. We note that plants have a KK$ variant (Gidda et al., 2009) of a known eukaryotic cytoplasmically exposed endoplasmic reticulum (ER) localization motif KKxx$ found in mammals, yeast and plants (Nilsson et al., 1989; Jackson et al., 1990; Contreras et al., 2004). It is therefore conceivable that the bacterial KK$ motif could in some proteins direct invading proteins to certain parts of the eukaryotic host cell. However, we do not think this is very plausible, since the enrichment of KK$ motifs spans many known bacterial lipoproteins (Table 3) which seem unlikely to migrate to this host cell location.

Novel motifs

The most significant novel motif (p-value 0.0003) discovered is a C-terminal G[AG].$ motif in the type IV secretion system dataset. The full list of unrelated proteins containing the G[AG].$ motif is represented in Table 4. The MEME regular expression pattern of the motif in these proteins is described in Figure 1. Four of the nine unrelated proteins containing this motif appear to be identified equivalents of the type IV secretion system components in the well-studied Agrobacterium tumefaciens: VirB4, VirB8, VirB11, and VirB7 [TrwH has 59% identity with VirB7 family (Patey et al., 2006)]. VirB4 and VirB11 are known energetic components of the type IV secretion system in A. tumefaciens. Both of these proteins are membrane associated NTPases on the inner membrane (Tegtmeyer et al., 2011). VirB8 on the other hand, is an essential inner membrane component of type IV secretion systems that is believed to form a homodimer and has been shown to be of importance for complex stability in A. tumefaciens (Sivanesan and Baron, 2011). The VirB7 is an outer membrane lipoprotein that localizes exocellularly and associates with the type IV secretion system pilus. Both VirB7 lipid modification and disulfide cross-linking have been shown to be important for pilus assembly (Sagulenko et al., 2001). The Helicobacter pylori protein Cag7 that is among the proteins containing the C-terminal G[AG].$ motif has previously been proposed to be a transmembrane protein that is associated with the pilus (Rohde et al., 2003; Tegtmeyer et al., 2011). At least five of the nine unrelated proteins containing the G[AG].$ motif seem to be associated with the bacterial membranes and it is thus possible that this motif would be involved in the targeting and/or attachment of these proteins into the bacterial membranes. However, since the motif has been specifically identified within type IV secretion proteins, it is more likely that the motif facilitates interaction with a component of the type IV secretion system itself. We inspected the distribution of the motif across effector proteins (Figure 2) and noted that there are typically one or none per species, suggesting that the motif is not itself enriched strongly among effector proteins themselves.

Table 4

PatternProtein groupP-value (Sig)MatchNo. unrelated proteinsDescription
G[AG].$Type IV secretion2.64E–04GGN9virB11 protein homolog|9992|YP_034060|49476019|8040| [Bartonella henselae str. Houston-1]
GAKvirB4|10558|NP_863348|32469876|8343|VirB4 [Campylobacter jejuni subsp. jejuni 81-176]
GAKvirB8|10560|NP_863298|32469826|8344|VirB8 [Campylobacter jejuni subsp. jejuni 81-176]
GAEcag pathogenicity island protein (cag11)|10866|NP_207327|15645157|8497| [Helicobacter pylori 26695]
GGKtrwF protein|9938|YP_034270|49476229|8013|[Bartonella henselae str. Houston-1]
GAItrwH2 hypothetical protein BH15720|9944|YP_034268|49476227|8016 [Bartonella henselae str. Houston-1]
GAScag pathogenicity island protein (cag25)|10894|NP_207342|15645172|8511| [Helicobacter pylori 26695]
GGNPutative type IV secretion system protein|41299|NP_790379|NP_790379.1|18355| [Pseudomonas syringae pv. tomato str. DC3000]
GAItrwH1 hypothetical protein BH15690|9942|YP_034265|49476224|8015| [Bartonella henselae str. Houston-1]
GGNcag pathogenicity island protein (cag7)|10904|NP_207323|15645153|8516| [Helicobacter pylori 26695]
KK$Adherence0.034KK11hmw2C putative accessory processing protein [Haemophilus influenzae]|2847|AAA20526|482843|2554|
KKkpsC polysaccharide modification protein [Campylobacter jejuni subsp. jejuni NCTC 11168]|10454|NP_282555|15792732|8291|
KKica operon transcriptional regulator [Staphylococcus aureus subsp. aureus MW2]|3137|NP_647402|21284314|2699|
KKpavA adherence and virulence protein A [Streptococcus agalactiae 2603V/R]|9719|NP_688199|22537348|7896|
KKType 4 fimbrial biogenesis protein PilO [Pseudomonas aeruginosa PAO1]|8738|NP_253729|15600235|7426|
KKType 4 fimbrial biogenesis protein PilN [Pseudomonas aeruginosa PAO1]|8740|NP_253730|15600236|7427|
KKPutative collagen binding protein [Streptococcus pyogenes MGAS315] SpyM3_0098|9709|NP_663902|21909634|7891|
KKoapA opacity associated protein [Haemophilus influenzae Rd KW20]|2865|NP_438494|16272282|2563|
KKneuC1 putative N-acetylglucosamine-6-phosphate 2-epimerase/N-acetylglucosamine-6-phosphatase [Campylobacter jejuni subsp. jejuni NCTC 11168]|10349|NP_282290|15792467|8218|
KKwaaE D,D-heptose 1-phosphate adenosyltransferase/7-phosphate kinase [Campylobacter jejuni subsp. jejuni NCTC 11168]|10396|NP_282297|15792474|8262|
KKhmw1C putative accessory processing protein [Haemophilus influenzae]|2841|AAA20529|475773|2551|
KKcytotoxin [Escherichia coli O157:H7]|11176|AAC70163|3822209|8652|
KK$Lipoprotein0.016KK9Multidrug resistance outer membrane efflux protein mdtP; Flags: Precursor|58083|Q8CVH8|24068| Escherichia coli
KKylpB/yscJ needle complex inner membrane lipoprotein [Yersinia pestis CO92]|20866|NP_395193|16082747|13354|
KKYop proteins translocation lipoprotein J OS = Yersinia enterocolitica GN = yscJ PE = 2 SV = 1|18600| AltName: Full = Lipoprotein ylpB; Flags: Precursor; |Q01251|11895|
KKLipoprotein [Salmonella enterica subsp. enterica serovar Typhi str. CT18]|44580|NP_455466|NP_455466.1|19996|
KKLPP20 lipoprotein OS = Helicobacter pylori GN = lpp20 PE = 1 SV = 1|17433|P0A0V0|10728|
KKOuter membrane factor of efflux pump [Escherichia coli str. K-12 substr. MG1655]58055|NP_418504|NP_418504|24053|
KKLipoprotein, putative [Enterococcus faecalis V583]9889|NP_816134|29376980|7982|
KKIron transport lipoprotein SirF [Staphylococcus aureus subsp. aureus Mu50]|8986|NP_371657|15924123|7550|
KKExport protein prsA cytoplasmic membrane protein, protein folding|3480|GBAA2336 prsA-3 |GBAA2336|2870|
KKLPP20 lipoprotein OS = Helicobacter pylori J99 GN = lpp20 PE = 3 SV = 1|17432|P0A0V1|
KKYop proteins translocation lipoprotein J OS = Yersinia pseudotuberculosis GN = yscJ PE = 2 SV = 1|18863| AltName: Lipoprotein ylpB |P69973|12158|
KKYaeC family lipoprotein [Enterococcus faecalis V583] [Enterococcus faecalis V583]|39473|NP_815743|NP_815743.1|17442|
KKMajor outer membrane lipoprotein OS = Yersinia pestis GN = lpp PE = 3 SV = 1|24369| |Q8ZDZ6|15171|
KKPutative lipoprotein [Salmonella enterica subsp. enterica serovar Typhi str. Ty2]|44956|NP_805720|NP_805720.1|20184|

List of proteins containing G[AG].$ and KK$ motifs.

Figure 2

Figure 2

Heat map visualization of the distribution of novel SliMFinder identified motifs amongst effector proteins from a selection of 60 organisms represented in the MvirDB. Columns: The bacterial species name with the total number of UPCs indicated in brackets at the start of the description. Purple, High GC Gram+ bacteria; Blue, Firmicutes; Yellow, a-proteobacteria; Light Brown, b-proteobacteria; Dark Brown, e-proteobacteria; Green, g-proteobacteria (non-enterobacteria); Red, g-proteobacteria (enterobacteria); Black, others (CFB). Rows: The motif regular expression with the total number of incidences in UPCs across all 60 organisms indicated in brackets at the start of the description. Color scale: The logarithm of the normalized N_UPC returned from the SLiMSearch results.

Other novel motifs discovered are summarized in Table 3 and in Figure 1. Their significance is in the range between that for the nominal significance level (p < 0.05) and the Bonferroni adjusted significance level (p < 0.004). While it is likely that a number of these motifs are genuine, a few may be false positives. The LP.G.Y motif found in the adherence dataset superficially resembles a Gram-positive bacteria cell wall anchoring LP.TG motif. Cleavage between the Thr and Gly by sortase or a related enzyme leads to covalent anchoring of the new C-terminal Thr to the cell wall (Navarre and Schneewind, 1994; Gaspar et al., 2005). Cell wall-anchored surface proteins of Gram-positive pathogens play important roles during the establishment of many infectious diseases. While it could be hypothesized that the LP.G.Y motif is similarly involved in the anchoring of bacterial proteins to the cell surface, there are two lines of evidence that argue against this. Firstly, there is no enrichment for T or similar amino acids between P and G in the instances of the motif returned (Figure 1). Secondly, this motif is present both in Gram-positive and Gram-negative bacterial proteins in our study. Accordingly, we consider LP.G.Y a potential novel motif involved in bacterial adhesion through an unidentified mechanism.

Repeated motifs

While SLiMFinder looks for motifs which recur one or more times in a number of independent proteins, it is of biological interest when those motifs are themselves repeated within the proteins, for example, representing multiple adhesion sites. Accordingly, we investigated the frequency of repeats of the identified motifs. Duplicated motifs were found with between two and four copies in proteins. The lipoprotein lipid anchoring motif was found repeated three times in the protein HrpB3 of Xanthomonas euvesicatoria (instances LAGC, LALC, and LSAC). Among these motif instances, LAGC and LSAC are known lipid anchoring motifs (Klein et al., 2005; Konkel et al., 2010). The third instance may represent a true positive anchoring motif, a degenerate motif that is no longer functional or a false positive sequence that fulfills some other functional role in the protein. However, it is clear that the repetition of this well-known motif is in some cases biologically important for function. Thus, for novel motifs, repetition within as well as between proteins may be a potential further indication of important function. An example would be the threefold repetition of the “LP.G.Y” motif in the surface-anchored fimbrial subunit protein SpaG of Corynebacterium diptherae. This motif has a known structure in the collagen binding domain of Staphylococcus aureus (PDB entry 1D2P) (Deivanayagam et al., 2000). Collagen is itself a repetitive structure, occurring in many dense repeats in the host extracellular matrix. The repetition of this bacterial motif in this particular protein may indicate its potential role in making multiple contacts with collagen. However, other instances of the motif detected by SLiMFinder only occurred once in each protein, suggesting that a single copy may be sufficient.

Distribution of short linear motifs across effector proteins of different species

We visualized the cross-species distribution of the SLiMFinder identified novel motifs (see Table 3B) among the annotated effector proteins of other species. The species were chosen to include those present in the MvirDB database that contributed motifs to the discovery, in order to display a varied set of species that could be visualized with ease. It is likely that they also exist in other organisms, although distinguishing true and false positives is not possible computationally. The visualization is normalized to correct for the fact that some species have very few proteins and that some motifs have very few instances. The total number of UPCs are indicated in brackets before each bacterial species as well as the total incidences of a motif in UPCs across all bacterial species indicated before each motif regular expression. The novel SLiMFinder identified effector protein motifs ..I.{0,1}N, [LV].PY and ..I[ST] are found among the effector proteins of many species, but are absent in those of many other species, including those with a reasonable number of annotated effector proteins (Figure 2).

We also looked at the distribution of known motifs (see Tables 1, 3A) across species (Figure 3). While some effector motifs (see second section of Table 1) show a wide phylogenetic distribution, others are restricted to only a few species, such as the G.LR… T motif involved in Rho GAP function. The nuclear localization signals (at the bottom of Figure 3) show a relatively restricted distribution. The WEK[IM]..FF late endocytic compartment localization motif is restricted to the genus Salmonella. While the ubiquitin ligase motifs L….TC and C.D are found in more than 71 and 199 instances respectively across the dataset whereas a number of species lack one or both of these motifs. The two SH3 binding motifs [RKY]..P..P and P..P.[KR] also show a restricted distribution. Similarly, the two PDZ binding motifs,…[ST].[ACVILF]$ and … [VLIFY].[ACVILF]$ show a restricted distribution. Bacterial effector proteins may under certain circumstances be under negative selection to avoid motifs that bind to common domains in the host such as PDZ and SH3 domains.

Figure 3

Figure 3

Heat map visualization of the distribution of known virulence motifs amongst effector proteins from a selection of 60 organisms represented in the MvirDB. Columns: The bacterial species name with the total number of UPCs indicated in brackets at the start of the description. Purple, High GC Gram+ bacteria; Blue, Firmicutes; Yellow, a-proteobacteria; Light Brown, b-proteobacteria; Dark Brown, e-proteobacteria; Green, g-proteobacteria (non-enterobacteria); Red, g-proteobacteria (enterobacteria); Black, others (CFB). Rows: The motif regular expression with the total number of incidences in UPCs across all 60 organisms indicated in brackets at the start of the description. Color scale: The logarithm of the normalized N_UPC returned from the SLiMSearch results.

It can be seen that strains of a species often have very similar motif distributions (Figures 24). There is a weak but not convincing trend (Figure 3) for the known motif distribution among effector proteins of the Firmicutes (Blue) to group together, relative the gamma-proteobacteria (Red and Green). While the Group 2 Bacillus species, anthracis and cereus, cluster together (Figure 3), many sets of closely related species (Figure 4) do not show particularly close relationships in terms of motif distribution. This may result from two factors: firstly, motifs are highly dynamic during evolution, and secondly, factors that play a role in pathogenicity also evolve very fast. It is also difficult to compare rare vs. common motifs, since rare ones may be missed simply because of variation among proteins in the definition of effector proteins, while common motifs may be dominated by false positives that obscure the biologically relevant signals.

Figure 4

Figure 4

Phylogenetic tree of the 60 organisms used to assess the distribution of prokaryotic protein motifs in Figures 2, 3. Purple, High GC Gram+ bacteria; Blue, Firmicutes; Yellow, a-proteobacteria; Light Brown, b-proteobacteria; Dark Brown, e-proteobacteria; Green, g-proteobacteria (non-enterobacteria); Red, g-proteobacteria (enterobacteria); Black, others (CFB).

Discussion

We believe that SLiMs are one potential class of new antimicrobial substances for the development of antimicrobial peptides and drugs. While they may lack the potency of antimicrobial peptides that damage the bacterial membrane, they may have other benefits. In particular, those that mimic peptide components of uniquely prokaryotic motifs are likely to have less off-target effects. The value of developing such therapeutic approaches depends on the range of species likely to be affected by the peptide therapeutic. While targeting eukaryotic peptides mimicked by prokaryote effector proteins provides a potential line of attack, the evolutionary plasticity of such motifs in both bacteria (Figure 3) and in hosts (Neduva and Russell, 2005) suggest that bacteria can rapidly evolve alternative effector strategies to replace one targeted host component with another. Nevertheless, where such drugs are developed for other indications in treating non-infectious disease, they may also have an impact on bacterial pathogenesis and would certainly be worth investigating. This problem of evolutionary evasion by pathogens is also relevant, however, to many adhesion motifs. In order for peptide therapeutics to be more robust in the face of rapid evolution of pathogen resistance, they may need to target fundamental components of bacterial biology. Targeting aspects of the central machinery of bacterial Type IV secretion systems may be a good compromise between targeting a component that is central to pathogenicity, while not affecting the biology of advantageous bacteria in the host. In this respect, the G[AG].$ motif identified in this study is a potential candidate worthy of further investigation. Some clues as to the function of this motif may be provided by the pattern of evolution. Presumably this motif has evolved in multiple components of the Type IV secretion system because of a selection pressure for these proteins to interact with some common factor. Identifying the common interaction partners of these proteins may help in pinpointing its potential functional role. In targeting such pathogenicity systems, the benefit of focusing on recurrent motifs is that they may be small enough interaction surfaces to be feasibly targeted by peptidomimetics, and important enough that it is difficult for the bacterial system to evolve resistance (Baron and Coombes, 2007; Paschos et al., 2011).

The shortlist of predicted motifs that we have generated provides a resource for researchers interested in the mechanisms of action of virulence factor proteins across a diverse range of bacterial species. The limitations of the list are well-illustrated by the fact that the motif discovery failed to rediscover the many mimicked eukaryotic motifs. This reflects not only the fact that some motifs have not evolved multiple times in unrelated proteins, but also the limitations in the datasets provided to the SLiMFinder approach. Ideally, datasets should have less than a 100 proteins which have clearly identified similar functions. The challenge is to group proteins according to function efficiently, since the annotation of protein function is highly variable, and frequently relies on computational predictions arising from homology rather than from direct experimentation. The bigger challenge is how to test and manipulate these motifs to provide insights into the mechanisms of action and to determine potential means of interrupting pathogenic processes. While mutagenesis studies can identify the key features of motif function, targeting of a motif may also be progressed by experimental use of bioactive peptides. However, identification of more potent peptidomimetic compounds that resemble such motifs will ideally need 3D models of the peptide regions in complex with their target interactors.

What, then, is the contribution that computational screening of novel motifs may play in the discovery of novel antimicrobial peptides? Firstly, it clearly will not identify all known motifs, since patterns of recurrent evolution or of strong sequence conservation are not seen for all antimicrobial peptides. Computational screens will also have some “false positives” in two senses: firstly, statistical false positives where the motif arose simply by chance; and secondly, biological false positives where the motif that functions effectively within its biological context of a larger protein and that protein's complexes, but it will not function as a stand-alone synthetic peptide. This could reflect a lack of strong affinity for its targets or it could reflect an inability to be delivered to the appropriate context in the first place. Nevertheless, computational screens have the advantage that they can be performed on high throughput sequencing of organisms about which little else is known and for which biological screening by mutagenesis is painstaking or impossible. The advantage of computational prioritization is that it identifies a subset of peptides which are enriched for biologically active peptides. Clearly, the strategy we adopted here is only detecting a small fraction of known motifs, in part because of the stringent correction for statistical mismatches that could be false positives, but also because many motifs do not recur in known unrelated proteins that fall into the same functional class. Discovery for bioactive peptides could follow other strategies, including searches for evolutionary conservation (Davey et al., 2012a). However, pathogenicity factors frequently evolve rapidly, and so conservation may not be an effective signal. Bioactivity predictors based on biophysical properties within the peptide sequences are an alternative strategy (Dosztanyi et al., 2009; Thomas et al., 2010; Mooney et al., 2012, 2013). These have the disadvantage that there is no straightforward statistical approach available to determine likely false discovery rates, but are very valuable in prioritizing a list of peptides for further experimental characterization. Other computational approaches focus more on particular classes of antimicrobial peptides with a strong therapeutic potential, including ribosomal and non-ribosomal cyclic peptides (Prieto et al., 2012; Kedarisetti et al., 2014). While their computational screening methods have the benefit that they focus more strongly on peptides in classes of known therapeutic benefit, we believe that the computational screening approach we identified here complements their approaches, and widens the diversity of peptides for experimental investigation and validation.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Statements

Acknowledgments

This work was funded by the Wellcome Trust Computational Infection Biology PhD Programme (supporting Heini Ruhanen and Daniel Hurley), by Science Foundation Ireland (grant no. 08-IN.1-B1864) and by the Irish Research Council Graduate Education Research Programme in Bioinformatics and Computational Biology (supporting Kevin T. O'Brien).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: http://www.frontiersin.org/journal/10.3389/fmicb.2014.00004/abstract

Table S1

Protein sequence identifiers for each functional category downloaded from the MvirDB.

References

  • 1

    AltoN. M.ShaoF.LazarC. S.BrostR. L.ChuaG.MattooS.et al. (2006). Identification of a bacterial type III effector family with G protein mimicry functions. Cell124, 133145. 10.1016/j.cell.2005.10.031

  • 2

    AltoN. M.WeflenA. W.RardinM. J.YararD.LazarC. S.TonikianR.et al. (2007). The type III effector EspF coordinates membrane trafficking by the spatiotemporal activation of two eukaryotic signaling pathways. J. Cell Biol. 178, 12651278. 10.1083/jcb.200705021

  • 3

    AtmakuriK.CascalesE.ChristieP. J. (2004). Energetic components VirD4, VirB11 and VirB4 mediate early DNA transfer reactions required for bacterial type IV secretion. Mol. Microbiol. 54, 11991211. 10.1111/j.1365-2958.2004.04345.x

  • 4

    BabuM. M.PriyaM. L.SelvanA. T.MaderaM.GoughJ.AravindL.et al. (2006). A database of bacterial lipoproteins (DOLOP) with functional assignments to predicted lipoproteins. J. Bacteriol. 188, 27612773. 10.1128/JB.188.8.2761-2773.2006

  • 5

    BagosP. G.TsirigosK. D.LiakopoulosT. D.HamodrakasS. J. (2008). Prediction of lipoprotein signal peptides in Gram-positive bacteria with a Hidden Markov Model. J. Proteome Res. 7, 50825093. 10.1021/pr800162c

  • 6

    BaileyT. L.BodenM.BuskeF. A.FrithM.GrantC. E.ClementiL.et al. (2009). MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res. 37, W202W208. 10.1093/nar/gkp335

  • 7

    BanerjeeA.Van SorgeN. M.SheenT. R.UchiyamaS.MitchellT. J.DoranK. S. (2010). Activation of brain endothelium by pneumococcal neuraminidase NanA promotes bacterial internalization. Cell. Microbiol. 12, 15761588. 10.1111/j.1462-5822.2010.01490.x

  • 8

    BaronC.CoombesB. (2007). Targeting bacterial secretion systems: benefits of disarmament in the microcosm. Infect. Disord. Drug Targets7, 1927. 10.2174/187152607780090685

  • 9

    BergmannS.WildD.DiekmannO.FrankR.BrachtD.ChhatwalG. S.et al. (2003). Identification of a novel plasmin(ogen)-binding motif in surface displayed alpha-enolase of Streptococcus pneumoniae. Mol. Microbiol. 49, 411423. 10.1046/j.1365-2958.2003.03557.x

  • 10

    BlackD. S.BliskaJ. B. (2000). The RhoGAP activity of the Yersinia pseudotuberculosis cytotoxin YopE is required for antiphagocytic function and virulence. Mol. Microbiol. 37, 515527. 10.1046/j.1365-2958.2000.02021.x

  • 11

    BoucrotE.BeuzonC. R.HoldenD. W.GorvelJ. P.MeresseS. (2003). Salmonella typhimurium SifA effector protein requires its membrane-anchoring C-terminal hexapeptide for its biological function. J. Biol. Chem. 278, 1419614202. 10.1074/jbc.M207901200

  • 12

    BraunV.RehnK. (1969). Chemical characterization, spatial distribution and function of a lipoprotein (murein-lipoprotein) of the E. coli cell wall. The specific effect of trypsin on the membrane structure. Eur. J. Biochem. 10, 426438. 10.1111/j.1432-1033.1969.tb00707.x

  • 13

    BrownN. F.SzetoJ.JiangX.CoombesB. K.FinlayB. B.BrumellJ. H. (2006). Mutational analysis of Salmonella translocated effector members SifA and SopD2 reveals domains implicated in translocation, subcellular localization and function. Microbiology152, 23232343. 10.1099/mic.0.28995-0

  • 14

    BuchaklianA. H.KlugC. S. (2006). Characterization of the LSGGQ and H motifs from the Escherichia coli lipid A transporter MsbA. Biochemistry45, 1253912546. 10.1021/bi060830a

  • 15

    BurtsM. L.WilliamsW. A.DebordK.MissiakasD. M. (2005). EsxA and EsxB are secreted by an ESAT-6-like system that is required for the pathogenesis of Staphylococcus aureus infections. Proc. Natl. Acad. Sci. U.S.A. 102, 11691174. 10.1073/pnas.0405620102

  • 16

    ConradiJ.HuberS.GausK.MertinkF.Royo GraciaS.StrijowskiU.et al. (2012a). Cyclic RGD peptides interfere with binding of the Helicobacter pylori protein CagL to integrins alphaVbeta3 and alpha5beta1. Amino Acids43, 219232. 10.1007/s00726-011-1066-0

  • 17

    ConradiJ.TegtmeyerN.WoznaM.WissbrockM.MichalekC.GagellC.et al. (2012b). An RGD helper sequence in CagL of Helicobacter pylori assists in interactions with integrins and injection of CagA. Front. Cell. Infect. Microbiol. 2:70. 10.3389/fcimb.2012.00070

  • 18

    ContrerasI.Ortiz-ZapaterE.AnientoF. (2004). Sorting signals in the cytosolic tail of membrane proteins involved in the interaction with plant ARF1 and coatomer. Plant J. 38, 685698. 10.1111/j.1365-313X.2004.02075.x

  • 19

    CornelisG. R.Van GijsegemF. (2000). Assembly and function of type III secretory systems. Annu. Rev. Microbiol. 54, 735774. 10.1146/annurev.micro.54.1.735

  • 20

    CraikD. J.FairlieD. P.LirasS.PriceD. (2013). The future of peptide-based drugs. Chem. Biol. Drug Des. 81, 136147. 10.1111/cbdd.12055

  • 21

    DaveyN. E.CowanJ. L.ShieldsD. C.GibsonT. J.ColdwellM. J.EdwardsR. J. (2012a). SLiMPrints: conservation-based discovery of functional motif fingerprints in intrinsically disordered protein regions. Nucleic Acids Res. 40, 1062810641. 10.1093/nar/gks854

  • 22

    DaveyN. E.Van RoeyK.WeatherittR. J.ToedtG.UyarB.AltenbergB.et al. (2012b). Attributes of short linear motifs. Mol. Biosyst. 8, 268281. 10.1039/c1mb05231d

  • 23

    DaveyN. E.HaslamN. J.ShieldsD. C.EdwardsR. J. (2010). SLiMFinder: a web server to find novel, significantly over-represented, short protein motifs. Nucleic Acids Res. 38, W534W539. 10.1093/nar/gkq440

  • 24

    DaveyN. E.HaslamN. J.ShieldsD. C.EdwardsR. J. (2011a). SLiMSearch 2.0: biological context for short linear motifs in proteins. Nucleic Acids Res. 39, W56W60. 10.1093/nar/gkr402

  • 25

    DaveyN. E.TraveG.GibsonT. J. (2011b). How viruses hijack cell regulation. Trends Biochem. Sci. 36, 159169. 10.1016/j.tibs.2010.10.002

  • 26

    DeanP. (2011). Functional domains and motifs of bacterial type III effector proteins and their roles in infection. FEMS Microbiol. Rev. 35, 11001125. 10.1111/j.1574-6976.2011.00271.x

  • 27

    DeivanayagamC. C.RichR. L.CarsonM.OwensR. T.DanthuluriS.BiceT.et al. (2000). Novel fold and assembly of the repetitive B region of the Staphylococcus aureus collagen-binding surface protein. Structure8, 6778. 10.1016/S0969-2126(00)00081-2

  • 28

    DerbiseA.SongY. P.ParikhS.FischettiV. A.PancholiV. (2004). Role of the C-terminal lysine residues of streptococcal surface enolase in Glu- and Lys-plasminogen-binding activities of group A streptococci. Infect. Immun. 72, 94105. 10.1128/IAI.72.1.94-105.2004

  • 29

    DeslandesL.OlivierJ.PeetersN.FengD. X.KhounlothamM.BoucherC.et al. (2003). Physical interaction between RRS1-R, a protein conferring resistance to bacterial wilt, and PopP2, a type III effector targeted to the plant nucleus. Proc. Natl. Acad. Sci. U.S.A. 100, 80248029. 10.1073/pnas.1230660100

  • 30

    DiellaF.HaslamN.ChicaC.BuddA.MichaelS.BrownN. P.et al. (2008). Understanding eukaryotic linear motifs and their role in cell signaling and regulation. Front. Biosci. 13, 65806603. 10.2741/3175

  • 31

    DinkelH.MichaelS.WeatherittR. J.DaveyN. E.Van RoeyK.AltenbergB.et al. (2012). ELM–the database of eukaryotic linear motifs. Nucleic Acids Res. 40, D242D251. 10.1093/nar/gkr1064

  • 32

    DosztanyiZ.MeszarosB.SimonI. (2009). ANCHOR: web server for predicting protein binding regions in disordered proteins. Bioinformatics25, 27452746. 10.1093/bioinformatics/btp518

  • 33

    DowenR. H.EngelJ. L.ShaoF.EckerJ. R.DixonJ. E. (2009). A family of bacterial cysteine protease type III effectors utilizes acylation-dependent and -independent strategies to localize to plasma membranes. J. Biol. Chem. 284, 1586715879. 10.1074/jbc.M900519200

  • 34

    EdwardsR. J.DaveyN. E.ShieldsD. C. (2007). SLiMFinder: a probabilistic method for identifying over-represented, convergently evolved, short linear motifs in proteins. PLoS ONE2:e967. 10.1371/journal.pone.0000967

  • 35

    EdwardsR. J.DaveyN. E.ShieldsD. C. (2008). CompariMotif: quick and easy comparisons of sequence motifs. Bioinformatics24, 13071309. 10.1093/bioinformatics/btn105

  • 36

    FuxreiterM.TompaP.SimonI. (2007). Local structural disorder imparts plasticity on linear motifs. Bioinformatics23, 950956. 10.1093/bioinformatics/btm035

  • 37

    GarauG.LemaireD.VernetT.DidebergO.Di GuilmiA. M. (2005). Crystal structure of phosphorylcholine esterase domain of the virulence factor choline-binding protein e from Streptococcus pneumoniae: new structural features among the metallo-beta-lactamase superfamily. J. Biol. Chem. 280, 2859128600. 10.1074/jbc.M502744200

  • 38

    GarmoryH. S.TitballR. W. (2004). ATP-binding cassette transporters are targets for the development of antibacterial vaccines and therapies. Infect. Immun. 72, 67576763. 10.1128/IAI.72.12.6757-6763.2004

  • 39

    GasparA. H.MarraffiniL. A.GlassE. M.DebordK. L.Ton-ThatH.SchneewindO. (2005). Bacillus anthracis sortase A (SrtA) anchors LPXTG motif-containing surface proteins to the cell wall envelope. J. Bacteriol. 187, 46464655. 10.1128/JB.187.13.4646-4655.2005

  • 40

    GiddaS. K.ShockeyJ. M.RothsteinS. J.DyerJ. M.MullenR. T. (2009). Arabidopsis thaliana GPAT8 and GPAT9 are localized to the ER and possess distinct ER retrieval signals: functional divergence of the dilysine ER retrieval motif in plant cells. Plant Physiol. Biochem. 47, 867879. 10.1016/j.plaphy.2009.05.008

  • 41

    HamJ. H.MajerczakD. R.Arroyo-RodriguezA. S.MackeyD. M.CoplinD. L. (2006). WtsE, an AvrE-family effector protein from Pantoea stewartii subsp. stewartii, causes disease-associated cell death in corn and requires a chaperone protein for stability. Mol. Plant Microbe Interact. 19, 10921102. 10.1094/MPMI-19-1092

  • 42

    HamJ. H.MajerczakD. R.NomuraK.MeceyC.UribeF.HeS. Y.et al. (2009). Multiple activities of the plant pathogen type III effector proteins WtsE and AvrE require WxxxE motifs. Mol. Plant Microbe Interact. 22, 703712. 10.1094/MPMI-22-6-0703

  • 43

    HamiauxC.Van EerdeA.ParsotC.BroosJ.DijkstraB. W. (2006). Structural mimicry for vinculin activation by IpaA, a virulence factor of Shigella flexneri. EMBO Rep. 7, 794799. 10.1038/sj.embor.7400753

  • 44

    HammerschmidtS.TilligM. P.WolffS.VaermanJ. P.ChhatwalG. S. (2000). Species-specific binding of human secretory component to SpsA protein of Streptococcus pneumoniae via a hexapeptide motif. Mol. Microbiol. 36, 726736. 10.1046/j.1365-2958.2000.01897.x

  • 45

    HarrisT. O.ShelverD. W.BohnsackJ. F.RubensC. E. (2003). A novel streptococcal surface protease promotes virulence, resistance to opsonophagocytosis, and cleavage of human fibrinogen. J. Clin. Invest. 111, 6170. 10.1172/JCI200316270

  • 46

    HayashiT.MorohashiH.HatakeyamaM. (2013). Bacterial EPIYA effectors–where do they come from? What are they? Where are they going?Cell. Microbiol. 15, 377385. 10.1111/cmi.12040

  • 47

    HicksS. W.CharronG.HangH. C.GalanJ. E. (2011). Subcellular targeting of Salmonella virulence proteins by host-mediated S-palmitoylation. Cell Host Microbe10, 920. 10.1016/j.chom.2011.06.003

  • 48

    HicksS. W.GalanJ. E. (2013). Exploitation of eukaryotic subcellular targeting mechanisms by bacterial effectors. Nat. Rev. Microbiol. 11, 316326. 10.1038/nrmicro3009

  • 49

    HigashiH.YokoyamaK.FujiiY.RenS.YuasaH.SaadatI.et al. (2005). EPIYA motif is a membrane-targeting signal of Helicobacter pylori virulence factor CagA in mammalian cells. J. Biol. Chem. 280, 2313023137. 10.1074/jbc.M503583200

  • 50

    ItzekA.GillenC. M.FuldeM.FriedrichsC.RodloffA. C.ChhatwalG. S.et al. (2010). Contribution of plasminogen activation towards the pathogenic potential of oral streptococci. PLoS ONE5:e13826. 10.1371/journal.pone.0013826

  • 51

    JacksonL. K.NawabiP.HenteaC.RoarkE. A.HaldarK. (2008). The Salmonella virulence protein SifA is a G protein antagonist. Proc. Natl. Acad. Sci. U.S.A. 105, 1414114146. 10.1073/pnas.0801872105

  • 52

    JacksonM. R.NilssonT.PetersonP. A. (1990). Identification of a consensus motif for retention of transmembrane proteins in the endoplasmic reticulum. EMBO J. 9, 31533162.

  • 53

    JunckerA. S.WillenbrockH.Von HeijneG.BrunakS.NielsenH.KroghA. (2003). Prediction of lipoprotein signal peptides in Gram-negative bacteria. Protein Sci. 12, 16521662. 10.1110/ps.0303703

  • 54

    KadaveruK.VyasJ.SchillerM. R. (2008). Viral infection and human disease–insights from minimotifs. Front. Biosci. 13, 64556471. 10.2741/3166

  • 55

    KedarisettiP.MiziantyM. J.KaasQ.CraikD. J.KurganL. (2014). Prediction and characterization of cyclic proteins from sequences in three domains of life. Biochim. Biophys. Acta. 1844, 181190. 10.1016/j.bbapap.2013.05.002

  • 56

    KleinC.Garcia-RizoC.BisleB.SchefferB.ZischkaH.PfeifferF.et al. (2005). The membrane proteome of Halobacterium salinarum. Proteomics5, 180197. 10.1002/pmic.200400943

  • 57

    KonkelM. E.LarsonC. L.FlanaganR. C. (2010). Campylobacter jejuni FlpA binds fibronectin and is required for maximal host cell adherence. J. Bacteriol. 192, 6876. 10.1128/JB.00969-09

  • 58

    Kovacs-SimonA.TitballR. W.MichellS. L. (2011). Lipoproteins of bacterial pathogens. Infect. Immun. 79, 548561. 10.1128/IAI.00682-10

  • 59

    LeeS. F.KellyM.McAlisterA.LuckS. N.GarciaE. L.HallR. A.et al. (2008). A C-terminal class I PDZ binding motif of EspI/NleA modulates the virulence of attaching and effacing Escherichia coli and Citrobacter rodentium. Cell. Microbiol. 10, 499513. 10.1111/j.1462-5822.2007.01065.x

  • 60

    LehaneA. M.KorresH.VermaN. K. (2005). Bacteriophage-encoded glucosyltransferase GtrII of Shigella flexneri: membrane topology and identification of critical residues. Biochem. J. 389, 137143. 10.1042/BJ20050102

  • 61

    LetyM. A.FrehelC.DubailI.BerettiJ. L.KayalS.BercheP.et al. (2001). Identification of a PEST-like motif in listeriolysin O required for phagosomal escape and for virulence in Listeria monocytogenes. Mol. Microbiol. 39, 11241139. 10.1111/j.1365-2958.2001.02281.x

  • 62

    LivermanA. D.ChengH. C.TroskyJ. E.LeungD. W.YarbroughM. L.BurdetteD. L.et al. (2007). Arp2/3-independent assembly of actin by Vibrio type III effector VopL. Proc. Natl. Acad. Sci. U.S.A. 104, 1711717122. 10.1073/pnas.0703196104

  • 63

    MartinezE.SchroederG. N.BergerC. N.LeeS. F.RobinsonK. S.BadeaL.et al. (2010). Binding to Na(+) /H(+) exchanger regulatory factor 2 (NHERF2) affects trafficking and function of the enteropathogenic Escherichia coli type III secretion system effectors Map, EspI and NleH. Cell. Microbiol. 12, 17181731. 10.1111/j.1462-5822.2010.01503.x

  • 64

    MooneyC.HaslamN. J.HoltonT. A.PollastriG.ShieldsD. C. (2013). PeptideLocator: prediction of bioactive peptides in protein sequences. Bioinformatics29, 11201126. 10.1093/bioinformatics/btt103

  • 65

    MooneyC.PollastriG.ShieldsD. C.HaslamN. J. (2012). Prediction of short linear protein binding regions. J. Mol. Biol. 415, 193204. 10.1016/j.jmb.2011.10.025

  • 66

    NavarreW. W.SchneewindO. (1994). Proteolytic cleavage and cell wall anchoring at the LPXTG motif of surface proteins in gram-positive bacteria. Mol. Microbiol. 14, 115121. 10.1111/j.1365-2958.1994.tb01271.x

  • 67

    NeduvaV.RussellR. B. (2005). Linear motifs: evolutionary interaction switches. FEBS Lett. 579, 33423345. 10.1016/j.febslet.2005.04.005

  • 68

    NilssonT.JacksonM.PetersonP. A. (1989). Short cytoplasmic sequences serve as retention signals for transmembrane proteins in the endoplasmic reticulum. Cell58, 707718. 10.1016/0092-8674(89)90105-0

  • 69

    NogueiraS. V.SmithA. A.QinJ. H.PalU. (2012). A surface enolase participates in Borrelia burgdorferi-plasminogen interaction and contributes to pathogen survival within feeding ticks. Infect. Immun. 80, 8290. 10.1128/IAI.05671-11

  • 70

    PaschosA.Den HartighA.SmithM. A.AtluriV. L.SivanesanD.TsolisR. M.et al. (2011). An in vivo high-throughput screening approach targeting the type IV secretion system component VirB8 identified inhibitors of Brucella abortus 2308 proliferation. Infect. Immun. 79, 10331043. 10.1128/IAI.00993-10

  • 71

    PateyG.QiZ.BourgG.BaronC.O'CallaghanD. (2006). Swapping of periplasmic domains between Brucella suis VirB8 and a pSB102 VirB8 homologue allows heterologous complementation. Infect. Immun. 74, 49454949. 10.1128/IAI.00584-06

  • 72

    PrietoC.Garcia-EstradaC.LorenzanaD.MartinJ. F. (2012). NRPSsp: non-ribosomal peptide synthase substrate predictor. Bioinformatics28, 426427. 10.1093/bioinformatics/btr659

  • 73

    RabinS. D.HauserA. R. (2005). Functional regions of the Pseudomonas aeruginosa cytotoxin ExoU. Infect. Immun. 73, 573582. 10.1128/IAI.73.1.573-582.2005

  • 74

    RamiscalR. R.TangS. S.KorresH.VermaN. K. (2010). Structural and functional divergence of the newly identified GtrIc from its Gtr family of conserved Shigella flexneri serotype-converting glucosyltransferases. Mol. Membr. Biol. 27, 114122. 10.3109/09687680903552250

  • 75

    Robert-SeilaniantzA.ShanL.ZhouJ. M.TangX. (2006). The Pseudomonas syringae pv. tomato DC3000 type III effector HopF2 has a putative myristoylation site required for its avirulence and virulence functions. Mol. Plant Microbe Interact. 19, 130138. 10.1094/MPMI-19-0130

  • 76

    RohdeJ. R.BreitkreutzA.ChenalA.SansonettiP. J.ParsotC. (2007). Type III secretion effectors of the IpaH family are E3 ubiquitin ligases. Cell Host Microbe1, 7783. 10.1016/j.chom.2007.02.002

  • 77

    RohdeM.PulsJ.BuhrdorfR.FischerW.HaasR. (2003). A novel sheathed surface organelle of the Helicobacter pylori cag type IV secretion system. Mol. Microbiol. 49, 219234. 10.1046/j.1365-2958.2003.03549.x

  • 78

    SabetC.LecuitM.CabanesD.CossartP.BierneH. (2005). LPXTG protein InlJ, a newly identified internalin involved in Listeria monocytogenes virulence. Infect. Immun. 73, 69126922. 10.1128/IAI.73.10.6912-6922.2005

  • 79

    SagulenkoV.SagulenkoE.JakubowskiS.SpudichE.ChristieP. J. (2001). VirB7 lipoprotein is exocellular and associates with the Agrobacterium tumefaciens T pilus. J. Bacteriol. 183, 36423651. 10.1128/JB.183.12.3642-3651.2001

  • 80

    SatoK.MoriH.YoshidaM.MizushimaS. (1996). Characterization of a potential catalytic residue, Asp-133, in the high affinity ATP-binding site of Escherichia coli SecA, translocation ATPase. J. Biol. Chem. 271, 1743917444. 10.1074/jbc.271.29.17439

  • 81

    SchlumbergerM. C.FriebelA.BuchwaldG.ScheffzekK.WittinghoferA.HardtW. D. (2003). Amino acids of the bacterial toxin SopE involved in G nucleotide exchange on Cdc42. J. Biol. Chem. 278, 2714927159. 10.1074/jbc.M302475200

  • 82

    ShanL.TharaV. K.MartinG. B.ZhouJ. M.TangX. (2000). The pseudomonas AvrPto protein is differentially recognized by tomato and tobacco and is localized to the plant plasma membrane. Plant Cell12, 23232338. 10.1105/tpc.12.12.2323

  • 83

    SivanesanD.BaronC. (2011). The dimer interface of Agrobacterium tumefaciens VirB8 is important for type IV secretion system function, stability, and association of VirB2 with the core complex. J. Bacteriol. 193, 20972106. 10.1128/JB.00907-10

  • 84

    SrikanthC. V.WallD. M.Maldonado-ContrerasA.ShiH. N.ZhouD.DemmaZ.et al. (2010). Salmonella pathogenesis and processing of secreted effectors by caspase-3. Science330, 390393. 10.1126/science.1194598

  • 85

    StirlingF. R.CuzickA.KellyS. M.OxleyD.EvansT. J. (2006). Eukaryotic localization, activation and ubiquitinylation of a bacterial type III secreted toxin. Cell. Microbiol. 8, 12941309. 10.1111/j.1462-5822.2006.00710.x

  • 86

    SunJ.MaressoA. W.KimJ. J.BarbieriJ. T. (2004). How bacterial ADP-ribosylating toxins recognize substrates. Nat. Struct. Mol. Biol. 11, 868876. 10.1038/nsmb818

  • 87

    SundaramoorthyR.FyfeP. K.HunterW. N. (2008). Structure of Staphylococcus aureus EsxA suggests a contribution to virulence by action as a transport chaperone and/or adaptor protein. J. Mol. Biol. 383, 603614. 10.1016/j.jmb.2008.08.047

  • 88

    SuzukiM.MimuroH.KigaK.FukumatsuM.IshijimaN.MorikawaH.et al. (2009). Helicobacter pylori CagA phosphorylation-independent function in epithelial proliferation and inflammation. Cell Host Microbe5, 2334. 10.1016/j.chom.2008.11.010

  • 89

    SzurekB.RossierO.HauseG.BonasU. (2002). Type III-dependent translocation of the Xanthomonas AvrBs3 protein into the plant cell. Mol. Microbiol. 46, 1323. 10.1046/j.1365-2958.2002.03139.x

  • 90

    TahirY. E.KuuselaP.SkurnikM. (2000). Functional mapping of the Yersinia enterocolitica adhesin YadA. Identification Of eight NSVAIG - S motifs in the amino-terminal half of the protein involved in collagen binding. Mol. Microbiol. 37, 192206. 10.1046/j.1365-2958.2000.01992.x

  • 91

    TegtmeyerN.HartigR.DelahayR. M.RohdeM.BrandtS.ConradiJ.et al. (2010). A small fibronectin-mimicking protein from bacteria induces cell spreading and focal adhesion formation. J. Biol. Chem. 285, 2351523526. 10.1074/jbc.M109.096214

  • 92

    TegtmeyerN.WesslerS.BackertS. (2011). Role of the cag-pathogenicity island encoded type IV secretion system in Helicobacter pylori pathogenesis. FEBS J. 278, 11901202. 10.1111/j.1742-4658.2011.08035.x

  • 93

    ThomasS.KarnikS.BaraiR. S.JayaramanV. K.Idicula-ThomasS. (2010). CAMP: a useful resource for research on antimicrobial peptides. Nucleic Acids Res. 38, D774D780. 10.1093/nar/gkp1021

  • 94

    TzfiraT.VaidyaM.CitovskyV. (2004). Involvement of targeted proteolysis in plant genetic transformation by Agrobacterium. Nature431, 8792. 10.1038/nature02857

  • 95

    WurteleM.WolfE.PedersonK. J.BuchwaldG.AhmadianM. R.BarbieriJ. T.et al. (2001). How the Pseudomonas aeruginosa ExoS toxin downregulates Rac. Nat. Struct. Biol. 8, 2326. 10.1038/83007

  • 96

    ZhangL.ZhangC.OjciusD. M.SunD.ZhaoJ.LinX.et al. (2012). The mammalian cell entry (Mce) protein of pathogenic Leptospira species is responsible for RGD motif-dependent infection of cells and animals. Mol. Microbiol. 83, 10061023. 10.1111/j.1365-2958.2012.07985.x

  • 97

    ZhangY.BarbieriJ. T. (2005). A leucine-rich motif targets Pseudomonas aeruginosa ExoS within mammalian cells. Infect. Immun. 73, 79387945. 10.1128/IAI.73.12.7938-7945.2005

  • 98

    ZhangY.HigashideW. M.MccormickB. A.ChenJ.ZhouD. (2006). The inflammation-associated Salmonella SopA is a HECT-like E3 ubiquitin ligase. Mol. Microbiol. 62, 786793. 10.1111/j.1365-2958.2006.05407.x

  • 99

    ZhouC. E.SmithJ.LamM.ZemlaA.DyerM. D.SlezakT. (2007). MvirDB–a microbial database of protein toxins, virulence factors and antibiotic resistance genes for bio-defence applications. Nucleic Acids Res. 35, D391D394. 10.1093/nar/gkl791

  • 100

    ZhuY.LiH.LongC.HuL.XuH.LiuL.et al. (2007). Structural insights into the enzymatic mechanism of the pathogenic MAPK phosphothreonine lyase. Mol. Cell28, 899913. 10.1016/j.molcel.2007.11.011

  • 101

    ZimmermannL.PeterhansE.FreyJ. (2010). RGD motif of lipoprotein T, involved in adhesion of Mycoplasma conjunctivae to lamb synovial tissue cells. J. Bacteriol. 192, 37733779. 10.1128/JB.00253-10

Summary

Keywords

short linear motifs (SLiMs), virulence factor, motif mimicry, antibacterial, bioinformatics, pathogen

Citation

Ruhanen H, Hurley D, Ghosh A, O'Brien KT, Johnston CR and Shields DC (2014) Potential of known and short prokaryotic protein motifs as a basis for novel peptide-based antibacterial therapeutics: a computational survey. Front. Microbiol. 5:4. doi: 10.3389/fmicb.2014.00004

Received

16 October 2013

Accepted

05 January 2014

Published

21 January 2014

Volume

5 - 2014

Edited by

Nádia S. Parachin, Universidade de Brasília, Brazil

Reviewed by

M. Pilar Francino, Center for Public Health Research, Spain; Luis C. N. Da Silva, Universidade Federal de Pernambuco, Brazil; Viji Sarojini, University of Auckland, New Zealand; Mrinal Bhave, Swinburne University of Technology, Australia

Copyright

*Correspondence: Denis C. Shields, Complex and Adaptive Systems Laboratory, University College Dublin, Belfield Office Park-8, Dublin 4, Ireland e-mail:

This article was submitted to Antimicrobials, Resistance and Chemotherapy, a section of the journal Frontiers in Microbiology.

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics