ORIGINAL RESEARCH article

Front. Microbiol., 13 August 2014

Sec. Microbial Physiology and Metabolism

Volume 5 - 2014 | https://doi.org/10.3389/fmicb.2014.00421

Ends of the line for tmRNA-SmpB

  • Sandia National Laboratories, Department of Systems Biology Livermore, CA, USA

Abstract

Genes for the RNA tmRNA and protein SmpB, partners in the trans-translation process that rescues stalled ribosomes, have previously been found in all bacteria and some organelles. During a major update of The tmRNA Website (relocated to http://bioinformatics.sandia.gov/tmrna), including addition of an SmpB sequence database, we found some bacteria that lack functionally significant regions of SmpB. Three groups with reduced genomes have lost the central loop of SmpB, which is thought to improve alanylation and EF-Tu activation: Carsonella, Hodgkinia, and the hemoplasmas (hemotropic Mycoplasma). Carsonella has also lost the SmpB C-terminal tail, thought to stimulate the decoding center of the ribosome. We validate recent identification of tmRNA homologs in oomycete mitochondria by finding partner genes from oomycete nuclei that target SmpB to the mitochondrion. We have moreover identified through exhaustive search a small number of complete, but often highly derived, bacterial genomes that appear to lack a functional copy of either the tmRNA or SmpB gene (but not both). One Carsonella isolate exhibits complete degradation of the tmRNA gene sequence yet its smpB shows no evidence for relaxed selective constraint, relative to other genes in the genome. After loss of the SmpB central loop in the hemoplasmas, one subclade apparently lost tmRNA. Carsonella also exhibits gene overlap such that tmRNA maturation should produce a non-stop smpB mRNA. At least some of the tmRNA/SmpB-deficient strains appear to further lack the ArfA and ArfB backup systems for ribosome rescue. The most frequent neighbors of smpB are the tmRNA gene, a ratA/rnfH unit, and the gene for RNaseR, a known physical and functional partner of tmRNA-SmpB.

Introduction

The trans-translation process resolves issues arising when the translating bacterial ribosome reaches the end of an mRNA with no stop codon, chiefly releasing the stalled ribosome but also eliminating both the non-stop mRNA and the encoded incomplete protein. The main agents of trans-translation are the RNA tmRNA (whose gene is named ssrA) and its protein ligand SmpB. tmRNA has a tRNA-like domain (TLD) that lacks an anticodon stem-loop; a bound SmpB occupies this corresponding space, and the complex fills the A site in the stalled ribosome, mimicking tRNA (Bessho et al., 2007; Neubauer et al., 2012). After peptidyl transfer to the alanyl moiety of charged tmRNA, the ribosome switches from the non-stop mRNA to the resume codon on tmRNA and translation continues, adding a short hydrophobic tag peptide to the nascent protein that is the signal for proteolysis (Karzai and Sauer, 2001). Canonical release at the tag reading frame stop codon frees the ribosome. Two back-up systems for trans-translation, ArfA/RF-2 and ArfB, have been described that can allow ribosome release from non-stop mRNA even when ssrA or smpB is inactive; both require the peptidyl-tRNA hydrolase activity of a release factor family member, but not the stop codon recognition usually associated with release factors (Chadani et al., 2010, 2011, 2012; Handa et al., 2011).

The tmRNA-SmpB system is found in bacteria and some organelles and has not yet been identified in archaea or in eukaryotes targeted to the cytoplasm. Aside from one report of a bacterium with a frameshift mutation in smpB, it has generally been considered that all bacteria have the system. Here we present 22 examples of complete bacterial genomes where either ssrA cannot be found, or smpB has an apparently inactivating mutation. A particularly strong case for loss of the system in a bacterial genome comes from a strain of the insect endosymbiont Carsonella ruddii, which, as best as current knowledge can be applied, further appears to lack trans-translation back-up systems. In the course of the exposition we survey bioinformatics tools for tmRNA and SmpB gene searches, and describe a major update of The tmRNA Website (http://bioinformatics.sandia.gov/tmrna).

Materials and methods

Search databases

Genomic data were downloaded from four directories (archaea, bacteria, plasmid, and viruses) of RefSeq on November 2012. This dataset consisted of 2031 bacterial and 137 archaeal complete genomes, and 1711 additional bacterial plasmids and 543 bacterial viruses (and 44 additional archaeal plasmids and 38 archaeal viruses) that were not part of chromosomal genome projects. BLAST databases were downloaded on 5 August 2013.

tmRNA sequence search

Three primary tmRNA sequence identification tools have been described: the sister programs BRUCE (Laslett et al., 2002) and ARAGORN (Laslett and Canback, 2004) and the Rfam/Infernal system (Griffiths-Jones et al., 2005) that parallels Pfam/HMMER. Rfam has four covariance models for different tmRNA forms. We applied these tools in a combined search for tmRNA and tRNA genes, because the most common false positive tmRNA hits are to legitimate tRNA genes. Our first-pass wrapper tFind.pl (available at bioinformatics.sandia.gov/software) combines tmRNA and tRNA search by running the programs tRNAscan-SE (Lowe and Eddy, 1997), ARAGORN (which also searches for tRNA genes) and BRUCE. It then resolves overlapping calls, divides the tRNAs into the two categories “valid” (those with tRNAscan-SE Cove score above 50 not labeled Pseudo or Undetermined, and also called by ARAGORN) and “questionable” (the remaining tRNA calls), and aims for accurate terminus determinatio Secn (except with two-piece tmRNAs). tmRNA calls in archaea or in bacteria with more than one call were scrutinized manually, rejecting some due to overlaps with better-called tRNAs, poor conservation of alanyl-tRNA synthetase discrimination features or other problems with the TLD. Other rejected bacterial tmRNA duplicate calls were tmRNA pseudogenes (missing one gene end) or tmRNA gene fragments formed by genomic island integration. Rfam/Infernal was not applied in this first pass because of a high false-positive rate (Table 1), but was instead applied when detection failed in a bacterial genome, along with a fourth tmRNA detection system, rFind.pl. This latter script uses our tmRNA full- and terminus-sequence databases with BLASTN to find additional tmRNAs and more accurately determine the termini of two-piece tmRNAs. Attention to the RNA gene termini is important for one method of identifying genomic islands, which favor ssrA and tRNA genes as integration sites (Mantri and Williams, 2004). When the above approaches failed to locate ssrA in a bacterial genome, we searched manually in the vicinity of smpB.

Table 1

DomainRawtmRNAValid tRNAQuest. tRNAPfamUnhit
BRUCE/ARAGORN
Bacteria203319830151421
Archaea1000730
Rfam ABOVE-THRESHOLD
Bacteria1309420371028323552487
Archaea12480849365331
Rfam BELOW-THRESHOLD
Bacteria2133751517039011384634
Archaea808040215945202

Evaluation of primary tmRNA sequence-finding programs.

See Materials and Methods.

We evaluated raw output of primary tmRNA-finding software by whether hits overlapped our final sets of tmRNA and other gene types (Table 1). The BRUCE and ARAGORN results were assessed together merging overlapping calls using BEDTools (Quinlan and Hall, 2010), likewise for the results of the four covariance models of Rfam; above-threshold Rfam hits were evaluated separately from intervals unique to the below-threshold hits. These three raw hits datasets were tested for overlap with various gene sets sequentially: our final tmRNAs, the valid tRNAs, the questionable tRNAs, and a set of conserved protein-coding regions. The latter came from six-frame translation of DNAs followed by Pfam-A/HMMER (with cut-TC thresholds) treatment, reporting only the genome segments coding for Pfam-positive portions of proteins. True positive rates for tmRNA discovery were 97.5% for BRUCE/ARAGORN and 15.6% for above-threshold Rfam/Infernal.

smpB search

The SmpB HMM of Pfam was used with HMMER and its default threshold, and five SmpB profiles (TIGR00086, cd09294, PRK0544, COG0691 and pfam01668) from Conserved Domain Database were used with RPS-TBLASTN and lower thresholds than the default that were nonetheless conservative, set at 1.4-fold above the highest score for a non-SmpB. Sub-threshold hits were examined in cases where a bacterial genome yielded no above-threshold hit. When this approach failed to locate smpB in a bacterial genome, we applied TBLASTN searches, and manual search in the vicinity of ssrA. In the final case of failure (Hodgkinia) we examined newer genomes of the same genus and were able to comparatively identify the gene.

tmRNA/SmpB sequence identifiers

For some sequences mentioned here we give the “tmID,” the identifier at The tmRNA Website (http://bioinformatics.sandia.gov/tmrna). Also, the webpage http://bioinformatics.sandia.gov/tmrna/ends.html is devoted to links to all sequences mentioned in this article, comparable to Tables 2, 3.

Table 2

ssrAStraintmID
BACTERIAL STRAINS MISSING ssrA
Carsonella ruddii PC*19165
secondary endosymbiont of Ctenarytaina eucalypti19166
Mycoplasma haemolamae str. Purdue*19167
Mycoplasma suis str. Illinois*19168
Mycoplasma wenyonii str. Massachusetts*19169
Mycoplasma suis KI3806*19170
PHAGES WITH ssrA
Bacillus phage G14561
Mycobacterium phage DS6A (TLD only)11587
Mycobacterium phage Bxz110675
Mycobacterium phage Cali13258
Mycobacterium phage Catera15205
Mycobacterium phage ET0814080
Mycobacterium phage Rizal14900
Mycobacterium phage ScottMcG10349
Mycobacterium phage Spud11713
Mycobacterium phage Wildcat11059

Genomes with unusual ssrA content.

Includes links to tmRNA webpages for bacterial strains missing tmRNA and phages with tmRNA. tmID is the tmRNA Website (http://bioinformatics.sandia.gov/tmrna) identifier.

*

Highly reduced genome (<106 bp).

Table 3

smpBStraintmID
BACTERIAL STRAINS WITH PSEUDOGENIZED, FRAMESHIFTED OR TRUNCATED smpB
PseudogeneHodgkinia cicadicola TETUND1*19190
TruncationTremblaya princeps PCIT*12215
TruncationTremblaya princeps PCVAL*12077
FrameshiftCorynebacterium pseudotuberculosis 3111952
FrameshiftMycobacterium intracellulare MOTT-219171
FrameshiftClostridium difficile CF510063
FrameshiftClostridium difficile M12015031
FrameshiftBuchnera aphidicola BCc*15428
FrameshiftBuchnera aphidicola str. TLW03*12194
FrameshiftPectobacterium carotovorum PCC2116329
FrameshiftAggregatibacter actinomycetemcomitans ANH938119118
FrameshiftPseudomonas putida DOT-T1E10352
FrameshiftSimiduia agarivorans SA119172
FrameshiftMycoplasma pneumoniae FH16792
FrameshiftThermotoga maritima MSB812964
FrameshiftPetrotoga mobilis SJ9513623
smpBs IN BACTERIAL PLASMIDS
Flavobacterium sp. KI723T1 plasmid pOAD2 (2 copies)19173
smpBs IN EUKARYOTIC GENOME PROJECTS
ContaminantCucumis sativus19176
ContaminantCeratitis capitata19177
EndosymbiontTrichoplax adhaerens19178
ChromatophorePaulinella chromatophora19174
Oomycete mito.-targetedAlbugo laibachii Nc14gi19187
Oomycete mito.-targetedPhytophthora infestans T30419188
Oomycete mito.-targetedPhytophthora sojae19189
Algal plastid-targetedNannochloropsis gaditana CCMP52619175
Algal plastid-targetedGuillardia theta CCMP271219179
Algal plastid-targetedPhaeodactylum tricornutum CCAP 1055/119180
Algal plastid-targetedThalassiosira pseudonana CCMP133519181
Algal plastid-targetedAureococcus anophagefferens19182
Algal plastid-targetedCallosobruchus chinensis19183
Algal plastid-targetedCyanidioschyzon merolae19184
Algal plastid-targetedEctocarpus siliculosus19185
Algal plastid-targetedThalassiosira oceanica19186

Genomes with unusual smpB content.

Includes links to webpages for bacterial strains with defective smpBs, bacterial plasmids with smpBs, and smpBs in eukaryotic genome projects (some of which are organelle-targeted). The Hodgkinia genome pseudogene has accumulated two premature stop codons in smpB. The two “truncation” cases have lost material reaching into the β-barrel at each end. We also note that SmpB lacks the central loop in the hemoplasmas, Carsonella and Hodgkinia, and lacks the C-terminal α helix in Carsonella, but these SmpBs retain all β strand segments and may therefore retain weak function. tmID is the tmRNA Website (http://bioinformatics.sandia.gov/tmrna) identifier.

*

Highly reduced genome (<106 bp).

The description of this genome (Pérez-Brocal et al., 2006) noted and discussed this frameshift, suggesting confidence in the gene sequence; any of the other frameshifts could instead be sequencing errors.

Results

Exhaustive search for ssrA

We applied our tFind.pl search method for ssrA to 2031 bacterial and 137 archaeal complete genomes, and additional RefSeq bacterial and archaeal plasmids and viruses not part of chromosomal genome projects. All ten raw tmRNA hits in Archaea were rejected by criteria noted above, while most bacterial genomes had a single ssrA located on the largest chromosome. Some genomes had a second or third ssrA allele, sometimes on a plasmid. Among plasmid and viral non-chromosomal projects, ssrA was only identified in eight mycobacteriophages Bxz1, Cali, Catera, ET08, Rizal, ScottMcG, Spud and Wildcat, however we can name additional phage tmRNA sequences in genomes that were not in our RefSeq dataset: Bacillus phage G (tmID: 14561) and mycobacteriophage DS6A (tmID: 11587). The DS6A sequence consists of little more than the tmRNA TLD; a similar molecule, whether or not chargeable with alanine, has been shown to strongly inhibit tmRNA, perhaps acting by titrating SmpB (Mao et al., 2009). For six genomes no tmRNA sequence could be identified: Carsonella ruddii PC, the four hemoplasmas of the Mycoplasma suis clade, and the secondary endosymbiont of Ctenarytaina eucalypti (Table 2). For C. ruddii PC, we further examine ssrA pseudogenization below.

Exhaustive search for smpB

Upon characterization of SmpB as a 7-stranded β barrel, an oligonucleotide-binding (OB) fold was recognized for the region from β3-β7, hinting at possible ancient evolutionary relationships (Dong et al., 2002). However, based on comparisons of backbone coordinates, no other structures at PDB were found to be structurally similar (Dong et al., 2002). Likewise sequence based profiles, specifically the SmpB HMM from Pfam (a standalone family not part of a clan) and a set of 5 SmpB profiles available at the Conserved Domain Database (NCBI) show no interference with other family profiles; the SmpB family is bioinformatically well-behaved. It is a single-domain protein, except that four multi-domain architectures for five (of 4542) SmpBs are reported at Pfam. However, two of these can be explained as an artifactual double-SmpB call due to a 14-aa insert and an artifactual fusion arising from splicing a bacterial gene present in a eukaryotic genome project, while the other three may be explained by sequencing errors not found in related strains, that shifted the smpB frame to that of its upstream neighbor or fused it to the downstream CDS by converting the smpB stop codon to a sense codon.

The above genomes were searched using the SmpB profiles, and for the small number (n = 14) of bacterial genomes for which the profiles failed even below threshold, BLASTX was applied with our SmpB database; for Hodgkinia, comparative analysis with two newer genomes (below) was required to identify smpB (also identifying two new tmRNA sequences). All instances of smpB were on bacterial chromosomes, except for two copies found in Flavobacterium sp. KI723T1 plasmid pOAD2. Some genomes are deficient for smpB (Table 3). Tremblaya has truncations at both ends of smpB, so severe that they may inactivate the protein. Study of newer Hodgkinia genomes as described below identified an isolate that has accumulated two TAA stop codons in smpB. In 13 other strains single frameshifts would inactivate the genes, unless these may be sequencing errors; however in one case the authors discuss the pseudogene, suggesting confidence in its sequencing (Pérez-Brocal et al., 2006).

Some SmpBs show loss of important features, yet may retain some function, given that the β-barrel framework appears intact. The central loop region, which contacts the tmRNA tRNA-like domain and is thought to play roles in alanylation (Dong et al., 2002) and in activating EF-Tu (Miller and Buskirk, 2014), is missing in Carsonella and the hemoplasmas (hemotropic Mycoplasma). The C-terminal tail, of demonstrated importance for SmpB function (Mantri and Williams, 2004; Jacob et al., 2005; Garza-Sánchez et al., 2011), is lost or truncated in Carsonella. In the model Thermus SmpB, this tail is unstructured in solution, but helical when in place in the ribosomal A site with alanine-charged tmRNA (Neubauer et al., 2012). In this location it contacts the 16S rRNA decoding center and continues to follow the path normally occupied by downstream mRNA, yet must undergo major conformational change to make way for the resume codon in later trans-translation steps. Many SmpBs extend variably beyond the helical tail segment of Thermus, raising the question of accommodating this extension in the ribosome. Tropheryma (tmID: 14758) has the longest C-terminal extension, 44 extra residues; when we constrained Tropheryma SmpB to the corresponding Thermus portion (Kelley and Sternberg, 2009), its extension showed continued helical structure with some breaks.

We found 16 smpB instances in eukaryotic genome projects. Four of these can be described as bacterial: two appear to be from enterobacterial microbiome contamininants of the medfly and cucumber genomes, another is from the endosymbiont associated with the placozoan Trichoplax genome (Driscoll et al., 2013), and the fourth is from the quasi-organellar chromatophore of Paulinella that is a recently-captured cyanobacterium. The remaining eukaryotic SmpBs appear to be nuclear-encoded and organelle-targeted. Three are from oomycete genomes and score for the mitochondrial signal peptide, supporting the recent discovery of tmRNA genes in oomycete mitochondria (Hafez et al., 2013). Nine are from algal genomes whose plastids are known to encode tmRNA; for some of these the N-terminal plastid transit peptide sequences have been noted (Jacob et al., 2005), while in others transit peptide identification may require further search for 5′ exons.

smpB gene neighborhood

We examined the neighborhood of smpB, and found 11 frequent neighbor gene families (Figure 1A). ssrA is the most frequent neighbor of smpB, yet accounts for fewer than half the cases. The clustering of these neighbors was also examined (Figure 1B). The association with the ubiquitin homolog RnfH and RatA toxin unit genes has been previously noted (Iyer et al., 2006). Several of these common neighbors also interact with the ribosome (RF-2, SecG, and RatA). Furthermore, RNase R is known to be a physical and functional partner with tmRNA-SmpB (Karzai et al., 2000; Liang and Deutscher, 2010; Venkataraman et al., 2014). Transcript analysis has confirmed operon structure for some of these clusters (Mantri and Williams, 2004; Garza-Sánchez et al., 2011).

Figure 1

The tmRNA website

The tmRNA Website (De Novoa and Williams, 2004) (http://bioinformatics.sandia.gov/tmrna) provides several research tools. Foremost is the sequence database. The previous instance of the database was updated with the above search results, and with the recently-described oomycete sequences, yielding 1631 unique sequences (1384 encoding one-piece tmRNA and 247 two-piece tmRNA); most are bacterial except for 41 mitochondrial and 22 plastid unique tmRNA sequences. These tmRNAs encode 710 unique proteolysis tag sequences. Each sequence was then used as BLAST query against NCBI est, gss, htgs, nt, other_genomic, patnt, refseq_genomic, tsa_nt and wgs databases, yielding 9167 instances of perfect though occasionally incomplete matches, counting each RefSeq/GenBank cross-reference pair as a single instance. The tmRNA Website provides all these sequences for download or for query by BLAST. These were also provided to RNAcentral (Bateman et al., 2011) and as third-party annotation to the International Nucleotide Sequence Database Archives (GenBank/ENA/DDBJ). Related resources that should be consulted are tmRDB (Andersen et al., 2006), Rfam (Burge et al., 2013), and RNAcentral (Bateman et al., 2011).

The tmRNA Website includes a new SmpB database with 2258 distinct amino acid sequences. These are available for BLAST search and download, as an alignment, as raw sequence and as a database. SmpB sequence is presented together with tmRNA sequences found in the same genome.

Anomalies in Carsonella

Carsonella ruddii is an insect endosymbiont, with extremely small (157–174 kbp) and AT-rich (14–18% GC) genomes, yet virtually no rearrangement of gene order (Sloan and Moran, 2012). The loss of the central loop and C-terminal tail of C. ruddii SmpB were noted above. When only one Carsonella tmRNA sequence was available, it was difficult to identify its tag reading frame. With several new sequences from additional strains, the tag reading has now been identified, standing out as the most conserved reading frame among the strains (Figure 2). C. ruddii is the only species encoding a tag ending in a charged residue (lysine), which hindered previous tag identification, however some strains do have as usual a hydrophobic terminal tag residue.

Figure 2

It was previously noted that smpB overlaps ssrA in Carsonella (Mao et al., 2009). This sets up an interesting feedback situation where the smpB mRNA would be cleaved by tmRNA maturation, and thereby become a non-stop substrate for the action of its own gene product. However, this situation is not widespread; we found it nowhere else but in Carsonella, and in only half of the Carsonella strains.

All tmRNAs in our database and indeed all bacterial tRNA-Ala at the Genomic tRNA Database (Chan and Lowe, 2009) have a terminal G:C base pair closing the acceptor stem, except for the tmRNAs of the C. ruddii HC/C. ruddii HT lineage. This anomaly is apparently due to a small deletion causing a 2-nt overlap between the 3′ termini of ssrA and the oppositely oriented tRNA-Phe gene, that changed the terminal residue of the tmRNA acceptor stem from the usual C to U (Figure 2). A base substitution mutation reverting this U back to C would have altered the discriminator base of tRNA-Phe; instead the deletion apparently drove the fixation of a compensatory mutation at the far end of ssrA producing the unique A:U closing base pair, which may allow better recognition by alanyl-tRNA synthetase than the post-deletion G:U pair would.

Although there were six complete bacterial genomes in which we failed to find tmRNA sequences, the genome of C. ruddii PC presents an especially clear case of pseudogenization. Because C. ruddii genomes show no rearrangement of gene order (Sloan and Moran, 2012), the site of any ssrA remnant could be predicted. An anchored segment (thin purple line in Figure 2) of the closely related C. ruddii PV genome is 216 bp (within which the tmRNA sequence occupies 202 bp); the corresponding segment in PC is 178 bp. This pseudogenization thus appears to have occurred largely in place and not by major deletion. The thoroughness of obliteration is remarkable; none of the most conserved regions of ssrA have been retained, neither for the 5′ tRNA-like domain, the resume codon region, nor the 3′ tRNA-like domain. Nucleotide bias has increased with this pseudogenization: GC content of the anchored region drops from and 17.6% in PV to 13.5% in PC. We expected that without tmRNA, selective constraint on smpB would relax in PC, but there is no evidence for this. The 181 orthologous protein-coding gene pairs shared between the close relatives C. ruddii PV (which encodes tmRNA) and C. ruddii PC (which does not) have already been evaluated for selective regime, revealing that they are generally under a purifying selection regime with low dN/dS ratios (Sloan and Moran, 2012). For smpB, the dN/dS value is 0.14 (D. Sloan, pers. comm.), in the middle of the peak of the dN/dS distribution for all genes. This indicates that relative to other genes, purifying selection is not relaxed in PC for smpB, even after the loss of its partner ssrA. Perhaps ssrA loss was too recent to detect follow-on relaxation at smpB.

Neither ribosome rescue backup system seems available to compensate for ssrA loss; C. ruddii PC had no detectable ArfA while its two matches to ArfB gave much stronger matches to the better conserved proteins RF-1 and RF-2.

Hodgkinia

Hodgkinia cicadicola is an insect endosymbiont with an extremely reduced (134–144 kbp) genome of balanced nucleotide composition (46–58% GC), and it uses UAG as a Trp codon rather than Stop (McCutcheon and Moran, 2011). Despite applying the profiles and BLAST at highest sensitivity, considering its unusual genetic code, and specifically searching in the ssrA vicinity we could not find smpB when only the H. cicadicola Dsem genome was available. With the recent arrival of two new genomes, one, H. cicadicola TETUND2, gave low but consistent signals with the profiles, identifying smpB and leading to identification in the other two genomes. All three SmpBs lack the central loop. H. cicadicola Dsem may also have lost the C-terminal tail. The H. cicadicola TETUND1 smpB has further accumulated two TAA stop codons and we therefore classify it as a pseudogene.

Anomalies in Mycoplasma

The third group we find lacking the SmpB central loop is the hemoplasmas (hemotropic Mycoplasma), which also have reduced genomes. We prepared a genome-based phylogenetic tree for Mycoplasma (Figure 3) that included 7 hemoplasmas, which formed a clade in the tree with two main subclades, in agreement with (Guimaraes et al., 2014) who named the two subclades haemofelis and suis. We were unable to identify the tmRNA gene nor its trace in any of the four genomes of the suis clade. The haemofelis clade did not help locate it because the haemofelis ssrA region (greA/ssrA/Hyp/rplQ/rpoA) is rearranged in the suis clade as greA/X/trmD/rpoA (where X is an 18 kbp insert of 26 hypothetical genes in M. wenyonii).

Figure 3

Non-stop mRNAs due to t(m)RNA gene overlap

The observation of smpB overlap with ssrA in Carsonella led us to ask how many mRNAs might become non-stop due to maturation of CDS-overlapping tmRNA or tRNA genes (Table 4). Others have found high-frequency non-stop mRNA caused by an RNase III site in arfA (Garza-Sánchez et al., 2011). We considered only the proteins positive for Pfam-A families, which account for 75.0% of the bacterial proteins studied, and for comparison included “questionable” tRNAs (probably mostly false positives) and oppositely oriented CDS/RNA gene pairs. We consider the 379 same-orientation overlaps of valid t(m)RNA genes as candidates for producing high-frequency non-stop mRNAs, although those with the CDS downstream of the RNA gene are suspicious; they may result from calling the start codon too far upstream. This represents an exceedingly small fraction of mRNAs tested (~1 in 15000). The top Pfam families among these candidates represent few evolutionary events, mostly affecting the same tRNA gene in a closely related group of genomes.

Table 4

Valid t(m)RNAQuestion-able tRNATop Pfam domain of CDSs overlapping valid t(m)RNANo. top PfamSettings for top Pfam
No. t(m)RNA1156604809
Overlapping Pfam CDS8281364
  Same orientation379735
    CDS upstream250244FTSW_RODA_SPOVE44All 44 are tRNAIle-CAT in Helicobacter
    CDS downstream106186Aminotran_398 are tRNALeu-CAA in Prochlorococcus
    CDS internal092
    CDS spanning23213GTP_EFTU6All 6 are tRNASec in Rhizobiales
  Opposite orientation449629
    CDS upstream23187RNB (RNase R)4All 4 are tRNALeu-CAG in Burkholderiaceae
    CDS downstream381186Resolvase72Diverse settings
    CDS internal083
    CDS spanning45173Resolvase16Diverse settings

Functional protein CDSs that overlap t(m)RNA genes.

Of the 6,489,445 original NCBI protein calls in the 2031 bacterial genome projects, 5,805,765 were positive for functionality with the Pfam/HMMER system (testing Pfam-A and Pfam-B) or with the CDD/RPSBLAST system, and were tested for overlap with either tmRNA genes from the tmRNA Website or tRNA genes found with a combination of tRNAscan-SE and Aragorn (see Materials and Methods for distinction between “valid” and “questionable” tRNAs).

Discussion

It is generally thought that neither tmRNA nor SmpB can function without the other (Sundermeier and Karzai, 2007; Felden and Gillet, 2011), although there are some counter-examples; e.g., smpB but not ssrA can be knocked out in Mycobacterium tuberculosis (Personne and Parish, 2014). Among the six bacteria that appear to lack tmRNA and 16 that appear to lack SmpB, none lack both; cofunction would predict eventual concomitant loss. In one case of tmRNA loss that we examined, selective constraint did not appear to relax for the remaining smpB. Both for tmRNA and SmpB, there may be more independent function than has been recognized.

The tmRNA literature cautions against reporting failure to find genes, and it is of course possible that our detection methods were inadequate or that genome sequences have errors, but we may be starting to identify bacteria that truly lack tmRNA or SmpB. These bacteria tend to have highly reduced genomes that have lost many genes otherwise widely conserved. It can morever be noted that tmRNA-SmpB is lacking in most mitochondria and plastids, which likewise have highly reduced genomes derived from bacteria. Thus, tmRNA-SmpB is not always required in bacteria or their descendents. Those organelles where we can detect the system fit this pattern: the RNA gene is retained in the organelle and can be traced to the organelle's ancestral bacterial group, while the partner protein gene resides in the nucleus, encoding the appropriate organellar import peptide. Intracellular but non-organellar bacteria do not have this luxury of passing genes to the nucleus for safekeeping. However, nucleus-stored organellar proteins need not always derive from the organelle's ancestor. In our preliminary phylogenetic tree of SmpB (not shown), the plastid SmpBs did cluster with Cyanobacteria, but the mitochondrial SmpBs clustered apart from the Alphaproteobacteria.

The ArfA and ArfB backup systems for ribosome rescue are not of wide enough phylogenetic distribution to explain all the tmRNA or SmpB losses noted here, although a mitochondrial ArfB homolog has been reported (Richter et al., 2010), and additional analogs, homologs or backup systems may yet be discovered. The current data suggest that neither the primary nor the backup ribosome rescue systems are required in all bacteria.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Statements

Acknowledgments

We thank Daniel Sloan (Yale U.) for detailed data on Carsonella dN/dS values. This research was fully supported by the Laboratory Directed Research and Development program at Sandia National Laboratories. Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation, for the U.S. Department of Energy's National Nuclear Security Administration under contract DE-AC04-94AL85000.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  • 1

    AndersenE. S.RosenbladM. A.LarsenN.WestergaardJ. C.BurksJ.WowerI. K.et al. (2006). The tmRDB and SRPDB resources. Nucleic Acids Res. 34, D163D168. 10.1093/nar/gkj142

  • 2

    AngiuoliS. V.SalzbergS. L. (2011). Mugsy: fast multiple alignment of closely related whole genomes. Bioinformatics27, 334342. 10.1093/bioinformatics/btq665

  • 3

    BatemanA.AgrawalS.BirneyE.BrufordE. A.BujnickiJ. M.CochraneG.et al. (2011). RNAcentral: a vision for an international database of RNA sequences. RNA17, 19411946. 10.1261/rna.2750811

  • 4

    BesshoY.ShibataR.SekineS.-I.MurayamaK.HigashijimaK.Hori-TakemotoC.et al. (2007). Structural basis for functional mimicry of long-variable-arm tRNA by transfer-messenger RNA. Proc. Natl. Acad. Sci. U.S.A. 104, 82938298. 10.1073/pnas.0700402104

  • 5

    BurgeS. W.DaubJ.EberhardtR.TateJ.BarquistL.NawrockiE. P.et al. (2013). Rfam 11.0: 10 years of RNA families. Nucleic Acids Res. 41, D226D232. 10.1093/nar/gks1005

  • 6

    CastresanaJ. (2000). Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol. Bio. Evol. 17, 540552. 10.1093/oxfordjournals.molbev.a026334

  • 7

    ChadaniY.ItoK.KutsukakeK.AboT. (2012). ArfA recruits release factor 2 to rescue stalled ribosomes by peptidyl-tRNA hydrolysis in Escherichia coli. Mol. Microbiol. 86, 3750. 10.1111/j.1365-2958.2012.08190.x

  • 8

    ChadaniY.OnoK.KutsukakeK.AboT. (2011). Escherichia coli YaeJ protein mediates a novel ribosome−rescue pathway distinct from SsrA-and ArfA-mediated pathways. Mol. Microbiol. 80, 772785. 10.1111/j.1365-2958.2011.07607.x

  • 9

    ChadaniY.OnoK.OzawaS. I.TakahashiY.TakaiK.NanamiyaH.et al. (2010). Ribosome rescue by Escherichia coli ArfA (YhdL) in the absence of trans-translation system. Mol. Microbiol. 78, 796808. 10.1111/j.1365-2958.2010.07375.x

  • 10

    ChanP. P.LoweT. M. (2009). GtRNAdb: a database of transfer RNA genes detected in genomic sequence. Nucleic Acids Res. 37, D93D97. 10.1093/nar/gkn787

  • 11

    De NovoaP. G.WilliamsK. P. (2004). The tmRNA website: reductive evolution of tmRNA in plastids and other endosymbionts. Nucleic Acids Res. 32, D104D108. 10.1093/nar/gkh102

  • 12

    DongG.NowakowskiJ.HoffmanD. W. (2002). Structure of small protein B: the protein component of the tmRNA–SmpB system for ribosome rescue. EMBO J. 21, 18451854. 10.1093/emboj/21.7.1845

  • 13

    DriscollT.GillespieJ. J.NordbergE. K.AzadA. F.SobralB. W. (2013). Bacterial DNA sifted from the Trichoplax adhaerens (Animalia: Placozoa) genome project reveals a putative rickettsial endosymbiont. Genome Biol. Evol. 5, 621645. 10.1093/gbe/evt036

  • 14

    FeldenB.GilletR. (2011). SmpB as the handyman of tmRNA during trans-translation. RNA Biol. 8, 440449. 10.4161/rna.8.3.15387

  • 15

    Garza-SánchezF.SchaubR. E.JanssenB. D.HayesC. S. (2011). tmRNA regulates synthesis of the ArfA ribosome rescue factor. Mol. Microbiol. 80, 12041219. 10.1111/j.1365-2958.2011.07638.x

  • 16

    Griffiths-JonesS.MoxonS.MarshallM.KhannaA.EddyS. R.BatemanA. (2005). Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res. 33, D121D124. 10.1093/nar/gki081

  • 17

    GuimaraesA. M. S.SantosA. P.Do NascimentoN. C.TimenetskyJ.MessickJ. B. (2014). Comparative genomics and phylogenomics of hemotrophic Mycoplasmas. PLoS ONE9:e91445. 10.1371/journal.pone.0091445

  • 18

    HafezM.BurgerG.SteinbergS. V.LangF. (2013). A second eukaryotic group with mitochondrion-encoded tmRNA: in silico identification and experimental confirmation. RNA Biol. 10, 11171124. 10.4161/rna.25376

  • 19

    HandaY.InahoN.NamekiN. (2011). YaeJ is a novel ribosome-associated protein in Escherichia coli that can hydrolyze peptidyl–tRNA on stalled ribosomes. Nucleic Acids Res. 39, 17391748. 10.1093/nar/gkq1097

  • 20

    IyerL. M.BurroughsA. M.AravindL. (2006). The prokaryotic antecedents of the ubiquitin-signaling system and the early evolution of ubiquitin-like β-grasp domains. Genome Biol. 7, R60. 10.1186/gb-2006-7-7-r60

  • 21

    JacobY.SharkadyS. M.BhardwajK.SandaA.WilliamsK. P. (2005). Function of the SmpB tail in transfer-messenger RNA translation revealed by a nucleus-encoded form. J. Biol. Chem. 280, 55035509. 10.1074/jbc.M409277200

  • 22

    KarzaiA. W.RocheE. D.SauerR. T. (2000). The SsrA–SmpB system for protein tagging, directed degradation and ribosome rescue. Nat. Struct. Mol. Biol. 7, 449455. 10.1038/75843

  • 23

    KarzaiA. W.SauerR. T. (2001). Protein factors associated with the SsrA· SmpB tagging and ribosome rescue complex. Proc. Natl. Acad. Sci. U.S.A. 98, 30403044. 10.1073/pnas.051628298

  • 24

    KelleyL. A.SternbergM. J. E. (2009). Protein structure prediction on the Web: a case study using the Phyre server. Nat. Protocol. 4, 363371. 10.1038/nprot.2009.2

  • 25

    LaslettD.CanbackB. (2004). ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res. 32, 1116. 10.1093/nar/gkh152

  • 26

    LaslettD.CanbackB.AnderssonS. (2002). BRUCE: a program for the detection of transfer−messenger RNA genes in nucleotide sequences. Nucleic Acids Res. 30, 34493453. 10.1093/nar/gkf459

  • 27

    LiangW.DeutscherM. P. (2010). A novel mechanism for ribonuclease regulation transfer-messenger RNA (tmRNA) and its associated protein SmpB regulate the stability of RNase R. J. Biol. Chem. 285, 2905429058. 10.1074/jbc.C110.168641

  • 28

    LoweT. M.EddyS. R. (1997). tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 09550964. 10.1093/nar/25.5.0955

  • 29

    MantriY.WilliamsK. P. (2004). Islander: a database of integrative islands in prokaryotic genomes, the associated integrases and their DNA site specificities. Nucleic Acids Res. 32, D55D58. 10.1093/nar/gkh059

  • 30

    MaoC.BhardwajK.SharkadyS. M.FishR. I.DriscollT.WowerJ.et al. (2009). Variations on the tmRNA gene. RNA Biol. 6, 355361. 10.4161/rna.6.4.9172

  • 31

    McCutcheonJ. P.MoranN. A. (2011). Extreme genome reduction in symbiotic bacteria. Nat. Rev. Microbiol. 10, 1326. 10.1038/nrmicro2670

  • 32

    MillerM. R.BuskirkA. R. (2014). An unusual mechanism for EF-Tu activation during tmRNA-mediated ribosome rescue. RNA20, 228235. 10.1261/rna.042226.113

  • 33

    NeubauerC.GilletR.KelleyA. C.RamakrishnanV. (2012). Decoding in the absence of a codon by tmRNA and SmpB in the ribosome. Science335, 13661369. 10.1126/science.1217039

  • 34

    Pérez-BrocalV.GilR.RamosS.LamelasA.PostigoM.MichelenaJ. M.et al. (2006). A small microbial genome: the end of a long symbiotic relationship?Science314, 312313. 10.1126/science.1130441

  • 35

    PersonneY.ParishT. (2014). Mycobacterium tuberculosis possesses an unusual tmRNA rescue system. Tuberculosis94, 3442. 10.1016/j.tube.2013.09.007

  • 36

    QuinlanA. R.HallI. M. (2010). BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics26, 841842. 10.1093/bioinformatics/btq033

  • 37

    RichterR.RorbachJ.PajakA.SmithP. M.WesselsH. J.HuynenM. A.et al. (2010). A functional peptidyl-tRNA hydrolase, ICT1, has been recruited into the human mitochondrial ribosome. EMBO J. 29, 11161125. 10.1038/emboj.2010.14

  • 38

    SloanD. B.MoranN. A. (2012). Genome reduction and co-evolution between the primary and secondary bacterial symbionts of psyllids. Mol. Bio. Evol. 29, 37813792. 10.1093/molbev/mss180

  • 39

    StamatakisA. (2006). RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics22, 26882690. 10.1093/bioinformatics/btl446

  • 40

    SundermeierT. R.KarzaiA. W. (2007). Functional SmpB-ribosome interactions require tmRNA. J. Biol. Chem. 282, 3477934786. 10.1074/jbc.M707256200

  • 41

    VenkataramanK.GujaK. E.Garcia-DiazM.KarzaiA. W. (2014). Non-stop mRNA decay: a special attribute of trans-translation mediated ribosome rescue. Front. Microbiol. 5:93. 10.3389/fmicb.2014.00093

Summary

Keywords

tmRNA, SmpB, trans-translation, Carsonella, Mycoplasma

Citation

Hudson CM, Lau BY and Williams KP (2014) Ends of the line for tmRNA-SmpB. Front. Microbiol. 5:421. doi: 10.3389/fmicb.2014.00421

Received

01 June 2014

Accepted

24 July 2014

Published

13 August 2014

Volume

5 - 2014

Edited by

Kenneth C. Keiler, Pennsylvania State University, USA

Reviewed by

Pavel V. Baranov, University College Cork, Ireland; Torsten Hain, University of Giessen, Germany

Copyright

*Correspondence: Kelly P. Williams, Sandia National Laboratories, Department of Systems Biology, 7011 East Ave., Livermore, CA 94550, USA e-mail:

This article was submitted to Microbial Physiology and Metabolism, a section of the journal Frontiers in Microbiology.

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics