Nucleotide sequence of conjugative prophage Φ1207.3 (formerly Tn1207.3) carrying the mef(A)/msr(D) genes for eﬄux resistance to macrolides in Streptococcus pyogenes

Genetic element Φ1207.3 (formerly Tn1207.3) is a prophage of Streptococcus pyogenes which carries the macrolide eﬄux resistance genes mef(A)/msr(D) and is capable of conjugal transfer among streptococci. Complete nucleotide sequence showed that Φ1207.3 is 52,491 bp in length and contained 58 open reading frames (ORFs). A manual homology-based annotation with functional prediction of the hypothetical gene product was possible only for 34 out of 58 ORFs. Φ1207.3 codes for two different C-methylation systems, several phage structural genes, a lysis cassette (composed by a holin and a peptidoglycan hydrolase), and three site-specific resolvases of the serine recombinase family.


INTRODUCTION
In Streptococcus pyogenes, the mef (A)/msr(D) pair of genes encoding efflux resistance to 14-and 15-membered macrolides is carried by a mobile genetic element originally described as a conjugative transposon which was called Tn1207.3 (Santagati et al., 2003;Pozzi et al., 2004). This element was found to be 52.5 kb in size, and to contain a complete copy of Tn1207.1, a 7,244-bp defective transposon previously found to carry mef(A)/msr (D) in S. pneumoniae (Santagati et al., 2000). Integration of the element into the S. pyogenes chromosome occurred at a specific GA dinucleotide target site located into the comEC coding sequence, with integration producing a duplication of the GA site. Upon conjugal transfer to S. pneumoniae, chromosomal integration occurred in celB, the pneumococcal homolog of comEC, at the same insertion site of Tn1207.1 (Santagati et al., 2003;Pozzi et al., 2004). A copy of Tn1207.1 was also found integrated in the 58,761-bp genetic element 10394.4, described as a prophage integrated at the same GA site within the comEC coding sequence of an erythromycin resistant clinical strain of S. pyogenes (Banks et al., 2003(Banks et al., , 2004.
Here we report the manually annotated DNA sequence of Tn1207.3 which indicates that the element is in fact a prophage, identical to the right end of 10394.4. For this reason the element was renamed 1207.3.

Streptococcus pyogenes STRAINS AND GROWTH CONDITIONS
2812A is an erythromycin-resistant italian clinical isolate containing 1207.3 (Santagati et al., 2003). Bacteria were routinely grown in tryptic soy broth or tryptic soy agar (Difco) supplemented with 3% horse blood and, where appropriate, in presence of 1 μg/ml erythromycin.

PCR AND SEQUENCING
Long PCR fragments were obtained with Takara LA Taq (Takara) following essentially the protocol suggested by the manufacturer. Briefly, the 25-μl reaction mixture was in 1X LA PCR Buffer II Taq buffer and contained: (i) 2.5 mM MgCl 2 , (ii) 200 μM dNTPs, (iii) 10 pmol of each primer, (iv) 0.25 units of Takara LA Taq, (v) 1 μl of liquid bacterial culture. Thermal cycling profile was as follows: 1 cycle at 92 • C for 2 min, then 30 cycles at 50 • C for 10 s, 68 • C for 15 min, 92 • C for 10 s, and 1 cycle at 50 • C for 1 min and 68 • C for 20 min. A primer walking approach (Santoro et al., 2010) was used to sequence the PCR products. The Expand High Fidelity www.frontiersin.org PCR System (Roche) was used to produce PCR fragments of about 1,000 bp in size which were used as sequencing starting template to confirm sequence on the other strand. Four primer pairs were used to amplify the four fragments, ranging from 10,358 to 16,223 bp in size, containing 1207.3 and its chromosomal junction fragments.

DNA SEQUENCE ANALYSIS
DNA sequence analysis was performed with the software Artemis version 11 1 . Manual gene annotation was carried out conducting BLAST homology searches of the databases available at the National Center for Biotechnology Information 2 . Protein domains were identified searching the protein family database Pfam available at the Wellcome Trust Sanger Institute 3 . The nucleotide sequence of 1207.3 is assigned GenBank accession no. AY657002.

NUCLEOTIDE SEQUENCE
The complete nucleotide sequence of 1207.3 was obtained by primer walking on four long PCR fragments spanning the whole element. Inverse PCR on genomic DNA of S. pyogenes strain 2812A was carried out with divergent primers matching the already known 7,244-bp sequence of 1207.3 and its chromosomal junction sequences (Santagati et al., 2003). Amplicons obtained were used as sequencing template to determine internal sequences of 1207.3 on which primers were designed for the amplification of the four long overlapping fragments. The nucleotide sequence of 1207.3 was confirmed on the other strand using short PCR fragments as sequencing templates. 1207.3 was found to be 52,491 bp in length and DNA sequence analysis showed the presence of 58 open reading frames (ORFs), 44 of which have the same direction of transcription (Figure 1). Regions with different GC content can be could be identified in the DNA sequence of 1207.3: (i) the left end, containing ORFs 1-11, with 36.4% GC; (ii) ORFs 12-16 with 31.6% GC; (iii) ORFs 17-54 with 41,4% GC; (iv) the right end, corresponding to ORFs 55-58, with 37,8% GC (Figure 1). Comparison of 1207.3 with sequences present in public databases showed that: (i) the first 7,244 bp of the 1207.3 sequence are identical to the 7,244-bp element Tn1207.1 of S. pneumoniae (Santagati et al., 2000(Santagati et al., , 2003; (ii) the whole 1207.3 is identical to the 52,491-bp right end sequence of S. pyogenes 10394.4 (Banks et al., 2003;Pozzi et al., 2004); (iii) Seven sequence fragments, ranging in size from 0.2 to 9.5 kb, show 76-97% nucleotide identity to the S. pyogenes prophage m46.1 (Brenciani et al., 2010); (iv) homologous sequence fragments are present in the genetic elements from the genomes of S. dysgalactiae AC-2713 (GenBank no. HE858529), S. suis JS14 (GenBank no. CP002465), and S. agalactiae A909 (GenBank no. CP000114).

ORFs OF 1207.3
For 34 out of 58 ORFs it was possible a manual homology-based annotation with functional prediction of the hypothetical gene product ( Table 1). Public protein databases and the Pfam protein family database were used for Blast searches of predicted gene products, taking into account significant homologies with functionally characterized proteins or good matches with Pfam domains. A putative ribosome binding sequence preceded all ORFs except orf36 and orf37. The alternative start codon TTG was present in orf34, orf36, orf43, and orf56, whereas orf37, orf38, and orf51 started with GTG. In the left arm of the element, the gene product of orf2 was predicted to be a site-specific resolvase of the serine recombinase family, mef(A) (orf4) and msr(D) (orf5) encoded respectively the transmembrane domains and the ATP-binding domains of the ABC transporter responsible for the M-type resistance to macrolides (Iannelli et al., 2014, Abstract C1-1188 Interscience Conference Antimicrobial Agents Chemotherapy, 2004), orf8 and orf11 were homologous to the Tn5253 umuC/umuD operon conferring UV resistance by activation of the SOS repair system (Munoz-Najar and Vijayakumar, 1999). SpyIM, encoded by orf12, is a C-5 cytosine-specific DNA methylase belonging to the SpyI type II restriction-modification Frontiers in Microbiology | Antimicrobials, Resistance and Chemotherapy www.frontiersin.org cassette, and responsible for inhibition of restriction by SmaI, whereas orf13 and orf14 encoded the two subunits of SpyI restriction endonuclease (Euler et al., 2007). Other genes coding for putative restriction-modification proteins include orf24 (restriction enzyme), orf30 (endonuclease), orf33 (cytosine-specific DNA methylase), while orf29 gene product presented an Sadenosylmethionine synthetase domain which may act as a methyl group donor for Orf33. In the central region, orf17, orf23, and orf25 encoded a DNA polymerase, a DNA primase, and a DNA helicase, possibly involved in phage DNA replication, whereas orf38 and orf39 encoded for the small and large subunit of the phage terminase, which, together with the portal protein encoded by orf40, could be involved in phage DNA packaging. orf42, orf44, orf45, and orf50 code for putative structural phage proteins. A putative lysis cassette, which is typically composed by a holin and an endolysin (Young et al., 2000) was encoded by hol (orf53) and skl (orf54). At the right end of the element, orf56 and orf57 gene products were homologous to resolvases of the serine recombinase family, possibly involved in excision, circularization, and site specific integration of 1207.3, whereas orf58 encoded a putative ADP-ribosyltransferase toxin.

DISCUSSION
The complete and annotated DNA sequence of the mobile genetic element previously called Tn1207.3 clearly shows that the element is a prophage which we renamed 1207.3. At the sequence level, 1207.3 (52,491 bp) shows homology to two S. pyogenes prophages: (i) 10394.4 (58,761 bp), integrated at the same chromosomal site of 1207.3 (Banks et al., 2003;Pozzi et al., 2004); (ii) m46.1 (55,172 bp), integrated in the rum gene encoding an RNA uracil methyltransferase (Brenciani et al., 2010). The whole 1207.3 is identical to the right end of 10394.4, whereas high homology (>70%) to m46.1 is limited to 57% of the 1207.3 genome. Prophages similar to 1207.3 were also described by Giovanetti et al. (2005).
The recombination machinery of 1207.3 consists of three resolvases of the serine recombinase family. Serine recombinases are less common than tyrosine recombinases in prophage genomes and are usually present as a single large recombinase gene (Smith and Thorpe, 2002). Most of the S. pyogenes prophages present in sequenced genomes have a tyrosine recombinase (integrase) as the recombination module (Beres and Musser, 2007). The two resolvase genes at the right end of 1207.3 are probably transcribed as a single unit, as their coding sequences overlap by one nucleotide. It is likely that their gene products cooperate in mediating excision and integration of the 1207.3 DNA. This arrangement in tandem of two resolvase genes is also found in SSCmec elements (types I to IV) of Staphylococcus aureus (Wang and Archer, 2010), and in streptococcal prophage m46.1 (Giovanetti et al., 2005;Brenciani et al., 2010).
Since 1207.3 can move among streptococcal species (S. pyogenes, S. pneumoniae, S. gordonii) by a mechanism which fits the operational definition of conjugation (Santagati et al., 2003), this conjugative prophage may represent a novel class of genetic elements with a molecular mechanism of transfer that still needs to be elucidated. It is entirely possible that assembly of a complete phage particle may not be essential for the observed interspecific DNA transfer, and we hypothesize that the lysis cassette of 1207.3, rather than producing the bacterial cell burst after intracellular phage expansion (Young et al., 2000), could contribute to the formation of cytoplasmic bridges between donor and recipient cell.