<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2023.1212149</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Slipped-strand mispairing within a polycytidine tract in transcriptional regulator <italic>mga</italic> leads to M protein phase variation and Mga length polymorphism in Group A Streptococcus</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Lei</surname>
<given-names>Benfang</given-names>
</name>
<xref rid="aff1" ref-type="aff"><sup>1</sup></xref>
<xref rid="c001" ref-type="corresp"><sup>&#x002A;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/201274/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Hanks</surname>
<given-names>Tracey S.</given-names>
</name>
<xref rid="aff1" ref-type="aff"><sup>1</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/2297504/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Bao</surname>
<given-names>Yunjuan</given-names>
</name>
<xref rid="aff2" ref-type="aff"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Liu</surname>
<given-names>Mengyao</given-names>
</name>
<xref rid="aff1" ref-type="aff"><sup>1</sup></xref>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Microbiology and Cell Biology, Montana State University</institution>, <addr-line>Bozeman, MT</addr-line>, <country>United States</country></aff>
<aff id="aff2"><sup>2</sup><institution>State Key Laboratory of Biocatalysis and Enzyme Engineering, School of Life Sciences, Hubei University</institution>, <addr-line>Wuhan, Hubei</addr-line>, <country>China</country></aff>
<author-notes>
<fn id="fn0001" fn-type="edited-by"><p>Edited by: Axel Cloeckaert, Institut National de recherche pour l&#x2019;agriculture, l&#x2019;alimentation et l&#x2019;environnement (INRAE), France</p></fn>
<fn id="fn0002" fn-type="edited-by"><p>Reviewed by: Andrea L. Herrera, University of South Dakota, United States; Victor C. Huber, University of South Dakota, United States; Samuel A. Shelburne, University of Texas MD Anderson Cancer Center, United States; Stephen B. Beres, Houston Methodist Research Institute, United States</p></fn>
<corresp id="c001">&#x002A;Correspondence: Benfang Lei, <email>blei@montana.edu</email></corresp>
</author-notes>
<pub-date pub-type="epub">
<day>26</day>
<month>06</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>14</volume>
<elocation-id>1212149</elocation-id>
<history>
<date date-type="received">
<day>26</day>
<month>04</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>06</day>
<month>06</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2023 Lei, Hanks, Bao and Liu.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Lei, Hanks, Bao and Liu</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>The M protein, a major virulence factor of Group A Streptococcus (GAS), is regulated by the multigene regulator Mga. An unexplained phenomena frequently occurring with <italic>in vitro</italic> genetic manipulation or culturing of M1T1 GAS strains is the loss of M protein production. This study was aimed at elucidating the basis for the loss of M protein production. The majority of M protein-negative (M<sup>&#x2212;</sup>) variants had one C deletion at a tract of 8 cytidines starting at base 1,571 of the M1 <italic>mga</italic> gene, which is designated as c.1571C[8]. The C deletion led to a c.1571C[7] <italic>mga</italic> variant that has an open reading frame shift and encodes a Mga-M protein fusion protein. Transformation with a plasmid containing wild-type <italic>mga</italic> restored the production of the M protein in the c.1571C[7] <italic>mga</italic> variant. Isolates producing M protein (M<sup>+</sup>) were recovered following growth of the c.1571C[7] M protein-negative variant subcutaneously in mice. The majority of the recovered isolates with reestablished M protein production had reverted back from c.1571C[7] to c.1571C[8] tract and some M<sup>+</sup> isolates lost another C in the c.1571C[7] tract, leading to a c.1571C[6] variant that encodes a functional Mga with 13 extra amino acid residues at the C-terminus compared with wild-type Mga. The nonfunctional c.1571C[7] and functional c.1571C[6] variants are present in M1, M12, M14, and M23 strains in NCBI genome databases, and a G-to-A nonsense mutation at base 1,657 of M12 c.1574C[7] <italic>mga</italic> leads to a functional c.1574C[7]/1657A <italic>mga</italic> variant and is common in clinical M12 isolates. The numbers of the C repeats in this polycytidine tract and the polymorphism at base 1,657 lead to polymorphism in the size of Mga among clinical isolates. These findings demonstrate the slipped-strand mispairing within the c.1574C[8] tract of <italic>mga</italic> as a reversible switch controlling M protein production phase variation in multiple GAS common M types.</p>
</abstract>
<kwd-group>
<kwd>Group A Streptococcus</kwd>
<kwd><italic>Streptococcus pyogenes</italic></kwd>
<kwd>M protein</kwd>
<kwd>Mga</kwd>
<kwd>phase variation</kwd>
<kwd>slipped-strand mispairing</kwd>
</kwd-group>
<contract-num rid="cn1">AI153755</contract-num>
<contract-sponsor id="cn1">National Institutes of Health and the Montana State Agricultural Experimental Station</contract-sponsor>
<counts>
<fig-count count="5"/>
<table-count count="4"/>
<equation-count count="0"/>
<ref-count count="47"/>
<page-count count="12"/>
<word-count count="9583"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Infectious Agents and Disease</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec id="sec1" sec-type="intro">
<title>Introduction</title>
<p><italic>Streptococcus pyogenes</italic> or Group A Streptococcus (GAS) is a major human pathogen that commonly causes pharyngitis and skin infection (<xref ref-type="bibr" rid="ref5">Carapetis et al., 2005</xref>). GAS can occasionally cause severe invasive infections, such as necrotizing fasciitis, pneumonia, and streptococcal toxic shock syndrome (<xref ref-type="bibr" rid="ref26">Nelson et al., 2016</xref>). GAS produces many extracellular virulence factors to mediate its pathogenesis (<xref ref-type="bibr" rid="ref7">Cunningham, 2008</xref>). The M protein is a coiled-coil &#x03B1;-helical surface protein that is covalently linked to the peptidoglycan of GAS (<xref ref-type="bibr" rid="ref23">McNamara et al., 2008</xref>). The M protein is a major virulence factor and specifically acquires host proteins, such as fibrinogen and complement factor H, to block deposition of antibody and complement C3b and C3bi opsonins, preventing phagocytosis of GAS by neutrophils (<xref ref-type="bibr" rid="ref11">Fischetti et al., 1988</xref>; <xref ref-type="bibr" rid="ref32">Sandin et al., 2006</xref>).</p>
<p>The M protein gene, <italic>emm</italic>, is one of several genes with the highest expression in GAS at the exponential growth phage in nutrient-rich medium. The transcription of <italic>emm</italic> is regulated by the transcription activator Mga (for multigene regulator in GAS and formerly known as Mry) (<xref ref-type="bibr" rid="ref4">Caparon and Scott, 1987</xref>; <xref ref-type="bibr" rid="ref31">Ribardo and McIver, 2006</xref>). Mga directly activates <italic>emm</italic> and several other virulence genes (<xref ref-type="bibr" rid="ref22">McIver et al., 1995</xref>). Expression of M protein in M12 type GAS exhibits phase variation, transiting from M protein-positive (M<sup>+</sup>) to M protein-negative (M<sup>&#x2212;</sup>) phenotype during culture <italic>in vitro</italic> (<xref ref-type="bibr" rid="ref35">Simpson and Cleary, 1987</xref>). Similarly, the loss of M protein production and down-regulation of <italic>emm</italic> transcription in some M1 type GAS strains was observed fairly frequently in conjunction with genetic manipulations for the generation of gene deletion mutants unrelated to the <italic>emm</italic> gene (<xref ref-type="bibr" rid="ref47">Zhou et al., 2013</xref>). In as much as the loss of M protein production and down-regulation of <italic>emm</italic> transcription occurred independent of the gene being deleted, presumably it was caused by spontaneous secondary mutation(s) affecting M protein production but unrelated to the gene being deleted (<xref ref-type="bibr" rid="ref47">Zhou et al., 2013</xref>). In both the M12 and M1 genetic backgrounds, the specific genetic change(s) underlying the loss of M protein production observed in these studies was not determined.</p>
<p>Phase variation in general refers to a reversible on/off switch for control of expression of one or more proteins between individual cells of a clonal population (<xref ref-type="bibr" rid="ref44">van der Woude, 2011</xref>). One mechanism of phase variation is slipped-strand mispairing that classically affects DNA regions with direct repeats of 1&#x2013;4 bases, with reversible genetic changes stemming from the misalignment of the repeat sequences of the mother and daughter strands during replication (<xref ref-type="bibr" rid="ref15">Levinson and Gutman, 1987</xref>; <xref ref-type="bibr" rid="ref12">Henderson et al., 1999</xref>; <xref ref-type="bibr" rid="ref45">Viguera et al., 2001</xref>). Slipped-strand mispairing results in decrease or increase of the number of repeats in the newly synthesized DNA and can lead to altered gene expression at either the transcriptional or translational level (<xref ref-type="bibr" rid="ref37">Stern et al., 1986</xref>; <xref ref-type="bibr" rid="ref39">Stibitz et al., 1989</xref>; <xref ref-type="bibr" rid="ref21">Lukomski et al., 2001</xref>; <xref ref-type="bibr" rid="ref30">Rasmussen and Bj&#x00F6;rck, 2001</xref>; <xref ref-type="bibr" rid="ref42">Torres-Cruz and van der Woude, 2003</xref>). The <italic>mga</italic> sequences fall into two major clusters that display 25 to 30% divergence (<xref ref-type="bibr" rid="ref2">Bessen et al., 2005</xref>). The <italic>mga</italic> genes in cluster 1 (<italic>mga</italic>-1), but not <italic>mga</italic>-2 cluster, have a tract of 8 continuous cytidine bases (polycytidine tract) starting at base 1,574, which is referred as c.1574C[8] according to the standard mutation nomenclature (<xref ref-type="bibr" rid="ref28">Ogino et al., 2007</xref>). This report presents evidence that slipped-strand mispairing within the c.1571C[8] tract of <italic>mga</italic> functions as a switch for the phase variation of M protein and occurs in patients.</p>
</sec>
<sec id="sec2" sec-type="materials|methods">
<title>Materials and methods</title>
<sec id="sec3">
<title>Bacterial strains and media</title>
<p>Sequenced M1 strains SF370 (<xref ref-type="bibr" rid="ref10">Ferretti et al., 2001</xref>), MGAS2221 (<xref ref-type="bibr" rid="ref40">Sumby et al., 2006</xref>), MGAS5005 (<xref ref-type="bibr" rid="ref40">Sumby et al., 2006</xref>), and 5448 (<xref ref-type="bibr" rid="ref46">Walker et al., 2007</xref>), and M3 strain MGAS315 (<xref ref-type="bibr" rid="ref1">Beres et al., 2002</xref>) were used in this study. These GAS strains and their derivatives were grown in Todd-Hewitt broth supplemented with 0.2% yeast extract (THY) at 37&#x00B0;C in 5% CO<sub>2</sub>. Tryptose agar with 5% sheep blood and THY agar were used as solid media.</p>
</sec>
<sec id="sec4">
<title><italic>In vitro</italic> culture condition that led to GAS variants with loss of M protein production</title>
<p>MGAS2221 was cultured in THY starting with an optical density at 600&#x2009;nm (OD<sub>600</sub>) of 0.05 at 37&#x00B0;C in 5% CO<sub>2</sub>. The continued culture was done by adding 1&#x2009;mL of the prior culture to 11&#x2009;mL of THY and grown for 12&#x2009;h. This serial passage continuous culturing process was repeated for 15 cycels. Following the 15th passage bacteria were harvested at an OD<sub>600</sub> of 0.3, diluted and plated on THY agar. Randomly picked colonies were streaked on plates, grown to mid-exponential growth, and stored frozen until analysis for detection of the M protein by Western blotting analysis. Gene deletion mutants of SF370, MGAS2221, MGAS5005, and MGAS315 were obtained in a two-step in-frame gene deletion procedure as described (<xref ref-type="bibr" rid="ref48">Zhu et al., 2009</xref>). The step in which mutants might lose M protein production was the growth of isolates from the first crossover in THY for &#x003E;8 passages (one passage&#x2009;=&#x2009;0.05 to 0.7 in OD<sub>600</sub>) that allowed the second crossover event to occur for the generation of gene deletion mutants.</p>
</sec>
<sec id="sec5">
<title>Cell surface M protein detection by Western blotting</title>
<p>The cell surface M protein of M1 GAS isolates was detected by Western blotting, as previously reported (<xref ref-type="bibr" rid="ref47">Zhou et al., 2013</xref>). Briefly, bacteria in 10&#x2009;mL culture with an OD<sub>600</sub> of 0.2 were harvested by centrifugation, suspended in 0.2&#x2009;mL phosphate-buffered saline (PBS), and digested with 10&#x2009;&#x03BC;g PlyC (<xref ref-type="bibr" rid="ref27">Nelson et al., 2006</xref>) at room temperature for 1&#x2009;h. The supernatants containing the released cell wall proteins were diluted with 1:60 1x SDS-PAGE loading buffer, boiled, and 10&#x2009;&#x03BC;L of each sample was resolved by SDS-PAGE. Proteins were transferred from SDS-PAGE gel to nitrocellulose membrane (Immobilon-NC, Millipore Corporation) with Towbin transfer buffer using a Trans-Blot Semi-Dry Transfer Cell (Bio-Rad Laboratories) at 15&#x2009;V for 40&#x2009;min. The membrane was blocked with 3% bovine serum albumin&#x2013;0.1% Tween 20 in 20&#x2009;mM Tris&#x2013;HCl, pH 8.0, for 1&#x2009;h and incubated for 1&#x2009;h with 1:2000 anti-M1 antisera that was kindly provided by Dr. James Dale at University of Tennessee Health Science Center. The membrane was then rinsed twice and washed three times for 15&#x2009;min with 0.1% Tween 20 in PBS. The membrane was incubated with goat anti-rabbit IgG (heavy + light chain) horseradish peroxidase (HRP) conjugate secondary antibodies for 1&#x2009;h and rinsed and washed as described above. Antigen&#x2013;antibody reactivity was visualized by enhanced chemiluminescence. Western blots usually shows two bands for M protein and the higher MW band is M protein with attached peptidoglycan fragments.</p>
</sec>
<sec id="sec6">
<title>Quantitative RT-PCR analysis</title>
<p>Data for <italic>emm</italic> mRNA levels in <xref rid="SM1" ref-type="supplementary-material">Supplementary Table S1</xref> were associated with our previous publications (<xref ref-type="bibr" rid="ref48">Zhu et al., 2009</xref>; <xref ref-type="bibr" rid="ref20">Liu et al., 2012</xref>; <xref ref-type="bibr" rid="ref17">Li et al., 2013</xref>; <xref ref-type="bibr" rid="ref47">Zhou et al., 2013</xref>; <xref ref-type="bibr" rid="ref18">Liu et al., 2015</xref>; <xref ref-type="bibr" rid="ref38">Stetzner et al., 2015</xref>; <xref ref-type="bibr" rid="ref9">Feng et al., 2017</xref>). Levels of <italic>emm</italic> and <italic>gyrA</italic> (control) mRNA were measured by using TaqMan quantitative RT-PCR assays with specific probes and primers as reported previously (<xref ref-type="bibr" rid="ref47">Zhou et al., 2013</xref>) or using the All-in-One SYBR qPCR mix from GeneCopoeia (<xref ref-type="bibr" rid="ref38">Stetzner et al., 2015</xref>). The transcription data for MGAS2221, GAS1806, M405, M406, and M497 were measured using the TaqMan quantitative RT-PCR as described by <xref ref-type="bibr" rid="ref47">Zhou et al. (2013)</xref>. The data were for early exponential growth phase at optical density 0.2 of GAS in THY. Control reactions that did not contain reverse transcriptase revealed no contamination of genomic DNA in any RNA sample. All RNA samples were assayed in triplicate, and the levels of <italic>emm</italic> mRNA were compared using the &#x0394;&#x0394;CT method with normalization to the mRNA levels of the <italic>gyrA</italic> gene and presented with normalization to that of corresponding wild-type or control strain.</p>
</sec>
<sec id="sec7">
<title>Complementation of M<sup>-</sup> variants with <italic>mga</italic></title>
<p>The <italic>mga</italic> gene of MGAS2221 was cloned into pDCBB-RFA (<xref ref-type="bibr" rid="ref19">Liu et al., 2013</xref>) with the Gateway Cloning Technology according to the manufacturer&#x2019;s manual. pDCBB (<xref ref-type="bibr" rid="ref43">Trevi&#x00F1;o et al., 2009</xref>) was modified by inserting the blunt-ended reading frame cassette A (RFA) into pDCBB at the EcoRV site, resulting in pDCBB-RFA (<xref ref-type="bibr" rid="ref19">Liu et al., 2013</xref>). The <italic>mga</italic> gene was PCR amplified using Phusion DNA polymerase (New England BioLabs, Ipswich, MA) and the primers 5&#x2032;- GGGGACAAGTTTGTACAAAAAAGCAGGCTagaagggtatacaaggtaatg-3&#x2032; and 5&#x2019;-GGGGACCACTTTGTACAAGAAAGCTGGGTgtttttgagttgctacagtta-3&#x2032;. The sequences in the capital letters were <italic>attB</italic> sequences for the BP clonase reaction. The PCR product was cloned into the donor vector pDONR221 using the BP clonase, yielding pDONR221-<italic>mga</italic>. The <italic>mga gene</italic> in pDONR221 was transferred into pDCBB-RFA by the LR clonase, yielding pDCBB-<italic>mga</italic>. pDCBB-<italic>mga</italic> or vector control pDCBB was introduced into M<sup>-</sup> variants <italic>via</italic> electroporation.</p>
</sec>
<sec id="sec8">
<title>DNA sequencing</title>
<p>DNA sequencing of amplified PCR products was performed using the BigDye Terminator v3.1&#x2009;Cycle Sequencing Kit and an Applied Biosystems 3130 genetic analyzer. Sequence data were analyzed using the software Sequencer 5.1 from the Gene Codes Corporation. To sequence the whole <italic>mga</italic> gene, a 2,454-bp fragment containing <italic>mga</italic> was amplified using Phusion DNA polymerase and the paired primers 5&#x2032;-GTTGTACCATAACAGTCAAAC-3&#x2032;/5&#x2032;-TTTCAAGTTCTTCAGCTCTC-3&#x2032;. The primers used for sequencing were the two PCR primers and additional primers, 5&#x2032;-AACGAATCAAGTTAACTGAGC-3&#x2032;, 5&#x2032;-TCCTAAACTTAAAGAACTGTG-3&#x2032;, 5&#x2032;-TGTCACGATCACATCATACTG-3&#x2032;, and 5&#x2032;-TTTAACAGTGTTGGTAATTTC-3&#x2032;. To sequence the c.1574C[8] region of M1 and M1T1 GAS isolates, a 469-bp fragment was amplified and sequenced using the primers 5&#x2032;-TTTAAACATCAGCTTTGCAGA-3&#x2032; and 5&#x2032;-TCTTCTATAACTTCCCTAGGA-3&#x2032;. To sequence the c.1592C[8] region of M3 GAS isolates, a 457-bp fragment was amplified and sequenced using the primers 5&#x2032;&#x2013;TTTTTAAACATCAGCTCTGCAGA&#x2013;3&#x2032; and 5&#x2032;&#x2013;CATTAACACTCCTAGCATCTG&#x2013;3&#x2032;.</p>
</sec>
<sec id="sec9">
<title>M<sup>+</sup> isolates from M<sup>-</sup> variants in subcutaneous infection of mice</title>
<p>M<sup>-</sup> variants were grown in THY, harvested at the mid-exponential growth phase (OD<sub>600</sub> of 0.4) and washed three times with and resuspended in pyrogen-free Dulbecco&#x2019;s phosphate-buffered saline (DPBS). Five 5-weeks-old female C57BL/6&#x2009;J mice were injected subcutaneously with 0.2&#x2009;mL of GAS suspension in DPBS with an OD<sub>600</sub> of 0.8. The mice were euthanized with CO<sub>2</sub> at day 4 after inoculation. Skin infection sites were collected and homogenized in DPBS using a Kontes pestle, and plated at appropriate dilutions. Randomly picked 70 colonies were grown and analyzed for detection of M protein production using the western blotting analysis as described above. The mouse experimental procedures were carried out in strict accordance with the recommendations in the <italic>Guide for the Care and Use of Laboratory Animals</italic> of the <xref ref-type="bibr" rid="ref25">National Research Council (2011)</xref>.</p>
</sec>
<sec id="sec10">
<title>Analyses of <italic>mga</italic> gene and Mga protein sequences from genome databases</title>
<p>The DNA sequences of the <italic>mga</italic> gene and its protein sequences were analyzed for M1, M3, M12, M14, M23, and M49. Complete and incomplete GAS genomes as of October 10, 2019 were downloaded from the NCBI GenBank database.<xref rid="fn0003" ref-type="fn"><sup>1</sup></xref> M type of GAS strains was determined by aligning their sequences with the <italic>emm</italic> sequences in an <italic>emm</italic> genotype database.<xref rid="fn0004" ref-type="fn"><sup>2</sup></xref> Because there were no polymorphisms in the length of Mga-2 of M49 GAS, we just listed all of 158 M1, 106 M3, 106&#x2009;M12, 6&#x2009;M14, and 2&#x2009;M23 strains that were available at the time for analysis (<xref rid="SM1" ref-type="supplementary-material">Supplementary Tables S2&#x2013;S4</xref> and <xref rid="tab1" ref-type="table">Table 1</xref>). The nucleotides of <italic>mga</italic> and <italic>emm</italic> in each genome were extracted by mapping to the <italic>mga</italic> and <italic>emm</italic> locus using blast (<xref ref-type="bibr" rid="ref3">Camacho et al., 2009</xref>). Multiple sequence alignments of <italic>mga</italic> DNA sequences and Mga protein sequences were performed using ClustalO (<xref ref-type="bibr" rid="ref34">Sievers et al., 2011</xref>).</p>
<table-wrap position="float" id="tab1">
<label>Table 1</label>
<caption><p>Polymorphism in polycytidine tract of <italic>mga</italic> and Mga length of M14 and M23 GAS.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Strain</th>
<th align="center" valign="top">M type</th>
<th align="center" valign="top">Polycytidine tract</th>
<th align="center" valign="top">AA No. of Mga</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">NS501</td>
<td align="center" valign="top">M14</td>
<td align="center" valign="top">c.1574C[8]</td>
<td align="center" valign="top">530</td>
</tr>
<tr>
<td align="left" valign="top">NS506</td>
<td align="center" valign="top">M14</td>
<td align="center" valign="top">c.1574C[8]</td>
<td align="center" valign="top">530</td>
</tr>
<tr>
<td align="left" valign="top">33042V1T1</td>
<td align="center" valign="top">M14</td>
<td align="center" valign="top">c.1574C[8]</td>
<td align="center" valign="top">530</td>
</tr>
<tr>
<td align="left" valign="top">NS4985</td>
<td align="center" valign="top">M14</td>
<td align="center" valign="top">c.1574C[8]</td>
<td align="center" valign="top">530</td>
</tr>
<tr>
<td align="left" valign="top">NCTC8199</td>
<td align="center" valign="top">M14</td>
<td align="center" valign="top">c.1574C[6]</td>
<td align="center" valign="top">543</td>
</tr>
<tr>
<td align="left" valign="top">HSC5</td>
<td align="center" valign="top">M14</td>
<td align="center" valign="top">c.1574C[6]</td>
<td align="center" valign="top">543</td>
</tr>
<tr>
<td align="left" valign="top">N23ND</td>
<td align="center" valign="top">M23</td>
<td align="center" valign="top">c.1555C[5]/c.1573[9]</td>
<td align="center" valign="top">530</td>
</tr>
<tr>
<td align="left" valign="top">NCTC4001</td>
<td align="center" valign="top">M23</td>
<td align="center" valign="top">c.1555C[6]/c.1574C[6]</td>
<td align="center" valign="top">543</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec id="sec11" sec-type="results">
<title>Results</title>
<sec id="sec12">
<title>M protein production is frequently lost during genetic manipulation for generation of GAS gene deletion mutants</title>
<p>Reverse genetic analysis is a commonly used approach to define the function and phenotype of target genes. We use a two-step strategy to delete genes in-frame in GAS (<xref ref-type="bibr" rid="ref48">Zhu et al., 2009</xref>). A suicide plasmid is constructed to contain the upstream and downstream flanking fragments of target gene that are joined to each other. The first step involves the first homologous crossover between one flanking fragment in the suicide plasmid and GAS chromosome, yielding chloramphenicol-resistant merodiploid transformants. The second step passes one transformant in liquid THY medium or on THY agar plate without chloramphenicol selection for more than 8 times to allow the second crossover between the other flanking fragment of the integrated suicide plasmid and its homologous sequence of GAS chromosome, resulting in gene deletion mutants. During genetic manipulation or any process that entails bacterial DNA replication, spontaneous random mutations can occur. To rule out the presence of a potential spurious mutation grossly altering the expression of known major virulence factors, western blotting analysis and/or qRT-PCR analyses were performed to check M protein production and the expression of M protein gene, <italic>hasA</italic>, <italic>spyCEP</italic>, and <italic>sse</italic> genes of gene deletion mutants. We have previously shown that genetic manipulation of GAS to construct gene deletion mutants can independently of the gene being targeted lead to the spontaneous loss of <italic>emm</italic> gene expression and M protein production (<xref ref-type="bibr" rid="ref47">Zhou et al., 2013</xref>). To evaluate if loss of M protein production during such processes is a general phenomenon common to many GAS strains of M1 and other M types, we analyzed gene deletion mutants of M1 strains SF370, MGAS2221, MGAS5005, and 5,448 by the western blotting analysis and gene deletion mutants of M3 strain MGAS315. As shown in <xref rid="fig1" ref-type="fig">Figure 1</xref>, independent of the gene targeted for deletion, gene deletion mutants of M1 GAS strains MGAS2221, MGAS5005, and SF370 could lose M protein production. In total, 43 of 92 gene deletion mutants of M1 GAS strains MGAS5005, MGAS2221, 5448, and SF370 lost M protein production (<xref rid="SM1" ref-type="supplementary-material">Supplementary Table S1</xref>). As listed in <xref rid="SM1" ref-type="supplementary-material">Supplementary Table S1</xref>, 13&#x2009;M<sup>+</sup> and 11&#x2009;M<sup>-</sup> gene deletion mutants of them had relative mRNA levels of the <italic>emm</italic> gene that were obtained for our previous publications (<xref ref-type="bibr" rid="ref48">Zhu et al., 2009</xref>; <xref ref-type="bibr" rid="ref20">Liu et al., 2012</xref>; <xref ref-type="bibr" rid="ref17">Li et al., 2013</xref>; <xref ref-type="bibr" rid="ref47">Zhou et al., 2013</xref>; <xref ref-type="bibr" rid="ref18">Liu et al., 2015</xref>; <xref ref-type="bibr" rid="ref38">Stetzner et al., 2015</xref>; <xref ref-type="bibr" rid="ref9">Feng et al., 2017</xref>). The <italic>emm</italic> mRNA levels in 9 of the 10&#x2009;M<sup>-</sup> gene deletion mutants were 1 to 5.5% of those in their parent strains whereas 1&#x2009;M<sup>&#x2212;</sup>, GAS1778 (5448&#x0394;sda1), and all the 11&#x2009;M<sup>+</sup> deletion mutants has similar levels of <italic>emm</italic> mRNA with those in their parent strain. Two of 9 gene deletion mutants of M3 strains MGAS315, GAS1191 and GAS1028, down-regulated <italic>emm3</italic> transcription by about 99%, and western blotting analysis was not performed for the M3 gene deletion mutants because anti-M3 protein antibody was not available (<xref rid="SM1" ref-type="supplementary-material">Supplementary Table S1</xref>).</p>
<fig position="float" id="fig1">
<label>Figure 1</label>
<caption><p>Anti-M protein western blots show GAS variants with loss of M protein production. <bold>(A)</bold> Anti-M protein blot of MGAS2221, MGAS5005 and SF370 and their gene deletion mutants. Lanes: 1, MGAS2221; 2, GAS1213 (MGAS2221&#x0394;<italic>spyCEP</italic>); 3, GAS1473 (MGAS2221&#x0394;sclA); 4, GAS1498 (MGAS2221&#x0394;<italic>spy1118</italic>); 5, MGAS5005; 6, GAS1133 (MGAS5005&#x0394;<italic>sse</italic>); 7, SF370; 8, GAS827 (SF370&#x0394;<italic>covS</italic>); and 9, GAS1458 (SF370&#x0394;<italic>sclA</italic>). <bold>(B)</bold> Anti-M protein blot of MGAS2221 (lane 1) and its derivative isolates from <italic>in vitro</italic> culture after 15 passages (lanes 2 to 14).</p></caption>
<graphic xlink:href="fmicb-14-1212149-g001.tif"/>
</fig>
<p>The second crossover to achieve gene deletion occurred during growing first-crossover tranconjugants in THY medium or on THY agar plates. The 44&#x2009;M<sup>-</sup> gene deletion mutants were all among 70 mutants that were obtained by using THY medium for the second crossover whereas all 31 gene deletion mutants that were obtained by using THY agar were all positive in M protein production (<xref rid="SM1" ref-type="supplementary-material">Supplementary Table S1</xref>). These data suggest that M<sup>-</sup> variants have an advantage over M<sup>+</sup> GAS to survive in THY.</p>
</sec>
<sec id="sec13">
<title>M<sup>-</sup> variants of MGAS2221 arise during culturing</title>
<p>To determine whether M<sup>-</sup> variants of M1 GAS arise during culture <italic>ex-vivo</italic> in THY, strain MGAS2221 was passed in THY twice a day for 7&#x2009;days, randomly chosen colonies after the last passage were analyzed for M protein production using western blotting analysis. Fifty colonies from the MGAS2221 culture after 14 passes were tested for M protein production, and 30 of them had no detectable M protein by Western blotting analysis. <xref rid="fig1" ref-type="fig">Figure 1B</xref> shows a representative Western blot in the analysis. Thus, M<sup>&#x2212;</sup> M1 GAS variants arise during <italic>in vitro</italic> growth, a phenomenon similar to that described for M12 GAS strains (<xref ref-type="bibr" rid="ref35">Simpson and Cleary, 1987</xref>).</p>
</sec>
<sec id="sec14">
<title>Slipped-strand mispairing within the c.1571C[8] tract of <italic>mga</italic> causes loss of M protein in M1 GAS</title>
<p>Most of 10 M<sup>-</sup> variants with <italic>emm</italic> transcript data had dramatic downregulation in transcription of the <italic>emm</italic> gene (<xref rid="SM1" ref-type="supplementary-material">Supplementary Table S1</xref>). Since Mga is the primary regulator of <italic>emm</italic> expression any polymorphism in the <italic>mga</italic> gene or the promoter region of <italic>mga</italic> and/or <italic>emm</italic> that reduces the capacity of Mga to activate <italic>emm</italic> transcription could contribute to loss of the M protein expression. To look for such polymorphisms, a PCR amplicon containing the <italic>mga</italic> gene and its 379-bp upstream and 470-bp downstream sequences from one M<sup>&#x2212;</sup> variant and its parent strain MGAS2221 were sequenced by the Sanger DNA sequencing. A single C deletion was found in the polycytidine tract that starts at base 1,571 and has 8 repeats of C near the end of the <italic>mga</italic> gene of M1 GAS (<xref rid="fig2" ref-type="fig">Figure 2A</xref>). This polycytidine tract of wild-type <italic>mga</italic> is designated as c.1574C[8] in which 1,574 and 8 represent the starting base of the polycytidine tract and the number of the C repeats, respectively. The 1C deletion in the c.1571C[8] tract leads to the c.1571C[7] <italic>mga</italic> variant. The c.1571C[7] <italic>mga</italic> variant gene in M1 GAS has an altered reading frame of the <italic>mga</italic> gene, and the mutated <italic>mga</italic> gene is read through the intergenic region between the <italic>mga</italic> and <italic>emm</italic>1 genes and the <italic>emm</italic>1 gene, leading to a protein variant that contains the amino acid sequence of Mga, 62 amino acid residues encoded by the intergenic sequence, and M1 Protein. Transformation of the c.1571C[7] <italic>mga</italic> variant with plasmid pDCBB-<italic>mga</italic>, but not the plasmid vector control, restored the M protein production (<xref rid="fig2" ref-type="fig">Figure 2B</xref>).</p>
<fig position="float" id="fig2">
<label>Figure 2</label>
<caption><p>Slipped-strand mispairing within the c.1571C[8] polycytidine tract of <italic>mga</italic> turns off M protein production. <bold>(A)</bold> DNA sequencing chromatogram shows 8\u00B0C repeats of the polycytidine tract near the 3&#x2032; end of <italic>mga</italic> in MGAS2221 but 7\u00B0C repeats of the tract in its M<sup>-</sup> variant (GAS1285) that led to a Mga-M1 protein fusion protein. <bold>(B)</bold> In trans complementation of the M<sup>-</sup> variant with pDCBB-mga but not pDCBB (vector control) restored M protein production.</p></caption>
<graphic xlink:href="fmicb-14-1212149-g002.tif"/>
</fig>
<p>To determine whether the c.1571C[7] <italic>mga</italic> variant was prevalent among our M<sup>&#x2212;</sup> strains, a 820-bp DNA fragment covering 349-bp of 3&#x2032; part of <italic>mga</italic>, 184-bp <italic>mga</italic>/<italic>emm</italic> intergenic region, and 287-bp of the 5&#x2032; part of <italic>emm</italic> was sequenced for 37 gene deletion mutants with diminished M protein production. The results are presented in <xref rid="SM1" ref-type="supplementary-material">Supplementary Table S1</xref>. 37 of the 43&#x2009;M<sup>&#x2212;</sup> mutants of M1 strains MGAS2221, MGAS5005, 5,448, or SF370 had the c.1571C[7] <italic>mga</italic> variant whereas 6 mutants had the c.1574C[8] tract of <italic>mga</italic>. One of the 5&#x2009;M<sup>-</sup> mutants with the c.1571C[8] <italic>mga</italic>, GAS1099, had A-to-G mutation at base 74 upstream of <italic>emm</italic> that is referred as -74A&#x2009;&#x003E;&#x2009;G of <italic>emm</italic>. This mutation changed the-10 box TACAAT of the <italic>emm</italic> promoter to TGCAAT and had reduction of <italic>emm</italic> mRNA by 97.9%. The whole <italic>mga</italic> gene of the 5 remaining M<sup>-</sup> variants with c.1571C[8] of <italic>mga</italic> was sequenced. GAS874 had missense mutation of A to C at base 292, c.292A&#x2009;&#x003E;&#x2009;C, leading to the threonine-to-proline mutation at residue 98 of the Mga protein, Mga T98P, and had reduction of <italic>emm</italic> mRNA levels by 94.5%. The other 4&#x2009;M<sup>-</sup> variants, GAS1499, GAS1081, GAS1185, and GAS1778, had the wild-type Mga and were not further characterized. The <italic>mga</italic> gene of 10&#x2009;M<sup>-</sup> variants from <italic>in vitro</italic> MGAS2221 culture were also sequenced. Eight of them had c.1571C[7]; isolate 499 had <italic>mga</italic> c.866A&#x2009;&#x003E;&#x2009;C missense mutation that led to Mga T289P; and isolate 521 had wt c.1571C[8] <italic>mga</italic> (<xref rid="tab2" ref-type="table">Table 2</xref>). Thus, the conversion of <italic>wt</italic> c.1571C[8] <italic>mga</italic> into the c.1571C[7] variant was a common cause for M<sup>-</sup> variants arisen during genetic manipulation and simple <italic>in vitro</italic> culturing of M1 GAS strains.</p>
<table-wrap position="float" id="tab2">
<label>Table 2</label>
<caption><p>Polymorphism of <italic>mga</italic> in M protein-negative isolates of cultured MGAS2221.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Isolate No.</th>
<th align="center" valign="top"><italic>mga</italic> polycytidine tract</th>
<th align="center" valign="top">Other mutation</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="bottom">GAS1806</td>
<td align="center" valign="bottom">c.1571C[7]</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">499</td>
<td align="center" valign="top">c.1571C[8]</td>
<td align="center" valign="bottom"><italic>mga</italic> c.866A&#x2009;&#x003E;&#x2009;C Mga T289P</td>
</tr>
<tr>
<td align="left" valign="top">503</td>
<td align="center" valign="top">c.1571C[7]</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">506</td>
<td align="center" valign="top">c.1571C[7]</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">520</td>
<td align="center" valign="bottom">c.1571C[7]</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">521</td>
<td align="center" valign="bottom">c.1571C[8]</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">523</td>
<td align="center" valign="bottom">c.1571C[7]</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">525</td>
<td align="center" valign="bottom">c.1571C[7]</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">535</td>
<td align="center" valign="bottom">c.1571C[7]</td>
<td/>
</tr>
<tr>
<td align="left" valign="top">540</td>
<td align="center" valign="bottom">c.1571C[7]</td>
<td/>
</tr>
</tbody>
</table>
</table-wrap>
<p>The <italic>mga</italic> gene of M3 GAS strains also has the polycytidine tract starting at base 1,592 that is referred as c.1592C[8]. Two gene deletion mutants of <italic>emm</italic>3 GAS strain MGAS315, GAS1191 and GAS1028, had reduction of <italic>emm</italic> mRNA levels by 99% and 1C deletion in the c.1592C[8] tract of <italic>mga</italic> (<xref rid="SM1" ref-type="supplementary-material">Supplementary Table S1</xref>). This 1-C deletion in the c.1592C[8] tract did not lead to the Mga-M3 fusion protein but led to an Mga variant with 52 extra amino residues in comparison with the wild-type Mga protein of M3 GAS. Unlike the M1 c1571C[7] <italic>mga</italic> variant, the c.1592C[7] <italic>mga</italic> variant of M3 GAS does not encode Mga-M protein fusion. This difference is due to the deletion of G at base 148 of the intergenic sequence between the <italic>mga</italic> and <italic>emm</italic> genes in M3 GAS in comparison with M1 GAS (<xref rid="SM1" ref-type="supplementary-material">Supplementary Figure S1</xref>).</p>
</sec>
<sec id="sec15">
<title>M<sup>+</sup> isolates from M<sup>-</sup> strains with c.1571C[7] <italic>mga</italic> variant in mice</title>
<p>A question of interest is whether the conversion of c.1571C[8] <italic>mga</italic> to c.1571C[7] <italic>mga</italic> is reversible. Since the M protein contributes to protection of GAS against the host innate immune response, <italic>in vivo</italic> growth would provide an environment to allow the conversion of c.1571C[7] <italic>mga</italic> to c.1571C[8] <italic>mga</italic>. To test this idea, two mice were subcutaneously infected with GAS1285, an M<sup>&#x2212;</sup> MGAS2221 &#x0394;<italic>sagA</italic> mutant with the c.1571C[7] <italic>mga</italic> variant. GAS isolates were recovered from skin infection sites 4&#x2009;days after inoculation, and 13 GAS colonies randomly chosen from each of mouse 1 and 2 for detecting M protein production by western blotting analysis. Three and four isolates among the analyzed isolates from mouse 1 (<xref rid="fig3" ref-type="fig">Figure 3A</xref>) and mouse 2 (western blot not shown) restored M protein production. DNA sequencing found that all 7 M<sup>+</sup> isolates had c.1571C[8] <italic>mga</italic> (<xref rid="tab3" ref-type="table">Table 3</xref>) whereas one M<sup>&#x2212;</sup> isolate from the infection was sequenced and still had the c.1571C[7] <italic>mga</italic> variant.</p>
<fig position="float" id="fig3">
<label>Figure 3</label>
<caption><p>Selection of M<sup>+</sup> GAS isolates from M<sup>&#x2212;</sup> GAS variants in infection sites of mice. Mice were inoculated subcutaneously with M<sup>&#x2212;</sup> c.1574C[7] variants of MGAS2221 or MGAS2221 &#x0394;<italic>sagA</italic>, and GAS isolates from the infection site of the skin were tested for M protein production by Western blotting analyses. <bold>(A)</bold> Anti-M protein western blot of GAS isolates. Lanes: 1, MGAS2221; 2&#x2013;14, isolates from skin infection site of M<sup>&#x2212;</sup> MGAS2221 &#x0394;sagA. <bold>(B)</bold> The C-terminus Amino sequences starting from residue 501 for wild-type M1 Mga (c.1574C[8]) and its c.1574C[7] and c.1574C[6] variants. The sequences in red are different from that of the wild-type Mga, and [62AA] and [M protein] for c.1574C[7] variant represent amino acid sequence of the 66 amino acid residues encoded by the sequence between <italic>mga</italic> and <italic>emm</italic> and the sequence of the M protein, respectively. The numbers 529, 1,075 and 542 are the total number of amino acid residues in wild-type Mga and its c.1574C[7] and c.1574C[6] variants, respectively. <bold>(C)</bold> Relative emm mRNA levels of MGAS2221 (control), M<sup>&#x2212;</sup> GAS1806, and M<sup>+</sup> M497 and M406 and M<sup>&#x2212;</sup> M405 isolated from GAS1806 infection site in the skin of mice. The polymorphism of the <italic>mga</italic> polycytidine tract is indicated.</p></caption>
<graphic xlink:href="fmicb-14-1212149-g003.tif"/>
</fig>
<table-wrap position="float" id="tab3">
<label>Table 3</label>
<caption><p>Summary of selection of M<sup>+</sup> isolates from skin infection site of mice with M<sup>&#x2212;</sup> GAS variants.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Mouse</th>
<th align="center" valign="top">Infection strain</th>
<th align="center" valign="top">Number of M<sup>+</sup> isolates/total tested isolates from mouse</th>
<th align="center" valign="top">No. of M<sup>+</sup> isolates with c.1571C[8] <italic>mga</italic></th>
<th align="center" valign="top">No. of M<sup>+</sup> isolates with c.1571C[6] <italic>mga</italic></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">1</td>
<td align="center" valign="top"><xref rid="tfn1" ref-type="table-fn"><sup>a</sup></xref>GAS1285</td>
<td align="char" valign="top" char="/">3/13</td>
<td align="center" valign="top">3</td>
<td align="center" valign="top">0</td>
</tr>
<tr>
<td align="left" valign="top">2</td>
<td align="center" valign="top">GAS1285</td>
<td align="char" valign="top" char="/">4/13</td>
<td align="center" valign="top">4</td>
<td align="center" valign="top">0</td>
</tr>
<tr>
<td align="left" valign="top">3</td>
<td align="center" valign="top"><xref rid="tfn2" ref-type="table-fn"><sup>b</sup></xref>GAS1806</td>
<td align="char" valign="top" char="/">4/8</td>
<td align="center" valign="top">3</td>
<td align="center" valign="top">1</td>
</tr>
<tr>
<td align="left" valign="top">4</td>
<td align="center" valign="top">GAS1806</td>
<td align="char" valign="top" char="/">1/8</td>
<td align="center" valign="top">1</td>
<td align="center" valign="top">0</td>
</tr>
<tr>
<td align="left" valign="top">5</td>
<td align="center" valign="top">GAS1806</td>
<td align="char" valign="top" char="/">2/8</td>
<td align="center" valign="top">0</td>
<td align="center" valign="top">2</td>
</tr>
<tr>
<td align="left" valign="top">6</td>
<td align="center" valign="top">GAS1806</td>
<td align="char" valign="top" char="/">0/10</td>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">7</td>
<td align="center" valign="top">GAS1806</td>
<td align="char" valign="top" char="/">0/10</td>
<td/>
<td/>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn1"><label>a</label><p>GAS1285 was a M<sup>-</sup> variant of MGAS2221&#x0394;sagA with c.1574C[7] <italic>mga</italic> variant.</p></fn>
<fn id="tfn2"><label>b</label><p>GAS1806 was a M<sup>-</sup> variant of MGAS2221 with c.1574C[7] <italic>mga</italic> variant.</p></fn>
</table-wrap-foot>
</table-wrap>
<p>We also tested GAS1806, an M<sup>&#x2212;</sup> c.1571C[7] variant of MGAS2221, in subcutaneous infection of 5 mice that are numbered from 3 to 7 in <xref rid="tab3" ref-type="table">Table 3</xref>. Four of 20 isolates from mouse 3, 1 of 8 isolates from mouse 4, and 2 of 8 isolates from mouse 5 restored M protein production whereas 10 isolates from mouse 6 or mouse 7 did not have detectable M protein production. Three of the 4&#x2009;M<sup>+</sup> isolates from mouse 3 restored the c.1571C[8] tract of <italic>mga</italic>, and the other M<sup>+</sup> revertant had an additional C deletion in the c.1571C[7] tract, resulting in 6 C at the polycytidine tract of <italic>mga</italic> that is designated as c.1571C[6]. The M<sup>+</sup> isolate in mouse 4 restored the c.1571C[8] tract of <italic>mga</italic> whereas both M<sup>+</sup> isolates in mouse 5 had the c.1571C[6] <italic>mga</italic> variant (<xref rid="tab3" ref-type="table">Table 3</xref>). The additional 1C deletion of the c.1571C[7] tract of <italic>mga</italic> in the M<sup>&#x2212;</sup> variant shifted the reading frame, converting the Mga-62 AA-M1 protein fusion protein into an Mga variant of 542 aa residues, which was just 13 amino acid residues longer than the wild-type Mga protein of M1 GAS (<xref rid="fig3" ref-type="fig">Figure 3B</xref>). This c.1571C[6] <italic>mga</italic> variant apparently restored Mga function and thus the M protein production.</p>
<p>To test whether the c.1571C[7] and c.1571C[6] <italic>mga</italic> variants lost and regained function, <italic>emm</italic> mRNA levels were measured by real-time RT PCR for MGAS2221 (wt control), GAS1806 (c.1571C[7] <italic>mga</italic> variant of MGAS2221), and isolates M497, M405, and M406 from mouse infection with GAS1806. M497, M405, and M406 had c.1571C[6] (M497), c.1571C[7], and c.1571C[8] <italic>mga</italic> variants. As shown in <xref rid="fig3" ref-type="fig">Figure 3C</xref>, GAS1806 and M405 lost <italic>emm</italic> transcription by 98% whereas M497 and M406 restored <italic>emm</italic> transcription to 90 and 100% of that in MGAS2221. Thus, the c.1571C[7] <italic>mga</italic> variant is nonfunctional and can be reversed into functional c.1571C[8] <italic>mga</italic> by gaining 1\u00B0C or converted into functional c.1571C[6] <italic>mga</italic> variant by losing another C during mouse infection. The results indicate that the polycytidine tract at the 3&#x2032; end of the <italic>mga</italic> gene in M1 GAS can switch between c.1571C[8] or c.1574C[6] and c.1574C[7] to turn on and off M protein production, respectively.</p>
</sec>
<sec id="sec16">
<title>Polymorphisms of the polycytidine tract of <italic>mga</italic>-1 in sequenced GAS strains in genome databases</title>
<p>If the polycytidine tract-based switch of <italic>mga</italic> for phase variation in M protein production functions during infection, there should be polymorphism at the polycytidine tract of the <italic>mga</italic> gene in clinical isolates. M12 GAS frequently shows M protein phase variation <italic>in vitro</italic> (<xref ref-type="bibr" rid="ref35">Simpson and Cleary, 1987</xref>). So we first examined the <italic>mga</italic> sequences of M12 GAS genomes in the NCBI genome database. The <italic>mga</italic> gene and protein sequences of 106 available M12 GAS genomes were aligned. As shown in <xref rid="fig4" ref-type="fig">Figure 4A</xref>, there are four different types of polymorphisms related to the number of the C repeats starting at base 1574 and polymorphism at base 1657: (1) functional c.1574C[8] <italic>mga</italic> and base G at position 1657 (1657G) (wt), (2) nonfunctional c.1574C[7] <italic>mga</italic> and 1657G (Variant 1), (3) functional c.1574C[6] <italic>mga</italic> and 1657G (Variant 2), and (4) functional <italic>mga</italic> variant with c.1574C[7] and 1657A (Variant 3). Wild-type Mga of M12 GAS has 530 amino acid residues. The nonfunctional c.1574C[7]<italic>/</italic>1657G Variant 1 encodes a nonfunctional Mga-M protein fusion protein with a 62-aa bridge encoded by the intergenic sequence between the <italic>mga</italic> and <italic>emm</italic> genes, the functional c.1574C[6]/1657G variant has 543 amino acid residues and is similar with the c.1571C[6] <italic>mga</italic> variants of MGAS2221 that were from the c.1574C[7] <italic>mga</italic> variant of MGAS2221 in mouse infection; and the G-to-A nonsense mutation at base 1657 of the c.1574C[7] <italic>mga</italic> variant converts Variant 1 into c.1574C[7]/1657A Mga variant that is 21 aa-residue longer than the wt Mga protein (<xref rid="fig4" ref-type="fig">Figure 4B</xref>) (<xref rid="tab4" ref-type="table">Table 4</xref>). Among 106&#x2009;M12 genomes in the NCBI genome database, there are 25 <italic>wt mga</italic>; 3 Variant 1, 38 Variant 2; and 40 Variant 3. These data are summarized in <xref rid="tab4" ref-type="table">Table 4</xref>, and the <italic>mga</italic> polymorphisms at the polycytidine tract and base 1657 of 106&#x2009;M12 genomes are listed in <xref rid="SM1" ref-type="supplementary-material">Supplementary Table S2</xref>. It is interesting that two of the three strains with nonfunctional c.1574C[7]/1657G <italic>mga</italic> variant 1, ATCC 11434 (<xref ref-type="bibr" rid="ref41">Tanaka et al., 2020</xref>) and MGAS2096 (<xref ref-type="bibr" rid="ref33">Sarkar et al., 2018</xref>), were isolated from patients with acute poststreptococcal glomerulonephritis, and the infection nature of the third strain, NCTC8300, is not known. The data indicate that the length polymorphism of Mga is caused through arising the nonfunctional c.1574C[7]/1657G <italic>mga</italic> variant and restoring the function of the nonfunctional c.1574C[7]/1657G <italic>mga</italic> variant by the additional C deletion at the c.1574C[7] tract or the G-to-A mutation at base 1657. These <italic>mga</italic> polymorphisms among clinical M12 isolates also indicate that M12 GAS undergoes phase variation in M protein production through the slipped-strand mispairing of the <italic>mga</italic> polycytidine tract.</p>
<fig position="float" id="fig4">
<label>Figure 4</label>
<caption><p>Polymorphism at the c.1574C[8] tract and base 1,657 of mga of sequenced M12 GAS genomes. <bold>(A)</bold> For different polymorphisms of the c.1574C[8] and base 1,657 in M12 <italic>mga</italic> genes. <bold>(B)</bold> The difference in amino acid sequence of Mga caused by the polymorphisms of mga in panel A. The sequences in red are those that are different from the wild-type Mga.</p></caption>
<graphic xlink:href="fmicb-14-1212149-g004.tif"/>
</fig>
<table-wrap position="float" id="tab4">
<label>Table 4</label>
<caption><p>Polymorphism at the polycytidine tract and base 1,657 and Functionality of M1 and M12 <italic>mga</italic> among GAS genomes in the GenBank database.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top" colspan="3">M protein type genotype</th>
<th align="center" valign="top">M1</th>
<th align="center" valign="top">M12</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top" rowspan="12">Polymorphism of <italic>mga</italic> at the polycytidine and base 1,654 in M1 and base 1,657 in M12</td>
<td align="left" valign="top" rowspan="3"><xref rid="tfn3" ref-type="table-fn"><sup>a</sup></xref>c.1571C[8]/1654G or c.1574C[8]/1657G</td>
<td align="left" valign="top">Frequency</td>
<td align="center" valign="top">150/158</td>
<td align="center" valign="top">25/106</td>
</tr>
<tr>
<td align="left" valign="top">AA No. of Mga</td>
<td align="center" valign="top">529</td>
<td align="center" valign="top">530</td>
</tr>
<tr>
<td align="left" valign="top">Functionality</td>
<td align="center" valign="top">Yes</td>
<td align="center" valign="top">Yes</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">c.1571C[7]/1654G or c.1574C[7]/1657G</td>
<td align="left" valign="top">Frequency</td>
<td align="center" valign="top">3/158</td>
<td align="center" valign="top">3/106</td>
</tr>
<tr>
<td align="left" valign="top">AA No. of MGA</td>
<td align="center" valign="top">1,075</td>
<td align="center" valign="top">1,157</td>
</tr>
<tr>
<td align="left" valign="top">Functionality</td>
<td align="center" valign="top">No</td>
<td align="center" valign="top">No</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">c.1571C[6]/1654G or c.1574C[6]/1657G</td>
<td align="left" valign="top">Frequency</td>
<td align="center" valign="top">5/158</td>
<td align="center" valign="top">38/106</td>
</tr>
<tr>
<td align="left" valign="top">AA No. of Mga</td>
<td align="center" valign="top">542</td>
<td align="center" valign="top">543</td>
</tr>
<tr>
<td align="left" valign="top">Functionality</td>
<td align="center" valign="top">Yes</td>
<td align="center" valign="top">Yes</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">c.1571C[7]/1654A or c.1574C[7]/1657A</td>
<td align="left" valign="top">Frequency</td>
<td align="center" valign="top">0/158</td>
<td align="center" valign="top">40/106</td>
</tr>
<tr>
<td align="left" valign="top">AA No of Mga</td>
<td/>
<td align="center" valign="top">543</td>
</tr>
<tr>
<td align="left" valign="top">Functionality</td>
<td/>
<td align="center" valign="top">Yes</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="tfn3"><label>a</label><p>The polycytidine tract with polymorphism starts at base 1,571 and 1,574 in M1 and M12 <italic>mga</italic>, respectively.</p></fn>
</table-wrap-foot>
</table-wrap>
<p>Among 158 M1 GAS strains (<xref rid="SM1" ref-type="supplementary-material">Supplementary Table S3</xref>), 5 M1 GAS strains, FDAARGOS, CCUG-4207, CCUG-4207-W1, NCTC8198, and AP1, had the functional c.1571C[6] <italic>mga</italic> variant, and three M1 strains, CCUG-47803, SPY8157, and GA41345, had the nonfunctional c.1571C[7] <italic>mga</italic> variant (<xref rid="tab4" ref-type="table">Table 4</xref> and <xref rid="SM1" ref-type="supplementary-material">Supplementary Table S3</xref>). Other M1 strains in the NCBI genome database have the wt c.1571C[8] <italic>mga</italic>. The frequency of the c.1571C[6] <italic>mga</italic> variant among the sequenced M1 GAS isolates, 5/158, is less than that among the sequenced M12 GAS isolates, 38/106 (<xref rid="tab4" ref-type="table">Table 4</xref>). Four of 6&#x2009;M14 GAS strains in the GenBank database have the wt c.1574C[8] <italic>mga</italic> encoding Mga with 530 amino acid residues, and two other strains have the c.1574C[6] <italic>mga</italic> variant for Mga with 543 amino acid residues (<xref rid="tab1" ref-type="table">Table 1</xref>). One of two M23 strain has the c.1574C[6] <italic>mga</italic> variant for Mga with 543 amino acid residue, and another has 1C addition at the c.1574C[8] tract and 1C deletion at the c.1555C[6] tract, leading to c.1555C[5]/c.1573C[9] <italic>mga</italic> variant that encode Mga with 530 amino acid residues (<xref rid="tab1" ref-type="table">Table 1</xref>). The c.1574C[6] <italic>mga</italic> variant in clinical M1, M14, and M23 GAS isolates indicates that the nonfunctional c1574C[7] <italic>mga</italic> variants of M1, M14, and M23 GAS arise during infection.</p>
<p>M3 GAS strains in the GenBank database have four polymorphisms of M3 <italic>mga</italic> that are related to the polycytidine tract near the 3&#x2032; end of <italic>mga</italic> (<xref rid="SM1" ref-type="supplementary-material">Supplementary Table S4</xref>). 106 strains have the wt c.1592C[8] <italic>mga</italic>; 5 strains have a missense mutation at base 1599 of the c.1592C[8] tract from C to T or c.1598C&#x2009;&#x003E;&#x2009;T; 1 strain has an addition of 1C to the tract to result in c.1592C[9] <italic>mga</italic> variant; and two strains, A842 and A843, had 1C deletion at the c.1592C[8] tract and 1C addition at the c.1573C[6] tract to result in c.1592C[7]/c.1573[7] <italic>mga</italic> variant. It is unlikely that the two events in strains A842 and A843 occurred at the same time. These polymorphisms of M3 <italic>mga</italic> indicates the slipped-strand mispairing within the c.1592C[8] tract of M3 <italic>mga</italic> also occurs in hosts.</p>
</sec>
<sec id="sec17">
<title>Lack of polymorphism in Mga length for serotypes of GAS that do not have the <italic>mga</italic> polycytidine tract</title>
<p>The M49 <italic>mga</italic> gene belongs to the <italic>mga</italic>-2 cluster that does not have the polycytidine tract. Except for a three amino acid residue insertion near the C terminus of Mga in M49 strain K36294, all the Mga sequences of the other 29&#x2009;M49 genomes do not show length polymorphism. The Mga sequences of the other <italic>mga</italic>-2, such as M89 <italic>mga</italic>, do not have the polycytidine tract, which is shown in the sequence alignment of <italic>mga</italic> of M1 strain MGAS5005 and M89 strain KUN-0012590 (<xref ref-type="bibr" rid="ref24">Murase et al., 2020</xref>) (<xref rid="SM1" ref-type="supplementary-material">Supplementary Figure S2</xref>). M89 strains do not have polymorphism in the length of the Mga protein. These results indicate that the polycytidine tract of <italic>mga</italic>-1, but not <italic>mga</italic>-2, causes polymorphism in the length of Mga proteins.</p>
</sec>
</sec>
<sec id="sec18" sec-type="discussions">
<title>Discussion</title>
<p>One of our findings is that the slipped-strand mispairing within the c.1571C[8] tract of the <italic>mga</italic> gene of M1 GAS mediates the phase variation of the M protein. <italic>In vitro</italic> genetic manipulation or culturing of M1 GAS strains frequently led to variants with the loss of M protein production. The majority of M<sup>-</sup> variants had a single C base deletion in the c.1574C[8] tract of <italic>mga</italic>, leading to an open reading frame shift that combine the <italic>mga</italic> and downstream <italic>emm</italic>1 gene to code for a predicted Mga-M protein fusion. Complementation in trans of the c.1571C[7] <italic>mga</italic> variant with an <italic>mga</italic>-containing plasmid restored the production of the M protein. Isolates with restored M protein production were recovered at skin infection sites in mice with the c.1571C[7] <italic>mga</italic> variant, and the majority of these M<sup>+</sup> revertants restored the c1571C[8] tract of <italic>mga</italic>. In addition, some M<sup>+</sup> revertants lost another C, leading to a c.1571C[6] <italic>mga</italic> variant that codes for a Mga with additional 13 amino acid residues at the C-terminus compared with the wt Mga. Transcriptional data of <italic>emm</italic> indicates that the c.1571C[7] <italic>mga</italic> variant is nonfunctional whereas the c.1571C[6] <italic>mga</italic> variant is functional as an activator of <italic>emm</italic> transcription. In addition to arising of the c.1571C[7] <italic>mga</italic> variant, M protein production could be lost by the Mga T98P and T289P missense mutations, A-to-G mutation in the-10 box of the <italic>emm</italic> promoter, and unknown mutation(s) that affects post-transcriptional translation. The M protein of M12 GAS has been shown to have phase variation <italic>in vitro</italic> (<xref ref-type="bibr" rid="ref35">Simpson and Cleary, 1987</xref>). Approximately 50-base pair deletions within or adjacent to the M protein coding sequence had been detected in two M<sup>-</sup> variants but the detail of the deletions and whether the deletions were responsible for the loss of the M protein production were not known (<xref ref-type="bibr" rid="ref36">Spanier et al., 1984</xref>). Our findings reveal that the slipped-strand mispairing within the c.1574C[8] tract in the <italic>mga</italic> gene of cluster 1 is one mechanism to reversibly turn the M protein production on and off (<xref rid="fig5" ref-type="fig">Figure 5A</xref>).</p>
<fig position="float" id="fig5">
<label>Figure 5</label>
<caption><p>The cytidine repeat of <italic>mga</italic> as a switch for M protein production and the role of the switch in the circulation among human hosts. <bold>(A)</bold> Slipped-strand mispairing of the polycytidine tract of mga functions as a switch for M protein production. The c.1574C[8] and c.1574C[6] repeats of <italic>mga</italic> lead to functional Mga, which turns on M protein expression whereas c.1574C[7] leads to nonfunctional Mga, which turns off M protein expression. <bold>(B)</bold> Proposed role of the switch of M protein production in GAS circulation among human hosts. It is proposed that humoral immunity in patients turns off the switch, resulting in M<sup>-</sup> variants that are transmitted to na&#x00EF;ve host where innate immunity turns on the switch to produce M protein.</p></caption>
<graphic xlink:href="fmicb-14-1212149-g005.tif"/>
</fig>
<p>The M protein and several other virulence factors regulated by Mga are among genes that have the highest expressions according to transcriptional profiling with RNAseq analysis (<xref ref-type="bibr" rid="ref14">Horstmann et al., 2018</xref>). The M protein, ScpA, and SclA are not essential for M1 GAS viability since their genes can be deleted (<xref ref-type="bibr" rid="ref17">Li et al., 2013</xref>; <xref ref-type="bibr" rid="ref18">Liu et al., 2015</xref>; <xref rid="SM1" ref-type="supplementary-material">Supplementary Table S1</xref>), and <italic>emm</italic> deletion mutant can survive in skin infection site of mice for several days (<xref ref-type="bibr" rid="ref18">Liu et al., 2015</xref>). One possible reason for the arising of M<sup>-</sup> variants in serial passage in THY is a metabolic burden driven evolutionary adaptation. In this case, the production of M protein constitutes a substantial anabolic burden that reduces fitness relative to strains that do not produce the M protein, and M<sup>&#x2212;</sup> mutants likely overgrow M protein-producing strains. We will examine this possibility in future. Loss of M protein production in construction of GAS gene deletion in our method using THY for passing significantly caused problems in our effort to generate gene deletion mutants without loss of M protein production in our previous studies. Fortunately, passing GAS with the first crossover on THY agar prevented the arising of M<sup>-</sup> variants (<xref ref-type="bibr" rid="ref47">Zhou et al., 2013</xref>; <xref rid="SM1" ref-type="supplementary-material">Supplementary Table S1</xref>). These results suggest that M<sup>+</sup> GAS bacteria under limited nutrient condition may have defects in the envelope structure that compromise GAS survival in liquid. M<sup>-</sup> variants may devote more nutrient to the synthesis of the cell wall to survive better in liquid under limited nutrient conditions. Passing on THY agar is an important improvement in the two step gene deletion approach to generate gene deletion mutants without loss of M protein production. Allelic exchange approach may have higher chance to obtain mutants with normal M protein production but introduces antibiotic selection marker. The <italic>sagA</italic> gene was shown to downregulate <italic>emm</italic> (<xref ref-type="bibr" rid="ref16">Li et al., 2019</xref>). The <italic>sda</italic>1 gene was proposed to be selection pressure for <italic>covRS</italic> mutants (<xref ref-type="bibr" rid="ref46">Walker et al., 2007</xref>). However, these results could not repeated, and loss of M protein production might be responsible for these observations (<xref ref-type="bibr" rid="ref47">Zhou et al., 2013</xref>; <xref ref-type="bibr" rid="ref18">Liu et al., 2015</xref>). No matter what approach is used to generate gene deletion mutants of GAS with <italic>mga</italic>-1, it is important to check <italic>emm</italic> transcription and M protein production of gene deletion mutants.</p>
<p>The M protein is required for resistance to phagocytosis by neutrophils (<xref ref-type="bibr" rid="ref29">Perez-Casal et al., 1992</xref>). The M protein is important if not essential for growth <italic>in vivo</italic> in order to protect against the immune response, evolution has landed upon a reversible switch, so that when a mutant that has lost M protein production is back in an <italic>in vivo</italic> environment a portion of them will revert to producing the M protein helping to ensure the survival of the species. This explains why lack of M production enhances fitness <italic>ex vivo</italic>, but M protein production enhances fitness <italic>in vivo</italic>.</p>
<p>The polymorphism in the length of Mga among clinical GAS isolates indicates that the M protein phase variation through the slipped-strand mispairing within the c.1574C[8] tract occurs <italic>in vivo</italic>. Several clinical isolates of M1 and M12 GAS had the nonfunctional c.1574C[7] <italic>mga</italic> variant. It is possible that the c1574C[7] variant in these isolates might be acquired due to <italic>in vitro</italic> processing. However, the presence of the functional c.1574C[6] and c.1574C[7]/1657A <italic>mga</italic> variants indicate that the c.1574C[7] variant occurs <italic>in vivo</italic>. It is interesting that the 1657A allele is absent in the c.1574C[8] and c.1574C[6] <italic>mga</italic> variants. 37.7% of 106&#x2009;M12 genomes in the GenBank database contain the c.1574C[7]/1657A <italic>mga</italic> variant. The majority of M12 strains with this <italic>mga</italic> variant should transcribe <italic>emm</italic> and produce M protein. Thus, it is reasonable to assume that the c.1574C[7]/1657A <italic>mga</italic> variant of M12 is functional. If this is true, the c.1657G&#x2009;&#x003E;&#x2009;A nonsense mutation is somehow selected for functional <italic>mga</italic> variants from the nonfunctional c.1574C[7] <italic>mga</italic> variant. The M protein is required for resistance to phagocytosis, which is likely one of selection pressures for functional Mga from c.1574C[7] <italic>mga</italic> variant. Apparently, there is no selection pressure to select the c.1574G&#x2009;&#x003E;&#x2009;A mutation in c.1574C[8] and c.1574C[6] because the c.1574G&#x2009;&#x003E;&#x2009;A mutation does not alter the functionality of the functional <italic>mga</italic> variants.</p>
<p>The C-terminal region of Mga is important for oligomerization of Mga and transcriptional activation (<xref ref-type="bibr" rid="ref13">Hondorp et al., 2012</xref>). Fusion of an extra 546 amino acid residues to the C-terminus of Mga in the c.1571C[7] <italic>mga</italic> variant of M1 and M12 GAS is expected to interfere with Mga oligomerization and transcriptional activation. The M12 c.1574C[7]/1657A <italic>mga</italic> variant encodes the Mga variant of 551 amino acid residues, 20 amino acid residues longer than the wt M12 <italic>mga</italic>, suggesting that addition of short peptides to the C-terminus of Mga does not affect the function of Mga dramatically.</p>
<p>It is not known why the nonfunctional c.1574C[7] <italic>mga</italic> variant arises and functional c.1574C[7]/1657A and c.1574C[6] <italic>mga</italic> variants are selected in host. We speculate that the acquired immunity may play a role in this process. Anti-M protein antibodies enhance GAS killing by neutrophils and may confer an advantage to M<sup>-</sup> variants. We do not have experimental evidence for this possibility. It is also possible that the c.1574C[7] <italic>mga</italic> variant arises in a nutrient-limited niche in the host. If the humoral immunity indeed plays a role for the c.1574C[7] <italic>mga</italic> variant, the selection of the functional c.1574C[8], c.1574C[6] and c.1574C[7]/1657A <italic>mga</italic> variants from c.1574C[7] <italic>mga</italic> variant would not occur in the same host in which the c.1574C[7] variant is. It is more likely that the c.1574C[7] <italic>mga</italic> variant is transmitted into na&#x00EF;ve hosts where the functional <italic>mga</italic> variants arise in response to the innate immune responses like in the mouse infection. It is proposed that the reversible switch of the M protein phase variation through the slipped-strand mispairing within the c.1574C[8] tract of the <italic>mga</italic> gene plays a role in the continued circulation of GAS among human hosts (<xref rid="fig5" ref-type="fig">Figure 5B</xref>). In this proposal, GAS with the c.1574C[8] <italic>mga</italic> is transmitted from patients to na&#x00EF;ve hosts to cause acute infections, and the humoral immunity then selects the c.1574C[7] <italic>mga</italic> variant; and the c.1574C[7] <italic>mga</italic> variant is transmitted to new na&#x00EF;ve hosts and converted into c.1574C[8] to cause infection. It would be interesting to test this proposal in future work.</p>
<p>Slipped-strand mispairing within a repetitive sequence is commonly used for adaption of bacteria to particular environment or niche. For Group A Streptococcus, slipped-strand mispairing in AACAA repeats in coding region controls production of streptococcal collagen-like protein B (<xref ref-type="bibr" rid="ref21">Lukomski et al., 2001</xref>; <xref ref-type="bibr" rid="ref30">Rasmussen and Bj&#x00F6;rck, 2001</xref>). The well-characterized hypervirulent M1T1 GAS strain MGAS5005 had a T deletion in the 7&#x2009;T tract starting at base 77 of the <italic>covS</italic> gene (<xref ref-type="bibr" rid="ref17">Li et al., 2013</xref>), and the T deletion variants of <italic>covS</italic> were found at GAS infection site of mice (<xref ref-type="bibr" rid="ref8">Engleberg et al., 2001</xref>). Slipped-strand mispairing within repetitive sequences appears to be a common mechanism for selecting variants that have advantage for survival under particular stress. Our findings represent an unusual mechanism that the slipped-strand mispairing in <italic>mga</italic> occurs near the end of the <italic>mga</italic> gene. This arrangement apparently can interfere with the role of the C-terminus of Mga in oligomerization and transcriptional activation but at the same time confers reversibility of the process.</p>
</sec>
<sec id="sec19" sec-type="conclusions">
<title>Conclusion</title>
<p><italic>In vitro</italic> manipulation of GAS frequently leads to the loss of M protein production. Deletion of base C within the polycytidine tract c.1571C[8] of <italic>mga</italic> result in the nonfunctional c.1571C[7] <italic>mga</italic> variant, turning off M protein production. M<sup>+</sup> isolates were obtained from infection sites with M<sup>&#x2212;</sup> GAS variants in mice and either restored the wild-type c.1574C[8] tract of <italic>mga</italic> or had a functional c.1574C[6] <italic>mga</italic> variant due to an additional C deletion at the c.1574C[7] tract. It is concluded that slipped-strand mispairing within the c.1571C[8] tract of M1 <italic>mga</italic> functions as a switch for the M protein phase variation. The nonfunctional c.1574C[7] and functional c.1574C[6] variants are present in M1, M12, M14, and M23 strains in the GenBank databases, and a G-to-A nonsense mutation at base 1,657 of c.1574C[7] <italic>mga</italic> leads to a functional c.1574C[7]/1657A <italic>mga</italic> variant and is common in clinical M12 isolates. These polymorphisms of the polycytidine tract and base 1,657 of <italic>mga</italic> indicate that M protein phase variation through the slipped-strand mispairing within the c.1574C[8] tract occurs <italic>in vivo</italic>.</p>
</sec>
<sec id="sec20" sec-type="data-availability">
<title>Data availability statement</title>
<p>Publicly available datasets were analyzed in this study. This data can be found at: Genomes of Group A Streptococcus in NCBI GenBank database were used in analyses.</p>
</sec>
<sec id="sec21">
<title>Ethics statement</title>
<p>The animal study was reviewed and approved by Montana State University, Bozeman IACUC.</p>
</sec>
<sec id="sec22">
<title>Author contributions</title>
<p>Experiments were designed by BL and performed by TH, ML, and BL. Bioinformatic analyses were done by YB. The manuscript was written by BL and TH. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="sec23" sec-type="funding-information">
<title>Funding</title>
<p>This work was supported in part by grant AI153755 from the National Institutes of Health and the Montana State Agricultural Experimental Station.</p>
</sec>
<sec id="conf1" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was performed in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="sec100" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<sec id="sec25" sec-type="supplementary-material">
<title>Supplementary material</title>
<p>The Supplementary material for this article can be found online at: <ext-link xlink:href="https://www.frontiersin.org/articles/10.3389/fmicb.2023.1212149/full#supplementary-material" ext-link-type="uri">https://www.frontiersin.org/articles/10.3389/fmicb.2023.1212149/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.docx" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="ref1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Beres</surname> <given-names>S. B.</given-names></name> <name><surname>Sylva</surname> <given-names>G. L.</given-names></name> <name><surname>Barbian</surname> <given-names>K. D.</given-names></name> <name><surname>Lei</surname> <given-names>B.</given-names></name> <name><surname>Hoff</surname> <given-names>J. S.</given-names></name> <name><surname>Mammarella</surname> <given-names>N. D.</given-names></name> <etal/></person-group>. (<year>2002</year>). <article-title>Genome sequence of a serotype M3 strain of group a Streptococcus: phage-encoded toxins, the high-virulence phenotype, and clone emergence</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>99</volume>, <fpage>10078</fpage>&#x2013;<lpage>10083</lpage>. doi: <pub-id pub-id-type="doi">10.1073/pnas.152298499</pub-id>, PMID: <pub-id pub-id-type="pmid">12122206</pub-id></citation></ref>
<ref id="ref2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bessen</surname> <given-names>D. E.</given-names></name> <name><surname>Manoharan</surname> <given-names>A.</given-names></name> <name><surname>Luo</surname> <given-names>F.</given-names></name> <name><surname>Wertz</surname> <given-names>J. E.</given-names></name> <name><surname>Robinson</surname> <given-names>D. A.</given-names></name></person-group> (<year>2005</year>). <article-title>Evolution of transcription regulatory genes is linked to niche specialization in the bacterial pathogen <italic>Streptococcus pyogenes</italic></article-title>. <source>J. Bacteriol.</source> <volume>187</volume>, <fpage>4163</fpage>&#x2013;<lpage>4172</lpage>. doi: <pub-id pub-id-type="doi">10.1128/jb.187.12.4163-4172.2005</pub-id>, PMID: <pub-id pub-id-type="pmid">15937178</pub-id></citation></ref>
<ref id="ref3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Camacho</surname> <given-names>C.</given-names></name> <name><surname>Coulouris</surname> <given-names>G.</given-names></name> <name><surname>Avagyan</surname> <given-names>V.</given-names></name> <name><surname>Ma</surname> <given-names>N.</given-names></name> <name><surname>Papadopoulos</surname> <given-names>J.</given-names></name> <name><surname>Bealer</surname> <given-names>K.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>BLAST+: architecture and applications</article-title>. <source>BMC Bioinformatics</source> <volume>10</volume>:<fpage>421</fpage>. doi: <pub-id pub-id-type="doi">10.1186/1471-2105-10-421</pub-id>, PMID: <pub-id pub-id-type="pmid">20003500</pub-id></citation></ref>
<ref id="ref4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Caparon</surname> <given-names>M. G.</given-names></name> <name><surname>Scott</surname> <given-names>J. R.</given-names></name></person-group> (<year>1987</year>). <article-title>Identification of a gene that regulates expression of M protein, the major virulence determinant of group a streptococci</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>84</volume>, <fpage>8677</fpage>&#x2013;<lpage>8681</lpage>. doi: <pub-id pub-id-type="doi">10.1073/pnas.84.23.8677</pub-id>, PMID: <pub-id pub-id-type="pmid">2446327</pub-id></citation></ref>
<ref id="ref5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carapetis</surname> <given-names>J. R.</given-names></name> <name><surname>Steer</surname> <given-names>A. C.</given-names></name> <name><surname>Mulholland</surname> <given-names>E. K.</given-names></name> <name><surname>Weber</surname> <given-names>M.</given-names></name></person-group> (<year>2005</year>). <article-title>The global burden of group a streptococcal diseases</article-title>. <source>Lancet Infect. Dis.</source> <volume>5</volume>, <fpage>685</fpage>&#x2013;<lpage>694</lpage>. doi: <pub-id pub-id-type="doi">10.1016/S1473-3099(05)70267-X</pub-id></citation></ref>
<ref id="ref7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cunningham</surname> <given-names>M. W.</given-names></name></person-group> (<year>2008</year>). <article-title>Pathogenesis of group a streptococcal infections and their sequelae</article-title>. <source>Adv. Exp. Med. Biol.</source> <volume>609</volume>, <fpage>29</fpage>&#x2013;<lpage>42</lpage>. doi: <pub-id pub-id-type="doi">10.1007/978-0-387-73960-1_3</pub-id></citation></ref>
<ref id="ref8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Engleberg</surname> <given-names>N. C.</given-names></name> <name><surname>Heath</surname> <given-names>A.</given-names></name> <name><surname>Miller</surname> <given-names>A.</given-names></name> <name><surname>Rivera</surname> <given-names>C.</given-names></name> <name><surname>DiRita</surname> <given-names>V. J.</given-names></name></person-group> (<year>2001</year>). <article-title>Spontaneous mutations in the CsrRS two-component regulatory system of <italic>Streptococcus pyogenes</italic> result in enhanced virulence in a murine model of skin and soft tissue infection</article-title>. <source>J. Infect. Dis.</source> <volume>183</volume>, <fpage>1043</fpage>&#x2013;<lpage>1054</lpage>. doi: <pub-id pub-id-type="doi">10.1086/319291</pub-id>, PMID: <pub-id pub-id-type="pmid">11237829</pub-id></citation></ref>
<ref id="ref9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Feng</surname> <given-names>W.</given-names></name> <name><surname>Minor</surname> <given-names>D.</given-names></name> <name><surname>Liu</surname> <given-names>M.</given-names></name> <name><surname>Lei</surname> <given-names>B.</given-names></name></person-group> (<year>2017</year>). <article-title>Requirement and synergistic contribution of platelet-activating factor Acetylhydrolase Sse and Streptolysin S to inhibition of neutrophil recruitment and systemic infection by Hypervirulent emm3 group a Streptococcus in subcutaneous infection of mice</article-title>. <source>Infect. Immun.</source> <volume>85</volume>, <fpage>e00530</fpage>&#x2013;<lpage>e00517</lpage>. doi: <pub-id pub-id-type="doi">10.1128/IAI.00530-17</pub-id>, PMID: <pub-id pub-id-type="pmid">28947648</pub-id></citation></ref>
<ref id="ref10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ferretti</surname> <given-names>J. J.</given-names></name> <name><surname>McShan</surname> <given-names>W. M.</given-names></name> <name><surname>Ajdic</surname> <given-names>D.</given-names></name> <name><surname>Savic</surname> <given-names>D. J.</given-names></name> <name><surname>Savic</surname> <given-names>G.</given-names></name> <name><surname>Lyon</surname> <given-names>K.</given-names></name> <etal/></person-group>. (<year>2001</year>). <article-title>Complete genome sequence of an M1 strain of <italic>Streptococcus pyogenes</italic></article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>98</volume>, <fpage>4658</fpage>&#x2013;<lpage>4663</lpage>. doi: <pub-id pub-id-type="doi">10.1073/pnas.071559398</pub-id>, PMID: <pub-id pub-id-type="pmid">11296296</pub-id></citation></ref>
<ref id="ref11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fischetti</surname> <given-names>V. A.</given-names></name> <name><surname>Jones</surname> <given-names>K. F.</given-names></name> <name><surname>Hollingshead</surname> <given-names>S. K.</given-names></name> <name><surname>Scott</surname> <given-names>J. R.</given-names></name></person-group> (<year>1988</year>). <article-title>Structure, function, and genetics of streptococcal M protein</article-title>. <source>Rev. Infect. Dis.</source> <volume>10</volume>, <fpage>S356</fpage>&#x2013;<lpage>S359</lpage>. doi: <pub-id pub-id-type="doi">10.1093/cid/10.supplement_2.s356</pub-id></citation></ref>
<ref id="ref12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Henderson</surname> <given-names>I. R.</given-names></name> <name><surname>Owen</surname> <given-names>P.</given-names></name> <name><surname>Nataro</surname> <given-names>J. P.</given-names></name></person-group> (<year>1999</year>). <article-title>Molecular switches--the ON and OFF of bacterial phase variation</article-title>. <source>Mol. Microbiol.</source> <volume>33</volume>, <fpage>919</fpage>&#x2013;<lpage>932</lpage>. doi: <pub-id pub-id-type="doi">10.1046/j.1365-2958.1999.01555.x</pub-id></citation></ref>
<ref id="ref13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hondorp</surname> <given-names>E. R.</given-names></name> <name><surname>Hou</surname> <given-names>S. C.</given-names></name> <name><surname>Hempstead</surname> <given-names>A. D.</given-names></name> <name><surname>Hause</surname> <given-names>L. L.</given-names></name> <name><surname>Beckett</surname> <given-names>D. M.</given-names></name> <name><surname>Mciver</surname> <given-names>K. S.</given-names></name></person-group> (<year>2012</year>). <article-title>Characterization of the group a Streptococcus Mga virulence regulator reveals a role for the C-terminal region in Oligomerization and transcriptional activation</article-title>. <source>Mol. Microbiol.</source> <volume>83</volume>, <fpage>953</fpage>&#x2013;<lpage>967</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1365-2958.2012.07980.x</pub-id></citation></ref>
<ref id="ref14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Horstmann</surname> <given-names>N.</given-names></name> <name><surname>Tran</surname> <given-names>C. N.</given-names></name> <name><surname>Brumlow</surname> <given-names>C.</given-names></name> <name><surname>DebRoy</surname> <given-names>S.</given-names></name> <name><surname>Yao</surname> <given-names>H.</given-names></name> <name><surname>Nogueras Gonzalez</surname> <given-names>G.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Phosphatase activity of the control of virulence sensor kinase CovS is critical for the pathogenesis of group a streptococcus</article-title>. <source>PLoS Pathog.</source> <volume>14</volume>:<fpage>e1007354</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.ppat.1007354</pub-id>, PMID: <pub-id pub-id-type="pmid">30379939</pub-id></citation></ref>
<ref id="ref15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Levinson</surname> <given-names>G.</given-names></name> <name><surname>Gutman</surname> <given-names>G. A.</given-names></name></person-group> (<year>1987</year>). <article-title>Slipped-strand mispairing: a major mechanism for DNA sequence evolution</article-title>. <source>Mol. Biol. Evol.</source> <volume>4</volume>, <fpage>203</fpage>&#x2013;<lpage>221</lpage>. doi: <pub-id pub-id-type="doi">10.1093/oxfordjournals.molbev.a040442</pub-id></citation></ref>
<ref id="ref16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Sledjeski</surname> <given-names>D. D.</given-names></name> <name><surname>Kreikemeyer</surname> <given-names>B.</given-names></name> <name><surname>Podbielski</surname> <given-names>A.</given-names></name> <name><surname>Boyle</surname> <given-names>M. D.</given-names></name></person-group> (<year>2019</year>). <article-title>Identification of pel, a <italic>Streptococcus pyogenes</italic> locus that affects both surface and secreted proteins</article-title>. <source>J. Bacteriol.</source> <volume>181</volume>, <fpage>6019</fpage>&#x2013;<lpage>6027</lpage>. doi: <pub-id pub-id-type="doi">10.1128/JB.181.19.6019-6027</pub-id></citation></ref>
<ref id="ref17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Zhu</surname> <given-names>H.</given-names></name> <name><surname>Feng</surname> <given-names>W.</given-names></name> <name><surname>Liu</surname> <given-names>M.</given-names></name> <name><surname>Song</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Regulation of inhibition of neutrophil infiltration by the two-component regulatory system CovRS in subcutaneous murine infection with group a streptococcus</article-title>. <source>Infect. Immun.</source> <volume>81</volume>, <fpage>974</fpage>&#x2013;<lpage>983</lpage>. doi: <pub-id pub-id-type="doi">10.1128/iai.01218-12</pub-id>, PMID: <pub-id pub-id-type="pmid">23319556</pub-id></citation></ref>
<ref id="ref18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>G.</given-names></name> <name><surname>Feng</surname> <given-names>W.</given-names></name> <name><surname>Li</surname> <given-names>D.</given-names></name> <name><surname>Liu</surname> <given-names>M.</given-names></name> <name><surname>Nelson</surname> <given-names>D. C.</given-names></name> <name><surname>Lei</surname> <given-names>B.</given-names></name></person-group> (<year>2015</year>). <article-title>The Mga Regulon but not Deoxyribonuclease Sda1 of invasive M1T1 group a Streptococcus contributes to in vivo selection of CovRS mutations and resistance to innate immune killing mechanisms</article-title>. <source>Infect. Immun.</source> <volume>83</volume>, <fpage>4293</fpage>&#x2013;<lpage>4303</lpage>. doi: <pub-id pub-id-type="doi">10.1128/IAI.00857-15</pub-id>, PMID: <pub-id pub-id-type="pmid">26283338</pub-id></citation></ref>
<ref id="ref19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>G.</given-names></name> <name><surname>Liu</surname> <given-names>M.</given-names></name> <name><surname>Xie</surname> <given-names>G.</given-names></name> <name><surname>Lei</surname> <given-names>B.</given-names></name></person-group> (<year>2013</year>). <article-title>Characterization of streptococcal platelet-activating factor acetylhydrolase variants that are involved in innate immune evasion</article-title>. <source>Infect. Immun.</source> <volume>81</volume>, <fpage>3128</fpage>&#x2013;<lpage>3138</lpage>. doi: <pub-id pub-id-type="doi">10.1128/IAI.00398-13</pub-id>, PMID: <pub-id pub-id-type="pmid">23774595</pub-id></citation></ref>
<ref id="ref20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>M.</given-names></name> <name><surname>Zhu</surname> <given-names>H.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Garcia</surname> <given-names>C. C.</given-names></name> <name><surname>Feng</surname> <given-names>W.</given-names></name> <name><surname>Kirpotina</surname> <given-names>L. N.</given-names></name></person-group> (<year>2012</year>). <article-title>Group a Streptococcus secreted esterase hydrolyzes platelet-activating factor to impede neutrophil recruitment and facilitate innate immune evasion</article-title>. <source>PLoS Pathog.</source> <volume>8</volume>:<fpage>e1002624</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.ppat.1002624</pub-id>, PMID: <pub-id pub-id-type="pmid">22496650</pub-id></citation></ref>
<ref id="ref21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lukomski</surname> <given-names>S.</given-names></name> <name><surname>Nakashima</surname> <given-names>K.</given-names></name> <name><surname>Abdi</surname> <given-names>I.</given-names></name> <name><surname>Cipriano</surname> <given-names>V. J.</given-names></name> <name><surname>Shelvin</surname> <given-names>B. J.</given-names></name> <name><surname>Graviss</surname> <given-names>E. A.</given-names></name> <etal/></person-group>. (<year>2001</year>). <article-title>Identification and characterization of a second extracellular collagen-like protein made by group a Streptococcus: control of production at the level of translation</article-title>. <source>Infect. Immun.</source> <volume>69</volume>, <fpage>1729</fpage>&#x2013;<lpage>1738</lpage>. doi: <pub-id pub-id-type="doi">10.1128/iai.69.3.1729-1738.2001</pub-id>, PMID: <pub-id pub-id-type="pmid">11179350</pub-id></citation></ref>
<ref id="ref22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McIver</surname> <given-names>K. S.</given-names></name> <name><surname>Heath</surname> <given-names>A. S.</given-names></name> <name><surname>Green</surname> <given-names>B. D.</given-names></name> <name><surname>Scott</surname> <given-names>J. R.</given-names></name></person-group> (<year>1995</year>). <article-title>Specific binding of the activator Mga to promoter sequences of the emm and scpA genes in the group a Streptococcus</article-title>. <source>J. Bacteriol.</source> <volume>177</volume>, <fpage>6619</fpage>&#x2013;<lpage>6624</lpage>. doi: <pub-id pub-id-type="doi">10.1128/jb.177.22.6619-6624.1995</pub-id>, PMID: <pub-id pub-id-type="pmid">7592441</pub-id></citation></ref>
<ref id="ref23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McNamara</surname> <given-names>C.</given-names></name> <name><surname>Zinkernagel</surname> <given-names>A. S.</given-names></name> <name><surname>Macheboeuf</surname> <given-names>P.</given-names></name> <name><surname>Cunningham</surname> <given-names>M. W.</given-names></name> <name><surname>Nizet</surname> <given-names>V.</given-names></name> <name><surname>Ghosh</surname> <given-names>P.</given-names></name></person-group> (<year>2008</year>). <article-title>Coiled-coil irregularities and instabilities in group a Streptococcus M1 are required for virulence</article-title>. <source>Science</source> <volume>319</volume>, <fpage>1405</fpage>&#x2013;<lpage>1408</lpage>. doi: <pub-id pub-id-type="doi">10.1126/science.1154470</pub-id>, PMID: <pub-id pub-id-type="pmid">18323455</pub-id></citation></ref>
<ref id="ref24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Murase</surname> <given-names>K.</given-names></name> <name><surname>Nozawa</surname> <given-names>T.</given-names></name> <name><surname>Aikawa</surname> <given-names>C.</given-names></name> <name><surname>Nagao</surname> <given-names>M.</given-names></name> <name><surname>Ikebe</surname> <given-names>T.</given-names></name> <name><surname>Yoshida</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>Complete genome sequences of <italic>Streptococcus pyogenes</italic> serotype M3, M28, and M89 strains isolated from human patients in Japan, 1994 to 2009</article-title>. <source>Microbiol. Resour. Announc.</source> <volume>9</volume>, <fpage>e01047</fpage>&#x2013;<lpage>e01020</lpage>. doi: <pub-id pub-id-type="doi">10.1128/MRA.01047-20</pub-id>, PMID: <pub-id pub-id-type="pmid">33060279</pub-id></citation></ref>
<ref id="ref25"><citation citation-type="book"><person-group person-group-type="author"><collab id="coll1">National Research Council</collab></person-group>. (<year>2011</year>). <source>Guide for the Care and Use of Laboratory Animals</source>, <edition>8th</edition> <publisher-name>National Academies Press</publisher-name>, <publisher-loc>Washington, DC</publisher-loc>.</citation></ref>
<ref id="ref26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nelson</surname> <given-names>G. E.</given-names></name> <name><surname>Pondo</surname> <given-names>T.</given-names></name> <name><surname>Toews</surname> <given-names>K. A.</given-names></name> <name><surname>Farley</surname> <given-names>M. M.</given-names></name> <name><surname>Lindegren</surname> <given-names>M. L.</given-names></name> <name><surname>Lynfield</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Epidemiology of invasive group a streptococcal infections in the United States, 2005-2012</article-title>. <source>Clin. Infect. Dis.</source> <volume>63</volume>, <fpage>478</fpage>&#x2013;<lpage>486</lpage>. doi: <pub-id pub-id-type="doi">10.1093/cid/ciw248</pub-id>, PMID: <pub-id pub-id-type="pmid">27105747</pub-id></citation></ref>
<ref id="ref27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nelson</surname> <given-names>D.</given-names></name> <name><surname>Schuch</surname> <given-names>R.</given-names></name> <name><surname>Chahales</surname> <given-names>P.</given-names></name> <name><surname>Zhu</surname> <given-names>S.</given-names></name> <name><surname>Fischetti</surname> <given-names>V. A.</given-names></name></person-group> (<year>2006</year>). <article-title>PlyC: a multimeric bacteriophage lysin</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>103</volume>, <fpage>10765</fpage>&#x2013;<lpage>10770</lpage>. doi: <pub-id pub-id-type="doi">10.1073/pnas.0604521103</pub-id>, PMID: <pub-id pub-id-type="pmid">16818874</pub-id></citation></ref>
<ref id="ref28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ogino</surname> <given-names>S.</given-names></name> <name><surname>Gulley</surname> <given-names>M. L.</given-names></name> <name><surname>den Dunnen</surname> <given-names>J. T.</given-names></name> <name><surname>Wilson</surname> <given-names>R. B.</given-names></name></person-group> (<year>2007</year>). <article-title>Association for Molecular Patholpogy Training and Education Committtee. Standard mutation nomenclature in molecular diagnostics: practical and educational challenges</article-title>. <source>J. Mol. Diagn.</source> <volume>9</volume>, <fpage>1</fpage>&#x2013;<lpage>6</lpage>. doi: <pub-id pub-id-type="doi">10.2353/jmoldx.2007.060081</pub-id>, PMID: <pub-id pub-id-type="pmid">17251329</pub-id></citation></ref>
<ref id="ref29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Perez-Casal</surname> <given-names>J.</given-names></name> <name><surname>Caparon</surname> <given-names>M. G.</given-names></name> <name><surname>Scott</surname> <given-names>J. R.</given-names></name></person-group> (<year>1992</year>). <article-title>Introduction of the emm 6 gene into an emm-deleted strain of <italic>Streptococcus pyogenes</italic> restores its ability to resist phagocytosis</article-title>. <source>Res. Microbiol.</source> <volume>143</volume>, <fpage>549</fpage>&#x2013;<lpage>558</lpage>. doi: <pub-id pub-id-type="doi">10.1016/0923-2508(92)90112-2</pub-id>, PMID: <pub-id pub-id-type="pmid">1475516</pub-id></citation></ref>
<ref id="ref30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rasmussen</surname> <given-names>M.</given-names></name> <name><surname>Bj&#x00F6;rck</surname> <given-names>L.</given-names></name></person-group> (<year>2001</year>). <article-title>Unique regulation of SclB - a novel collagen-like surface protein of <italic>Streptococcus pyogenes</italic></article-title>. <source>Mol. Microbiol.</source> <volume>40</volume>, <fpage>1427</fpage>&#x2013;<lpage>1438</lpage>. doi: <pub-id pub-id-type="doi">10.1046/j.1365-2958.2001.02493.x</pub-id>, PMID: <pub-id pub-id-type="pmid">11442840</pub-id></citation></ref>
<ref id="ref31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ribardo</surname> <given-names>D. A.</given-names></name> <name><surname>McIver</surname> <given-names>K. S.</given-names></name></person-group> (<year>2006</year>). <article-title>Defining the Mga regulon: comparative transcriptome analysis reveals both direct and indirect regulation by Mga in the group a streptococcus</article-title>. <source>Mol. Microbiol.</source> <volume>62</volume>, <fpage>491</fpage>&#x2013;<lpage>508</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1365-2958.2006.05381.x</pub-id></citation></ref>
<ref id="ref32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sandin</surname> <given-names>C.</given-names></name> <name><surname>Carlsson</surname> <given-names>F.</given-names></name> <name><surname>Lindahl</surname> <given-names>G.</given-names></name></person-group> (<year>2006</year>). <article-title>Binding of human plasma proteins to <italic>Streptococcus pyogenes</italic> M protein determines the location of opsonic and non-opsonic epitopes</article-title>. <source>Mol. Microbiol.</source> <volume>59</volume>, <fpage>20</fpage>&#x2013;<lpage>30</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1365-2958.2005.04913.x</pub-id>, PMID: <pub-id pub-id-type="pmid">16359315</pub-id></citation></ref>
<ref id="ref33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sarkar</surname> <given-names>P.</given-names></name> <name><surname>Danger</surname> <given-names>J. L.</given-names></name> <name><surname>Jain</surname> <given-names>I.</given-names></name> <name><surname>Meadows</surname> <given-names>L. A.</given-names></name> <name><surname>Beam</surname> <given-names>C.</given-names></name> <name><surname>Medicielo</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Phenotypic variation in the group a Streptococcus due to natural mutation of the accessory protein-encoding gene rocA</article-title>. <source>mSphere</source> <volume>3</volume>:<fpage>e00519-18</fpage>. doi: <pub-id pub-id-type="doi">10.1128/mSphere.00519-18</pub-id>, PMID: <pub-id pub-id-type="pmid">30333182</pub-id></citation></ref>
<ref id="ref34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sievers</surname> <given-names>F.</given-names></name> <name><surname>Wilm</surname> <given-names>A.</given-names></name> <name><surname>Dineen</surname> <given-names>D.</given-names></name> <name><surname>Gibson</surname> <given-names>T. J.</given-names></name> <name><surname>Karplus</surname> <given-names>K.</given-names></name> <name><surname>Li</surname> <given-names>W.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal omega</article-title>. <source>Mol. Syst. Biol.</source> <volume>7</volume>:<fpage>539</fpage>. doi: <pub-id pub-id-type="doi">10.1038/msb.2011.75</pub-id>, PMID: <pub-id pub-id-type="pmid">21988835</pub-id></citation></ref>
<ref id="ref35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Simpson</surname> <given-names>W. J.</given-names></name> <name><surname>Cleary</surname> <given-names>P. P.</given-names></name></person-group> (<year>1987</year>). <article-title>Expression of M type 12 protein by a group a streptococcus exhibits phase-like variation: evidence for coregulation of colony opacity determinants and M protein</article-title>. <source>Infect. Immun.</source> <volume>55</volume>, <fpage>2448</fpage>&#x2013;<lpage>2455</lpage>. doi: <pub-id pub-id-type="doi">10.1128/iai.55.10.2448-2455.1987</pub-id>, PMID: <pub-id pub-id-type="pmid">2443452</pub-id></citation></ref>
<ref id="ref36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spanier</surname> <given-names>J. G.</given-names></name> <name><surname>Jones</surname> <given-names>S. J.</given-names></name> <name><surname>Cleary</surname> <given-names>P.</given-names></name></person-group> (<year>1984</year>). <article-title>Small DNA deletions creating avirulence in <italic>Streptococcus pyogenes</italic></article-title>. <source>Science</source> <volume>225</volume>, <fpage>935</fpage>&#x2013;<lpage>938</lpage>. doi: <pub-id pub-id-type="doi">10.1126/science.6089334</pub-id>, PMID: <pub-id pub-id-type="pmid">6089334</pub-id></citation></ref>
<ref id="ref37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stern</surname> <given-names>A.</given-names></name> <name><surname>Brown</surname> <given-names>M.</given-names></name> <name><surname>Nickel</surname> <given-names>P.</given-names></name> <name><surname>Meyer</surname> <given-names>T. F.</given-names></name></person-group> (<year>1986</year>). <article-title>Opacity genes in <italic>Neisseria gonorrhoeae</italic>: control of phase and antigenic variation</article-title>. <source>Cells</source> <volume>47</volume>, <fpage>61</fpage>&#x2013;<lpage>71</lpage>. doi: <pub-id pub-id-type="doi">10.1016/0092-8674(86)90366-1</pub-id>, PMID: <pub-id pub-id-type="pmid">3093085</pub-id></citation></ref>
<ref id="ref38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stetzner</surname> <given-names>Z. W.</given-names></name> <name><surname>Li</surname> <given-names>D.</given-names></name> <name><surname>Feng</surname> <given-names>W.</given-names></name> <name><surname>Liu</surname> <given-names>M.</given-names></name> <name><surname>Liu</surname> <given-names>G.</given-names></name> <name><surname>Wiley</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Serotype M3 and M28 group a streptococci have distinct capacities to evade neutrophil and TNF-&#x03B1; responses and to invade soft tissues</article-title>. <source>PLoS One</source> <volume>10</volume>:<fpage>e0129417</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pone.0129417</pub-id>, PMID: <pub-id pub-id-type="pmid">26047469</pub-id></citation></ref>
<ref id="ref39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stibitz</surname> <given-names>S.</given-names></name> <name><surname>Aaronson</surname> <given-names>W.</given-names></name> <name><surname>Monack</surname> <given-names>D.</given-names></name> <name><surname>Falkow</surname> <given-names>S.</given-names></name></person-group> (<year>1989</year>). <article-title>Phase variation in <italic>Bordetella pertussis</italic> by frameshift mutation in a gene for a novel two-component system</article-title>. <source>Nature</source> <volume>338</volume>, <fpage>266</fpage>&#x2013;<lpage>269</lpage>. doi: <pub-id pub-id-type="doi">10.1038/338266a0</pub-id></citation></ref>
<ref id="ref40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sumby</surname> <given-names>P.</given-names></name> <name><surname>Whitney</surname> <given-names>A. R.</given-names></name> <name><surname>Graviss</surname> <given-names>E. A.</given-names></name> <name><surname>DeLeo</surname> <given-names>F. R.</given-names></name> <name><surname>Musser</surname> <given-names>J. M.</given-names></name></person-group> (<year>2006</year>). <article-title>Genome-wide analysis of group a streptococci reveals a mutation that modulates global phenotype and disease specificity</article-title>. <source>PLoS Pathog.</source> <volume>2</volume>:<fpage>e5</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.ppat.0020005</pub-id>, PMID: <pub-id pub-id-type="pmid">16446783</pub-id></citation></ref>
<ref id="ref41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tanaka</surname> <given-names>M.</given-names></name> <name><surname>Kinoshita-Daitoku</surname> <given-names>R.</given-names></name> <name><surname>Kiga</surname> <given-names>K.</given-names></name> <name><surname>Sanada</surname> <given-names>T.</given-names></name> <name><surname>Zhu</surname> <given-names>B.</given-names></name> <name><surname>Okano</surname> <given-names>T.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>Group a Streptococcus establishes pharynx infection by degrading the deoxyribonucleic acid of neutrophil extracellular traps</article-title>. <source>Sci. Rep.</source> <volume>10</volume>:<fpage>3251</fpage>. doi: <pub-id pub-id-type="doi">10.1038/s41598-020-60306-w</pub-id>, PMID: <pub-id pub-id-type="pmid">32094510</pub-id></citation></ref>
<ref id="ref42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Torres-Cruz</surname> <given-names>J.</given-names></name> <name><surname>van der Woude</surname> <given-names>M. W.</given-names></name></person-group> (<year>2003</year>). <article-title>Slipped-strand mispairing can function as a phase variation mechanism in <italic>Escherichia coli</italic></article-title>. <source>J. Bacteriol.</source> <volume>185</volume>, <fpage>6990</fpage>&#x2013;<lpage>6994</lpage>. doi: <pub-id pub-id-type="doi">10.1128/jb.185.23.6990-6994.2003</pub-id>, PMID: <pub-id pub-id-type="pmid">14617664</pub-id></citation></ref>
<ref id="ref43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Trevi&#x00F1;o</surname> <given-names>J.</given-names></name> <name><surname>Perez</surname> <given-names>N.</given-names></name> <name><surname>Ramirez-Pe&#x00F1;a</surname> <given-names>E.</given-names></name> <name><surname>Liu</surname> <given-names>Z.</given-names></name> <name><surname>Shelburne</surname> <given-names>S. A.</given-names></name> <name><surname>Musser</surname> <given-names>J. M.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>CovS simultaneously activates and inhibits the CovR-mediated repression of distinct subsets of group a Streptococcus virulence factor-encoding genes</article-title>. <source>Infect. Immun.</source> <volume>77</volume>, <fpage>3141</fpage>&#x2013;<lpage>3149</lpage>. doi: <pub-id pub-id-type="doi">10.1128/IAI.01560-08</pub-id>, PMID: <pub-id pub-id-type="pmid">19451242</pub-id></citation></ref>
<ref id="ref44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>van der Woude</surname> <given-names>M. W.</given-names></name></person-group> (<year>2011</year>). <article-title>Phase variation: how to create and coordinate population diversity</article-title>. <source>Curr. Opin. Microbiol.</source> <volume>14</volume>, <fpage>205</fpage>&#x2013;<lpage>211</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.mib.2011.01.002</pub-id>, PMID: <pub-id pub-id-type="pmid">21292543</pub-id></citation></ref>
<ref id="ref45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Viguera</surname> <given-names>E.</given-names></name> <name><surname>Canceill</surname> <given-names>D.</given-names></name> <name><surname>Ehrlich</surname> <given-names>S. D.</given-names></name></person-group> (<year>2001</year>). <article-title>Replication slippage involves DNA polymerase pausing and dissociation</article-title>. <source>EMBO J.</source> <volume>20</volume>, <fpage>2587</fpage>&#x2013;<lpage>2595</lpage>. doi: <pub-id pub-id-type="doi">10.1093/emboj/20.10.2587</pub-id>, PMID: <pub-id pub-id-type="pmid">11350948</pub-id></citation></ref>
<ref id="ref46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Walker</surname> <given-names>M. J.</given-names></name> <name><surname>Hollands</surname> <given-names>A.</given-names></name> <name><surname>Sanderson-Smith</surname> <given-names>M. L.</given-names></name> <name><surname>Cole</surname> <given-names>J. N.</given-names></name> <name><surname>Kirk</surname> <given-names>J. K.</given-names></name> <name><surname>Henningham</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2007</year>). <article-title>DNase Sda1 provides selection pressure for a switch to invasive group a streptococcal infection</article-title>. <source>Nat. Med.</source> <volume>13</volume>, <fpage>981</fpage>&#x2013;<lpage>985</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nm1612</pub-id>, PMID: <pub-id pub-id-type="pmid">17632528</pub-id></citation></ref>
<ref id="ref47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>Y.</given-names></name> <name><surname>Hanks</surname> <given-names>T. S.</given-names></name> <name><surname>Feng</surname> <given-names>W.</given-names></name> <name><surname>Li</surname> <given-names>J.</given-names></name> <name><surname>Liu</surname> <given-names>G.</given-names></name> <name><surname>Liu</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>The sagA/pel locus does not regulate the expression of the M protein of the M1T1 lineage of group a Streptococcus</article-title>. <source>Virulence</source> <volume>4</volume>, <fpage>698</fpage>&#x2013;<lpage>706</lpage>. doi: <pub-id pub-id-type="doi">10.4161/viru.26413</pub-id>, PMID: <pub-id pub-id-type="pmid">24121654</pub-id></citation></ref>
<ref id="ref48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhu</surname> <given-names>H.</given-names></name> <name><surname>Liu</surname> <given-names>M.</given-names></name> <name><surname>Sumby</surname> <given-names>P.</given-names></name> <name><surname>Lei</surname> <given-names>B.</given-names></name></person-group> (<year>2009</year>). <article-title>The secreted esterase of group a streptococcus is important for invasive skin infection and dissemination in mice</article-title>. <source>Infect. Immun.</source> <volume>77</volume>, <fpage>5225</fpage>&#x2013;<lpage>5232</lpage>. doi: <pub-id pub-id-type="doi">10.1128/iai.00636-09</pub-id>, PMID: <pub-id pub-id-type="pmid">19805529</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="fn0003"><p><sup>1</sup><ext-link xlink:href="https://ftp.ncbi.nlm.nih.gov/genomes/genbank/bacteria/Streptococcus_pyogenes" ext-link-type="uri">https://ftp.ncbi.nlm.nih.gov/genomes/genbank/bacteria/Streptococcus_pyogenes</ext-link></p></fn>
<fn id="fn0004"><p><sup>2</sup><ext-link xlink:href="https://ftp.cdc.gov/pub/infectious_diseases/biotech" ext-link-type="uri">https://ftp.cdc.gov/pub/infectious_diseases/biotech</ext-link></p></fn>
</fn-group>
</back>
</article>
