<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Genet.</journal-id>
<journal-title>Frontiers in Genetics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Genet.</abbrev-journal-title>
<issn pub-type="epub">1664-8021</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">997341</article-id>
<article-id pub-id-type="doi">10.3389/fgene.2022.997341</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Genetics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Investigating CRISPR spacer targets and their impact on genomic diversification of <italic>Streptococcus mutans</italic>
</article-title>
<alt-title alt-title-type="left-running-head">Walker and Shields</alt-title>
<alt-title alt-title-type="right-running-head">
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fgene.2022.997341">10.3389/fgene.2022.997341</ext-link>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Walker</surname>
<given-names>Alejandro R.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="fn" rid="fn1">
<sup>&#x2020;</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/734896/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Shields</surname>
<given-names>Robert C.</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<xref ref-type="fn" rid="fn1">
<sup>&#x2020;</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/344129/overview"/>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>Department of Oral Biology</institution>, <institution>University of Florida</institution>, <addr-line>Gainesville</addr-line>, <addr-line>FL</addr-line>, <country>United States</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Department of Biological Sciences</institution>, <institution>Arkansas State University</institution>, <addr-line>Jonesboro</addr-line>, <addr-line>AR</addr-line>, <country>United States</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/106537/overview">David W. Ussery</ext-link>, University of Arkansas for Medical Sciences, United States</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1607304/overview">Kesavan Babu</ext-link>, UT Southwestern Medical Center, United States</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/405245/overview">Kurt Selle</ext-link>, Locus Biosciences Inc., United States</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Robert C. Shields, <email>rshields@astate.edu</email>
</corresp>
<fn fn-type="equal" id="fn1">
<label>
<sup>&#x2020;</sup>
</label>
<p>ORCID: Alejandro R. Walker, <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0002-5839-6584">orcid.org/0000-0002-5839-6584</ext-link>; Robert C. Shields, <ext-link ext-link-type="uri" xlink:href="http://orcid.org/0000-0002-8823-9504">orcid.org/0000-0002-8823-9504</ext-link>
</p>
</fn>
<fn fn-type="other">
<p>This article was submitted to Computational Genomics, a section of the journal Frontiers in Genetics</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>15</day>
<month>09</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>13</volume>
<elocation-id>997341</elocation-id>
<history>
<date date-type="received">
<day>18</day>
<month>07</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>17</day>
<month>08</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Walker and Shields.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Walker and Shields</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>CRISPR-Cas is a bacterial immune system that restricts the acquisition of mobile DNA elements. These systems provide immunity against foreign DNA by encoding CRISPR spacers that help target DNA if it re-enters the cell. In this way, CRISPR spacers are a type of molecular tape recorder of foreign DNA encountered by the host microorganism. Here, we extracted &#x223c;8,000 CRISPR spacers from a collection of over three hundred <italic>Streptococcus mutans</italic> genomes. Phage DNA is a major target of <italic>S. mutans</italic> spacers. S. mutans strains have also generated immunity against mobile DNA elements such as plasmids and integrative and conjugative elements. There may also be considerable immunity generated against bacterial DNA, although the relative contribution of self-targeting versus bona fide intra- or inter-species targeting needs to be investigated further. While there was clear evidence that these systems have acquired immunity against foreign DNA, there appeared to be minimal impact on horizontal gene transfer (HGT) constraints on a species-level. There was little or no impact on genome size, GC content and &#x2018;openness&#x2019; of the pangenome when comparing between <italic>S. mutans</italic> strains with low or high CRISPR spacer loads. In summary, while there is evidence of CRISPR spacer acquisition against self and foreign DNA, CRISPR-Cas does not act as a barrier on the expansion of the <italic>S. mutans</italic> accessory genome.</p>
</abstract>
<kwd-group>
<kwd>
<italic>Streptococcus mutans</italic>
</kwd>
<kwd>CRISPR</kwd>
<kwd>horizontal gene transfer</kwd>
<kwd>integrative and conjugative element</kwd>
<kwd>pangenome</kwd>
<kwd>CRISPR-Cas</kwd>
</kwd-group>
<contract-sponsor id="cn001">National Institute of Dental and Craniofacial Research<named-content content-type="fundref-id">10.13039/100000072</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>
<italic>Streptococcus mutans</italic> is a Gram positive bacteria that inhabits the oral cavity and has a strong association with causing dental caries (tooth decay) (<xref ref-type="bibr" rid="B24">Lemos et al., 2019</xref>). It has a genome size of approximately 2&#xa0;Mbp and an &#x201c;open&#x201d; pangenome, meaning that as more <italic>S. mutans</italic> genomes are sequenced new genes previously not associated with <italic>S. mutans</italic> are discovered (<xref ref-type="bibr" rid="B39">Song et al., 2013</xref>; <xref ref-type="bibr" rid="B27">Meng et al., 2017</xref>). These observations suggest that the organism is adaptable and able to acquire new traits over time. More broadly, it has been estimated that as a group 60% &#x2b; of the pangenome of streptococci is acquired <italic>via</italic> horizontal gene transfer (HGT) (<xref ref-type="bibr" rid="B35">Richards et al., 2014</xref>). The ability of <italic>S. mutans</italic> to horizontally acquire DNA is likely greatly facilitated by competence development, a trait that is conserved (<italic>comRS</italic>, <italic>comX</italic> and late-competence genes) across the species (<xref ref-type="bibr" rid="B22">Khan et al., 2016</xref>). There are other important contributors to the genetic variability observed in <italic>S. mutans</italic> including residing in the oral microbiome which is likely a hotspot for HGT and significant pressure to adapt to new eco-niches. Regarding the last point, <italic>S. mutans</italic> strains have acquired traits, such as collagen-binding proteins, that have expanded its niche into cardiovascular diseases (<xref ref-type="bibr" rid="B1">Abranches et al., 2011</xref>).</p>
<p>In a recent publication we detected the presence of at least one CRISPR (clustered regularly interspaced short palindromic repeats)-Cas(CRISPR-associated protein) system in 95% of <italic>S. mutans</italic> strains from a collection of 477 genomes (<xref ref-type="bibr" rid="B38">Shields et al., 2020</xref>). CRISPR-Cas systems function as an adaptive immune system, incorporating DNA fragments from invading DNA (CRISPR spacers) that in concert with Cas endonucleases prevent future &#x2018;re-infection&#x2019; of the invading DNA. CRISPR-Cas systems are organized into types and subtypes depending on the function of the Cas nuclease and the organization of the CRISPR-Cas loci. Type I (C and E subtypes) and Type II (A subtype) are most commonly identified in <italic>S. mutans</italic> and these systems recognize and cleave DNA. There are rare instances where Type III-A systems, that have specificity towards cleaving RNA (<xref ref-type="bibr" rid="B23">Kolesnik et al., 2021</xref>), are found in <italic>S. mutans</italic> (<xref ref-type="bibr" rid="B38">Shields et al., 2020</xref>). A substantial number of <italic>S. mutans</italic> strains encode both a Type I-C and a Type II-A CRISPR-Cas system, perhaps to help with viral escape from either system. The model <italic>S. mutans</italic> strain, UA159, carries Type II-A and Type I-C CRISPR-Cas loci. The Type II-A system is functional, is capable of interfering with plasmid transformation and has been repurposed for CRISPR interference and genome editing (<xref ref-type="bibr" rid="B37">Serbanescu et al., 2015</xref>; <xref ref-type="bibr" rid="B14">Gong et al., 2018</xref>; <xref ref-type="bibr" rid="B38">Shields et al., 2020</xref>). It is likely that the Type I-C system is not functional because of a truncated Cas1 protein (<xref ref-type="bibr" rid="B41">van der Ploeg, 2009</xref>; <xref ref-type="bibr" rid="B37">Serbanescu et al., 2015</xref>). Interference of both plasmid and phage DNA has also been shown in <italic>S. mutans</italic> P42S, with the Cas9 protein recognizing a different protospacer adjacent motif (PAM) domain compared with the UA159 Cas9 nuclease (<xref ref-type="bibr" rid="B28">Mosterd and Moineau, 2020</xref>). Some progress has been made exploring the functionality of these systems but there has been limited analysis of CRISPR spacers acquired by <italic>S. mutans</italic>. The most comprehensive study of <italic>S. mutans</italic> CRISPR spacers was conducted in 2009 from 29 individual strains (<xref ref-type="bibr" rid="B41">van der Ploeg, 2009</xref>). With our large collection of <italic>S. mutans</italic> genomes, and changes in CRISPR-Cas analysis over the last decade, we believe an updated analysis of CRISPR spacers is warranted.</p>
<p>The observation that <italic>S. mutans</italic> has a large pan-genome is in conflict with the high frequency of CRISPR-Cas systems across the species. These systems have primarily been considered as a phage defense mechanism. However, there is experimental evidence that CRISPR-Cas systems can limit other types of HGT (e.g., plasmid acquisition) in laboratory settings including in <italic>S. mutans</italic> (<xref ref-type="bibr" rid="B37">Serbanescu et al., 2015</xref>). Despite evidence that CRISPR-Cas systems protect bacteria from phage and other mobile genetic element infections in simplified systems, it remains unclear how much they contribute to the ecology and evolution of bacteria living in complex natural environments. When large genome datasets are studied there has been conflicting evidence of the impact of CRISPR-Cas on HGT in bacterial populations. In <italic>Pseudomonas aeruginosa</italic> CRISPR-Cas systems constrain HGT (<xref ref-type="bibr" rid="B43">Wheatley and MacLean, 2021</xref>) and there is evidence of immunity against antibiotic resistance genes in <italic>Enterococcus faecalis</italic> strains carrying CRISPR-Cas (<xref ref-type="bibr" rid="B31">Palmer and Gilmore, 2010</xref>). However, there is also evidence of no inhibition of HGT by CRISPR-Cas in other species (<xref ref-type="bibr" rid="B15">Gophna et al., 2015</xref>). The oral microbiome, specifically dental plaque, has a high CRISPR spacer load compared with other body sites (<xref ref-type="bibr" rid="B29">M&#xfc;nch et al., 2021</xref>), it is a dynamic environment and it is a large community of interacting microbes and phages. It therefore provides an excellent model for understanding the contribution of CRISPR-Cas in limiting HGT in complex microbial settings. Here, using <italic>S. mutans</italic> as a starting point, we will begin to address how impactful CRISPR-Cas is on genomic diversification in this complex ecosystem.</p>
</sec>
<sec sec-type="methods" id="s2">
<title>Methods</title>
<sec id="s2-1">
<title>Genomic data</title>
<p>We have a collection of 477 <italic>S. mutans</italic> genomes that was curated by Vince Richards at Clemson University; sequencing files were downloaded from GenBank/NCBI and assembled. Genomes are available in. gbk format on the Github platform (<ext-link ext-link-type="uri" xlink:href="https://github.com/theshieldslab/Streptococcus-mutans-CRISPR-Spacers-Analysis">https://github.com/theshieldslab/Streptococcus-mutans-CRISPR-Spacers-Analysis</ext-link>). For this study CRISPR spacers were isolated from 335 <italic>S. mutans</italic> genomes. Incomplete assemblies were evident in the other <italic>S. mutans</italic> genomes, likely because of Illumina based whole genome sequencing, leading to difficulty in identifying repeat regions (i.e., CRISPR spacer regions).</p>
</sec>
<sec id="s2-2">
<title>CRISPR spacer identification</title>
<p>Identification and data curation of the CRISPR systems found in our clinical isolates was conducted at the University of Florida HiPerGator cluster computer with the CRISPRCasFinder (<xref ref-type="bibr" rid="B8">Couvin et al., 2018</xref>) version 4.2.17 pipeline. The resulting output files, containing the predicted spacers, were then processed and summarized in several files containing CRISPR system types and subtype, list of genes, repeat sequences, count of repeat sequences, and spacers sequences. The spacer sequences were then labeled with their respective isolate name, contig ID where the spacers was predicted, and an additional counter number. All CRISPR spacer sequences were then parsed into a FASTA format file, which is available on the Github platform.</p>
</sec>
<sec id="s2-3">
<title>CRISPR spacer target identification</title>
<p>Blastn was used to identify spacer targets from phage, integrative and conjugative elements (ICEs), plasmids, and <italic>S. mutans</italic>. Phage DNA sequences were downloaded from NCBI (<ext-link ext-link-type="uri" xlink:href="ftp://ftp.ncbi.nih.gov/refseq/release/viral/">ftp://ftp.ncbi.nih.gov/refseq/release/viral/</ext-link>). ICE sequences were downloaded from the ICE database ICEberg 2.0 (<ext-link ext-link-type="uri" xlink:href="https://bioinfo-mml.sjtu.edu.cn/ICEberg2/download.html">https://bioinfo-mml.sjtu.edu.cn/ICEberg2/download.html</ext-link>) (<xref ref-type="bibr" rid="B26">Liu et al., 2019</xref>). Plasmid sequences were downloaded from a database containing 10,892 complete and annotated plasmids (<xref ref-type="bibr" rid="B4">Brooks et al., 2019</xref>). In addition, known <italic>S. mutans</italic> plasmid sequences were compiled (available on the Github platform) (<xref ref-type="bibr" rid="B47">Zhou et al., 2001</xref>; <xref ref-type="bibr" rid="B6">Caufield et al., 2007</xref>; <xref ref-type="bibr" rid="B34">Rheinberg et al., 2013</xref>). Parameters for Blastn were set to 95% sequence coverage (qcovs) and 85% sequence identity (pident). These parameters are a reasonable threshold for identifying CRISPR targets but it is possible that CRISPR spacers meeting these criteria are not capable of DNA interference because of mismatches in the seed region, or because the foreign DNA has mismatches in the proto-spacer adjacent motif (PAM). Other parameters generated with the sequence results included the number of mismatches, the evalue and the subject strand (sstrand). All raw output data is available as. xlsx files on the Github platform.</p>
</sec>
<sec id="s2-4">
<title>Analysis of genome size, GC content and the pangenome</title>
<p>The data mining for gene clusters was conducted in a series of steps, starting with Prokka annotations for each clinical isolate genome (<xref ref-type="bibr" rid="B36">Seemann, 2014</xref>). Since all isolates belong to the <italic>Streptococcus</italic> genus, annotations with Prokka were defined within Bacteria and Streptococci for better gene predictions. Multilocus sequence typing (MLST) was conducted with Roary (<xref ref-type="bibr" rid="B30">Page et al., 2015</xref>), which is ideal for close non-divergent genomes. The final step was to generate the phylogenomic tree from the genome alignment file created by Roary. A maximum likelihood tree was drawn with the Phyml (<xref ref-type="bibr" rid="B16">Guindon and Gascuel, 2003</xref>; <xref ref-type="bibr" rid="B18">Hordijk and Gascuel, 2005</xref>) algorithm and plotted in R (R: a language and environment for statistical computing) with the GGtree (<xref ref-type="bibr" rid="B44">Yu et al., 2017</xref>) package. This work also included custom bash command lines in order to generate accurate measurements of GC content and genome size for all the isolates in our bank.</p>
</sec>
</sec>
<sec sec-type="results|discussion" id="s3">
<title>Results and discussion</title>
<sec id="s3-1">
<title>CRISPR arrays and spacers are frequently detected in <italic>S. mutans</italic> isolates</title>
<p>Using CRISPRCasFinder (<xref ref-type="bibr" rid="B8">Couvin et al., 2018</xref>) across our genome database we identified a total of 8,078 CRISPR spacers of which 5,875 are unique sequences. Isolates with CRISPR arrays present carry an average of 23.9 spacers each, with strain smu174 carrying 129 spacers in total (on one array). By comparison, <italic>Streptococcus thermophilus</italic> strains carry an average of 33 spacers per cassette (<xref ref-type="bibr" rid="B19">Horvath et al., 2008</xref>). The majority of <italic>S. mutans</italic> strains carry one or two CRISPR arrays but there are rare instances of strains carrying 4 &#x2b; (<xref ref-type="fig" rid="F1">Figure 1A</xref>). <xref ref-type="fig" rid="F1">Figure 1B</xref> shows a frequency distribution of CRISPR spacer load across all the isolates. CRISPR spacer acquisition is a balance between obtaining immunity against harmful sequences without diluting spacers (some which may provide inadequate immunity because of mutation) across the available Cas machinery (<xref ref-type="bibr" rid="B13">Garrett, 2021</xref>). Most CRISPR spacers were either 30-nt or 32&#x2013;35-nt in length (<xref ref-type="fig" rid="F1">Figure 1C</xref>), with these two groups likely arising from differences in Type I and Type II CRISPR spacer lengths. The average GC content of CRISPR spacers was 38.16%, which is above the average GC content of <italic>S. mutans</italic> strains (<xref ref-type="fig" rid="F1">Figure 1D</xref>). This may indicate that foreign DNA encountered by <italic>S. mutans</italic> is on average of a higher GC content than the average <italic>S. mutans</italic> genome. By way of comparison, M102AD, an <italic>S. mutans</italic> infecting phage has GC content of 39.6%.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Analysis of <italic>S. mutans</italic> CRISPR spacers <bold>(A)</bold> A phylogenetic tree was generated using iTOL (<xref ref-type="bibr" rid="B25">Letunic and Bork, 2021</xref>) and strains were colored according to how many CRISPR spacer arrays were present on each strain genome <bold>(B)</bold> A frequency distribution of CRISPR spacer load across all the isolates <bold>(C)</bold> CRISPR spacers were grouped according to the length (nt) of the CRISPR spacers <bold>(D)</bold> A frequency distribution of CRISPR spacer GC content (percentage GC) was generated.</p>
</caption>
<graphic xlink:href="fgene-13-997341-g001.tif"/>
</fig>
</sec>
<sec id="s3-2">
<title>Targets of <italic>S. mutans</italic> CRISPR spacers</title>
<p>Having identified 8,078 spacers across our <italic>S. mutans</italic> genomes we next wanted to identify targets for these spacers. First, we characterized how many of these spacers target foreign DNA. This is the traditional target of CRISPR-Cas systems, and includes interference with phage, plasmid, integrative and conjugative elements (ICEs), and other bacterial DNA. In total, 1913 spacers (23.7%) mapped to these targets (<xref ref-type="fig" rid="F2">Figure 2</xref>). Next, we will describe these targets in more detail for each sub-group of foreign DNA.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Predicted targets of CRISPR spacers. A series of pie charts shows the targets of spacers after searching across viral, plasmid, integrative and conjugative element, and bacterial DNA databases. The types of bacterial and phage DNA being targeted were further classified into genus (for bacteria) and strain (phage). These targets are not unique; for example, many of the CRISPR spacers targeting phage will target multiple phages if the phage gene is conserved.</p>
</caption>
<graphic xlink:href="fgene-13-997341-g002.tif"/>
</fig>
<p>
<italic>Phage</italic>
<bold>.</bold> Adaptive immunity against bacteriophage is a notable feature of CRISPR-Cas systems across bacteria. Consistent with this concept <italic>S. mutans</italic> CRISPR spacers mapped to 1,137 phage sequences, of which 531 were unique matches. A total of sixteen distinct phage species are targeted by <italic>S. mutans</italic> CRISPR spacers (<xref ref-type="sec" rid="s10">Supplementary Table S1</xref>). For four of these phages <italic>S. mutans</italic> is the known host, whereas for the other phages the hosts are other Firmicutes bacteria; <italic>Staphylococci</italic>, <italic>Enterococci</italic>, <italic>Lactobacilli</italic> and <italic>Streptococci</italic>. For these phages, typically only one spacer aligned against a conserved target. We anticipate that rather than infecting <italic>S. mutans</italic>, these phage genes are present either as prophage elements or as orphan genes in <italic>S. mutans</italic> genomes. The four <italic>S. mutans</italic> infecting phage that <italic>S. mutans</italic> has acquired spacer immunity against are known as M102, M102AD, &#x278;APCM01 and smHBZ8 (<xref ref-type="bibr" rid="B12">Delisle and Rostkowski, 1993</xref>; <xref ref-type="bibr" rid="B11">Delisle et al., 2012</xref>; <xref ref-type="bibr" rid="B10">Dalmasso et al., 2015</xref>; <xref ref-type="bibr" rid="B45">Zaken et al., 2021</xref>). These are the only <italic>S. mutans</italic> phage that have been genome sequenced and all belong to the <italic>Siphoviridae</italic> family of double-stranded DNA viruses; they are lytic phage. At least two other <italic>S. mutans</italic> phage, e10 and f1, are known to infect <italic>S. mutans</italic> but these strains have not been sequenced (<xref ref-type="bibr" rid="B12">Delisle and Rostkowski, 1993</xref>). The four sequenced <italic>S. mutans</italic> phage are highly related, particularly the first 20 Kbp (<xref ref-type="fig" rid="F3">Figure 3</xref>). However, they each contain a variable region consisting of small open reading frames with unknown functions at the 3&#x2019; end of the genome (<xref ref-type="fig" rid="F3">Figure 3</xref>). Spacers were mapped and viewed with a genome browser to determine if there is any bias in spacer acquisition to specific phage genes. Overall, it appears that immunity to the variable region is less commonly acquired, and this was most evident for M102 and M102AD.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>CRISPR spacer acquisition against <italic>S. mutans</italic> bacteriophage. CRISPR spacers were blasted against viral sequences downloaded from NCBI. CRISPR spacers mapped to four <italic>S. mutans</italic> infecting phage, M102 (371 spacers), M102AD (388 spacers), &#x278;APCM01 (354 spacers) and smHBZ8 (362 spacers). These are the only <italic>S. mutans</italic> phage that have been genome sequenced and all belong to the <italic>Siphoviridae</italic> family of double-stranded DNA viruses. CRISPR spacers that mapped to phage sequences are shown as black lines, with the height of these lines indicating the total number of spacers that mapped to that specific region. Open reading frames for which no CRISPR spacers have been acquired against are colored red, and are more frequently located in a variable region of the phage genomes.</p>
</caption>
<graphic xlink:href="fgene-13-997341-g003.tif"/>
</fig>
<p>Isolation of <italic>S. mutans</italic> phage is rare, For example, &#x278;APCM01 was the only <italic>S. mutans</italic> phage isolated from 85 saliva samples (<xref ref-type="bibr" rid="B10">Dalmasso et al., 2015</xref>) and smHBZ8 was the single <italic>S. mutans</italic> phage isolated from 254 samples (diverse sources; saliva, dental sewage, cariogenic dentin, extracted teeth, and dental plaque) (<xref ref-type="bibr" rid="B45">Zaken et al., 2021</xref>). To understand how important CRISPR-Cas is in explaining the low frequency of infective phage isolation we calculated the frequency of phage immunity spacers across our genome database. In total, 64% of <italic>S. mutans</italic> strains that carry CRISPR-Cas systems have phage targeting spacers (against M102AD or M102 or &#x278;APCM01). The high frequency of CRISPR-Cas systems and anti-phage spacers would likely limit the efficacy of phage therapy in <italic>S. mutans</italic>.</p>
<p>
<italic>Plasmids</italic>
<bold>.</bold> CRISPR spacers are also naturally acquired against plasmid DNA (<xref ref-type="bibr" rid="B31">Palmer and Gilmore, 2010</xref>; <xref ref-type="bibr" rid="B40">Touchon et al., 2011</xref>). Acquisition of spacers against plasmids was very infrequently identified for <italic>S. mutans</italic>. In a database containing &#x223c;12,000 sequenced plasmids (<xref ref-type="bibr" rid="B4">Brooks et al., 2019</xref>) only two spacers mapped. A spacer from smu333 mapped to pLF25067 carried by <italic>Lactobacillus fermentum</italic> (<xref ref-type="bibr" rid="B3">Aryantini et al., 2017</xref>) and a spacer from smu408 mapped to pSZ4 from <italic>Staphylococcus warneri</italic> (<xref ref-type="bibr" rid="B7">Cheng et al., 2013</xref>, 1). One strain, smu441, carries a spacer that maps (88% coverage) against a small 5.6 kbp &#x2018;cryptic&#x2019; plasmid that is carried by &#x223c;5% of <italic>S. mutans</italic> strains (<xref ref-type="bibr" rid="B47">Zhou et al., 2001</xref>, 140; <xref ref-type="bibr" rid="B34">Rheinberg et al., 2013</xref>).</p>
<sec id="s3-2-1">
<title>Integrative conjugative plasmids</title>
<p>Integrative and conjugative elements (ICEs; also known as conjugative transposons) are another type of mobile genetic element that can transfer between bacterial species and strains (<xref ref-type="bibr" rid="B20">Johnson and Grossman, 2015</xref>). Sequence data for known ICEs was downloaded from ICEberg and used to check for CRISPR spacer acquisition against ICEs. In total, <italic>S. mutans</italic> has acquired fifty-eight spacers against ICEs. All but one of these spacers matches against TnSmu1, an uncharacterized putative ICE which is related to Tn916 but with significant modifications (<xref ref-type="bibr" rid="B2">Ajdi&#x107; et al., 2002</xref>). In the model <italic>S. mutans</italic> strain UA159 TnSmu1 is a large 20 kbp region comprising of over twenty predicted genes (<xref ref-type="bibr" rid="B2">Ajdi&#x107; et al., 2002</xref>). Most of the genes in this region are hypothetical but some are predicted to have integrative and conjugative functions (e.g<italic>.</italic> type IV secretion system components). When aligned to this region the CRISPR spacers qualitatively appear to have been generated to target the integrase and secretion system components instead of hypothetical regions (<xref ref-type="fig" rid="F4">Figure 4</xref>). The functionality of TnSmu1 and whether it behaves like an ICE is unknown. However, acquisition of CRISPR spacers by strains of <italic>S. mutans</italic> could indicate that it may be horizontally acquired and strains are actively trying to avoid acquisition of the element. Certain strains of <italic>S. mutans</italic> may also have been repeatedly attacked by TnSmu1 as the isolate Smu179 has acquired five spacers that align to TnSmu1. In <italic>Pseudomonas aeruginosa</italic> CRISPR spacers often target ICE or conjugative transfer genes and this is associated with a lower prevalence of these systems in strains harboring CRISPR-Cas systems (<xref ref-type="bibr" rid="B43">Wheatley and MacLean, 2021</xref>). Prevention of the acquisition of these elements likely arises because of fitness costs associated with them. Our analysis of CRISPR-mediated immunity of ICEs is likely conservative as description and characterization of these elements is limited in <italic>S. mutans</italic>.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>CRISPR spacer acquisition against the putative integrative and conjugative element TnSmu1. CRISPR spacers were blasted against ICE sequences downloaded from ICEberg. Multiple CRISPR spacers mapped to a putative <italic>S. mutans</italic> ICE known as TnSmu1. This element is approximately 20&#xa0;Kbp and consists of twenty-eight genes. Most of the genes are hypothetical (grey) but some are annotated as putative T4SS components (blue) and one gene is annotated as an integrase (green). CRISPR spacers that mapped to this region are shown as small arrows, with a row for each <italic>S. mutans</italic> strain that has acquired spacers against the putative ICE.</p>
</caption>
<graphic xlink:href="fgene-13-997341-g004.tif"/>
</fig>
</sec>
</sec>
<sec id="s3-3">
<title>
<italic>S. mutans</italic> self- and strain-targeting CRISPR spacers</title>
<p>With the observation that <italic>S. mutans</italic> has gained immunity against inter-species transfer of mobile genetic elements we next wanted to explore if CRISPR spacers have been acquired more generally against <italic>S. mutans</italic> strain DNA. To do this, we searched for CRISPR spacer hits within genes encoded by <italic>S. mutans</italic> across all the genomes included in this study. A total of sixty-three annotated genes are the target of CRISPR spacers (<xref ref-type="sec" rid="s10">Supplementary Table S2</xref>). In addition, six other genes with functional predictions (<xref ref-type="sec" rid="s10">Supplementary Table S2</xref>) are also the target of CRISPR spacers. Lastly, <italic>S. mutans</italic> hypothetical proteins were also matched to <italic>S. mutans</italic> CRISPR spacer sequences. From this set of genes (excluding hypotheticals), 22 reside in the core genome (&#x3e;99% of genomes), 4 are soft core genes (95&#x2013;99% of genomes), 17 are shell genes (15&#x2013;95% of genomes), and 20 are cloud genes (&#x3c;15% of genomes) (<xref ref-type="sec" rid="s10">Supplementary Table S2</xref>). CRISPR targets against <italic>S. mutans</italic> core genes are over-represented given that core genes only account for 5% of the <italic>S. mutans</italic> pangenome. This is compared with cloud genes which make up 85% of the pangenome. Notable <italic>S. mutans</italic> gene CRISPR targets include the <italic>clp</italic> system (<xref ref-type="bibr" rid="B21">Kajfasz et al., 2009</xref>), the <italic>gtfC</italic> glucosyltransferase (<xref ref-type="bibr" rid="B17">Hanada and Kuramitsu, 1988</xref>), the cell surface antigen <italic>spaP</italic> (<xref ref-type="bibr" rid="B9">Crowley et al., 1999</xref>), carbohydrate utilization systems (i.e., <italic>fruA</italic> and <italic>lacF</italic>) (<xref ref-type="bibr" rid="B5">Burne et al., 1999</xref>; <xref ref-type="bibr" rid="B46">Zeng et al., 2010</xref>) and several genes involved in DNA replication, repair or modification (i.e., <italic>dpnM</italic>, <italic>parE</italic>, <italic>radD</italic>, <italic>recF</italic>, <italic>repA</italic>, <italic>smc</italic>, <italic>ssb</italic> and <italic>topB</italic>). Several of the CRISPR spacers that target the <italic>S. mutans</italic> pangenome are likely targeting mobile DNA elements. For example, spacers against tyrosine recombinases (i.e., <italic>xerC</italic> and <italic>xerD</italic>), conjugal transfer protein <italic>traG</italic>, and the Int-Tn transposase from the ICE Tn916. To visualize the extent of CRISPR spacer acquisition against <italic>S. mutans</italic> genomes, spacers were mapped to two strains, UA159 and NN2025 (<xref ref-type="fig" rid="F5">Figures 5A,B</xref>). The number of spacers that aligned within 10&#xa0;kb regions varied, with notable hotspots against the genomes of both UA159 and NG8. These hotspot regions were TnSmu1 and repeat regions within the promoters of <italic>clpP</italic> (protein homeostasis), <italic>rexB</italic> (putative exonuclease) and a hypothetical protein.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>Acquisition of CRISPR spacers against <italic>S. mutans</italic> genes. To visualize CRISPR spacer acquisition against <italic>S. mutans</italic> genomes, spacers were mapped to two strains, UA159 <bold>(A)</bold> and NN2025 <bold>(B)</bold>. Each graph shows the number of spacers that aligned within each 10&#xa0;Kbp region of a <italic>S. mutans</italic> genome. There are notable hotspots that are highlighted with red, green and blue bars. A network diagram was constructed using Gephi (<ext-link ext-link-type="uri" xlink:href="http://www.gephi.org/">http://www.gephi.org</ext-link>) to visualize <italic>S. mutans</italic> strains that interact with other <italic>S. mutans</italic> strains on the basis of CRISPR spacer acquisition <bold>(C)</bold>. Each node (circle) is a single <italic>S. mutans</italic> strain and a line between strains indicates a connection based on a CRISPR target being present on one of those strains. Strains with increased connections (targeting more strains with CRISPR spacers) are colored on a scale, with red nodes depicting strains with the most connections (CRISPR spacers targeting other <italic>S. mutans</italic> strains).</p>
</caption>
<graphic xlink:href="fgene-13-997341-g005.tif"/>
</fig>
<p>Inter-species CRISPR targeting was also visualized using network analysis (<xref ref-type="fig" rid="F5">Figure 5C</xref>). This analysis showed that a minority of <italic>S. mutans</italic> strains (ca. 60 strains; red nodes in <xref ref-type="fig" rid="F5">Figure 5C</xref>) were accounting for the majority of CRISPR spacer connections between <italic>S. mutans</italic> strains. These strains carry CRISPR spacers against highly conserved (core) <italic>S. mutans</italic> genes (<xref ref-type="sec" rid="s10">Supplementary Table S2</xref>). We reasoned that this group of highly connected strains could actually be carrying self-targeting CRISPR spacers against highly conserved genes. Analysis of self-targeting revealed that ninety-four <italic>S. mutans</italic> strains carry self-targeting CRISPR spacers (<xref ref-type="sec" rid="s10">Supplementary Table S3</xref>) and most of the highly connected nodes in the network analysis are strains that self-target. All of these strains contain the Cas genes required for functional CRISPR-Cas systems but it is unknown if these systems are inactivated through point mutations. Acquisition of self-targeting spacers would likely be detrimental and create pressure for inactivation of the CRISPR-Cas systems. Six out of the ninety-four self-targeting strains contain anti-CRISPR (Acr) proteins that could also account for a loss of functionality (<xref ref-type="sec" rid="s10">Supplementary Table S3</xref>). In summary, <italic>S. mutans</italic> strains carry spacers that target <italic>S. mutans</italic> genes although the relative amount of intra-species-targeting versus only self-targeting is currently unknown.</p>
</sec>
<sec id="s3-4">
<title>Limited impact of CRISPR-Cas systems on <italic>S. mutans</italic> genome size and GC content</title>
<p>Analysis of <italic>S. mutans</italic> CRISPR spacers has shown potential immunity acquisition against phage, plasmid, ICE, and <italic>S. mutans</italic> DNA. Acquisition of spacers against these types of DNA should act to limit the rate of HGT in <italic>S. mutans</italic>. The association between CRISPR-Cas and its impact on HGT in bacteria is not clear. For example, <italic>P. aeruginosa</italic> genomes with CRISPR-Cas are significantly smaller than those lacking CRISPR-Cas which suggests that CRISPR-Cas impedes HGT (<xref ref-type="bibr" rid="B43">Wheatley and MacLean, 2021</xref>). However, a similar trend was not found for <italic>Staphylococcus aureus</italic> or <italic>Acinetobacter baumannii</italic> (<xref ref-type="bibr" rid="B32">Pursey et al., 2022</xref>). Here, we are using CRISPR spacer load as a measure of CRISPR-Cas activity. It has limitations because we cannot determine if the CRISPR arrays/systems are functional or defective. However, it is the most appropriate measure because it is unfeasible to test for genomic differences between strains with and without CRISPR-Cas systems because the majority of <italic>S. mutans</italic> strains have at least one CRISPR-Cas system. CRISPR spacer load, the number of CRISPR spacers per genome, has been used as a measure of CRISPR activity by others (<xref ref-type="bibr" rid="B15">Gophna et al., 2015</xref>). For our analysis, <italic>S. mutans</italic> strains were grouped by the total number of spacers they have acquired and for each group the median size of the genome was computed (<xref ref-type="fig" rid="F6">Figure 6</xref>). We found that increased CRISPR spacers loads were associated with marginally increased genome sizes. Strains with 1&#x2013;20 CRISPR spacers had a median genome size of 2.01&#xa0;Mbp and strains with 40&#x2013;60 CRISPR spacers had a median genome size of 2.05&#xa0;Mbp (two-tailed <italic>t</italic>-test <italic>p</italic> &#x3c; 0.01). Our analysis shows that increased CRISPR spacer acquisition does not lead to reduced genome size in <italic>S. mutans</italic>.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>CRISPR activity and genome size. Genomes were grouped according to the number of CRISPR spacers present (1&#x2013;20, 21&#x2013;40, 41&#x2013;60, 61&#x2013;80 and 81&#x2b;) and a violin plot was generated for each group. Median genome size (Mbp) is shown next to each violin plot; each dot is a single <italic>S. mutans</italic> strain/genome. Statistical significance between groups was calculated using a two-tailed <italic>t</italic>-test (&#x2a;&#x2a; &#x3d; <italic>p</italic> &#x3c; 0.01; &#x2a;&#x2a;&#x2a; &#x3d; <italic>p</italic> &#x3c; 0.001).</p>
</caption>
<graphic xlink:href="fgene-13-997341-g006.tif"/>
</fig>
<p>By limiting HGT CRISPR-Cas could also affect the GC content of host genomes by reducing the acquisition of higher or lower GC foreign DNA. The average GC content across all <italic>S. mutans</italic> CRISPR spacers was 38.2% (<xref ref-type="fig" rid="F1">Figure 1D</xref>) which may indicate that on average foreign DNA elements have higher GC content than the average <italic>S. mutans</italic> genome (36.8%). Phages that infect <italic>S. mutans</italic>, M102AD, M102, and SMHBZ8 have GC content of 39.6%, 39.21%, and 38.8% respectively (<xref ref-type="bibr" rid="B42">van der Ploeg, 2007</xref>; <xref ref-type="bibr" rid="B11">Delisle et al., 2012</xref>; <xref ref-type="bibr" rid="B45">Zaken et al., 2021</xref>). The GC content of <italic>S. mutans</italic> genomes was consistent for strains with 1&#x2013;20 (36.80%), 20&#x2013;40 (36.82%) and 40&#x2013;60 (36.79%) CRISPR spacers (<xref ref-type="fig" rid="F7">Figure 7</xref>). A significant difference in GC content was observed for strains with 60&#x2013;80 (36.68%) CRISPR spacers compared with the 1&#x2013;20 and 20&#x2013;40 groupings (two-tailed <italic>t</italic>-test <italic>p</italic> &#x3c;0.05). Although only nominally lower than the other groups, this might indicate a reduction in the acquisition of higher GC foreign DNA among strains with 60&#x2013;80 CRISPR spacers.</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p>CRISPR activity and GC content. Genomes were grouped according to the number of CRISPR spacers present (1&#x2013;20, 21&#x2013;40, 41&#x2013;60, 61&#x2013;80 and 81&#x2b;) and a violin plot was generated for each group. Median GC content (percentage GC) is shown next to each violin plot; each dot is a single <italic>S. mutans</italic> strain/genome. Statistical significance between groups was calculated using a two-tailed <italic>t</italic>-test (&#x2a; &#x3d; <italic>p</italic> &#x3c;0.05; &#x2a;&#x2a; &#x3d; <italic>p</italic> &#x3c; 0.01).</p>
</caption>
<graphic xlink:href="fgene-13-997341-g007.tif"/>
</fig>
</sec>
<sec id="s3-5">
<title>CRISPR-Cas systems do not restrict the size of the <italic>S. mutans</italic> pangenome</title>
<p>To gain a greater insight into the impact of CRISPR-Cas systems on HGT we wanted to examine how CRISPR spacer load alters the size and distribution of the <italic>S. mutans</italic> pangenome. In our analysis including 335 <italic>S. mutans</italic> genomes, the pangenome size was calculated to include 15,062 genes. There were 822 genes in the core genome (&#x3e;99% of genomes), 496 soft core genes (95&#x2013;99% of genomes), 982 shell genes (15&#x2013;95% of genomes), and 12,762 cloud genes (&#x3c;15% of genomes). Next, pangenome analysis was conducted on two groups, one with a low CRISPR spacer load (1&#x2013;20 spacers, average 9.36; 163 genomes) and another with a high CRISPR spacer load (21&#x2013;129 spacers, average 37.93; 169 genomes). For both groups we generated pie charts which show the percentage of genes located in the different gene clusters (core; soft core; shell; cloud) (<xref ref-type="fig" rid="F8">Figure 8</xref>). The low and high CRISPR spacer load groups have a similar percentage of genes located within the cloud genes gene cluster. Cloud genes are those that are most likely acquired <italic>via</italic> HGT. By this measure there is no limiting of HGT by CRISPR-Cas systems for <italic>S. mutans</italic> strains with either low or high CRISPR spacer loads.</p>
<fig id="F8" position="float">
<label>FIGURE 8</label>
<caption>
<p>Pangenome analysis comparing low and high CRISPR spacer load <italic>S. mutans</italic> strains. Pangenome analysis was conducted on three groups. 1) all <italic>S. mutans</italic> genomes, 2) a group with low CRISPR spacer load (1&#x2013;20 spacers, average 9.36; 163 genomes) and 3) a group with a high CRISPR spacer load (21&#x2013;129 spacers, average 37.93; 169 genomes). For the three groups pie charts were generated which show the percentage of genes located in the different gene clusters (core, red; soft core, orange; shell, green; cloud, blue). Cloud genes (blue) are most likely acquired <italic>via</italic> HGT.</p>
</caption>
<graphic xlink:href="fgene-13-997341-g008.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="conclusion" id="s4">
<title>Conclusion</title>
<p>Here, we have generated a comprehensive &#x2018;memory&#x2019; of DNA that has attacked a diverse panel of <italic>S. mutans</italic> isolates. Although there is considerable &#x2018;dark matter&#x2019; (undetected CRISPR spacer sequences), phage are the main target of <italic>S. mutans</italic> spacers. Isolates have also generated immunity against mobile DNA elements such as plasmids and ICEs. There may also be considerable immunity generated against bacterial DNA, although the relative contribution of self-targeting versus bona fide intra- or inter-species targeting needs to be investigated further. Collectively, CRISPR-Cas systems and acquired spacers are abundant across all <italic>S. mutans</italic> strains. While there is clear evidence that these systems have acquired immunity against foreign DNA, there appears to be minimal impact on HGT constraints on a species-level. There was little or minimal impact on genome size, GC content and &#x2018;openness&#x2019; of the pangenome. There are several possible explanations for why there is a minimal impact on HGT and these include 1) the systems have specificity for phage infection, 2) self-targeting has led to a loss of activity, 3) the systems are highly regulated to only acquire immunity under certain conditions, 4) competence acquired DNA is single stranded and the CRISPR-Cas systems are less active against ssDNA, 5) spacer load may be an inaccurate measure of CRISPR-Cas activity and 6) target or spacer DNA has changed so that immunity no longer functions. A deeper understanding of the impact of these systems on the biology and evolution of <italic>S. mutans</italic> will likely require functional studies and/or model systems for interrogating spacer acquisition.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="s5">
<title>Data availability statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/<xref ref-type="sec" rid="s10">Supplementary Material</xref>.</p>
</sec>
<sec id="s6">
<title>Author contributions</title>
<p>AR and RS were involved in the investigation, data curation and revised the manuscript; AR performed the formal analysis and software development (implementation of code and supporting algorithms); RS was involved with funding acquisition, conceptualization of the study and drafted the manuscript.</p>
</sec>
<sec id="s7">
<title>Funding</title>
<p>This work was supported in part by the National Institute for Dental and Craniofacial Research (NIDCR) grant DE029882 awarded to RS.</p>
</sec>
<ack>
<p>We would like to thank Vince Richards (Clemson University) for providing us with the <italic>S. mutans</italic> genome files associated with this study.</p>
</ack>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s10">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fgene.2022.997341/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fgene.2022.997341/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="DataSheet1.PDF" id="SM1" mimetype="application/PDF" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Abranches</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Miller</surname>
<given-names>J. H.</given-names>
</name>
<name>
<surname>Martinez</surname>
<given-names>A. R.</given-names>
</name>
<name>
<surname>Simpson-Haidaris</surname>
<given-names>P. J.</given-names>
</name>
<name>
<surname>Burne</surname>
<given-names>R. A.</given-names>
</name>
<name>
<surname>Lemos</surname>
<given-names>J. A.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>The collagen-binding protein Cnm is required for <italic>Streptococcus mutans</italic> adherence to and intracellular invasion of human coronary artery endothelial cells</article-title>. <source>Infect. Immun.</source> <volume>79</volume>, <fpage>2277</fpage>&#x2013;<lpage>2284</lpage>. <pub-id pub-id-type="doi">10.1128/IAI.00767-10</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ajdi&#x107;</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>McShan</surname>
<given-names>W. M.</given-names>
</name>
<name>
<surname>McLaughlin</surname>
<given-names>R. E.</given-names>
</name>
<name>
<surname>Savi&#x107;</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Chang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Carson</surname>
<given-names>M. B.</given-names>
</name>
<etal/>
</person-group> (<year>2002</year>). <article-title>Genome sequence of <italic>Streptococcus mutans</italic> UA159, a cariogenic dental pathogen</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>99</volume>, <fpage>14434</fpage>&#x2013;<lpage>14439</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.172501299</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Aryantini</surname>
<given-names>N. P. D.</given-names>
</name>
<name>
<surname>Prajapati</surname>
<given-names>J. B.</given-names>
</name>
<name>
<surname>Urashima</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Fukuda</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Complete genome sequence of Lactobacillus fermentum MTCC 25067 (formerly TDS030603), a viscous exopolysaccharide-producing strain isolated from Indian fermented milk</article-title>. <source>Genome Announc.</source> <volume>5</volume>, <fpage>e00091</fpage>&#x2013;<lpage>17</lpage>. <pub-id pub-id-type="doi">10.1128/genomeA.00091-17</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brooks</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Kaze</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Sistrom</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>A curated, comprehensive database of plasmid sequences</article-title>. <source>Microbiol. Resour. Announc.</source> <volume>8</volume>, <fpage>e01325</fpage>&#x2013;<lpage>18</lpage>. <pub-id pub-id-type="doi">10.1128/MRA.01325-18</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Burne</surname>
<given-names>R. A.</given-names>
</name>
<name>
<surname>Wen</surname>
<given-names>Z. T.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Y.-Y. M.</given-names>
</name>
<name>
<surname>Penders</surname>
<given-names>J. E. C.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Regulation of expression of the fructan hydrolase gene of Streptococcus mutans GS-5 by induction and carbon catabolite repression</article-title>. <source>J. Bacteriol.</source> <volume>181</volume>, <fpage>2863</fpage>&#x2013;<lpage>2871</lpage>. <pub-id pub-id-type="doi">10.1128/JB.181.9.2863-2871.1999</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Caufield</surname>
<given-names>P. W.</given-names>
</name>
<name>
<surname>Saxena</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Fitch</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Population structure of plasmid-containing strains of Streptococcus mutans, a member of the human indigenous biota</article-title>. <source>J. Bacteriol.</source> <volume>189</volume>, <fpage>1238</fpage>&#x2013;<lpage>1243</lpage>. <pub-id pub-id-type="doi">10.1128/JB.01183-06</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cheng</surname>
<given-names>V. W. T.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Oyedotun</surname>
<given-names>K. S.</given-names>
</name>
<name>
<surname>Ridgway</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Ellison</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Weiner</surname>
<given-names>J. H.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Complete genome of the solvent-tolerant Staphylococcus warneri strain SG1</article-title>. <source>Genome Announc.</source> <volume>1</volume>, <fpage>e0003813</fpage>. <pub-id pub-id-type="doi">10.1128/genomeA.00038-13</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Couvin</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Bernheim</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Toffano-Nioche</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Touchon</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Michalik</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>N&#xe9;ron</surname>
<given-names>B.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>CRISPRCasFinder, an update of CRISRFinder, includes a portable version, enhanced performance and integrates search for Cas proteins</article-title>. <source>Nucleic Acids Res.</source> <volume>46</volume>, <fpage>W246</fpage>&#x2013;<lpage>W251</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gky425</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Crowley</surname>
<given-names>P. J.</given-names>
</name>
<name>
<surname>Brady</surname>
<given-names>L. J.</given-names>
</name>
<name>
<surname>Michalek</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Bleiweis</surname>
<given-names>A. S.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Virulence of a spaP mutant ofStreptococcus mutans in a gnotobiotic rat model</article-title>. <source>Infect. Immun.</source> <volume>67</volume>, <fpage>1201</fpage>&#x2013;<lpage>1206</lpage>. <pub-id pub-id-type="doi">10.1128/IAI.67.3.1201-1206.1999</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dalmasso</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Haas</surname>
<given-names>E. de</given-names>
</name>
<name>
<surname>Neve</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Strain</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Cousin</surname>
<given-names>F. J.</given-names>
</name>
<name>
<surname>Stockdale</surname>
<given-names>S. R.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Isolation of a novel phage with activity against Streptococcus mutans biofilms</article-title>. <source>PLOS ONE</source> <volume>10</volume>, <fpage>e0138651</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0138651</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Delisle</surname>
<given-names>A. L.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Chalmers</surname>
<given-names>N. I.</given-names>
</name>
<name>
<surname>Barcak</surname>
<given-names>G. J.</given-names>
</name>
<name>
<surname>Rousseau</surname>
<given-names>G. M.</given-names>
</name>
<name>
<surname>Moineau</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Biology and genome sequence of Streptococcus mutans phage M102AD</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>78</volume>, <fpage>2264</fpage>&#x2013;<lpage>2271</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.07726-11</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Delisle</surname>
<given-names>A. L.</given-names>
</name>
<name>
<surname>Rostkowski</surname>
<given-names>C. A.</given-names>
</name>
</person-group> (<year>1993</year>). <article-title>Lytic bacteriophages of <italic>Streptococcus mutans</italic>
</article-title>. <source>Curr. Microbiol.</source> <volume>27</volume>, <fpage>163</fpage>&#x2013;<lpage>167</lpage>. <pub-id pub-id-type="doi">10.1007/BF01576015</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Garrett</surname>
<given-names>S. C.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Pruning and tending immune memories: Spacer dynamics in the CRISPR array</article-title>. <source>Front. Microbiol.</source> <volume>12</volume>, <fpage>664299</fpage>. <pub-id pub-id-type="doi">10.3389/fmicb.2021.664299</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gong</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zeng</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>X.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Genome editing in Streptococcus mutans through self-targeting CRISPR arrays</article-title>. <source>Mol. Oral Microbiol.</source> <volume>33</volume>, <fpage>440</fpage>&#x2013;<lpage>449</lpage>. <pub-id pub-id-type="doi">10.1111/omi.12247</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gophna</surname>
<given-names>U.</given-names>
</name>
<name>
<surname>Kristensen</surname>
<given-names>D. M.</given-names>
</name>
<name>
<surname>Wolf</surname>
<given-names>Y. I.</given-names>
</name>
<name>
<surname>Popa</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Drevet</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Koonin</surname>
<given-names>E. V.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>No evidence of inhibition of horizontal gene transfer by CRISPR&#x2013;Cas on evolutionary timescales</article-title>. <source>ISME J.</source> <volume>9</volume>, <fpage>2021</fpage>&#x2013;<lpage>2027</lpage>. <pub-id pub-id-type="doi">10.1038/ismej.2015.20</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guindon</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Gascuel</surname>
<given-names>O.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood</article-title>. <source>Syst. Biol.</source> <volume>52</volume>, <fpage>696</fpage>&#x2013;<lpage>704</lpage>. <pub-id pub-id-type="doi">10.1080/10635150390235520</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hanada</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Kuramitsu</surname>
<given-names>H. K.</given-names>
</name>
</person-group> (<year>1988</year>). <article-title>Isolation and characterization of the Streptococcus mutans gtfC gene, coding for synthesis of both soluble and insoluble glucans</article-title>. <source>Infect. Immun.</source> <volume>56</volume>, <fpage>1999</fpage>&#x2013;<lpage>2005</lpage>. <pub-id pub-id-type="doi">10.1128/iai.56.8.1999-2005.1988</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hordijk</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Gascuel</surname>
<given-names>O.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Improving the efficiency of SPR moves in phylogenetic tree search methods based on maximum likelihood</article-title>. <source>Bioinformatics</source> <volume>21</volume>, <fpage>4338</fpage>&#x2013;<lpage>4347</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bti713</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Horvath</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Romero</surname>
<given-names>D. A.</given-names>
</name>
<name>
<surname>Co&#xfb;t&#xe9;-Monvoisin</surname>
<given-names>A.-C.</given-names>
</name>
<name>
<surname>Richards</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Deveau</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Moineau</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2008</year>). <article-title>Diversity, activity, and evolution of CRISPR loci in Streptococcus thermophilus</article-title>. <source>J. Bacteriol.</source> <volume>190</volume>, <fpage>1401</fpage>&#x2013;<lpage>1412</lpage>. <pub-id pub-id-type="doi">10.1128/JB.01415-07</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Johnson</surname>
<given-names>C. M.</given-names>
</name>
<name>
<surname>Grossman</surname>
<given-names>A. D.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Integrative and conjugative elements (ICEs): What they do and how they work</article-title>. <source>Annu. Rev. Genet.</source> <volume>49</volume>, <fpage>577</fpage>&#x2013;<lpage>601</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-genet-112414-055018</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kajfasz</surname>
<given-names>J. K.</given-names>
</name>
<name>
<surname>Martinez</surname>
<given-names>A. R.</given-names>
</name>
<name>
<surname>Rivera-Ramos</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Abranches</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Koo</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Quivey</surname>
<given-names>R. G.</given-names>
</name>
<etal/>
</person-group> (<year>2009</year>). <article-title>Role of clp proteins in expression of virulence properties of Streptococcus mutans</article-title>. <source>J. Bacteriol.</source> <volume>191</volume>, <fpage>2060</fpage>&#x2013;<lpage>2068</lpage>. <pub-id pub-id-type="doi">10.1128/JB.01609-08</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Khan</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Rukke</surname>
<given-names>H. V.</given-names>
</name>
<name>
<surname>H&#xf8;vik</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>&#xc5;mdal</surname>
<given-names>H. A.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Morrison</surname>
<given-names>D. A.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Comprehensive transcriptome profiles of Streptococcus mutans UA159 map core streptococcal competence genes</article-title>. <source>mSystems</source> <volume>1</volume>, <fpage>e00038</fpage>&#x2013;<lpage>15</lpage>. <pub-id pub-id-type="doi">10.1128/mSystems.00038-15</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kolesnik</surname>
<given-names>M. V.</given-names>
</name>
<name>
<surname>Fedorova</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Karneyeva</surname>
<given-names>K. A.</given-names>
</name>
<name>
<surname>Artamonova</surname>
<given-names>D. N.</given-names>
</name>
<name>
<surname>Severinov</surname>
<given-names>K. V.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Type III CRISPR-cas systems: Deciphering the most complex prokaryotic immune system</article-title>. <source>Biochemistry.</source> <volume>86</volume>, <fpage>1301</fpage>&#x2013;<lpage>1314</lpage>. <pub-id pub-id-type="doi">10.1134/S0006297921100114</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lemos</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Palmer</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Zeng</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Wen</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Kajfasz</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Freires</surname>
<given-names>I.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>The Biology of Streptococcus mutans</article-title>. <source>Microbiol. Spectr.</source> <volume>7</volume>. <pub-id pub-id-type="doi">10.1128/microbiolspec.GPP3-0051&#x2013;201810.1128/microbiolspec.GPP3-0051-2018</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Letunic</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Bork</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Interactive tree of life (iTOL) v5: an online tool for phylogenetic tree display and annotation</article-title>. <source>Nucleic Acids Res.</source> <volume>49</volume>, <fpage>W293</fpage>&#x2013;<lpage>W296</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkab301</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Xie</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Bi</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>ICEberg 2.0: an updated database of bacterial integrative and conjugative elements</article-title>. <source>Nucleic Acids Res.</source> <volume>47</volume>, <fpage>D660</fpage>&#x2013;<lpage>D665</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gky1123</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Meng</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Exploring the genomic diversity and cariogenic differences of Streptococcus mutans strains through pan-genome and comparative genome analysis</article-title>. <source>Curr. Microbiol.</source> <volume>74</volume>, <fpage>1200</fpage>&#x2013;<lpage>1209</lpage>. <pub-id pub-id-type="doi">10.1007/s00284-017-1305-z</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mosterd</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Moineau</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Characterization of a type II-A CRISPR-cas system in Streptococcus mutans</article-title>. <source>mSphere</source> <volume>5</volume>, <fpage>e00235</fpage>&#x2013;<lpage>20</lpage>. <pub-id pub-id-type="doi">10.1128/mSphere.00235-20</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>M&#xfc;nch</surname>
<given-names>P. C.</given-names>
</name>
<name>
<surname>Franzosa</surname>
<given-names>E. A.</given-names>
</name>
<name>
<surname>Stecher</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>McHardy</surname>
<given-names>A. C.</given-names>
</name>
<name>
<surname>Huttenhower</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Identification of natural CRISPR systems and targets in the human microbiome</article-title>. <source>Cell Host Microbe</source> <volume>29</volume>, <fpage>94</fpage>&#x2013;<lpage>106</lpage>. <pub-id pub-id-type="doi">10.1016/j.chom.2020.10.010</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Page</surname>
<given-names>A. J.</given-names>
</name>
<name>
<surname>Cummins</surname>
<given-names>C. A.</given-names>
</name>
<name>
<surname>Hunt</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Wong</surname>
<given-names>V. K.</given-names>
</name>
<name>
<surname>Reuter</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Holden</surname>
<given-names>M. T. G.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Roary: rapid large-scale prokaryote pan genome analysis</article-title>. <source>Bioinformatics</source> <volume>31</volume>, <fpage>3691</fpage>&#x2013;<lpage>3693</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btv421</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Palmer</surname>
<given-names>K. L.</given-names>
</name>
<name>
<surname>Gilmore</surname>
<given-names>M. S.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Multidrug-resistant Enterococci lack CRISPR-cas</article-title>. <source>mBio</source> <volume>1</volume>, <fpage>e00227</fpage>&#x2013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1128/mBio.00227-10</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pursey</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Dimitriu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Paganelli</surname>
<given-names>F. L.</given-names>
</name>
<name>
<surname>Westra</surname>
<given-names>E. R.</given-names>
</name>
<name>
<surname>van Houte</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>CRISPR-Cas is associated with fewer antibiotic resistance genes in bacterial pathogens</article-title>. <source>Philos. Trans. R. Soc. Lond. B Biol. Sci.</source> <volume>377</volume>, <fpage>20200464</fpage>. <pub-id pub-id-type="doi">10.1098/rstb.2020.0464</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="web">
<collab>R Core Team</collab> (<year>2015</year>). <article-title>R: a language and environment for statistical computing</article-title> <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://www.gbif.org/tool/81287/r-a-language-and-environment-for-statistical-computing">https://www.gbif.org/tool/81287/r-a-language-and-environment-for-statistical-computing</ext-link>
</comment> [<comment>Accessed April 12, 2022</comment>]. </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rheinberg</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Swierzy</surname>
<given-names>I. J.</given-names>
</name>
<name>
<surname>Nguyen</surname>
<given-names>T. D.</given-names>
</name>
<name>
<surname>Horz</surname>
<given-names>H.-P.</given-names>
</name>
<name>
<surname>Conrads</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Cryptic Streptococcus mutans 5.6-kb plasmids encode a toxin&#x2013;antitoxin system for plasmid stabilization</article-title>. <source>J. Oral Microbiol.</source> <volume>5</volume>, <fpage>19729</fpage>. <pub-id pub-id-type="doi">10.3402/jom.v5i0.19729</pub-id> </citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Richards</surname>
<given-names>V. P.</given-names>
</name>
<name>
<surname>Palmer</surname>
<given-names>S. R.</given-names>
</name>
<name>
<surname>Pavinski Bitar</surname>
<given-names>P. D.</given-names>
</name>
<name>
<surname>Qin</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Weinstock</surname>
<given-names>G. M.</given-names>
</name>
<name>
<surname>Highlander</surname>
<given-names>S. K.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Phylogenomics and the dynamic genome evolution of the genus Streptococcus</article-title>. <source>Genome Biol. Evol.</source> <volume>6</volume>, <fpage>741</fpage>&#x2013;<lpage>753</lpage>. <pub-id pub-id-type="doi">10.1093/gbe/evu048</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Seemann</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Prokka: rapid prokaryotic genome annotation</article-title>. <source>Bioinformatics</source> <volume>30</volume>, <fpage>2068</fpage>&#x2013;<lpage>2069</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu153</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Serbanescu</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Cordova</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Krastel</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Flick</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Beloglazova</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Latos</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Role of the <italic>Streptococcus mutans</italic> CRISPR-cas systems in immunity and cell physiology</article-title>. <source>J. Bacteriol.</source> <volume>197</volume>, <fpage>749</fpage>&#x2013;<lpage>761</lpage>. <pub-id pub-id-type="doi">10.1128/JB.02333-14</pub-id> </citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shields</surname>
<given-names>R. C.</given-names>
</name>
<name>
<surname>Walker</surname>
<given-names>A. R.</given-names>
</name>
<name>
<surname>Maricic</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Chakraborty</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Underhill</surname>
<given-names>S. A. M.</given-names>
</name>
<name>
<surname>Burne</surname>
<given-names>R. A.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Repurposing the <italic>Streptococcus mutans</italic> CRISPR-cas9 system to understand essential gene function</article-title>. <source>PLoS Pathog.</source> <volume>16</volume>, <fpage>e1008344</fpage>. <pub-id pub-id-type="doi">10.1371/journal.ppat.1008344</pub-id> </citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Song</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Conrads</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Rheinberg</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Sztajer</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Reck</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2013</year>). <article-title>Genetic variability of mutans streptococci revealed by wide whole-genome sequencing</article-title>. <source>BMC Genomics</source> <volume>14</volume>, <fpage>430</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-14-430</pub-id> </citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Touchon</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Charpentier</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Clermont</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Rocha</surname>
<given-names>E. P. C.</given-names>
</name>
<name>
<surname>Denamur</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Branger</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>CRISPR distribution within the <italic>Escherichia coli</italic> species is not suggestive of immunity-associated diversifying selection</article-title>. <source>J. Bacteriol.</source> <volume>193</volume>, <fpage>2460</fpage>&#x2013;<lpage>2467</lpage>. <pub-id pub-id-type="doi">10.1128/JB.01307-10</pub-id> </citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>van der Ploeg</surname>
<given-names>J. R.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Analysis of CRISPR in <italic>Streptococcus mutans</italic> suggests frequent occurrence of acquired immunity against infection by M102-like bacteriophages</article-title>. <source>Microbiol. Read. Engl.)</source> <volume>155</volume>, <fpage>1966</fpage>&#x2013;<lpage>1976</lpage>. <pub-id pub-id-type="doi">10.1099/mic.0.027508-0</pub-id> </citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>van der Ploeg</surname>
<given-names>J. R.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Genome sequence of Streptococcus mutans bacteriophage M102</article-title>. <source>FEMS Microbiol. Lett.</source> <volume>275</volume>, <fpage>130</fpage>&#x2013;<lpage>138</lpage>. <pub-id pub-id-type="doi">10.1111/j.1574-6968.2007.00873.x</pub-id> </citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wheatley</surname>
<given-names>R. M.</given-names>
</name>
<name>
<surname>MacLean</surname>
<given-names>R. C.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>CRISPR-Cas systems restrict horizontal gene transfer in <italic>Pseudomonas aeruginosa</italic>
</article-title>. <source>ISME J.</source> <volume>15</volume>, <fpage>1420</fpage>&#x2013;<lpage>1433</lpage>. <pub-id pub-id-type="doi">10.1038/s41396-020-00860-3</pub-id> </citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yu</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Smith</surname>
<given-names>D. K.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Guan</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Lam</surname>
<given-names>T. T.-Y.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>ggtree: an r package for visualization and annotation of phylogenetic trees with their covariates and other associated data</article-title>. <source>Methods Ecol. Evol.</source> <volume>8</volume>, <fpage>28</fpage>&#x2013;<lpage>36</lpage>. <pub-id pub-id-type="doi">10.1111/2041-210X.12628</pub-id> </citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zaken</surname>
<given-names>H. B.</given-names>
</name>
<name>
<surname>Kraitman</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Coppenhagen-Glazer</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Khalifa</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Alkalay-Oren</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Ben-Gal</surname>
<given-names>G.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Isolation and characterization of anti-Streptococcus mutans phage as a possible treatment agent for caries</article-title>. <pub-id pub-id-type="doi">10.20944/preprints202104.0045.v1</pub-id> </citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zeng</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Das</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Burne</surname>
<given-names>R. A.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Utilization of lactose and galactose by Streptococcus mutans: Transport, toxicity, and carbon catabolite repression</article-title>. <source>J. Bacteriol.</source> <volume>192</volume>, <fpage>2434</fpage>&#x2013;<lpage>2444</lpage>. <pub-id pub-id-type="doi">10.1128/JB.01624-09</pub-id> </citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhou</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Caufield</surname>
<given-names>P. W.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Qi</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>Complete nucleotide sequence and characterization of pUA140, a cryptic plasmid from Streptococcus mutans</article-title>. <source>Plasmid</source> <volume>46</volume>, <fpage>77</fpage>&#x2013;<lpage>85</lpage>. <pub-id pub-id-type="doi">10.1006/plas.2001.1539</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>