Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Cell. Infect. Microbiol., 28 August 2025

Sec. Clinical and Diagnostic Microbiology and Immunology

Volume 15 - 2025 | https://doi.org/10.3389/fcimb.2025.1636071

This article is part of the Research TopicAdvances in Vaginal Microbiome and Metabolite Research: Genetics, Evolution, and Clinical PerspectivesView all 12 articles

CRISPR typing and phage content of colonizing Group B Streptococci from healthy Egyptian women

Sarah ShabayekSarah Shabayek1Dorina HaiderDorina Haider2Verena VogelVerena Vogel2Uzma Basit Khan,Uzma Basit Khan3,4Dorota JamrozyDorota Jamrozy3Elita JauneikaiteElita Jauneikaite5Kirsty LeDoareKirsty LeDoare6Stephen BentleyStephen Bentley3Barbara Spellerberg*Barbara Spellerberg2*
  • 1Department of Microbiology and Immunology, Faculty of Pharmacy, Suez Canal University, Ismailia, Egypt
  • 2Institute of Medical Microbiology and Hygiene, University Hospital Ulm, Ulm, Germany
  • 3Parasites and Microbes, Wellcome Sanger Institute, Hinxton, United Kingdom
  • 4Oxford Centre for Microbiome Studies, Kennedy Institute of Rheumatology, University of Oxford, Oxford, United Kingdom
  • 5Department of Metabolism, Digestion and Reproduction, Institute of Reproduction and Developmental Biology, Imperial College London, London, United Kingdom
  • 6Centre for Neonatal and Paediatric Infection, Maternal and Neonatal Vaccinology Group, City St. Georges University of London, London, United Kingdom

Background: Streptococcus agalactiae or Group B Streptococcus (GBS) causes serious infections in neonates with a particularly high burden of disease in Africa. Maternal vaginal colonization is the primary source of neonatal transmission. Molecular surveillance of the maternal GBS population is crucial for informing maternal vaccine development and monitoring of the global circulation of GBS clones.

Methods: The current study analyzes the structure and diversity of the clustered regularly interspaced palindromic repeat (CRISPR)-associated (Cas) system and phage content in colonizing GBS isolates collected from healthy pregnant women from Ismailia, Egypt. The isolates were characterized by whole-genome sequencing within the global JUNO project.

Results: CRISPR arrays and phages were detected in a vast majority of GBS isolates. A strong congruence was observed between multilocus sequence typing (MLST), CRISPR profile, and phage content. Region-specific sequence types (STs) observed only in Africa were distinguishable from other lineages.

Conclusions: CRISPR typing is a promising low-cost tool for investigating the population structure of GBS clones, particularly in middle- and low-income countries.

Introduction

The clustered regularly interspaced palindromic repeat (CRISPR)-associated (Cas) system is an adaptive immune system of prokaryotes providing protection against invading nucleic acids associated with mobile genetic elements (MGEs), such as phages and plasmids (Barrangou et al., 2007; Lopez-Sanchez et al., 2012). The CRISPR-Cas locus typically consists of highly conserved direct repeats (DR) separated by non-repetitive invader-targeting spacer sequences of a similar length and genes encoding CRISPR-associated proteins (Cas proteins) (Kunin et al., 2007; Lopez-Sanchez et al., 2012). Spacer cleavage derived from invading DNA is mediated by the direct hybridization of small CRISPR RNAs (crRNAs), which act in conjunction with Cas proteins (Deltcheva et al., 2011). Many of the repeat-spacer arrays are flanked by a leader sequence on one side and a trailer sequence on the other. CRISPR arrays represent a chronological archive of past invaders as recently acquired spacers are integrated at the leader proximal and sorted in a linear, time-oriented manner. Polarized acquisition of new spacers is concomitant to the duplication of a DR (Barrangou and Dudley, 2016; Beauruelle et al., 2017).

Streptococcus agalactiae or group B streptococcus (GBS) is a pathobiont that colonizes the gastrointestinal and genitourinary tracts of healthy humans (Verani et al., 2010; Brokaw et al., 2021). GBS is a leading cause of sepsis, meningitis, and pneumonia in neonates (Verani et al., 2010; Brokaw et al., 2021). Maternal carriage is the principal route of GBS vertical transmission to newborns (Verani et al., 2010; Brokaw et al., 2021). It can also cause severe infections in the elderly and non-pregnant adults (Verani et al., 2010; Brokaw et al., 2021). Two CRISPR-Cas systems have been described in GBS, a type II-A system, named CRISPR1 and a type I-C system named CRISPR2. While the CRISPR1 locus is ubiquitous and functional, the CRISPR2 locus is present in a few strains and is less active (Lopez-Sanchez et al., 2012). The CRISPR1 locus is extremely diverse and highly dynamic with continuous spacer acquisitions and duplications as well as spacer deletions. It is typically composed of highly conserved DRs of 36 bp, which are interspaced with spacers of 30 bp. Conversely, the CRISPR2 locus is less diverse and less dynamic with DRs of 32 bp (Lopez-Sanchez et al., 2012).

Prophages, integrated into bacterial chromosomes, play crucial roles in virulence, pathogenicity, and the regulation of bacterial ecology and colonization (Penadés et al., 2015; Davies et al., 2016). CRISPR-Cas systems control phage acquisition (Barrangou et al., 2007), whereas prophages are associated with horizontal gene transfer in bacteria, enhancing genomic diversity and accounting for up to 10% of all strain-specific genes in GBS (Tettelin et al., 2005; Juhas, 2015). Lysogeny is common among a variety of bacterial species; in GBS, it was first described in 1969 in strains of bovine origin (Khan et al., 2024). While the majority of GBS isolates contain one or more prophages, differences in prophage distribution among GBS serotypes and clonal complexes have been reported reflecting GBS population heterogeneity (Domelier et al., 2009; Salloum et al., 2011; van der Mee-Marquet et al., 2018).

The population structure of GBS has been investigated using different typing methods including serotyping and multilocus sequence typing (MLST) (Jones et al., 2003). Although serotyping is relatively less expensive and easy to perform in the laboratory, its discriminating power is insufficient for a detailed comparison of isolates (Sabat et al., 2013; Beauruelle et al., 2021) and has in many laboratories been replaced by molecular genetic typing methods. MLST is currently the standard for GBS typing (Beauruelle et al., 2021). However, a major disadvantage of MLST is that it is expensive, time-consuming, and labor-intensive (Radtke et al., 2010; Sabat et al., 2013; Beauruelle et al., 2021). The vast majority of GBS isolates harbor a CRISPR type II-A system (Lopez-Sanchez et al., 2012). A typing method based on CRISPR array analysis with high discriminatory power has been shown to be efficient in separating GBS isolates (Lopez-Sanchez et al., 2012). This genotyping approach discriminates isolates according to the CRISPR1 spacer content of the type II-A system based on similarities between spacers. Unlike MLST, CRISPR typing is an easy and low-cost method (Beauruelle et al., 2021). Recent studies suggested CRISPR typing as a promising alternative to MLST (Beauruelle et al., 2021) especially in low-resource settings.

While there is a scarcity of GBS epidemiological data for many African countries, this region is particularly indicated with a high burden of GBS vaginal carriage and disease (Shabayek et al., 2022). One of the main reasons for the scarcity of data is the high cost of GBS typing methods (Capan et al., 2012; Russell et al., 2017). The low-cost CRISPR approach has been proposed as a useful tool with high discriminatory power for GBS typing in middle- and low-income countries (Beauruelle et al., 2021) and may be especially useful in view of the introduction of GBS vaccines in the near future.

Our work builds on a previous study (Shabayek et al., 2022) where the MLST types of 90 colonizing Egyptian GBS isolates from pregnant and non-pregnant women have been described. The current study focuses on the analysis of the structure and diversity of CRISPR-Cas profiles and their use for epidemiological typing in these 90 Egyptian GBS isolates using whole genome sequencing (WGS) within the JUNO project (Project JUNO, 2021). Furthermore, the diversity of GBS prophages found in the Egyptian GBS isolates, including their distribution across different STs and correlation with CRISPR1 array clusters, was analyzed.

Materials and methods

Bacterial genomic DNA was extracted using the GenElute Bacterial Genomic DNA Kit (Sigma-Aldrich, St. Louis, MO, USA) according to the manufacturer’s instructions. Whole-genome sequencing (WGS) was performed at the Wellcome Sanger Institute as part of the global GBS surveillance study JUNO [https://www.gbsgen.net/ (accessed on 2 August 2024)] (Project JUNO, 2021). Briefly, WGS libraries were prepared using the NEB Ultra II library kit on an Agilent Bravo liquid handling platform, and the sequencing was performed on the Illumina HiSeq platform with 150-base-pair read length. Short-read data are available from the European Nucleotide Archive (see Supplementary Table S1 for sample accession numbers). Genomes were assembled using the SPAdes genome assembler (version 3.14.0) (Bankevich et al., 2012). Subsequently, multilocus sequence typing (MLST) was performed with the assembled genomes by utilizing the PubMLST database (Jolley et al., 2018; Shabayek et al., 2022). WGS data were analyzed for CRISPR loci using CRISPRCasFinder (Couvin et al., 2018; Makarova et al., 2020). Only CRISPR arrays with evidence level 3 or 4 were retained for further analysis. The rest were discarded as CRISPR arrays having evidence levels 3 and 4 are considered highly likely candidates, whereas evidence levels 1 and 2 indicate potentially invalid CRISPR arrays (Couvin et al., 2018; Makarova et al., 2020). The evidence levels 1-4 are a rating system to discriminate fake CRISPR-like elements from true CRISPRs. The lowest evidence level (rated 1) is given to short candidate arrays made of one to three spacers as these often do not correspond to CRISPRs. Putative CRISPR arrays with at least four spacers are assigned to levels 2–4 based on an EBcons (entropy-based conservation) index, which is produced through an algorithm to measure CRISPR repeat conservation and degree of similarity between spacers. To assess the conservation of CRISPR-Cas type II-A systems in all African countries, the presence of the csn2 gene, a signature gene of CRISPR-Cas type II-A systems (Makarova et al., 2020), was analyzed in publicly available genomes of GBS strains isolated in nine other African countries (study references: PRJEB11000, PRJEB18603, PRJEB8986, PRJEB41294, PRJEB43245, PRJEB44246, PRJEB46377, PRJNA63204, PRJNA556442, PRJNA986888, PRJNA315969, PRJNA407943, and PRJNA479604; for ENA accession numbers, see Supplementary Table S2). In silico PCR was conducted using the following primers: 5'ATGATCAAGATTAATTTTCC3' and 5'TTATACCATATTTTCGCCTA 3'. Graphic presentation, visualization, and alignment of CRISPR loci were performed using CRISPRViZ (Nethery and Barrangou, 2019). A phylogenetic tree of CRISPR1 loci was built using the Interactive Tree Of Life (iTOL) (Letunic and Bork, 2024) online tool (https://itol.embl.de). A Newick file of the initial tree was generated by the Molecular Evolutionary Genetics Analysis (MEGA) software version 11 (Tamura et al., 2021) through multiple-sequence alignment of CRISPR1 sequences using ClustalW and the neighbor-joining algorithm. This was then uploaded to the online iTOL to build a circular phylogenetic tree. The assembled genomes were analyzed for prophage sequences using PHASTEST (Wishart et al., 2023). Only predicted prophage regions with the completeness level “intact” were considered for further analysis. Prophage regions with completeness level “incomplete” and “questionable” were discarded. The Basic Local Alignment Search Tool (BLAST) (Altschul et al., 1990) was used to identify homologous regions. Prophage sequences were aligned using Qiagen CLC Main Workbench 7.7.3 with default parameters (gap open cost: 10; gap extension cost: 1,0). Qiagen CLC Main Workbench 7.7.3. was used for bootstrap analysis (100 replicates) and construction of the phylogenetic tree (tree construction method: neighbor joining algorithm; nucleotide distance measure: Jukes-Cantor). Prophage sequences were assigned into clusters A-P according to the phylogenetic tree, and clusters were aligned separately using the Clustal Omega tool (Madeira et al., 2024). Clustering was retained, when the percent identity of the aligned sequences within each cluster was at least 75% (Table 1; Supplementary Table S3).

Table 1
www.frontiersin.org

Table 1. Prophage clusters and similarity to previously characterized phages.

Results

We investigated the CRISPR1 and CRISPR2 loci distribution in a collection of 90 whole-genome sequenced colonizing GBS isolates from healthy Egyptian women. Using CRISPRCasFinder, the CRISPR1 sequence was detected in 88 isolates, confirming its ubiquitous nature and conserved structure in GBS. The complete sequence was generated for each isolate, enabling the analysis of spacers, direct repeats, leaders, and trailers. The analyzed sequences carried a CRISPR1-Cas type II-A system with four cas genes (cas9, cas1, cas2, and csn2). The number of spacers ranged from three to 32 per isolate, corresponding to a CRISPR1 array size of 434 to 2348 bp. In addition to CRISPR1, seven isolates (ST12, n = 5; ST10, n = 1; ST24, n = 1) concurrently harbored a CRISPR2 system, confirming its rare presence. The number of CRISPR2 spacers ranged from 4 to 10 per isolate, with array sizes ranging from 497 to 896 bp. All sequenced CRISPR2-Cas type I-C systems contained seven cas genes (cas3, cas5c, cas8c, cas7c, cas4, cas1, and cas2).

In line with these results, 2,835 publicly available genomes from other African countries (Angola, Botswana, Ethiopia, Gambia, Kenya, Malawi, Mozambique, Nigeria, South Africa) were analyzed for the presence of the csn2 gene, which is specific to type II-A systems and indicative of CRISPR1 (Makarova et al., 2020). The csn2 gene is very conserved and was identified by in silico PCR in over 99% (2,825 of 2,835) of the analyzed African isolates.

A total of 1,441 spacers were identified, of which 287 spacers were unique (275 in CRISPR1 and 12 in CRISPR2) appearing only once among the analyzed genomes. Unique spacer sizes ranged from 27 to 36 bp (Supplementary Table S4). The remaining 1,154 spacers were shared and duplicated within or across isolate (1125 in CRISPR1 and 29 in CRISPR2), corresponding to 258 distinct spacers after removing duplicates (Supplementary Table S5) ranging from 28 to 35 bp. Frequently repeated spacer sequences are listed in Table 2. Internal spacer duplication(s) within the same isolate was observed only in the CRISPR1 arrays but not in CRISPR2. Spacer duplication(s) in the CRISPR1 arrays within the same genome were observed in 37 isolates, including ST1 (n = 13), ST19 (n = 8), ST12 (n = 4), ST23 (n = 3), and ST932 (n = 2), and single isolates of STs 3, 6, 14, 25, 196, 556, and 934.

Table 2
www.frontiersin.org

Table 2. The topmost repeated spacers within the same isolate and/or among more than one isolate of colonizing GBS strains in Egypt.

Using CRISPRViZ isolates were aligned and grouped based on CRISPR1 array homology (Figures 13). Strong congruence was observed between CRISPR1 array clustering and MLST typing. A total of 81 isolates were grouped into 16 CRISPR1 clusters, whereas seven isolates demonstrated distinct arrays and were not grouped into any cluster. These were then regrouped according to MLST type. For CC1 (Figures 1A–E), STs 1, 3, and 196 were grouped into four clusters, with one isolate showing a distinct array (CC1 subgroup-1 to 5). Although eBURST assigned ST3 and ST196 to different CCs, their CRISPR1 homology grouped them under ST1. All except one of the ST14 isolates were clustered together (CC1 subgroup-6 to 7) (Figures 2A, B). For CC4 (Figures 2C–E), rare ST4s and African-specific ST932 (Shabayek et al., 2022) formed two clusters, whereas one ST4 isolate had a distinct array (CC4 subgroups 1 to 3). CC12 (Figures 2F, G) comprised ST12 and ST10 isolates in two clusters (CC12 subgroups 1 to 2). For CC17 (Figure 3A), the CRISPR1 arrays of the two novel ST1954/HvgA positive/serotype III isolates (Shabayek et al., 2022), were identical, demonstrating a perfect match with 100% similarity, and were gathered into a single cluster. For CC19 (Figures 3B–D), all except one of the ST19 isolates were clustered together and ST28 isolates were grouped with a single ST1 isolate showing a CRISPR1 array homologous to the ST28 cluster (CC19 subgroups 1 to 3). CC23 (Figures 3E, F) grouped the STs 23, 25, and 556 in two clusters (CC23 subgroup-1 to 2). ST6 and the African-specific ST569 isolates (Shabayek et al., 2022) formed separate singleton clusters (Figures 2H, 3G). ST24 and African-specific STs 486 and 934 (Shabayek et al., 2022) had distinct CRISPR1 arrays and were not grouped into any cluster (Figures 3H–J). Notably, isolates with unique CRISPR1 arrays had 66% to 100% unique spacers as shown by CRISPRCasFinder. A phylogenetic tree of the analyzed CRISPR1 loci is shown in Figure 4.

Figure 1
Color-coded matrix chart labeled with sections A through E on the left and numbered items on the top and left sides. Each cell contains differently colored symbols, representing various data points, arranged in patterned rows. The layout is divided by bold horizontal lines separating the sections.

Figure 1. The CRISPR1 arrays of ST1, ST3, and ST196 isolates showing CC1 subgroups from 1 to 5. (A) CRISPR1 array of CC1 subgroup 1. (B) CC1 subgroup 2. (C) CC1 subgroup 3. (D) CC1 subgroup 4. (E) CC1 subgroup 5. According to eBURST, ST3 and ST196 belonged to different CCs. However, these were grouped into CC1 based on their homologous CRISPR1 array to ST1. The CRISPR1 arrays are represented using CRISPRViZ. A black line separates CRISPR1 subgroups. Spacers are represented as colored boxes. Gaps (=missing spacers) are shown with a boxed cross symbol (⊠) after alignment of identical spacers between strains of the same group. Arrays are oriented with respect to the leader sequence located on the left.

Figure 2
Grid displaying various colored blocks and symbols, organized into eight labeled sections from A to H. Each row contains different multicolored blocks, some marked with symbols like diamonds and squares. The sections are numbered and named, showing distinctive patterns and arrangements. The image doesn't immediately convey specific data but seems to classify or compare groups visually.

Figure 2. The CRISPR1 arrays of ST14/CC1, CC4 (ST4 and ST932), CC12 (ST12 and ST10) isolates, and the singleton ST569. A-B. The CRISPR1 arrays of ST14/CC1 isolates showing CC1 subgroups 6 and 7. (A) CRISPR1 array of CC1 subgroup 6. (B) CRISPR1 array of CC1 subgroup 7. All except one of the ST14 isolates were gathered into a single cluster. C-E. The CRISPR1 arrays of CC4 (ST4 and ST932) isolates showing CC4 subgroups from 1 to 3. (C) CRISPR1 array of CC4 subgroup 1. (D) CRISPR1 array of CC4 subgroup 2. (E) CRISPR1 array of CC4 subgroup 3. STs 4 and 932 were grouped into two clusters, with one ST4 isolate having a distinct array. (F-H) The CRISPR1 arrays of CC12 (ST12 and ST10) isolates and the singleton ST569. (F) CRISPR1 array of CC12 subgroup-1. (G) CRISPR1 array of CC12 subgroup-2. (H) CRISPR1 array of ST569 isolates. The CRISPR1 arrays are represented using CRISPRViZ. A black line separates CRISPR1 subgroups. Spacers are represented as colored boxes. Gaps (=missing spacers) are shown with a boxed cross symbol (⊠) after alignment of identical spacers between strains of the same group. Arrays are oriented with respect to the leader sequence located on the left.

Figure 3
Grid chart displaying colored symbols within rows labeled A to J, each associated with specific codes like “1035-ST1545”. Rows show varying patterns of colored diamonds and circles, indicating data distribution or relationships.

Figure 3. The CRISPR1 arrays of the novel ST1954/CC17, STs 19, 28, and 1 isolates, CC23 (ST23, ST25, ST556), and the singletons ST6, ST24, ST486, and ST934 isolates. (A) The CRISPR1 arrays of the novel ST1954/CC17 isolates. B-D. The CRISPR1 arrays of STs 19, 28, and 1 isolates showing CC19 subgroups from 1 to 3. (B) CRISPR1 array of CC19 subgroup 1. (C) CRISPR1 array of CC19 subgroup 2. (D) CRISPR1 array of CC19 subgroup 3. All except one of the ST19 isolates were gathered into a single cluster. The ST28 isolates were grouped with a single ST1 isolate showing a CRISPR1 array homologous to the ST28 cluster. E-F. The CRISPR1 arrays of CC23 (ST23, ST25, ST556) isolates showing CC23 subgroups 1 and 2. (E) CRISPR1 array of CC23 subgroup 1. (F) CRISPR1 array of CC23 subgroup 2. The CC23 isolates were grouped into two clusters. G-J. The CRISPR1 arrays of the singletons ST6, ST24, ST486, and ST934 isolates. (G) CRISPR1 array of ST6 (n = 2). (H) CRISPR1 array of ST24 (n = 1). (I) CRISPR1 array of ST486 (n = 1). (J) CRISPR1 array of ST934 (n = 1). The CRISPR1 arrays are represented using CRISPRViZ. A black line separates CRISPR1 subgroups. Spacers are represented as colored boxes. Gaps (=missing spacers) are shown with a boxed cross symbol (⊠) after alignment of identical spacers between strains of the same group. Arrays are oriented with respect to the leader sequence located on the left.

Figure 4
Phylogenetic tree diagram displaying colored branches representing various sequences clustered by sequence type (ST) and clonal complexes. Labels include ST1, ST19, ST74, etc. A key indicates color codes for different clonal complexes such as CC4, CC17, and CC19.

Figure 4. Phylogenetic circular tree of the CRISPR1 sequences of colonizing GBS strains from Egypt. The phylogenetic tree of CRISPR1 loci was built using the Interactive Tree Of Life (iTOL).

A comparison of the spacers comprising the CRISPR1 loci revealed extreme diversity (Figures 13). A total of 87 different spacer profiles were identified, confirming that the GBS CRISPR1 locus is highly dynamic. Spacers near the trailer end were more conserved, whereas those near the leader end were mostly unique or rare. However, some isolates showed spacer deletions at the trailer end and some isolates lacked unique spacers in the leader region. These spacers were in the middle part of the CRISPR array of related strains. Events involving several spacer deletions in the middle part of the CRISPR1 array without the acquisition of unique spacers or duplication of conserved spacers were also observed, as well as internal spacer switches.

Finally, using CRISPRCasFinder, we analyzed the flanking regions of CRISPR1 and CRISPR2 (leader and trailer ends), as well as the DR, as these sequences sometimes show variation between groups of isolates. A total of four DR sequences were identified in the CRISPR1 loci (Supplementary Table S6); the two most prevalent, 36 bp each, appeared in 64% (n = 58) and 29% (n = 26) of isolates. The other two, 37 bp long, were found in ST19 (n=2) and the novel ST1954/CC17 (Shabayek et al., 2022) (n = 2). For the seven CRISPR2 loci, two 32-bp DR sequences were identified. Concerning leader sequences, a total of nine sequences directly adjacent to the first CRISPR1 repeat were identified (Supplementary Table S7). The three most common CRISPR1 leader sequences were found in 57% (n = 51), 20% (n = 18), and 9% (n = 8) of the isolates, respectively. For the CRISPR2 loci, two leader sequences were identified. Concerning trailer sequences, nine sequences located downstream of CRISPR1 were identified (Supplementary Table S8). The three most common trailer sequences were found in 41% (n = 37), 31% (n = 28), and 13% (n = 12) of the isolates, respectively. For the CRISPR2 loci, two trailer sequences were identified.

CRISPR-Cas systems provide protection against phage invasion, to investigate how CRISPR profiles influence prophage content in the Egyptian GBS population, the assembled whole-genome sequences were analyzed with PHASTEST (Penadés et al., 2015). In total, 149 prophage sequences were identified across 90 GBS isolates. A total of 61 prophages were excluded from analysis since they were predicted as “incomplete” or “questionable” by PHASTEST. The number and size of intact prophages per genome are shown in Table 3. Among the analyzed GBS genomes, most (56.7%, 51/90) carried only one prophage, 14.4% (13/90) had two, 3.3% (3/90) had three, and 1.1% (1/90) four. No prophages were detected in 24.4% of the GBS isolates. The prevalence of prophages varied across different STs (Figure 5a). Isolates of ST1, ST4, and ST28 mainly carried one prophage, whereas ST12, ST14, ST19, and the African-specific ST932 showed more heterogeneity. Interestingly, no prophages were detected in the African-specific ST569.

Table 3
www.frontiersin.org

Table 3. Prophages in Egyptian GBS isolates.

Figure 5
Chart (a) is a bar graph showing the number of GBS isolates across various sequence types, categorized by prophage numbers from zero to four. Chart (b) is a phylogenetic tree depicting relationships among different GBS isolates, with branches labeled A to P, representing different clonal complexes. Both charts illustrate genetic variation and clustering.

Figure 5. Distribution and phylogeny of GBS prophages. (a) Prophage distribution across sequence types of Egyptian GBS isolates. (b) Neighbor Joining tree constructed on the genome comparison at nucleotide level of prophages found in Egyptian GBS strains by bootstrap analysis using Qiagen CLC Main Workbench 7.7.3. Multiple-sequence alignment was constructed using Qiagen CLC Main Workbench 7.7.3 with default settings. Assigned prophage groups from (A-P) are depicted as well as corresponding CRISPR1 array clusters. Prophages labeled with an asterisk originate from GBS strains not included in CRISPR1 array clustering.

Based on the alignment and the following constructed phylogenetic tree (Figure 5b), the identified prophages were assigned into prophage clusters A-P (Table 3). Clusters of prophage sequences with ≥75% identity were aligned (Table 1; Supplementary Table S3) and further compared with the NCBI nr/nt database of viruses and bacteria using MEGABLAST. Most resembled annotated Streptococcus phages, including phages of S. agalactiae, S. dysgalactiae, and S. pyogenes, and Enterococcus phages (Tables 1, 4). The 14 analyzed prophages from cluster A matched S. agalactiae phage Javan55 with 100% identity over 66%-67% genome coverage (Juhas, 2015). The 11 prophages from cluster E also matched Javan55 with 100% identity over a coverage of 67%, but had ≤50% mutual similarity to cluster A prophages. Prophages from cluster D and F matched S. agalactiae phage Javan29, each showing 100% identity and 68% genome coverage (Juhas, 2015); however, the mutual similarity of both clusters was ≤50%. Cluster B, J, and L prophages resembled S. agalactiae phage Javan51, Javan39, and Javan23, respectively (Juhas, 2015). Prophages of clusters G and P did not match to any annotated bacteriophage. Despite their mutual similarity of at least 87%, cluster H prophages did not consistently match a particular phage (Table 4). Cluster C, K, M, N, and O prophages matched various known phages including S. agalactiae phages phiStag1_20280_6_181 (Russell et al., 1969), vB_Sags-UPM1 (van der Mee-Marquet et al., 2018), phiGBSVK-D_GBSInt2.2 (Salloum et al., 2011), LF2 (Domelier et al., 2009), and phiGBSVK-F2_GBSInt11.1 (Salloum et al., 2011). Table 4 lists unclustered prophages and their closest annotated matches.

Table 4
www.frontiersin.org

Table 4. Similarity to previously characterized phages of prophage cluster H and prophages not clustered.

Prophage pattern correlated with the CRISPR1 clustering (Table 3; Figure 5b). Prophage clusters A and E were exclusive to GBS isolates of CRISPR1 cluster CC1-1–CC1-4; cluster B and O prophages to CC1–6 and CC1-7; prophage clusters D, F, J, and M to CC19-1, CC19-2, and CC19-3; cluster G prophages to CC23-1, K to CC12–1 and CC12-2; and prophages N and P to CC4–1 and CC4–2 or CC12-2, respectively; clusters C, H, and L spanned multiple CRISPR1 groups. No prophages were detected In CRISPR1 clusters CC17, singleton ST569, and singleton ST6.

Discussion

CRISPR plays a key role in adaptive immunity against foreign DNA sequences such as integrative and conjugative elements, phages, and plasmids (Lopez-Sanchez et al., 2012). We investigated the structure and content of CRISPR1 and CRISPR2 loci in a collection of 90 whole-genome sequenced colonizing GBS isolates from healthy Egyptian women. Our analysis confirmed the ubiquitous nature of CRISPR1 and revealed its high diversity and dynamism, whereas CRISPR2 was rare, less diverse, and less active, as previously reported (Lopez-Sanchez et al., 2012; Lier et al., 2015; Beauruelle et al., 2017; Beauruelle et al., 2021). In line with earlier findings (Lopez-Sanchez et al., 2012; Lier et al., 2015; Beauruelle et al., 2017; Beauruelle et al., 2021), we observed a strong congruence between CRISPR1-based genotyping and MLST, underscoring the utility of CRISPR typing in characterizing the population structure of vaginally carried GBS. The CRISPR1 spacer content efficiently reflected the GBS population structure. To assess the applicability of CRISPR typing in African GBS isolates more specifically, in silico PCR was performed, screening for the presence of the csn2 gene, which is exclusive to CRISPR-Cas type II-A systems (Makarova et al., 2020). Among 2835 S. agalactiae genomes, collected from nine different African countries, the type II-A system was easily detected in over 99% of genomes, demonstrating the widespread presence of this system and highlighting CRISPR typing as a robust tool for GBS population surveillance.

Lopez-Sanchez et al., 2012 (Lopez-Sanchez et al., 2012) demonstrated that CRISPR analysis not only enables GBS typing based on ancient spacers but also facilitates subtyping through the analysis of more recently acquired spacers, which is an even more effective method, as it allows clustering of isolates based on newer spacers. This precise subtyping may have significant epidemiological implications. For instance, we previously reported that serotype VI, a common lineage among our isolates, was almost exclusively associated with ST-14 (Shabayek et al., 2022). In the current study, we showed that all ST14 isolates, except one, clustered together based on CRISPR1 profiles. This isolate was the only ST14 isolate carrying serotype III and harboring both Alp2 and Epsilon surface proteins, whereas all other ST14 isolates carried serotype VI and harbored only the Epsilon surface protein. Furthermore, CRISPR1 clustering grouped ST28 isolates with a single ST1 isolate, all of these belonged to serotype II and harbored the Rib surface protein. Interestingly, all African-specific STs (486, 569, 932, 934) were clearly distinguishable from other lineages based on CRISPR1 profiles. Lopez-Sanchez et al. (2012) (Lopez-Sanchez et al., 2012) also suggested that CRISPR1 analysis could support subtyping according to geographical distribution. Therefore, CRISPR1-based subtyping may be useful for tracking regions specific GBS lineages and monitoring their global circulation.

CRISPR diversity is driven by spacer acquisition, deletion, and duplication, reflecting the activity of this adaptive immune system. In concordance with previous studies (Lopez-Sanchez et al., 2012; Lier et al., 2015; Beauruelle et al., 2017; Beauruelle et al., 2021), we observed that spacer acquisitions frequently occurred near the leader end of the CRISPR array. Consistent with Beauruelle et al. (2017) (Beauruelle et al., 2017), isolates showing polarized acquisitions patterns predominantly belonged to ST1. Additionally, previous studies (Deveau et al., 2008; Horvath et al., 2008; Lopez-Sanchez et al., 2012; Beauruelle et al., 2017) have shown that spacer deletions typically occur in the internal region of the array and are rarely found near the trailer end. Our findings support this, as most spacer deletions were observed in the internal region of the CRISPR locus. However, we also identified spacer deletions near the trailer end in a few ST23 isolates. Furthermore, we found that internal spacer duplication(s) were more frequent among the CC1 isolates—a pattern reported among CC23 isolates by Beauruelle et al (Beauruelle et al., 2017). Interestingly, identical spacer loci were observed only in CC17 isolates in our study, consistent with earlier findings (Lopez-Sanchez et al., 2012). This aligns with a study from 2017 (Beauruelle et al., 2017) that found no modifications in CRISPR1 loci among CC17 isolates. Similarly, Beauruelle et al. (2021) (Beauruelle et al., 2021) reported fewer spacer variations at the CRISPR1 locus among CC17 isolates, attributing this to the presence of highly homogenous and prevalent CC17 clones.

Spacer polymorphisms revealed varying degrees of heterogeneity among the ST and CC groups. In concordance with Lier et al. (2015) (Lier et al., 2015) and Beauruelle et al. (2021) (Beauruelle et al., 2021), we observed CRISPR1 clusters containing ST1, and to a lesser extent, those containing CC4, CC12, and CC19 were more heterogeneous, as evidenced by their separation into several subgroups. According to Da Cunha and colleagues (Da Cunha et al., 2014), this heterogeneity might be explained by a higher rate of recombination within certain CCs. In contrast, we found that CRISPR1 clusters containing the novel ST1954/CC17 clones and CC23 isolates were more homogenous. This finding is consistent with previous studies (Lier et al., 2015; Beauruelle et al., 2017; Beauruelle et al., 2021) which reported the lowest CRISPR1 polymorphism in CC17 followed by CC23. Moreover, Beauruelle et al. (2017) reported that CC23 isolates shared a relatively high number of common spacers. We observed a similar pattern of homogeneity in CRISPR1 clusters containing ST14 and the African-specific ST569. Beauruelle et al. (2021) and Lier et al. (2015) (Lier et al., 2015; Beauruelle et al., 2021) proposed that the level of spacer diversity in a CRISPR array is a useful indicator of the functional activity of the associated locus. CRISPR polymorphism also reflects a strain’s ability to acquire new spacers. Lier et al. (2015) (Lier et al., 2015) identified CC17 isolates as having significantly fewer spacers compared with other CCs.

Prophages contribute to virulence, pathogenicity, and the genomic diversity of GBS (Penadés et al., 2015; Davies et al., 2016).

They are typically acquired through horizontal gene transfer and play a major role in the acquisition of bacterial virulence factors. The integration and excision of prophages strongly influence evolution of pathogenic bacteria and may facilitate adaptation to new hosts and environmental challenges. However, the initiation of a phage lytic cycle through environmental stress kills the bacterial host, and the bacterial immunity system CRISPR-Cas provides sequence-specific immunity against phages. The presence of certain phages and the pattern of a specific phage content may thus be associated with distinct CRISPR profiles. Therefore, in addition to CRISPR profiling, the 90 Egyptian GBS isolates were analyzed to determine the presence and diversity of prophages. In total, 149 prophages were identified with most isolates harboring at least one integrated prophage and some genomes carrying multiple prophages. Among the 90 analyzed GBS isolates, 68 (∼76%) had at least one prophage integrated, with 13 isolates carrying two prophages, 3 isolates carrying three prophages, and 1 isolate carrying four prophages. Consistent with other studies on GBS prophages, we observed differences in prophage content between isolates of different STs (Figure 5a) (Salloum et al., 2011; van der Mee-Marquet et al., 2018). When analyzing the prophage content in relation to different CRISPR types, we also observed associations between certain prophages and specific CRISPR types, supporting the potential of CRISPR typing to efficiently differentiate distinct lineages within the GBS population. Among the prophage carriers, the highest prophage content was observed in ST19 (2.1 average prophages per genome) whereas the lowest was found in ST23 (0.4 prophages per genome). This variation may be associated with CRISPR locus activity, as an inverse correlation between prophage content and CRISPR spacer number has been reported in S. pyogenes (Nozawa et al., 2011). The integration of multiple prophages from different clusters into the bacterial genome may enhance genomic diversity, virulence, or fitness and appears to be linked to specific CRISPR profiles potentially conferring selective advantages to the host bacterium (Ramisetty and Sudhakari, 2019).

Vaccine development requires careful consideration of geographic variations in the population structure of GBS. The World Health Organization (WHO) has declared GBS vaccination among pregnant women as a priority in low- and middle-income countries (Seale et al., 2019). CRISPR typing offers a low-cost and simple alternative to MLST, as it requires analysis of only one locus, whereas MLST involves sequencing seven loci (Radtke et al., 2010; Sabat et al., 2013; Beauruelle et al., 2021). CRISPR typing has proven effective in evaluating the diversity and evolutionary dynamics of GBS vaginal carriage (Beauruelle et al., 2021). As previously suggested (Beauruelle et al., 2021), this method seems particularly valuable in middle- and low-income countries where data on the molecular characteristics of GBS isolates remain limited.

Conclusions

CRISPR1-based typing provides molecular-level insights into the population structure of GBS isolates rom vaginal carriage. In this study, we observed a strong congruence between MLST, CRISPR typing, and phage content in Egyptian GBS isolates. Compared with MLST, CRISPR typing appears to be a low-cost alternative with great discriminatory power, making it a valuable tool investigating GBS population dynamics. CRISPR1 diversity is driven by spacer acquisition, deletion, and duplication, reflecting the activity and adaptability of this protection system. Isolates with polarized acquisitions particularly those belonging to ST1 exhibited more heterogenous spacer profiles. In contrast, the novel ST1954/CC17 isolates showed a lower ability to acquire novel spacers and were the only clones with identical spacer profiles. These findings suggest that the evolution of CRISPR1 arrays may be linked to the dynamics of phylogenetic lineages. Given the widespread presence of CRISPR-Cas type II-A systems, CRISPR typing should be applicable to the vast majority of African GBS isolates. Notably, all African-specific STs in our study could be discriminated from other lineages. Subtyping based on the restricted phylogeographic distribution of specific lineages appears feasible. In summary, CRISPR1 typing is a simple and effective method to monitor the circulation of GBS clones at both regional and global scales.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www.ebi.ac.uk/ena, Supplementary Table S2.

Author contributions

SS: Methodology, Investigation, Conceptualization, Visualization, Validation, Writing – original draft, Writing – review & editing, Formal analysis, Data curation. DH: Formal analysis, Writing – original draft, Validation, Visualization, Data curation, Methodology, Investigation, Writing – review & editing. VV: Validation, Writing – review & editing, Software, Formal analysis, Writing – original draft, Methodology, Data curation, Investigation. UK: Validation, Formal analysis, Writing – review & editing, Data curation, Methodology. DJ: Data curation, Validation, Formal analysis, Project administration, Investigation, Writing – review & editing, Methodology, Funding acquisition. EJ: Data curation, Methodology, Formal analysis, Validation, Writing – review & editing. KL: Data curation, Methodology, Supervision, Validation, Writing – review & editing. SB: Supervision, Conceptualization, Project administration, Funding acquisition, Investigation, Writing – review & editing. BS: Methodology, Writing – review & editing, Writing – original draft, Supervision, Conceptualization, Project administration.

Funding

The author(s) declare financial support was received for the research and/or publication of this article. The Bill and Melinda Gates foundation funded the JUNO project. Whole genome sequencing of the GBS isolates was carried out within the JUNO project.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcimb.2025.1636071/full#supplementary-material

References

Altschul, S. F., Gish, W., Miller, W., Myers, E. W., and Lipman, D. J. (1990). Basic local alignment search tool. J. Mol. Biol. 215, 403–410. doi: 10.1016/s0022-2836(05)80360-2

PubMed Abstract | Crossref Full Text | Google Scholar

Bankevich, A., Nurk, S., Antipov, D., Gurevich, A. A., Dvorkin, M., Kulikov, A. S., et al. (2012). SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol.: J. Comput. Mol. Cell Biol. 19, 455–477. doi: 10.1089/cmb.2012.0021

PubMed Abstract | Crossref Full Text | Google Scholar

Barrangou, R. and Dudley, E. G. (2016). CRISPR-based typing and next-generation tracking technologies. Annu. Rev. Food Sci. Technol. 7, 395–411. doi: 10.1146/annurev-food-022814-015729

PubMed Abstract | Crossref Full Text | Google Scholar

Barrangou, R., Fremaux, C., Deveau, H., Richards, M., Boyaval, P., Moineau, S., et al. (2007). CRISPR provides acquired resistance against viruses in prokaryotes. Sci. (New York N.Y.) 315, 1709–1712. doi: 10.1126/science.1138140

PubMed Abstract | Crossref Full Text | Google Scholar

Beauruelle, C., Pastuszka, A., Horvath, P., Perrotin, F., Mereghetti, L., and Lanotte, P. (2017). CRISPR: A useful genetic feature to follow vaginal carriage of group B streptococcus. Front. Microbiol. 8. doi: 10.3389/fmicb.2017.01981

PubMed Abstract | Crossref Full Text | Google Scholar

Beauruelle, C., Treluyer, L., Pastuszka, A., Cochard, T., Lier, C., Mereghetti, L., et al. (2021). CRISPR typing increases the discriminatory power of streptococcus agalactiae typing methods. Front. Microbiol. 12. doi: 10.3389/fmicb.2021.675597

PubMed Abstract | Crossref Full Text | Google Scholar

Brokaw, A., Furuta, A., Dacanay, M., Rajagopal, L., and Adams Waldorf, K. M. (2021). Bacterial and host determinants of group B streptococcal vaginal colonization and ascending infection in pregnancy. Front. Cell. Infect. Microbiol. 11. doi: 10.3389/fcimb.2021.720789

PubMed Abstract | Crossref Full Text | Google Scholar

Capan, M., Mombo-Ngoma, G., Akerey-Diop, D., Basra, A., Würbel, H., Lendamba, W., et al. (2012). Epidemiology and management of group B streptococcal colonization during pregnancy in Africa. Wiener Klin. Wochenschrift 124 Suppl 3, 14–16. doi: 10.1007/s00508-012-0239-5

PubMed Abstract | Crossref Full Text | Google Scholar

Couvin, D., Bernheim, A., Toffano-Nioche, C., Touchon, M., Michalik, J., Néron, B., et al. (2018). CRISPRCasFinder, an update of CRISRFinder, includes a portable version, enhanced performance and integrates search for Cas proteins. Nucleic Acids Res. 46, W246–w251. doi: 10.1093/nar/gky425

PubMed Abstract | Crossref Full Text | Google Scholar

Da Cunha, V., Davies, M. R., Douarre, P. E., Rosinski-Chupin, I., Margarit, I., Spinali, S., et al. (2014). Streptococcus agalactiae clones infecting humans were selected and fixed through the extensive use of tetracycline. Nat. Commun. 5, 4544. doi: 10.1038/ncomms5544

Crossref Full Text | Google Scholar

Davies, E. V., Winstanley, C., Fothergill, J. L., and James, C. E. (2016). The role of temperate bacteriophages in bacterial infection. FEMS Microbiol. Lett. 363. doi: 10.1093/femsle/fnw015

Crossref Full Text | Google Scholar

Deltcheva, E., Chylinski, K., Sharma, C. M., Gonzales, K., Chao, Y., Pirzada, Z. A., et al. (2011). CRISPR RNA maturation by trans-encoded small RNA and host factor RNase III. Nature 471, 602–607. doi: 10.1038/nature09886

PubMed Abstract | Crossref Full Text | Google Scholar

Deveau, H., Barrangou, R., Garneau, J. E., Labonté, J., Fremaux, C., Boyaval, P., et al. (2008). Phage response to CRISPR-encoded resistance in Streptococcus thermophilus. J. Bacteriol. 190, 1390–1400. doi: 10.1128/jb.01412-07

PubMed Abstract | Crossref Full Text | Google Scholar

Domelier, A. S., van der Mee-Marquet, N., Sizaret, P. Y., Héry-Arnaud, G., Lartigue, M. F., Mereghetti, L., et al. (2009). Molecular characterization and lytic activities of Streptococcus agalactiae bacteriophages and determination of lysogenic-strain features. J. Bacteriol. 191, 4776–4785. doi: 10.1128/jb.00426-09

PubMed Abstract | Crossref Full Text | Google Scholar

Horvath, P., Romero, D. A., Coûté-Monvoisin, A. C., Richards, M., Deveau, H., Moineau, S., et al. (2008). Diversity, activity, and evolution of CRISPR loci in Streptococcus thermophilus. J. Bacteriol. 190, 1401–1412. doi: 10.1128/jb.01415-07

PubMed Abstract | Crossref Full Text | Google Scholar

Jolley, K. A., Bray, J. E., and Maiden, M. C. J. (2018). Open-access bacterial population genomics: BIGSdb software, the PubMLST.org website and their applications. Wellcome Open Res. 3, 124. doi: 10.12688/wellcomeopenres.14826.1

PubMed Abstract | Crossref Full Text | Google Scholar

Jones, N., Bohnsack, J. F., Takahashi, S., Oliver, K. A., Chan, M. S., Kunst, F., et al. (2003). Multilocus sequence typing system for group B streptococcus. J. Clin. Microbiol. 41, 2530–2536. doi: 10.1128/jcm.41.6.2530-2536.2003

PubMed Abstract | Crossref Full Text | Google Scholar

Juhas, M. (2015). Horizontal gene transfer in human pathogens. Crit. Rev. Microbiol. 41, 101–108. doi: 10.3109/1040841x.2013.804031

PubMed Abstract | Crossref Full Text | Google Scholar

Khan, U. B., Dyster, V., Chaguza, C., van Sorge, N. M., van de Beek, D., Man, W. K., et al. (2024). Genetic markers associated with host status and clonal expansion of Group B Streptococcus in the Netherlands. Front. Microbiol. 15. doi: 10.3389/fmicb.2024.1410651

PubMed Abstract | Crossref Full Text | Google Scholar

Kunin, V., Sorek, R., and Hugenholtz, P. (2007). Evolutionary conservation of sequence and secondary structures in CRISPR repeats. Genome Biol. 8, R61. doi: 10.1186/gb-2007-8-4-r61

Crossref Full Text | Google Scholar

Letunic, I. and Bork, P. (2024). Interactive Tree of Life (iTOL) v6: recent updates to the phylogenetic tree display and annotation tool. Nucleic Acids Res. 52, W78–w82. doi: 10.1093/nar/gkae268

PubMed Abstract | Crossref Full Text | Google Scholar

Lier, C., Baticle, E., Horvath, P., Haguenoer, E., Valentin, A. S., Glaser, P., et al. (2015). Analysis of the type II-A CRISPR-Cas system of Streptococcus agalactiae reveals distinctive features according to genetic lineages. Front. Genet. 6. doi: 10.3389/fgene.2015.00214

PubMed Abstract | Crossref Full Text | Google Scholar

Lopez-Sanchez, M. J., Sauvage, E., Da Cunha, V., Clermont, D., Ratsima Hariniaina, E., Gonzalez-Zorn, B., et al. (2012). The highly dynamic CRISPR1 system of Streptococcus agalactiae controls the diversity of its mobilome. Mol. Microbiol. 85, 1057–1071. doi: 10.1111/j.1365-2958.2012.08172.x

PubMed Abstract | Crossref Full Text | Google Scholar

Madeira, F., Madhusoodanan, N., Lee, J., Eusebi, A., Niewielska, A., Tivey, A. R. N., et al. (2024). The EMBL-EBI Job Dispatcher sequence analysis tools framework in 2024. Nucleic Acids Res. 52, W521–w525. doi: 10.1093/nar/gkae241

PubMed Abstract | Crossref Full Text | Google Scholar

Makarova, K. S., Wolf, Y. I., Iranzo, J., Shmakov, S. A., Alkhnbashi, O. S., Brouns, S. J. J., et al. (2020). Evolutionary classification of CRISPR-Cas systems: a burst of class 2 and derived variants. Nat. Rev. Microbiol. 18, 67–83. doi: 10.1038/s41579-019-0299-x

PubMed Abstract | Crossref Full Text | Google Scholar

Nethery, M. A. and Barrangou, R. (2019). CRISPR Visualizer: rapid identification and visualization of CRISPR loci via an automated high-throughput processing pipeline. RNA Biol. 16, 577–584. doi: 10.1080/15476286.2018.1493332

PubMed Abstract | Crossref Full Text | Google Scholar

Nozawa, T., Furukawa, N., Aikawa, C., Watanabe, T., Haobam, B., Kurokawa, K., et al. (2011). CRISPR inhibition of prophage acquisition in Streptococcus pyogenes. PloS One 6, e19543. doi: 10.1371/journal.pone.0019543

PubMed Abstract | Crossref Full Text | Google Scholar

Penadés, J. R., Chen, J., Quiles-Puchalt, N., Carpena, N., and Novick, R. P. (2015). Bacteriophage-mediated spread of bacterial virulence genes. Curr. Opin. Microbiol. 23, 171–178. doi: 10.1016/j.mib.2014.11.019

PubMed Abstract | Crossref Full Text | Google Scholar

Project JUNO (2021). A global genomic survey of Streptococcus agalactiae (Hinxton: Wellcome Genome Campus). Available online at: https://www.gbsgen.net/.

Google Scholar

Radtke, A., Lindstedt, B. A., Afset, J. E., and Bergh, K. (2010). Rapid multiple-locus variant-repeat assay (MLVA) for genotyping of Streptococcus agalactiae. J. Clin. Microbiol. 48, 2502–2508. doi: 10.1128/jcm.00234-10

PubMed Abstract | Crossref Full Text | Google Scholar

Ramisetty, B. C. M. and Sudhakari, P. A. (2019). Bacterial ‘Grounded’ Prophages: hotspots for genetic renovation and innovation. Front. Genet. 10. doi: 10.3389/fgene.2019.00065

PubMed Abstract | Crossref Full Text | Google Scholar

Russell, L. B., Kim, S. Y., Cosgriff, B., Pentakota, S. R., Schrag, S. J., Sobanjo-Ter Meulen, A., et al. (2017). Cost-effectiveness of maternal GBS immunization in low-income sub-Saharan Africa. Vaccine 35, 6905–6914. doi: 10.1016/j.vaccine.2017.07.108

PubMed Abstract | Crossref Full Text | Google Scholar

Russell, H., Norcross, N. L., and Kahn, D. E. (1969). Isolation and characterization of Streptococcus agalactiae bacteriophage. J. Gen. Virol. 5, 315–317. doi: 10.1099/0022-1317-5-2-315

PubMed Abstract | Crossref Full Text | Google Scholar

Sabat, A. J., Budimir, A., Nashev, D., Sá-Leão, R., van Dijl, J., Laurent, F., et al. (2013). Overview of molecular typing methods for outbreak detection and epidemiological surveillance. Eur. Communicable Dis. Bull. 18, 20380. doi: 10.2807/ese.18.04.20380-en

Crossref Full Text | Google Scholar

Salloum, M., van der Mee-Marquet, N., Valentin-Domelier, A. S., and Quentin, R. (2011). Diversity of prophage DNA regions of Streptococcus agalactiae clonal lineages from adults and neonates with invasive infectious disease. PloS One 6, e20256. doi: 10.1371/journal.pone.0020256

PubMed Abstract | Crossref Full Text | Google Scholar

Seale, A. C., Baker, C. J., Berkley, J. A., Madhi, S. A., Ordi, J., Saha, S. K., et al. (2019). Vaccines for maternal immunization against Group B Streptococcus disease: WHO perspectives on case ascertainment and case definitions. Vaccine 37, 4877–4885. doi: 10.1016/j.vaccine.2019.07.012

PubMed Abstract | Crossref Full Text | Google Scholar

Shabayek, S., Vogel, V., Jamrozy, D., Bentley, S. D., and Spellerberg, B. (2022). Molecular epidemiology of group B streptococcus colonization in Egyptian women. Microorganisms 11. doi: 10.3390/microorganisms11010038

PubMed Abstract | Crossref Full Text | Google Scholar

Tamura, K., Stecher, G., and Kumar, S. (2021). MEGA11: molecular evolutionary genetics analysis version 11. Mol. Biol. Evol. 38, 3022–3027. doi: 10.1093/molbev/msab120

PubMed Abstract | Crossref Full Text | Google Scholar

Tettelin, H., Masignani, V., Cieslewicz, M. J., Donati, C., Medini, D., Ward, N. L., et al. (2005). Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: Implications for the microbial “pan-genome. Proc. Natl. Acad. Sci. USA 102, 13950–13955. doi: 10.1073/pnas.0506758102

PubMed Abstract | Crossref Full Text | Google Scholar

van der Mee-Marquet, N., Diene, S. M., Barbera, L., Courtier-Martinez, L., Lafont, L., Ouachée, A., et al. (2018). Analysis of the prophages carried by human infecting isolates provides new insight into the evolution of Group B Streptococcus species. Clin. Microbiol. Infect.: Off. Publ. Eur. Soc. Clin. Microbiol. Infect. Dis. 24, 514–521. doi: 10.1016/j.cmi.2017.08.024

PubMed Abstract | Crossref Full Text | Google Scholar

Verani, J. R., McGee, L., and Schrag, S. J. (2010). Prevention of perinatal group B streptococcal disease–revised guidelines from CDC, 2010. MMWR Recommendations Rep.: Morbid. Mortal. Weekly Rep. Recommendations Rep. 59, 1–36.

PubMed Abstract | Google Scholar

Wishart, D. S., Han, S., Saha, S., Oler, E., Peters, H., Grant, J. R., et al. (2023). PHASTEST: faster than PHASTER, better than PHAST. Nucleic Acids Res. 51, W443–w450. doi: 10.1093/nar/gkad382

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: GBS (Group B Streptococcus), CRISPR, Africa, epidemiology, phage (bacteriophage)

Citation: Shabayek S, Haider D, Vogel V, Khan UB, Jamrozy D, Jauneikaite E, LeDoare K, Bentley S and Spellerberg B (2025) CRISPR typing and phage content of colonizing Group B Streptococci from healthy Egyptian women. Front. Cell. Infect. Microbiol. 15:1636071. doi: 10.3389/fcimb.2025.1636071

Received: 27 May 2025; Accepted: 30 July 2025;
Published: 28 August 2025.

Edited by:

António Machado, University of the Azores, Portugal

Reviewed by:

Philippe Lanotte, Université de Tours, France
Caitlin Wiafe-Kwakye, University of Virginia, United States

Copyright © 2025 Shabayek, Haider, Vogel, Khan, Jamrozy, Jauneikaite, LeDoare, Bentley and Spellerberg. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Barbara Spellerberg, YmFyYmFyYS5zcGVsbGVyYmVyZ0B1bmlrbGluaWstdWxtLmRl

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.