ORIGINAL RESEARCH article
Genomic Insights Into Clinical Shiga Toxin-Producing Escherichia coli Strains: A 15-Year Period Survey in Jönköping, Sweden
- 1Division of Infectious Diseases, Department of Medicine Huddinge, Karolinska Institutet, Huddinge, Sweden
- 2State Key Laboratory of Infectious Disease Prevention and Control, National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China
- 3mEpiLab, School of Veterinary Science, Massey University, Palmerston North, New Zealand
- 4Division of Clinical Microbiology, Department of Laboratory Medicine, Karolinska Institutet, Huddinge, Sweden
- 5Department of Microbiology, School of Public Health, Southern Medical University, Guangzhou, China
- 6The Public Health Agency of Sweden, Solna, Sweden
- 7New Zealand Food Safety Science and Research Centre, School of Veterinary Science, Massey University, Palmerston North, New Zealand
- 8Laboratory Medicine, Jönköping Region County, Department of Clinical and Experimental Medicine, Linköping University, Jönköping, Sweden
- 9Division of Laboratory Medicine, Oslo University Hospital, Oslo, Norway
- 10Division of Laboratory Medicine, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
Shiga toxin-producing Escherichia coli (STEC) are important foodborne pathogens that can cause human infections ranging from asymptomatic carriage to bloody diarrhea (BD) and fatal hemolytic uremic syndrome (HUS). However, the molecular mechanism of STEC pathogenesis is not entirely known. Here, we demonstrated a large scale of molecular epidemiology and in-depth genomic study of clinical STEC isolates utilizing clinical and epidemiological data collected in Region Jönköping County, Sweden, over a 15-year period. Out of 184 STEC isolates recovered from distinct patients, 55 were from patients with BD, and 129 were from individuals with non-bloody stools (NBS). Five individuals developed HUS. Adults were more associated with BD. Serotypes O157:H7, O26:H11, O103:H2, O121:H19, and O104:H4 were more often associated with BD. The presence of Shiga toxin-encoding gene subtypes stx2a, stx2a + stx2c, and stx1a + stx2c was associated with BD, while stx1a was associated with milder disease. Multiplex virulence and accessory genes were correlated with BD; these genes encode toxins, adhesion, autotransporters, invasion, and secretion system. A number of antimicrobial resistance (AMR) genes, such as aminoglycoside, aminocoumarin, macrolide, and fluoroquinolone resistance genes, were prevalent among clinical STEC isolates. Whole-genome phylogeny revealed that O157 and non-O157 STEC isolates evolved from distinct lineages with a few exceptions. Isolates from BD showed more tendency to cluster closely. In conclusion, this study unravels molecular trait of clinical STEC strains and identifies genetic factors associated with severe clinical outcomes, which could contribute to management of STEC infections and disease progression if confirmed by further functional validation.
Shiga toxin-producing Escherichia coli (STEC) belong to a genetically and phenotypically diverse group of E. coli strains characterized by the production of one or more Shiga toxins (Stx) (Bryan et al., 2015). STEC may cause asymptomatic infection, bloody diarrhea (BD), and life-threatening hemolytic uremic syndrome (HUS) (Bruyand et al., 2018). The most predominant STEC serotype is O157:H7, which has been strongly associated with severe clinical symptoms (Preussel et al., 2013). However, non-O157 STEC strains are increasingly recognized as the main cause of sporadic cases or outbreaks worldwide, especially since 2011 when a STEC O104:H4 outbreak occurred in Germany and spread rapidly in European countries (Bielaszewska et al., 2011). A recent systematic review showed that non-O157 STEC are more common causes of acute diarrhea than the better-known O157 strains (Valilis et al., 2018). The most predominant non-O157 serogroups causing human infections are O26, O45, O103, O111, O121, and O145 in the United States, defined as the “big six” (Gould et al., 2013). These might differ from those of other countries and vary year to year. For example, O80 serogroup, along with O26, has emerged to become a predominant serogroup in 2015 among HUS patients reported in France (Ingelbeen et al., 2018). The latest European annual STEC surveillance report showed that 8,658 confirmed STEC cases were registered in 30 EU/EEA countries in 2018, among which Germany had the highest confirmed STEC cases (2,226), followed by Ireland (966) and Sweden (892) (European Centre for Disease Prevention and Control, 2020). It is noteworthy that a previous study estimated that the true annual number of STEC cases in Germany is 28,347, with a median of 4,969 cases due to O157 STEC and 22,019 cases due to non-O157 STEC, which is much higher than the number registered in European surveillance system (Kuehne et al., 2016). This is probably true for many other European countries as well.
Stx is the key virulence factor of STEC and comprises two types: Stx1 and Stx2 (corresponding genes are referred to as stx1 and stx2) (Bergan et al., 2012). The Stx1/Stx2 can further be classified into subtypes, among which Stx2a, Stx2c, and Stx2d were significantly associated with development of HUS, whereas other subtypes were linked to mild symptoms (Scheutz, 2014). Recently, the discovery of the new subtype Stx2k in diarrheal patients indicated the pathogenic potential of uncommon and emerging Stx subtypes (Yang et al., 2020). Besides Stx, other virulence factors also play a role in STEC pathogenicity. For example, intimin encoded by eae gene residing on the locus of enterocyte effacement (LEE) pathogenicity island plays a critical role in intestinal colonization. The LEE island encodes a type III secretion system (TTSS), which is responsible for the attaching and effacing (A/E) lesions on intestinal epithelia (Stevens and Frankel, 2014). The presence of both stx2 and eae is associated with a greater probability of triggering severe clinical symptoms (Werber et al., 2003). However, strains carrying stx and eae do not always cause a severe disease, suggesting the potential role of other virulence factors in the disease progression, which warrants further exploration.
The natural reservoir of STEC is the gastrointestinal tract of ruminant animals, particularly cattle. Human infection is commonly acquired through ingestion of contaminated beef, water, and vegetables or through contact with animals (Mughini-Gras et al., 2018). As the dose of infection is low, person-to-person transmission can also occur. STEC may be shed from the bowel of infected individuals after the resolution of symptoms. Previous studies have shown that patients tested stx positive for up to 256 days (Vonberg et al., 2013; Matussek et al., 2016). Long-term STEC carriers represent a chronic risk of person-to-person transmission; thus, their social and working life is legally restricted by the health authorities, posing a high psychological and socioeconomic burden. In Sweden, there is a legal requirement for at least one stx2-negative stool sample before children are allowed to return to kindergarten (Nolskog et al., 2017). In the United Kingdom, children below 5 years must not return to school until they have two negative stool cultures 24 h apart, while children older than 5 years must have 48 h of negative stool prior to returning to school (Walsh and Johnson, 2018). The possibility of finding molecular predictors for prolonged duration of shedding is of great importance to optimize control measures to limit the spread from person to person.
Previous studies have contributed to a deeper insight in the STEC infections in different populations in Europe, as well as a general understanding of pathogenic STEC strains associated with human infections (Buvens et al., 2012; Messens et al., 2015; Fierz et al., 2017; De Rauw et al., 2018; Nuesch-Inderbinen et al., 2018). We have previously reported STEC infection in adults over 10 years of age in Jönköping, Sweden, and we depicted the main virulence factors associated with clinical outcomes (Bai et al., 2018b). An earlier microarray analysis on targeted virulence factors of STEC isolates from children below 10 years of age showed that multiplex virulence genes were associated with BD compared with cases with non-BD, and several genes were associated with prolonged duration of bacterial shedding (Matussek et al., 2017). An in-depth analysis is essential to unveil the complete picture of how genetic factors correlate with clinical outcomes. In the current study, we investigated the molecular epidemiology of STEC isolates from infected individuals including diarrheal patients and contacts identified through contact tracing from April 2003 through 2017, in Region Jönköping County, Sweden. Whole-genome sequencing was done on all isolates to characterize the molecular features. Bacterial genomic data were analyzed in combination with the clinical and epidemiological data to explore the genetic factors that might be associated with severe clinical outcomes, and the phylogenetic relatedness of all isolates was assessed.
Materials and Methods
Collection of Shiga Toxin-Producing Escherichia coli Isolates and Metadata
In total, 195 STEC isolates were collected from diarrheal patients and individuals identified after tracing contacts of index cases from April 2003 through 2017 in Region Jönköping County, Sweden. Clinical data from STEC patients are collected through routine praxis used for the STEC surveillance performed in Region Jönköping County, Sweden. Formal consent is not required, as sample and data collection are part of the routine microbiological and contact tracing work in Region Jönköping County, Sweden.
Associations Between Age, Country of Infection, Duration of stx Shedding, and Bloody Diarrhea
The relationships between age and country of infection and the binary outcome variables BD (yes or no) and duration of stx shedding (>24 or ≤24 days) were assessed by considering these variables as covariates in mixed-effects logistic regression models, with outbreak group (OG) as a random effect to allow for the lack of statistical independence between cases belonging to the same outbreak. Models were fitted using the package lme4 in R (Bates et al., 2015).
Whole-Genome Sequencing and Assembly
For the 167 STEC isolates collected from April 2003 through 2014, bacterial genomic DNA was extracted using the EZ1 DNA Tissue Kit on EZ1 instrument (Qiagen, Hilden, Germany) according to the manufacturer’s protocol. DNA library preparation was done using Nextera chemistry (Illumina). The library was then pair-end (2 × 150 bp) sequenced using the Illumina HiSeq X platform (paired-end reads; read length 150 bp) at SciLifeLab (Stockholm, Sweden). For the 28 STEC isolates collected from 2015 through 2017, DNA was extracted using the MagDEA Dx SV reagent kit on magLEAD instrument (Precision System Science, Chiba, Japan). The DNA library preparation and purification were done using the AB Library Builder system and AMPure beads (Thermo Fisher Scientific, Waltham, MA, United States). The library was then sequenced using the Ion Torrent S5 XL platform (single-end reads; length 400 bp) at The Public Health Agency of Sweden as previously described (Lagerqvist et al., 2020). The quality of the raw reads was assessed with FastQC (version 0.11.8)1. Trimmomatic (version: 0.38) was used to trim the adapter sequences and low-quality bases (quality scores 10) from the beginning and end of the sequencing reads. Sequencing reads that were shorter than 30 bp were eliminated from further analyses (Bolger et al., 2014). The Illumina sequencing reads were de novo assembled with SKESA (version 2.3.0) with the adapter sequence mechanism disabled (Souvorov et al., 2018). The Ion Torrent sequencing reads were de novo assembled with SPAdes (version 3.12.0) in “careful mode” (Bankevich et al., 2012). The draft genome sequences were annotated with Prokka (version 1.11) (Seemann, 2014) using the built-in Escherichia-specific BLAST database. Eleven isolates failed in sequencing or yielded low-quality reads, which were excluded in subsequent analyses; thus, 184 STEC isolates were analyzed in this study.
Determination of stx Subtypes and Serotypes
The stx subtypes of STEC isolates were determined by ABRicate version 0.8.102 using default parameters. Briefly, an in-house stx subtyping database was created with ABRicate by integrating representative nucleotide sequences of all identified stx1 and stx2 subtypes, which included stx1 and stx2 subtypes previously reported by Scheutz et al. (2012); two recently identified stx2 subtypes, stx2h (Bai et al., 2018a) and stx2m (Yang et al., 2020); and one provisional subtype, stx2i (Lacher et al., 2016). The assemblies were then used to search against the stx subtyping database. Serotype was determined by comparing assemblies with the SerotypeFinder database using BLAST + v2.2.30 (Camacho et al., 2009).
Characterization of Virulence Factors and Antimicrobial Resistance Genes
The VFDB database3 was used for determination of virulence factors, and the CARD database4 was used for AMR factors. Gene presence/absence was determined using ABRicate version 0.8.10 with the following parameters: coverage ≥ 60% and identity ≥ 80%. The statistical association between virulence/AMR genes and isolate classifications was assessed with Fisher’s exact test using Statistica12 (StatSoft, Inc. Tulsa, OK, United States) for three classification models (age groups, clinical symptoms, and duration of stx shedding). Factors with corrected p-value (Benjamini–Hochberg) of association below 0.05 were considered statistically significant. For the clinical relevance of specific stx subtypes, Fisher’s exact test was done separately in different groups, with a p-value < 0.05 regarded as statistically significant.
Pan Genome-Wide Association Study
The pan genomes of 184 STEC isolates were calculated from the harmonized genome annotations produced by Prokka, using Roary5. The accessory genes were associated with clinical variables (age, clinical symptoms, and duration of stx shedding) using Scoary v1.6.16 (run with 1,000 permutation replicates) (Brynildsrud et al., 2016). Patterns of genes were reported as statistically significantly associated with a variable if they attained a Benjamini–Hochberg-corrected p-value of below 0.05. Given isolates from the same OG, which had an epidemiological link with each other, may introduce confounding, Scoary was further performed on a subset of isolates, including only the index case in each OG. In addition, multiple correspondence analysis (MCA) of pan genomes was performed using the gene presence/absence table generated from Roary. The R function MCA from R package FactoMineR was used for the analysis (Lê et al., 2008). The gene presence/absence table of all isolates was represented as categorical data. The total dimension of the data was reduced to 20 dimensions, and the first two dimensions that captured the highest variation were used to visualize the clustering. Default values were used for the rest of the parameters.
Whole-Genome Phylogenetic Analysis
The phylogenetic relationships of all 184 STEC isolates, together with six reference Escherichia coli genomes of predominant STEC serotypes (Supplementary Figure S1), were assessed by whole-genome multilocus sequence typing (wgMLST) and whole-genome phylogeny analysis. To define wgMLST allelic profiles, we used Fast-GeP6 (Zhang et al., 2018) with default settings, using the complete genome sequence of strain EDL933 (Acc. CP008957.1) as a reference. The whole-genome polymorphic sites-based phylogeny was inferred from the concatenated sequences of the coding sequences (CDSs) shared by all the whole-genome sequences. All the regions with elevated densities of base substitutions were eliminated, and a final maximum likelihood tree was generated by Gubbins (version 2.4.1) (Croucher et al., 2015) with default settings. The phylogenetic tree was annotated with relevant metadata using iTOL7.
The draft genome assemblies were submitted to GenBank under the BioProject PRJNA630106. Four strains included in this study have been previously described (Bai et al., 2019).
Shiga Toxin-Producing Escherichia coli Cases
Among 184 STEC isolates, 55 were from patients with BD, out of which four developed HUS. There were 129 isolates from individuals with non-bloody stools (NBS) including non-BD patients and asymptomatic carriers identified through contact contracting around an index case, out of which one developed HUS. Duration of stx shedding was available in 130 STEC-infected individuals. The median duration of stx shedding was 24 days (0–294 days), which was used to separate short (≤24 days) and long (> 24 days) duration of shedding. Sixty-two individuals exhibited a long duration of shedding, and 68 showed a short duration of shedding. Among the 184 STEC-infected individuals, 104 were children (<10 years old) and 80 were adults (≥10 years old). Contact tracing information was available for 73 out of 184 infected individuals, of which 32 were index cases. The index case and contacts, if any, were grouped into one OG; 48 OGs were assigned in total, of which 18 comprised more than one individual. After country of origin and OG were adjusted, adults were more associated with BD than children (odds ratio 4.8, 95% CI 1.2–18.9, p = 0.026), while children showed longer duration of stx shedding, which was not significant after adjusting for age, country of infection, and OG (p = 0.059) (Table 1). The travel history was available for all individuals, among which 123 were domestically infected while 61 were travel-associated. The possible source of infection was primarily from consumption of products from food-producing animals, such as milk, cheese, and sausage. The metadata of STEC isolates are shown in Supplementary Table S1.
Shiga Toxin-Producing Escherichia coli Serotypes and stx Subtypes in Correlation to Clinical Symptoms
In total, 51 different O:H serotypes were assigned among 184 STEC isolates. The most predominant serotypes were O157:H7 and O26:H11, comprising 34 isolates each, followed by O103:H2 and O121:H19, comprising 18 and 16 isolates, respectively (Figure 1 and Supplementary Table S1). The majority of isolates of serotypes O157:H7, O121:H19, and O104:H4 were correlated to BD (Figure 1). Two O104:H4 isolates, one O121:H19, one O98:H21, and one O157:H7 isolate were isolated from HUS patients.
Figure 1. Serotypes of clinical Shiga toxin-producing Escherichia coli (STEC) isolates in correlation with clinical symptoms. BD, patients with bloody diarrhea; NBS, individuals with non-bloody stools.
Seventeen different stx subtypes/combinations were identified, with stx1a (71 isolates), stx2a (27), stx2a + stx2c (20), and stx2c (11) being most predominant, followed by stx1c (9 isolates), stx1a + stx2c (8), stx1c + stx2b (8), and stx2b (8). The presence of stx2a, stx2a + stx2c, and stx1a + stx2c was statistically associated with BD, while stx1a was associated with NBS(p < 0.05) (Table 2). The presence of stx1a and stx2c was statistically associated with children group, while stx1a + stx2a was associated with the adults group (Table 2). No association was observed between stx types/subtypes and duration of stx shedding. Three out five isolates from HUS patients carried stx2a, one carried stx2a + stx2c, and one possessed stx1a. A correlation was observed between serotypes and stx subtypes. All 18 isolates of O103:H2 serotype carried the stx1a subtype; all 16 isolates of O121:H19 serotype carried the stx2a subtype, with one isolate carrying both stx2a and stx1a; 31 out of 34 O26:H11 isolates carried the stx1a subtype.
Table 2. Prevalence of stx types/subtypes in correlation with clinical symptomsa, age groupb, and duration of stx sheddingc.
Distribution of Virulence and Antimicrobial Resistance Factors in Clinical Shiga Toxin-Producing Escherichia coli Strains
STEC isolates from patients with BD demonstrated a high frequency of carriage for the greatest number of virulence genes. Using Fisher’s exact test corrected for multiple testing with the Benjamini–Hochberg-corrected p-value, we found 130 genes that were significantly associated with disease severity, among which 123 genes were positively associated with BD when compared with NBS, while seven genes were inversely associated with BD. These virulence factors can be classified into groups based on their functions: adhesin (eae, etpI, lifA, paa, and toxB), autotransporters (ehaA, espP, and upaC), toxins (stx1, stx2, astA, east1, and senB), invasion (tia), iron uptake factors, LEE-encoded TTSS and non-LEE-encoded TTSS effectors, hypothetical proteins, and other factors (Supplementary Table S2). However, no association was found between the virulence factors and prolonged duration of stx shedding or age groups.
Seventy AMR genes were identified in 184 STEC isolates, and 34 AMR genes were present in all isolates, which are associated with resistance to 10 classes of antimicrobial drugs, i.e., aminoglycoside, aminocoumarin, macrolide, fluoroquinolone, nucleoside, peptide, penam, tetracycline, fosfomycin, and nitroimidazole. Genes associated with resistance to fluoroquinolone antibiotic, such as crp, hns, acrB, marA, mdtM, and emrA were most commonly detected (Supplementary Table S3). Multidrug resistance gene emrE was statistically significantly associated with serotype O157:H7 (p < 0.001) (Table 3). fosA7 only existed in two O5:H9 strains, sat-1 and blaTEM–150 genes were only observed in one O35:H10 strain, and dfrA5 was only found in two O96:H19 strains (Supplementary Table S3).
Table 3. Prevalence of AMR genes in 184 clinical STEC isolates in correlation with clinical and bacterial variablesa.
Genome-Wide Association Study of Genetic Markers for Clinical Significance
The pan genome of 184 STEC isolates was composed of 26,474 genes. More than 800 accessory genes were found to be statistically significantly associated with BD, while no gene was statistically significantly associated with age groups or prolonged duration of stx shedding. Most of BD-associated genes encoded hypothetical proteins with unknown function (Supplementary Table S4a). Similarly, Scoary on a subset of 163 isolates including only index case from each OG showed that no accessory gene was statistically associated with age or duration of stx shedding, and multiple genes were statistically associated with BD compared with NBS (Supplementary Table S4b). MCA of pan genomes showed that all O157:H7 strains clustered together, while O26:H11 strains clustered closely with O103:H2, O121:H19, and other non-O157 serotypes. Five O117:H7 strains clustered separately from others. Isolates from BD and NBS groups scattered throughout different clusters except O117:H7 cluster, which included five strains from NBS (Figure 2). No clear separation was observed between groups of other clinical variables such as duration of stx shedding, age, and country of infection (data not shown).
Figure 2. Multiple correspondence analysis plot comparing pan genomes of 184 Shiga toxin-producing Escherichia coli (STEC) isolates. BD, patients with bloody diarrhea; NBS, individuals with non-bloody stools. Isolates from BD and NBS are indicated by the ring and square, respectively. The main serotypes were marked in different colors, as indicated.
A whole-genome phylogeny was constructed from alignment of concatenated CDSs of the 2,485 shared-loci found in all 184 STEC isolates and six Escherichia coli reference genomes representing main STEC serotypes (O157:H7, O145:H28, O111: H-, O26:H11, O104:H4, and O103:H2). The 184 STEC isolates were divided into two clades, O157 and non-O157, except that two O145:H28 isolates and one O8:H19 isolate clustered into O157 clade. Non-O157 clade further formed two subclades, with one comprising most isolates (136) and the other containing 11 isolates. Isolates with the same serotype clustered together. Within the O157:H7 clade, isolates carrying both stx2a and stx2c clustered closely (Supplementary Figure S1). Isolates from BD patients were mostly distributed in six clusters of predominant STEC serotypes, i.e., O157:H7, O26:H11, O121:H19, O103:H2, O111:H8, and O104:H4. However, isolates from patients with long duration of stx shedding scattered across different clusters. Out of 48 OGs, 18 included more than one isolate. Isolates from the same OG were grouped together, with four exceptions, i.e., OG55, OG16, OG53, and G5 (Figure 3).
Figure 3. Whole-genome phylogeny of Shiga toxin-producing Escherichia coli (STEC) isolates. Circular representation of the Gubbins phylogenetic tree generated from the concatenated sequences of the shared loci found in the wgMLST analysis. Gubbins tree was annotated with relevant metadata using iTOL. From the outer to inner circle, each represents serotypes, duration of bacterial shedding, clinical symptoms, and outbreak groups.
In Sweden, it has been mandatory to report human O157 STEC human infection since 1996 and all serotypes since July 2004. The number has been increasingly reported ever since all serotypes became mandatory to register in 2004 (Nolskog et al., 2017). To our knowledge, this is the first systematic genomic study of clinical STEC isolates since all serotypes have been reported in Sweden. Our study showed that non-O157 serotypes caused 81.5% of STEC infection in Region Jönköping County, Sweden, while O157 serotypes accounted for 18.5% of STEC cases, highlighting the clinical relevance of non-O157 STEC serotypes. We found that O157 and several predominant non-O157 serotypes, such as O26:H11, O103:H2, and O121:H19, were more often associated with severe symptoms such as BD. The incidence of HUS among STEC-infected individuals was 2.7% in this study. Among the five HUS cases, two were caused by serotype O104:H4, and the other three were caused by O157:H7, O121:H19, and O98:H21. This indicates a high pathogenic risk of non-O157 serotypes. It is noteworthy that two out of four O104:H4 isolates were from HUS patients who were part of the German outbreak in 2011 and clustered together with German outbreak strain 2011C-3493.
The presence of stx2a (with or without stx2c) is responsible for severe clinical outcomes such as HUS (Orth et al., 2007). It is noteworthy that one O98:H21 isolate from a HUS patient in this study carried only stx1a, indicating the potential risk associated with less virulent stx subtype strains. Notably, the presence of stx1a and stx2c was associated with children group, while stx1a + stx2a was associated with the adults group. Thus, the association between the adults group and BD observed in this study may be confounded by stx subtypes and other factors, for instance, the selection bias in the patients who seek care and were reported with STEC. Adults may be overrepresented in those who have BD, since adults are less likely to seek care for non-BD and other mild symptoms. Besides stx, 130 virulence factors were found to be associated with disease severity, among which 123 genes were overrepresented in isolates from patients with BD, while seven genes were related with NBS. Most BD-associated genes encoded TTSS, the best-characterized system disrupting host cell defense by delivering virulence factors into host cells (Pinaud et al., 2018). Intimin gene eae; TTSS-related genes espB, tir, map, and cesF; and regulation gene tir, which are commonly associated with high virulent O157:H7 serotype, were also distributed widely among non-O157 isolates in this study, suggesting a high pathogenicity of some non-O157 isolates. The EspF, which has emerged as a “Swiss army knife” of STEC pathogenesis, can target host mitochondria and the nucleolus, disrupt tight junctions, and induce hemorrhagic enteritis (Holmes et al., 2010). Interestingly, we found that espF was more frequent among non-O157 than O157 isolates, which was in accordance with a recent study (Baba et al., 2019). Notably, four isolates harbored heat-stable toxins (ST) encoding genes sta and stb, the virulence determinants for enterotoxigenic Escherichia coli (ETEC), exhibiting STEC/ETEC hybrid pathotype as previously reported (Bai et al., 2019). Further studies are warranted to understand the expression level of these genetic factors and their associations with disease severity.
The use of antibiotics is not recommended in STEC infections, as it may increase the risk of HUS development by inducing Stx production (Bielaszewska et al., 2012). However, AMR is a growing concern due to the widespread of E. coli resistant to all antibiotics used in human therapy and the dissemination of AMR genes through mobile genetic elements, which could lead to multiple drug resistance (MDR) (Oporto et al., 2019). In this study, 70 AMR genes were found in 184 clinical STEC isolates, and 25.7% genes were associated with MDR. The antibiotics resistance phenotype remains to be tested to understand the associations of AMR genotypes and phenotypes. For instance, most of the strains in this study carried beta-lactamase family gene ampC, which is widely distributed in Enterobacteriaceae. However, the chromosomal ampC genes are expressed constitutively at a low level in Escherichia coli and confers resistance to cefoxitin only when it is overproduced (Jaurin et al., 1981;Peter-Getzlaff et al., 2011). Fluoroquinolone resistance genes were most frequent (31.4%), which is noteworthy, as fluoroquinolones are commonly used to treat travelers’ diarrhea (Day et al., 2017). The emrE gene, which is related to macrolide antibiotic resistance, was overrepresented in isolates from short bacterial shedding and adults, although the difference was not statistically significant after Benjamini–Hochberg correction. A previous study showed that azithromycin, which belongs to ımacrolide antibiotic drug class, might be used safely to treat STEC O104:H4 long-term carriage (Nitschke et al., 2012). However, as the authors pointed out, this finding warrants confirmation for other STEC strains.
The whole-genome phylogeny demonstrated a distinctive evolutionary path between O157 and non-O157 STEC strains with a few exceptions; i.e., three O145:H28 strains clustered closely with O157 strains, which was in agreement with an earlier study showing that O145:H28 demonstrates a common evolutionary lineage with O157:H7 (Cooper et al., 2014). Interestingly, we found that one O8:H19 isolate from a non-BD patient also clustered in O157:H7 clade. These findings were supported by MCA of pan genomes showing that two O145:H28 strains and one O8:H19 strain clustered closely with O157:H7 strains. Further study is needed with more strains to understand if the O8:H19 strains share the same evolutionary path with O157:H7. We found that isolates from BD patients were mostly distributed in the six clusters comprising most predominant and clinical relevant serotypes. Isolates from the same OG clustered closely, indicating that they share a similar genome background. Isolates from long-term carriers scattered throughout different clusters, which to some extent support our findings that no bacterial genetic factor was significantly associated with duration of bacterial shedding. To the best of our knowledge, this is first study investigating the associations between bacterial genetic factors in the genome level and duration of bacterial shedding; our study implies that prolonged bacterial carriage in the gut might be more associated with host-related factors, which would be a valuable future work.
The current study has several limitations. The metadata such as duration of bacterial shedding and contact tracing of some individuals were lacking, which hampers a comprehensive understanding of associations between bacterial genetic factors and clinical traits. This study had selection bias in the patients who seek care and were reported with STEC. Adults are less likely to seek care for non-BD and, thus, may be overrepresented in those who have BD; this might confound the statistical associations between age of patients and bacterial genetic factors such as stx subtype. Moreover, statistical associations calculated between genetic factors in all STEC isolates and clinical variables might be skewed by isolates from the same OG, which had a close epidemiological link to each other. The statistical strategy in this study was different from that of our previous report where a naïve p-value was applied to find, as many as possible genes, potentially associated with clinical symptoms (Matussek et al., 2017). In this study, we used a corrected p-value to reduce the possibility of type I statistical errors (false-positive findings), which may result in a slight difference of associations observed in two studies. If we used the naïve p-value in this study, several genes would be associated with duration of shedding (data not shown), similar to the previous study (Matussek et al., 2017). Furthermore, the reference databases and cutoffs for determination of virulence and AMR genes vary among different studies; thus, the data from different studies should be compared and interpreted with caution.
To conclude, this study provides pathogenomic insight into clinical STEC isolates and unravels multiplex genetic factors associated with clinical outcomes. Further studies are needed to elucidate the functions of these genetic factors as well as their interactions with host in STEC disease progression. These knowledge could contribute to monitoring STEC infections and management of severe clinical outcomes.
Data Availability Statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.
Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent from the participants’ legal guardian/next of kin was not required to participate in this study in accordance with the national legislation and the institutional requirements.
AM and XB designed the study. XB, JZ, and YH analyzed the data and drafted the manuscript. CJ, IH, and SM collected the clinical and epidemiological data. NF, AA, and SM contributed to the statistics analysis. SL, NF, and YX provided their expertise feedback. All the authors reviewed the draft and contributed significantly to the final manuscript.
This work was supported by grants from the Scandinavian Society for Antimicrobial Chemotherapy Foundation (SLS884041) and National Natural Science Foundation of China (81701977). All funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
We thank Olaf Dienus from the Department of Laboratory Medicine, Jönköping, Sweden, for laboratory work and general support in the project.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2021.627861/full#supplementary-material
- ^ https://github.com/s-andrews/FastQC
- ^ https://github.com/tseemann/abricate
- ^ http://www.mgc.ac.cn/VFs/
- ^ https://card.mcmaster.ca/
- ^ https://github.com/sanger-pathogens/Roary
- ^ https://github.com/jizhang-nz/fast-GeP
- ^ http://itol.embl.de
Baba, H., Kanamori, H., Kudo, H., Kuroki, Y., Higashi, S., Oka, K., et al. (2019). Genomic analysis of Shiga toxin-producing Escherichia coli from patients and asymptomatic food handlers in Japan. PLoS One 14:e0225340. doi: 10.1371/journal.pone.0225340
Bai, X., Mernelius, S., Jernberg, C., Einemo, I. M., Monecke, S., Ehricht, R., et al. (2018b). Shiga toxin-producing Escherichia coli infection in Jonkoping County, Sweden: occurrence and molecular characteristics in correlation with clinical symptoms and duration of stx shedding. Front. Cell Infect. Microbiol. 8:125. doi: 10.3389/fcimb.2018.00125
Bai, X., Zhang, J., Ambikan, A., Jernberg, C., Ehricht, R., Scheutz, F., et al. (2019). Molecular characterization and comparative genomics of clinical hybrid shiga toxin-producing and enterotoxigenic Escherichia coli (STEC/ETEC) strains in Sweden. Sci. Rep. 9:5619.
Bankevich, A., Nurk, S., Antipov, D., Gurevich, A. A., Dvorkin, M., Kulikov, A. S., et al. (2012). SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 19, 455–477. doi: 10.1089/cmb.2012.0021
Bielaszewska, M., Idelevich, E. A., Zhang, W., Bauwens, A., Schaumburg, F., Mellmann, A., et al. (2012). Effects of antibiotics on Shiga toxin 2 production and bacteriophage induction by epidemic Escherichia coli O104:H4 strain. Antimicrob. Agents Chemother. 56, 3277–3282. doi: 10.1128/aac.06315-11
Bielaszewska, M., Mellmann, A., Zhang, W., Kock, R., Fruth, A., Bauwens, A., et al. (2011). Characterisation of the Escherichia coli strain associated with an outbreak of haemolytic uraemic syndrome in Germany, 2011: a microbiological study. Lancet Infect. Dis. 11, 671–676. doi: 10.1016/s1473-3099(11)70165-7
Bruyand, M., Mariani-Kurkdjian, P., Gouali, M., De Valk, H., King, L. A., Le Hello, S., et al. (2018). Hemolytic uremic syndrome due to Shiga toxin-producing Escherichia coli infection. Med. Mal. Infect. 48, 167–174.
Buvens, G., De Gheldre, Y., Dediste, A., De Moreau, A. I., Mascart, G., Simon, A., et al. (2012). Incidence and virulence determinants of verocytotoxin-producing Escherichia coli infections in the brussels-capital region, Belgium, in 2008-2010. J. Clin. Microbiol. 50, 1336–1345. doi: 10.1128/jcm.05317-11
Cooper, K. K., Mandrell, R. E., Louie, J. W., Korlach, J., Clark, T. A., Parker, C. T., et al. (2014). Comparative genomics of enterohemorrhagic Escherichia coli O145:H28 demonstrates a common evolutionary lineage with Escherichia coli O157:H7. BMC Genomics 15:17. doi: 10.1186/1471-2164-15-17
Croucher, N. J., Page, A. J., Connor, T. R., Delaney, A. J., Keane, J. A., Bentley, S. D., et al. (2015). Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins. Nucleic Acids Res. 43:e15. doi: 10.1093/nar/gku1196
Day, M., Doumith, M., Jenkins, C., Dallman, T. J., Hopkins, K. L., Elson, R., et al. (2017). Antimicrobial resistance in Shiga toxin-producing Escherichia coli serogroups O157 and O26 isolated from human cases of diarrhoeal disease in England, 2015. J. Antimicrob. Chemother. 72, 145–152. doi: 10.1093/jac/dkw371
De Rauw, K., Jacobs, S., and Pierard, D. (2018). Twenty-seven years of screening for Shiga toxin-producing Escherichia coli in a university hospital. Brussels, Belgium, 1987-2014. PLoS One 13:e0199968. doi: 10.1371/journal.pone.0199968
European Centre for Disease Prevention and Control (2020). Shiga Toxin/Verocytotoxin-Producing Escherichia coli (STEC/VTEC) Infection - Annual Epidemiological Report for 2018. ECDC: Surveillance Report. Sweden: European Centre for Disease Prevention and Control.
Fierz, L., Cernela, N., Hauser, E., Nuesch-Inderbinen, M., and Stephan, R. (2017). Characteristics of shigatoxin-producing Escherichia coli strains isolated during 2010-2014 from human infections in Switzerland. Front. Microbiol. 8:1471. doi: 10.3389/fmicb.2017.01471
Gould, L. H., Mody, R. K., Ong, K. L., Clogher, P., Cronquist, A. B., Garman, K. N., et al. (2013). Increased recognition of non-O157 Shiga toxin-producing Escherichia coli infections in the United States during 2000-2010: epidemiologic features and comparison with E. coli O157 infections. Foodborne Pathog. Dis. 10, 453–460.
Ingelbeen, B., Bruyand, M., Mariani-Kurkjian, P., Le Hello, S., Danis, K., Sommen, C., et al. (2018). Emerging shiga-toxin-producing Escherichia coli serogroup O80 associated hemolytic and uremic syndrome in France, 2013-2016: differences with other serogroups. PLoS One 13:e0207492. doi: 10.1371/journal.pone.0207492
Kuehne, A., Bouwknegt, M., Havelaar, A., Gilsdorf, A., Hoyer, P., Stark, K., et al. (2016). Estimating true incidence of O157 and non-O157 Shiga toxin-producing Escherichia coli illness in Germany based on notification data of haemolytic uraemic syndrome. Epidemiol. Infect. 144, 3305–3315. doi: 10.1017/s0950268816001436
Lacher, D. W., Gangiredla, J., Patel, I., Elkins, C. A., and Feng, P. C. (2016). Use of the Escherichia coli identification microarray for characterizing the health risks of Shiga toxin-producing Escherichia coli isolated from foods. J. Food Prot. 79, 1656–1662. doi: 10.4315/0362-028x.jfp-16-176
Lagerqvist, N., Lof, E., Enkirch, T., Nilsson, P., Roth, A., and Jernberg, C. (2020). Outbreak of gastroenteritis highlighting the diagnostic and epidemiological challenges of enteroinvasive Escherichia coli, County of Halland, Sweden, November 2017. Euro Surveill. 25:1900466.
Matussek, A., Einemo, I. M., Jogenfors, A., Lofdahl, S., and Lofgren, S. (2016). Shiga toxin-producing Escherichia coli in diarrheal stool of swedish children: evaluation of polymerase chain reaction screening and duration of shiga toxin shedding. J. Pediatric. Infect. Dis. Soc. 5, 147–151. doi: 10.1093/jpids/piv003
Matussek, A., Jernberg, C., Einemo, I. M., Monecke, S., Ehricht, R., Engelmann, I., et al. (2017). Genetic makeup of Shiga toxin-producing Escherichia coli in relation to clinical symptoms and duration of shedding: a microarray analysis of isolates from Swedish children. Eur. J. Clin. Microbiol. Infect. Dis. 36, 1433–1441. doi: 10.1007/s10096-017-2950-7
Messens, W., Bolton, D., Frankel, G., Liebana, E., Mc, L. J., Morabito, S., et al. (2015). Defining pathogenic verocytotoxin-producing Escherichia coli (VTEC) from cases of human infection in the European Union, 2007-2010. Epidemiol. Infect. 143, 1652–1661. doi: 10.1017/s095026881400137x
Mughini-Gras, L., Van Pelt, W., Van Der Voort, M., Heck, M., Friesema, I., and Franz, E. (2018). Attribution of human infections with Shiga toxin-producing Escherichia coli (STEC) to livestock sources and identification of source-specific risk factors, the Netherlands (2010-2014). Zoonoses Public Health 65, e8–e22.
Nitschke, M., Sayk, F., Hartel, C., Roseland, R. T., Hauswaldt, S., Steinhoff, J., et al. (2012). Association between azithromycin therapy and duration of bacterial shedding among patients with Shiga toxin-producing enteroaggregative Escherichia coli O104:H4. JAMA 307, 1046–1052. doi: 10.1001/jama.2012.264
Nuesch-Inderbinen, M., Morach, M., Cernela, N., Althaus, D., Jost, M., Mausezahl, M., et al. (2018). Serotypes and virulence profiles of Shiga toxin-producing Escherichia coli strains isolated during 2017 from human infections in Switzerland. Int. J. Med. Microbiol. 308, 933–939. doi: 10.1016/j.ijmm.2018.06.011
Oporto, B., Ocejo, M., Alkorta, M., Marimon, J. M., Montes, M., and Hurtado, A. (2019). Zoonotic approach to Shiga toxin-producing Escherichia coli: integrated analysis of virulence and antimicrobial resistance in ruminants and humans. Epidemiol. Infect. 147:e164.
Orth, D., Grif, K., Khan, A. B., Naim, A., Dierich, M. P., and Wurzner, R. (2007). The Shiga toxin genotype rather than the amount of Shiga toxin or the cytotoxicity of Shiga toxin in vitro correlates with the appearance of the hemolytic uremic syndrome. Diagn. Microbiol. Infect. Dis. 59, 235–242. doi: 10.1016/j.diagmicrobio.2007.04.013
Peter-Getzlaff, S., Polsfuss, S., Poledica, M., Hombach, M., Giger, J., Bottger, E. C., et al. (2011). Detection of AmpC beta-lactamase in Escherichia coli: comparison of three phenotypic confirmation assays and genetic analysis. J. Clin. Microbiol. 49, 2924–2932. doi: 10.1128/jcm.00091-11
Preussel, K., Hohle, M., Stark, K., and Werber, D. (2013). Shiga toxin-producing Escherichia coli O157 is more likely to lead to hospitalization and death than non-O157 serogroups–except O104. PLoS One 8:e78180. doi: 10.1371/journal.pone.0078180
Scheutz, F., Teel, L. D., Beutin, L., Pierard, D., Buvens, G., Karch, H., et al. (2012). Multicenter evaluation of a sequence-based protocol for subtyping Shiga toxins and standardizing Stx nomenclature. J. Clin. Microbiol. 50, 2951–2963. doi: 10.1128/jcm.00860-12
Vonberg, R. P., Hohle, M., Aepfelbacher, M., Bange, F. C., Belmar Campos, C., Claussen, K., et al. (2013). Duration of fecal shedding of Shiga toxin-producing Escherichia coli O104:H4 in patients infected during the 2011 outbreak in Germany: a multicenter study. Clin. Infect. Dis. 56, 1132–1140. doi: 10.1093/cid/cis1218
Werber, D., Fruth, A., Buchholz, U., Prager, R., Kramer, M. H., Ammon, A., et al. (2003). Strong association between shiga toxin-producing Escherichia coli O157 and virulence genes stx2 and eae as possible explanation for predominance of serogroup O157 in patients with haemolytic uraemic syndrome. Eur. J. Clin. Microbiol. Infect. Dis. 22, 726–730. doi: 10.1007/s10096-003-1025-0
Yang, X., Bai, X., Zhang, J., Sun, H., Fu, S., Fan, R., et al. (2020). Escherichia coli strains producing a novel Shiga toxin 2 subtype circulate in China. Int. J. Med. Microbiol. 310:151377. doi: 10.1016/j.ijmm.2019.151377
Keywords: Shiga toxin-producing Escherichia coli, whole genome sequencing, comparative genomics, bloody diarrhea, clinical outcomes, duration of bacterial shedding
Citation: Bai X, Zhang J, Hua Y, Jernberg C, Xiong Y, French N, Löfgren S, Hedenström I, Ambikan A, Mernelius S and Matussek A (2021) Genomic Insights Into Clinical Shiga Toxin-Producing Escherichia coli Strains: A 15-Year Period Survey in Jönköping, Sweden. Front. Microbiol. 12:627861. doi: 10.3389/fmicb.2021.627861
Received: 10 November 2020; Accepted: 05 January 2021;
Published: 05 February 2021.
Edited by:Narjol González-Escalona, United States Food and Drug Administration, United States
Reviewed by:Roger Stephan, University of Zurich, Switzerland
Joseph M. Bosilevac, United States Department of Agriculture (USDA), United States
Copyright © 2021 Bai, Zhang, Hua, Jernberg, Xiong, French, Löfgren, Hedenström, Ambikan, Mernelius and Matussek. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xiangning Bai, firstname.lastname@example.org
†These authors have contributed equally to this work