<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2023.1221668</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Precision metagenomics sequencing for food safety: hybrid assembly of Shiga toxin-producing <italic>Escherichia coli</italic> in enriched agricultural water</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Maguire</surname> <given-names>Meghan</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Ramachandran</surname> <given-names>Padmini</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/626945/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Tallent</surname> <given-names>Sandra</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/928710/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Mammel</surname> <given-names>Mark K.</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/345956/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Brown</surname> <given-names>Eric W.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/175937/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Allard</surname> <given-names>Marc W.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/173563/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Musser</surname> <given-names>Steven M.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Gonz&#x000E1;lez-Escalona</surname> <given-names>Narjol</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/97881/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Center for Food Safety and Applied Nutrition, Office of Regulatory Science</institution>, <addr-line>College Park, MD</addr-line>, <country>United States</country></aff>
<aff id="aff2"><sup>2</sup><institution>Office of Applied Research and Safety Assessment, Food and Drug Administration</institution>, <addr-line>College Park, MD</addr-line>, <country>United States</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Alexandre Lamas, University of Santiago de Compostela, Spain</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Carlos Manuel Franco, University of Santiago de Compostela, Spain; Sabine Delannoy, Agence Nationale de S&#x000E9;curit&#x000E9; Sanitaire de l&#x00027;Alimentation, de l&#x00027;Environnement et du Travail (ANSES), France</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Narjol Gonz&#x000E1;lez-Escalona <email>narjol.gonzalez-escalona&#x00040;fda.hhs.gov</email></corresp>
</author-notes>
<pub-date pub-type="epub">
<day>31</day>
<month>08</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>14</volume>
<elocation-id>1221668</elocation-id>
<history>
<date date-type="received">
<day>12</day>
<month>05</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>04</day>
<month>08</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2023 Maguire, Ramachandran, Tallent, Mammel, Brown, Allard, Musser and Gonz&#x000E1;lez-Escalona.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Maguire, Ramachandran, Tallent, Mammel, Brown, Allard, Musser and Gonz&#x000E1;lez-Escalona</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license> </permissions>
<abstract>
<p>Culture-independent metagenomic sequencing of enriched agricultural water could expedite the detection and virulotyping of Shiga toxin-producing <italic>Escherichia coli</italic> (STEC). We previously determined the limits of a complete, closed metagenome-assembled genome (MAG) assembly and of a complete, fragmented MAG assembly for O157:H7 in enriched agricultural water using long reads (Oxford Nanopore Technologies, Oxford), which were 10<sup>7</sup> and 10<sup>5</sup> CFU/ml, respectively. However, the nanopore assemblies did not have enough accuracy to be used in Single Nucleotide Polymorphism (SNP) phylogenies and cannot be used for the precise identification of an outbreak STEC strain. The present study aimed to determine the limits of detection and assembly for STECs in enriched agricultural water by Illumina MiSeq sequencing technology alone, followed by establishing the limit of hybrid assembly with nanopore long-read sequencing using three different hybrid assemblers (SPAdes, Unicycler, and OPERA-MS). We also aimed to generate a genome with enough accuracy to be used in a SNP phylogeny. The classification of MiSeq and nanopore sequencing identified the same highly abundant species. Using the totality of the MiSeq output and a precision metagenomics approach in which the <italic>E. coli</italic> reads are binned before assembly, the limit of detection and assembly of STECs by MiSeq were determined to be 10<sup>5</sup> and 10<sup>7</sup> CFU/ml, respectively. While a complete, closed MAG could not be generated at any concentration, a complete, fragmented MAG was produced using the SPAdes assembler with an STEC concentration of at least 10<sup>7</sup> CFU/ml. At this concentration, hybrid assembled contigs aligned to the nanopore-assembled genome could be accurately placed in a neighbor-joining tree. The MiSeq limit of detection and assembly was less sensitive than nanopore sequencing, which was likely due to factors including the small starting material (50 vs. 1 &#x003BC;g) and the dilution of the library loaded on the cartridge. This pilot study demonstrates that MiSeq sequencing requires higher coverage in precision metagenomic samples; however, with sufficient concentration, STECs can be characterized and phylogeny can be accurately determined.</p></abstract>
<kwd-group>
<kwd>foodborne pathogens</kwd>
<kwd><italic>Escherichia coli</italic></kwd>
<kwd>nanopore sequencing</kwd>
<kwd>short-read sequencing</kwd>
<kwd>pre-harvest agricultural water</kwd>
<kwd>metagenomics</kwd>
<kwd>hybrid assembly</kwd>
</kwd-group>
<counts>
<fig-count count="2"/>
<table-count count="5"/>
<equation-count count="0"/>
<ref-count count="52"/>
<page-count count="13"/>
<word-count count="9905"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Food Microbiology</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>Precision metagenomics is an approach that customizes the analysis of a metagenomic sample for the detection and classification of a specific pathogen. The development of culture-independent methods for the detection of foodborne pathogens can expedite source tracking and reduce prospective corrective measures during outbreak scenarios (Loman et al., <xref ref-type="bibr" rid="B29">2013</xref>; Huang et al., <xref ref-type="bibr" rid="B23">2017</xref>; Brown et al., <xref ref-type="bibr" rid="B6">2019</xref>). For U.S. Food and Drug Administration (FDA)-designated zero-tolerance pathogens, such as <italic>Listeria monocytogenes</italic> and <italic>Salmonella</italic> spp., qPCR or metagenomic detection is sufficient to initiate microbiological isolate confirmation followed by regulatory action (Archer, <xref ref-type="bibr" rid="B3">2018</xref>; CFSAN<xref ref-type="fn" rid="fn0001"><sup>1</sup></xref>). However, the pervasive nature of <italic>Escherichia coli</italic> necessitates further classification. There are more than 400 serotypes of Shiga toxin-producing <italic>E. coli</italic> (STECs) that can range in potential pathogenicity, as determined by the presence of a combination of <italic>stx</italic> (shiga toxin), <italic>eae</italic> (intimin), and other putative virulence genes (Kaper et al., <xref ref-type="bibr" rid="B25">2004</xref>; Garmendia et al., <xref ref-type="bibr" rid="B15">2005</xref>; Gonzalez-Escalona and Kase, <xref ref-type="bibr" rid="B18">2019</xref>; Gonzalez-Escalona et al., <xref ref-type="bibr" rid="B17">2019a</xref>,<xref ref-type="bibr" rid="B19">b</xref>; National Advisory Committee On Microbiological Criteria For Foods, <xref ref-type="bibr" rid="B36">2019</xref>). STECs are responsible for &#x0007E;2,400 hospitalizations per year causing severe diseases, ranging from diarrhea to hemolytic uremic syndrome (HUS) (Tarr et al., <xref ref-type="bibr" rid="B45">2005</xref>; Mellmann et al., <xref ref-type="bibr" rid="B33">2008</xref>; Scallan et al., <xref ref-type="bibr" rid="B42">2011</xref>; Beutin and Martin, <xref ref-type="bibr" rid="B5">2012</xref>).</p>
<p>Produce-related outbreaks have reportedly increased from 6% of all foodborne transmission in the 1990s to &#x0007E;18% in a survey from 2003 to 2014 (Sivapalasingam et al., <xref ref-type="bibr" rid="B43">2004</xref>; Fischer et al., <xref ref-type="bibr" rid="B13">2015</xref>). Agricultural water has been considered a potential contamination source due to adjacent land use, incomplete water sanitization, or wild animal activity (FDA;<xref ref-type="fn" rid="fn0002"><sup>2</sup></xref> Steele and Odumeru, <xref ref-type="bibr" rid="B44">2004</xref>; Monaghan and Hutchison, <xref ref-type="bibr" rid="B34">2012</xref>; Oliveira et al., <xref ref-type="bibr" rid="B38">2012</xref>; Allende and Monaghan, <xref ref-type="bibr" rid="B1">2015</xref>; Uyttendaele et al., <xref ref-type="bibr" rid="B47">2015</xref>). Current STEC detection protocols as outlined by the FDA Bacteriological Analytical Manual (BAM) Chapter 4A (Feng et al., <xref ref-type="bibr" rid="B12">2020</xref>) entail 24-h enrichment, qPCR detection of the <italic>stx1, stx2</italic>, and <italic>wzy</italic> genes of the O157 antigen, followed by several rounds of microbiological assays (TSAYE, TC-SMAC, and chromogenic agar plates), and single colony isolation used for whole-genome sequencing (WGS), which confirms the identity and pathogenic potential of the isolated STEC. This entire process can take &#x0007E;2 weeks of analysis time, which may exceed the shelf life of produce and, in particular, leafy greens.</p>
<p>To expedite the analysis time, we have investigated culture-independent sequencing methods for the detection and classification of STECs directly in agricultural water. Long-read nanopore sequencing classified the highly abundant microbial community (&#x0003E;1% read abundance) but failed to accurately detect reads belonging to O157 in unenriched agricultural water due to low concentrations (Maguire et al., <xref ref-type="bibr" rid="B31">2022</xref>). We have, therefore, suggested precision metagenomic analysis of enriched agricultural water instead, as the STEC concentration is increased to detectable levels by qPCR, according to the BAM Chapter 4A protocol. To investigate the use of long-read nanopore sequencing for precision metagenomics, it was necessary to establish the limits of detection and assembly of the technique (Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>). However, detection is not sufficient for the classification of the STEC serotype or virulotype. The limit of nanopore complete, fragmented assembly of a metagenome-assembled genome (MAG) was equivalent to the limit of qPCR detection at &#x0007E;10<sup>5</sup> CFU/ml. At STEC concentrations above &#x0007E;10<sup>7</sup> CFU/ml, a complete, closed O157:H7 MAG was obtained with the chromosome in single contig and the plasmid in the second contig (Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>).</p>
<p>Despite the benefits of nanopore sequencing, including affordability, portability, long reads, and real-time basecalling, the inherent error rate of nanopore sequencing (when using the fast-calling model) precludes the closed genomes from being used for phylogenic analysis. Short-read Illumina MiSeq sequencing technology, however, is highly accurate but creates a challenge in assembling highly repetitive regions, which can span several hundred base pairs (Bertrand et al., <xref ref-type="bibr" rid="B4">2019</xref>; Gonzalez-Escalona et al., <xref ref-type="bibr" rid="B17">2019a</xref>; Moss et al., <xref ref-type="bibr" rid="B35">2020</xref>). Hybrid assembly using nanopore long-read sequencing as scaffolds and MiSeq for accuracy improves the overall outcome and generates closed or fragmented MAGs with enough quality to be used in phylogeny analysis (Gonzalez-Escalona and Sharma, <xref ref-type="bibr" rid="B20">2020</xref>; Maguire et al., <xref ref-type="bibr" rid="B31">2022</xref>). However, as determined for long-read sequencing, the limits of detection and assembly for the Illumina MiSeq technology have yet to be determined as well. We expected the limits of detection and assembly of Illumina MiSeq to be different from nanopore for several reasons: (1) the starting material is small (1&#x02013;100 ng, depending on the DNA library preparation kit used vs. 1 &#x003BC;g for nanopore sequencing); (2) only a fraction of the library is used for sequencing, resulting in sampling bias (the entire material is used for nanopore sequencing instead); and (3) most MiSeq sequencing runs multiplex several samples (the exact level of detection or sequencing depth necessary for each sample has not been established). Therefore, we aimed to determine the limits of short-read Illumina MiSeq detection and MAG assembly using agricultural water artificially contaminated with STEC strain EDL933_2 (O157:H7), as previously determined for nanopore sequencing (Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>). We further aimed to test the use of hybrid assemblies for the recovery of a complete or fragmented O157:H7 MAG. For this last part, we have evaluated the performance of three hybrid assembly software (Unicycler, OPERA-MS, and SPAdes).</p>
</sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and methods</title>
<sec>
<title>Bacterial strains and media</title>
<p>We used a Shiga toxin-producing <italic>E. coli</italic> (STEC) EDL933 O157:H7 strain that was taken from our collection at CFSAN and is a variant of ATCC 43895 that has lost the <italic>stx2</italic> phage after several passages in the laboratory. We have termed this variant strain EDL933_2. EDL933_2 was grown overnight in static culture in tryptic soy broth (TSB) at 37&#x000B0;C.</p>
</sec>
<sec>
<title>Short-read metagenomic sequencing, contigs, assembly, and annotation</title>
<p>For the short-read metagenomic studies using the MiSeq instrument, we used the same DNA extracted during our previous publication (Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>) and was used for determining both the detection and assembly limit for STEC EDL933_2 spiked into enrichments. The spiking experiment from that previous study was prepared as follows: 200 ml of an STEC-negative pre-harvest agricultural water sample was enriched according to the FDA BAM Chapter 4A protocol. An equal volume of 2&#x000D7; modified buffered peptone water with pyruvate (mBPWp) was added and incubated in static culture for 5 h at 37&#x000B0;C; then, an antimicrobial cocktail of acriflavin, cefsulodin, and vancomycin was added before an overnight static incubation (18&#x02013;24 h) at 42&#x000B0;C. An equal volume (1 ml) of overnight-enriched agricultural water was artificially contaminated with 10-fold serial dilutions of an overnight culture (10<sup>9</sup>-10<sup>5</sup> CFU/ml) of EDL933_2 diluted in TSB, resulting in a total of five samples (Water &#x0002B; Ecoli1&#x02013;5). Additionally, a sample consisting only of the enriched agricultural water (Water) was used as a negative control for the presence of EDL933_2. The number of CFUs in the stock EDL933_2 spiked culture was calculated by spreading dilution on tryptic soy agar (TSA) plates. The DNAs were extracted from 1 ml of each artificially contaminated sample, Water and Water &#x0002B; Ecoli1&#x02013;5, using the Maxwell RSC Cultured Cells DNA kit with a Maxwell RSC Instrument (Promega Corporation, Madison, WI) according to the manufacturer&#x00027;s instructions for Gram-negative bacteria with additional RNase treatment. DNA concentration was determined by a Qubit 4 Fluorometer (Invitrogen, Carlsbad, CA) according to the manufacturer&#x00027;s instructions. DNA quality was determined by the NanoDrop spectrophotometer (NanoDrop ND-1000 UV-Vis, Thermo Fisher Scientific, Waltham, MA) according to the manufacturer&#x00027;s instructions.</p>
<p>Short-read paired-end sequences were generated using Illumina MiSeq sequencing with a MiSeq V3 kit using 2 &#x000D7; 250 bp paired-end chemistry (Illumina Inc., San Diego, CA). DNA libraries were prepared using the DNA Prep kit with three samples in duplicate per cartridge according to the manufacturer&#x00027;s instructions. The starting material was 100 ng per sample, and 12 pmol was the final concentration loaded into the MiSeq instrument. A custom script (<xref ref-type="supplementary-material" rid="SM1">Supplementary Note</xref>) was developed to classify the reads by taxonomy using Centrifuge, similar to What&#x00027;s In My Pot (WIMP), as described with the long reads. The reads identified as <italic>E. coli</italic> by taxon number (see script) were extracted and saved in paired fastq files.</p>
<p><italic>De novo</italic> assembly of short reads alone was performed using the CLC Genomics Workbench (v20.0.2; Qiagen, Germantown, MD). Several <italic>de novo</italic> hybrid assemblies using both total and <italic>E. coli</italic> extracted (binned) long and short reads (from the same sample and dilution for both sequencing technologies) were generated using default parameters for SPAdes v3.13.1 (Antipov et al., <xref ref-type="bibr" rid="B2">2016</xref>), Unicycler v0.4.8 (Wick et al., <xref ref-type="bibr" rid="B50">2017</xref>), and OPERA_MS v19.07.01 (Bertrand et al., <xref ref-type="bibr" rid="B4">2019</xref>). The assembled contigs were classified by taxonomy by Kraken2 (Wood et al., <xref ref-type="bibr" rid="B51">2019</xref>) using GalaxyTrakr (Gangiredla et al., <xref ref-type="bibr" rid="B14">2021</xref>). The presence of the complete genome and synteny was checked using the Mauve genome aligner (Darling et al., <xref ref-type="bibr" rid="B11">2004</xref>) and compared to the reference genome generated previously using nanopore long reads (Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>).</p>
</sec>
<sec>
<title>Closure of high-quality EDL933_2 genome by long- and short-read sequencing</title>
<p>For bioinformatic quality control purposes, we generated a high-quality closed genome of the strain (EDL933_2) used in the artificial contamination studies. The long-read sequencing as well as the assembly using only long reads were reported earlier (Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>). The short reads were generated as above using a MiSeq Illumina sequencer. The high-quality EDL933_2 genome was generated by a hybrid assembly using long and short reads with Unicycler v0.4.8 (Wick et al., <xref ref-type="bibr" rid="B50">2017</xref>), as described previously (Maguire et al., <xref ref-type="bibr" rid="B32">2021b</xref>).</p>
</sec>
<sec>
<title><italic>In silico</italic> serotyping</title>
<p>Batch screening of the <italic>de novo</italic> assemblies was performed to analyze the major serotype present in each sample using Ridom SeqSphere&#x0002B; software v7.0.6 (Ridom, M&#x000FC;nster, Germany) using the genes deposited in the Center for Genomic Epidemiology (<ext-link ext-link-type="uri" xlink:href="http://www.genomicepidemiology.org/services/">http://www.genomicepidemiology.org/services/</ext-link>) for <italic>E</italic>. <italic>coli</italic> as part of their web-based tool, SerotypeFinder 2.0 (<ext-link ext-link-type="uri" xlink:href="https://cge.food.dtu.dk/services/SerotypeFinder/">https://cge.food.dtu.dk/services/SerotypeFinder/</ext-link>).</p>
</sec>
<sec>
<title><italic>In silico</italic> identification of virulence genes</title>
<p>The <italic>de novo</italic> assemblies were batch screened for virulence genes using Ridom SeqSphere&#x0002B; using the genes deposited in the NCBI Pathogen Detection Reference Gene Catalog (<ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/pathogens/isolates&#x00023;/refgene/">https://www.ncbi.nlm.nih.gov/pathogens/isolates&#x00023;/refgene/</ext-link>) and described in the study of Gonzalez-Escalona and Kase (<xref ref-type="bibr" rid="B18">2019</xref>).</p>
</sec>
<sec>
<title>Phylogenetic relationship of the strains by cgMLST analysis</title>
<p>The phylogenetic relationship of the strains was assessed by a core genome multilocus sequence typing (cgMLST) analysis using Ridom SeqSphere&#x0002B;. The genome of O157:H7 strain Sakai (NC_002695) was used as a reference. A cgMLST phylogenetic analysis of 4,200 O157:H7 genomes available at NCBI using 4,651 loci from Sakai showed that 17 strains clustered highly together to the strain EDL933_2 used in this study (this includes the four sample duplicates/strains and the EDL933_2 hybrid assemblies from this study). We used Nei&#x00027;s DNA distance method (Nei et al., <xref ref-type="bibr" rid="B37">1983</xref>) for calculating the matrix of genetic distance, taking into consideration only the number of same/different alleles in the core genes. A neighbor-joining (NJ) tree using pairwise ignoring missing values and the appropriate genetic distances was built after the cgMLST analysis. cgMLST uses the allele number of each locus for determining the genetic distance and builds the phylogenetic tree. The use of allele numbers reduces the influence of recombination in the dataset studied and allows for fast clustering determination of genomes.</p>
</sec>
<sec>
<title>Metagenomic data and high-quality EDL933_2 genome accession numbers</title>
<p>The Illumina metagenomic sequence data from this study are available in GenBank under BioProject number <ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="PRJNA639799">PRJNA639799</ext-link>. The genome for EDL933_2 was deposited under accession number <ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="CP120944">CP120944</ext-link>-<ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="CP120946">CP120946</ext-link>.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>Microbial community identification by MiSeq sequencing</title>
<p>To be able to perform phylogenic and SNP analysis on the sequenced <italic>E. coli</italic> O157 strain, short-read sequencing by Illumina MiSeq technology was performed with the same DNA extractions used previously (Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>). Artificial contamination of STEC-negative agricultural water enrichment with 10-fold dilutions of the <italic>E. coli</italic> O157:H7 EDL933_2 variant produced final concentrations of 7 &#x000D7; 10<sup>8</sup> CFU/ml (Water &#x0002B; Ecoli1) to 7 &#x000D7; 10<sup>4</sup> CFU/ml (Water &#x0002B; Ecoli5) plus a negative control sample of only enriched agricultural water (Water). As mentioned in the Materials and Methods section, each sample was run in duplicate in two different MiSeq sequencing runs, and the output resulted in an average of 8.7 million reads and 2.25 Gb yield per sample (<xref ref-type="table" rid="T1">Table 1</xref>). The sequenced reads for each sample were classified by taxonomy using Centrifuge (Kim et al., <xref ref-type="bibr" rid="B26">2016</xref>), which is the basis for the EPI2ME WIMP workflow used for the taxon classification of the nanopore reads (Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>). We report the taxa accounting for &#x0003E;1% abundance of the total unique reads.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>MiSeq sequencing and assembly summary statistics.</p></caption> 
<table frame="box" rules="all">
<thead>
<tr style="background-color:&#x00023;919498;color:&#x00023;ffffff">
<th valign="top" align="left"><bold>Sample name</bold></th>
<th valign="top" align="center"><bold>EDL933_2 concentration (CFU/ml)</bold></th>
<th valign="top" align="center"><bold>Yield (Gb)</bold></th>
<th valign="top" align="center"><bold>Total reads</bold></th>
<th valign="top" align="center"><bold>Contigs</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Water A<xref ref-type="table-fn" rid="TN1"><sup>a</sup></xref></td>
<td valign="top" align="center">None</td>
<td valign="top" align="center">2.5</td>
<td valign="top" align="center">11,004,158</td>
<td valign="top" align="center">14,355</td>
</tr>
<tr>
<td valign="top" align="left">Water B<xref ref-type="table-fn" rid="TN1"><sup>a</sup></xref></td>
<td valign="top" align="center">None</td>
<td valign="top" align="center">1.6</td>
<td valign="top" align="center">6,933,448</td>
<td valign="top" align="center">12,974</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli1A<xref ref-type="table-fn" rid="TN1"><sup>a</sup></xref></td>
<td valign="top" align="center">7.00E&#x0002B;08</td>
<td valign="top" align="center">1.5</td>
<td valign="top" align="center">6,240,592</td>
<td valign="top" align="center">11,455</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli1B<xref ref-type="table-fn" rid="TN1"><sup>a</sup></xref></td>
<td valign="top" align="center">7.00E&#x0002B;08</td>
<td valign="top" align="center">1.8</td>
<td valign="top" align="center">8,243,832</td>
<td valign="top" align="center">10,015</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli2A<xref ref-type="table-fn" rid="TN1"><sup>a</sup></xref></td>
<td valign="top" align="center">7.00E&#x0002B;07</td>
<td valign="top" align="center">1.9</td>
<td valign="top" align="center">8,357,928</td>
<td valign="top" align="center">13,509</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli2B<xref ref-type="table-fn" rid="TN1"><sup>a</sup></xref></td>
<td valign="top" align="center">7.00E&#x0002B;07</td>
<td valign="top" align="center">2.1</td>
<td valign="top" align="center">9,562,322</td>
<td valign="top" align="center">11,235</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli3A<xref ref-type="table-fn" rid="TN2"><sup>b</sup></xref></td>
<td valign="top" align="center">7.00E&#x0002B;06</td>
<td valign="top" align="center">2.0</td>
<td valign="top" align="center">8,513,148</td>
<td valign="top" align="center">15,133</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli3B<xref ref-type="table-fn" rid="TN2"><sup>b</sup></xref></td>
<td valign="top" align="center">7.00E&#x0002B;06</td>
<td valign="top" align="center">2.1</td>
<td valign="top" align="center">8,772,570</td>
<td valign="top" align="center">15,138</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; E coli4A<xref ref-type="table-fn" rid="TN2"><sup>b</sup></xref></td>
<td valign="top" align="center">7.00E&#x0002B;05</td>
<td valign="top" align="center">2.3</td>
<td valign="top" align="center">9,936,638</td>
<td valign="top" align="center">13,851</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; E coli4B<xref ref-type="table-fn" rid="TN2"><sup>b</sup></xref></td>
<td valign="top" align="center">7.00E&#x0002B;05</td>
<td valign="top" align="center">2.2</td>
<td valign="top" align="center">9,561,540</td>
<td valign="top" align="center">13,431</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli5A<xref ref-type="table-fn" rid="TN2"><sup>b</sup></xref></td>
<td valign="top" align="center">7.00E&#x0002B;04</td>
<td valign="top" align="center">2.1</td>
<td valign="top" align="center">8,983,248</td>
<td valign="top" align="center">13,448</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli5B<xref ref-type="table-fn" rid="TN2"><sup>b</sup></xref></td>
<td valign="top" align="center">7.00E&#x0002B;04</td>
<td valign="top" align="center">2.1</td>
<td valign="top" align="center">9,125,196</td>
<td valign="top" align="center">13,486</td>
</tr></tbody>
</table>
<table-wrap-foot>
<p>Replicates for each sample are listed as A or B.</p>
<fn id="TN1">
<label>a</label>
<p>Samples in run 1.</p></fn>
<fn id="TN2">
<label>b</label>
<p>Samples in run 2.</p></fn>
</table-wrap-foot>
</table-wrap>
<p>Centrifuge analysis of MiSeq short reads identified a highly diverse bacterial community. Most of the sample was comprised of the same nine highly abundant species as previously determined by nanopore and subsequent WIMP analysis but with slightly different proportions (<xref ref-type="fig" rid="F1">Figure 1</xref>). The duplicate MiSeq samples demonstrated approximately the same percent abundance. The nine bacterial species and their approximate proportions identified by MiSeq sequencing and Centrifuge analysis in the background water sample were <italic>Klebsiella pneumoniae</italic> (40%), <italic>Enterobacter cloacae</italic> (16%), <italic>Enterobacter kobei</italic> (7%), <italic>Enterobacter</italic> sp. ODB01 (6.5%), <italic>Enterobacter hormaechei</italic> (6%), <italic>Acinetobacter baumannii</italic> (5.8%), <italic>Citrobacter freundii</italic> (4.9%), <italic>Pseudomonas putida</italic> (3%), and <italic>Enterobacter xiangfangensis</italic> (1.9%). The percentage of species abundance (% of total reads) classified by Centrifuge was very similar between nanopore and MiSeq data for the water sample, except for <italic>K. pneumoniae</italic> and <italic>E. cloacae</italic> (<xref ref-type="fig" rid="F1">Figure 1</xref>).</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Relative abundance of bacterial species associated with pre-harvest agricultural water with and without artificial <italic>Escherichia coli</italic> EDL933_2 contamination from MiSeq sequencing read output. Enriched pre-harvest agricultural water (Water) was artificially contaminated with 10-fold dilutions of <italic>E. coli</italic> EDL933_2 (&#x0002B;Ecoli) with a starting concentration of 7 &#x000D7; 10<sup>8</sup> CFU/ml (Water &#x0002B; Ecoli1). Extracted DNA was previously sequenced by nanopore, and bacterial species were identified using the EPI2ME WIMP workflow (Water Nanopore) (Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>). MiSeq sequencing was performed in duplicate, and read output was classified by taxonomy using centrifuge (Kim et al., <xref ref-type="bibr" rid="B26">2016</xref>). Bacterial species contributing more than 1% of the unique identified reads are shown, and the sum of the remaining species identified is included as &#x0201C;Other.&#x0201D;</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-14-1221668-g0001.tif"/>
</fig>
</sec>
<sec>
<title>Limit of detection and assembly using short-read MiSeq sequencing alone</title>
<p>To better understand the depth of sequencing and microbial community characterization of enriched agricultural water metagenomic samples, we aimed to establish the limits of detection and assembly of the MiSeq technology alone and in combination with the previous nanopore data. The agricultural water sample determined by the FDA BAM Chapter 4A method to be STEC-negative was confirmed as such by the Centrifuge taxonomy classification analysis of the short-read MiSeq output for duplicates of the water sample (WaterA and WaterB). There were 12,672 (0.17%) and 8,362 (0.18%) reads, respectively, belonging to generic <italic>E. coli</italic> in this sample, and none of these reads matched STEC O157:H7.</p>
<p>However, for the spiked samples, the percentage abundance of <italic>E. coli</italic> reads was proportional to the added STEC concentration, ranging from &#x0007E;60% in Water &#x0002B; Ecoli1 (A: 2,628,675 reads and B: 3,724,064 reads) to 0.2% in Water &#x0002B; Ecoli5 (A: 10,827 reads and B: 10,985 reads). While Centrifuge does not perform strain-level identification, we determined the limit of detection for <italic>E. coli</italic> O157 using MiSeq reads as samples containing the &#x0201C;<italic>Escherichia</italic> Stx1 converting phage.&#x0201D; In samples Water &#x0002B; Ecoli4A and B, 34 and 19 total reads were identified as <italic>Escherichia</italic> Stx1 converting phage, respectively. Therefore, the limit of detection of <italic>E. coli</italic> O157 in MiSeq reads was determined to be 10<sup>5</sup> CFU/ml when multiplexed. If only a single sample is run, then not only the detection limit but also the cost per sample will be higher.</p>
<p>To assess the limit of assembly, we closed the EDL933_2 genome obtained from the hybrid assembly of nanopore long reads and Illumina MiSeq short reads. We used this genome as a reference to ensure the accuracy of our <italic>in silico</italic> analyses to detect serotype (O157:H7) and virulence gene profiles, including <italic>stx1a</italic> and <italic>eae</italic> gamma-1 (<xref ref-type="table" rid="T2">Table 2</xref>). The genome assembly limit is the minimum number of reads necessary to produce either a complete, closed metagenome-assembled genome (MAG) with 20&#x000D7; coverage or a complete, fragmented genome that ensures accurate serotyping and a complete virulence profile. We performed a <italic>de novo</italic> assembly with the CLC Genomics Workbench on the total MiSeq read output, resulting in an average of 13,170 contigs per sample (<xref ref-type="table" rid="T1">Table 1</xref>). As expected, only the O9 serotype was detected in WaterA and WaterB samples, and no <italic>stx, eae</italic>, or additional virulence genes were found. The O157:H7 serotype, <italic>stx1a, eae</italic> gamma-1 genes, and complete virulotype were detected in samples Water &#x0002B; Ecoli1A and B and Water &#x0002B; Ecoli2A and B. At lower spiked levels, Water &#x0002B; Ecoli3A and B, only the un-spiked O9 serotype could be detected, and the <italic>stx</italic> and <italic>eae</italic> gene identification was lost. It was possible to detect some virulence genes but with inconsistent sample accuracy (<xref ref-type="table" rid="T2">Table 2</xref>). Therefore, the limit of assembly of MiSeq sequencing using all reads was determined to be 10<sup>7</sup> CFU/ml.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Serotype and virulence gene identification in <italic>de novo</italic> assembled contigs from Illumina MiSeq sequencing reads.</p></caption> 
<table frame="box" rules="all">
<thead>
<tr style="background-color:&#x00023;919498;color:&#x00023;ffffff">
<th valign="top" align="left"><bold>Sample<xref ref-type="table-fn" rid="TN3"><sup>a</sup></xref></bold></th>
<th valign="top" align="left"><bold>ST</bold></th>
<th valign="top" align="left"><bold>Serotype<xref ref-type="table-fn" rid="TN4"><sup>b</sup></xref></bold></th>
<th valign="top" align="left"><bold>Stx type</bold></th>
<th valign="top" align="left"><bold>Eae</bold></th>
<th valign="top" align="left"><bold><italic>ehxA</italic></bold></th>
<th valign="top" align="left"><bold><italic>espA</italic></bold></th>
<th valign="top" align="left"><bold><italic>espB</italic></bold></th>
<th valign="top" align="left"><bold><italic>espF</italic></bold></th>
<th valign="top" align="left"><bold><italic>espJ</italic></bold></th>
<th valign="top" align="left"><bold><italic>espK</italic></bold></th>
<th valign="top" align="left"><bold><italic>espP</italic></bold></th>
<th valign="top" align="left"><bold><italic>tccP</italic></bold></th>
<th valign="top" align="left"><bold><italic>etpD</italic></bold></th>
<th valign="top" align="left"><bold><italic>nleA</italic></bold></th>
<th valign="top" align="left"><bold><italic>nleB</italic></bold></th>
<th valign="top" align="left"><bold><italic>nleC</italic></bold></th>
<th valign="top" align="left"><bold><italic>tir</italic></bold></th>
<th valign="top" align="left"><bold><italic>katP</italic></bold></th>
<th valign="top" align="left"><bold><italic>pssA</italic></bold></th>
<th valign="top" align="left"><bold><italic>toxB</italic></bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">EDL933_2<xref ref-type="table-fn" rid="TN5"><sup>c</sup></xref></td>
<td valign="top" align="left">11</td>
<td valign="top" align="left">O157:H7</td>
<td valign="top" align="left">1a</td>
<td valign="top" align="left">Gamma-1</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
</tr>
<tr>
<td valign="top" align="left">Water A</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">O9</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left">Water B</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">O9</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli1A</td>
<td valign="top" align="left">11</td>
<td valign="top" align="left">O157:H7</td>
<td valign="top" align="left">1a</td>
<td valign="top" align="left">Gamma-1</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli1B</td>
<td valign="top" align="left">11</td>
<td valign="top" align="left">O157:H7</td>
<td valign="top" align="left">1a</td>
<td valign="top" align="left">Gamma-1</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli2A<xref ref-type="table-fn" rid="TN6"><sup>d</sup></xref></td>
<td valign="top" align="left">11</td>
<td valign="top" align="left">O157:H7</td>
<td valign="top" align="left">1a</td>
<td valign="top" align="left">Gamma-1</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli2B<xref ref-type="table-fn" rid="TN6"><sup>d</sup></xref></td>
<td valign="top" align="left">unk</td>
<td valign="top" align="left">O157:H7</td>
<td valign="top" align="left">1a</td>
<td valign="top" align="left">Gamma-1</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli3A</td>
<td valign="top" align="left">unk</td>
<td valign="top" align="left">O9</td>
<td valign="top" align="left">1a</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli3B</td>
<td valign="top" align="left">unk</td>
<td valign="top" align="left">O9</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr></tbody>
</table>
<table-wrap-foot>
<fn id="TN3">
<label>a</label>
<p>CFU/ml levels of EDL933_2 inoculation can be found in <xref ref-type="table" rid="T1">Table 1</xref>.</p></fn>
<fn id="TN4">
<label>b</label>
<p><italic>In silico</italic> serotype using genes defined by the Center for Genomic Epidemiology at the technical University of Denmark (DTU) (<ext-link ext-link-type="uri" xlink:href="https://cge.cbs.dtu.dk/services/SerotypeFinder/">https://cge.cbs.dtu.dk/services/SerotypeFinder/</ext-link>).</p></fn>
<fn id="TN5">
<label>c</label>
<p><italic>Escherichia coli</italic> O157:H7 EDL933_2 variant strain reference hybrid assembly for serotyping and virulotyping.</p></fn>
<fn id="TN6">
<label>d</label>
<p>Complete, fragmented genome assembly limit.</p></fn>
<p>unk&#x02014;incomplete profile.</p>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>Precision metagenomic assembly limit using <italic>E. coli</italic>-binned MiSeq reads</title>
<p>We previously demonstrated that we were able to improve the limit of fragmented MAG assembly and generate a completely closed MAG by binning the nanopore reads belonging to a particular species, which we termed precision metagenomics (Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>). This process used a custom Python script to extract the WIMP-identified &#x0201C;<italic>E. coli</italic>&#x0201D; reads into a separate fastq file used for <italic>de novo</italic> assembly. Therefore, we developed a similar custom script using the Centrifuge taxonomic classifier to extract the <italic>Escherichia</italic> MiSeq paired-end reads into two fastq files (R1 and R2; <xref ref-type="supplementary-material" rid="SM1">Supplementary Note</xref>). <italic>Escherichia</italic> identified reads decreased according to the spiked concentration, with 3,666,510 and 5,335,057 <italic>E. coli</italic> reads in Water &#x0002B; Ecoli1A and B samples, while samples Water &#x0002B; Ecoli4A/B and 5A/B contained similar read numbers than the negative control WaterA and B samples (<xref ref-type="table" rid="T3">Table 3</xref>). <italic>De novo</italic> assembly with the CLC Genomics Workbench produced between 319 and 2,884 contigs from the binned reads. Water &#x0002B; Ecoli1A and B and Water &#x0002B; Ecoli2A and B contigs produced assemblies with at least 48&#x000D7; coverage and 5.6 Mb, which is the size of the EDL933_2 genome (5.6 Mb). However, at lower spiked concentrations (Water &#x0002B; Ecoli3 and below), the coverage was below 10&#x000D7; and the complete genome could not be assembled (<xref ref-type="table" rid="T3">Table 3</xref>). Therefore, the limit of assembly using precision metagenomics was determined to be 10<sup>7</sup> CFU/ml, the same as using the total read output.</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p><italic>De novo</italic> assembly statistics from <italic>Escherichia coli</italic>-binned Illumina MiSeq sequencing reads.</p></caption> 
<table frame="box" rules="all">
<thead>
<tr style="background-color:&#x00023;919498;color:&#x00023;ffffff">
<th valign="top" align="left"><bold>Sample name<xref ref-type="table-fn" rid="TN7"><sup>a</sup></xref></bold></th>
<th valign="top" align="center"><bold><italic>Escherichia coli</italic> reads</bold></th>
<th valign="top" align="center"><bold>Contigs</bold></th>
<th valign="top" align="center"><bold>Coverage (X)<xref ref-type="table-fn" rid="TN8"><sup>b</sup></xref></bold></th>
<th valign="top" align="center"><bold>Total size (Mb)</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Water A</td>
<td valign="top" align="center">45,312</td>
<td valign="top" align="center">388</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">0.48</td>
</tr>
<tr>
<td valign="top" align="left">Water B</td>
<td valign="top" align="center">69,430</td>
<td valign="top" align="center">319</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">0.41</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli1A</td>
<td valign="top" align="center">3,666,510</td>
<td valign="top" align="center">358</td>
<td valign="top" align="center">157</td>
<td valign="top" align="center">5.51</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli1B</td>
<td valign="top" align="center">5,335,057</td>
<td valign="top" align="center">418</td>
<td valign="top" align="center">216</td>
<td valign="top" align="center">5.45</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli2A<xref ref-type="table-fn" rid="TN9"><sup>c</sup></xref></td>
<td valign="top" align="center">1,201,634</td>
<td valign="top" align="center">450</td>
<td valign="top" align="center">48</td>
<td valign="top" align="center">5.60</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli2B<xref ref-type="table-fn" rid="TN9"><sup>c</sup></xref></td>
<td valign="top" align="center">1,658,476</td>
<td valign="top" align="center">555</td>
<td valign="top" align="center">65</td>
<td valign="top" align="center">5.60</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli3A</td>
<td valign="top" align="center">137,880</td>
<td valign="top" align="center">2,852</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">4.35</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli3B</td>
<td valign="top" align="center">140,346</td>
<td valign="top" align="center">2,884</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">4.37</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli4A</td>
<td valign="top" align="center">65,145</td>
<td valign="top" align="center">662</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">0.74</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli4B</td>
<td valign="top" align="center">61,901</td>
<td valign="top" align="center">613</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">0.71</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli5A</td>
<td valign="top" align="center">56,493</td>
<td valign="top" align="center">353</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">0.49</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli5B</td>
<td valign="top" align="center">57,789</td>
<td valign="top" align="center">377</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">0.50</td>
</tr></tbody>
</table>
<table-wrap-foot>
<fn id="TN7">
<label>a</label>
<p>CFU/ml levels of EDL933_2 inoculation can be found in <xref ref-type="table" rid="T1">Table 1</xref>.</p></fn>
<fn id="TN8">
<label>b</label>
<p>Coverage is relative to the EDL933_2 variant strain genome size of &#x0007E;5.6 Mb.</p></fn>
<fn id="TN9">
<label>c</label>
<p>Complete, fragmented genome assembly limit.</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>Limit of assembly using the hybrid assembly of binned nanopore reads and binned MiSeq reads</title>
<p>We have previously demonstrated improved assembly output by using a hybrid assembly technique with a combination of nanopore long reads and MiSeq short reads (Gonzalez-Escalona and Sharma, <xref ref-type="bibr" rid="B20">2020</xref>). Numerous software programs have been developed for this purpose, and each can produce slightly different assemblies. Therefore, we tested three different hybrid assemblers: Unicycler (Wick et al., <xref ref-type="bibr" rid="B50">2017</xref>), SPAdes (Antipov et al., <xref ref-type="bibr" rid="B2">2016</xref>), and OPERA-MS (Bertrand et al., <xref ref-type="bibr" rid="B4">2019</xref>). Due to the large metagenomic input of the MiSeq reads, the hybrid assemblers with <italic>E. coli</italic>-binned nanopore reads and the total MiSeq output performed poorly (data not shown). Consequently, we used the <italic>E. coli</italic>-binned extracted MiSeq reads and binned nanopore reads to generate hybrid assemblies using the same three hybrid assemblers (<xref ref-type="table" rid="T4">Tables 4</xref>, <xref ref-type="table" rid="T5">5</xref>). <italic>In silico</italic> analysis of WaterA and B samples from each of the three assemblers detected only the O9 serotype (<xref ref-type="table" rid="T4">Table 4</xref>), and no virulence genes were identified (data not shown). For the spiked samples, the O157:H7 serotype, <italic>stx1a</italic> gene, and <italic>eae</italic> gamma-1 gene could be detected with a concentration as low as 10<sup>6</sup> CFU/ml assemblies generated with Unicycler and OPERA-MS. While the O157:H7 serotype could be identified at 10<sup>5</sup> CFU/ml with the SPAdes assembly, the <italic>stx</italic> and <italic>eae</italic> gene identification was less reliable between samples (<xref ref-type="table" rid="T5">Table 5</xref>).</p>
<table-wrap position="float" id="T4">
<label>Table 4</label>
<caption><p>Spades hybrid assembly statistics using <italic>Escherichia coli</italic>-binned nanopore and MiSeq sequencing reads.</p></caption> 
<table frame="box" rules="all">
<thead>
<tr style="background-color:&#x00023;919498;color:&#x00023;ffffff">
<th valign="top" align="left"><bold>Sample<sup>a</sup></bold></th>
<th valign="top" align="left"><bold>ST</bold></th>
<th valign="top" align="left"><bold>Serotype</bold></th>
<th valign="top" align="left"><bold>Stx type</bold></th>
<th valign="top" align="left"><bold>Eae type</bold></th>
<th valign="top" align="left"><bold>Contig no. (O157 chromosome and plasmid)</bold></th>
<th valign="top" align="left"><bold>Percent EDL933_2 genome assembled</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">WaterA</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">O9</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">571</td>
<td valign="top" align="left">0%</td>
</tr>
<tr>
<td valign="top" align="left">WaterB</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">O9</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">472</td>
<td valign="top" align="left">0%</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli1A</td>
<td valign="top" align="left">11</td>
<td valign="top" align="left">O157:H7</td>
<td valign="top" align="left">1a</td>
<td valign="top" align="left">Gamma-1</td>
<td valign="top" align="left">484 (13 and 1)</td>
<td valign="top" align="left">100%</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli1B</td>
<td valign="top" align="left">11</td>
<td valign="top" align="left">O157:H7</td>
<td valign="top" align="left">1a</td>
<td valign="top" align="left">Gamma-1</td>
<td valign="top" align="left">256 (37 and 1)</td>
<td valign="top" align="left">100%</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli2A</td>
<td valign="top" align="left">11</td>
<td valign="top" align="left">O157:H7</td>
<td valign="top" align="left">1a</td>
<td valign="top" align="left">Gamma-1</td>
<td valign="top" align="left">313 (19 and 1)</td>
<td valign="top" align="left">100%</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli2B</td>
<td valign="top" align="left">11</td>
<td valign="top" align="left">O157:H7</td>
<td valign="top" align="left">1a</td>
<td valign="top" align="left">Gamma-1</td>
<td valign="top" align="left">353 (35 and 1)</td>
<td valign="top" align="left">100%</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli3A</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">O157:H7</td>
<td valign="top" align="left">1a</td>
<td valign="top" align="left">Gamma-1</td>
<td valign="top" align="left">559 (197 and 2)</td>
<td valign="top" align="left">95%</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli3B</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">O157:H7</td>
<td valign="top" align="left">1a</td>
<td valign="top" align="left">Gamma-1</td>
<td valign="top" align="left">572 (204 and 6)</td>
<td valign="top" align="left">95%</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli4A</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">O9</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">939</td>
<td valign="top" align="left">0%</td>
</tr>
<tr>
<td valign="top" align="left">Water &#x0002B; Ecoli4B</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">O9</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">938</td>
<td valign="top" align="left">0%</td>
</tr></tbody>
</table>
<table-wrap-foot>
<p><sup>a</sup>CFU/ml levels of EDL933_2 inoculation can be found in <xref ref-type="table" rid="T1">Table 1</xref>.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T5">
<label>Table 5</label>
<caption><p>Virulence gene identification in <italic>de novo</italic>, hybrid assembled contigs from binned nanopore reads and binned Illumina MiSeq sequencing reads.</p></caption> 
<table frame="box" rules="all">
<thead>
<tr style="background-color:&#x00023;919498;color:&#x00023;ffffff">
<th/>
<th valign="top" align="left"><bold>EDL933_2</bold></th>
<th valign="top" align="center" colspan="8"><bold>Unicycler</bold></th>
<th valign="top" align="center" colspan="8"><bold>Spades</bold></th>
<th valign="top" align="center" colspan="8"><bold>OPERA-MS</bold></th>
</tr>
<tr style="background-color:&#x00023;919498;color:&#x00023;ffffff">
<th/>
<th/>
<th valign="top" align="center" colspan="8"><bold>Water</bold> &#x0002B; <bold>Ecoli</bold></th>
<th valign="top" align="center" colspan="8"><bold>Water</bold> &#x0002B; <bold>Ecoli</bold></th>
<th valign="top" align="center" colspan="8"><bold>Water</bold> &#x0002B; <bold>Ecoli</bold></th>
</tr>
<tr style="background-color:&#x00023;919498;color:&#x00023;ffffff">
<th/>
<th/>
<th valign="top" align="left"><bold>1A</bold></th>
<th valign="top" align="left"><bold>1B</bold></th>
<th valign="top" align="left"><bold>2A</bold></th>
<th valign="top" align="left"><bold>2B</bold></th>
<th valign="top" align="left"><bold>3A</bold></th>
<th valign="top" align="left"><bold>3B</bold></th>
<th valign="top" align="left"><bold>4A</bold></th>
<th valign="top" align="left"><bold>4B</bold></th>
<th valign="top" align="left"><bold>1A</bold></th>
<th valign="top" align="left"><bold>1B</bold></th>
<th valign="top" align="left"><bold>2A</bold></th>
<th valign="top" align="left"><bold>2B</bold></th>
<th valign="top" align="left"><bold>3A</bold></th>
<th valign="top" align="left"><bold>3B</bold></th>
<th valign="top" align="left"><bold>4A</bold></th>
<th valign="top" align="left"><bold>4B</bold></th>
<th valign="top" align="left"><bold>1A</bold></th>
<th valign="top" align="left"><bold>1B</bold></th>
<th valign="top" align="left"><bold>2A</bold></th>
<th valign="top" align="left"><bold>2B</bold></th>
<th valign="top" align="left"><bold>3A</bold></th>
<th valign="top" align="left"><bold>3B</bold></th>
<th valign="top" align="left"><bold>4A</bold></th>
<th valign="top" align="left"><bold>4B</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><italic>astA</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>ehxA</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>espA</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>espB</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>espF</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>espJ</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>espP</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>tccP</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>etpD</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>gad</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>iha</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>iss</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>nleA</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>nleB</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>nleC</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>tir</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>katP</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>toxB</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>ecf1</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
</tr>
<tr>
<td valign="top" align="left">IEE</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>espK</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>pssA</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr>
<tr>
<td valign="top" align="left"><italic>air</italic></td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x0002B;</td>
<td valign="top" align="left">&#x02013;</td>
<td valign="top" align="left">&#x02013;</td>
</tr></tbody>
</table>
<table-wrap-foot>
<p>CFU/ml levels of EDL933_2 inoculation can be found in <xref ref-type="table" rid="T1">Table 1</xref>.</p>
</table-wrap-foot>
</table-wrap>
<p>Virulence gene identification, however, demonstrated marked differences in the three assemblies produced. <italic>In silico</italic> analysis of the Unicycler assembly was able to detect all of the virulence genes at a concentration of at least 10<sup>7</sup> CFU/ml, with the exception of Water &#x0002B; Ecoli1B, in which <italic>tccp</italic> was not found. Similarly, analysis of the SPAdes assembly indicated that a complete, fragmented MAG could be generated with a concentration of at least 10<sup>7</sup> CFU/ml. OPERA-MS, however, generated assemblies from which all virulence genes could not be identified consistently at any concentration (<xref ref-type="table" rid="T5">Table 5</xref>). Therefore, a limit of complete, fragmented assembly using a hybrid assembly of <italic>E. coli</italic>-binned nanopore reads and <italic>E. coli</italic>-binned MiSeq read output was established as 10<sup>7</sup> CFU/ml with the SPAdes assembler. A complete, closed MAG could not be generated at any concentration using any of the assemblers tested here.</p>
</sec>
<sec>
<title>Phylogenetic analysis of the SPAdes hybrid assemblies by cgMLST analysis</title>
<p>The phylogenetic relationship among the different hybrid assemblies obtained for each spiking level, that of strain EDL933_2, and other O157:H7 isolates was assessed by a cgMLST analysis using the genome of <italic>E. coli</italic> strain Sakai (NC_002695). The initial analysis failed because there were other <italic>E. coli</italic> contigs that interfered with the analysis (data not shown). To correct and eliminate these potential contigs present in the final assembly, we used the genome assembled using only the binned nanopore reads as a reference to re-organize the contigs from the hybrid assemblies using the Mauve aligner (Darling et al., <xref ref-type="bibr" rid="B11">2004</xref>). By using this method, we were able to correctly identify the contigs belonging to EDL 933_2 in the spiked hybrid assembly. The new phylogenetic analysis using the filtered EDL933_2 contigs showed that the hybrid assemblies for spiked levels above 10<sup>7</sup> CFU/ml were placed correctly in the phylogenetic tree (<xref ref-type="fig" rid="F2">Figure 2</xref>). Hybrid assemblies at 10<sup>6</sup> CFU/ml or below were too fragmented, and the phylogenetic analysis lost precision, with the selected contigs falling outside the EDL933_2_FDA cluster.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Neighbor-joining phylogenetic tree generated by a cgMLST analysis of the hybrid assemblies (HAs) for the different spiking levels and 4,200 other O157 <italic>Escherichia coli</italic> strains (available at NCBI). The genome of <italic>E. coli</italic> O157:H7 strain Sakai (NC_002695) was used as a reference. The final cgMLST analysis was based on 4,651 shared loci among those strains. The HA for the different spiking levels in this study is in red.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fmicb-14-1221668-g0002.tif"/>
</fig></sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>Most food safety laboratories are relying heavily on different methods of recovery and identification of foodborne pathogens to facilitate the fast identification and source tracking of specific strains during foodborne outbreaks. Among those foodborne outbreaks of high importance are those caused by STECs in produce. Due to the importance of agricultural water to produce and food safety, accurate detection and classification of STECs potentially present is of major importance, especially during a foodborne outbreak. Current methods for strain identification (usually by SNP or MLST analysis) during a foodborne outbreak require sequencing of pure strains by either short (usually Illumina) or long reads (usually PacBio or Oxford Nanopore Technologies). However, to obtain a single pure colony strain, there are several steps, such as initial detection [usually by real-time PCR (qPCR)] and extensive selective plating, before a single colony can be obtained. This is a time-consuming process that only provides confirmation of an isolate after almost 2 weeks of labor, when the associated contaminated product might be out of the market chain. By combining qPCR and short-read and long-read metagenomic analysis of the enrichment, we can definitively detect an STEC isolate and characterize its virulence potential in 3&#x02013;4 days. While this will not replace eventual confirmation by microbiological methods, this reduces the time for a prospective corrective measure by a complete week.</p>
<p>A previous study has determined the limits of detection and assembly for STECs in enriched water using long reads (nanopore) (Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>). However, the same limits for Illumina MiSeq have not been determined yet. Our first goal from this study was to determine those. The importance of these determinations is paramount to evaluating the feasibility of the use of any technique to support outbreak investigations. Short reads have been used for culture-independent surveillance and have shown promising results in retrieving STEC, <italic>Salmonella</italic>, and <italic>L. monocytogenes</italic> genomes in as short as 24 h (Leonard et al., <xref ref-type="bibr" rid="B27">2015</xref>, <xref ref-type="bibr" rid="B28">2016</xref>; Ottesen et al., <xref ref-type="bibr" rid="B39">2020</xref>; Saltykova et al., <xref ref-type="bibr" rid="B41">2020</xref>; Townsend et al., <xref ref-type="bibr" rid="B46">2020</xref>; Buytaers et al., <xref ref-type="bibr" rid="B7">2021</xref>; Commichaux et al., <xref ref-type="bibr" rid="B9">2021</xref>; Zhang et al., <xref ref-type="bibr" rid="B52">2021</xref>; Vorimore et al., <xref ref-type="bibr" rid="B48">2023</xref>). However, those genomes were in high concentration in the sample (&#x0007E;80%&#x02212;90%), indicating that, when levels are &#x0007E;10<sup>8</sup> CFU/ml or higher, short reads will perform correctly (the target strain will be easily identified if only one strain is present). The main pitfall of the use of those short reads will be that the genome assembly will still be composed of many non-contiguous contigs, and many important traits or markers of the genome might be lost.</p>
<p>In this study, we started by taxonomically classifying the microorganisms present in the enriched water sample used as a negative control for the presence of STEC O157:H7 using short reads (Illumina MiSeq). In general, the proportions were similar to what was obtained previously for the same samples using long reads (nanopore) (Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>), except for two microorganisms (<italic>K. pneumoniae</italic> and <italic>E. cloacae</italic>). This was unexpected, and the explanations could be as follows: (1) Illumina reads are shorter, which might lead to misclassification and overrepresentation of certain taxa; (2) while both analyses use Centrifuge, the RefSeq databases used might include some differences in classification and could contribute to the different classifications; (3) MiSeq loads only part of the library; (4) the efficiency in library preparations can vary with the GC content of the microorganisms; and (5) since most taxonomy classifiers rely on read numbers matching to the organism, in the case of short reads, some bigger bacterial genomes (&#x0003E;5 Mb) might appear to be more represented in the sample than smaller genomes. There is a need for further experiments to test the accuracy of strain-level identification by Centrifuge in metagenomics samples between nanopore and MiSeq sequencing outputs.</p>
<p>We continued by determining empirically the detection and assembly limits of the STEC strain EDL933_2 using Illumina MiSeq reads. The main difference with the experiment in our previous study, using long reads (nanopore), was that we multiplexed six samples per MiSeq run instead of a single sample per run. This allows us to perform sequencing of the same sample per duplicate per run. The fact that we used six samples per MiSeq run was because the limit of detection or assembly from metagenomic samples using that equipment was currently unknown. MiSeq detection limit using all reads was determined to be 10<sup>5</sup> CFU/ml for STECs (when using six samples in a V3 cartridge), and the assembly limit was &#x0007E;10<sup>7</sup> CFU/ml to obtain a fragmented MAG. Even after binning the reads matching to <italic>E. coli</italic>, the assembly limit remained the same, although with a lower number of contigs (<xref ref-type="table" rid="T1">Tables 1</xref>, <xref ref-type="table" rid="T3">3</xref>). When using a single sample per run, the estimated detection and assembly limits should therefore be six times higher than what we observed. The cost for a nanopore run is approximately USD$ 1,100, while an Illumina MiSeq run without multiplexing is approximately USD$ 1,500. All these variables (price, limits of detection, and assembly) should be taken into consideration when planning metagenomic studies using a MiSeq platform, specifically if multiplexing samples are used in a single run.</p>
<p>Assembling STEC genomes is a very complicated matter, and if the STECs are in mixed culture, it is even more difficult (Buytaers et al., <xref ref-type="bibr" rid="B7">2021</xref>; Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>; Jaudou et al., <xref ref-type="bibr" rid="B24">2022</xref>). We have previously shown that a completely closed STEC O157:H7 MAG can be recovered from enriched samples by sequencing using long reads (nanopore) at 10<sup>7</sup> CFU/ml and higher levels (Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>). However, this completely closed MAG was not of high quality to be used for high-precision SNP analysis that could reveal the correct placement of the genome in a phylogenetic tree (results not shown). Therefore, we intended to test whether, by using a hybrid assembly approach, we could obtain similar results (completely closed MAG from a metagenomic sample) and with enough quality to be used for SNP analysis for source tracking as commonly used in outbreak investigations (Hoffmann et al., <xref ref-type="bibr" rid="B22">2016</xref>; Crowe et al., <xref ref-type="bibr" rid="B10">2017</xref>; Brown et al., <xref ref-type="bibr" rid="B6">2019</xref>; Saltykova et al., <xref ref-type="bibr" rid="B41">2020</xref>; Haendiges et al., <xref ref-type="bibr" rid="B21">2021</xref>; Vorimore et al., <xref ref-type="bibr" rid="B48">2023</xref>). For the hybrid assembly, we selected three known software (SPAdes) and pipelines (Unicycler and OPERA-MS). As mentioned in the Results section, neither of the assemblers performed adequately to meet our needs (recovery of the completely closed or fragmented O157:H7 STEC MAG) when using the complete datasets using default parameters. That is not to say that they performed inadequately for MAG hybrid assemblies overall for the entire metagenomic sample. That was not part of our tests. Considering that, the binned <italic>E. coli</italic> reads extracted from both long and short-read datasets were used instead for testing the performance of the hybrid assemblers. They performed with varying degrees of effectivity with SPAdes producing the better outcome (<xref ref-type="table" rid="T5">Table 5</xref>). They all showed the same limit of assembly (10<sup>7</sup> CFU/ml) but failed to recover a completely closed MAG for any of the spiking levels. This was contrary to what was observed when using long reads alone (Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>), where a completely closed O157:H7 STEC MAG was recovered for spiking levels above 10<sup>7</sup> CFU/ml. A similar observation when using hybrid assemblies for metagenomic sequences was observed for <italic>L. monocytogenes</italic> in enrichments from ice cream (Commichaux et al., <xref ref-type="bibr" rid="B9">2021</xref>), where the hybrid assemblers using short reads as the first step of the assembly resulted in more accurate assemblies but were more fragmented and failed to recover a completely closed genome.</p>
<p>Another important consideration is that OPERA-MS requires 4 Gb of output data for samples with low complexity and 10 Gb for samples with high complexity. In our case, we had &#x0007E;2 Gb per sample, which might explain why our OPERA-MS assemblies were the worst performers of the three tested ones, suggesting that a single sample should be run in a MiSeq system if a better hybrid assembly is necessary to obtain better or more complete MAG assemblies. However, a different system with higher output per sample could be used (i.e., NextSeq 2000 or another short-read sequencer) to fulfill the software pipeline requirements, albeit with higher expenses per run. If a simple estimation of taxonomic diversity is desired, the MiSeq approach might suffice if lower depth per sample is not a problem.</p>
<p>One of the reasons why we picked OPERA-MS for the hybrid assembly was that the authors claimed that their pipeline was able to detect strains of the same organism in the metagenomic sample (Bertrand et al., <xref ref-type="bibr" rid="B4">2019</xref>). This is a very important and attractive feature that is desired when performing metagenomic analysis of enriched food or water samples. When food samples or water samples are analyzed, single colonies are picked and then tested for specific targets and sequenced (Walters et al., <xref ref-type="bibr" rid="B49">2013</xref>). This process might miss some other co-occurring strains present in the sample, and those strains might be of lower prevalence (Commichaux et al., <xref ref-type="bibr" rid="B9">2021</xref>). However, Unicycler was not designed for the assembly of metagenomic samples (Wick et al., <xref ref-type="bibr" rid="B50">2017</xref>), but since we extracted the <italic>E. coli</italic> reads from both long- and short-read sequencing data for each individual sample, we expected it to perform successfully when used for a single organism.</p>
<p>Most of the current use of WGS in bacterial-associated diseases is focused on the sequencing of isolated microorganisms from selective culture plates (Hoffmann et al., <xref ref-type="bibr" rid="B22">2016</xref>; Gobin et al., <xref ref-type="bibr" rid="B16">2018</xref>). Most clinical and diagnostic laboratories are moving toward the use of culture-independent diagnostic tests (CIDTs) to quickly identify within minutes the etiological agent causing the illness. However, contrary to the culture method, which takes longer and requires more effort, a physical isolate is not produced at the end of the diagnosis. The lack of a physical isolate dramatically affects how public health agencies can properly identify potential outbreak clusters in outbreak investigations. This, in turn, has a negative impact on public health and increases the time to resolve ongoing or potential outbreaks (Carleton et al., <xref ref-type="bibr" rid="B8">2019</xref>). A comparable situation occurs when trying to isolate foodborne bacteria from foods for surveillance or outbreak investigations, such as STECs, which take &#x0007E;2 weeks to obtain a single isolate before doing WGS on that isolate and complete the outbreak investigation (Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>). Consequently, there is a renewed effort to move to culture-independent subtyping and outbreak investigations for pathogen infections in both clinical and food investigations (Carleton et al., <xref ref-type="bibr" rid="B8">2019</xref>; Pe&#x000F1;a-Gonzalez et al., <xref ref-type="bibr" rid="B40">2019</xref>; Maguire et al., <xref ref-type="bibr" rid="B30">2021a</xref>). These methods or approaches need to be rapid, cheap (95&#x02013;300 USD), and highly accurate (Carleton et al., <xref ref-type="bibr" rid="B8">2019</xref>). In a previous publication, we were able to obtain completely closed STEC genomes from enriched samples (using a metagenomic approach); however, the technology employed (ONT) produced genomes that were not suitable for source tracking using an SNP approach. In this study, by adding Illumina reads and assembling the data using both Illumina and ONT data using a hybrid approach, we were able to obtain fragmented STEC O157:H7 MAGs with enough quality and accuracy to be able to be placed in the correct cluster when analyzed against 4,200 O157:H7 genomes available at NCBI (<xref ref-type="fig" rid="F2">Figure 2</xref>). Both replicates for each spike level (1e<sup>8</sup> and 7e<sup>7</sup> CFU/ml) clustered together with the genome generated for the completely circular closed genome (EDL933_2 FDA Unicycler) spiked strain and differed among them by a maximum of three SNPs.</p>
<p>Current developments in ONT sequencing are showing promising results, with newer sequencing kits producing data of higher quality (Q20&#x0002B;) (preliminary unpublished data by authors). However, more studies and advances are still needed to make it suitable to be used as an alternative method to culture-based approaches, as well as the creation of improved assemblers for that kind of data.</p>
</sec>
<sec sec-type="conclusions" id="s5">
<title>Conclusion</title>
<p>Overall, we tested the limits of detection and assembly for EDL933 O157:H7 in enriched irrigation water using a shotgun short-read sequencing approach (MiSeq Illumina). In our previous study, the nanopore sequencing detection and assembly limits were determined to be 10<sup>3</sup> and 10<sup>5</sup> CFU/ml, respectively. In this study, we showed that, for MiSeq sequencing, the detection and assembly limits when using six samples per MiSeq V3 kit were two 10-fold lower at 10<sup>5</sup> and 10<sup>7</sup> CFU/ml, respectively, when compared to nanopore sequencing. Furthermore, short-read sequencing (MiSeq Illumina) of the highest level of EDL933_2 spiking of 7 &#x000D7; 10<sup>8</sup> CFU/ml did not result in a completely closed genome as observed with the nanopore sequencing in our previous study. Nevertheless, the completely fragmented genome or fragmented MAG obtained with levels above 10<sup>7</sup> CFU/ml with short reads was enough to make a complete characterization of the STEC strain, including serotype and virulotype. In this study, we ran six samples per single MiSeq reagent cartridge, and the analysis of the results provided vital information regarding how many samples could be run successfully in a single cartridge and their coverage depth, answering several unknowns about the limit of detection and assemblies for samples with varying degrees of abundance in the sample. For example, if your target of interest in your sample is low, then we suggest running a single sample per run or sequencing them in other instruments (i.e., NextSeq 2000) that can produce higher output data. Still, the cost of your sampling/analysis ratio must be taken into consideration as well. Contrary to what we were expecting, a hybrid assembly approach did not produce a completely closed genome even at the highest concentration (7 &#x000D7; 10<sup>8</sup> CFU/ml), highlighting the complexity of STEC genome assemblies. Nonetheless, these fragmented MAG hybrid genomes clustered with the correct samples in an SNP tree of more than 4200 STEC O157:H7 genomes. Further studies using nanopore sequencing kits that show promising results in read quality and accuracy could help in solving these issues of closing high-quality STEC genomes from metagenomic samples to be used for outbreak investigations while the single purified strain is isolated for regulatory reasons. The impact of this single <italic>in silico</italic> purified strain would be remarkable, allowing to speed up the process of pulling potentially contaminated products from the shell earlier, reducing the number of illnesses associated with that product, and helping in root case analysis of the possible sources of contamination of such products. We must stress though that additional advances in the technology (more samples and output per run) and price reduction per run will make this approach more accessible for use in routine food testing at most laboratories.</p>
</sec>
<sec sec-type="data-availability" id="s6">
<title>Data availability statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found at: <ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/genbank/">https://www.ncbi.nlm.nih.gov/genbank/</ext-link>, CP120944-CP120946 and PRJNA639799.</p>
</sec>
<sec sec-type="author-contributions" id="s7">
<title>Author contributions</title>
<p>NG-E and MMag conceived and designed the experiments and wrote the manuscript. NG-E, MMag, and MMam analyzed the data. NG-E, MMag, MMam, MA, EB, and SM contributed to reagents, materials, and analysis tools. MMag, MA, ST, EB, and NG-E revised and drafted the study.</p>
</sec>
</body>
<back>
<sec sec-type="funding-information" id="s8">
<title>Funding</title>
<p>This study was supported by funding from the MCMi Challenge Grants Program Proposal &#x00023;2018-646, CS Challenge Award &#x00023;2021-1464, and the FDA Foods Program Intramural Funds. MMag acknowledges a Research Fellowship Program (ORISE) for the Center for Food Safety and Applied Nutrition administered by the Oak Ridge Associated Universities through a contract with the U.S. Food and Drug Administration.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x00027;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec sec-type="supplementary-material" id="s10">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmicb.2023.1221668/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmicb.2023.1221668/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.docx" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/></sec>
<fn-group>
<fn id="fn0001"><p><sup>1</sup><ext-link ext-link-type="uri" xlink:href="https://www.fda.gov/media/102633/download">https://www.fda.gov/media/102633/download</ext-link> and <ext-link ext-link-type="uri" xlink:href="https://www.fda.gov/media/83177/download">https://www.fda.gov/media/83177/download</ext-link>.</p></fn>
<fn id="fn0002"><p><sup>2</sup><ext-link ext-link-type="uri" xlink:href="https://www.fda.gov/food/outbreaks-foodborne-illness/environmental-assessment-factors-potentially-contributing-contamination-romaine-lettuce-implicated">https://www.fda.gov/food/outbreaks-foodborne-illness/environmental-assessment-factors-potentially-contributing-contamination-romaine-lettuce-implicated</ext-link> and <ext-link ext-link-type="uri" xlink:href="https://www.fda.gov/food/outbreaks-foodborne-illness/investigation-summary-factors-potentially-contributing-contamination-romaine-lettuce-implicated-fall">https://www.fda.gov/food/outbreaks-foodborne-illness/investigation-summary-factors-potentially-contributing-contamination-romaine-lettuce-implicated-fall</ext-link>.</p></fn>
</fn-group>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Allende</surname> <given-names>A.</given-names></name> <name><surname>Monaghan</surname> <given-names>J.</given-names></name></person-group> (<year>2015</year>). <article-title>Irrigation water quality for leafy crops: a perspective of risks and potential solutions</article-title>. <source>Int. J. Environ. Res. Public Health</source> <volume>12</volume>, <fpage>7457</fpage>&#x02013;<lpage>7477</lpage>. <pub-id pub-id-type="doi">10.3390/ijerph120707457</pub-id><pub-id pub-id-type="pmid">26151764</pub-id></citation></ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Antipov</surname> <given-names>D.</given-names></name> <name><surname>Korobeynikov</surname> <given-names>A.</given-names></name> <name><surname>McLean</surname> <given-names>J. S.</given-names></name> <name><surname>Pevzner</surname> <given-names>P. A.</given-names></name></person-group> (<year>2016</year>). <article-title>hybridSPAdes: an algorithm for hybrid assembly of short and long reads</article-title>. <source>Bioinformatics</source> <volume>32</volume>, <fpage>1009</fpage>&#x02013;<lpage>1015</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btv688</pub-id><pub-id pub-id-type="pmid">26589280</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Archer</surname> <given-names>D. L.</given-names></name></person-group> (<year>2018</year>). <article-title>The evolution of FDA&#x00027;s policy on <italic>Listeria monocytogenes</italic> in ready-to-eat foods in the United States</article-title>. <source>Curr. Opin. Food Sci.</source> <volume>20</volume>, <fpage>64</fpage>&#x02013;<lpage>68</lpage>. <pub-id pub-id-type="doi">10.1016/j.cofs.2018.03.007</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bertrand</surname> <given-names>D.</given-names></name> <name><surname>Shaw</surname> <given-names>J.</given-names></name> <name><surname>Kalathiyappan</surname> <given-names>M.</given-names></name> <name><surname>Ng</surname> <given-names>A. H. Q.</given-names></name> <name><surname>Kumar</surname> <given-names>M. S.</given-names></name> <name><surname>Li</surname> <given-names>C.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Hybrid metagenomic assembly enables high-resolution analysis of resistance determinants and mobile elements in human microbiomes</article-title>. <source>Nat. Biotechnol</source>. <volume>37</volume>, <fpage>937</fpage>&#x02013;<lpage>944</lpage>. <pub-id pub-id-type="doi">10.1038/s41587-019-0191-2</pub-id><pub-id pub-id-type="pmid">31359005</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Beutin</surname> <given-names>L.</given-names></name> <name><surname>Martin</surname> <given-names>A.</given-names></name></person-group> (<year>2012</year>). <article-title>Outbreak of Shiga toxin-producing <italic>Escherichia coli</italic> (STEC) O104:H4 infection in Germany causes a paradigm shift with regard to human pathogenicity of STEC strains</article-title>. <source>J. Food Prot</source>. <volume>75</volume>, <fpage>408</fpage>&#x02013;<lpage>418</lpage>. <pub-id pub-id-type="doi">10.4315/0362-028X.JFP-11-452</pub-id><pub-id pub-id-type="pmid">22289607</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>E.</given-names></name> <name><surname>Dessai</surname> <given-names>U.</given-names></name> <name><surname>McGarry</surname> <given-names>S.</given-names></name> <name><surname>Gerner-Smidt</surname> <given-names>P.</given-names></name></person-group> (<year>2019</year>). <article-title>Use of whole-genome sequencing for food safety and public health in the United States</article-title>. <source>Foodborne Pathog. Dis</source>. <volume>16</volume>, <fpage>441</fpage>&#x02013;<lpage>450</lpage>. <pub-id pub-id-type="doi">10.1089/fpd.2019.2662</pub-id><pub-id pub-id-type="pmid">31194586</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Buytaers</surname> <given-names>F. E.</given-names></name> <name><surname>Saltykova</surname> <given-names>A.</given-names></name> <name><surname>Denayer</surname> <given-names>S.</given-names></name> <name><surname>Verhaegen</surname> <given-names>B.</given-names></name> <name><surname>Vanneste</surname> <given-names>K.</given-names></name> <name><surname>Roosens</surname> <given-names>N. H. C.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>Towards real-time and affordable strain-level metagenomics-based foodborne outbreak investigations using Oxford nanopore sequencing technologies</article-title>. <source>Front. Microbiol</source>. <volume>12</volume>, <fpage>738284</fpage>. <pub-id pub-id-type="doi">10.3389/fmicb.2021.738284</pub-id><pub-id pub-id-type="pmid">34803953</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carleton</surname> <given-names>H. A.</given-names></name> <name><surname>Besser</surname> <given-names>J.</given-names></name> <name><surname>Williams-Newkirk</surname> <given-names>A. J.</given-names></name> <name><surname>Huang</surname> <given-names>A.</given-names></name> <name><surname>Trees</surname> <given-names>E.</given-names></name> <name><surname>Gerner-Smidt</surname> <given-names>P.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Metagenomic approaches for public health surveillance of foodborne infections: opportunities and challenges</article-title>. <source>Foodborne Pathog. Dis</source>. <volume>16</volume>, <fpage>474</fpage>&#x02013;<lpage>479</lpage>. <pub-id pub-id-type="doi">10.1089/fpd.2019.2636</pub-id><pub-id pub-id-type="pmid">31170005</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Commichaux</surname> <given-names>S.</given-names></name> <name><surname>Javkar</surname> <given-names>K.</given-names></name> <name><surname>Ramachandran</surname> <given-names>P.</given-names></name> <name><surname>Nagarajan</surname> <given-names>N.</given-names></name> <name><surname>Bertrand</surname> <given-names>D.</given-names></name> <name><surname>Chen</surname> <given-names>Y.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>Evaluating the accuracy of <italic>Listeria monocytogenes</italic> assemblies from quasimetagenomic samples using long and short reads</article-title>. <source>BMC Genomics</source> <volume>22</volume>, <fpage>389</fpage>. <pub-id pub-id-type="doi">10.1186/s12864-021-07702-2</pub-id><pub-id pub-id-type="pmid">34039264</pub-id></citation></ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Crowe</surname> <given-names>S. J.</given-names></name> <name><surname>Bottichio</surname> <given-names>L.</given-names></name> <name><surname>Shade</surname> <given-names>L. N.</given-names></name> <name><surname>Whitney</surname> <given-names>B. M.</given-names></name> <name><surname>Corral</surname> <given-names>N.</given-names></name> <name><surname>Melius</surname> <given-names>B.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Shiga toxin-producing <italic>E. coli</italic> infections associated with flour</article-title>. <source>N. Engl. J. Med</source>. <volume>377</volume>, <fpage>2036</fpage>&#x02013;<lpage>2043</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMoa1615910</pub-id><pub-id pub-id-type="pmid">29166238</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Darling</surname> <given-names>A. C.</given-names></name> <name><surname>Mau</surname> <given-names>B.</given-names></name> <name><surname>Blattner</surname> <given-names>F. R.</given-names></name> <name><surname>Perna</surname> <given-names>N. T.</given-names></name></person-group> (<year>2004</year>). <article-title>Mauve: multiple alignment of conserved genomic sequence with rearrangements</article-title>. <source>Genome Res</source>. <volume>14</volume>, <fpage>1394</fpage>&#x02013;<lpage>1403</lpage>. <pub-id pub-id-type="doi">10.1101/gr.2289704</pub-id><pub-id pub-id-type="pmid">15231754</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Feng</surname> <given-names>P.</given-names></name> <name><surname>Weagant</surname> <given-names>S. D.</given-names></name> <name><surname>Jinneman</surname> <given-names>K.</given-names></name></person-group> (<year>2020</year>). <source>BAM Chapter 4A: Diarrheagenic Escherichia coli</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="https://www.fda.gov/food/laboratory-methods-food/bam-chapter-4a-diarrheagenic-escherichia-coli">https://www.fda.gov/food/laboratory-methods-food/bam-chapter-4a-diarrheagenic-escherichia-coli</ext-link></citation>
</ref>
<ref id="B13">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Fischer</surname> <given-names>N.</given-names></name> <name><surname>Bourne</surname> <given-names>A.</given-names></name> <name><surname>Plunkett</surname> <given-names>D.</given-names></name></person-group> (<year>2015</year>). <source>Outbreak alert! 2015: A Review of Foodborne Illness in the US from 2004-2013</source>. <publisher-loc>Washington, DC</publisher-loc>: <publisher-name>Center for Science in the Public Interest</publisher-name>.</citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gangiredla</surname> <given-names>J.</given-names></name> <name><surname>Rand</surname> <given-names>H.</given-names></name> <name><surname>Benisatto</surname> <given-names>D.</given-names></name> <name><surname>Payne</surname> <given-names>J.</given-names></name> <name><surname>Strittmatter</surname> <given-names>C.</given-names></name> <name><surname>Sanders</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>GalaxyTrakr: a distributed analysis tool for public health whole genome sequence data accessible to non-bioinformaticians</article-title>. <source>BMC Genomics</source> <volume>22</volume>, <fpage>114</fpage>. <pub-id pub-id-type="doi">10.1186/s12864-021-07405-8</pub-id><pub-id pub-id-type="pmid">33568057</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Garmendia</surname> <given-names>J.</given-names></name> <name><surname>Frankel</surname> <given-names>G.</given-names></name> <name><surname>Crepin</surname> <given-names>V. F.</given-names></name></person-group> (<year>2005</year>). <article-title>Enteropathogenic and enterohemorrhagic <italic>Escherichia coli</italic> infections: translocation, translocation, translocation</article-title>. <source>Infect. Immun</source>. <volume>73</volume>, <fpage>2573</fpage>&#x02013;<lpage>2585</lpage>. <pub-id pub-id-type="doi">10.1128/IAI.73.5.2573-2585.2005</pub-id><pub-id pub-id-type="pmid">15845459</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gobin</surname> <given-names>M.</given-names></name> <name><surname>Hawker</surname> <given-names>J.</given-names></name> <name><surname>Cleary</surname> <given-names>P.</given-names></name> <name><surname>Inns</surname> <given-names>T.</given-names></name> <name><surname>Gardiner</surname> <given-names>D.</given-names></name> <name><surname>Mikhail</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>National outbreak of Shiga toxin-producing <italic>Escherichia coli</italic> O157, H7. linked to mixed salad leaves, United Kingdom, 2016</article-title>. <source>Euro Surveill</source>. <volume>23</volume>, <fpage>17</fpage>&#x02013;<lpage>00197</lpage>. <pub-id pub-id-type="doi">10.2807/1560-7917.ES.2018.23.18.17-00197</pub-id><pub-id pub-id-type="pmid">29741151</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gonzalez-Escalona</surname> <given-names>N.</given-names></name> <name><surname>Allard</surname> <given-names>M. A.</given-names></name> <name><surname>Brown</surname> <given-names>E. W.</given-names></name> <name><surname>Sharma</surname> <given-names>S.</given-names></name> <name><surname>Hoffmann</surname> <given-names>M.</given-names></name></person-group> (<year>2019a</year>). <article-title>Nanopore sequencing for fast determination of plasmids, phages, virulence markers, and antimicrobial resistance genes in Shiga toxin-producing <italic>Escherichia coli</italic></article-title>. <source>PLoS ONE</source> <volume>14</volume>, <fpage>e0220494</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0220494</pub-id><pub-id pub-id-type="pmid">31361781</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gonzalez-Escalona</surname> <given-names>N.</given-names></name> <name><surname>Kase</surname> <given-names>J. A.</given-names></name></person-group> (<year>2019</year>). <article-title>Virulence gene profiles and phylogeny of Shiga toxin-positive <italic>Escherichia coli</italic> strains isolated from FDA regulated foods during 2010-2017</article-title>. <source>PLoS ONE</source> <volume>14</volume>, <fpage>e0214620</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0214620</pub-id><pub-id pub-id-type="pmid">30934002</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Gonzalez-Escalona</surname> <given-names>N.</given-names></name> <name><surname>Meng</surname> <given-names>J.</given-names></name> <name><surname>Doyle</surname> <given-names>M. P.</given-names></name></person-group> (<year>2019b</year>). <source>Shiga Toxin-producing Escherichia coli. Food Microbiology: Fundamentals and Frontiers</source>, <edition>5th ed</edition>. <publisher-loc>Washington, DC</publisher-loc>: <publisher-name>American Society of Microbiology</publisher-name>. <pub-id pub-id-type="doi">10.1128/9781555819972.ch11</pub-id></citation>
</ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gonzalez-Escalona</surname> <given-names>N.</given-names></name> <name><surname>Sharma</surname> <given-names>S. K.</given-names></name></person-group> (<year>2020</year>). <article-title>Closing <italic>Clostridium botulinum</italic> group I genomes using a combination of short- and long-reads</article-title>. <source>Front. Microbiol</source>. <volume>11</volume>, <fpage>239</fpage>. <pub-id pub-id-type="doi">10.3389/fmicb.2020.00239</pub-id><pub-id pub-id-type="pmid">32153532</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Haendiges</surname> <given-names>J.</given-names></name> <name><surname>Jinneman</surname> <given-names>K.</given-names></name> <name><surname>Gonzalez-Escalona</surname> <given-names>N.</given-names></name></person-group> (<year>2021</year>). <article-title>Choice of library preparation affects sequence quality, genome assembly, and precise <italic>in silico</italic> prediction of virulence genes in Shiga toxin-producing <italic>Escherichia coli</italic></article-title>. <source>PLoS ONE</source> 16, e0242294. <pub-id pub-id-type="doi">10.1371/journal.pone.0242294</pub-id><pub-id pub-id-type="pmid">33761524</pub-id></citation></ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hoffmann</surname> <given-names>M.</given-names></name> <name><surname>Luo</surname> <given-names>Y.</given-names></name> <name><surname>Monday</surname> <given-names>S. R.</given-names></name> <name><surname>Gonzalez-Escalona</surname> <given-names>N.</given-names></name> <name><surname>Ottesen</surname> <given-names>A. R.</given-names></name> <name><surname>Muruvanda</surname> <given-names>T.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Tracing origins of the <italic>salmonella</italic> bareilly strain causing a food-borne outbreak in the United States</article-title>. <source>J. Infect. Dis</source>. <volume>213</volume>, <fpage>502</fpage>&#x02013;<lpage>508</lpage>. <pub-id pub-id-type="doi">10.1093/infdis/jiv297</pub-id><pub-id pub-id-type="pmid">25995194</pub-id></citation></ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>A. D.</given-names></name> <name><surname>Luo</surname> <given-names>C.</given-names></name> <name><surname>Pena-Gonzalez</surname> <given-names>A.</given-names></name> <name><surname>Weigand</surname> <given-names>M. R.</given-names></name> <name><surname>Tarr</surname> <given-names>C. L.</given-names></name> <name><surname>Konstantinidis</surname> <given-names>K. T.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Metagenomics of two severe foodborne outbreaks provides diagnostic signatures and signs of coinfection not attainable by traditional methods</article-title>. <source>Appl. Environ. Microbiol</source>. <volume>83</volume>, <fpage>e02577</fpage>&#x02013;<lpage>16</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.02577-16</pub-id><pub-id pub-id-type="pmid">27881416</pub-id></citation></ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jaudou</surname> <given-names>S.</given-names></name> <name><surname>Deneke</surname> <given-names>C.</given-names></name> <name><surname>Tran</surname> <given-names>M. L.</given-names></name> <name><surname>Schuh</surname> <given-names>E.</given-names></name> <name><surname>Goehler</surname> <given-names>A.</given-names></name> <name><surname>Vorimore</surname> <given-names>F.</given-names></name> <etal/></person-group>. (<year>2022</year>). <article-title>A step forward for Shiga toxin-producing <italic>Escherichia coli</italic> identification and characterization in raw milk using long-read metagenomics</article-title>. <source>Microb. Genom</source>. 8, mgen000911. <pub-id pub-id-type="doi">10.1099/mgen.0.000911</pub-id><pub-id pub-id-type="pmid">36748417</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kaper</surname> <given-names>J. B.</given-names></name> <name><surname>Nataro</surname> <given-names>J. P.</given-names></name> <name><surname>Mobley</surname> <given-names>H. L.</given-names></name></person-group> (<year>2004</year>). <article-title>Pathogenic <italic>Escherichia coli</italic></article-title>. <source>Nat. Rev. Microbiol</source>. <volume>2</volume>, <fpage>123</fpage>&#x02013;<lpage>140</lpage>. <pub-id pub-id-type="doi">10.1038/nrmicro818</pub-id><pub-id pub-id-type="pmid">15040260</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>D.</given-names></name> <name><surname>Song</surname> <given-names>L.</given-names></name> <name><surname>Breitwieser</surname> <given-names>F. P.</given-names></name> <name><surname>Salzberg</surname> <given-names>S. L.</given-names></name></person-group> (<year>2016</year>). <article-title>Centrifuge: rapid and sensitive classification of metagenomic sequences</article-title>. <source>Genome Res</source>. <volume>26</volume>, <fpage>1721</fpage>&#x02013;<lpage>1729</lpage>. <pub-id pub-id-type="doi">10.1101/gr.210641.116</pub-id><pub-id pub-id-type="pmid">27852649</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Leonard</surname> <given-names>S. R.</given-names></name> <name><surname>Mammel</surname> <given-names>M. K.</given-names></name> <name><surname>Lacher</surname> <given-names>D. W.</given-names></name> <name><surname>Elkins</surname> <given-names>C. A.</given-names></name></person-group> (<year>2015</year>). <article-title>Application of metagenomic sequencing to food safety: detection of Shiga toxin-producing <italic>Escherichia coli</italic> on fresh bagged spinach</article-title>. <source>Appl. Environ. Microbiol</source>. <volume>81</volume>, <fpage>8183</fpage>&#x02013;<lpage>8191</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.02601-15</pub-id><pub-id pub-id-type="pmid">26386062</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Leonard</surname> <given-names>S. R.</given-names></name> <name><surname>Mammel</surname> <given-names>M. K.</given-names></name> <name><surname>Lacher</surname> <given-names>D. W.</given-names></name> <name><surname>Elkins</surname> <given-names>C. A.</given-names></name></person-group> (<year>2016</year>). <article-title>Strain-level discrimination of Shiga toxin-producing <italic>Escherichia coli</italic> in spinach using metagenomic sequencing</article-title>. <source>PLoS ONE</source> <volume>11</volume>, <fpage>e0167870</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0167870</pub-id><pub-id pub-id-type="pmid">27930729</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Loman</surname> <given-names>N. J.</given-names></name> <name><surname>Constantinidou</surname> <given-names>C.</given-names></name> <name><surname>Christner</surname> <given-names>M.</given-names></name> <name><surname>Rohde</surname> <given-names>H.</given-names></name> <name><surname>Chan</surname> <given-names>J. Z.</given-names></name> <name><surname>Quick</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>A culture-independent sequence-based metagenomics approach to the investigation of an outbreak of Shiga-toxigenic <italic>Escherichia coli</italic> O104:H4</article-title>. <source>JAMA</source> <volume>309</volume>, <fpage>1502</fpage>&#x02013;<lpage>1510</lpage>. <pub-id pub-id-type="doi">10.1001/jama.2013.3231</pub-id><pub-id pub-id-type="pmid">23571589</pub-id></citation></ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maguire</surname> <given-names>M.</given-names></name> <name><surname>Kase</surname> <given-names>J. A.</given-names></name> <name><surname>Roberson</surname> <given-names>D.</given-names></name> <name><surname>Muruvanda</surname> <given-names>T.</given-names></name> <name><surname>Brown</surname> <given-names>E. W.</given-names></name> <name><surname>Allard</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2021a</year>). <article-title>Precision long-read metagenomics sequencing for food safety by detection and assembly of Shiga toxin-producing <italic>Escherichia coli</italic> in irrigation water</article-title>. <source>PLoS ONE</source> <volume>16</volume>, <fpage>e0245172</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0245172</pub-id><pub-id pub-id-type="pmid">33444384</pub-id></citation></ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maguire</surname> <given-names>M.</given-names></name> <name><surname>Kase</surname> <given-names>J. A. A.</given-names></name> <name><surname>Brown</surname> <given-names>E. W. W.</given-names></name> <name><surname>Allard</surname> <given-names>M. W. W.</given-names></name> <name><surname>Musser</surname> <given-names>S. M. M.</given-names></name> <name><surname>Gonzalez-Escalona</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2022</year>). <article-title>Metagenomic survey of agricultural water using long read sequencing: considerations for a successful analysis</article-title>. <source>Front. Environ. Sci</source>, <volume>10</volume>, <fpage>830300</fpage>. <pub-id pub-id-type="doi">10.3389/fenvs.2022.830300</pub-id></citation>
</ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maguire</surname> <given-names>M.</given-names></name> <name><surname>Khan</surname> <given-names>A. S.</given-names></name> <name><surname>Adesiyun</surname> <given-names>A. A.</given-names></name> <name><surname>Georges</surname> <given-names>K.</given-names></name> <name><surname>Gonzalez-Escalona</surname> <given-names>N.</given-names></name></person-group> (<year>2021b</year>). <article-title>Closed genome sequence of a <italic>Salmonella enterica</italic> serotype senftenberg strain carrying the mcr-9 gene isolated from broken chicken eggshells in trinidad and tobago</article-title>. <source>Microbiol. Resour. Announc</source>. 10, e0146520. <pub-id pub-id-type="doi">10.1128/MRA.01465-20</pub-id><pub-id pub-id-type="pmid">34042489</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mellmann</surname> <given-names>A.</given-names></name> <name><surname>Bielaszewska</surname> <given-names>M.</given-names></name> <name><surname>K&#x000F6;ck</surname> <given-names>R.</given-names></name> <name><surname>Friedrich</surname> <given-names>A. W.</given-names></name> <name><surname>Fruth</surname> <given-names>A.</given-names></name> <name><surname>Middendorf</surname> <given-names>B.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>Analysis of collection of hemolytic uremic syndrome&#x02013;associated enterohemorrhagic <italic>Escherichia coli</italic></article-title>. <source>Emerging Infect. Dis</source>. <volume>14</volume>, <fpage>1287</fpage>&#x02013;<lpage>1290</lpage>. <pub-id pub-id-type="doi">10.3201/eid1408.071082</pub-id><pub-id pub-id-type="pmid">18680658</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Monaghan</surname> <given-names>J. M.</given-names></name> <name><surname>Hutchison</surname> <given-names>M. L.</given-names></name></person-group> (<year>2012</year>). <article-title>Distribution and decline of human pathogenic bacteria in soil after application in irrigation water and the potential for soil-splash-mediated dispersal onto fresh produce</article-title>. <source>J. Appl. Microbiol</source>. <volume>112</volume>, <fpage>1007</fpage>&#x02013;<lpage>1019</lpage>. <pub-id pub-id-type="doi">10.1111/j.1365-2672.2012.05269.x</pub-id><pub-id pub-id-type="pmid">22372934</pub-id></citation></ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moss</surname> <given-names>E. L.</given-names></name> <name><surname>Maghini</surname> <given-names>D. G.</given-names></name> <name><surname>Bhatt</surname> <given-names>A. S.</given-names></name></person-group> (<year>2020</year>). <article-title>Complete, closed bacterial genomes from microbiomes using nanopore sequencing</article-title>. <source>Nat Biotechnol.</source> <volume>38</volume>, <fpage>701</fpage>&#x02013;<lpage>707</lpage>. <pub-id pub-id-type="doi">10.1038/s41587-020-0422-6</pub-id><pub-id pub-id-type="pmid">32042169</pub-id></citation></ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><collab>National Advisory Committee On Microbiological Criteria For Foods</collab></person-group> (<year>2019</year>). <article-title>Response to questions posed by the food and drug administration regarding virulence factors and attributes that define foodborne Shiga toxin-producing <italic>Escherichia coli</italic> (STEC) as severe human pathogens (dagger)</article-title>. <source>J. Food Prot</source>. <volume>82</volume>, <fpage>724</fpage>&#x02013;<lpage>767</lpage>. <pub-id pub-id-type="doi">10.4315/0362-028X.JFP-18-479</pub-id><pub-id pub-id-type="pmid">30969806</pub-id></citation></ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nei</surname> <given-names>M.</given-names></name> <name><surname>Tajima</surname> <given-names>F.</given-names></name> <name><surname>Tateno</surname> <given-names>Y.</given-names></name></person-group> (<year>1983</year>). <article-title>Accuracy of estimated phylogenetic trees from molecular data. II. Gene frequency data</article-title>. <source>J. Mol. Evol</source>. <volume>19</volume>, <fpage>153</fpage>&#x02013;<lpage>170</lpage>. <pub-id pub-id-type="doi">10.1007/BF02300753</pub-id><pub-id pub-id-type="pmid">6571220</pub-id></citation></ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oliveira</surname> <given-names>M.</given-names></name> <name><surname>Vinas</surname> <given-names>I.</given-names></name> <name><surname>Usall</surname> <given-names>J.</given-names></name> <name><surname>Anguera</surname> <given-names>M.</given-names></name> <name><surname>Abadias</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>Presence and survival of <italic>Escherichia coli</italic> O157:H7 on lettuce leaves and in soil treated with contaminated compost and irrigation water</article-title>. <source>Int. J. Food Microbiol</source>. <volume>156</volume>, <fpage>133</fpage>&#x02013;<lpage>140</lpage>. <pub-id pub-id-type="doi">10.1016/j.ijfoodmicro.2012.03.014</pub-id><pub-id pub-id-type="pmid">22483400</pub-id></citation></ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ottesen</surname> <given-names>A.</given-names></name> <name><surname>Ramachandran</surname> <given-names>P.</given-names></name> <name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Brown</surname> <given-names>E.</given-names></name> <name><surname>Reed</surname> <given-names>E.</given-names></name> <name><surname>Strain</surname> <given-names>E.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>Quasimetagenomic source tracking of <italic>Listeria monocytogenes</italic> from naturally contaminated ice cream</article-title>. <source>BMC Infect. Dis</source>. <volume>20</volume>, <fpage>83</fpage>. <pub-id pub-id-type="doi">10.1186/s12879-019-4747-z</pub-id><pub-id pub-id-type="pmid">31996135</pub-id></citation></ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pe&#x000F1;a-Gonzalez</surname> <given-names>A.</given-names></name> <name><surname>Soto-Gir&#x000F3;n</surname> <given-names>M. J.</given-names></name> <name><surname>Smith</surname> <given-names>S.</given-names></name> <name><surname>Sistrunk</surname> <given-names>J.</given-names></name> <name><surname>Montero</surname> <given-names>L.</given-names></name> <name><surname>P&#x000E1;ez</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Metagenomic signatures of gut infections caused by different <italic>Escherichia coli</italic> pathotypes</article-title>. <source>Appl. Environ. Microbiol</source>. <volume>85</volume>, <fpage>e01820</fpage>&#x02013;<lpage>19</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.01820-19</pub-id><pub-id pub-id-type="pmid">31585992</pub-id></citation></ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saltykova</surname> <given-names>A.</given-names></name> <name><surname>Buytaers</surname> <given-names>F. E.</given-names></name> <name><surname>Denayer</surname> <given-names>S.</given-names></name> <name><surname>Verhaegen</surname> <given-names>B.</given-names></name> <name><surname>Pierard</surname> <given-names>D.</given-names></name> <name><surname>Roosens</surname> <given-names>N. H. C.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>Strain-level metagenomic data analysis of enriched <italic>in vitro</italic> and <italic>in silico</italic> spiked food samples: paving the way towards a culture-free foodborne outbreak investigation using STEC as a Case Study</article-title>. <source>Int. J. Mol. Sci</source>. <volume>21</volume>, <fpage>5688</fpage>. <pub-id pub-id-type="doi">10.3390/ijms21165688</pub-id><pub-id pub-id-type="pmid">32784459</pub-id></citation></ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Scallan</surname> <given-names>E.</given-names></name> <name><surname>Hoekstra</surname> <given-names>R. M.</given-names></name> <name><surname>Angulo</surname> <given-names>F. J.</given-names></name> <name><surname>Tauxe</surname> <given-names>R. V.</given-names></name> <name><surname>Widdowson</surname> <given-names>M. A.</given-names></name> <name><surname>Roy</surname> <given-names>S. L.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Foodborne illness acquired in the United States&#x02013;major pathogens</article-title>. <source>Emerg. Infect. Dis</source>. <volume>17</volume>, <fpage>7</fpage>&#x02013;<lpage>15</lpage>. <pub-id pub-id-type="doi">10.3201/eid1701.P11101</pub-id><pub-id pub-id-type="pmid">21192848</pub-id></citation></ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sivapalasingam</surname> <given-names>S.</given-names></name> <name><surname>Friedman</surname> <given-names>C. R.</given-names></name> <name><surname>Cohen</surname> <given-names>L.</given-names></name> <name><surname>Tauxe</surname> <given-names>R. V.</given-names></name></person-group> (<year>2004</year>). <article-title>Fresh produce: a growing cause of outbreaks of foodborne illness in the United States, 1973 through 1997</article-title>. <source>J. Food Prot</source>. <volume>67</volume>, <fpage>2342</fpage>&#x02013;<lpage>2353</lpage>. <pub-id pub-id-type="doi">10.4315/0362-028X-67.10.2342</pub-id><pub-id pub-id-type="pmid">15508656</pub-id></citation></ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Steele</surname> <given-names>M.</given-names></name> <name><surname>Odumeru</surname> <given-names>J.</given-names></name></person-group> (<year>2004</year>). <article-title>Irrigation water as source of foodborne pathogens on fruit and vegetables</article-title>. <source>J. Food Prot</source>. <volume>67</volume>, <fpage>2839</fpage>&#x02013;<lpage>2849</lpage>. <pub-id pub-id-type="doi">10.4315/0362-028X-67.12.2839</pub-id><pub-id pub-id-type="pmid">15633699</pub-id></citation></ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tarr</surname> <given-names>P. I.</given-names></name> <name><surname>Gordon</surname> <given-names>C. A.</given-names></name> <name><surname>Chandler</surname> <given-names>W. L.</given-names></name></person-group> (<year>2005</year>). <article-title>Shiga-toxin-producing <italic>Escherichia coli</italic> and haemolytic uraemic syndrome</article-title>. <source>Lancet</source>. <volume>365</volume>, <fpage>1073</fpage>&#x02013;<lpage>1086</lpage>. <pub-id pub-id-type="doi">10.1016/S0140-6736(05)71144-2</pub-id><pub-id pub-id-type="pmid">15781103</pub-id></citation></ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Townsend</surname> <given-names>A.</given-names></name> <name><surname>Li</surname> <given-names>S.</given-names></name> <name><surname>Mann</surname> <given-names>D. A.</given-names></name> <name><surname>Deng</surname> <given-names>X. A.</given-names></name></person-group> (<year>2020</year>). <article-title>quasimetagenomics method for concerted detection and subtyping of <italic>Salmonella enterica</italic> and <italic>E. coli</italic> O157:H7 from romaine lettuce</article-title>. <source>Food Microbiol</source>. <volume>92</volume>, <fpage>103575</fpage>. <pub-id pub-id-type="doi">10.1016/j.fm.2020.103575</pub-id><pub-id pub-id-type="pmid">32950159</pub-id></citation></ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Uyttendaele</surname> <given-names>M.</given-names></name> <name><surname>Jaykus</surname> <given-names>L.-A.</given-names></name> <name><surname>Amoah</surname> <given-names>P.</given-names></name> <name><surname>Chiodini</surname> <given-names>A.</given-names></name> <name><surname>Cunliffe</surname> <given-names>D.</given-names></name> <name><surname>Jacxsens</surname> <given-names>L.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Microbial hazards in irrigation water: standards, norms, and testing to manage use of water in fresh produce primary production</article-title>. <source>Compr. Rev. Food Sci. Food Saf</source>. <volume>14</volume>, <fpage>336</fpage>&#x02013;<lpage>356</lpage>. <pub-id pub-id-type="doi">10.1111/1541-4337.12133</pub-id></citation>
</ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vorimore</surname> <given-names>F.</given-names></name> <name><surname>Jaudou</surname> <given-names>S.</given-names></name> <name><surname>Tran</surname> <given-names>M. L.</given-names></name> <name><surname>Richard</surname> <given-names>H.</given-names></name> <name><surname>Fach</surname> <given-names>P.</given-names></name> <name><surname>Delannoy</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2023</year>). <article-title>Combination of whole genome sequencing and supervised machine learning provides unambiguous identification of eae-positive Shiga toxin-producing <italic>Escherichia coli</italic></article-title>. <source>Front. Microbiol</source>. <volume>14</volume>, <fpage>1118158</fpage>. <pub-id pub-id-type="doi">10.3389/fmicb.2023.1118158</pub-id><pub-id pub-id-type="pmid">37250024</pub-id></citation></ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Walters</surname> <given-names>S. P.</given-names></name> <name><surname>Gonzalez-Escalona</surname> <given-names>N.</given-names></name> <name><surname>Son</surname> <given-names>I.</given-names></name> <name><surname>Melka</surname> <given-names>D. C.</given-names></name> <name><surname>Sassoubre</surname> <given-names>L. M.</given-names></name> <name><surname>Boehm</surname> <given-names>A. B.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title><italic>Salmonella enterica</italic> diversity in central Californian coastal waterways</article-title>. <source>Appl. Environ. Microbiol</source>. <volume>79</volume>, <fpage>4199</fpage>&#x02013;<lpage>4209</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.00930-13</pub-id><pub-id pub-id-type="pmid">23624479</pub-id></citation></ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wick</surname> <given-names>R. R.</given-names></name> <name><surname>Judd</surname> <given-names>L. M.</given-names></name> <name><surname>Gorrie</surname> <given-names>C. L.</given-names></name> <name><surname>Holt</surname> <given-names>K. E.</given-names></name></person-group> (<year>2017</year>). <article-title>Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads</article-title>. <source>PLoS Comput. Biol</source>. 13, e1005595. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1005595</pub-id><pub-id pub-id-type="pmid">28594827</pub-id></citation></ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wood</surname> <given-names>D. E.</given-names></name> <name><surname>Lu</surname> <given-names>J.</given-names></name> <name><surname>Langmead</surname> <given-names>B.</given-names></name></person-group> (<year>2019</year>). <article-title>Improved metagenomic analysis with Kraken 2</article-title>. <source>Genome Biol</source>. <volume>20</volume>, <fpage>257</fpage>. <pub-id pub-id-type="doi">10.1186/s13059-019-1891-0</pub-id><pub-id pub-id-type="pmid">31779668</pub-id></citation></ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Payne</surname> <given-names>M.</given-names></name> <name><surname>Kaur</surname> <given-names>S.</given-names></name> <name><surname>Lan</surname> <given-names>R.</given-names></name></person-group> (<year>2021</year>). <article-title>Improved genomic identification, clustering, and serotyping of Shiga toxin-producing <italic>Escherichia coli</italic> using cluster/serotype-specific gene markers</article-title>. <source>Front. Cell. Infect. Microbiol</source>. <volume>11</volume>, <fpage>772574</fpage>. <pub-id pub-id-type="doi">10.3389/fcimb.2021.772574</pub-id><pub-id pub-id-type="pmid">35083165</pub-id></citation></ref>
</ref-list>
</back>
</article>