Assessment and Comparison of Molecular Subtyping and Characterization Methods for Salmonella

Tang, Silin; Orsi, Renato H.; Luo, Hao; Ge, Chongtao; Zhang, Guangtao; Baker, Robert C.; Stevenson, Abigail; Wiedmann, Martin

doi:10.3389/fmicb.2019.01591

REVIEW article

Front. Microbiol., 12 July 2019

Sec. Food Microbiology

Volume 10 - 2019 | https://doi.org/10.3389/fmicb.2019.01591

Assessment and Comparison of Molecular Subtyping and Characterization Methods for Salmonella

ST
Silin Tang ¹^*
RH
Renato H. Orsi ²
HL
Hao Luo ¹
CG
Chongtao Ge ¹
GZ
Guangtao Zhang ¹
RC
Robert C. Baker ¹
AS
Abigail Stevenson ¹
MW
Martin Wiedmann ²

1. Mars Global Food Safety Center, Beijing, China
2. Department of Food Science, College of Agriculture and Life Sciences, Cornell University, Ithaca, NY, United States

Abstract

The food industry is facing a major transition regarding methods for confirmation, characterization, and subtyping of Salmonella. Whole-genome sequencing (WGS) is rapidly becoming both the method of choice and the gold standard for Salmonella subtyping; however, routine use of WGS by the food industry is often not feasible due to cost constraints or the need for rapid results. To facilitate selection of subtyping methods by the food industry, we present: (i) a comparison between classical serotyping and selected widely used molecular-based subtyping methods including pulsed-field gel electrophoresis, multilocus sequence typing, and WGS (including WGS-based serovar prediction) and (ii) a scoring system to evaluate and compare Salmonella subtyping assays. This literature-based assessment supports the superior discriminatory power of WGS for source tracking and root cause elimination in food safety incident; however, circumstances in which use of other subtyping methods may be warranted were also identified. This review provides practical guidance for the food industry and presents a starting point for further comparative evaluation of Salmonella characterization and subtyping methods.

Introduction

A number of food safety incidents and recalls caused by Salmonella contamination have been associated with ready-to-eat low-moisture products (e.g., milk powder, raw almonds, dry seasonings, and peanut butter) (Pillai and Ricke, 2002; Maciorowski et al., 2004; Park et al., 2008; GMA, 2009; Hanning et al., 2009), and other food commodities (e.g., meat products, eggs, and vegetables) (Greig and Ravel, 2009; Wu et al., 2017; Ricke et al., 2018) in recent years. These cases highlight the need to reinforce Salmonella control measures in the food industry, including rapid and accurate tracking of pathogen contamination sources with appropriate subtyping tools. Tools used in incident investigations that can differentiate Salmonella beyond the species level (defined as Salmonella subtyping) are essential to improve control of this pathogen, as Salmonella contamination can occur from diverse sources at any stage of food production (Olaimat and Holley, 2012; Barco et al., 2013; Shi et al., 2015).

Conventional serotyping (White–Kauffmann–Le minor scheme) has been used as a Salmonella subtyping method for >80 years (Salmonella Subcommittee of the Nomenclature Committee of the International Society for, Microbiology, 1934; Grimont and Weill, 2007; Guibourdenche et al., 2010; Dera-Tomaszewska, 2012; Shi et al., 2015) and has been a certified approach for public health monitoring of Salmonella infections for over 50 years (CDC, 2015). This method classifies the genus Salmonella into serovars (also known as “serotypes”) based on surface antigens including somatic (O), flagellar (H), and capsular (Vi) antigens (Brenner et al., 2000). More than 2,500 serovars of Salmonella enterica, the Salmonella species responsible for virtually all salmonellosis cases have been identified by conventional serotyping (Hadjinicolaou et al., 2009; Ferrari et al., 2017), but less than 100 serovars account for the vast majority of human infections (CDC, 2015). Due to the large variety of Salmonella serovars, a laboratory needs to maintain more than 250 different high-quality typing antisera and 350 different antigens for conventional serotyping of Salmonella (McQuiston et al., 2004; Fitzgerald et al., 2006). The turnaround time (i.e., time needed from isolate submission to a laboratory to receipt of the result) for serotyping a single isolate is usually >3 days. In some cases, it can take much longer (>12 days) as multiple antibody/agglutination reactions may be needed in a step-wise fashion to assign a final classification for complex serovars (Kim et al., 2006; Boxrud, 2010). Traditional serotyping is thus time-consuming and labor intensive requiring well-trained, experienced technicians (Boxrud, 2010; Shi et al., 2015). Unfortunately, it can also be imprecise (McQuiston et al., 2011). Moreover, the low discriminatory power of conventional serotyping may result in false-positive identification of relatedness between two unrelated isolates, as strains with the same serovar (such as the serovar Salmonella Enteritidis) may originate from multiple contamination sources. Further in-depth resolution beyond the serovar level is thus required for incident investigations (Ricke, 2017; Ricke et al., 2018). Various rapid molecular-based subtyping methods have been developed to provide faster, more discriminatory, and more accurate subtyping of Salmonella thus overcoming the limitations of traditional serotyping. Nevertheless, serovar data can still provide important historical epidemiological information, as certain serovars have specific virulence characteristics or may be associated with specific contamination sources (Ricke et al., 2018). Thus, it is important to link the subtypes identified by these molecular-based methods to Salmonella serovars.

There is no current global recommendation for the application of molecular characterization methods for Salmonella, although the food industry has applied both banding pattern-based and sequence-based subtyping methods for incident investigations. This review will provide (i) a comparison between classical serotyping and selected widely used molecular-based subtyping methods including pulsed-field gel electrophoresis (PFGE), multilocus sequence typing (MLST), and whole-genome sequencing (WGS, including WGS-based serovar prediction), and (ii) a scoring system to evaluate and compare Salmonella subtyping assays.

Banding Pattern-Based and Sequencing-Based Characterization Methods for Salmonella

There are two major types of molecular-based subtyping methods: (i) nucleotide banding pattern-based subtyping methods, representing the banding patterns generated from the restriction digestion or polymerase chain reaction (PCR) amplification of genomic or plasmid DNA (Wachsmuth et al., 1991; Hartmann and West, 1997) and (ii) sequencing-based subtyping, identifying variants at the single-nucleotide level of the selected gene markers or the entire genome of an isolate. A comparison of the resolution, turnaround time, ability of serovar prediction, cost, and feasibility of these methods is given below (Table 1).

TABLE 1

Method	Ability to identify or predict serovars	Ability to provide sensitive subtype discrimination	Time to results from a single colony	Commercial availability (time to results that can be expected from commercial labs)	Summary of value for food industry	Estimated reagent cost per isolate (instrument and labor cost not included)¹	Service cost per isolate (provided by commercial labs)¹
Classical White–Kauffman serotyping	While Salmonella serovars are based on White–Kauffmann serotyping, serotyping does provide frequent misclassification (Petersen et al., 2002).	Very poor subtype discrimination; only valuable as subtyping method for rare and unusual serovars.	2–17 days (usually >5 days) (ECDC, 2015; Bopp et al., 2016)	2–4 weeks	Classical serotyping is likely to be replaced rapidly by WGS-based serovar prediction. Main value for industry is as a rapid confirmation and subtype screen if access exists to lab that can provide rapid turnaround time.	$5–65 (ECDC, 2015; Bopp et al., 2016)	∼$175
Pulsed-field gel electrophoresis (PFGE)	Intermediate ability to predict serovars	Good subtyping discrimination for most serovars. Some PFGE patterns are very common within some serovars (e.g., Pattern 4 for Salmonella Enteritidis)	4–6 days (ECDC, 2015; Bopp et al., 2016)	2–3 weeks	Has been the gold standard subtyping method for Salmonella, is likely to be replaced rapidly by WGS, starting with public health authorities and food regulators.	$7–50 (ECDC, 2015; Bopp et al., 2016)	$130–200
Multiple locus variable number of tandem repeats (VNTR) analysis (MLVA)	Intermediate ability to predict serovars	Good subtyping discrimination for most serovars. May perform better than PFGE for some serovars but worse for others.	1–2 days	NA²	Has been used as a secondary subtyping method to compensate the low discriminatory power of serotyping and PFGE for some Salmonella serovars; it is likely to be replaced rapidly by WGS, starting with public health authorities and food regulators.	$9–36 (Amirkhanian et al., 2006; Top et al., 2008; Schouls et al., 2009; ECDC, 2015)	NA²
Legacy multilocus sequence typing (legacy MLST)	Intermediate ability to predict serovars	Better than conventional serotyping and riboprinting, worse than PFGE and WGS.	1–2 days	2–3 days	Main value for industry is as a rapid confirmation and subtype screen, can be used to select the reference genome for WGS data analysis.	$30–82 (Ranieri et al., 2013; Shi et al., 2015)	∼$280
Whole-genome sequencing (WGS)	Currently available serovar-prediction software using WGS data work well for less common serovars. May not work for extremely rare serovars.	Best discrimination among molecular subtyping approaches	3–17 days (ECDC, 2015) (depends on sequencing capabilities. Usually 1 day after sequencing is finished).	2–8 weeks	For companies with high demand of isolates to be subtyped, WGS is probably the most affordable and fastest method that provides the best discrimination. In addition, in silico serotyping and in silico MLST can be done from the data to allow for comparison with historical isolates that have not been whole-genome-sequenced. Other information, such as presence of antibiotic resistance genes and virulence genes can be easily retrieved from the data. For companies with low demand, the costs of real-time sequencing may be prohibitive, requiring that old isolates must wait until more isolates are collected to be submitted together.	$60–230 (ECDC, 2015)	$100 (using Illumina HiSeq X series)–up to more than $500 (using Illumina MiSeq)

Overview of Salmonella characterization and subtyping methods.

¹These cost estimates per isolate are based on (i) previous cost estimation reports and studies, (ii) official prices available on Internet, and/or (iii) personal communication with service providers and product vendors, as of June 2018, true costs may vary considerably based on number of isolates tested per run, labor costs, and region/country, etc. ²NA, not available.

Banding Pattern-Based Characterization Methods

Pulsed-Field Gel Electrophoresis (PFGE)

Pulsed-field gel electrophoresis was first described in 1984 and developed as a subtyping method for Salmonella in the 1990s (Threlfall and Frost, 1990; Figure 1). PFGE is currently the gold standard for PulseNet International, and has been used by public health authorities and food regulators for outbreak investigations and source tracking globally (including USCDC, USFDA, USDA, and ECDC) (Zou et al., 2010; Wattiau et al., 2011; PulseNet, 2014; CDC, 2016a). Alternative methods for Salmonella subtyping are commonly compared against PFGE (Call et al., 2008). However, PulseNet is transitioning from using PFGE and multiple locus variable number of tandem repeats analysis (MLVA) toward using WGS as the standardized genotyping method for foodborne pathogens (CDC, 2017a; Nadon et al., 2017). PulseNet International has defined standard PFGE protocols (PulseNet, 2013; CDC, 2017b) and maintains a database of Salmonella PFGE profiles with >350,000 PFGE patterns representing >500 serovars. These PFGE patterns predominantly represent isolates collected since 1996 in North America and Europe (Zou et al., 2013). PFGE has relatively high concordance with epidemiological relatedness with two decades of data accumulation (CDC, 2018a). However, the PulseNet database for PFGE patterns is not publicly available and can only be accessed by PulseNet participating laboratories.

FIGURE 1

The PFGE approach uses restriction enzymes that recognize specific restriction sites along the genomic DNA and fragment the DNA to sizes normally ranging from 20 to 800 kb (up to 2,000 kb) (Schwartz and Cantor, 1984; Singh et al., 2006). These large fragments are separated in a flat agarose gel by constantly changing the direction of the electric current (pulsed field), which causes the DNA to separate by size, generating a specific “fingerprint pattern” for a given isolate (Foley et al., 2009). The restriction enzymes XbaI, NotI, SpeI, and SfiI have been typically used for Gram-negative bacteria including Salmonella (Barg and Goering, 1993). The primary restriction enzyme used for Salmonella PFGE is XbaI. A public health laboratory usually has access to software [e.g., BioNumerics and GelCompar (Applied Maths, Sint-Martens-Latem, Belgium); Diversity Database Fingerprinting Software (Bio-Rad Laboratories, Hercules, CA, United States)], to analyze a PFGE pattern (Nsofor, 2016) and uploads PFGE patterns to a national database. PulseNet Central’s database managers then analyze the uploaded pattern to see if a new outbreak has emerged or whether the isolate is part of an ongoing outbreak (CDC, 2018a). To make inter-laboratory comparison of DNA patterns possible, standardized protocols, molecular size standards (Salmonella Braenderup H2812, ATCC BAA-664), software, and nomenclature of PFGE patterns are required (PulseNet, 2015a). The approximate cost of the equipment and reagents required by PFGE can be accessed on the PulseNet International – PFGE site (PulseNet, 2015b).

Pulsed-field gel electrophoresis has been shown repeatedly to be more discriminatory than methods such as conventional serotyping, ribotyping, or MLST for many bacteria (Fakhr et al., 2005; Harbottle et al., 2006; Oloya et al., 2009; Soyer et al., 2010; Hauser et al., 2012). The combination of profiles generated by using additional restriction enzymes can enhance the value of this method for differentiating highly homogeneous Salmonella strains (Zheng et al., 2011); however, the cost increases as additional enzymes are used. PFGE can be used for subtyping of both Gram-positive (e.g., Listeria monocytogenes, Staphylococcus aureus) and Gram-negative (e.g., Salmonella, Escherichia coli, Shigella, Campylobacter jejuni) pathogenic bacteria. Typically, only the choice of the restriction enzyme and conditions for electrophoresis need to be optimized depending on the bacterial species investigated (PulseNet, 2015a).

Although various software platforms are available for PFGE pattern analysis, artifacts (e.g., brightly fluorescing spot) may lead to misidentification of bands. PFGE technology cannot usually be used to reliably visualize smaller fragments (e.g., <20.5 kb; Hunter et al., 2005) and has difficulty in differentiating bands differing by <5–10% in size due to the limited resolution of electrophoresis (Dijkshoorn et al., 2001; Persing et al., 2011). To address these issues, it has been recommended that users confirm PFGE pattern assignments using their experience and additional information to avoid incorrect band calling and systematic band shifts due to gel imperfections or imperfect reproducibility of electrophoretic conditions (Van Belkum et al., 2007). PFGE cannot be automated and requires high-level technical expertise and, thus, is hampered by low throughput, and may show low robustness and poor comparability of results between laboratories (Hyytia-Trees et al., 2007; Fabre et al., 2012; Kjeldsen et al., 2016).

No genetic information such as virulence potential or presence of antimicrobial resistance genes can be provided by PFGE, as the DNA fragments are separated by size rather than sequence (Ferrari et al., 2017). Observed bands of comparable size might not represent the same sequence of DNA, and a small mutation in a restriction site may result in changes in multiple bands. “Relatedness” determined by PFGE thus may not represent a true phylogenetic relationship between isolates (CDC, 2018a). Typically, multiple distinct PFGE patterns can be identified among isolates classified into the same serovar. Polyphyletic serovars, which are derived from more than one common evolutionary ancestor or ancestral group (e.g., serovars Newport, Mississippi, Saintpaul, Kentucky), show high levels of PFGE diversity (Porwollik et al., 2004; Sukhnanand et al., 2005; Alcaine et al., 2006; Harbottle et al., 2006; Sangal et al., 2010). PFGE-based prediction of these serovars is unreliable if isolates in the database are not representative of all clades of the serovar. On the other hand, PFGE may cluster epidemiologically unrelated isolates into identical PFGE types (Barco et al., 2013) and may even provide similar or identical PFGE types for isolates that represent different, but genetically very similar serovars that have a common ancestor (Barco et al., 2013; Shi et al., 2015), such as Typhimurium (antigenic formula: 1,4,[5],12:i:1,2) versus Typhimurium var. Copenhagen (antigenic formula: 1,4,12:i:1,2) (Heisig et al., 1995; Hauser et al., 2011), and Typhimurium versus 4,5,12:i:- (Guerra et al., 2000; Soyer et al., 2009; Wiedmann and Nightingale, 2009; Hoelzer et al., 2010; Ranieri et al., 2013). Furthermore, differentiation of genetically homogeneous serovars such as serovar Enteritidis challenges the usefulness of PFGE in Salmonella subtyping activities (Olson et al., 2007; Zheng et al., 2007). Approximately 45% of serovar Enteritidis isolates reported to PulseNet display the same PFGE XbaI pattern (JEGX01.0004), although many of these isolates are not epidemiologically related (Zheng et al., 2007). It is important to mention that the serovars mentioned above (i.e., Enteritidis, Typhimurium, Newport, Mississippi, Saintpaul, and Kentucky) are ranked among the most common Salmonella serovars associated with human and animal salmonellosis globally (Galanis et al., 2006; CDC, 2009).

Multiple Locus Variable Number of Tandem Repeats Analysis (MLVA)

Multiple locus variable number of tandem repeats analysis is a PCR-based typing method originating from forensic science where it is used for DNA “fingerprinting” samples of human origin. It has frequently been applied to scientific studies of prokaryotes as well as to microbial outbreak detection and source tracking (Lindstedt et al., 2003, 2013; Figure 1). MLVA is the second major genotyping tool (after PFGE) used in the PulseNet network (PulseNet, 2015c); prior to WGS, MLVA was one of the most popular subtyping methods used in public health surveillance and outbreak investigation of Salmonella, particularly in Europe (Torpdahl et al., 2007; Hopkins et al., 2011; Barco et al., 2013; Bauer et al., 2013; Lindstedt et al., 2013; Mughini-Gras et al., 2018). MLVA is usually performed following serotyping or PFGE for routine surveillance as a complementary technique for Salmonella subtyping (Torpdahl et al., 2007; Lienemann et al., 2015; Kjeldsen et al., 2016; CDC, 2017c; Ferrari et al., 2017), as it is challenging for PFGE to further differentiate isolates of genetically homogeneous serovars such as Salmonella Enteritidis (Kjeldsen et al., 2016). MVLA is especially used for typing Salmonella Typhimurium and Salmonella Enteritidis strains in reference or regulatory laboratories in Denmark, France, Germany, and United States [e.g., CDC, USDA – Food Safety and Inspection Service (FSIS) laboratories] (Barco et al., 2013; Bauer et al., 2013).

Multiple locus variable number of tandem repeats analysis is serovar specific, thus different Salmonella serovars usually require different MLVA schemes (Kruy et al., 2011). The first step toward uniform standardization of the MLVA profiles was collectively taken by PulseNet International and ECDC in defining the standard protocols of MLVA for Salmonella Typhimurium and Salmonella Enteritidis (ECDC, 2011, 2016b; PulseNet, 2015c). These serovars account for 26% of the culture-confirmed human Salmonella infections reported by US Laboratory-based Enteric Disease Surveillance (LEDS) and >60% of the salmonellosis cases reported by ECDC (ECDC, 2016a; Kjeldsen et al., 2016; CDC, 2018b). This uniform standardization of the MLVA profiles allowed direct comparison between laboratories irrespective of the platform used for MLVA (Larsson et al., 2009). Validated MLVA standard protocols for additional Salmonella serovars of clinical importance worldwide are largely missing, making MLVA use for serovars other than Enteritidis and Typhimurium difficult. However, with the advent of and transition into WGS, further development of MLVA may not occur.

Multiple locus variable number of tandem repeats analysis assesses the variation in the number of tandem repeated DNA sequences referred to as “variable-number tandem repeats” (VNTRs) in multiple regions of the bacterial genome to characterize bacterial isolates. The number of VNTRs in a given locus may vary between different microorganisms and even among bacterial isolates of the same species and serovar (Lindstedt et al., 2003; Torpdahl et al., 2007; Ngoi et al., 2015). The VNTR profiles vary in length between a few base pairs long to over 100 base pairs, enabling the development of techniques that utilize variation in the size of VNTR to discriminate closely related isolates (Lindstedt et al., 2003; Torpdahl et al., 2007; Fabre et al., 2012). The improved discriminatory power of MLVA varies with the serovar and phage type investigated (Torpdahl et al., 2007; Lienemann et al., 2015); e.g., in a study in Denmark, MLVA could differentiate distinct clusters within the most common phage types of Salmonella Typhimurium such as DT104, DT120, and DT12 even though these isolates displayed comparable PFGE patterns (Torpdahl et al., 2007). Public health laboratories usually have access to software (e.g., BioNumerics, GeneMapper, the free Peak Scanner) for analysis of MLVA patterns (ECDC, 2011; PulseNet, 2015c). Minimum spanning trees are frequently applied to MLVA profiles, yielding maps of predicted relationships among isolates based on single-locus and dual-locus variants (Van Belkum et al., 2007). However, web-accessible MLVA databases are not widely used for international collaboration (Guigon et al., 2008).

Multiple locus variable number of tandem repeats analysis is cheaper, faster, simpler to execute, and shows a relatively high-throughput compared with other molecular methods (Torpdahl et al., 2005, 2007; Lindstedt et al., 2007, 2013; Hopkins et al., 2011; Kruy et al., 2011). MLVA is less labor-intensive, time-consuming, and it is easier to perform than PFGE and MLST, as the protocol requires only a regular PCR step followed by capillary electrophoresis (Torpdahl et al., 2007; Lindstedt et al., 2013). Reduced handling time of pathogenic bacteria is beneficial for large-scale investigations. MLVA is also suitable for automation using a pipetting robot work station, automated sequencer, and analytical software (Barco et al., 2013; Lindstedt et al., 2013; Ferrari et al., 2017). Moreover, MLVA demonstrates good international repeatability and reproducibility for specific serovars such as Salmonella Typhimurium and Salmonella Enteritidis (Larsson et al., 2013). The data generated by MLVA can be readily analyzed and standardized for inter-laboratory comparisons (Torpdahl et al., 2007; Hopkins et al., 2011; Lindstedt et al., 2013; Wuyts et al., 2013).

A major drawback of MLVA for Salmonella subtyping is that the most effective MLVA protocols described so far are serovar-specific (Barco et al., 2013; Ngoi et al., 2015; Kjeldsen et al., 2016); hence, isolates have to be serotyped prior to selecting a specific MLVA scheme for further subtyping (Kjeldsen et al., 2016). At least 27 MLVA schemes have been developed to subtype different Salmonella serovars, whereas only Salmonella Typhimurium and Salmonella Enteritidis MLVA assays have been standardized in Europe and in the PulseNet network (PulseNet, 2015c; Kjeldsen et al., 2016). Another drawback is that rapid evolution of the target loci may decrease the reliability of results provided by MLVA regarding the relationship between strains under investigation (Hopkins et al., 2007, 2011; Lindstedt et al., 2013). This might hamper the use of MLVA, particularly in long-term epidemiological studies (Lindstedt, 2005; Li et al., 2009).

Repetitive Element PCR (Rep-PCR)

Repetitive element PCR targets the repetitive elements of genomic DNA to discriminate bacterial isolates. This method has been developed using three families of repeat sequences for subtyping Salmonella, including “enterobacterial repetitive intergenic consensus” (ERIC) sequences, “the repetitive extragenic palindromic” (REP) sequences, and the “BOX” sequences (Gilson et al., 1990; Hulton et al., 1991; Martin et al., 1992). The PCR products amplified from genome regions containing these repetitive elements are analyzed by agarose gel electrophoresis, and the banding patterns generated are used to investigate the genetic relatedness between bacterial isolates (Sabat et al., 2013). The DiversiLab system (bioMérieux, Marcy-l’Etoile, France) automated the whole process of the Rep-PCR subtyping approach after 2000 and has been used for subtyping pathogens in hospitals worldwide (Healy et al., 2005; Chenu et al., 2012; Sabat et al., 2013; Figure 1). As the low reproducibility of original Rep-PCR method may have resulted from variability in reagents and gel electrophoresis systems (Sabat et al., 2013), the application of the DiversiLab system with microfluidic capillary electrophoresis increased both the resolution and reproducibility of the Rep-PCR approach (Healy et al., 2005; Chenu et al., 2012; Sabat et al., 2013). However, the system has been discontinued, making Rep-PCR unavailable as a commercial platform.

The major advantages of this method include its relatively low cost (comparable to that of PFGE) and short turnaround time (within one day) (Sabat et al., 2013; Ngoi et al., 2015). However, the discriminatory power of Rep-PCR in subtyping Salmonella is reportedly lower than that of PFGE (Tiong et al., 2010; Thong and Ang, 2011; Elemfareji and Thong, 2013; Ngoi et al., 2015). Its relatively low reproducibility (which can at least be partially addressed by automation, such as in the DiversiLab system), and low accuracy of serovar prediction (Weigel et al., 2004; Wise et al., 2009) have limited its application in Salmonella subtyping.

Sequencing-Based Characterization Methods

Legacy Multilocus Sequence Typing (Legacy MLST)

Multilocus sequence typing is a nucleotide sequence-based approach that assesses DNA sequence variations (i.e., allelic type) of typically three, four, or seven selected well-conserved, housekeeping genes, usually using Sanger sequencing technology (Liu, 2010; Achtman et al., 2012). Schemes targeting seven genes are typically considered the “classical” MLST approach; this typing approach was originally proposed for isolates of Neisseria meningitidis (Liu, 2010). In this review, we focus on the most widely used Salmonella scheme targeting seven housekeeping genes [aroC, dnaN, hemD, hisD, thrA, sucA, and purE; hereafter denoted as legacy MLST to distinguish newer approaches (described below)] (Li et al., 2009; Yun et al., 2015). It was first introduced for Salmonella Typhi in 2002 (Kidgell et al., 2002), and extended to all Salmonella serovars in 2012 (Achtman et al., 2012; Figure 1). Legacy MLST is mainly used in research studies, assessing the population genetics and evolution of Salmonella. Public Health England (PHE) started adopting the seven-gene MLST (based on WGS data) approach as a replacement for traditional serotyping in 2015 (Ashton et al., 2016).

Historical MLST data including legacy MLST sequence types are maintained on EnteroBase (Alikhan et al., 2018). As of November 2017, the number of legacy MLST sequence types for Salmonella has reached 3,929 (Alikhan et al., 2018). Legacy MLST analysis can be conducted online by entering the sequences of amplified genes. Allelic variation at each locus is cataloged and a sequence type is assigned by comparing the allele set. The strains are characterized by their unique sequence type. With the advent of next-generation-sequencing technologies, legacy MLST data can also be extracted directly from WGS data using bioinformatics pipelines (Achtman et al., 2012; Ashton et al., 2016). The relatedness of isolates subtyped by legacy MLST can be displayed as a dendrogram or a minimum spanning tree based on the matrix of pairwise differences between their allelic profiles (Francisco et al., 2009), or as a phylogenetic tree built directly from the nucleotide alignment of the seven genes.

Legacy MLST can deliver results more rapidly than PFGE (Shi et al., 2015; Yun et al., 2015; Table 1), and the publicly available databases and online query system enable legacy MLST results to be highly reproducible and exchangeable between laboratories. However, legacy MLST shows lower discriminatory power than PFGE and MLVA, which limits its application to further discriminate isolates within a given serovar (Torpdahl et al., 2005; Alcaine et al., 2006; Foley et al., 2006; Harbottle et al., 2006; Hauser et al., 2012; Ngoi et al., 2015), and for source attribution (Barco et al., 2013). Protocols targeting sequences in genes that change more rapidly than housekeeping genes have been developed to improve the discriminatory power of legacy MLST (Ross and Heuzenroeder, 2005, 2008).

Clustered Regularly Interspaced Short Palindromic Repeat-Based Subtyping (CRISPR-Based Subtyping)

The clustered regularly interspaced short palindromic repeat (CRISPR) typing method uses the diversity of the content of CRISPR loci to distinguish bacterial strains. The application of the CRISPR system for subtyping foodborne pathogens is discussed in detail elsewhere (Shariat and Dudley, 2014; Shi et al., 2015; Barrangou and Dudley, 2016; Ferrari et al., 2017; Ricke et al., 2018). Although the CRISPR system has been applied to the subtyping of at least 100 Salmonella serovars (Shariat and Dudley, 2014; Barrangou and Dudley, 2016), this approach is not widely used by public health authorities or food regulators (Ferrari et al., 2017).

Clustered regularly interspaced short palindromic repeat loci contain variable lengths of CRISPR spacers obtained from foreign nucleic acids of plasmids or bacteriophages (Shariat and Dudley, 2014; Wright et al., 2017). These CRISPR spacers are acquired or lost during evolution of the pathogen over time in a sequential manner (Ricke et al., 2018), thus constructing a unique set of DNA sequence patterns that may provide sufficient resolution for pathogen subtyping (Fricke et al., 2011; Barrangou and Horvath, 2012; Shariat and Dudley, 2014; Wright et al., 2017). For subtyping, amplified CRISPR loci PCR products are sequenced by Sanger sequencing technology (Liu et al., 2011). The CRISPR spacer sequences are analyzed to assign each locus with an allelic type. The combination of the allelic types of analyzed CRISPR loci determine the isolate’s allelic profile (also referred to as the isolate’s sequence type) and is used to investigate the relationships between isolates (Liu et al., 2011).

The CRISPR approach has been shown to be feasible for subtyping of Salmonella (Liu et al., 2011; Fabre et al., 2012; DiMarzio et al., 2013; Shariat et al., 2013a, b, c; Almeida et al., 2017). Liu et al. (2011) developed a CRISPR–multi-virulence-locus sequence typing (MVLST) approach using virulence genes sseL and fimH with CRISPR1 and CRISPR2 loci; this approach was used to compare 171 isolates representing nine serovars (Typhimurium, Enteritidis, Newport, Heidelberg, Javiana, I 4,[5],12:i:-, Montevideo, Muenchen, Saintpaul) and was reported to be able to subtype Salmonella with resolution at the outbreak level. CRISPR–MVLST using different schemes of virulence genes has also been applied by others for subtyping Salmonella (DiMarzio et al., 2013; Shariat et al., 2013a; Almeida et al., 2017). The results from these studies suggest that CRISPR–MVLST has a higher discriminatory power than legacy MLST (Ferrari et al., 2017); however, discrimination is lower than PFGE in some cases (Almeida et al., 2015). While CRISPR typing has a relatively short turnaround time (comparable to MLST), current major drawbacks include high cost (Almeida et al., 2017; Ferrari et al., 2017), unstandardized protocol, and database, as well as limited research on the concordance between the diversity of Salmonella isolates reflected by CRISPR loci content and by the other standard subtyping methods (Shi et al., 2015).

Whole-Genome Sequencing (WGS)

Whole-genome sequencing captures DNA sequence changes across the entire genome of single microbial isolates. The data are useful to assess evolution, allowing accurate description of the genetic relatedness of isolates. The use of WGS for Salmonella subtyping in outbreak investigation and pathogen source tracking has proven effective by a rapidly increasing number of studies (den Bakker et al., 2011, 2014; Allard et al., 2012; Leekitcharoenphon et al., 2014; Deng et al., 2015; Taylor et al., 2015; Hoffmann et al., 2016; Inns et al., 2016). WGS was first used to trace a Salmonella multistate outbreak in the United States in 2009 (CDC, 2019), and has been used for pathogen subtyping by the public health surveillance systems in the United States (Allard et al., 2018), Canada (Vincent et al., 2018), the United Kingdom (Ashton et al., 2016), Denmark (Kvistholm Jensen et al., 2016), and France (Moura et al., 2016). PulseNet international is also making efforts to implement WGS within the PulseNet network as a routine tool to replace PFGE and MLVA (Nadon et al., 2017; Figure 1). Both PHE (Ashton et al., 2016) and the US FDA (2018) have started using “real-time” WGS to subtype Salmonella isolates. CDC is also using WGS in state laboratories for Salmonella outbreak investigations (CDC, 2016b). WGS will be used increasingly for contamination incident investigations in the food industry, particularly as cost continues to shrink and ease of use increases. WGS (as well as other sequencing approaches that use the same next-generation sequencing technologies used for WGS) also have a number of additional applications in the food industry, which will further drive implementation of these tools. Examples of other applications include (i) monitoring ingredient supplies, (ii) identification of microbial persistence in processing environments, and (iii) prediction of antimicrobial resistance (including in Salmonella) and other relevant phenotypes, facilitating the improvement of sanitary management, microbial hazard control, and microbiological risk assessment (Allard et al., 2018; Rantsiou et al., 2018; Ricke et al., 2018).

Sequenced Salmonella genomes can be deposited and made publicly available on the National Center for Biotechnology Information site¹, the European Bioinformatics Institute site², or the DNA Data Bank of Japan site³ with data shared between all three (Kodama et al., 2012; Jagadeesan et al., 2019). NCBI provides phylogenetic tree-based clustering of all publicly available sequence data at the NCBI pathogen detection site⁴. These phylogenetic trees show the closest matches to any newly submitted data (Allard et al., 2018). NCBI also houses the data using GenomeTrakr Network (FDA, 2018). This was developed by the US FDA and NCBI as the first distributed network of laboratories to utilize WGS, with both genomic and geographic data, for foodborne pathogen characterization. This network includes the WGS laboratories of the CDC and USDA (Allard et al., 2016; Jackson et al., 2016). As of February 2019, there are over 184,000 genome sequences or raw sequencing data of S. enterica available on NCBI. WGS data analysis can also be performed off-line without using any public databases, an approach that may sometimes be preferred by industry.

Sequencing platforms that can be used currently for WGS include Illumina, Ion Torrent, Oxford Nanopore Technologies, and Pacific Biosciences (PacBio). Procedures to validate the complete workflow for S. enterica WGS with Illumina (MiSeq and HiSeq) and PacBio platforms from subculture of isolates to bioinformatics analysis have been reported by Portmann et al. (2018). The Illumina sequencing system is one of the most widely used sequencing platforms; it produces DNA-sequence reads with the length of 50–300 bp using sequencing-by-synthesis (SBS). This process uses fragmented DNA templates to detect single bases as they are incorporated during a DNA replication reaction on a solid surface flow cell (Illumina (2019)). For applications including comparative genomics and phylogeny, these short reads of DNA sequences can be aligned to a reference genome or de novo assembled into longer sequences called contigs (Loman and Pallen, 2015). The large amount of data generated by WGS combined with a complex data analysis process generally requires expertise in bioinformatics to deploy and run (Wyres et al., 2014; Deurenberg et al., 2017). Software with a more user-friendly interface, such as CLC Genomics Workbench⁵, BioNumerics, and Geneious (Biomatters, New Zealand), however, is available, including for industry users with limited bioinformatics expertise and an increasing number of user-friendly bioinformatics tools are being developed.

The rapid growth of WGS data in the publicly available databases allows industry to compare isolates with global entries of pathogen sequences used by food regulators and public health authorities (Allard et al., 2018; Rantsiou et al., 2018). Despite increasing availability of data analysis software, it is still challenging to generate consistent analytical reports due to the lack of standardized approaches to data analysis and interpretation (Clooney et al., 2016); for example, even with a standard software, choice of reference genomes can have considerable effects on the data analyses (Pightling et al., 2014). Furthermore, there are currently no clearly outlined safeguards to protect companies from regulatory action if shared WGS data show a relationship between pathogen isolates identified by a company and an outbreak isolate. Development of a mechanism for sharing data through anonymous hubs may allay concerns on confidentiality and encourage data sharing (FAO, 2016). This mechanism may also enable more effective data capture and analysis for monitoring trends and identifying related incidents.

The current cost of the entire WGS process, including DNA library preparation, sequencing, data analysis, and storage, is relatively high compared with the other molecular-based subtyping methods. The cost difference is more apparent when a small number of isolates are sequenced (as could be typical for the food industry). The cost of maintaining data analysis tools and bioinformatics personnel needs to be taken into consideration (Leekitcharoenphon et al., 2014; Ferrari et al., 2017; Nadon et al., 2017).

WGS-Analysis Procedures

Interpretation of WGS data for source tracking or outbreak investigation typically uses two approaches to represent results: (i) single-nucleotide polymorphism (SNP) or allelic differences (often presented as distance matrix tables), and (ii) phylogeny or clustering of the isolates. SNP or allelic differences show objectively the genetic distance between two isolates. Hence, if isolate A shows three SNPs or allelic differences to isolate B, and 26 SNPs or allelic differences to isolate C, then we can say that isolate A is more similar to isolate B than to isolate C. If one assumes that all three isolates evolved at the same rate, then we can say that isolates A and B are evolutionarily more closely related to each other than they are to C. However, this assumption (i.e., all isolates evolve at the same rate) may not always be true. Environmental conditions or mutations in the DNA repair system may influence the rate of genetic change accumulated in a genome; e.g., a Salmonella isolate persisting in a humid, nutritious environment such as in a chicken farm may multiply much faster than an isolate persisting in a dry food processing environment. This environmental difference will allow the “chicken farm” isolate to accumulate more mutations (per year or any other time unit) than the dry food processing environment isolate, because the “chicken farm” isolate will multiply more times during the same period than the dry food processing environment isolate. Moreover, mutations in genes involved in DNA repair may result in the so-called “mutator phenotypes” (also sometime referred to as “hypermutators”). Mutator isolates accumulate mutations at a higher rate than non-mutator isolates (Muteeb and Sen, 2010). Hence, analyzing the number of SNP or allelic differences alone may result in misinterpretation of the results if the assumption that isolates evolved at the same rate does not hold true. Phylogenetic or clustering analyses are thus better suited to an investigation, as these analyses group isolates by their similarities instead of their differences (Pightling et al., 2018). To infer the evolutionary relationship of the isolates within a data set, therefore, a phylogeny must be constructed. For more detailed and technical information on reconstructing bacterial phylogenies from WGS data, the reader is referred to two in-depth reviews on this subject (Collins and Xavier, 2017; Patané et al., 2018).

WGS Analysis Approaches for Serotyping

Genetic-based approaches have been developed for in silico determination of serovars, because the phenotypic determination of Salmonella serovars is costly, time-consuming, and labor-intensive. These in silico methods have relied on two main approaches: (i) indirect determination using genetic markers associated with particular serovars and (ii) direct determination using genes responsible for the expression of the somatic O (rfb gene cluster) and flagellar H (fljB and fliC) antigens. The latter method has the advantage of relying on the same genetic information that results in the phenotype assessed by traditional serotyping, while the former method may require validation for new described serovars. These two approaches can also be combined for more reliable serovar prediction.

With the advent of whole-genome sequencing (WGS), in silico direct serovar determination has become the most used approach, and at least two Salmonella serovar databases and programs have been routinely used for in silico serotyping of Salmonella: SeqSero (Zhang et al., 2015) and SISTR (Yoshida et al., 2016a). SeqSero uses a database of 473 alleles representing 56 fliC antigenic types and 190 alleles representing 18 fljB antigenic types in a combined H-antigen database (Zhang et al., 2015). The somatic O-antigen database associated with SeqSero consists of 46 rfb gene cluster sequences corresponding to the 46 O-antigens identified in Salmonella (Zhang et al., 2015). The rfb database was specifically designed to be used with genome assemblies (as opposed to raw sequencing reads). A third database was specifically built for determination of the somatic O-antigen using raw sequencing reads (as opposed to genome assemblies). This third database consists of the genes wzx (encoding the O-antigen flipase), wzy (encoding the O-antigen polymerase), and other targets, all of which are found within the rfb gene cluster. In total, the authors claimed that the SeqSero scheme can theoretically identify 2,389 of the 2,577 serovars that were described in the White–Kauffmann–Le minor scheme by the end of 2014 (Zhang et al., 2015). The inability to predict 188 serovars is due to the absence of the DNA sequences for the antigen-encoding genes corresponding to these serovars in the SeqSero database. Empirical data showed that the SeqSero database has an accuracy of 91.5–92.6% for serotype prediction (Zhang et al., 2015).

SISTR is a platform for in silico analysis of Salmonella draft genome assemblies. SISTR includes the Salmonella Genoserotyping Array (SGSA) tool among other resources. SGSA relies on the allelic differences found within the rfb gene cluster for determination of 18 of the 46 somatic O-antigens, and fljB and fliC for determination of 41 flagellar H antigens (Yoshida et al., 2014). SGSA targets the identification of 90% (n = 2,190) of Salmonella serovars. When serovar determination using genoserotyping is not possible or is incomplete, SISTR also has the option to use the core genome MLST (cgMLST) scheme to infer the serovar based on phylogenetic context. The accuracy of SISTR in predicting Salmonella serovars has been assessed to be close to 95% (Yoshida et al., 2016a, b; Robertson et al., 2018).

Since SISTR can use genoserotyping and the cgMLST scheme to infer the serovar, higher confidence should be attributed to assignments where both genoserotyping and cgMLST agree on the serovar designation. Moderate confidence should be attributed to serovar assignments when only cgMLST is able to identify the serovar. When neither the genoserotyping nor cgMLST can identify the serovar, SeqSero may be used and may allow for serovar prediction.

WGS Analyses for Subtype Characterization

Overview of WGS data analysis approaches

Different approaches can be used for analysis of WGS data for subtyping characterization related to source incident tracking. The most common approaches are based on (i) high-quality single-nucleotide polymorphism (hqSNP) identification and pairwise comparison of hqSNP differences, or (ii) whole-genome (wg)/cgMLST typing using pre-defined schemes (i.e., databases) containing allelic differences for either the pan (wg) or core (cg) genomes of Salmonella and subsequent pairwise comparison for assessing the number of allelic differences.

High-quality SNP analyses

High-quality SNP analyses rely on identification of SNP differences across a set of closely related isolates using raw sequence data, which are mapped to a closed or draft genome assembly (also referred to as the “reference genome”). Only SNPs that have been vertically transferred from an ancestral isolate to the current isolates are subject to the hqSNP analysis, while SNPs that were supposedly horizontally transferred are filtered out from the results. The reference can be a closely related genome outside the dataset, or a genome within the dataset. The analysis consists of two main steps: (i) mapping the raw sequence reads against the reference genome and (ii) SNP calling using stringent criteria to prevent the misidentification of sequencing errors or misaligned regions as SNPs (Davis et al., 2015; Katz et al., 2017). The choice of a closely related reference has been shown to be a key step in the analysis. Reference genomes that are not closely related to the set of isolates under investigation may result in underestimation of the number of SNPs, due to specific regions of the genome that may be present in the dataset under investigation, but that are missing in the reference genome (Pightling et al., 2014). There are at least two publicly available approaches that have been commonly used for hqSNP analysis: (i) the US FDA CFSAN (The Center for Food Safety and Applied Nutrition) SNP pipeline (Davis et al., 2015) and (ii) the US CDC-developed Lyve-SET hqSNP pipeline (Katz et al., 2017). These two pipelines rely on publicly available software to carry out the mapping and SNP calling steps and offer similar results despite some methodological differences, including different criteria for filtering out low-quality SNPs and masking regions supposedly acquired through horizontal gene transfer.

High-quality SNP analysis has been applied in several outbreak investigations in the United States, Canada, and some European countries, including a Salmonella Enteritidis outbreak in the United Kingdom that was linked to a German egg producer (Inns et al., 2015). Historical Salmonella Typhimurium isolates from humans and foods involved in five outbreaks and consisting of five distinct MLVA subtypes were re-analyzed using hqSNP analysis by Octavia et al. (2015); in this study at least 11 isolates not previously linked to the outbreaks were ruled in based on less than two SNP differences to the isolates previously linked to the outbreaks. Another retrospective study used hqSNP to analyze a collection of 55 Salmonella Enteritidis from seven epidemiologically characterized outbreaks and sporadic cases. One isolate not previously linked to any outbreak (i.e., sporadic) was identified to be part of one outbreak (“ruled in”) (Taylor et al., 2015). An investigation into a multi-state outbreak caused by Salmonella Poona was carried out in 2015 using PFGE and hqSNP analysis. Analysis by PFGE demonstrated three different patterns. However, WGS results showed that isolates with different PFGE patterns were genetically linked with less than six SNP differences (Kozyreva et al., 2016). Subtyping of Salmonella Dublin with PFGE was shown to have limited value in a recent outbreak investigation due to its low discriminatory power for this Salmonella serovar (Mohammed et al., 2016). The nine clinical isolates associated with the outbreak were indistinguishable by PFGE, but they were also indistinguishable from other unrelated Salmonella Dublin isolates. The nine isolates linked to the outbreak clustered together with one to nine SNP differences when analyzed using hqSNP, and they could be distinguished from other isolates that shared the same PFGE pattern with epidemiologically unrelated isolates showing more than 50 SNP differences when compared to the outbreak isolates (Mohammed et al., 2016). These studies show that public health agencies are increasingly relying on hqSNP analysis for outbreak investigation, including tracking the source of outbreaks. High-quality SNP analysis clearly improves subtype accuracy and outbreak investigations by not only allowing for increased discriminatory power, but also reducing instances where closely related isolates are being classified as “different.”

wgMLST

Whole-genome MLST (wgMLST) analysis relies on the comparison of individual genomes against a database containing all known alleles for all the genes representing the pan genome of a defined group of strains (i.e., serovar, subspecies, species, and genus). The pan genome is defined as all the genes present in at least one genome from a defined group. Two main approaches can be used, and these are often used in combination: (i) assembly free mapping and (ii) assembly based mapping. Raw sequencing reads are directly mapped against the database in an assembly free approach. Hence, this approach does not require de novo assembly of the genome prior to its utilization. SRST2 (Inouye et al., 2014) and BWA-MEM (Li, 2013) are the most commonly used programs to carry out this task. Because this approach deals directly with the raw sequence reads, it allows filtering low-quality reads or specific nucleotides with low quality within a good-quality read. In an assembly based approach the raw sequence reads are first used to generate a high-quality draft genome (i.e., usually not a closed genome) using a genome assembler. Later, the draft genome (i.e., assembly) is used to find matches against the database. The program most commonly used to map the draft genome against the database is BLASTN (Altschul et al., 1990), although other options also exist. Independently from the approach used (i.e., assembly free or assembly based), the result of mapping a genome against a database is a list of the alleles found in the analyzed genome. When more than one genome is analyzed, the list of alleles from each genome can be compared and the number of allele differences can be computed. Assembly free and assembly based wgMLST allele assignment should match for high confidence. Results are often shown as a distance matrix of allele differences and a dendrogram constructed from this distance matrix. The wgMLST methods allow for comparison of non-closely related isolates from different groups since all genomes are compared against the same database, which is a great advantage of this method over hqSNP (Maiden et al., 2013; Nadon et al., 2017). A disadvantage of the method is that the database must be constructed and shared across different groups, who must agree in using the same database in order to make their results comparable (Nadon et al., 2017). Construction of such databases is also time-consuming and labor-intensive, with the difficulty increasing with the diversity of the organisms included in the same database (e.g., a database for S. enterica subspecies enterica serovar Agona will require less time and labor than a database for all S. enterica).

Core genome MLST (cgMLST)

The cgMLST method is very similar to the wgMLST method. The major difference is the size and nature of the database. While the wgMLST database contains alleles for all genes in the pan genome of the defined group, the cgMLST only contains alleles for those genes that are present in all (or almost all) genomes of the defined group (i.e., the “core genome”). Hence, a cgMLST database will not capture the genetic diversity present in the accessory genes (i.e., genes that are not present in all isolates) and hence tends to be much smaller than a corresponding wgMLST database. The advantages of using the cgMLST are: (i) speed; because the cgMLST database is smaller than the wgMLST database, results can be obtained faster, and (ii) construction of the cgMLST database is generally easier than the wgMLST database, as typically less genomes are needed to identify the core genome than the pan genome of a group (den Bakker et al., 2010). While allele code schemes are used by some groups to summarize the differences observed among isolates subtyped by both cgMLST and wgMLST (Nadon et al., 2017), it generally is easier to define standard, stable, cgMLST allele codes. This allele code scheme can be easily transferred in a spreadsheet and can be interpreted similarly to what has been in use for PFGE. An allele code scheme may not, however, be fully stable and may need to be revised as new cg- or wgMLST types are identified (Nadon et al., 2017). A disadvantage of cgMLST is that it may show reduced discriminatory power over wgMLST, as shown in a comparison between the Salmonella cgMLST and wgMLST schemes defined in EnteroBase (Alikhan et al., 2018), carried out using Salmonella Enteritidis historical isolates from a UK egg-associated outbreak (Inns et al., 2015), as well as closely related non-outbreak isolates identified previously (Dallman et al., 2016). The 177 isolates from this dataset resulted in 177 unique sequence types by wgMLST (Simpson’s diversity index = 1.00) and 137 unique sequence types by cgMLST (Simpson’s diversity index = 0.98) (P < 0.05), showing the superior discriminatory power of wgMLST over cgMLST. However, both approaches grouped the isolates into identical clusters (Pearce et al., 2018).

Comparison of hqSNP-based analysis and genomic MLST analysis

Theoretically, hqSNP analysis is the most discriminatory approach for molecular subtyping, as it investigates all possible SNPs between each pair of isolates in the dataset. The second most discriminatory approach is wgMLST, which is designed to investigate virtually all genes in the genomes; intergenic regions and genes not present in the wgMLST scheme will not be investigated and polymorphisms present in these regions will be missed. The cgMLST approach is the least discriminatory of the three as it relies on only a subset of the genes present in the wgMLST scheme. Hence, similarly to the wgMLST approach, polymorphisms present in intergenic regions and in genes not included in the cgMLST scheme will not be assessed (Chen et al., 2017). Both wgMLST and cgMLST are reference-independent which makes the results more reproducible and transferable than hqSNP analysis (Nadon et al., 2017). In order to reproduce the results obtained from hqSNP analysis, one needs to use the same reference and parameters that were used in the original analysis (Nadon et al., 2017). This is not an issue with wgMLST or cgMLST analysis as long as analyses use the same scheme containing the same genes and alleles to allow for comparisons. Transference and communication of the results also seem to be more complicated for hqSNP analysis than for cgMLST or wgMLST (Nadon et al., 2017). This is because hqSNP analysis, as compared to cgMLST or wgMLST analyses, requires more parameter settings, which must be communicated for better interpretation. wgMLST and cgMLST analyses are also typically integrated into commercially available software, while the hqSNP pipelines are available as free open software or integrated into commercial software. Free-of-charge hqSNP pipelines require UNIX-based systems and are run through the command line, which may require specialized expertise (Nadon et al., 2017). Commercially available software, which can run cgMLST and wgMLST (e.g., BioNumerics) tends to be more user-friendly. BioNumerics uses a graphical user interface and can be installed in Microsoft Windows computers. The hqSNP analysis can easily be kept private as the analysis can be run within a closed dataset of genomes. The cgMLST and wgMLST can also be kept private; however, it may require some additional infrastructure (i.e., a private cloud) to be built around the commercial software.

Comparison of Molecular Methods for Predicting the Serovar of Salmonella

A comparison of different molecular methods for predicting the serovar of Salmonella is shown Table 2. Acceptable correlation between PFGE patterns and serovars has been described by several researchers (Weigel et al., 2004; Nde et al., 2006; Gaul et al., 2007; Kerouanton et al., 2007; Zou et al., 2010; Shi et al., 2015; Bopp et al., 2016). Shi et al. (2015) summarized the serovar-prediction accuracy of different molecular serotyping methods with studies from 1993 to 2013. The proportion of isolates that may not be accurately serotyped with PFGE is generally comparable to the proportion that is not typeable, or that requires extensive additional labor and reagents using conventional serotyping (Bopp et al., 2016). Examples of serovars incorrectly predicted by PFGE are summarized below (Table 3). Overall, with PFGE patterns for approx. 500 Salmonella serovars in the PFGE pattern database (Ranieri et al., 2013; Shi et al., 2015) and the reported good correlation between PFGE patterns and serovars, PFGE-based serovar prediction should be possible for a large proportion of these serovars, but will not be possible for a large number of less common serovars not represented in the database.

TABLE 2

Number of isolates tested	Number of serovars tested	Isolate sources	Serovar-prediction accuracy (%)	References
PFGE
80	6	Turkey processing plant	99	Nde et al.,2006
68	10	Swine farms	84	Weigel et al.,2004
674	12	Swine	85	Gaul et al.,2007
866	8	Food animals, production facilities, and clinical samples	96	Zou et al.,2010
1,128	31	Food, animals, humans, natural environment, and processing plants	97	Kerouanton et al.,2007
46	40	Human and cattle	75	Ranieri et al.,2013
1,486	110	New York State Department of Health, isolates received in 2012; human clinics	96	Bopp et al.,2016
1,437	131	New York State Department of Health, isolates received in 2013; human clinics	91	Bopp et al.,2016
1,558	107	New York State Department of Health, isolates received in 2014; human clinics	90	Bopp et al.,2016
Legacy MLST
25	7	Chickens	92	Liu,2010
66	1	Cattle, birds, horses, and other animals	99	Sukhnanand et al.,2005
110	25	Human and veterinary source	98	Torpdahl et al.,2005
152	33	Reference collection	100	Ben-Darif et al.,2010
4,257	554	Reference collection	88	Achtman et al.,2012
46	40	Human and cattle	91	Ranieri et al.,2013
42,400	624	SRA collection	91	Robertson et al.,2018
7,338	263	Human	96	Ashton et al.,2016
WGS-(SeqSero)
308	72	CDC collection	99	Zhang et al.,2015
3,306	228	Genome Trakr collection	93	Zhang et al.,2015
354	44	GenBank collection	92	Zhang et al.,2015
WGS-(SISTR)
4,291	246	SRA and NCBI Assembly collections	95	Yoshida et al.,2016a
42,400	624	SRA collection	97	Robertson et al.,2018

Comparison of molecular characterization methods for prediction of Salmonella¹ serovars.

¹This table is revised from the information provided by the review of Shi et al. (2015).

TABLE 3

Major incorrectly predicted serovars	“O” antigens	Phase 1 “H” antigens	Phase 2 “H” antigens	References
Montevideo (clustered with Senftenberg)	6,7	g,m,s	No phase 2 antigen	Nde et al.,2006
Senftenberg (clustered with Montevideo)	1,3,19	g,s,t	No phase 2 antigen	Nde et al.,2006
Typhimurium var. Copenhagen (clustered with 4,[5],12:i:- and Typhimurium)	1,4,12	I	1,2	Gaul et al.,2007
4,5,12:i:- (clustered with Typhimurium var. Copenhagen and Typhimurium)	4,5,12	I	No phase 2 antigen	Gaul et al.,2007
Typhimurium (clustered with Typhimurium var. Copenhagen and 4,[5],12:i:-)	1,4,5,12	I	1,2	Gaul et al.,2007
Saintpaul (clustered with Typhimurium var. Copenhagen and Typhimurium)	1,4,5,12	e,h	1,2	Ranieri et al.,2013
Putten (clustered with Agona)	13, 23	D	l, w	Gaul et al.,2007
Agona (clustered with Putten)	4,12	f,g,s	No phase 2 antigen	Gaul et al.,2007
Paratyphi B	1,4,5,12	B	1,2	Kerouanton et al.,2007
Give	3,10	l,v	1,7	Kerouanton et al.,2007
Newport	6,8	e,h	1,2	Kerouanton et al.,2007

Examples of serovars incorrectly predicted by PFGE.

Multiple locus variable number of tandem repeats analysis is not widely used for serovar prediction even though efforts have been made to develop MLVA subtyping schemes to subtype multiple serovars of Salmonella with one protocol (Van Cuyck et al., 2011; Kjeldsen et al., 2016). A universal MLVA scheme for most frequently isolated Salmonella serovars (accounting for 80% of the clinical isolates from humans in Europe) has been developed by Kjeldsen et al. (2016). In another study, an MLVA scheme identified 31 serovars (Van Cuyck et al., 2011). Nevertheless, further development of multiple-serovar MLVA schemes and robust MLVA profile databases is unlikely to occur given the benefits offered by WGS.

The serovar-prediction accuracy of Rep-PCR has been reported to range between 0 and 100%, indicating some limitations of this method (Shi et al., 2015). Ranieri et al. (2013) showed that Rep-PCR accurately predicted the serovar of 30 out of 46 isolates representing the top 40 Salmonella serovars isolated from human and non-human sources, with an accuracy of 65%. This accuracy was relatively lower than that obtained with PFGE or MLST, when the same set of isolates were evaluated.

Ashton et al. (2016) compared the serovars predicted by using legacy MLST sequences extracted from WGS data to the results generated by conventional serotyping, for 7,338 isolates representing 263 serovars of Salmonella enterica subspecies I. The 10 most common serovars in this S. enterica subspecies I dataset were serovars Enteritidis, Typhimurium, Infantis, Typhi, Newport, Virchow, Kentucky, Stanley, Paratyphi A, and Java. They found that the serovar prediction accuracy of legacy MLST was 96%.

The overall serovar-prediction accuracy for the CRISPR subtyping approach has been reported to range from 78 to 90% (Liu et al., 2011; Fabre et al., 2012; Shi et al., 2015). More studies are needed to further assess serovar-prediction accuracy using CRISPR.

Given the range of serovars represented in the SeqSero and SISTR databases, WGS can be used to theoretically predict 2,389 and 2,190 of the 2,577 serovars described in the White–Kauffmann–Le minor when using the serovar prediction programs SeqSero (Zhang et al., 2015) and SISTR (Yoshida et al., 2016a), respectively. Using empirical data, the accuracy of serotype prediction with SeqSero and SISTR has been reported to be approx. 92 and 95%, respectively (Zhang et al., 2015; Yoshida et al., 2016a, b; Robertson et al., 2018). By comparison, traditional Salmonella serotyping had an accuracy of 73% when 33–36 independent laboratories performed serotyping of the same eight Salmonella strains representing seven different serovars (Petersen et al., 2002), suggesting that WGS-based methods may be more reliable than traditional serotyping to assign Salmonella isolates to serovars. Nevertheless, further experimental studies are needed to continue to quantify the ability of WGS-based methods to identify Salmonella serovars.

Comparison of Molecular Methods for Subtype Differentiation of Salmonella

Molecular methods are used for subtyping Salmonella isolates that belong to the same serovar, as well as being used for serovar prediction. This section briefly provides some examples of comparative studies of subtyping methods. In one study, PFGE was compared to MLVA to subtype 163 non-typhoidal Salmonella isolates representing 15 serovars; MLVA differentiated the isolates into 79 MLVA subtypes while PFGE differentiated the same isolates into 87 subtypes. The Nei’s diversity index for MLVA was 0.979 compared to 0.999 for PFGE (Kjeldsen et al., 2016). However, for specific serovars (e.g., Salmonella Enteritidis) MLVA has been reported to provide improved discriminatory power over PFGE (Boxrud et al., 2007; Beranek et al., 2009; De Cesare et al., 2015). MLST has the advantage of being highly reproducible and easily transferable among laboratories. However, in a study of 110 Salmonella isolates from 25 serovars (Torpdahl et al., 2005), MLST resulted in 43 sequence types, while PFGE was able to differentiate the isolates into 73 PFGE subtypes. The downside of PFGE in this study was the inability to type 11 of the 110 (10%) isolates. In a study comparing different molecular methods to differentiate 52 Salmonella Enteritidis isolates, PFGE resulted in eight subtypes, while MLVA resulted in 18 subtypes and WGS resulted in 34 subtypes. The discriminatory power of PFGE, MLVA, and WGS was 0.81, 0.92, and 0.97 (Simpson’s index of diversity), respectively (Deng et al., 2015). In another study, PFGE and WGS were used to differentiate 55 Salmonella Enteritidis isolates; PFGE resulted in 10 subtypes; however, WGS was able to further differentiate the isolates into 45 unique subtypes (Taylor et al., 2015), showing the greater discriminatory power of WGS over PFGE. In a study of isolates from a Salmonella Poona outbreak (Kozyreva et al., 2016), 4 PFGE subtypes and 7 WGS subtypes were observed among the 16 isolates; in silico MLST using the WGS data resulted in one MLST sequence type. Phylogenetic analysis using WGS data showed that the distinct PFGE types did not necessarily correlate with increased genetic distance between isolates. Isolates that differed by 0 SNPs showed distinct PFGE subtypes, suggesting that PFGE results would be misleading for these isolates (Kozyreva et al., 2016). While the relative discriminatory power of different subtyping methods depends on the strains and serovars tested, WGS methods were consistently found to be most discriminatory, followed by PFGE. While some MLVA schemes provide enhanced discriminatory power over PFGE for some serovars, for other serovars PFGE may be more discriminatory than MLVA.

Criteria to Evaluate and Validate Different Salmonella Characterization Methods

Molecular-based Salmonella characterization methods including WGS are evolving very fast. Many of the characterization methods and technologies, as well as data analysis pipelines, are operated as research tools, and are under continuous development. Evaluation of these tools for Salmonella investigation, especially for those serovars/strains highly relevant to food products and processing environments, is pre-requisite for the implementation of these methods. Methods that can be used by the food industry must be thoroughly validated before implementation to ensure reliability and consistency of the method when it is used across different laboratories. Validation should cover the end-to-end workflow for source tracking from isolate subculture to bioinformatic analysis, articulating the key quality requirements and criteria (Ferrari et al., 2017; Nadon et al., 2017; Portmann et al., 2018). Proposed criteria for evaluation of Salmonella characterization methods for potential routine use in the food industry are shown below (Table 4).

TABLE 4

Key criteria for evaluation	Description	Target	Key factors affecting performance	Quantitative evaluation (scale of 0–5)
Stability	Consistency of the typing result for an isolate after its primary isolation and during laboratory storage and subculture.	Typing results should be stable during laboratory storage and subculture; strain markers should not mutate too rapidly to change the strain’s position in the epidemiological context; data on the stability of the markers should be available.	Rapid mutations and recombination of the marker(s) during storage and subculture could lead to poor reproducibility.	0 – Extremely poor stability 1 – No data are available on stability 3 – Some limited data suggest that markers are stable 5 – Strong data are available supporting stability of markers (and/or data are available that can be used to correct for mutations or changes in markers during passage).
Typeability	Ability to assign a type to all isolates tested by it.	Typeability should be as high as possible.	Poor typeability could be found in assays using a scheme that does not cover genetic variation in full; typeability may also be reduced if some isolates show high endogenous nuclease activity.	0 – Extremely poor typeability (<80%) 1 – Data indicate between 80 and 90% typeability; or no evaluation of typeability performed 2 – Data indicate between 90 and 93% typeability 3 – Data indicate between 94 and 96% typeability 4 – Data indicate between 97 and 99% typeability 5 – Data indicate >99% typeability.
Discriminatory power	Ability to assign a different type to two unrelated strains; discriminatory power can be expressed using Simpson’s index of diversity (SID)	Discriminatory power should be as high as possible. For highly discriminatory methods, clustering using phylogenetic analysis tools can be used to define isolates that share a recent common ancestor.	Discriminatory power is highly dependent on the marker(s) selected for typing.	0 – Extremely poor discriminatory power (<80%, SID <0.80) 1 – Data indicate between 80 and 90% discriminatory power (SID 0.80–0.90); or no evaluation of discriminatory power performed 2 – Data indicate between 90 and 93% discriminatory power (SID 0.90–0.93) 3 – Data indicate between 94 and 96% discriminatory power (SID 0.94–0.96) 4 – Data indicate between 97 and 99% discriminatory power (SID 0.97–0.99) 5 – Data indicate >99% discriminatory power (i.e., SID > 0.99). Note: we recommend that data are generated using appropriate strain collection and >100 isolates.
Epidemiological concordance	Ability to reflect, agree with, and possibly further illuminate the available epidemiological information about the cases under study.	Epidemiological concordance should be as high as possible; strains from the same outbreak or strains that are otherwise linked by epidemiological evidence should be classified into the same subtype (or phylogenetically characterized as sharing a recent common ancestor).	Low epidemiological concordance could be found in assays that either target “low stability markers” or an assay with limited discriminatory power, which will group together isolates that are epidemiologically unrelated.	0 – Extremely poor epidemiological concordance; <80% isolates are classified correctly. 1 – Poor epidemiological concordance; data indicate between 80 and 90% isolates are classified correctly; or no evaluation of epidemiological concordance 2 – Low epidemiological concordance; data indicate between 90 and 93% isolates are classified correctly 3 – Intermediate level of epidemiological concordance; data indicate between 94 and 96% isolates are classified correctly) 4 – Good epidemiological concordance; data indicate between 97 and 99% isolates are classified correctly 5 – Strong epidemiological concordance; data indicate all isolates are classified correctly Note: we recommend that data are generated by using at least 20 sets of epidemiologically related isolates. Ideally, a given subtyping method classifies all of these isolates correctly.
Reproducibility	Ability to perform reproducibly in different laboratories and with different personnel.	Results should be highly reproducible (>99%).	Poor reproducibility could be the results of (i) technically difficult assay (leading to technical errors by personnel, e.g., cross-contamination), (ii) reagents not standardized sufficiently, (iii) equipment not performing reproducibly, (iv) poorly optimized typing system, (v) sensitivity of equipment or assay system to environmental factors (e.g., humidity, temperature), (vi) bias in observing, recording, analysis, and interpretation of the results; (vii) or assays targeting biologically highly variable markers (e.g., some of the surface antigens targeted by classical serotyping).	0 – Extremely poor reproducibility; <80%; meaning for >20% of isolates results are not reproducible between labs 1 – Poor reproducibility; data indicate between 80 and 90% of isolates results are reproducible between labs 2 – Low reproducibility; data indicate between 91 and 93% of isolates results are reproducible between labs 3 – Intermediate reproducibility; data indicate between 94 and 96% of isolates results are reproducible between labs 4 – Good reproducibility; data indicate between 97 and 99% of isolates results are reproducible between labs 5 – Strong reproducibility; data indicate >99% of isolates results are reproducible between labs Note: we recommend that data are generated based on an evaluation by at least four laboratories.
Repeatability	Ability to produce the same results in the same laboratory with the same equipment and personnel	Results should be highly repeatable ( > 99%)	Poor repeatability could be the result of i) technically difficult assay (leading to technical errors by personnel, e.g., cross-contamination), ii) reagents not standardized sufficiently, iii) equipment not performing reproducibly.	0 – Extremely low repeatability (<90%; meaning for >10% of isolates results are not repeatable) 1 – No evaluation of repeatability performed 2 – Data indicate between 90 and 93% repeatability 3 – Data indicate between 94 and 96% repeatability; or repeatability evaluated with small number of isolates (<40) 4 – Data indicate between 97 and 99% repeatability 5 – Data indicate >99% repeatability Note: we recommend that repeatability evaluation performed with at least 40 isolates, ideally with 100 isolates.
Serovar prediction ability	Ability to accurately predict the serovar of a given strain.	Range, as the number of identifiable serovars, and accuracy (i.e., percentage of isolates with correct serovar identification) should be maximized. Accuracy should be given priority over range as misclassification may lead to worse decisions than non-classification.	Poor serovar prediction could be a result of (i) limited database coverage of different serovars, (ii) low discriminatory power, (iii) low typeability, (iv) no standard protocol of serovar prediction with produced data.	0 – Extremely low serovar prediction accuracy (serovar is correctly predicted for <70% of serovars) 1 – No evaluation of serovar prediction ability, or weak prediction accuracy (data indicate between 70 and 80% serovar prediction accuracy) 2 – Data indicate between 80 and 85% serovar prediction accuracy 3 – Data indicate between 85 and 90% serovar prediction accuracy; or serovar prediction ability evaluated with small number of serovars 4 – Data indicate between 90 and 98% serovar prediction accuracy 5 – Data indicate >98% serovar prediction accuracy); serovars are correctly predicted for all common isolates² Note: we recommend that data are generated by using at least 40 different serovars, ideally more than 100 serovars.
Speed	Time to results from pure single colony	<5 days	Speed can be influenced by throughput, equipment, and data analysis program used for a given assay	0 – >1 month 1 – 3–4 weeks 2 – 2–3 weeks 3 – 1–2 weeks 4 – ≤5 days 5 – ≤2 days
Ease of use	Ease of use encompasses technical simplicity, workload, suitability for high throughput test, ease of data analysis, and result interpretation	Ease of use is important for the implementation of an assay in the internal laboratories of food industry, less important when using services provided by a commercial laboratory.	Poor ease of use is usually caused by the high level of expertise and experience required by a given assay, e.g., bioinformatics expertise to analyze data produced by the assay.	0 – The given assay requires extremely high level of expertise and experience in specific techniques (PhD level scientist with >4 days of specialized training) 3 – The given assay requires average level of expertise and experience of a microbiological technician 5 – No specific expertise or experience required; assay can be completed by high school diploma and <1 day training.
Cost	Total cost encompasses cost of equipment reagent/consumables, data analysis platform, and staffing. For routine use, we usually just assess the reagent cost per isolate. Staffing cost can vary considerably in different regions/countries within a given turnaround time, thus needs to be assessed separately with actual local situations.	A balance between efficiency/effectiveness and cost of a given assay is more important than pursuing low cost, because low cost may potentially lead to larger economic loss and extra investigation time caused by poor quality of typing result.	High cost per isolate for routine test is usually caused by high reagent cost and long turnaround time (leading to high staffing cost).	We recommend to use the actual reagent cost per isolate plus staffing cost estimated with given turnaround time to compare the assay being validated to the currently/previously used methods by food industry; data here are based on costs from commercial laboratories in North America and Europe: 0 – >$1,000 per isolate 1 – $500–$1,000 per isolate 2 – $200–$500 per isolate 3 – $150–200 per isolate 4 – $100–150 per isolate 5 – ≤$100

Proposed evaluation criteria for Salmonella characterization methods that may be used routinely in the food industry¹.

¹The parameters and information in this table are adapted from Van Belkum et al. (2007) and Wiedmann et al. (2014) with industry-specific practical needs. ²The serovar typing ability of conventional serotyping method (Kaufmann–White Le Minor scheme) is around 90% taking the typeability and accuracy of it into consideration (Bopp et al., 2016).

Implementation of Molecular-Based Salmonella Subtyping Methods by the Food Industry

We consider that WGS is the most suitable method to characterize Salmonella for incident investigation at production facilities in the food industry. This opinion is based on comparison of the resolution, turnaround time, ability of serovar prediction, cost, and feasibility of the available methods. Bioinformatics is a key capability required for WGS. The food industry may choose to invest in in-house capability that can interface with outside resources (e.g., academic partners, industry partners, government agencies), however, there are also opportunities to outsource data analyses to commercial or academic partner labs. Both the CFSAN pipeline and the Lyve-SET pipeline have been widely tested and seem to provide comparable and reliable results for hqSNP analysis. Implementation of wgMLST and cgMLST within BioNumerics has been successfully completed for L. monocytogenes in the United States. A cgMLST scheme is publicly available from EnteroBase (EnteroBase URL: https://enterobase.warwick.ac.uk/) and it is likely to be implemented within BioNumerics in the future. Other data analysis methods such as genome distance analysis (Pinho et al., 2009; Auch et al., 2010) can also become possible future approaches that allow for the food industry to develop data analysis capabilities for contamination source tracking.

The turnaround time of in-house WGS subtyping can be comparable to many conventional subtyping methods including conventional serotyping and PFGE (Table 1). WGS, however, provides much more information about an isolate with one single experimental procedure, enabling full characterization of the pathogen (including in silico serovar prediction and antimicrobial resistance gene identification) and more accurate clustering/discrimination of the isolates investigated. This is faster than using multiple conventional subtyping approaches in a stepwise approach to get equal information. The cost of WGS is also comparable to that of the conventional subtyping tools, considering the high quality and volume of information provided by WGS within one experimental procedure. In silico serotyping should be performed instead of traditional serotyping for determination of serovars once WGS is implemented as the subtyping method for Salmonella. This approach will greatly reduce the costs and time associated with serotyping.

Legacy MLST targeting variants of seven housekeeping genes of Salmonella can be used in combination with WGS. While legacy MLST classification can be obtained using Sanger sequencing technology (also known as first-generation sequencing technology) within 1 week, it can also be obtained by using the sequence information extracted from WGS data. Although legacy MLST has relatively lower discriminatory power compared with PFGE and MLVA, it is faster than PFGE when using an in-house Sanger sequencer such as Applied Biosystems Genetic Analysis Systems (Thermo Fisher Scientific). It is also more universal to all Salmonella serovars than MLVA which usually requires a specific scheme for each serovar. In addition, the serovar prediction ability of legacy MLST has been demonstrated to be comparable to that of PFGE (Tables 1, 3).

PFGE is currently still the “gold standard” and most widely used Salmonella DNA fingerprinting method used by public health authorities and food regulators to characterize and track this pathogen in outbreaks, although it is being replaced by WGS. PFGE remains a valuable tool for foodborne pathogen characterization by the food industry, while a transition to WGS occurs. PFGE has been repeatedly shown to be more discriminatory than methods such as conventional serotyping, automated ribotyping, or MLST for many bacteria including Salmonella. In addition to these methods, single-plex or multiplex PCR assays that can detect and identify specific Salmonella serotypes have been described (Kim et al., 2006; Akiba et al., 2011; Zhu et al., 2015; Xiong et al., 2018; Xu L. et al., 2018; Xu Y. et al., 2018); these tools provide an alternative approach for detection and identification of specific Salmonella serovars.

The results of any subtyping approach can be used to assess the relationship of isolates in an investigation. Nevertheless, the epidemiological context is indispensable in final decision making in incident investigation and to determine further actions for food safety management improvement. High-resolution WGS subtyping results should not be interpreted in the absence of epidemiological information.

The raw sequence data generated by molecular-based subtyping methods, especially WGS, require both physical and virtual space for storage. It is desirable to retain the original sequence reads (usually files with >200 MB for each Salmonella isolate) for potential future analysis using alternative data analysis methods or for a retrospective investigation. Commercial clouds can provide a storage solution, provided that special attention is paid to data security. A robust Internet connection and high band-width is needed to transfer WGS data if data storage is outsourced. Subtyping analysis needs to be supported by complete metadata providing the relevant epidemiological context to identify the root cause of the contamination. Thus, the capability for metadata collection, organization, and storage is needed together with building the capability for WGS. The metadata should include information such as the geographic and temporal background of the isolates, the sample type, and sample source (e.g., raw ingredients, finished products, environment), etc. The Consortium for Sequencing the Food Supply Chain, founded by IBM and Mars Incorporated, represents industrial groups putting effort into collecting genomic information on pathogenic bacteria across the food supply chain⁶. This consortium represents one part of the broader goal to increase knowledge of foodborne pathogens at the genomic level.

Conclusion

The application of DNA-based methods for characterization of pathogens such as Salmonella has become common practice. Our literature-based assessment supports the superior discriminatory power of WGS and its advantages compared with other methods for Salmonella subtyping and source tracking for the food industry. We also identified circumstances under which use of other subtyping methods may be warranted. Implementation of molecular-based Salmonella characterization methods, including WGS, provides improvement of source tracking and root cause elimination; however, these methods require investment in bioinformatics capability. Routine use of WGS or complete replacement of current subtyping methods by WGS will require attention to key issues including standardization, robustness, and validation of the analytical methodology. High resolution WGS subtyping of Salmonella promises to vastly improve the ability of the food industry to track and control Salmonella and is poised to become standard methodology in food safety for characterization of foodborne pathogens by public health authorities and food regulators. Nevertheless, standardization of WGS operation and data analysis, in particularly source tracking analysis, is required at a global level. A common agreement of understanding and the application of WGS between the food industry, public health, and food safety regulators are expected to guide the implementation of WGS in food safety management.

Statements

Author contributions

ST and MW conceived and designed the work. ST, RO, and HL collected the data, conducted data analysis, and interpreted it. ST, RO, HL, and MW drafted the article. MW, AS, RB, CG, and GZ critically revised the article.

Funding

The authors declare that this study received funding from Mars Global Food Safety Center. The funder was not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication.

Acknowledgments

We thank Peter Markwell, Dr. Bala Ganesan, and Dr. Kristel Hauben for comments that greatly improved the manuscript.

Conflict of interest

ST, HL, CG, GZ, RB, and AS were employed by the Mars Global Food Safety Center. MW serves as a compensated scientific advisor for BioMérieux, Mérieux NutriSciences, Mars, and Neogen and has served as a paid speaker for 3M and IBM. The remaining author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Footnotes

1.^https://www.ncbi.nlm.nih.gov/sra

2.^https://www.ebi.ac.uk/ena/

3.^https://www.ddbj.nig.ac.jp/index-e.html

4.^https://www.ncbi.nlm.nih.gov/pathogens/

5.^https://www.qiagenbioinformatics.com

6.^https://researcher.watson.ibm.com/researcher/view_group.php?id=9635

References

1
AchtmanM.WainJ.WeillF. X.NairS.ZhouZ.SangalV. (2012). Multilocus sequence typing as a replacement for serotyping in Salmonella enterica.PLoS Pathog.8:e1002776. 10.1371/journal.ppat.1002776
2
AkibaM.KusumotoM.IwataT. (2011). Rapid identification of Salmonella enterica serovars, Typhimurium, Choleraesuis, Infantis, Hadar, Enteritidis, Dublin and Gallinarum, by multiplex PCR.J. Microbiol. Methods859–15. 10.1016/j.mimet.2011.02.002
3
AlcaineS. D.SoyerY.WarnickL. D.SuW. L.SukhnanandS.RichardsJ.et al (2006). Multilocus sequence typing supports the hypothesis that cow- and human-associated Salmonella isolates represent distinct and overlapping populations.Appl. Environ. Microbiol.727575–7585. 10.1128/aem.01174-06
4
AlikhanN. F.ZhouZ.SergeantM. J.AchtmanM. (2018). A genomic overview of the population structure of Salmonella.PLoS Genet.14:e1007261. 10.1371/journal.pgen.1007261
5
AllardM. W.BellR.FerreiraC. M.Gonzalez-EscalonaN.HoffmannM.MuruvandaT.et al (2018). Genomics of foodborne pathogens for microbial food safety.Curr. Opin. Biotechnol.49224–229. 10.1016/j.copbio.2017.11.002
6
AllardM. W.LuoY.StrainE.LiC.KeysC. E.SonI.et al (2012). High resolution clustering of Salmonella enterica serovar Montevideo strains using a next-generation sequencing approach.BMC Genomics13:32. 10.1186/1471-2164-13-32
7
AllardM. W.StrainE.MelkaD.BunningK.MusserS. M.BrownE. W.et al (2016). Practical value of food pathogen traceability through building a whole-genome sequencing network and database.J. Clin. Microbiol.541975–1983. 10.1128/JCM.00081-16
8
AlmeidaF.MedeirosM. I.RodriguesP.DosD.FalcaoJ. P. (2015). Genotypic diversity, pathogenic potential and the resistance profile of Salmonella typhimurium strains isolated from humans and food from 1983 to 2013 in Brazil.J. Med. Microbiol.641395–1407. 10.1099/jmm.0.000158
9
AlmeidaF.SeribelliA. A.da SilvaP.MedeirosM. I. C.Dos Prazeres RodriguesD.MoreiraC. G.et al (2017). Multilocus sequence typing of Salmonella typhimurium reveals the presence of the highly invasive ST313 in Brazil.Infect. Genet. Evol.5141–44. 10.1016/j.meegid.2017.03.009
10
AltschulS. F.GishW.MillerW.MyersE. W.LipmanD. J. (1990). Basic local alignment search tool.J. Mol. Biol.215403–410. 10.1006/jmbi.1990.9999
11
AmirkhanianV.LuiM.GuttmanA.SzantaiE. (2006). Cost Benefit Analysis of a Multicapillary Electrophoresis System American Laboratory. Available at: https://www.americanlaboratory.com/913-Technical-Articles/35725-Cost-Benefit-Analysis-of-a-Multicapillary-Electrophoresis-System/(accessed June 1, 2006).
- Google Scholar
12
AshtonP. M.NairS.PetersT. M.BaleJ. A.PowellD. G.PainsetA. (2016). Identification of Salmonella for public health surveillance using whole genome sequencing.PeerJ4:e1752. 10.7717/peerj.1752
13
AuchA. F.KlenkH.GökerM. (2010). Standard operating procedure for calculating genome-to-genome distances based on high-scoring segment pairs.Stand. Genomic Sci.2142–148. 10.4056/sigs.541628
14
BarcoL.BarrucciF.OlsenJ. E.RicciA. (2013). Salmonella source attribution based on microbial subtyping.Int. J. Food Microbiol.163193–203. 10.1016/j.ijfoodmicro.2013.03.005
15
BargN. L.GoeringR. V. (1993). Molecular epidemiology of nosocomial infection: analysis of chromosomal restriction fragment patterns by pulsed-field gel electrophoresis.Infect. Control Hosp. Epidemiol.14595–600. 10.1086/646645
- CrossRef
- Google Scholar
16
BarrangouR.DudleyE. G. (2016). CRISPR-based typing and next-generation tracking technologies.Annu. Rev. Food Sci. Technol.7395–411. 10.1146/annurev-food-022814-015729
17
BarrangouR.HorvathP. (2012). CRISPR: new horizons in phage resistance and strain identification.Annu. Rev. Food Sci. Technol.3143–162. 10.1146/annurev-food-022811-101134
18
BauerN.EvansP.LeopoldB.LevineJ.WhiteP. (2013). USDA-FSIS Subtyping Work Group: Current and Future Development and Use of Molecular Subtyping by USDA-FSI. Available at: https://www.fsis.usda.gov/wps/wcm/ connect/6c7f71fd-2c0c-4ff0-b2bc-4977c7947516/Molecular-Subtyping-White- Paper.pdf?MOD=AJPERES(accessed October 27, 2018).
- Google Scholar
19
Ben-DarifE.De PinnaE.ThrelfallE. J.BoltonF. J.UptonM.FoxA. J.et al (2010). Comparison of a semi-automated rep-PCR system and multilocus sequence typing for differentiation of Salmonella enterica isolates.J. Microbiol. Methods8111–16. 10.1016/j.mimet.2010.01.013
20
BeranekA.MikulaC.RaboldP.ArnholdD.BergholdC.LedererI.et al (2009). Multiple-locus variable-number tandem repeat analysis for subtyping of Salmonella enterica subsp. enterica serovar Enteritidis.Int. J. Med. Microbiol.29943–51. 10.1016/j.ijmm.2008.06.002
21
BoppD. J.BakerD. J.ThompsonL.SaylorsA.RootT. P.ArmstrongL.et al (2016). Implementation of Salmonella serotype determination using pulsed-field gel electrophoresis in a state public health laboratory.Diagn. Microbiol. Infect. Dis.85416–418. 10.1016/j.diagmicrobio.2016.04.023
22
BoxrudD. (2010). Advances in subtyping methods of foodborne disease pathogens.Curr. Opin. Biotechnol.21137–141. 10.1016/j.copbio.2010.02.011
23
BoxrudD.Pederson-GulrudK.WottonJ.MedusC.LyszkowiczE.BesserJ.et al (2007). Comparison of multiple-locus variable-number tandem repeat analysis, pulsed-field gel electrophoresis, and phage typing for subtype analysis of Salmonella enterica serotype Enteritidis.J. Clin. Microbiol.45536–543. 10.1128/jcm.01595-06
24
BrennerF. W.VillarR. G.AnguloF. J.TauxeR.SwaminathanB. (2000). Salmonella nomenclature - Guest commentary.J. Clin. Microbiol.382465–2467.
- Google Scholar
25
CallD. R.OrfeL.DavisM. A.LafrentzS.KangM. S. (2008). Impact of compounding error on strategies for subtyping pathogenic bacteria.Foodborne Pathog. Dis.5505–516. 10.1089/fpd.2008.0097
26
CDC (2009). U Centers, S., for Disease Control and Prevention: Salmonella Surveillance: Annual Summary. Available at: https://www.cdc.gov/ncezid/dfwed/pdfs/salmonellaannualsummarytables2009.pdf(accessed October 27, 2018).
- Google Scholar
27
CDC (2015). Serotypes and the Importance of Serotyping Salmonella. Available at: https://www.cdc.gov/salmonella/reportspubs/salmonella-atlas/serotyping-importance.html(accessed February 8, 2019).
- Google Scholar
28
CDC (2016a). Frequently Asked Questions. Available at: https://www.cdc.gov/pulsenet/about/faq.html(accessed February 16, 2016).
- Google Scholar
29
CDC (2016b). Whole Genome Sequencing (WGS). Available at: https://www.cdc.gov/pulsenet/pathogens/wgs.html(accessed February 11, 2016).
- Google Scholar
30
CDC (2017a). PulseNet International - On the Path to Implementing Whole Genome Sequencing for Foodborne Disease Surveillance. Available at: https://www.cdc.gov/pulsenet/participants/international/wgs-vision.html(accessed June 9, 2017).
- Google Scholar
31
CDC (2017b). Standard Operating Procedure for PulseNet PFGE of Escherichia coli O157:H7, Escherichia coli non-O157 (STEC), Salmonella serotypes, Shigella sonnei and Shigella flexneri. Available at: https://www.cdc.gov/pulsenet/pdf/ecoli-shigella-salmonella-pfge-protocol-508c.pdf(accessed October 27, 2018).
- Google Scholar
32
CDC (2017c). US Centers for Disease Control and Prevention (CDC): PulseNet Methods - Multiple Locus Variable-Number Tandem Repeat Analysis (MLVA). Available at: https://www.cdc.gov/pulsenet/pathogens/mlva.html(accessed October 27, 2018).
- Google Scholar
33
CDC (2018a). Pulsed-Field Gel Electrophoresis (PFGE). Available at: https://www.cdc.gov/pulsenet/pathogens/pfge.html(accessed December 30, 2018).
- Google Scholar
34
CDC (2018b). US Centers for Disease Control and Prevention (CDC): National Enteric Disease Surveillance - Salmonella Annual Report, 2016. Available at: https://www.cdc.gov/nationalsurveillance/salmonella-surveillance.html(accessed December 30, 2018).
- Google Scholar
35
CDC (2019). Outbreak of Salmonella Infections Linked to Pet Hedgehogs. Available at: https://www.cdc.gov/salmonella/typhimurium-01-19/index.html(accessed May 30, 2019).
- Google Scholar
36
ChenY.LuoY.CarletonH.TimmeR.MelkaD.MuruvandaT. (2017). Whole genome and core genome multilocus sequence typing and single nucleotide polymorphism analyses of Listeria monocytogenes isolates associated with an outbreak linked to cheese, United States, 2013.Appl. Environ. Microbiol.83:e00633-17. 10.1128/AEM.00633-17
37
ChenuJ. W.CoxJ. M.PavicA. (2012). Classification of Salmonella enterica serotypes from Australian poultry using repetitive sequence-based PCR.J. Appl. Microbiol.112185–196. 10.1111/j.1365-2672.2011.05172.x
38
ClooneyA. G.FouhyF.SleatorR. D.O’DriscollA.StantonC.CotterP. D.et al (2016). Comparing apples and oranges?: next generation sequencing and its impact on microbiome analysis.PLoS One11:e0148028. 10.1371/journal.pone.0148028
39
CollinsC.XavierD. (2017). “Reconstructing the ancestral relationships between bacterial pathogen genomes,” in Bacterial Pathogenesis: Methods and Protocols, edsNordenfeltP.CollinM. (New York, NY: Springer), 109–137. 10.1007/978-1-4939-6673-8_8
40
DallmanT.InnsT.JombartT.AshtonP.LomanN.ChattC. (2016). Phylogenetic structure of European Salmonella Enteritidis outbreak correlates with national and international egg distribution network.Microb. Genom.2:e000070. 10.1099/mgen.0.000070
41
DavisS.PettengillJ. B.LuoY.PayneJ.ShpuntoffA.RandH.et al (2015). CFSAN SNP Pipeline: an automated method for constructing SNP matrices from next-generation sequence data.PeerJ Comput. Sci.1:e20. 10.7717/peerj-cs.20
- CrossRef
- Google Scholar
42
De CesareA.KrishnamaniK.ParisiA.RicciA.LuzziI.BarcoL.et al (2015). Comparison between Salmonella enterica serotype Enteritidis genotyping methods and phage type.J. Clin. Microbiol.533021–3031. 10.1128/JCM.01122-15
43
den BakkerH. C.AllardM. W.BoppD.BrownE. W.FontanaJ.IqbalZ.et al (2014). Rapid whole-genome sequencing for surveillance of Salmonella enterica serovar Enteritidis.Emerg. Infect. Dis.201306–1314. 10.3201/eid2008.131399
44
den BakkerH. C.CummingsC. A.FerreiraV.VattaP.OrsiR. H.DegoricijaL.et al (2010). Comparative genomics of the bacterial genus Listeria: genome evolution is characterized by limited gene acquisition and limited gene loss.BMC Genomics11:688. 10.1186/1471-2164-11-688
45
den BakkerH. C.SwittA. I.CummingsC. A.HoelzerK.DegoricijaL.Rodriguez-riveraL. D.et al (2011). A whole-genome single nucleotide polymorphism-based approach to trace and identify outbreaks linked to a common Salmonella enterica subsp. enterica serovar Montevideo pulsed-field gel electrophoresis type.Appl. Environ. Microbiol.778648–8655. 10.1128/AEM.06538-11
46
DengX.ShariatN.DriebeE. M.RoeC. C.TolarB.TreesE.et al (2015). Comparative analysis of subtyping methods against a whole-genome-sequencing standard for Salmonella enterica serotype enteritidis.J. Clin. Microbiol.53212–218. 10.1128/JCM.02332-14
47
Dera-TomaszewskaB. (2012). Salmonella serovars isolated for the first time in Poland, 1995-2007.Int. J. Occup. Med. Environ. Health25294–303. 10.2478/S13382-012-0038-2
48
DeurenbergR. H.BathoornE.ChlebowiczM. A.CoutoN.FerdousM.García-CobosS. (2017). Application of next generation sequencing in clinical microbiology and infection prevention.J. Biotechnol.24316–24. 10.1016/j.jbiotec.2016.12.022
49
DijkshoornL.TownerK. J.StruelensM. (2001). New Approaches for the Generation and Analysis of Microbial Typing Data.Amsterdam: Elsevier.
- Google Scholar
50
DiMarzioM.ShariatN.KariyawasamS.BarrangouR.DudleyE. G. (2013). Antibiotic resistance in Salmonella enterica serovar typhimurium associates with CRISPR sequence type.Antimicrob. Agents Chemother.574282–4289. 10.1128/AAC.00913-13
51
ECDC (2011). European Centre for Disease Prevention and Control (ECDC) Technical Document: Laboratory Standard Operating Procedure for MLVA of Salmonella enterica Serotype Typhimurium. Available at: https://ecdc.europa. eu/sites/portal/files/media/en/publications/Publications/1109_SOP_Salmonella _Typhimurium_MLVA.pdf(accessed October 27, 2018).
- Google Scholar
52
ECDC (2015). European Centre for Disease Prevention and Control (ECDC): Expert Opinion on the Introduction of Next-Generation Typing Methods for Food-and Waterborne Diseases in the EU and EEA.Stockholm: ECDC.
- Google Scholar
53
ECDC (2016a). European Centre for Disease Prevention and Control (ECDC): Salmonellosis - Annual Epidemiological Report 2016 [2014 Data]. Available at: https://ecdc.europa.eu/en/publications-data/salmonellosis-annual-epidemiological-report-2016-2014-data(accessed January 31, 2016).
- Google Scholar
54
ECDC (2016b). European Centre for Disease Prevention and Control (ECDC) Technical Document: Laboratory Standard Operating Procedure for Multiple-Locus Variable-Number Tandem Repeat Analysis of Salmonella enterica Serotype Enteritidis. Available at: https://ecdc.europa.eu/sites/portal/files/media/en/publications/Publications/Salmonella-Enteritidis-Laboratory-standard-operating-procedure.pdf(accessed October 27, 2018).
- Google Scholar
55
ElemfarejiO. I.ThongK. L. (2013). Comparative virulotyping of Salmonella typhi and Salmonella enteritidis.Indian J. Microbiol.53410–417. 10.1007/s12088-013-0407-y
56
FabreL.ZhangJ.GuigonG.Le HelloS.GuibertV.Accou-DemartinM. (2012). CRISPR typing and subtyping for improved laboratory surveillance of Salmonella infections.PLoS One7:e36995. 10.1371/journal.pone.0036995
57
FakhrM. K.NolanL. K.LogueC. M. (2005). Multilocus sequence typing lacks the discriminatory ability of pulsed-field gel electrophoresis for typing Salmonella enterica serovar typhimurium.J. Clin. Microbiol.432215–2219. 10.1128/JCM.43.5.2215-2219.2005
58
FAO (2016). Food and Agriculture Organization of the United Nations: Applications of Whole Genome Sequencing in Food Safety Management.Rome: FAO.
- Google Scholar
59
FDA (2018). GenomeTrakr Network. Available at: https://www.fda.gov/Food/FoodScienceResearch/WholeGenomeSequencingProgramWGS/ucm363134.htm(accessed March 21, 2019).
- Google Scholar
60
FerrariR. G.PanzenhagenP. H. N.ConteC. A.Jr. (2017). Phenotypic and genotypic eligible methods for Salmonella typhimurium source tracking.Front. Microbiol.8:2587. 10.3389/fmicb.2017.02587
61
FitzgeraldC.GheeslingL.CollinsM.FieldsP. I. (2006). Sequence analysis of the rfb loci, encoding proteins involved in the biosynthesis of the Salmonella enterica O17 and O18 antigens: serogroup-specific identification by PCR.Appl. Environ. Microbiol.727949–7953. 10.1128/aem.01046-06
62
FoleyS. L.LynneA. M.NayakR. (2009). Molecular typing methodologies for microbial source tracking and epidemiological investigations of Gram-negative bacterial foodborne pathogens.Infect. Genet. Evol.9430–440. 10.1016/j.meegid.2009.03.004
63
FoleyS. L.WhiteD. G.McDermottP. F.WalkerR. D.RhodesB.Fedorka-CrayP. J.et al (2006). Comparison of subtyping methods for differentiating Salmonella enterica serovar typhimurium isolates obtained from food animal sources.J. Clin. Microbiol.443569–3577. 10.1128/jcm.00745-06
64
FranciscoA. P.BugalhoM.RamirezM.CarricoJ. A. (2009). Global optimal eBURST analysis of multilocus typing data using a graphic matroid approach.BMC Bioinformatics10:152. 10.1186/1471-2105-10-152
65
FrickeW. F.MammelM. K.McDermottP. F.TarteraC.WhiteD. G.LeclercJ. E.et al (2011). Comparative genomics of 28 Salmonella enterica isolates: evidence for CRISPR-mediated adaptive sublineage evolution.J. Bacteriol.1933556–3568. 10.1128/JB.00297-11
66
GalanisE.Lo Fo WongD. M.PatrickM. E.BinszteinN.CieslikA.ChalermchikitT. (2006). Web-based surveillance and global Salmonella distribution, 2000-2002.Emerg. Infect. Dis.12381–388. 10.3201/eid1205.050854
67
GaulS. B.WedelS.ErdmanM. M.HarrisD. L.HarrisI. T.FerrisK. E.et al (2007). Use of pulsed-field gel electrophoresis of conserved XbaI fragments for identification of swine Salmonella serotypes.J. Clin. Microbiol.45472–476. 10.1128/jcm.00962-06
68
GilsonE.BachellierS.PerrinS.PerrinD.GrimontP. A.GrimontF.et al (1990). Palindromic unit highly repetitive DNA sequences exhibit species specificity within Enterobacteriacea.Res. Microbiol.1411103–1116. 10.1016/0923-2508(90)90084-4
69
GMA (2009). The Association of Food, Beverage and Consumer Products Companies (GMA): Control of Salmonella in Low-Moisture Foods. Available at: https://www.gmaonline.org/downloads/technical-guidance-and-tools/SalmonellaControlGuidance.pdf(accessed March 16, 2009).
- Google Scholar
70
GreigJ. D.RavelA. (2009). Analysis of foodborne outbreak data reported internationally for source attribution.Int. J. Food Microbiol.13077–87. 10.1016/j.ijfoodmicro.2008.12.031
71
GrimontP.WeillF. (2007). Antigenic Formulae of the Salmonella Serovars, 9th Edn. Paris: WHO Collaborating Centre for Reference and Research on Salmonella.
- Google Scholar
72
GuerraB.LaconchaI.SotoS. M.Gonzalez-HeviaM. A.MendozaM. C. (2000). Molecular characterisation of emergent multiresistant Salmonella enterica serotype [4,5,12:i:-] organisms causing human salmonellosis.FEMS Microbiol. Lett.190341–347. 10.1111/j.1574-6968.2000.tb09309.x
73
GuibourdencheM.RoggentinP.MikoleitM.FieldsP. I.BockemuhlJ.GrimontP. A.et al (2010). Supplement 2003-2007 (No. 47) to the White-Kauffmann-Le Minor scheme.Res. Microbiol.16126–29. 10.1016/j.resmic.2009.10.002
74
GuigonG.ChevalJ.CahuzacR.BrisseS. (2008). MLVA-NET–a standardised web database for bacterial genotyping and surveillance.Euro Surveill.13:18863. 10.2807/ese.13.19.18863-en
- CrossRef
- Google Scholar
75
HadjinicolaouA. V.DemetriouV. L.EmmanuelM. A.KakoyiannisC. K.KostrikisL. G. (2009). Molecular beacon-based real-time PCR detection of primary isolates of Salmonella typhimurium and Salmonella enteritidis in environmental and clinical samples.BMC Microbiol.9:97. 10.1186/1471-2180-9-97
76
HanningI. B.NuttJ. D.RickeS. C. (2009). Salmonellosis outbreaks in the United States due to fresh produce: sources and potential intervention measures.Foodborne Pathog. Dis.6635–648. 10.1089/fpd.2008.0232
77
HarbottleH.WhiteD. G.McDermottP. F.WalkerR. D.ZhaoS. (2006). Comparison of multilocus sequence typing, pulsed-field gel electrophoresis, and antimicrobial susceptibility typing for characterization of Salmonella enterica serotype Newport isolates.J. Clin. Microbiol.442449–2457. 10.1128/jcm.00019-06
78
HartmannF. A.WestS. E. (1997). Utilization of both phenotypic and molecular analyses to investigate an outbreak of multidrug-resistant Salmonella anatum in horses.Can. J. Vet. Res.61173–181.
- Pubmed Abstract
- Google Scholar
79
HauserE.TietzeE.HelmuthR.JunkerE.PragerR.SchroeterA.et al (2012). Clonal dissemination of Salmonella enterica serovar infantis in Germany.Foodborne Pathog. Dis.9352–360. 10.1089/fpd.2011.1038
80
HauserE.TietzeE.HelmuthR.MalornyB. (2011). Different mutations in the oafA gene lead to loss of O5-antigen expression in Salmonella enterica serovar typhimurium.J. Appl. Microbiol.110248–253. 10.1111/j.1365-2672.2010.04877.x
81
HealyM.HuongJ.BittnerT.LisingM.FryeS.RazaS.et al (2005). Microbial DNA typing by automated repetitive-sequence-based PCR.J. Clin. Microbiol.43199–207. 10.1128/JCM.43.1.199-207.2005
82
HeisigP.KratzB.HalleE.GraserY.AltweggM.RabschW.et al (1995). Identification of DNA gyrase A mutations in ciprofloxacin-resistant isolates of Salmonella typhimurium from men and cattle in Germany.Microb. Drug Resist.1211–218. 10.1089/mdr.1995.1.211
83
HoelzerK.SoyerY.Rodriguez-RiveraL. D.CummingsK. J.McDonoughP. L.Schoonmaker-BoppD. J. (2010). The prevalence of multidrug resistance is higher among bovine than human Salmonella enterica serotype Newport, Typhimurium, and 4,5,12:i:- isolates in the United States but differs by serotype and geographic region.Appl. Environ. Microbiol.765947–5959. 10.1128/AEM.00377-10
84
HoffmannM.LuoY.MondayS. R.Gonzalez-EscalonaN.OttesenA. R.MuruvandaT. (2016). Tracing origins of the Salmonella bareilly strain causing a food-borne outbreak in the united states.J. Infect. Dis.213502–508. 10.1093/infdis/jiv297
85
HopkinsK. L.MaguireC.BestE.LiebanaE.ThrelfallE. J. (2007). Stability of multiple-locus variable-number tandem repeats in Salmonella enterica serovar typhimurium.J. Clin. Microbiol.453058–3061. 10.1128/jcm.00715-07
86
HopkinsK. L.PetersT. M.de PinnaE.WainJ. (2011). Standardisation of multilocus variable-number tandem-repeat analysis (MLVA) for subtyping of Salmonella enterica serovar Enteritidis.Euro Surveill.16:19942. 10.2807/ese.16.32.19942-en
87
HultonC. S. J.HigginsC. F.SharpP. M. (1991). ERIC sequences: a novel family of repetitive elements in the genomes of Escherichia coli, Salmonella typhimurium and other enterobacteria.Mol. Microbiol.5825–834. 10.1111/j.1365-2958.1991.tb00755.x
88
HunterS. B.VauterinP.Lambert-FairM. A.Van DuyneM. S.KubotaK.GravesL.et al (2005). Establishment of a universal size standard strain for use with the pulsenet standardized pulsed-field gel electrophoresis protocols: converting the national databases to the new size standard.J. Clin. Microbiol.431045. 10.1128/JCM.43.3.1045-1050.2005
89
Hyytia-TreesE. K.CooperK.RibotE. M.Gerner-SmidtP. (2007). Recent developments and future prospects in subtyping of foodborne bacterial pathogens.Future Microbiol.2175–185. 10.2217/17460913.2.2.175
90
Illumina (2019). Illumina - Introduction to SBS Technology. Available at: https://www.illumina.com/science/technology/next-generation-sequencing/sequencing-technology.html(accessed October 27, 2018).
- Google Scholar
91
InnsT.AshtonP.Herrera-LeonS.LighthillJ.FoulkesS.JombartT. (2016). Prospective use of whole genome sequencing (WGS) detected a multi-country outbreak of Salmonella enteritidis.Epidemiol. Infect.145289–298. 10.1017/S0950268816001941
92
InnsT.LaneC.PetersT.DallmanT.ChattC.McFarlandN. (2015). A multi-country Salmonella enteritidis phage type 14b outbreak associated with eggs from a German producer: ‘near real-time’ application of whole genome sequencing and food chain investigations, United Kingdom, May to September 2014.Euro Surveill.20:21098. 10.2807/1560-7917.ES2015.20.16.21098
93
InouyeM.DashnowH.RavenL. A.SchultzM. B.PopeB. J.TomitaT.et al (2014). SRST2: rapid genomic surveillance for public health and hospital microbiology labs.Genome Med.6:90. 10.1186/s13073-014-0090-6
94
JacksonB. R.TarrC.StrainE.JacksonK. A.ConradA.CarletonH. (2016). Implementation of nationwide real-time whole-genome sequencing to enhance listeriosis outbreak detection and investigation.Clin. Infect. Dis.63380–386. 10.1093/cid/ciw242
95
JagadeesanB.Gerner-SmidtP.AllardM. W.LeuilletS.WinklerA.XiaoY. (2019). The use of next generation sequencing for improving food safety: translation into practice.Food Microbiol.7996–115. 10.1016/j.fm.2018.11.005
96
KatzL. S.GriswoldT.Williams-NewkirkA. J.WagnerD.PetkauA.SieffertC.et al (2017). A comparative analysis of the Lyve-SET phylogenomics pipeline for genomic epidemiology of foodborne pathogens.Front. Microbiol.8:375. 10.3389/fmicb.2017.00375
97
KerouantonA.MaraultM.LaillerR.WeillF. X.FeurerC.EspieE.et al (2007). Pulsed-field gel electrophoresis subtyping database for foodborne Salmonella enterica serotype discrimination.Foodborne Pathog. Dis.4293–303. 10.1089/fpd.2007.0090
98
KidgellC.ReichardU.WainJ.LinzB.TorpdahlM.DouganG.et al (2002). Salmonella typhi, the causative agent of typhoid fever, is approximately 50,000 years old.Infect. Genet. Evol.239–45. 10.1016/s1567-1348(02)00089-8
99
KimS.FryeJ. G.HuJ.Fedorka-CrayP. J.GautomR.BoyleD. S.et al (2006). Multiplex PCR-based method for identification of common clinical serotypes of Salmonella enterica subsp. enterica.J. Clin. Microbiol.443608–3615. 10.1128/jcm.00701-06
100
KjeldsenM. K.TorpdahlM.PedersenK.NielsenE. M. (2016). Development and comparison of a generic multiple-locus variable-number tandem repeat analysis with PFGE for typing of Salmonella enterica subsp. enterica.J. Appl. Microbiol.1191707–1717. 10.1111/jam.12965
101
KodamaY.ShumwayM.LeinonenR. (2012). The Sequence Read Archive: explosive growth of sequencing data.Nucleic Acids Res.4054–56. 10.1093/nar/gkr854
102
KozyrevaV. K.CrandallJ.SabolA.PoeA.ZhangP.Concepcion-AcevedoJ.et al (2016). Laboratory investigation of Salmonella enterica serovar Poona outbreak in California: comparison of pulsed-field gel electrophoresis (PFGE) and whole genome sequencing (WGS) results.PLoS Curr.8:ecurrents.outbreaks.1bb3e36e74bd5779bc43ac3a8dae52e6. 10.1371/currents.outbreaks.1bb3e36e74bd5779bc43ac3a8dae52e6
103
KruyS. L.van CuyckH.KoeckJ. L. (2011). Multilocus variable number tandem repeat analysis for Salmonella enterica subspecies.Eur. J. Clin. Microbiol. Infect. Dis.30465–473. 10.1007/s10096-010-1110-0
104
Kvistholm JensenA.NielsenE. M.BjorkmanJ. T.JensenT.MullerL.PerssonS.et al (2016). Whole-genome sequencing used to investigate a nationwide outbreak of listeriosis caused by ready-to-eat delicatessen meat, Denmark, 2014.Clin. Infect. Dis.6364–70. 10.1093/cid/ciw192
105
LarssonJ. T.TorpdahlM.Mlva working groupMollerN. E. (2013). Proof-of-concept study for successful inter-laboratory comparison of MLVA results.Euro Surveill.18:20566. 10.2807/1560-7917.ES2013.18.35.20566
106
LarssonJ. T.TorpdahlM.PetersenR. F.SorensenG.LindstedtB. A.NielsenE. M.et al (2009). Development of a new nomenclature for Salmonella typhimurium multilocus variable number of tandem repeats analysis (MLVA).Euro Surveill.14:19174. 10.2807/ese.14.15.19174-en
107
LeekitcharoenphonP.NielsenE. M.KaasR. S.LundO.AarestrupF. M. (2014). Evaluation of whole genome sequencing for outbreak detection of Salmonella enterica.PLoS One9:e87991. 10.1371/journal.pone.0087991
108
LiH. (2013). Aligning Sequence Reads, Clone Sequences and Assembly Contigs with BWA-MEM. Available at: http://arxiv.org/abs/1303.3997(accessed March 16, 2013).
- Google Scholar
109
LiW.RaoultD.FournierP. E. (2009). Bacterial strain typing in the genomic era.FEMS Microbiol. Rev.33892–916. 10.1111/j.1574-6976.2009.00182.x
110
LienemannT.KyyhkynenA.HalkilahtiJ.HaukkaK.SiitonenA. (2015). Characterization of Salmonella typhimurium isolates from domestically acquired infections in Finland by phage typing, antimicrobial susceptibility testing, PFGE and MLVA.BMC Microbiol.15:131. 10.1186/s12866-015-0467-8
111
LindstedtB. A. (2005). Multiple-locus variable number tandem repeats analysis for genetic fingerprinting of pathogenic bacteria.Electrophoresis262567–2582. 10.1002/elps.200500096
112
LindstedtB. A.HeirE.GjernesE.KapperudG. (2003). DNA fingerprinting of Salmonella enterica subsp. enterica serovar typhimurium with emphasis on phage type DT104 based on variable number of tandem repeat loci.J. Clin. Microbiol.411469–1479. 10.1128/JCM.41.4.1469-1479.203
113
LindstedtB. A.TorpdahlM.NielsenE. M.VardundT.AasL.KapperudG.et al (2007). Harmonization of the multiple-locus variable-number tandem repeat analysis method between Denmark and Norway for typing Salmonella typhimurium isolates and closer examination of the VNTR loci.J. Appl. Microbiol.102728–735. 10.1111/j.1365-2672.2006.03134.x
114
LindstedtB. A.TorpdahlM.VergnaudG.Le HelloS.WeillF. X.TietzeE.et al (2013). Use of multilocus variable-number tandem repeat analysis (MLVA) in eight European countries, 2012.Euro Surveill.18:20385. 10.2807/ese.18.04.20385-en
115
LiuF. (2010). Virulence Gene and CRISPR Multilocus Sequence Typing Scheme for Subtyping the Major Serovars of Salmonella enterica Subspecies enterica. Available at: https://etda.libraries.psu.edu/files/final_submissions/1960(accessed October 27, 2018).
- Google Scholar
116
LiuF.BarrangouR.GernersmidtP.RibotE. M.KnabelS. J.DudleyE. G.et al (2011). Novel virulence gene and clustered regularly interspaced short palindromic repeat (CRISPR) multilocus sequence typing scheme for subtyping of the major serovars of Salmonella enterica subsp. enterica.Appl. Environ. Microbiol.771946–1956. 10.1128/AEM.02625-10
117
LomanN. J.PallenM. J. (2015). Twenty years of bacterial genome sequencing.Nat. Rev. Microbiol.13787–794. 10.1038/nrmicro3565
118
MaciorowskiK. G.JonesF. T.PillaiandS. D.RickeS. C. (2004). Incidence, sources, and control of food-borne Salmonella spp. in poultry feeds.Worlds Poult. Sci. J.60446–457. 10.1079/WPS200428
- CrossRef
- Google Scholar
119
MaidenM. C.Jansen van RensburgM. J.BrayJ. E.EarleS. G.FordS. A.JolleyK. A.et al (2013). MLST revisited: the gene-by-gene approach to bacterial genomics.Nat. Rev. Microbiol.11728–736. 10.1038/nrmicro3093
120
MartinB.HumbertO.CamaraM.GuenziE.WalkerJ.MitchellT.et al (1992). A highly conserved repeated DNA element located in the chromosome of Streptococcus pneumoniae.Nucleic Acids Res.203479–3483. 10.1093/nar/20.13.3479
121
McQuistonJ. R.ParrenasR.Ortiz-RiveraM.GheeslingL.BrennerF.FieldsP. I.et al (2004). Sequencing and comparative analysis of flagellin genes fliC, fljB, and flpA from Salmonella.J. Clin. Microbiol.421923–1932. 10.1128/JCM.42.5.1923-1932.2004
122
McQuistonJ. R.WatersR. J.DinsmoreB. A.MikoleitM. L.FieldsP. I. (2011). Molecular determination of H antigens of Salmonella by use of a microsphere-based liquid array.J. Clin. Microbiol.49565–573. 10.1128/JCM.01323-10
123
MohammedM.DelappeN.O’ConnorJ.McKeownP.GarveyP.CormicanM.et al (2016). Whole genome sequencing provides an unambiguous link between Salmonella Dublin outbreak strain and a historical isolate.Epidemiol. Infect.144576–581. 10.1017/S0950268815001636
124
MouraA.CriscuoloA.PouseeleH.MauryM. M.LeclercqA.TarrC.et al (2016). Whole genome-based population biology and epidemiological surveillance of Listeria monocytogenes.Nat. Microbiol.2:16185. 10.1038/nmicrobiol.2016.185
125
Mughini-GrasL.FranzE.van PeltW. (2018). New paradigms for Salmonella source attribution based on microbial subtyping.Food Microbiol.7160–67. 10.1016/j.fm.2017.03.002
126
MuteebG.SenR. (2010). “Random mutagenesis using a mutator strain,” in In Vitro Mutagenesis Protocols, 3rd Edn, ed.BramanJ. (Totowa, NJ: Humana Press), 411–419. 10.1007/978-1-60761-652-8_29
127
NadonC.Van WalleI.Gerner-SmidtP.CamposJ.ChinenI.Concepcion-AcevedoJ.et al (2017). PulseNet International: vision for the implementation of whole genome sequencing (WGS) for global food-borne disease surveillance.Euro Surveill.22:30544. 10.2807/1560-7917.ES.2017.22.23.30544
128
NdeC. W.SherwoodJ. S.DoetkottC.LogueC. M. (2006). Prevalence and molecular profiles of Salmonella collected at a commercial turkey processing plant.J. Food Prot.691794–1801. 10.4315/0362-028X-69.8.1794
129
NgoiS. T.TehC. S.ChaiL. C.ThongK. L. (2015). Overview of molecular typing tools for the characterization of Salmonella enterica in Malaysia.Biomed. Environ. Sci.28751–764. 10.3967/bes2015.105
130
NsoforC. (2016). Pulsed-field gel electrophoresis (PFGE): principles and applications in molecular epidemiology: a review.Int. J. Curr. Res. Med. Sci.238–51. 10.1111/j.1863-2378.2009.01259.x
131
OctaviaS.WangQ.TanakaM. M.KaurS.SintchenkoV.LanR.et al (2015). Delineating community outbreaks of Salmonella enterica serovar Typhimurium by use of whole-genome sequencing: insights into genomic variability within an outbreak.J. Clin. Microbiol.531063–1071. 10.1128/JCM.03235-14
132
OlaimatA. N.HolleyR. A. (2012). Factors influencing the microbial safety of fresh produce: a review.Food Microbiol.321–19. 10.1016/j.fm.2012.04.016
133
OloyaJ.DoetkottD.KhaitsaM. L. (2009). Antimicrobial drug resistance and molecular characterization of Salmonella isolated from domestic animals, humans, and meat products.Foodborne Pathog. Dis.6273–284. 10.1089/fpd.2008.0134
134
OlsonA. B.AndrysiakA. K.TraczD. M.Guard-BouldinJ.DemczukW.NgL. K.et al (2007). Limited genetic diversity in Salmonella enterica serovar Enteritidis PT13.BMC Microbiol.7:87. 10.1186/1471-2180-7-87
135
ParkS. Y.WoodwardC. L.KubenaL. F.NisbetD. J.BirkholdS. G.RickeS. C.et al (2008). Environmental dissemination of foodborne Salmonella in preharvest poultry production: reservoirs, critical factors, and research strategies.Crit. Rev. Environ. Sci. Technol.3873–111. 10.1080/10643380701598227
- CrossRef
- Google Scholar
136
PatanéJ. S. L.MartinsJ.SetubalJ. C. (2018). “Phylogenomics,” in Comparative Genomics: Methods and Protocols, edsSetubalJ. C.StoyeJ.StadlerP. F. (New York, NY: Springer).
- Google Scholar
137
PearceM. E.AlikhanN. F.DallmanT. J.ZhouZ.GrantK.MaidenM. C. J.et al (2018). Comparative analysis of core genome MLST and SNP typing within a European Salmonella serovar Enteritidis outbreak.Int. J. Food. Microbiol.2741–11. 10.1016/j.ijfoodmicro.2018.02.023
138
PersingD. H.TenoverF. C.VersalovicJ.TangY. W.UngerE. R.RelmanD. A. (2011). Molecular Microbiology: Diagnostic Principles and Practice.Washington, DC: ASM Press.
- Google Scholar
139
PetersenA.AarestrupF. M.AnguloF. J.WongS.StöhrK.WegenerH. C.et al (2002). WHO global salm-surv external quality assurance system (EQAS): an important step toward improving the quality of Salmonella serotyping and antimicrobial susceptibility testing worldwide.Microb. Drug Resist.8345–353. 10.1089/10766290260469615
140
PightlingA. W.PetronellaN.PagottoF. (2014). Choice of reference sequence and assembler for alignment of Listeria monocytogenes short-read sequence data greatly influences rates of error in SNP analyses.PLoS One9:e104579. 10.1371/journal.pone.0104579
141
PightlingA. W.PettengillJ. B.LuoY.BaugherJ. D.RandH.StrainE.et al (2018). Interpreting whole-genome sequence analyses of foodborne bacteria for regulatory applications and outbreak investigations.Front. Microbiol.9:1482. 10.3389/fmicb.2018.01482
142
PillaiS. D.RickeS. C. (2002). Bioaerosols from municipal and animal wastes: background and contemporary issues.Can. J. Microbiol.48681–696. 10.1139/w02-070
143
PinhoA. J.BastosC. A. C.FerreiraP. J. S. G.GarciaS. P.AfreixoV. (2009). Genome analysis with inter-nucleotide distances.Bioinformatics253064–3070. 10.1093/bioinformatics/btp546
144
PortmannA. C.FournierC.GimonetJ.Ngom-BruC.BarrettoC.BaertL. (2018). A validation approach of an end-to-end whole genome sequencing workflow for source tracking of Listeria monocytogenes and Salmonella enterica.Front. Microbiol.9:446. 10.3389/fmicb.2018.00446
145
PorwollikS.BoydE. F.ChoyC.ChengP.FloreaL.ProctorE.et al (2004). Characterization of Salmonella enterica subspecies I genovars by use of microarrays.J. Bacteriol.1865883–5898. 10.1128/JB.186.17.5883-5898.2004
146
PulseNet (2013). Standard Operating Procedure for PulseNet PFGE of Escherichia coli O157:H7, Escherichia coli non-O157 (STEC), Salmonella Serotypes, Shigella sonnei and Shigella flexneri. Available at: http://www.pulsenetinternational.org/assets/PulseNet/uploads/pfge/PNL05_Ec-Sal-ShigPFGEprotocol.pdf(accessed March 24, 2013).
- Google Scholar
147
PulseNet (2014). PulseNet Europe. Available at: http://www.pulsenetinternational.org/networks/europe/(accessed November 4, 2014).
- Google Scholar
148
PulseNet (2015a). PulseNet Explained. Available at: http://www.pulsenetinternational.org/international/pulsenetexplained/(accessed May 13, 2015).
- Google Scholar
149
PulseNet (2015b). PulseNet Protocols A - Pulse Field Gel Electrophoresis (PFGE). Available at: http://www.pulsenetinternational.org/protocols/pfge/(accessed October 22, 2015).
- Google Scholar
150
PulseNet (2015c). PulseNet Protocols B - Molecular Typing for MLVA. Available at: http://www.pulsenetinternational.org/protocols/mlva/(accessed May 13, 2015).
- Google Scholar
151
RanieriM. L.ShiC.SwittM. A. I.den BakkerH. C.WiedmannM. (2013). Comparison of typing methods with a new procedure based on sequence characterization for Salmonella serovar prediction.J. Clin. Microbiol.511786–1797. 10.1128/JCM.03201-12
152
RantsiouK.KathariouS.WinklerA.SkandamisP.Saint-CyrM. J.Rouzeau-SzynalskiK.et al (2018). Next generation microbiological risk assessment: opportunities of whole genome sequencing (WGS) for foodborne pathogen surveillance, source tracking and risk assessment.Int. J. Food Microbiol.2873–9. 10.1016/j.ijfoodmicro.2017.11.007
153
RickeS. C. (2017). Insights and challenges of Salmonella infection of laying hens.Curr. Opin. Food Sci.1843–49. 10.1016/j.cofs.2017.10.012
- CrossRef
- Google Scholar
154
RickeS. C.KimS. A.ShiZ.ParkS. H. (2018). Molecular-based identification and detection of Salmonella in food production systems: current perspectives.J. Appl. Microbiol.125313–327. 10.1111/jam.13888
155
RobertsonJ.YoshidaC.KruczkiewiczP.NadonC.NichaniA.TaboadaE. N.et al (2018). Comprehensive assessment of the quality of Salmonella whole genome sequence data available in public sequence databases using the Salmonella in silico typing resource (SISTR).Microb. Genom.4:e000151. 10.1099/mgen.0.000151
156
RossI. L.HeuzenroederM. W. (2005). Discrimination within phenotypically closely related definitive types of Salmonella enterica serovar typhimurium by the multiple amplification of phage locus typing technique.J. Clin. Microbiol.431604–1611. 10.1128/JCM.43.4.1604-1611.2005
157
RossI. L.HeuzenroederM. W. (2008). A comparison of three molecular typing methods for the discrimination of Salmonella enterica serovar Infantis.FEMS Immunol. Med. Microbiol.53375–384. 10.1111/j.1574-695X.2008.00435.x
158
SabatA. J.BudimirA.NashevD.Sá-LeãoR.van DijlJ. M.LaurentF.et al (2013). Overview of molecular typing methods for outbreak detection and epidemiological surveillance.Euro Surveill.18:20380. 10.2807/ese.18.04.20380-en
159
Salmonella Subcommittee of the Nomenclature Committee of the International Society for, Microbiology (1934). The genus Salmonella Lignières, 1900.J. Hyg.34333–350. 10.1017/s0022172400034677
160
SangalV.HarbottleH.MazzoniC. J.HelmuthR.GuerraB.DidelotX.et al (2010). Evolution and population structure of Salmonella enterica serovar Newport.J. Bacteriol.1926465–6476. 10.1128/JB.00969-10
161
SchoulsL. M.SpalburgE. C.van LuitM.HuijsdensX. W.PluisterG. N.van Santen-VerheuvelM. J.et al (2009). Multiple-locus variable number tandem repeat analysis of Staphylococcus aureus: comparison with pulsed-field gel electrophoresis and spa-typing.PLoS One4:e5082. 10.1371/journal.pone.0005082
162
SchwartzD. C.CantorC. R. (1984). Separation of yeast chromosome-sized DNAs by pulsed field gradient gel electrophoresis.Cell3767–75. 10.1016/0092-8674(84)90301-5
163
ShariatN.DiMarzioM. J.YinS.DettingerL.SandtC. H.LuteJ. R.et al (2013a). The combination of CRISPR-MVLST and PFGE provides increased discriminatory power for differentiating human clinical isolates of Salmonella enterica subsp. enterica serovar Enteritidis.Food Microbiol.34164–173. 10.1016/j.fm.2012.11.012
164
ShariatN.SandtC. H.DiMarzioM. J.BarrangouR.DudleyE. G. (2013b). CRISPR-MVLST subtyping of Salmonella enterica subsp. enterica serovars Typhimurium and Heidelberg and application in identifying outbreak isolates.BMC Microbiol.13:254. 10.1186/1471-2180-13-254
165
ShariatN.SandtC. H.KirchnerM. K.TreesE.BarrangouR.DudleyE. G. (2013c). Subtyping of Salmonella enterica serovar Newport outbreak isolates by CRISPR-MVLST and determination of the relationship between CRISPR-MVLST and PFGE results.J. Clin. Microbiol.512328–2336. 10.1128/JCM.00608-13
166
ShariatN.DudleyE. G. (2014). CRISPRs: molecular signatures used for pathogen subtyping.Appl. Environ. Microbiol.80430–439. 10.1128/AEM.02790-13
167
ShiC.SinghP.RanieriM. L.WiedmannM.Moreno SwittA. I. (2015). Molecular methods for serovar determination of Salmonella.Crit. Rev. Microbiol.41309–325. 10.3109/1040841X.2013.837862
168
SinghA.GoeringR. V.SimjeeS.FoleyS. L.ZervosM. J. (2006). Application of molecular techniques to the study of hospital infection.Clin. Microbiol. Rev.19512–530.
- Pubmed Abstract
- Google Scholar
169
SoyerY.AlcaineS. D.Schoonmaker-BoppD. J.RootT. P.WarnickL. D.McDonoughP. L.et al (2010). Pulsed-field gel electrophoresis diversity of human and bovine clinical Salmonella isolates.Foodborne Pathog. Dis.7707–717. 10.1089/fpd.2009.0424
170
SoyerY.Moreno SwittA.DavisM. A.MaurerJ.McDonoughP. L.Schoonmaker-BoppD. J.et al (2009). Salmonella enterica serotype 4,5,12:i:-, an emerging Salmonella serotype that represents multiple distinct clones.J. Clin. Microbiol.473546–3556. 10.1128/JCM.00546-09
171
SukhnanandS.AlcaineS.WarnickL. D.SuW. L.HofJ.CraverM. P.et al (2005). DNA sequence-based subtyping and evolutionary analysis of selected Salmonella enterica serotypes.J. Clin. Microbiol.433688–3698. 10.1128/JCM.43.8.3688-3698.2005
172
TaylorA. J.LappiV.WolfgangW. J.LapierreP.PalumboM. J.MedusC. (2015). Characterization of foodborne outbreaks of Salmonella enterica serovar enteritidis with whole-genome sequencing single nucleotide polymorphism-based analysis for surveillance and outbreak detection.J. Clin. Microbiol.533334–3340. 10.1128/JCM.01280-15
173
ThongK. L.AngC. P. (2011). Genotypic and phenotypic differentiation of Salmonella enterica serovar Paratyphi B in Malaysia.Southeast Asian J. Trop. Med. Public Health421178–1189.
- Pubmed Abstract
- Google Scholar
174
ThrelfallE. J.FrostJ. A. (1990). The identification, typing and fingerprinting of Salmonella: laboratory aspects and epidemiological applications.J. Appl. Bacteriol.685–16. 10.1111/j.1365-2672.1990.tb02542.x
175
TiongV.ThongK. L.YusofM. Y.HanifahY. A.SamJ. I.HassanH. (2010). Macrorestriction analysis and antimicrobial susceptibility profiling of Salmonella enterica at a University Teaching Hospital, Kuala Lumpur.Jpn. J. Infect. Dis.63317–322.
- Pubmed Abstract
- Google Scholar
176
TopJ.BangaN. M.HayesR.WillemsR. J.BontenM. J.HaydenM. K. (2008). Comparison of multiple-locus variable-number tandem repeat analysis and pulsed-field gel electrophoresis in a setting of polyclonal endemicity of vancomycin-resistant Enterococcus faecium.Clin. Microbiol. Infect.14363–369. 10.1111/j.1469-0691.2007.01945.x
177
TorpdahlM.SkovM. N.SandvangD.BaggesenD. L. (2005). Genotypic characterization of Salmonella by multilocus sequence typing, pulsed-field gel electrophoresis and amplified fragment length polymorphism.J. Microbiol. Methods63173–184. 10.1016/j.mimet.2005.03.006
178
TorpdahlM.SorensenG.LindstedtB. A.NielsenE. M. (2007). Tandem repeat analysis for surveillance of human Salmonella typhimurium infections.Emerg. Infect. Dis.13388–395. 10.3201/eid1303.060460
179
Van BelkumA.TassiosP. T.DijkshoornL.HaeggmanS.CooksonB.FryN. K.et al (2007). Guidelines for the validation and application of typing methods for use in bacterial epidemiology.Clin. Microbiol. Infect.131–46. 10.1111/j.1469-0691.2007.01786.x
180
Van CuyckH.Farbos-GrangerA.LeroyP.YithV.GuillardB.SarthouJ. L.et al (2011). MLVA polymorphism of Salmonella enterica subspecies isolated from humans, animals, and food in Cambodia.BMC Res. Notes4:306. 10.1186/1756-0500-4-306
181
VincentC.UsongoV.BerryC.TremblayD. M.MoineauS.YousfiK.et al (2018). Comparison of advanced whole genome sequence-based methods to distinguish strains of Salmonella enterica serovar Heidelberg involved in foodborne outbreaks in Quebec.Food Microbiol.7399–110. 10.1016/j.fm.2018.01.004
182
WachsmuthI. K.KiehlbauchJ. A.BoppC. A.CameronD. N.StrockbineN. A.WellsJ. G.et al (1991). The use of plasmid profiles and nucleic acid probes in epidemiologic investigations of foodborne, diarrheal diseases.Int. J. Food Microbiol.1277–89. 10.1016/0168-1605(91)90049-U
183
WattiauP.BolandC.BertrandS. (2011). Methodologies for Salmonella enterica subsp. enterica subtyping: gold standards and alternatives.Appl. Environ. Microbiol.777877–7885. 10.1128/AEM.05527-11
184
WeigelR. M.QiaoB.TeferedegneB.SuhD. K.BarberD. A.IsaacsonR. E. (2004). Comparison of pulsed field gel electrophoresis and repetitive sequence polymerase chain reaction as genotyping methods for detection of genetic diversity and inferring transmission of Salmonella.Vet. Microbiol.100205–217. 10.1016/j.vetmic.2004.02.009
185
WiedmannM.NightingaleK. (2009). DNA-based subtyping methods facilitate identification of foodborne pathogens.Food Technol.6344–49.
- Google Scholar
186
WiedmannM.WangS.PostL.NightingaleK. (2014). Assessment criteria and approaches for rapid detection methods to be used in the food industry.J. Food Prot.77670–690. 10.4315/0362-028X.JFP-13-138
187
WiseM. G.SiragusaG. R.PlumbleeJ.HealyM.CrayP. J.SealB. S. (2009). Predicting Salmonella enterica serotypes by repetitive sequence-based PCR.J. Microbiol. Methods7618–24. 10.1016/j.mimet.2008.09.006
188
WrightA. V.LiuJ. J.KnottG. J.DoxzenK. W.NogalesE.DoudnaJ. A. (2017). Structures of the CRISPR genome integration complex.Science3571113–1118. 10.1126/science.aao0679
189
WuS.RickeS. C.SchneiderK. R.AhnS. (2017). Food safety hazards associated with ready-to-bake cookie dough and its ingredients.Food Control73986–993. 10.1016/j.foodcont.2016.10.010
- CrossRef
- Google Scholar
190
WuytsV.MattheusW.De Laminne de BexG.WildemauweC.RoosensN. H.MarchalK.et al (2013). MLVA as a tool for public health surveillance of human Salmonella typhimurium: prospective study in Belgium and evaluation of MLVA loci stability.PLoS One8:e84055. 10.1371/journal.pone.0084055
191
WyresK.ConwayT.GargS.QueirozC.ReumannM.HoltK. (2014). WGS analysis and interpretation in clinical and public health microbiology laboratories: what are the requirements and how do existing tools compare?Pathogens3437–458. 10.3390/pathogens3020437
192
XiongD.SongL.PanZ.JiaoX. (2018). Identification and discrimination of Salmonella enterica serovar gallinarum biovars pullorum and gallinarum based on a one-step multiplex PCR assay.Front. Microbiol.9:1718. 10.3389/fmicb.2018.01718
193
XuL.LiuZ.LiY.YinC.HuY.XieX.et al (2018). A rapid method to identify Salmonella enterica serovar Gallinarum biovar Pullorum using a specific target gene ipaJ.Avian Pathol.47238–244. 10.1080/03079457.2017.1412084
194
XuY.HuY.GuoY.ZhouZ.XiongD.MengC.et al (2018). A new PCR assay based on the new gene-SPUL_2693 for rapid detection of Salmonella enterica subsp. enterica serovar Gallinarum biovars Gallinarum and Pullorum.Poult. Sci.974000–4007. 10.3382/ps/pey254
195
YoshidaC.AhmadA.GurnikS.BlimkieT.MurphyS. A.KropinskiA. M. (2016a). Evaluation of molecular methods for identification of Salmonella serovars.J. Clin. Microbiol.541992–1998. 10.1128/JCM.00262-16
196
YoshidaC.KruczkiewiczP.LaingC. R.LingohrE. J.GannonV. P.NashJ. H. (2016b). The Salmonella in silico typing resource (SISTR): an open web-accessible tool for rapidly typing and subtyping draft Salmonella genome assemblies.PLoS One11:e0147101. 10.1371/journal.pone.0147101
197
YoshidaC.LingohrE. J.TrognitzF.MacLarenN.RosanoA.MurphyS. A.et al (2014). Multi-laboratory evaluation of the rapid genoserotyping array (SGSA) for the identification of Salmonella serovars.Diagn. Microbiol. Infect. Dis.80185–190. 10.1016/j.diagmicrobio.2014.08.006
198
YunY. S.ChaeS. J.NaH. Y.ChungG. T.YooC. K.LeeD. Y. (2015). Modified method of multilocus sequence typing (MLST) for serotyping in Salmonella species.J. Bacteriol. Virol.45314–318. 10.4167/jbv.2015.45.4.314
- CrossRef
- Google Scholar
199
ZhangS.YinY.JonesM. B.ZhangZ.KaiserD. B. L.DinsmoreB. A.et al (2015). Salmonella serotype determination utilizing high-throughput genome sequencing data.J. Clin. Microbiol.531685–1692. 10.1128/JCM.00323-15
200
ZhengJ.KeysC. E.ZhaoS.AhmedR.MengJ.BrownE. W. (2011). Simultaneous analysis of multiple enzymes increases accuracy of pulsed-field gel electrophoresis in assigning genetic relationships among homogeneous Salmonella strains.J. Clin. Microbiol.4985–94. 10.1128/JCM.00120-10
201
ZhengJ.KeysC. E.ZhaoS.MengJ.BrownE. W. (2007). Enhanced subtyping scheme for Salmonella enteritidis.Emerg. Infect. Dis.131932–1935. 10.3201/eid1312.070185
202
ZhuC.YueM.RankinS.WeillF. X.FreyJ.Dieter SchifferliM. (2015). One-step identification of five prominent chicken Salmonella serovars and biotypes.J. Clin. Microbiol.533881–3883. 10.1128/JCM.01976-15
203
ZouW.LinW. J.FoleyS. L.ChenC. H.NayakR.ChenJ. J.et al (2010). Evaluation of pulsed-field gel electrophoresis profiles for identification of Salmonella serotypes.J. Clin. Microbiol.483122–3126. 10.1128/JCM.00645-10
204
ZouW.TangH.ZhaoW.MeehanJ.FoleyS. L.LinW.et al (2013). Data mining tools for Salmonella characterization: application to gel-based fingerprinting analysis.BMC Bioinformatics14 (Suppl. 14):S15. 10.1186/1471-2105-14-S14-S15

Summary

Keywords

Salmonella, subtyping, serotyping, WGS, PFGE, MLST, food industry

Citation

Tang S, Orsi RH, Luo H, Ge C, Zhang G, Baker RC, Stevenson A and Wiedmann M (2019) Assessment and Comparison of Molecular Subtyping and Characterization Methods for Salmonella. Front. Microbiol. 10:1591. doi: 10.3389/fmicb.2019.01591

Received

20 March 2019

Accepted

26 June 2019

Published

12 July 2019

Volume

10 - 2019

Edited by

Learn-Han Lee, Monash University Malaysia, Malaysia

Reviewed by

Min Yue, Zhejiang University, China; Dapeng Wang, Shanghai Jiao Tong University, China; Soohyoun Ahn, University of Florida, United States

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Silin Tang, silin.tang@effem.com

This article was submitted to Food Microbiology, a section of the journal Frontiers in Microbiology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Food Microbiology

REVIEW article

Assessment and Comparison of Molecular Subtyping and Characterization Methods for Salmonella

Abstract

Introduction

Banding Pattern-Based and Sequencing-Based Characterization Methods for Salmonella