Analysis of the Listeria monocytogenes Population Structure among Isolates from 1931 to 2015 in Australia

Listeriosis remains among the most important bacterial illnesses, with a high associated mortality rate. Efforts to control listeriosis require detailed knowledge of the epidemiology of the disease itself, and its etiological bacterium, Listeria monocytogenes. In this study we provide an in-depth analysis of the epidemiology of 224 L. monocytogenes isolates from Australian clinical and non-clinical sources. Non-human sources included meat, dairy, seafood, fruit, and vegetables, along with animal and environmental isolates. Serotyping, Multi-Locus Sequence Typing, and analysis of inlA gene sequence were performed. Serogroups IIA, IIB, and IVB comprised 94% of all isolates, with IVB over-represented among clinical isolates. Serogroup IIA was the most common among dairy and meat isolates. Lineage I isolates were most common among clinical isolates, and 52% of clinical isolates belonged to ST1. Overall 39 STs were identified in this study, with ST1 and ST3 containing the largest numbers of L. monocytogenes isolates. These STs comprised 40% of the total isolates (n = 90), and both harbored isolates from clinical and non-clinical sources. ST204 was the third most common ST. The high prevalence of this group among L. monocytogenes populations has not been reported outside Australia. Twenty-seven percent of the STs in this study contained exclusively clinical isolates. Analysis of the virulence protein InlA among isolates in this study identified a truncated form of the protein among isolates from ST121 and ST325. The ST325 group contained a previously unreported novel mutation leading to production of a 93 amino acid protein. This study provides insights in the population structure of L. monocytogenes isolated in Australia, which will contribute to public health knowledge relating to this important human pathogen.


INTRODUCTION
The burden of foodborne disease in Australia has been estimated at an annual cost of AUD$1.2 billion, comprising approximately 5.4 million cases of disease (The OzFoodNet Working Group, 2012). Australia is divided into eight different states or territories, each with its own health department coordinating surveillance for between 10 and 15 foodborne diseases, including listeriosis. As in many other countries, incidence rates of listeriosis are generally far lower than other common foodborne illness, such as campylobacteriosis or salmonellosis, with the rate varying between 0.3 and 0.4 cases per 100,000 since 2008, with most cases/highest rates occurring in age groups over the age of 60 (Rosen, 2002;The OzFoodNet Working Group, 2012). A similar association with high mortality rates is also shared by other countries, with 20-30% of hospitalized cases of invasive listeriosis resulting in fatality (Dalton et al., 2011;Scallan et al., 2011;The OzFoodNet Working Group, 2012). Recent evidence suggests that this burden may be higher, particularly for pregnant women or patients with neurolisteriosis (Charlier et al., 2017).
In addition to the public health burden, the associated costs to the food industry are high -Listeria monocytogenes is the main contaminant linked to food recalls in Australia due to microbial contamination, with 45% of these recalls from 2005 through to 2014 due to this organism (based on FSANZ 1 data); where ready-to-eat (RTE) meat products represented the largest group of recalled products.
In efforts to address these public health and economic costs, countries have employed complex surveillance systems designed to provide knowledge of the epidemiology and population dynamics of L. monocytogenes, and to improve detection and response to associated outbreaks of disease (Kirk et al., 2008;Wang et al., 2009;Fox et al., 2012). OzFoodNet was established in Australia in 2000, with governmentfunded epidemiologists appointed to states and territories to improve surveillance of foodborne disease (Kirk et al., 2008). From 2010 onward, OzFoodNet has enhanced national surveillance of listeriosis, and now collects additional data on invasive L. monocytogenes isolates, incorporating molecular sub-typing data to facilitate rapid and precise identification of disease clusters (The OzFoodNet Working Group, 2012). Although recent studies have yielded detailed insights into the population distribution of L. monocytogenes globally as well as source associations of important subgroups, such as the over-representation of sequence type (ST) 121 in food sources or of clonal complex (CC) 1 in clinical cases, this data is lacking in the context of Australia (Chenal-Francisque et al., 2011;Maury et al., 2016;Moura et al., 2016).
An important component of understanding the epidemiology of foodborne disease is knowledge of the occurrence and molecular ecology of strains isolated from food products, and indeed the food chain as a whole (Fox et al., 2012). Such information can facilitate insights into the distribution of certain strains within different food chains or environments, and enable relevant associations to be made. Temporal analysis of both clinical and non-clinical surveillance data can allow monitoring of the occurrence of individual strain sub-types or epidemic clones over time, and provides improved understanding of potential risk of disease, and where corrective efforts may be directed. Recent studies have provided insights into global, continental and/or national trends in this area, such as the association of ST121 to food sources, the predominance of CC2 and CC1 globally and association of CC1 with outbreaks of disease, or the dominance of the ST328 subgroup in India (Chenal-Francisque et al., 2011;Haase et al., 2013;Yin et al., 2015;Barbuddhe et al., 2016;Maury et al., 2016). In addition to this, genomic analysis can provide insights into characteristics such as strain virulence. The invasion protein InlA, for example, plays a key role in invasive listeriosis by mediating translocation across the intestinal epithelium (Schäferkordt and Chakraborty, 1997). Mutations in the inlA gene have been shown to impact pathogenesis, with premature stop codons (PMSCs) leading to reduced invasion of the infected host (Nightingale et al., 2005;Chen et al., 2011).
Previous studies have reported on the incidence of listeriosis across Australia, and identified risk groups, outbreaks, and risk factors associated with the disease (Dalton et al., 2011;The OzFoodNet Working Group, 2012). In this study, we present molecular typing analyses of L. monocytogenes isolated from clinical, environmental, and food sources. Data has been interrogated to identify the prevalent STs among the L. monocytogenes population in Australia, and dominant strains have been identified, including their associated food chains and links to clinical illness. This study provides knowledge about L. monocytogenes across clinical and non-clinical settings in Australia, and together with previous studies (Dalton et al., 2011;The OzFoodNet Working Group, 2012;Kwong et al., 2016) provides new insights into the epidemiology of listeriosis in Australia and the organism's associated genetic traits.

Multi-Locus Sequence Typing (MLST)
Previously described Multi-Locus Sequence Typing (MLST) data were available for 58 isolates (Nilsson et al., 2012;Haase et al., 2013); the STs of the other 166 isolates were determined using the seven housekeeping gene targets and primers, as previously described (Ragon et al., 2008). In silico analysis was performed on genome assemblies generated using the Illumina MiSeq platform (Illumina, San Diego, CA, USA) or the Ion Torrent platform (Life Technologies, USA). Consensus sequences for the housekeeping genes (abcZ, bglA, cat, dapE, dat, ldh, and lhk) were extracted for each isolate (Ridom SeqSphere+, Ridom GmbH). All phylogenetic analyses were carried out using BioNumerics (v7.5; Applied Maths, Sint-Martens-Latem, Belgium). Minimum spanning trees were generated using minimum spanning tree from categorical data analysis, including partitioning analysis to identify CCs. CCs were assigned to groups with single locus variants.
inlA Sequence and Phylogenetic Analysis The inlA gene sequence was extracted from the de novo assemblies of the 166 sequenced isolates, using Geneious (v9.0.5) software (Kearse et al., 2012). These were classified into groups and aligned in Geneious using the translation alignment algorithm (global alignment with Blosum62 cost matrix), with each group having a unique inlA sequence (Supplementary Figure S1). Phylogenetic analysis (1000 bootstraps) was then performed on the aligned sequences using SplitsTree software (Huson, 1998). Translated protein sequences were derived from the inlA gene sequence of each isolate, and examined for PMSCs.

SNP Typing
The 166 sequenced isolates were variant called by mapping the reads to the appropriate reference strain (IIA, NC_022568.1; IIB, NC_018587.1; IIC, NC_017546; IVB, NC_019556.1) using snippy 2 with the default settings of minimum 10x coverage and 90% minimum proportion of reads differing to reference. Core single nucleotide polymorphism (SNP) alignments were used for tree generation using the PHYML tree builder (Hasegawa-Kishino-Yano substitution model with 1,000 bootstraps) in the Geneious software package; (Guindon and Gascuel, 2003;Kearse et al., 2012).

Statistical Analysis
Statistical analysis was performed to investigate the association of ST with clinical, dairy food, or meat food sources, using χ 2 analysis (where n ≥ 10 for a subgroup). The significance of overrepresentation or under-representation was calculated relative to all isolates or lineage (either I or II), as appropriate.

Serotyping Analysis
Analysis of the serotype distribution identified three main serogroups accounting for 94% of isolates -the IIA grouping of 1/2a and 3a (41%), the IIB grouping of 1/2b and 3b serotypes (30%), and the IVB grouping, which included 4b, 4d, and 4e (23%; Figure 1A). The IIC grouping of 1/2c and 3c isolates comprised 5%, with IVA and IVD representing the remaining 1%. Serogroup IVB was the predominant group among cases of listeriosis, accounting for 56.6% of clinical isolates, and was significantly over-represented among clinical isolates (p < 0.0001; Figure 1B and Supplementary Table S2). In contrast to this, the occurrence of serogroup IVB isolates was significantly lower among meat category isolates (p < 0.001). Serogroup IIA was the largest group among dairy (39%), meat (51%), animal (62.5%) and vegetable (75%) isolates ( Figures 1C-F, respectively); interestingly among clinical isolates, however, IIA was significantly under-represented (Supplementary Table S2). Serogroup IIB was the second most common serogroup identified among isolates from meat, dairy and clinical categories (39.2, 33.9, and 18.9%, respectively), however, there was no significant distribution of this serogroup or serogroup IIC among any of these categories (Supplementary Table S2).

MLST Analysis
The 224 isolates differentiated into 39 different STs. The distribution of STs among isolates is presented with respect to their source (Figure 2). Seventeen lineage I STs, 22 lineage II STs, and a single lineage III ST (ST202) were identified. Lineage I had the highest proportion of isolates (n = 118, 52.7%), with 47.3% (n = 106) belonging to lineage II and a single lineage III isolate. ST3 (n = 55) and ST1 (n = 35) were the largest groupings identified, and comprised 40% of all isolates. Nine CCs were identified, indicated by the gray partitions and labels in Figure 2. Lineage II contained the majority of CCs (n = 5), with the remaining four CCs comprised of lineage I isolates. A total of 10 STs comprised exclusively clinical isolates: nine were lineage I STs, with a single lineage II ST (Figure 2). Of the 39 STs identified, 20 contained clinical isolates (51%, Figure 3). In the case of lineage I, 76% of STs contained clinical isolates -this was lower at 32% for lineage II. In contrast to this, of the 17 STs containing only isolates from non-clinical sources, 14 were lineage II, 2 were lineage I, with the remaining being lineage III (the source of the ST141 and ST145 isolates was unknown). The largest single-source categories were ST20 and ST19, comprising 6 and 4 animal isolates, respectively. Three STs only contained dairy isolates: ST101 (n = 3), ST122 (n = 2), and ST325 (n = 2).
With respect to all isolates in this study, χ 2 analysis (Supplementary Table S2) indicated lineage I isolates were overrepresented among clinical cases (p < 0.001), along with CC1 isolates (p < 0.0001). In contrast to this lineage II isolates were under-represented among clinical isolates (p < 0.001). While no clear association of dairy isolates with lineage, CC or ST was identified, meat isolates were over-represented among ST204 (p < 0.001). In the case of CC3, this subgroup was significantly over-represented among lineage I meat isolates (p < 0.001), contrasting to its under-representation among lineage I clinical isolates. There was no significant association of one lineage over another with respect to meat isolates.
inlA Characterization Analysis of the inlA nucleotide sequence of the 166 sequenced isolates identified 17 variants (InlA groups 1-17, Figure 4 and Supplementary Figure S1). This included 143 variable nucleotide positions, with 40 variable amino acid (AA) residues among the corresponding translated protein sequences. Two of the translated protein sequences, InlA group 4 (shared by all ST3 and ST896 isolates analyzed) and InlA group 15 (shared by all ST5, ST323, and ST324 isolates analyzed), were identical, although differing in three nucleotide positions in the gene sequence. Two of the InlA protein sequences, InlA group 10 (identified in all ST121 isolates analyzed) and InlA group 11 (identified in all ST325 isolates analyzed), contained PMSCs. The PMSC mutation in InlA group 10 isolates results in a protein of 492 AA, whereas the PMSC in InlA group 11 yields a 93 AA InlA protein (the full length InlA is 801 AAs). Clustering analysis of inlA gene sequences identified four closely related lineage I InlA groups which includes STs 3, 5, 59, 87, 323, 324, 382, and 896 (Figure 4). The inlA gene varied at just nine nucleotide positions across these groups, and with the exception of InlA group 14 (ST59) all of these groups contained clinical isolates. Analysis of overall variation by lineage shows greater heterogeneity among lineage II InlA alleles relative to lineage I (115 and 20 variable nucleotide positions among isolates of the same lineage, respectively).

SNP Subtyping
Analysis of genetic diversity among isolates from individual STs showed a high degree of variability (Figures 5-8). With the exception of ST12 (four SNPs among the five isolates, which includes the reference strain EGD), serogroup IIA STs were characterized by a high number of SNPs among isolates from individual STs, with ST121, ST155, ST204, and ST321 containing the most SNPs relative to other STs (636, 421, 420, and 306 SNPs, respectively; Figure 5). In contrast to this, serogroup IIB STs were highly conserved, as evidenced by only 69 SNPs shared among the 43 CC3 isolates (42 isolates from this study and the reference serotype 1/2b isolate, SLCC2755; Figure 6). The small number of serogroup IIC isolates comprised only two STs: ST9 and ST122. Although no epidemiological link was known for the two ST122 isolates, they differed by only a single SNP. Similar to serogroup IIB, the other lineage I serogroup IV was also dominated by a single ST (ST1). Although the overall number of SNPs was higher in CC1 (n = 133) relative to CC3 (n = 69), groups of highly conserved isolates were noted, including epidemiologically linked and unlinked isolates differing by 0-2 SNPs (Figure 8). In contrast to CC1, ST2 isolates were more genetically diverse with 238 SNP loci identified among this group.

DISCUSSION
Building a detailed understanding of pathogen population demographics, along with clinical and food chain association, is central to building a robust response and control system directed at minimizing the impact of foodborne pathogens to public health. The data presented in this study provides detailed insight into the epidemiology of L. monocytogenes in an Australian context by using application of molecular techniques including molecular serotyping, MLST, and analysis of the inlA virulence gene marker.
Serotyping was one of the first subtyping methods established to investigate L. monocytogenes epidemiology. It has been applied extensively as a rapid means to characterize isolates, with applications in both understanding the relevance of certain serotypes to illness in humans and animals, as well as being employed as a tool to assist in outbreak investigations (Jonquières et al., 1998). Examination of the serotypes of all isolates in this study identified the IIA, IIB, and IVB groupings as the major serotypes (41, 30, and 23%, respectively). This distribution is similar to that observed in other countries, where these groupings, along with the IIC grouping (1/2c and 3c) dominate the L. monocytogenes population (Bille and Rocourt, 1996;Wang et al., 2009). Analysis of clinical isolates from Portugal from 1997 to 2004 showed the same top three serogroups, with IVB the most common, followed by IIB and then IIA (72, 18, and FIGURE 2 | Distribution of STs identified from each source, with increasing circle size representing a larger number of isolates of that ST. Clonal complexes are partitioned (gray shading, and indicated by gray text). Connecting lines infer phylogenetic relatedness in terms of number of allelic differences (thick solid, 1; thin solid in partition, 2; thin solid outside partition, 3; broken, 4-6; dotted, no allelic matches). ST202 is lineage III.
Frontiers in Microbiology | www.frontiersin.org 11%, respectively) (Almeida et al., 2010). Of particular note is the absence of serogroup IIC among the Portuguese clinical isolates which comprised 8% of Australian clinical cases in this study. A serogroup hierarchy was noted among clinical isolates in this study with IVB being the most common, followed by IIB then IIA (56, 19, and 17%, respectively). Although lineage I (which includes IVB and IIB serogroups) has previously been identified as over-represented among clinical cases of listeriosis (Sauders et al., 2006), as was seen in this study, it should be noted that lineage II (which includes the IIA and IIC serogroups) has been equally significant in causing infection in some countries (Leong et al., 2015). This demonstrates the importance of lineage II strains in human listeriosis, and although approximately 25% of clinical isolates in this study were lineage II suggesting they remain an important etiological agent of listeriosis in Australia, it is worth noting that serogroup IIA was significantly under-represented among clinical isolates relative to the overall population structure (Supplementary Table S2).
The application of MLST to characterize L. monocytogenes has yielded many insights into the overall structure of the species population. It has served to establish genetic lineages (I, II, III, and IV), as well as define CCs of highly related L. monocytogenes strains, which have been shown to have global distribution (Meinersmann et al., 2004;Ragon et al., 2008). Analysis of the ST distribution among the population (Figure 2) identified two main groupings, comprising either lineage I or lineage II isolates. Such definitive separation of lineage I and II has previously  been observed using similar analysis of large L. monocytogenes ST datasets (Haase et al., 2013). The two largest ST groupings identified, ST3 and ST1 comprising 40% of all isolates (n = 90) are both lineage I. These have also been identified globally among the most common STs found in both clinical and non-clinical settings (Ragon et al., 2008;Haase et al., 2013). Clinical isolates were significantly over-represented among the CC1 sub-group, in contrast to an under-representation of CC3 isolates among lineage I clinical isolates (Supplementary Table S2). This strong association of CC1 clones with clinical illness has been identified in a recent large study which determined source associations and virulence of L. monocytogenes subpopulations , and it has been hypothesized that this subgroup may have increased virulence relative to others (Henri et al., 2016). That large study also noted ST121 and ST9 as significantly over-represented in food sources, and although all the six ST121 isolates in this study were similarly of food origin, no significant association of CC9 isolates in this study were observed with respect to clinical or food sources (Supplementary Table S2).
Many dominant subgroups identified in this study have also been identified as major subgroups in other studies, such as CC1, CC2, CC3, CC9, and CC155 (Chenal-Francisque et al., 2011). In addition to this geographical trends have previously been reported in relation to the L. monocytogenes population structure; for example in India a ST328 clone has been identified as widely disseminated among multiple sources including clinical cases (Barbuddhe et al., 2016). Interestingly in this study, ST204 was identified as the third most common ST with isolates from both clinical and a range of non-clinical sources. This is in contrast to other similar studies which have not identified, or recorded at lower abundance, ST204 isolates among L. monocytogenes populations from sources outside Australia (Chenal-Francisque et al., 2011;Haase et al., 2013;Unterholzner et al., 2013;Stessl et al., 2014). A notable exception being a recent study on L. monocytogenes isolated from various sources in Switzerland reporting ST204 isolates primarily from meat sources (Grohmann et al., 2003). ST204 isolates in this study were identified in a range of sources, including meat, dairy, environmental and clinical samples; in the case of ST204 isolates in this study meat sources were significantly over-represented (Supplementary Table S2). Further studies into the genomics of ST204 isolates identified a range of accessory genes involved in different stress response mechanisms among ST204 isolates, often harbored on mobile genetic elements . A high frequency of plasmid carriage has also been observed . Taken together, this may underlie the capacity for ST204 isolates to colonize a broad range of environmental niches.
Each of the five most common STs (with >10 isolates) identified all contained isolates from a clinical and a variety of non-clinical sources (Figure 2). The largest non-clinical ST identified was ST20, which was the 8th largest ST grouping overall, and only contained caprine animal isolates. In addition to ST20, ST19 also comprised isolates exclusively of animal origin, suggesting this may be a niche for these subgroups. Analysis of non-clinical categories identified in the study suggests that there may be a subset of STs linked with dairy sources (ST101, FIGURE 7 | Phylogenetic analysis of serogroup IIC core genome SNPs. STs containing multiple isolates are indicated by labeled rounded rectangles (the CC or ST is labeled on the outside right, and the number of SNPs is indicated by the number on the inside right of the rectangle).
ST122, and ST325), however, a larger dataset is required to further investigate these associations.
Of the STs containing clinical isolates, 50% did not contain isolates from other sources (Figure 3). This is consistent with the difficulty in attributing sporadic cases of listeriosis to a food source (Lee et al., 2013). This difficulty is exacerbated by the ubiquitous nature of the organism, and the wide variety of environmental niches it can colonize. A high proportion of STs with raw or RTE meat isolates (80%) also contained isolates from human listeriosis cases. It is noteworthy that meat products comprised the majority of food recalls due to L. monocytogenes contamination across Australia during the 2005-2014 period, with Food Standards Australia New Zealand (FSANZ) recall data showing L. monocytogenes was responsible for 45% of all food recalls due to microbial contamination in this period 3 . This association is further evidenced in the risk assessment study which suggested processed RTE meats could be responsible for up to 40% of Australian listeriosis cases (Ross et al., 2009). A similar examination of STs with dairy isolates in this study indicated 39% also included clinical isolates, lower than that observed with meat isolates (Figure 3). It should be noted, however, that although non-clinical isolates included a diverse representation from multiple States of Australia, the clinical isolates in this study were dominated by those isolates from Queensland. The application of PFGE or FIGURE 8 | Phylogenetic analysis of serogroup IVB core genome SNPs. CCs and STs containing multiple isolates are indicated by labeled rounded rectangles (the CC or ST is labeled on the outside right, and the number of SNPs is indicated by the number on the inside right of the rectangle). Isolates linked by colored shading are epidemiologically linked, with the number indicating the highest number of SNPs when comparing isolates in that subset.
whole genome sequencing to epidemiologically linked isolates and analysis with a broader representation of isolates from other Australian States may provide greater insight into strains shared by food and clinical sources. Continued surveillance should include quantitative microbial risk assessment to identify the contribution of different foods to the burden of human infection (Ross et al., 2009). In addition the application of source attribution modeling should be applied. This approach utilizes modeling, such as the Hald Bayesian risk attribution model, to combine highly discriminatory subtyping analysis (e.g., SNP analysis or Pulsed-Field Gel Electrophoresis) with data relating to human sporadic infection and the prevalence of those subtypes in different foods, to identify likely sources (Hald et al., 2004;Little et al., 2010). Such approaches can help identify key points to direct food safety strategies and public health interventions.
Internalin A (InlA) is a virulence protein associated with the invasive form of listeriosis (Schäferkordt and Chakraborty, 1997). This membrane-associated protein binds E-cadherin and facilitates the organism's entry into intestinal epithelial cells, a key step in the invasive process. Analysis of the inlA sequence from isolates in this study identified a heterogeneous nucleotide sequence among isolates, with 17 inlA sequence variants showing higher conservation among lineage I isolates (Figure 4). This cluster of closely related lineage I isolates was noted with a lower sequence variation relative to other InlA groups (InlA groups 4, 14, 15, and 16; Supplementary Figure S1). Seven of the eight STs in this closely related group included clinical isolates. Along with the lineage I InlA group 3 (ST1, ST2, and ST328) these lineage I isolates also clustered separately from other lineages (Figure 4). In contrast to lineage I isolates, lineage II isolates showed higher heterogeneity in their InlA sequences with a higher number of alleles showing greater genetic variation. In addition to this only 36% of these alleles (4/11) contained clinical isolates, suggesting this higher divergence in InlA tends to be associated with isolates from non-clinical sources. Previous studies have identified PMSCs in inlA, leading to truncated or secreted forms of the protein (Autret et al., 2001;Nightingale et al., 2005). These mutations have been associated with reduced invasiveness of strains harboring them. Typically the same mutations are shared by strains among an individual ST. Of the STs identified in this study, two had PMSCs: all ST121 isolates had a mutation AA 492 and all ST325 isolates had a mutation at AA 93 (both of which are lineage II STs). Although the ST121 PMSC mutation has been previously reported, to the authors' knowledge this is the first report of the novel AA 93 mutation. No ST121 or ST325 isolates in this study were from clinical sources. Although taken together this data suggests these STs may be less invasive in human patients, further study is required to confirm this.
Single nucleotide polymorphism typing analysis showed a genetically diverse serogroup IIA, with individual STs or CCs often sharing a large number of SNPs (up to 636 in the case of the most diverse ST, ST121). In addition, epidemiologically linked isolates often included a high number of SNP differences (ranging from 13 to 241). In contrast, a high degree of genetic conservation was observed among serogroup IIB isolates, particularly CC3 isolates (Figure 6). Serogroup IVB was also dominated by a single CC (CC1), which also included a clinical isolate linked to contaminated dairy products (differing by only a single SNP). The only other clinical case which was associated with a closely related food (also a dairy food) isolate was noted among ST5, although there were nine SNPs in this instance. This number is in agreement with observations in similar studies where epidemiologically linked isolates differed by less than 10 SNPs (Kwong et al., 2016). Among serogroup IIC, a clinical ST9 isolate shared only six SNPs with a food isolates, although no epidemiological link was known in this case to suggest an association. All other epidemiologically linked isolates were either exclusively from foods, or only isolated from clinical cases with no known associated food source. Although this study used SNP typing analysis to examine population structure, further investigations in our laboratory are ongoing into the loci of SNPs in relation to their impact on genetic functionality and associated phenotypes such as virulence and stress resistance.
Surveillance of foodborne disease in Australia has continued to evolve to maintain Australia's high food safety standards and well-coordinated public health program, notably through the establishment of OzFoodNet in 2000, and the enhanced listeriosis surveillance program implemented in 2010 (The OzFoodNet Working Group, 2012). Integrating detailed analysis of epidemiological data to national, and international, surveillance programs can serve to improve the efficacy and response to threats to human health posed by foodborne pathogens such as L. monocytogenes. This study highlights the association of subpopulations of L. monocytogenes to different sources in Australia, including those associated with clinical illness, and identifies genetic mutations suggesting attenuated virulence of certain subgroups in humans. The ubiquitous ecology of L. monocytogenes presents a significant challenge for source tracking, which is further exacerbated by the long incubation time of disease in patients following consumption of contaminated foods. Surveillance data where epidemiological linkages are known will help further understanding of key transmission routes and high risk foods. Additional analysis also including isolates from samples not highly represented in this study (e.g., fruit, vegetables, and seafood) will increase understanding of the distribution of L. monocytogenes subtypes through these different sources. These insights can be utilized to help maintain Australian public health and food safety.

AUTHOR CONTRIBUTIONS
AJ, NF, KG, JB, and EF conceived and designed the experiments. N-XF, RG, MB, TG, and CG performed wet laboratory experiments. AJ, JM, JB, and EF conducted bioinformatics analyses. AJ and EF drafted manuscript. All authors read and approved final manuscript and agree to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.

FUNDING
This work was co-funded by Commonwealth Scientific and Industrial Research Organisation, the Queensland Department of Health Forensic and Scientific Services Research and Development Fund and the Victorian Department of Environment and Primary Industries. This work included sequencing services provided by the Ramaciotti Centre for Genomics.

SUPPLEMENTARY MATERIAL
The Supplementary Material for this article can be found online at: http://journal.frontiersin.org/article/10.3389/fmicb. 2017.00603/full#supplementary-material FIGURE S1 | Alignment of each inlA gene allele identified among isolates in this study. TABLE S1 | Isolates included in this study.
TABLE S2 | Distribution of major CCs, STs, or serogroups identified in this study from clinical, dairy or meat sources. The significance of this distribution to each source relative to either the overall population or to the associated lineage is shown, as determined by χ 2 analysis. Significance thresholds used are: * * p < 0.00001; * p < 0.001.