Skip to main content

ORIGINAL RESEARCH article

Front. Microbiol., 13 July 2021
Sec. Food Microbiology
This article is part of the Research Topic Campylobacter-associated Food Safety View all 32 articles

Core Genome MLST for Source Attribution of Campylobacter coli

\r\nLucas Harrison*Lucas Harrison1*Sampa MukherjeeSampa Mukherjee1Chih-Hao HsuChih-Hao Hsu1Shenia YoungShenia Young1Errol StrainErrol Strain1Qijing ZhangQijing Zhang2Glenn E. TillmanGlenn E. Tillman3Cesar MoralesCesar Morales3Jovita HaroJovita Haro3Shaohua ZhaoShaohua Zhao1
  • 1U.S. Food and Drug Administration, Center for Veterinary Medicine, Laurel, MD, United States
  • 2College of Veterinary Medicine, Iowa State University, Ames, IA, United States
  • 3U.S. Department of Agriculture, Food Safety and Inspection Service, Athens, GA, United States

Campylobacter species are among the leading foodborne bacterial agents of human diarrheal illness. The majority of campylobacteriosis has been attributed to Campylobacter jejuni (85% or more), followed by Campylobacter coli (5–10%). The distribution of C. jejuni and C. coli varies by host organism, indicating that the contribution to human infection may differ between isolation sources. To address the relative contribution of each source to C. coli infections in humans, core genome multilocus sequence type with a 200-allele difference scheme (cgMLST200) was used to determine cgMLST type for 3,432 C. coli isolated from food animals (n = 2,613), retail poultry meats (n = 389), human clinical settings (n = 285), and environmental sources (n = 145). Source attribution was determined by analyzing the core genome with a minimal multilocus distance methodology (MMD). Using MMD, a higher proportion of the clinical C. coli population was attributed to poultry (49.6%) and environmental (20.9%) sources than from cattle (9.8%) and swine (3.2%). Within the population of C. coli clinical isolates, 70% of the isolates that were attributed to non-cecal retail poultry, dairy cattle, beef cattle and environmental waters came from two cgMLST200 groups from each source. The most common antibiotic resistance genes among all C. coli were tetO (65.6%), blaOXA193 (54.2%), aph(3′)-IIIa (23.5%), and aadE-Cc (20.1%). Of the antibiotic resistance determinants, only one gene was isolated from a single source: blaOXA61 was only isolated from retail poultry. Within cgMLST200 groups, 17/17 cgMLST200-435 and 89/92 cgMLST200-707 isolates encoded for aph(3’)-VIIa and 16/16 cgMLST200-319 harbored aph(2’)-If genes. Distribution of blaOXA alleles showed 49/50 cgMLST200-5 isolates contained blaOXA498 while blaOXA460 was present in 37/38 cgMLST200-650 isolates. The cgMLST200-514 group revealed both ant(6)-Ia and sat4 resistance genes in 23/23 and 22/23 isolates, respectively. Also, cgMLST200-266 and cgMLST200-84 had GyrAT86I mutation with 16/16 (100%) and 14/15 (93.3%), respectively. These findings illustrate how cgMLST and MMD methods can be used to evaluate the relative contribution of known sources of C. coli to the human burden of campylobacteriosis and how cgMLST typing can be used as an indicator of antimicrobial resistance in C. coli.

Introduction

Campylobacter species are spiral or rod-shaped gram-negative bacteria with optimal growth conditions in microaerophilic environments at 30–47°C (Hsieh et al., 2018). Despite these growth requirements, Campylobacter species are often isolated from a wide range of sources including animal digestive tracts, freshwater reservoirs and packaged food products (Horrocks et al., 2009; Zhao et al., 2015; Dearlove et al., 2016). While many Campylobacter strains are carried by host organisms asymptomatically, a subset of strains can cause gastric enteritis in humans (Lee and Newell, 2006; Pascoe et al., 2020). This Campylobacter-associated disease is known as campylobacteriosis and is the leading cause of gastric enteritis worldwide (World Health Organization, 2018). In the United States, a 2019 report identified Campylobacter species as the most common foodborne pathogen with an incidence of 19.5 infections per 100,000 people (Tack et al., 2019). While Campylobacter bacteria are found in a wide range of sources, human exposure to retail meats has been identified as a major risk factor for campylobacteriosis (Osimani et al., 2017).

The contribution to human campylobacteriosis is not spread evenly among Campylobacter species. The two most common Campylobacter species associated with foodborne illness are C. jejuni and C. coli. These species are the causative agents in > 85% and 5–10% of human campylobacteriosis, respectively (Patrick et al., 2018). Data from the National Antimicrobial Resistance Monitoring System (NARMS) shows that these species differ both in their antibiotic resistance profiles and their prevalence among food animal sources (Food and Drug Administration, 2021). Antibiotic resistance profiles of C. coli from retail meats are more commonly identified as resistant to at least one class of antibiotics, with resistance to macrolides, lincosamides, and ketolides showing a noticeable difference between species (Zhao et al., 2010; Food and Drug Administration, 2021). NARMS surveillance data of food animal cecal samples in the US reveals C. coli as the dominant Campylobacter species in swine and poultry samples (Dessai, 2017).

While C. coli isolation rates are comparable between cecal samples of swine and chicken, differences in post-slaughter processing practices can affect their isolation rate from retail meats of these animals. In communities not localized near food animals, exposure to retail meat products is a greater contributor to the burden of human campylobacteriosis than exposure to the food animals themselves (Rosner et al., 2017). Among the retail meats, retail poultry represents the greatest risk factor for human infection (Food and Drug Administration, 2010; Rosner et al., 2017). While isolates were readily recovered from the caeca of cattle and swine, C. coli were rarely isolated (<1% prevalence) from beef and pork retail meat products. As a result, screening for Campylobacter species in retail beef and pork products was discontinued in NARMS retail meat program in 2008. While the prevalence of C. coli is low among these food sources, the percentage of cattle and swine populations colonized with C. coli implicate them as potential reservoirs for C. coli infection in humans (Dessai, 2017; Plishka et al., 2020).

Retail meat products may be contaminated with C. coli originating from other sources through a variety of routes (Rasschaert et al., 2020). For example, studies have detailed how contact between infected animals during transport, airborne contamination in processing plants, bacterial contamination of contact surfaces at processing sites or even transfer of material from one sample to another during mechanical separation all may contribute to cross-contamination of poultry products (Battersby et al., 2016; Oliveira et al., 2020; Rasschaert et al., 2020; Konduru et al., 2021). With the advent of molecular surveillance methods, individual strains may be identified and tracked throughout the food processing chain. One method that is used for source tracking is core genome multilocus sequence typing (cgMLST) (Maiden et al., 2013). The core genome of Campylobacter species is defined by 1,343 loci as determined by Cody et al. (2017). Allele profiles of these 1,343 loci have sufficient diversity throughout the population of Campylobacter species that they can successfully be used for strain typing using a cgMLST schema (Hsu et al., 2020). Strains with similar allele profiles can be clustered into groups by allowing for a threshold of allelic differences between strains. Our previous study evaluating C. jejuni populations using core genome MLST with an allele difference of 200 (cgMLST200) showed an excellent correlation with traditional MLST typing and had a greater utility for source attribution and correlation with AMR profiles (Hsu et al., 2020). This method classifies strains into different groups if the allelic identity of more than 200 of the 1342 loci of the Campylobacter core genome differ between strains.

Recently, a minimal multilocus distance (MMD) analysis of Campylobacter genomes was demonstrated to be effective for source attribution (Pérez-Reche et al., 2020). This MMD method can be optimized to process the same 1,343 loci of the Campylobacter core genome as the cgMLST200 typing scheme to classify strains. Unlike the cgMLST200 scheme, though, the core genomes of individual strains are used to generate the strain clusters that define each source. This can be used for source attribution by comparing the core genomes from strains of an unknown source to the core genome model of each known isolation source and obtaining the probabilities of the strain having originated from each. While this method has the benefit of targeting associations between the core genome and defined metadata categories (e.g., isolation source), it is not currently optimized to account for distinct subpopulations within a strain cluster. That is to say, while the MMD method is well suited to identify if a strain originated from a chicken source, it may not be able to discriminate between the antibiotic-resistant subpopulations found within the chicken dataset. The objective of this study was to use both cgMLST and MMD methods to characterize the relative contributions of C. coli sources to act as reservoirs for antibiotic resistance genes and the relative contribution of each source to human campylobacteriosis.

Materials and Methods

Bacterial Strains, Sequencing, and AMR Identification

Whole genome sequencing data representing 3,432 C. coli isolates from environmental sources including water, soil and wild birds, food animal sources, human and retail meat sources were collected through the NARMS program, the National Center for Biotechnology Information (NCBI) database and the European Nucleotide Archive (ENA) database. Sequences of human (n = 285) and environmental source (n = 145) isolates were downloaded directly from their respective online databases. Food animal isolates were obtained through cecal sampling by the United States Department of Agriculture Food Safety and Inspection Service (USDA-FSIS) as part of NARMS animal arm and consisted of 964 cattle, 898 swine, 475 chicken and 296 turkey isolates of C. coli. Retail meat isolates collected through the NARMS retail meat program contained 374 chicken and 11 turkey isolates. All strains included in this study were isolated between 1999 and 2019. Genomic DNA was extracted using the Qiagen DNeasy Blood and Tissue kit (Qiagen, Gaithersburg, MD) and genomes were sequenced on an Illumina MiSeq using v2 or v3 chemistry (Illumina, San Diego, CA).

C. coli genomes were assembled using the CLC Genomics Workbench version 8.0 (CLC bio Aarhus, Denmark). Contigs less than 200 bp were removed prior to assembly and error correction was performed using the map reads back to contigs method. AMR determinants, including both resistance genes and point mutations, were identified with AMRFinder using database version 2020-09-30.1 (Feldgarden et al., 2019). Coverage and identity parameters were set to default values, using a minimum coverage of the reference protein of 0.5 and a minimum identity for BLAST hits of either the curated threshold value specific to the hit, or 0.9 if no curated threshold value was present.

Core Genome MLST

The cgMLST groups for the C. coli dataset were assigned as previously described using an in house script (Hsu et al., 2020). Briefly, the cgMLST type for each strain was assigned using the allele sequences of the 1,343 loci that define the Campylobacter core genome. A BLAST search was performed to identify the alleles for each locus and up to 100 missing alleles was allowed for each genome. For strains where sequence data at the cgMLST loci was missing, the loci was assigned a value of “N.” Following cgMLST designation, the distance between core genomes was calculated based on the pairwise distance between allelic profiles and was measured in allelic difference. Strains were organized into groups allowing for a 200-allele difference between strains using a single-linkage clustering method.

The core genomes were used to generate a minimum spanning tree of the C. coli population using GrapeTree v1.5.0. A single table containing the core genome allele identities for each strain was used to generate the minimum spanning tree using GrapeTree v1.5.0. Each row of the table corresponded to a single strain and the table contained 1,343 columns, each representing one core genome loci. The cells were populated with the allele identity for each strain and loci pair. Using the MSTree V2 algorithm, the pairwise dissimilarity of core genome loci was compared between strains and used to generate the minimum spanning tree.

Source Attribution

A source attribution model of human-pathogenic and retail meat C. coli isolates was generated using the MMD methodology developed by Pérez-Reche et al. (2020). Within this project, MMD source attribution was used to evaluate the dissimilarity of cgMLST loci between isolates of a known source, such as the population of C. coli isolated from chicken caeca, to those isolated from humans or retail meat samples. The pairwise dissimilarity, quantified as the Hamming distance, was compared between all known sources and the individual strains from human or retail meats to determine the likelihood of having originated from each of the sources.

The core genome multilocus sequence profiles and the known-source metadata of the 3,234 C. coli isolates were evaluated using the MMD R script. Source attribution was determined by modifying the MMD configuration file to include the isolation sources present in the isolate metadata file and setting the source attribution flag as the analysis to be performed. MMD self-attribution used the same dataset as source-attribution, with the configuration file modified to flag self-attribution as the analysis and a user-determined source as the target for self-attribution. Self-attribution analysis then generated cgMLST models for each isolation source and mapped he self-attribution target population back to the isolation source their cgMLST most closely resembles.

Results

Antibiotic Resistance Profiles of C. coli

Screening C. coli sequence data with AMRFinder revealed 50 distinct alleles of antibiotic resistance genes and resistance-associated mutations among the isolates. While tet(O), blaOXA193 and aph(3’)-IIIa were the three most common antibiotic resistance genes, they were underrepresented in the environmental isolate subpopulation (Figure 1). The blaOXA61 gene was almost exclusively found in retail poultry isolates with only a single instance recovered from a cecal turkey isolate. While blaOXA594 and blaOXA460 were most common among cecal chicken and turkey populations at a rate of 9.8–18.1%, blaOXA594 was missing from the retail poultry isolates and blaOXA460 was only present in 0.5% of the population. A set of 344 blaOXA alleles could not be annotated at the allele level due to incomplete sequence and were not included in this project. Ribosomal mutations resulting in the substitutions L23_A2075G and L22_A103V associated with macrolide and lincosamide resistance did not show exclusivity for any isolation source, though L23_A2075G was more strongly represented in swine and human isolates (Supplementary Table 1). L23_A2075G was present at the greatest rate in cecal swine isolates at 20.9% and L22_A103V was most prevalent in environmental sources at 17.8%. The GyrA_T86I substitution associated with resistance to fluoroquinolones was more common, present in cecal cattle (61.1%), human (48.8%), cecal turkey (35.8%), retail poultry (26.2%), cecal chicken (17.7%), and environmental (7.8%) isolates.

FIGURE 1
www.frontiersin.org

Figure 1. Prevalence of AMR determinants in C. coli populations among different isolation sources.

Core Genome Multilocus Sequence Typing of C. coli

Campylobacter coli isolates were characterized by their core genome of 1,343 loci and visualized as a minimum spanning tree (Figure 2). Isolates from the same source co-localized among branches, though few branches were exclusive to a single source. Notably, while isolates from human sources were observed throughout the tree, they were present at the highest concentrations on the same branch that favored poultry isolates, including both cecal and retail isolates.

FIGURE 2
www.frontiersin.org

Figure 2. Minimum spanning tree of C. coli dataset as determined by their core genome. Each circle represents a cgMLST group and the size of the circle is proportional to the number of isolates in that group. Tree was generated in GrapeTree using a log depiction of branch length.

cgMLST200 Typing of C. coli

Organizing Campylobacter strains by core genome into groups that allow a 200-allele difference among isolates is an effective subtyping method for differentiating strains both among and between Campylobacter species. Our cgMLST analysis generated 1,085 cgMLST200 groups from our 3,432 C. coli isolates. Within the 1,085 cgMLST200 groups, 818 contain only one isolate each and were designated as singletons. The majority of these singleton strains were isolated from the caeca swine, indicating greater sequence diversity in the swine population compared to other isolation sources. Non-singleton cgMLST200 group analysis revealed that 126 of the remaining 267 cgMLST200 groups were identified among multiple sources. These results show that 88.4% of the cgMLST200 groups were specific to a single isolation source and only 11.6% could be found in multiple hosts.

Campylobacter cgMLST200 classification was then used to evaluate the similarity of strains from humans to strains recovered from other sources. Evaluating the cgMLST200 groups from each source showed that 25/285 C. coli clinical isolates belonged to a cgMLST200 group associated with only one other source (Figure 3). The largest contributor to human-pathogenic strains was a set of 8 cgMLST200 groups that contained isolates from cattle, chicken, turkey and swine sources. These accounted for 100/285 C. coli strains isolated from humans.

FIGURE 3
www.frontiersin.org

Figure 3. Distribution of cgMLST200 groups that contain human pathogenic strains of C. coli. The majority of human isolates belong to cgMLST200 groups containing isolates from a variety of sources. The first number listed in each source shows how many cgMLST200 groups contain human-pathogenic C. coli. The second number, in brackets, is the number of human-pathogenic strains in the cgMLST200 groups.

Strains recovered from swine were least similar to human isolates, with only 7.7% of the population belonging to human-pathogenic cgMLST200 groups (Table 1). Evaluating turkey, chicken and environmental populations revealed that 33.6, 26.3, and 20.1% of their populations shared cgMLST200 groups with humans, respectively. Cattle isolates showed the strongest representation of human-pathogenic cMLST200 groups, with 87.7% of the strains belonging to one of these groups. A single human-pathogenic cgMLST200 group (cgMLST200-234) also contained ∼two-thirds of the entire cattle population and accounted for the large difference in proportion between cattle and other sources (Supplementary Table 2).

TABLE 1
www.frontiersin.org

Table 1. Isolation source composition of cgMLST200 groups containing clinical strains of C. coli.

Several cgMLST200 groups showed strong correlation with the presence of antibiotic resistance determinants (Supplementary Table 1). Seven antibiotic resistance genes were conserved at > 95% prevalence in 6 cgMLST200 groups (Table 2). Further, strains from these cgMLST200 groups retained the corresponding antibiotic resistance genes regardless of the isolation source. Within the cgMLST200-5 group, blaOXA489 was found in strains isolated from swine, cattle, turkey, human and environmental sources. The cgMLST200-5 group was notable as well because 50/50 strains contained a mutation in 50S_L22A103V that can confer resistance to macrolide antibiotics. In addition, all 16 isolates in cgMLST200-266 group had a mutation in GyrAT86I associated with resistance to fluoroquinolone antibiotics (Table 3).

TABLE 2
www.frontiersin.org

Table 2. Prevalence of AMR genes in cgMLST200 groups.

TABLE 3
www.frontiersin.org

Table 3. Prevalence of AMR mutations in cgMLST200 groups.

Core Genome Minimal Multilocus Distance Analysis

MMD methodology was used to determine the contribution of each Campylobacter source to human campylobacteriosis. This approach differs from cgMLST200 typing in that MMD generates a model of core genome profiles that best represent each isolation source before performing source attribution of the C. coli clinical isolates. Because of this difference in methodology, source attribution calls of MMD can differ from those determined by cgMLST200 typing. A population analysis of human-pathogenic C. coli using MMD highlights cecal chicken isolates as the greatest contributor to campylobacteriosis and indicates cecal swine sources as the weakest contributor (Table 4). Further population analysis revealed that isolates collected from swine and environmental sources showed highest likelihood of correct self-attribution during method validation (Supplementary Figure 1).

TABLE 4
www.frontiersin.org

Table 4. Source attribution of human-pathogenic C. coli population.

In contrast to the population-wide analysis noted above, we then employed MMD to perform source attribution for the individual strains that were isolated from humans (Figure 4). Campylobacter contamination of retail meats can occur at multiple points from slaughter processing to packaging for retail consumption (Oliveira et al., 2020). To address the potential of a heterogeneous source population obtained from retail meats, clinical isolates were only sourced to cecal isolate populations that were obtained prior to processing. Only 4.2% of the clinical isolates could be attributed to a single source with 100% likelihood and 11.9% of the population could be attributed at >95% likelihood. C. coli were attributed to chicken and turkey sources at 36.5 and 9.1%, respectively, for a combined total of 45.6% from poultry sources. A total of 23.9% of human isolates were attributed to environmental sources and the remaining 18.6 and 11.9% were attributed to cattle and swine, respectively.

FIGURE 4
www.frontiersin.org

Figure 4. Source attribution of human-pathogenic C. coli to environmental and food animal cecal sources. Each bar represents a single C. coli isolate obtained from a human source and the color composition of the bar shows the likelihood of the strain as originating from the evaluated sources.

MMD was then used to determine the likely source of C. coli isolates from retail poultry samples. As expected, analysis of the individual strains revealed that 79.2% of C. coli recovered from retail poultry meats showed the strongest likelihood of attribution to cecal chicken or cecal turkey isolates (Figure 5). Interestingly, environmental sources showed the next highest likelihood of attribution, accounting for 13.8% of the population while the remaining strains were attributed to cattle and swine at 4.7 and 2.3%, respectively.

FIGURE 5
www.frontiersin.org

Figure 5. Source attribution of C. coli isolated from retail meats to environmental and food animal cecal sources. Each bar represents a single C. coli isolate obtained from a retail meat source and the color composition of the bar shows the likelihood of the strain as originating from the evaluated sources.

As a final step, we compared the self-attribution results from the turkey, chicken and retail poultry isolates between the non-poultry sources using the Tukey multiple comparison of means. This comparison was made to determine if any of the poultry sources were preferentially identified as having originated a non-poultry source to indicate a source of contamination. Turkey, chicken and retail poultry isolates were attributed to a cattle source at a similar rate of 8.7–10.3%. Similarly, turkey, chicken and retail poultry isolate were attributed to a swine source at a range of rates from 3.4 to 4.3%. Attribution to environmental sources showed a difference in attribution rates between turkey, chicken and retail poultry isolates. Chicken isolates were attributed to environmental sources at a rate of 0.4% and turkey isolates at 2.2% (Figure 6). Retail poultry isolates, however, were attributed to environmental sources at a rate of 8.0%.

FIGURE 6
www.frontiersin.org

Figure 6. A comparison of self-attribution results for poultry datasets to non-poultry sources as an indicator of population similarity. Blinded validation of poultry datasets misattributed poultry isolates to cattle and swine sources at similar rates. Misattribution of retail poultry isolates to environmental sources occurred at a higher rate (**p < 0.001) than the misattribution of cecal chicken or cecal turkey isolates.

Discussion

In this study, we used the C. coli core genome to categorize isolates with two independent methods. The first method, cgMLST200, is a subtyping scheme that organizes strains based on the similarity of the core genomes. We have demonstrated how strains with similar core genome profiles can show strong association with specific metadata categories, such as AMR determinants or isolation host. The second method, MMD analysis, evaluates the core genomes of all strains of a given metadata group (e.g., source) and establishes core genome profiles of each group. Using this method, we were able to evaluate each strain from our human and retail meat populations and identify the most likely source for attribution.

Comparing the results from each method reveals the methods’ relative strengths and limitations. The metadata-agnostic cgMLST200 typing schema clusters related strains into groups with 100% probability of membership, but the groups themselves are not necessarily congruent with any metadata feature. This makes cgMLST200 analyses well suited for our evaluation of populations against large metadata sets, such as associations to the entire library of AMR determinants. Another benefit of the cgMLST200 typing schema is that it provides a common naming system that categorizes Campylobacter strains by core genome similarity and can be applied to strains where only sequence data is known. The MMD approach uses metadata to define source attribution groups and provides the relative likelihood of any sample originating from any of the attribution groups. While this method is optimized to identify the likely source of a C. coli strain, it requires that the sources be made up of distinct populations. This trait prevented us from attributing human-pathogenic C. coli to the retail poultry isolates, which were a composite of chicken, turkey and putative contaminants from other sources.

The presence of strains from the retail poultry population attributed to swine, environmental and cattle isolates implicate retail poultry consumption as one route by which humans may be exposed to C. coli from environmental contaminants. A comparison of self-attribution results among retail poultry isolates to cecal samples showed no difference in their attribution rates to swine and cattle. This indicates that fraction of poultry isolates attributed to swine and cattle is not greatly affected by events post-slaughter. The fraction of C. coli isolated from all poultry samples attributed to environmental sources reveals that strains recovered from retail poultry are more likely than cecal samples to be attributed to our dataset of C. coli generated from environmental sources. Because there is variation in the rate of attribution to only one of the three sources, this indicates that the discrepancy is not due to allelic variability within the retail poultry isolates alone.

Comparing the mean cgMLST200 group size was one indicator of genomic diversity among our attribution sources. Mean cgMLST200 group size increased as core genome diversity between strains decreased below the 200-allele difference threshold. One limitation of evaluating genomic diversity using this metric is that interpretation of the results is dependent on the number of allele differences allowed. This limitation is illustrated when comparing our results between cattle and swine isolates. Although swine and cattle sources contained a similar number of isolates, there were ∼5 times as many cgMLST200 groups in swine. A closer look at the cattle data revealed that 80% of the cattle strains were classified into only two cgMLST200 groups. Taking this unequal distribution of strains among cgMLST200 groups into account, the genomic diversity of C. coli isolated from cattle outside these two cgMLST200 groups is similar to the swine isolates. While the majority of cattle isolates were classified into similar cgMLST200 groups, the opposite was observed in the swine isolates. Of the 617 cgMLST200 groups present in swine, 505 groups contained only 1 isolate each. Evaluating these data between isolation sources using a cgMLST200 schema alone provides a general metric for comparing the genomic diversity among isolates from different food sources.

While the MMD method provided detailed predictions of source attribution, the cgMLST200 typing scheme was able to cluster several large groups of strains with conserved AMR determinants. Combining these two methods to identify subpopulations encoding antibiotic resistant determinants was relevant to this project as, with the exception of blaOXA61 in retail poultry, the resistance determinants did not show a bias for any isolation source. This is in contrast with earlier AMR profiling of C. jejuni from food animals. The C. jejuni study showed that chromosomal mutations associated with macrolide resistance were absent from the cattle population while here we show these chromosomal mutations are present in C. coli from all isolation sources. This point is noteworthy because C. coli and C. jejuni are often isolated from the same environment and experience similar sets of evolutionary selective pressures.

In this study, we have shown how cgMLST200 typing can be used to inform the likelihood of certain antimicrobial resistance determinants in subpopulations of C. coli. Further, both methods used for source attribution indicated that swine sources contributed least to the human burden of campylobacteriosis. This is despite a high prevalence of C. coli isolated from swine cecal samples. The core genomes of poultry isolates showed the highest similarity to human isolates while environmental, cattle and swine isolates were increasingly dissimilar. Without supporting epidemiological data, however, we are unable to claim that any of the C. coli isolated from humans originated from a particular source. Data generated from these methods allow us to evaluate the genomic similarity of human-pathogenic C. coli strains to each of the isolation sources. Using both cgMLST200 and MMD analyses, we have demonstrated the utility of large-scale genomic analysis of C. coli for identifying risk factors of campylobacteriosis, including identifying the likely source of antimicrobial resistant C. coli infections.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Author Contributions

ES, LH, and SZ were involved in the conception of the project and study design. CM, JH, SM, and SY collected sequence and metadata. SM was responsible for database organization and curation. C-HH, LH, SM, and SZ performed data analysis. C-HH, CM, ES, GT, JH, LH, QZ, SM, SY, and SZ revised and reviewed the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This research received no external funding and the APC was funded by the FDA.

Disclaimer

The views expressed in this article are those of the authors and do not necessarily reflect the official policy of the Department of Health and Human Services, the U.S. Food and Drug Administration, or the U.S. Government. Reference to any commercial materials, equipment, or process does not in any way constitute approval, endorsement, or recommendation by the Food and Drug Administration.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2021.703890/full#supplementary-material

References

Battersby, T., Whyte, P., and Bolton, D. (2016). The pattern of Campylobacter contamination on broiler farms; external and internal sources. J. Appl. Microbiol. 120, 1108–1118. doi: 10.1111/jam.13066

PubMed Abstract | CrossRef Full Text | Google Scholar

Cody, A. J., Bray, J. E., Jolley, K. A., McCarthy, N. D., and Maiden, M. C. (2017). Core genome multilocus sequence typing scheme for stable, comparative analyses of Campylobacter jejuni and C. coli human disease isolates. J. Clin. Microbiol. 55, 2086–2097. doi: 10.1128/jcm.00080-17

PubMed Abstract | CrossRef Full Text | Google Scholar

Dearlove, B. L., Cody, A. J., Pascoe, B., Méric, G., Wilson, D. J., and Sheppard, S. K. (2016). Rapid host switching in generalist Campylobacter strains erodes the signal for tracing human infections. ISME J. 10, 721–729. doi: 10.1038/ismej.2015.149

PubMed Abstract | CrossRef Full Text | Google Scholar

Dessai, U. (2017). NARMS at the USDA: Monitoring Antimicrobial Resistance in Food Animals at Slaughter and Processing. Washington, DC: United States Department of Agriculture Food Safety and Inspection Service.

Google Scholar

Feldgarden, M., Brover, V., Haft, D. H., Prasad, A. B., Slotta, D. J., Tolstoy, I., et al. (2019). Validating the AMRFinder tool and resistance gene database by using antimicrobial resistance genotype-phenotype correlations in a collection of isolates. Antimicrob. Agents Chemother. 63, e00483-19.

Google Scholar

Food and Drug Administration (2010). National Antimicrobial Resistance Monitoring System-Enteric Bacteria (NARMS): 2007 Executive Report. Rockville, MD: Department of Health and Human Services, US FDA.

Google Scholar

Food and Drug Administration (2021). NARMS Now. Avaliable online at: https://www.fda.gov/animal-veterinary/national-antimicrobial-resistance-monitoring-system/narms-now-integrated-data (accessed May 27, 2021).

Google Scholar

Horrocks, S., Anderson, R., Nisbet, D., and Ricke, S. (2009). Incidence and ecology of Campylobacter jejuni and coli in animals. Anaerobe 15, 18–25. doi: 10.1016/j.anaerobe.2008.09.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Hsieh, Y.-H., Simpson, S., Kerdahi, K., and Sulaiman, I. M. (2018). A comparative evaluation study of growth conditions for culturing the isolates of Campylobacter spp. Curr. Microbiol. 75, 71–78. doi: 10.1007/s00284-017-1351-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Hsu, C.-H., Harrison, L., Mukherjee, S., Strain, E., McDermott, P., Zhang, Q., et al. (2020). Core genome multilocus sequence typing for food animal source attribution of human Campylobacter jejuni infections. Pathogens 9:532. doi: 10.3390/pathogens9070532

PubMed Abstract | CrossRef Full Text | Google Scholar

Konduru, B., Sagi, S., and Parida, M. (2021). Dry reagent-based multiplex real-time PCR assays for specific identification of chicken, mutton, beef and pork in raw and processed meat products. Eur. Food Res. Technol. 247, 737–746. doi: 10.1007/s00217-020-03662-1

CrossRef Full Text | Google Scholar

Lee, M. D., and Newell, D. G. (2006). Campylobacter in poultry: filling an ecological niche. Avian Dis. 50, 1–9. doi: 10.1637/7474-111605r.1

PubMed Abstract | CrossRef Full Text | Google Scholar

Maiden, M. C., Van Rensburg, M. J. J., Bray, J. E., Earle, S. G., Ford, S. A., Jolley, K. A., et al. (2013). MLST revisited: the gene-by-gene approach to bacterial genomics. Nat. Rev. Microbiol. 11, 728–736. doi: 10.1038/nrmicro3093

PubMed Abstract | CrossRef Full Text | Google Scholar

Oliveira, M., Tiwari, B. K., and Duffy, G. (2020). Emerging technologies for aerial decontamination of food storage environments to eliminate microbial cross-contamination. Foods 9:1779. doi: 10.3390/foods9121779

PubMed Abstract | CrossRef Full Text | Google Scholar

Osimani, A., Aquilanti, L., Pasquini, M., and Clementi, F. (2017). Prevalence and risk factors for thermotolerant species of Campylobacter in poultry meat at retail in Europe. Poult. Sci. 96, 3382–3391. doi: 10.3382/ps/pex143

PubMed Abstract | CrossRef Full Text | Google Scholar

Pascoe, B., Schiaffino, F., Murray, S., Méric, G., Bayliss, S. C., Hitchings, M. D., et al. (2020). Genomic epidemiology of Campylobacter jejuni associated with asymptomatic pediatric infection in the Peruvian Amazon. PLoS Negl. Trop. Dis. 14:e0008533. doi: 10.1371/journal.pntd.0008533

PubMed Abstract | CrossRef Full Text | Google Scholar

Patrick, M., Henao, O., Robinson, T., Geissler, A., Cronquist, A., Hanna, S., et al. (2018). Features of illnesses caused by five species of Campylobacter, foodborne diseases active surveillance network (FoodNet)–2010–2015. Epidemiol. Infect. 146, 1–10. doi: 10.1017/s0950268817002370

PubMed Abstract | CrossRef Full Text | Google Scholar

Pérez-Reche, F. J., Rotariu, O., Lopes, B. S., Forbes, K. J., and Strachan, N. J. (2020). Mining whole genome sequence data to efficiently attribute individuals to source populations. Sci. Rep. 10, 1–16.

Google Scholar

Plishka, M., Sargeant, J. M., Greer, A. L., Hookey, S., and Winder, C. (2020). The Prevalence of Campylobacter in Live Cattle, Turkey, Chicken, and Swine in the United States and Canada: a systematic review and meta-analysis. FoodbornePathog. Dis. 18, 230–242. doi: 10.1089/fpd.2020.2834

PubMed Abstract | CrossRef Full Text | Google Scholar

Rasschaert, G., De Zutter, L., Herman, L., and Heyndrickx, M. (2020). Campylobacter contamination of broilers: the role of transport and slaughterhouse. Int. J. Food Microbiol. 322:108564. doi: 10.1016/j.ijfoodmicro.2020.108564

PubMed Abstract | CrossRef Full Text | Google Scholar

Rosner, B. M., Schielke, A., Didelot, X., Kops, F., Breidenbach, J., Willrich, N., et al. (2017). A combined case-control and molecular source attribution study of human Campylobacter infections in Germany, 2011–2014. Sci. Rep. 7, 1–12.

Google Scholar

Tack, D. M., Marder, E. P., Griffin, P. M., Cieslak, P. R., Dunn, J., Hurd, S., et al. (2019). Preliminary incidence and trends of infections with pathogens transmitted commonly through food—Foodborne Diseases Active Surveillance Network, 10 US sites, 2015–2018. Morb. Mortal. Weekly Rep. 68, 369–373. doi: 10.15585/mmwr.mm6816a2

PubMed Abstract | CrossRef Full Text | Google Scholar

World Health Organization (2018). Campylobacter. Avaliable online at: https://www.who.int/news-room/fact-sheets/detail/campylobacter (accessed March 01, 2020).

Google Scholar

Zhao, S., Mukherjee, S., Chen, Y., Li, C., Young, S., Warren, M., et al. (2015). Novel gentamicin resistance genes in Campylobacter isolated from humans and retail meats in the USA. J. Antimicrob. Chemother. 70, 1314–1321. doi: 10.1093/jac/dkv001

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhao, S., Young, S., Tong, E., Abbott, J., Womack, N., Friedman, S., et al. (2010). Antimicrobial resistance of Campylobacter isolates from retail meat in the United States between 2002 and 2007. Appl. Environ. Microbiol. 76, 7949–7956. doi: 10.1128/aem.01297-10

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: Campylobacter coli, source attribution, cgMLST, food safety, antibiotic resistance, MMD

Citation: Harrison L, Mukherjee S, Hsu C-H, Young S, Strain E, Zhang Q, Tillman GE, Morales C, Haro J and Zhao S (2021) Core Genome MLST for Source Attribution of Campylobacter coli. Front. Microbiol. 12:703890. doi: 10.3389/fmicb.2021.703890

Received: 30 April 2021; Accepted: 21 June 2021;
Published: 13 July 2021.

Edited by:

Xiaonan Lu, McGill University, Canada

Reviewed by:

Jasna Kovac, The Pennsylvania State University (PSU), United States
Linda Antionette Bester, University of KwaZulu-Natal, South Africa

Copyright © 2021 Harrison, Mukherjee, Hsu, Young, Strain, Zhang, Tillman, Morales, Haro and Zhao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Lucas Harrison, THVjYXMuSGFycmlzb25AZmRhLmhocy5nb3Y=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.