Key Transitions in the Evolution of Rapid and Slow Growing Mycobacteria Identified by Comparative Genomics

Mycobacteria have been classified into rapid and slow growing phenotypes, but the genetic factors that underlie these growth rate differences are not well understood. We compared the genomes of 157 mycobacterial species, representing all major branches of the mycobacterial phylogenetic tree to identify genes and operons enriched among rapid and slow growing mycobacteria. Overlaying growth phenotype on a phylogenetic tree based on 304 core genes suggested that ancestral mycobacteria had a rapid growth phenotype with a single major evolutionary separation into rapid and slow growing sub-genera. We identified 293 genes enriched among rapid growing sub-genera, including genes encoding for amino acid transport/metabolism (e.g., livFGMH operon) and transcription, as well as novel ABC transporters. Loss of the livFGMH and ABC transporter operons among slow growing species suggests that reduced cellular amino acid transport may be growth limiting. Comparative genomic analysis suggests that horizontal gene transfer, from non-mycobacterial genera, may have contributed to niche adaptation and pathogenicity, especially among slow growing species. Interestingly, the mammalian cell entry (mce) operon was found to be ubiquitous, irrespective of growth phenotype or pathogenicity, although protein sequence homology between rapid and slow growing species was low (<50%). This suggests that the mce operon was present in ancestral rapid growing species, but later adapted by slow growing species for use as a mechanism to establish an intra-cellular lifestyle.


INTRODUCTION
Mycobacteria are common environmental organisms, but some species are significant human pathogens. Mycobacterium tuberculosis is responsible for tuberculosis in humans (World Health Organization, 2019), and M. leprae and M. lepromatosis, the cause of leprosy, has become dependent on the human host for survival and dispersal (Monot et al., 2009;Singh et al., 2015). Most mycobacteria are found in soil or water, but they can occupy a variety of environmental niches. Mycobacteria are routinely classified as rapid or slow growers based on their in vitro growth characteristics (Kim et al., 2013). Slow growing species typically require more than 7 days before colonies become visible on solid media, while rapid growing species form colonies on selective media within 2-5 days (Kim et al., 2013). The slow growing phenotype has been associated with an intra-cellular lifestyle and pathogenicity, while rapid growing species are mainly environmental and include only a limited number of opportunistic pathogens (Philley and Griffith, 2015). Studies using genetic markers suggested that slow growing species represent a genetically distinct group that evolved from rapid growing species (Wee et al., 2017), while the ancestral M. abscessus-chelonae clade is highly divergent from other rapid growing species and have developed unique colonization and disease causing mechanisms (Medjahed et al., 2010;Tortoli, 2012).
The availability of high quality sequences for a large and steadily increasing number of genomes of mycobacterial species in publicly accessible databases provided opportunities for comparative genomics analyses, including a detailed assessment of key differences between rapid and slow growing species. Previous phylogenetic analyses supported a single evolutionary split into rapid and slow growing phenotypes (Tortoli et al., 2017). However, the identification of slow growing species among rapid growers suggested that growth phenotype might be variable and not necessarily related to a major "metabolic transition" . In addition, recognition of intermediate growth rates in some phylogenetic clades (e.g., the M. terrae complex) demonstrated that growth rate is a complex trait (Vasireddy et al., 2016).
With the goal of providing a better understanding of the evolutionary relationship between the different mycobacterial clades, a new classification scheme has been proposed (Gupta et al., 2018). The proposed scheme redefines all mycobacteria into five sub-genera with members of the distantly related Abscessus-Chelonae clade referred to as Mycobacteroides, while the majority of other rapid growers are classified as Mycolicibacterium. The M. terrae complex, which includes slow and intermediate (5-7 days) growers (Tortoli, 2014) were included in the Mycolicibacter sub-genus. The vast majority of slow growing species and the major human pathogens, including M. tuberculosis, were classified in the Mycobacterium sub-genus. However, this analysis did not capture the underlying mechanisms that may explain growth phenotype differences, especially the key differences that occurred during the split between major rapid and slow growth phenotypes.
In order to fill this knowledge gap, we applied pangenome comparative genomics to explore the deep evolutionary origins of mycobacteria, with a specific focus on the genomic differences observed between rapid and slow growing sub-genera.

Rooting of the Mycobacterial Phylogeny
An initial phylogenetic analysis was performed comparing the five Mycobacterium sub-genera against other members of the Actinobacteria phylum, of which the five proposed mycobacteria genera are members. A phylogenetic tree was constructed using 30 conserved genes encoding for ribosomal proteins from the complete genomes of M. abscessus (sub-genus Mycobacteroides), M. smegmatis (sub-genus Mycolicibacterium), M. sinense (subgenus Mycolicibacter), M. triviale (sub-genus Mycolicibacillus) and M. tuberculosis (sub-genus Mycobacterium), representing all the major Mycobacteria sub-genera against six Actinobacteria species including Nocardia brasiliensis (NZ_KB907307), Rhodococcus fascians (NZ_CP015235), Corynebacterium diphtheriae (NZ_LN831026) Amycolatopsis mediterranei (NC_014318), Pseudonocardia dioxanivorans (NC_015312) and Nakamurella multipartita (NC_013235.1). Nakamurella multipartita was used to root the tree based on previous phylogenetic analysis (Lewin et al., 2016). BLAST and custom scripts were used to extract the nucleotide sequence of 30 conserved genes, which were aligned with Muscle v3.8 (Edgar, 2004) and then concatenated. The concatenated alignment was filtered using Gblocks (Castresana, 2000). The phylogenetic tree was built using the Maximum likelihood method with the General Time Reversible (GTR) model implemented in RaxML (Stamatakis, 2014). Bootstrap values were calculated using 1000 replicates.

Whole Genome-Based Phylogenetic Analysis
We included all sequenced Mycobacterium species with highquality assemblies available in the NCBI database, as of 31 January 2019. Available genome assemblies were filtered using an assembly quality criteria (<1000 contigs). For each species, the assembly with the smallest number of scaffolds was used as the species representative. The only exception was the use of M. vulneris NCXM0100000 instead of the better-assembled M. vulneris CCBG00000000, given that the better-assembled genome likely represents a mislabeled species (Tortoli, 2018). Our final dataset included genome assemblies for 157 species (Supplementary Table S1). The genomes were uniformly reannotated using the Broad Institute's prokaryotic annotation pipeline to ensure a consistent annotation protocol for optimal genome comparison.
For detailed assessment of the evolutionary relationship between different Mycobacterium species, SynerClust v1 (Georgescu et al., 2018) was employed to perform orthogroup clustering across the 157 assembled sequences, resulting in a set of 304 single-copy core genes. Orthologs were defined as genes that were vertically inherited and have the same function. Sequences for each orthogroup were individually aligned using Muscle v3.8 and then concatenated to build a Maximum likelihood phylogenetic tree using the GTR model with 1000 bootstrap replicates.

Pangenome Analysis
Since the phylogenetic tree identified a major evolutionary split between rapid and slow growing phenotypes, SynerClust orthogroups (Georgescu et al., 2018) were used to compare the pangenome of rapid versus slow growing mycobacteria on opposite branches of the split. We adopted the proposed sub-genera scheme (Gupta et al., 2018), when referring to different groups of mycobacterial species. Firstly, we used relaxed criteria by searching for orthogroups present in >80% of rapid growing species (sub-genus Mycolicibacterium) and in <20% slow growing species (sub-genera Mycobacterium, Mycolicibacillus, and Mycolicibacter). Rapid growing species from the Mycobacteroides sub-genus were excluded from this comparative analysis, as they were phylogenetically divergent. Three Mycolicibacterium species (M. doricum, M. farcinogenes, and M. tusciae) with an atypical slow growing phenotype were also excluded. In addition, M. algericum from the Mycolicibacter sub-genus was excluded as an outlier rapid growing species. However, species with intermediate growth rates in specific environmental conditions were included (Sahraoui et al., 2011). The 80/20 cut-off was used to ensure that genes were not excluded due to poor assembly or other errors and still allowed for accurate identification of genes enriched (uniquely conserved) in rapid growing species (Schreiber et al., 2017). The same approach was used to identify genes enriched among slow growing species.
Uniquely enriched gene clusters, with q-values less than 0.05, identified in either rapid or slow growing species, were classified into Clusters of Orthologous Groups (COGs) using WebMGA interface (Wu et al., 2011). COG results were parsed for matches with e-value of >1 × 10 −5 and checked to see if a gene matched to multiple COG models, in which case the model with best match was used. If a COG model was included in multiple classes, then each class was recorded, as it could represent a protein with multiple functional roles. We also confirmed that all members of the gene cluster matched the same COG model. Fisher Exact tests were performed to assess significant differences in the number of functional categories between rapid and slow growing species. Stringent criteria were then applied to search for orthogroups present in both rapid growing subgenera (Mycobacteroides and Mycolicibacterium), but absent in all members of the slow growing sub-genera (Mycolicibacillus, Mycolicibacter, and Mycobacterium). The presence and absence of these gene operons across genera was visualized using iTol (Letunic and Bork, 2019).

Comparative Genomics and Genomic Island Identification
Pairwise whole genome comparison of M. tuberculosis H37Rv against selected rapid and slow growing strains was performed using the Artemis comparison tool (Carver et al., 2005) and visualized using the BLAST ring image generator (BRIG) (Alikhan et al., 2011). Two other slow growing (M. intermedium and M. paratuberculosis) and three rapid growing (M. smegmatis, M. neoaurum, and M. fortuitum) Mycobacteria with completed and closed genomes were selected for this analysis to broadly represent both branches of the major evolutionary split that occurred between rapid and slow growing species. M. tuberculosis H37Rv was selected as the reference genome for this analysis (Cole et al., 1998;Camus et al., 2002). Regions of differences (RODs) that were identified were compared against the assemblies of the 157 mycobacterial genomes using BLASTn, to observe the distribution across the different sub-genera.

Rapid Growing Species Are Ancestral
The phylogeny of Actinobacteria has confirmed that mycobacteria are evolutionary related to Rhodococcus and Nocardia species (Supplementary Figure S1). It also indicated that the Mycobacteroides sub-genus, represented by M. abscessus is the most ancestral mycobacteria sub-genus. Therefore, we used the Mycobacteroides sub-genus to root our core gene based phylogenetic tree of all 157 mycobacterial genome sequences (Figure 1). Similar to previous findings, our core gene phylogeny identified five major mycobacterial sub-genera (Gupta et al., 2018). Ancestral strains belonging to the Mycobacteroides and Mycolicibacterium sub-genera included almost all of the rapid growers, with a single evolutionary split separating the vast majority of species with a rapid or slow growing phenotype. The predominantly fast-growing Mycolicibacterium sub-genus also contained three slow growing species, M. doricum, M farcinogenes, and M. tusciae, interspersed on separate terminal branches. All other slow growing species were members of the three sub-genera located on the other arm of the major phylogenetic split; Mycobacterium, Mycolicibacillus, and Mycolicibacter. M. algericum was the only rapid growing species within these three sub-genera; located on the terminal branch of the Mycolicibacter sub-genus.

Rapid Growing Species Were Enriched in Genes Related to Amino Acid Transport/Metabolism and Transcription
We identified 293 genes that were highly enriched among rapid growing species and 309 among slow growing species. Classification of enriched genes into COG functional categories (Figure 2) revealed that genes related to amino acid transport and metabolism (31 genes in rapid growers vs. 16 genes in slow growers) and transcription (26 genes in rapid growers vs. 14 genes in slow growers) were highly enriched among rapid growers.
The enriched and conserved genes in the rapid growing species included 19 genes that are arranged into four operons, all encoding transporter functions (Figure 3). Three of these operons (livFGMH, ABC operon 1, and ABC operon 2) were either annotated as amino acid transporters or predicted to be amino acid transporters based on COG annotations ( Table 1). The livFGMH operon, which consists of five genes that transport branched chain amino acids across the lipid rich mycobacterial cell wall, was universally present in the rapid growing sub-genera Mycobacteroides and Mycolicibacterium. The other two operons (ABC operon 1 and 2) are uncharacterized ABC transporters predicted by COG annotation to have a role in amino acid transport, and were found in the rapid growing Mycolicibacterium sub-genus only. We also noted that the shaACDEFG operon, associated with ion transport and pH balance, was uniquely absent from the slow growing Mycobacterium genus, which includes most pathogenic species.
Using the stringent criteria, we identified 40 additional gene orthologs present in the majority (>98%) of the members of the Mycobacteroides and Mycolicibacterium sub-genera, but FIGURE 1 | Phylogenetic tree of all well characterised Mycobacterium species. This Maximum likelihood tree of 157 well-characterized Mycobacterium species is based on nucleotide alignment of 304 single copy genes. It shows five distinct sub-genera and indicates that slow growers evolved from more ancestral fast growing species. Bootstrap values are shown on nodes with less than 100% support. absent in all members of slow growing species (Mycobacterium, Mycolicibacillus, and Mycolicibacter sub-genera). Unlike the livFGMH operon, these genes do not cluster into operons and are scattered across the chromosome. The majority of these genes have functions related to cell respiration (dehydrogenases) and other poorly characterized metabolism roles ( Table 2).
To understand additional factors potentially involved in slow growth rate, we sought to identify unique genetic properties shared among the individual terminal branch outliers with a slow-growth phenotype found within the rapid growing Mycolicibacterium sub-genus. However, no consistent and significant features could be determined, likely due to the limited number of outlier species with a slow growing phenotype and the possibility of different mechanisms employed by each outlier, given that they are each located on a terminal branch.

Mammalian Cell Entry (mce1) Operon Found in Rapid and Slow Growing Mycobacteria
Among the 309 enriched genes in the slow growing species, genes associated with the ESX-5 Type VII secretion system, PPE family and the mammalian cell entry (mce) operon were most prevalent. The mce1 operon is comprised of two yrbE (yrbE1A and yrbE1B) and six mce genes (mceA1, mceB1, mceC1, mceE1, and mceF1) (Zhang and Xie, 2011). Homologs of the mce1 operon were also detected in rapid growing species, although the gene clustering approach used in this study separated the rapid grower homologs into a single ortholog group. In order to examine the differences in the mce1 operon in more detail, the protein sequences encoded by the mce1 genes were extracted from the genome of M. tuberculosis H37Rv and BLASTp was used to assess the sequence variability across the all 157 genomes (Figure 4). Within the Mycobacterium sub-genus, mce1 proteins shared >80% sequence identity, with almost 100% identity among species belonging to the M. tuberculosis complex. Shared sequence identity with the Mycolicibacillus, Mycolicibacter, and Mycolicibacterium sub-genera was significantly reduced for the six mce genes (<60%). The two yrbE genes, which encodes for membrane proteins similar to ABC transporter permeases, have homologs with sequence identity in the range of 65-75% in most Mycobacterium species; however, sequence homology with the ancestral Mycobacteroides sub-genus was less than 14%.

Horizontal Gene Transfer in Slow Growing Mycobacteria
It has been hypothesized that horizontal gene transfer (HGT) played an important role in the evolution of pathogenic slow growing phenotypes. In order to investigate the role of HGT in the evolution of slow growing species, we performed a whole-genome alignment using BLAST of the well-annotated M. tuberculosis H37Rv genome compared against two other slow growing pathogens (M. intermedium and M. paratuberculosis) and three rapid growing species (M. smegmatis, M. neoaurum, and M. fortuitum) to identify regions where elements have inserted into the M. tuberculosis H37Rv genome (Figure 5). In total, 16 RODs were identified and BLASTn was used to confirm the distribution of these RODs in the other 157 genomes.
Six RODs (GI-LeuT, ROD-1, ROD-2, ROD-5, ROD-6, and ROD-10) are present in at least 70% of slow growing species, with notable absences in M. leprae and M. lepromatosis, which have undergone significant genome reduction (Monot et al., 2009;Singh et al., 2015). Matches to these regions were observed in <10% of rapid growing species. The other 10 RODs were either found to be exclusive to M. tuberculosis complex, which is comprised of a group of almost clonal species including M. bovis and M. africanum, or sporadic in both slow and rapid growing genomes (Supplementary Table S2). One of the RODs commonly found in slow growers, but missing from rapid growers, was identified as a potential genomic island (GI-LeuT) containing genes that encode for biofilm regulators and a number of hypothetical proteins. The other five RODs contained genes encoding hypothetical proteins ROD-1 and ROD-6 and/or PPE family proteins (ROD-2 and ROD-5), while ROD-10 encodes a number of membrane proteins (PPE and ESX secretion proteins) involved in host-pathogen interactions. A second genomic island GI-LeuX, found only in species of the M. tuberculosis complex, carried three transposase genes along with ESX and PPE genes. Other RODs found only in the M. tuberculosis complex include ROD-4, which encodes for an ABC transporter and is associated with a spike in the GC content indicating likely acquisition via HGT. PPE family proteins were encoded on ROD-9, ROD-13, and ROD-14. ROD-11 encodes a single toxin-antitoxin system and contains a Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR) region (Freidlin et al., 2017). ROD-8 is located near the mce1 operon and encodes multiple toxin-antitoxin systems.

DISCUSSION
This report illustrates the capacity of comparative genomics to explain differences between mycobacterial growth phenotypes using comprehensive sets of high-quality genomes. The phylogenetic tree presented here was built from a concatenated alignment of 304 single copy genes from ortholog clusters generated using a novel Synerclust-based orthogroup method, which allowed us to leverage genomic synteny to help identify orthologs across species. This high resolution phylogenetic tree was congruent with previous phylogenetic analyses that used different methods and separated mycobacteria into five major sub-genera Tortoli et al., 2017;Gupta et al., 2018). The novel comparative genomics methods employed allowed us to focus with high granularity on key genetic differences between rapid and slow growing species.
Our detailed mycobacterial phylogeny suggests a single deep evolutionary split between rapid and slow growing species, with ancestral species being rapid growers, with only a handful of terminal branch exceptions. These include M. farcinogenes, M. doricum, and M. tusciae being slow growing species located within a rapid sub-genus and M. algericum identified as a rapid growing M. terrae-complex species located within the slow and intermediate growing Mycolicibacter sub-genus (Sahraoui et al., 2011). M. tusciae and M. doricum both exhibit substantially slower growth rates than other members of their fast-growing clade. M. tusciae takes 4 weeks to grow on Middlebrook agar (Tortoli et al., 1999), and M. doricum takes 2 weeks on Lowenstein-Jensen medium (Tortoli et al., 2001). This suggested ongoing growth phenotype evolution, which is also illustrated by M. farcinogenes that forms colonies in 5-10 days compared to the closely related M. senegalense that forms colonies in 2-5 days (Hamid, 2014). However, switches in growth phenotypes are rare and the majority of species within specific sub-genera maintain the growth rate of their ancestors, with a single major evolutionary transition from rapid to slow growing phenotypes.
This study focussed on genomic differences between the rapid growing Mycolicibacterium sub-genus and slow growing subgenera on the other branch of this evolutionary split, excluding species with a disco crdant growth phenotype. It was found that amino acid transporters were highly conserved in rapid growing species, but absent in slow growing species. Amino acid transporters are membrane-bound proteins that mediate the transfer of amino acids into and out of cells, with critical roles in regulating energy metabolism, protein synthesis and redox balance (Kandasamy et al., 2018). The ATP-binding livFGMH operon specifically transports branched chain amino acids such as leucine, isoleucine and valine, which are important for bacterial growth (Conner and Hansen, 1967). The livFGMH operon is conserved in all rapid growing species, including the outgroup Mycobacteroides sub-genus, but absent from all slow growing sub-genera. Using stricter cut-offs the enriched orthogroups were filtered to identify an additional 40 genes found only in rapid growing species ( Table 2). The majority of these genes have predicted dehydrogenase functions, although their metabolic functions remains poorly characterized. Importantly, this study identified two ATP-binding transporter operons unique to rapid growers, predicted to be involved in amino acid transport based on COG annotations. Both operons consisted of four genes, including an ATPbinding protein, substrate binding protein and two permeases (transmembrane proteins). ABC transporters are amongst the most common ATP powered transporters found in bacteria and are linked to the transport of a wide variety of substrates and lipids across the membrane bilayer (Ford and Beis, 2019). The ability to utilize a variety of substrates and metabolites are likely to be important for sustaining rapid growth (Kandasamy et al., 2018), with a strong signal that sub-optimal amino acid transport may be particularly growth limiting. These two ABC operons were found to be conserved only in the Mycolicibacterium sub-genus and absent in Mycobacteroides. They were seemingly acquired when mycobacteria diverged from the Mycobacteroides outgroup and were then lost from species that evolved into slow growers. The likely gain and loss of these transporters, during this key evolutionary transition, increases interest in their specific function.
The shaACDEFG operon was selectively absent from the more pathogenic Mycobacterium sub-genus. It encodes a Na + /H + antiporter that regulates cellular pH under extreme conditions (Kosono et al., 2005). A recently published genomic comparison of 28 rapid and slow growing species revealed several genes making up the livFGMH and shaACDEFG operons, and genes that encode Msp porins, to be absent in pathogenic Mycobacterium species (Wee et al., 2017). However, this small study excluded other slow growing genera and our study demonstrates that, unlike the livFGMH operon that is universally absent from all slow growing genera, the shaACDEFG operon is only absent from the more pathogenic Mycobacterium sub-genus. This would suggest that the shaACDEFG operon was deleted as mycobacteria developed a more pathogenic intracellular lifestyle.
A key virulence mechanism for M. tuberculosis is the ability to invade host macrophages, which is in part mediated by the mammalian cell entry (mce1) operon. While the mce1 operon is considered a key feature of slow growing pathogenic mycobacteria, homologs of the yrbE and mce genes are also found in distant rapid growing species such as M. smegmatis and FIGURE 4 | The mce1 operon is highly variable across all five mycobacterial sub-genera. The left-hand panel depicts the phylogeny as in Figure 1. The right panel shows a gradient heatmap based on protein sequence identity for each protein encoded by the mce1 operon using BLASTp.
Frontiers in Microbiology | www.frontiersin.org M. abscessus (Kumar et al., 2005;Sassi and Drancourt, 2014). Interestingly, the gene clustering approach used in this study divided the mce1 operon from rapid and slow growing species into two ortholog clusters, with limited homology between the slow growing M. tuberculosis complex and the ancestral rapid growing M. abscessus-chelonae complex. This level of variability suggests that the mce1 operon probably fulfilled a different function in ancestral rapid growing species. It may have provided ancestral mycobacteria with the ability to enter amoeba cells (Zhang and Xie, 2011), paving the way for the adoption of a pathogenic intra-cellular lifestyle with ongoing evolution of the mce1 operon facilitating macrophage entry. The potential advantage afforded is supported by the presence of four duplicated mce1 operons in pathogenic mycobacterial species, which suggests strong positive selective pressure. It has been suggested that the smaller genome size observed in more pathogenic slow growers resulted from the loss of genes required for essential nutrient uptake and metabolism in the environment (Devulder et al., 2005), which were no longer required after they acquired the ability to live within host cells.
In contrast to our observation of whole operons uniquely conserved in rapid growers, we identified only scattered genes coding for the ESX and PPE protein families to be highly enriched among slow growers. These genes produce proteins that are involved in host interaction and are mainly secreted by ESX Type VII secretion systems, which are considered to be important for virulence (Simeone et al., 2015). Numerous hypothetical and metabolic genes were also detected on 16 genomic islands that likely represent horizontal gene transfer; six were common in slow growing species but rarely observed in rapid growers. Evidence of horizontal gene transfer includes the presence of a tRNA at the boundary of the island, as well as the presence of genes encoding transposase and distinct differences in GC content.
Several of the islands identified are consistent with islands found using an automated detection method for the M. tuberculosis genome (Becq et al., 2007), but we detected several additional genomic islands 2,5,6,7,12). In particular, ROD-1 carries genes that encode biofilm regulators and an ESX secretion protein that is an ESX Type VII secretion system substrate. The ESX Type VII secretion system has been shown to be critical for virulence (Houben et al., 2012(Houben et al., , 2014. ROD-2, which encodes highly polymorphic PPE proteins, offers another example of horizontal gene transfer in slow growers. The ROD comparison was limited by the inclusion of only select high quality assembled genomes, since fragmented genomes will lower the accuracy of genomic islands detection. Although the RODs identified would be influenced by strain selection, strains were selected from representative metabolic groups and divergent branches of the evolutionary tree to maximize the insight gained.

CONCLUSION
In conclusion, this study provides a detailed description of key genetic differences using a novel comparative genomics approach. We identified four operons that are uniquely conserved in rapid growing sub-genera of Mycobacteria, which could be targeted in future microbiological and pharmacological studies to elucidate their role in growth phenotype determination. Potential genes that could be knocked out or silenced to observe growth rate differences include those linked to amino acid transport, such as the livFGMH and ABC transporter operons. The mce-1 operon was found to be ubiquitous among mycobacteria, but significantly evolved among slow growing species with ability for sustained intra-cellular survival, suggesting a crucial role in this lifestyle transition.

DATA AVAILABILITY STATEMENT
Publicly available datasets were analyzed in this study. This data can be found here: on https://www.ncbi.nlm.nih.gov/ with accession numbers: NC_010397.1, NZ_CP007220.