Sugar Metabolism of the First Thermophilic Planctomycete Thermogutta terrifontis: Comparative Genomic and Transcriptomic Approaches

Xanthan gum, a complex polysaccharide comprising glucose, mannose and glucuronic acid residues, is involved in numerous biotechnological applications in cosmetics, agriculture, pharmaceuticals, food and petroleum industries. Additionally, its oligosaccharides were shown to possess antimicrobial, antioxidant, and few other properties. Yet, despite its extensive usage, little is known about xanthan gum degradation pathways and mechanisms. Thermogutta terrifontis, isolated from a sample of microbial mat developed in a terrestrial hot spring of Kunashir island (Far-East of Russia), was described as the first thermophilic representative of the Planctomycetes phylum. It grows well on xanthan gum either at aerobic or anaerobic conditions. Genomic analysis unraveled the pathways of oligo- and polysaccharides utilization, as well as the mechanisms of aerobic and anaerobic respiration. The combination of genomic and transcriptomic approaches suggested a novel xanthan gum degradation pathway which involves novel glycosidase(s) of DUF1080 family, hydrolyzing xanthan gum backbone beta-glucosidic linkages and beta-mannosidases instead of xanthan lyases, catalyzing cleavage of terminal beta-mannosidic linkages. Surprisingly, the genes coding DUF1080 proteins were abundant in T. terrifontis and in many other Planctomycetes genomes, which, together with our observation that xanthan gum being a selective substrate for many planctomycetes, suggest crucial role of DUF1080 in xanthan gum degradation. Our findings shed light on the metabolism of the first thermophilic planctomycete, capable to degrade a number of polysaccharides, either aerobically or anaerobically, including the biotechnologically important bacterial polysaccharide xanthan gum.


INTRODUCTION
Planctomycetes is a bacterial phylum, comprising only a few cultivated species, while a large number of ribosomal RNA sequences from various uncultured planctomycetes have been observed in the SILVA Ref database (release 128, Quast et al., 2013). Altogether, around a hundred sequenced planctomycetes genomes are available today, including 38 genome sequences of cultivated and ca. twice that amount of uncultivated ones (IMG database, March 2017, Markowitz et al., 2012. Cultivated planctomycetes with validly published names comprise 2 classes, 3 orders, 5 families, 25 genera, and 29 species. Until recently, all of these were characterized as strictly aerobic, heterotrophic and peptidoglycan-less microorganisms, which reproduce by budding and grow at mesophilic and slightly psychrophilic conditions (Liesack et al., 1986;Fuerst and Sagulenko, 2011). However, all of these features were reconsidered during the past few years. Their cell walls have been shown to contain a uniquely thin peptidoglycan layer (Jeske et al., 2015), representatives of the novel class Phycisphaerae divide by binary fission (Fukunaga et al., 2009;Kovaleva et al., 2015) instead of budding, and, finally, a few thermophilic and facultative anaerobic representatives were recently isolated Slobodkina et al., 2015). Even though no autotrophic planctomycetes were isolated and cultivated so far, members of the third class-level lineage, represented by uncultivated anammox planctomycetes (van de Graaf et al., 1995), are thought to fix CO 2 via the acetyl-CoA pathway (Strous et al., 2006).
Thermogutta terrifontis was characterized as the first thermophilic representative of the phylum Planctomycetes (Slobodkina et al., 2015). Among two species of the genus one, T. hypogea, was isolated from a subsurface environment of a Beatrix gold mine, SAR, while the second, T. terrifontis, was isolated from a microbial mat, developed in a terrestrial hot spring of Kunashir island (Far-East of Russia). As other cultivated planctomycetes, T. terrifontis grew well on various carbohydrates including oligo-and polysaccharides. At the same time its capability of anaerobic growth by either fermentation or anaerobic respiration was a novel finding among the representatives of this phylum. Yet, nothing is known on the mechanisms underlying these novel capabilities. T. terrifontis strain R1 has been shown to grow on xanthan gum (Slobodkina et al., 2015) -a complex polysaccharide synthesized by Xanthomonas campestris, comprising a beta-1,4-glucan backbone and mannosyl-glucuronyl-mannose side chains. Xanthan gum itself has numerous biotechnological applications in cosmetics, agriculture pharmaceuticals, food and petroleum industries (García-Ochoa et al., 2000). Moreover, its oligosaccharides were described as elicitors, stimulating plants in their defense response against pathogens (Liu et al., 2005); as antimicrobial compounds (Qian et al., 2006) and as antioxidants (Xiong et al., 2013). At the moment, not much is known about its decomposition pathways, especially on mechanisms and involved proteins. A few key enzymes needed to break xanthan gum side chains (xanthan lyases, beta-glucuronyl hydrolases, and alpha-mannosidases) are known, yet still no glycosidases acting on the glucan backbone of xanthan gum have been characterized. Here, we reconstructed the central carbohydrate metabolism of T. terrifontis strain R1 using genomic and transcriptomic sequencing with a special emphasis on xanthan gum degradation.

MATERIALS AND METHODS
Cultures and DNA/RNA Extraction Thermogutta terrifontis R1 cells were grown under microaerophilic conditions for 2-4 days in glass bottles, sealed with butyl rubber plug with aluminum cap, on modified Pfenning medium supplemented with xanthan gum or trehalose as a growth substrate. Mineral growth medium (Podosokorskaya et al., 2011) was prepared aerobically; 50 mg l −1 of yeast extract (Sigma) was added as a source of indefinite growth factors. Atmospheric air was in the gas phase and no reducing agents or resazurin were added. pH was adjusted to 6.1 with 5 M NaOH. Xanthan gum (1 g/l) [KELTROL R T (food grade Xanthan Gum, Lot#2F5898K) CP Kelco] or trehalose (2 g/l) (Sigma, T9531) were added from sterile stock solutions before inoculation.
For genome sequencing 2000 ml of T. terrifontis grown culture was centrifuged at 17664 G, and cells pellet was harvested. Total DNA was extracted from the cell pellet by freezing and thawing in TNE buffer (Tris 20 mM, NaCl 15 mM, EDTA 20 mM). After treating with lysozyme, RNase A, SDS, and proteinase K the DNA was extracted with phenol/chloroform and precipitated with EtOH and dissolved in 2 mM TE buffer .
For the transcriptomic experiment, cultures on both xanthan gum and trehalose were grown for 4 days. Then, three samples of each culture (cultivated on xanthan gum and trehalose) were used for RNA extraction. RNA was extracted using TRI-reagent (Sigma), following standard protocol with the addition of two freeze and thaw cycles of the cells in TRI-reagent, as well as addition of chloroform and washing twice with 75% EtOH. RNA pellet was dissolved in RNAase-free water.

Sequencing, Assembly, and Mapping
The genome was sequenced using a combination of Illumina GA II, and Roche/454 sequencing platforms. All general aspects of 500 bp paired-end (PE) library construction and sequencing, as well as 454 single-end sequencing can be found at the corresponding company website. A hybrid assembly of Illumina and 454 datasets was done using the Newbler assembler (Margulies et al., 2005). The initial Newbler assembly consisting of seven unique contigs was used to identify repeat regions that were subsequently screened out. At this point a 2 kb mate-pair Illumina library was constructed and sequenced and obtained paired end information was used to arrange multiple screened contigs into a single scaffold using the Phred/Phrap/Consed software package (Gordon et al., 1998). This package was also used for further sequence assembly and quality assessment in the subsequent finishing process. Sequence gaps between contigs that represented repeats were filled with Dupfinisher (Han and Chain, 2006), and a single scaffold was manually created and verified using available paired-end information. Illumina reads were used to correct potential base errors and increase consensus quality.
Together, the combination of the Illumina and 454 sequencing platforms provided 320× coverage of the genome.

Genome Annotation
The assembled chromosome was uploaded to the RAST server (Aziz et al., 2008) for de novo gene prediction using Glimmer-3 (Delcher et al., 2007) and initial detection of homologs. Furthermore, the predicted genes were searched against the protein databases Pfam 27.0 (Finn et al., 2014), COG 2003(Galperin et al., 2015, MEROPS 9.12 (Rawlings et al., 2016), CAZy/dbCAN (Yin et al., 2012;Lombard et al., 2014), and TCDB (Saier et al., 2014) in order to expand the initial RAST annotation. The positive hits from these searches were manually curated using Uniprot/NCBI BLASTp. Signal peptides were predicted with the SignalP 4.1 web server (Petersen et al., 2011), and transmembrane helices were predicted with the TMHMM 2.0 web server (Krogh et al., 2001). Infernal 1.1.1 (Nawrocki and Eddy, 2013) was used in conjunction with the covariance models from Rfam 12.0 (Nawrocki et al., 2015) to search for non-coding RNA genes.

Transcriptome Sequencing and Assignment
Extracted total RNA was converted to cDNA by reverse transcriptase. Total cDNA was sequenced from the three samples of each culture by strand-specific paired-end Illumina sequencing using an insert size of 270 bp and read length of 90 bp. RNA-seq reads were mapped to the genome using BWA ver 0.7.8 (Li and Durbin, 2009) requiring properly mapped pairs. Read assignment to genes was done using the featureCounts program from the Subread package (Liao et al., 2013). Only uniquely assigned read pairs were counted. Differential gene expression between the two groups was measured using edgeRun (Dimont et al., 2015) and genes were called differentially expressed using a BH-corrected p-value of 0.05.

Phylogenetic Analysis
Phylogenetic analyses were performed according to Sorokin et al. (2016) using the maximum likelihood method in MEGA6 (Tamura et al., 2013). Initial multiple amino acid sequence alignments were done in Mafft 7 (Katoh and Standley, 2013).

Genome Assembly and General Genome Characteristics
The genome of T. terrifontis strain R1 was sequenced and assembled into a single circular chromosome with a length of 4,810,751 bp and GC content of 57.34%. Genome annotation was performed using the RAST server and Infernal. In total, 4,504 protein coding genes were found in the genome, of which 2,412 could not be annotated by our database search and are therefore designated as "hypothetical protein." Both RAST and Infernal identified the same set of the 3 rRNAs and 46 tRNAs. Additionally, the ribonuclease P (RNase P), SRP, and tmRNA genes were identified by Infernal. No homolog was found for the non-coding 6S RNA gene. A recent computational screen for 6S RNA across all bacterial phyla (Wehner et al., 2014) reported the absence of 6S RNA in Pirellula staleyi and Rhodopirellula baltica, which are the closest related species to T. terrifontis in the 16S rRNA phylogeny (Slobodkina et al., 2015). This suggests that this gene is also likely to be absent in T. terrifontis. The Infernal search also revealed three riboswitches: cyclic di-GMP-I (RF01051), cobalamin (RF00174), and fluoride (RF01734).
The genome was submitted to GenBank with the accession number CP018477.

Transcriptome Sequencing and General Transcriptome Characteristics
Thermogutta terrifontis R1 cells were cultured in growth media containing trehalose or xanthan gum, each in triplicates (see section "Materials and Methods"). Transcriptome sequencing using Illumina paired-end sequencing resulted in between 11.5 and 12.1 m read pairs for the 2 × 3 replicates. Across these, between 91.3 and 98.5% of the read pairs could be mapped uniquely to the genome. Differential expression analysis reported that 665 genes are up-and 617 genes are down-regulated on xanthan gum compared to trehalose-grown culture (Figure 1 and Supplementary Table S1).
Our analysis of the T. terrifontis R1 genome revealed 101 genes encoding glycosidases (GHs), 14 genes encoding polysaccaride lyases (PLs) and 3 genes encoding carbohydrate esterases (CEs) (Supplementary Table S2). Among these, 54 genes encode proteins that were predicted to be secreted outside the cells (whether anchored on the cells surface or being released into the culture broth). No dominant CAZy (GH, PL, or CE) families (Lombard et al., 2014) were observed among T. terrifontis R1 CAZymes, yet the most numerous were GH5 (10 proteins) and putative glycosidases (9 proteins), including DUF1080 domain. Detailed analysis of the CAZymes specificities revealed following activities: trehalose can be degraded by trehalose synthase (Qu et al., 2004) acting in opposite direction (THTE_2039). Sucrose hydrolysis may occur by the action of intracellular fructosidase (THTE_0696). Alpha-1,4-bonds and alpha-1,6-bonds in starch can be hydrolyzed by a number of GH13 and GH77 glycosidases (THTE_1477, THTE_2143, THTE_3153, and THTE_3783), producing maltooligosaccharides and finally D-glucose. Cellobiose can be hydrolyzed by the putative beta-glucosidase (THTE_0963), or one of the GH2 and GH5 glycosidases with currently uncertain function. Xylan can be decomposed to xylooligosaccharides FIGURE 1 | Differently expressed genes between xanthan gum-grown and trehalose-grown Thermogutta terrifontis R1 cells. Green and red dots indicate significantly up-regulated and down-regulated genes in xanthan gum culture compared with trehalose culture (P-value < 0.05). Annotated data points show locus tag identifiers of differentially expressed genes encoding proteins that are involved either in xantham gum / trehalose degradation pathways or in the central carbon metabolism. Y axis: log2 Fold Change between trehalose and xanthan gum-grown cultures. X axis: Read count of each gene per million mapped reads, averaged across all samples.
by means of endoxylanases (THTE_2600 and THTE_3961) and to xylose by beta-xylosidases (THTE_0688, THTE_1819, THTE_1884, and THTE_2108). Pectin degradation occurs, most probably, by the action of a pectate lyase (THTE_1993) and several polygalacturonases (THTE_0436, THTE_1516, and THTE_2121), releasing D-galacturonic residues, that are further metabolized to D-glyceraldehyde 3-phosphate and pyruvate (see below). A large number of glycosidases was predicted to be involved in xanthan gum hydrolysis (see section "Xanthan Gum and Trehalose Utilization Pathways, Revealed by Comparative Genomic and Transcriptomic Analyses").

Genome-Scale Reconstruction of Central Carbohydrate Metabolism
According to the results of genome analysis, the final products of oligo-and polysaccharides decomposition were predicted to comprise glucose, fructose, mannose, xylose, galacturonate, and glucuronate. Some of these (glucose, mannose, xylose) as well as galactose were also shown to be used as growth substrates by T. terrifontis R1 according to Slobodkina et al., 2015. D-Glucose and D-fructose oxidation seems to occur via the Embden-Meyerhof (EM) pathway (Figures 2, 3 and Supplementary Table S3). Interestingly, the genome contains four genes encoding phosphofructokinases: one ATPdependent (THTE_2190) and three pyrophosphate-dependent (THTE_0093, THTE_1056, THTE_2629). Since no genes for fructose-1,6-bisphosphatase were found, and PPi-dependent phosphofructokinases are thought to be reversible, at least one of them should be a part of gluconeogenesis. Additionally, analysis of the nearest characterized homologs supports two of them to be involved in xylose utilization (see below). The Entner-Doudoroff pathway seems to be inoperative due to the absence of the gene Frontiers in Microbiology | www.frontiersin.org encoding 6-phosphogluconate dehydratase, a key enzyme of the pathway. Glucose-1-dehydrogenase, gluconokinase, and gluconate dehydratase genes are also absent in the genome.
The first step of galactose utilization -phosphorylation to galactose-1-phosphate -is catalyzed by galactokinase (THTE_0177). Next, the putative galactose-1-P-uridyltransferase (THTE_3784) transfers the UDP-group from UDP-glucose to galactose-1-P, producing UDP-galactose and glucose-1-P. This protein belongs to the type 1 galactose-1-P-uridyltransferases family, however, its closest characterized homolog was ADPglucose:phosphate adenylyltransferase (UniProt ID Q9FK51). While phylogenetic analysis (Supplementary Figure S1) supports this finding, the sequence identity of these two proteins is rather low (Identity 35%, Coverage 97%), leaving the function of this enzyme unclear. However, since no other putative galactose-1-P-uridyltransferase genes were found, the assignment of the function to THTE_3784 remains plausible. Finally UDPgalactose is converted to UDP-glucose by the UDP-glucose 4-epimerase (THTE_2863), whereas the glucose-1-phosphate is converted to glucose-6-phosphate by the phosphoglucomutase (THTE_3829).
Xylose utilization was predicted to occur as follows: xylose is isomerized to xylulose by xylose isomerase (THTE_2111); xylulokinase (THTE_0598) phosphorylates xylulose to xylulose-5-phosphate, which finally enters the pentose-phosphate pathway. All genes encoding proteins of both oxidative and synthetic parts of this pathway were found in the genome with transaldolase gene as an exception (Figure 3 and Supplementary  Table S4). Yet, sedoheptulose-7-phosphate (S-7-P), formed under action of transketolase, could be phosphorylated by PPi-dependent phosphofructokinases THTE_0093 and THTE_2629, of which the nearest characterized homolog from Methylococcus capsulatus (UniProt Q609I3, Reshetnikov et al., 2008) was shown to reversibly phosphorylate S-7-P with higher activity and affinity than fructose-6-phosphate (F-6-P). The resulting sedoheptulose-1,7-bisphosphate could be eliminated to erythrose-4-phosphate and dihydroxyacetone-phosphate by fructose-1,6-bisphosphate aldolase (THTE_1419) as it was proposed by Susskind et al. (1982) and Schellenberg et al. (2014). D-Galacturonate, released in the course of pectin degradation, is presumably oxidized to glyceraldehyde-3-phosphate and FIGURE 3 | Enzymes involved in sugar turnover. Color boxes indicate the differential expression of the genes, indicated by the transcriptomic analysis. Green: up-regulation with xanthan gum; Red: down-regulation with xanthan gum; Blue: genes are not significantly differentially expressed between xanthan gum and trehalose.
The metabolism of xanthan gum degradation productsglucose, glucuronate, and mannose -is described in Section "Xanthan Gum and Trehalose Utilization Pathways, Revealed by Comparative Genomic and Transcriptomic Analyses." Pyruvate, generated in the course of degradation of sugars and sugar acids, is further oxidized to acetyl-CoA in the reactions catalyzed by the pyruvate dehydrogenase complex (Figure 3): pyruvate dehydrogenase (E1) (THTE_3014), dihydrolipoamide acetyltransferase (E2) (THTE_2674), and lipoamide dehydrogenase (E3) (THTE_2676).
It has been shown by Slobodkina et al. (2015), that the products of T. terrifontis R1 glucose fermentation were hydrogen, lactate and acetate. Lactate could be produced from pyruvate by lactate dehydrogenase (THTE_3348), while the mechanism of acetate formation remains unclear. Although two acetate kinases were found (THTE_1319 and THTE_2274), no genes coding for phosphate-acetyl transferase were detected in the genome. It is therefore possible that acetate could be formed due to the action of CoA-acylating aldehyde dehydrogenase (THTE_1321), catalyzing the NADH-dependent reduction of acetyl-CoA to acetaldehyde (Toth et al., 1999), and aldehyde dehydrogenase (THTE_2212) catalyzing the oxidation of acetaldehyde to acetate along with formation of NADH (Ho and Weiner, 2005). Finally, acetate could be formed under the action of putative ADP-forming acetyl-CoA synthetase (THTE_2996), as it was shown for few hyperthermophilic archaea (Musfeldt et al., 1999;Musfeldt and Schönheit, 2002). Surprisingly, the genome encoded an ATP-dependent acetyl-CoA synthase (THTE_1589), which catalyzes the irreversible activation of acetate, whereas acetate was not listed among the substrates, supporting the growth of T. terrifontis R1 in Slobodkina et al. (2015).
Hydrogen formed by T. terrifontis R1 in the course of fermentation apparently results from the operation of group 3c [NiFe]-hydrogenase (Vignais and Billoud, 2007) THTE_4311-4313 and/or [FeFe]-hydrogenases (Vignais and Billoud, 2007) THTE_2884, THTE_2882, THTE_2881, THTE_3842-THTE_3844. On the other hand, according to our analysis, all the genomes of planctomycetes, available in the IMG database (34 genomes of planctomycetes with assigned genus and species names. The analysis was performed 03.07.17), lack genes of [FeFe]-hydrogenase. Since, T. terrifontis is the first planctomycete known to synthesize hydrogen in the course of fermentation, and it is currently the only one in which genes for [FeFe]-hydrogenases have been found, at least some of its [FeFe]-hydrogenases could be involved in hydrogen production.
All genes, coding the citrate cycle (TCA cycle) enzymes were found in the T. terrifontis R1 genome (Figures 2, 3 and Supplementary Table S5).

Genome-Scale Reconstruction of Nitrate Reduction
Genes for all three subunits of the respiratory cytoplasmic nitrate reductase Nar (Simon and Klotz, 2013) were found in the genome. The alpha subunit (NarG) THTE_1509 belongs to the deep lineage within the Nar-DMSO cluster (Supplementary Figure S3) of the molybdopterine superfamily (Duval et al., 2008). THTE_1508 and THTE_1507 encode the other two subunits NarH and NarI, respectively, while THTE_1506 encodes a chaperon subunit (TorD). The NarGHI complex might form a supercomplex with an electrogenic membrane-bound NADH dehydrogenase (Simon and Klotz, 2013, Figure 2 and Supplementary Table S6). No diheme subunit NarC was found, yet the genes of cytochrome b/c 1 complex (complex III, THTE_1510-THTE_1512) are located in close vicinity to the NarGHI genes (THTE_1509-THTE_1507), what might reflect the involvement of the complex III in the electron and proton transfer during T. terrifontis anaerobic growth with nitrate. Nitrite is reduced to ammonium by means of non-electrogenic periplasmic membrane-bound nitrite reductase Nrf, the catalytic subunit NrfA and the membrane-bound subunit NrfH (Simon and Klotz, 2013) of which are encoded by THTE_1450 and THTE_1449, respectively.

Genome-Scale Reconstruction of Aerobic Respiration
The complete aerobic respiratory electron transfer chain (ETC), including H + -translocating NADH-dehydrogenase (complex I), succinate dehydrogenase (complex II), cytochrome b/c 1complex (complex III), and terminal cytochrome c oxidase aa 3type (complex IV) was found (Figure 2 and Supplementary  Table S6). We did not find the typical cytochrome c gene or the plastocyanin gene, involved in transferring electrons from complex III to complex IV, yet THTE_3354 encodes a putative large (258 amino acids) cytochrome c containing two monoheme domains. No genes encoding terminal quinol oxidases (bd-type, bo 3 -type, or ba 3 -type) were found in the genome.

Xanthan Gum and Trehalose Utilization Pathways, Revealed by Comparative Genomic and Transcriptomic Analyses
In order to decipher the mechanisms of xanthan gum degradation, T. terrifontis R1 was grown on xanthan gum and trehalose (as the control), and the transcriptomes were sequenced and analyzed for genes that are up-regulated in the cultures with xantham gum as the substrate. Trehalose is a disaccharide consisting of 1-1-alpha-linked glucose molecules, and it was chosen as one of the simple sugars, supporting growth of the strain. Interestingly, T. terrifontis R1 genomic analysis revealed no genes coding for known trehalose-hydrolyzing enzymes of GH15, GH37, and GH65 families. Furthermore, two GH13 proteins (THTE_1477 and THTE_3153) have no trehaloseconverting enzymes among their nearest characterized relatives. Therefore, the only remaining reasonable candidate involved in decomposition of trehalose is a trehalose synthase of GT4 family (THTE_2039), acting in reverse direction, leading to a release of D-glucose and NDP-D-glucose molecules. The level of its expression in cells, grown on trehalose and xanthan-gum was similar (Figure 3), what could be explained by reversibility of its action (Qu et al., 2004;Ryu et al., 2005) at various growth conditions: trehalose degradation when trehalose is being sole substrate and trehalose synthesis when other substrates are used.
Among the enzymes currently known to be involved in xanthan gum decomposition, there are few removing terminal mannose residues xanthan lyases (Hashimoto et al., 1998;Ruijssenaars et al., 1999), belonging to the PL8 family, and an endoxanthanase (xanthan-specific endoglucanase, , hydrolyzing the glucan backbone ( Figure 4A). The latter has been biochemically characterized, yet its sequence is still unknown, preventing structure analysis and evolutionary reconstructions using sequence comparison.
For cleavage of the GlcA(β1-2α)Man linkage the action of beta-glucuronidase is needed. All known beta-glucuronidases belong to the GH1, GH2, GH30, and GH79 families. While no genes encoding GH1, GH30, and GH79 proteins were found, five genes encoding GH2 family glycosidases (THTE_0208, THTE_2104, THTE_2820, THTE_3433, and THTE_3824) were revealed in the T. terrifontis R1 genome. The family GH2 contains a number of enzymes with various specificities. We compared T. terrifontis GH2s with the previously characterized members of the GH2 family, to predict whether the putative five T. terrifontis GH2 proteins act as beta-glucuronidases. All five genes formed a monophyletic group, adjoined to the cluster with characterized betagalactosidases and beta-glucuronidases (Supplementary Figure S5). Given that one of these (THTE_2104) was significantly up-regulated in the cells growing on xanthan gum, the beta-glucuronidase activity seems to be characteristic of at least this one, yet possibly all five GH2 from this monophyletic group possess this activity. It should be noted that THTE_2104 and also THTE_0208 were predicted to be secreted.
The Man(1α-β3)Glc linkage could be hydrolyzed by an alphamannosidase of GH38 family (THTE_2605). Another option is an extracellular putative alpha-galactosidase of GH36 family FIGURE 4 | Comparison of (A) known (with xanthan lyase) and (B) proposed novel (without xanthan lyase) xanthan gum degradation pathways. ? question mark indicates no sequence data available for these enzymes. Please, see the text for details.
(THTE_1560), whose characterized homologs are known to hydrolyze a number of oligosaccharides of various structures (Merceron et al., 2012). Both THTE_2605 and THTE_1560 genes were up-regulated during the growth on xanthan gum.
Finally, the hydrolysis of a Glc(β1-4)Glc linkage in xanthan gum backbone could be catalyzed by the GH5 enzymes THTE_0890, THTE_1171 and THTE_3688, however, only one of them was predicted to be extracellular (THTE_1171) and none of them were up-regulated on xanthan gum.
Search of the other putative glycosidases of T. terrifontis R1 revealed nine proteins (THTE_0474, THTE_1038,  THTE_1454,  THTE_1561,  THTE_2480,  THTE_2853, THTE_3338, THTE_4089, and THTE_4163) containing a domain of unknown function (DUF1080). All of these proteins, except THTE_1454 and THTE_3338, were predicted to be extracellular, and two of them (THTE_1561 and THTE_4089) were up-regulated on xanthan gum. These proteins may be representatives of a novel family of glycosidases according to TOPSAN annotation 1 . Such a high number of genes encoding putative glycosidases with unknown function in a xanthan gum degrading microorganism might be an indication on their involvement in the process, most probably for hydrolysis of the backbone linkage ( Figure 4B). Interestingly, proteins containing DUF1080 are highly overrepresented among all Planctomycetes (Supplementary Table S7) in comparison with other organisms. This, together with our observation that xanthan gum being a selective substrate for many planctomycetes, suggest an important role of DUF1080 proteins in xanthan gum degradation. Finally, five of nine T. terrifontis R1 DUF1080 proteins, including THTE_1561, have a CBM66 domain, which was found mainly among Firmicutes representatives and helps binding the terminal fructoside residue in fructans (Cuskin et al., 2012). Yet, the DUF1080 and CBM66 domains overlap each other, indicating two different designations of the same domain occurred.
The predicted products of xanthan gum degradation are mannose, glucuronic acid, and glucose. Although no mannokinase genes were found in the T. terrifontis R1 genome, mannose could be phosphorylated to mannose-6-P by the variety of its putative glucokinases from the ROK family (THTE_0095, THTE_2175, THTE_3207, THTE_4164), representatives of which are known to be capable of acting on various hexoses (Conejo et al., 2010;Nakamura et al., 2012). Mannose-6-phosphate upon conversion to fructose-6-phosphate by mannose-6-phopshate isomerase (THTE_1892) enters the EM pathway (Figures 2, 3). However, none of the genes coding these proteins were up-regulated on xanthan gum in our experiment, possibly due to their wide specificity and hence constitutive expression.
In most cases, the majority of enzymes involved in both the trehalose and xanthan gum-dependent pathways of the central carbohydrate metabolism were expressed on the same level or were down-regulated in the cells grown on xanthan gum. This could be due to the lower structural complexity of trehalose in comparison with xanthan gum, which requires fewer degradation steps and determines easier import into the cell: one transporter and one enzymatic step are enough to transport and decompose trehalose to the basic metabolites (D-glucose and NTP-α-D-glucose) compared with most certainly multiple transporters and four steps of xanthan gum decomposition, coupled with two-and five-step mannose and glucuronate, respectively, conversions to EM pathway metabolites (Figure 3). Finally, the majority of flagellar, as well as pili IV and secretion system proteins, were up-regulated (Supplementary Table S1) in the cells grown on xanthan gum. This might be a reflection of viscosity of the substrate, which force cells to be more agile and capable of binding to the substrate.

CONCLUSION
The central carbohydrate metabolism of T. terrifontis R1 was deciphered using genomic and transcriptomic approaches, and the novel xanthan gum degradation pathway was proposed. The pathway involves endomannanases/beta-mannosidases instead of xanthan lyases as well DUF1080 proteins for the hydrolysis of xanthan gum backbone. Surprisingly, the genes coding DUF1080 proteins were highly abundant in T. terrifontis R1, as well as in many other Planctomycetes genomes. These results are relevant due to lack of the information on microorganisms degrading xanthan gum and its degradation pathways, which has so far been limited to few representatives of Actinobacteria and Firmicutes and their enzymes. Yet, further studies including proteomics of xanthan gum-growing cultures, as well as, purification and characterization of the respective enzymes are needed to verify the predicted pathway.

DATA AVAILABILITY
The T. terrifontis R1 genome is available via GenBank accession number CP018477.

AUTHOR CONTRIBUTIONS
IK, XP, and EB-O conceived the study. SG and PM contributed to cultivation and transcriptome sequencing. AS, VK, PM, and AK contributed to genome sequencing and assembly. AE, PM, and IK contributed to genome and transcriptome analysis. AE, PM, XP, and IK wrote the manuscript.

FUNDING
The work was supported by the European Union 7th Framework Programme FP7/2007. The work of AE, EB-O and IK was supported by RSF grant 17-74-30025. The work of VK was supported by the Federal Agency for Scientific Organisations of Russia.