Metabolic Potential, Ecology and Presence of Associated Bacteria Is Reflected in Genomic Diversity of Mucoromycotina

Mucoromycotina are often considered mainly in pathogenic context but their biology remains understudied. We describe the genomes of six Mucoromycotina fungi representing distant saprotrophic lineages within the subphylum (i.e., Umbelopsidales and Mucorales). We selected two Umbelopsis isolates from soil (i.e., U. isabellina, U. vinacea), two soil-derived Mucor isolates (i.e., M. circinatus, M. plumbeus), and two Mucorales representatives with extended proteolytic activity (i.e., Thamnidium elegans and Mucor saturninus). We complement computational genome annotation with experimental characteristics of their digestive capabilities, cell wall carbohydrate composition, and extensive total lipid profiles. These traits inferred from genome composition, e.g., in terms of identified encoded enzymes, are in accordance with experimental results. Finally, we link the presence of associated bacteria with observed characteristics. Thamnidium elegans genome harbors an additional, complete genome of an associated bacterium classified to Paenibacillus sp. This fungus displays multiple altered traits compared to the remaining isolates, regardless of their evolutionary distance. For instance, it has expanded carbon assimilation capabilities, e.g., efficiently degrades carboxylic acids, and has a higher diacylglycerol:triacylglycerol ratio and skewed phospholipid composition which suggests a more rigid cellular membrane. The bacterium can complement the host enzymatic capabilities, alter the fungal metabolism, cell membrane composition but does not change the composition of the cell wall of the fungus. Comparison of early-diverging Umbelopsidales with evolutionary younger Mucorales points at several subtle differences particularly in their carbon source preferences and encoded carbohydrate repertoire. Nevertheless, all tested Mucoromycotina share features including the ability to produce 18:3 gamma-linoleic acid, use TAG as the storage lipid and have fucose as a cell wall component.


INTRODUCTION
Mucoromycotina subphylum comprises three orders: Umbelopsidales, Endogonales and Mucorales (Spatafora et al., 2016). While Umbelopsidales and Mucorales group mostly saprotrophic fungi living in the soil, on dung and litter, Endogonales are known to establish symbiotic interactions with plants (Desirò et al., 2017). The ancestors of extant Mucoromycotina were among the first colonizers of land. These early-branching fungi possess all of the traits commonly acknowledged as distinctive characteristics of the fungal kingdom such as apical growth, presence of ergosterol in the membranes and a cell wall made of chitin and betaglucan (Richards et al., 2017). Yet, their cell wall differs from ascomycetes and basidiomycetes by the presence of fucose and high amounts of N-acetylglucosamine and glucuronic acid (Bartnicki-Garcia, 1968;Mélida et al., 2015) which is typical for other Opisthokonta rather than fungi. Mucoromycotina representatives form a fast-growing mycelium of haploid hyphae. The sexual phase includes the fusion of two gametangia and formation of a resting spore called zygospore (Spatafora et al., 2016).
Apart from decomposing organic matter as saprotrophs, some Mucoromycotina are capable of forming mutualistic and mycorrhiza-like associations with Haplomitriopsida liverworts (Bidartondo et al., 2011;Field et al., 2015). Others are parasites of plants and animals (Partida-Martinez and Hertweck, 2005;Ibrahim and Spellberg, 2014). The ecology of Mucoromycotina is poorly studied, which hinders understanding of their role in the ecosystem (Richardson and Rautemaa-Richardson, 2019).
Some Mucorales are involved in life-threatening, opportunistic infections with mortality rates reaching up to 40-80% (Cornely et al., 2019). There are general traits which predispose microorganisms to become opportunistic pathogens, e.g., thermotolerance and ability to evade immune cells. Many non-pathogenic fungi are adapted to higher temperatures due to living in decomposing organic matter, often warmed up due to rotting processes (Maheshwari et al., 2000;Neher et al., 2013), which allows them to survive inside the animal warm-blooded body.
Here we describe the genomes of six Mucoromycotina species representing separated saprotrophic lineages within the subphylum (i.e., Umbelopsidales and Mucorales). Despite numerous studies on Mucoromycotina in a pathogenic context their basic biology remains understudied. We selected two Umbelopsis isolates from soil: U. isabellina and U. vinacea, two soil-derived Mucor isolates: M. circinatus and M. plumbeus, and two Mucorales representatives: Thamnidium elegans and M. saturninus with proteolytic capabilities enabling them to colonize dung and animal substrate (Hanagasaki and Asato, 2018). Two strains were obtained from the CBS-KNAW culture collection and the remaining four were selected from the Mucoromycotina collection of University of Warsaw Herbarium. We complement genome analysis with a phenotypic description of their digestive potential, their cell wall carbohydrate composition, and total lipid profiles. Finally, we link the presence of endohyphal bacteria with observed characteristics.

Genome Assembly and Annotation
Obtained assemblies showed diverse levels of fragmentation depending on genome size and abundance of repeats. Both Umbelopsis genomes assembled into fewer than 200 scaffolds whereas Mucor assemblies were significantly more fragmented (Table 1).
Genome completeness was verified using single copy fungal orthologous genes searched by BUSCO (Simão et al., 2015).

Repetitive Elements
The largest group of TEs found in the genomes of Mucorales, with more than 200 copies per genome are class II elements belonging to Tc1/Mariner superfamily (Supplementary Table 1) but other DNA-repeats were also ubiquitous. EnSpm, PIF/Harbinger, hAT-Ac MuLE, Merlin, PiggyBac were present from one up to a hundred copies in all sequenced Mucorales. EnSpm, PIF/Harbinger, hAT-Ac and MuLE are ubiquitous in fungi in general, whereas Merlin elements are characteristic for basal fungal lineages and were apparently lost in Dikarya (Muszewska et al., 2017b). Unlike Mucorales, Umbelopsidales genomes had just a few copies of Tc1/Mariner, hAT-Ac, and a single Ginger element. Class I retrotransposon landscape is dominated by LTR retrotransposons from Ty3/Gypsy and LINE/L1 elements which were found in all genomes except for U. vinacea which lacks the L1s. Surprisingly the omnipresent Ty1/Copia elements were identified only in the genome of M. plumbeus (two copies). Also, all sequenced species had LTR/DIRS elements, characteristic for their tyrosinase integrase and are absent from Dikarya (Muszewska et al., 2013). Mucor genomes harbor also numerous Helitrons, known for their rolling circle replication mechanism. The presence of Helitrons has been observed in other Mucor species previously (Lebreton et al., 2020). Noteworthy, Helitrons often hijack neighboring genes and are efficient vectors of HGT in other fungi (Castanera et al., 2014). Overall, Umbelopsis genomes contained ten-fold less transposable elements per genome compared to Mucorales, which is expected taking into account the differences in genome size.

RNAi and Other Defense Mechanisms
Fungal genomes are usually protected via diverse mechanisms that include fungal repeat-induced point mutation (RIP), methylation induced premeiotically (MIP), meiotic silencing of unpaired DNA (MSUD) and quelling. It has been described that Mucormycotina possesses RNAi pathway components (Cervantes et al., 2013) which are also important for fungus biology (Nicolás et al., 2015;Calo et al., 2017). We scanned the sequenced genomes with HMM profiles of RNAi core enzymes, namely Dicer, Argonaute and RdRP, and demonstrate that all are present in all six fungal genomes, with duplication of RdRP in all Mucorales (from three to five copies per genome) and a single copy in both Umbelopsis isolates (Supplementary Table 2). U. vinacea, M. circinatus, M. plumbeus harbored duplicated Argonaute proteins. Self-non-self and the fungal immune system based on Nucleotide Oligomerization Domain (NOD)-like receptors (NLRs) could be identified neither in the six genomes described in this work nor in 14 predicted Mucormycotina proteomes available at NCBI. The typical NLR central domains NACHT or NB-ARC seem to be absent from those genomes.

Detection of Paenibacillus Bacteria in Thamnidium Genome
Initial Thamnidium assembly contained genome fragments characterized by two clearly distinct GC content ratios and for that reason was additionally analyzed as a metagenome. It is composed of two easily distinguishable fractions, one belonging to a presumed fungal host and the latter to an associated bacterium representing Paenibacillus, Firmicutes. Fungal and bacterial genomes were re-assembled separately. Remaining five Mucoromycotina assemblies did not contain significant amounts of sequences with sequence similarity to non-Mucoromycotina taxa.

Paenibacillus Features
The bacterial genome is moderately complete (BUSCO score 85.7%) and encodes 5815 genes. Its closest relative, Paenibacillus sp. 7523-1, despite DDH similarity above 70% threshold for two out of three DDH calculation formulas implemented in TYGS (67.2;88.3;72.9), differs in GC content by 2.35% which supports the separation of the newly sequenced strain as a new species (Supplementary Figure 1 and Supplementary Table 11). A phylogenetic tree inferred from 16S RNA of related isolates shows its proximity to P. illinoensis isolates despite differences in GC content (Figure 1). GC content of the identified Paenibacillus genome could be altered by misplaced fungal reads. Despite discarding all reads mapping on eukaryote genomes when assembling the Paenibacillus genome it is still possible that some fungal fragments, not similar to other eukaryotic sequences, could not be filtered out.
The bacterial genome has only one complete antimicrobial locus, a rifamycin-inactivating phosphotransferase RphD, identified by scanning using ABRicate. We found a partial beta-lactam resistance pathway lacking the central betalactamase (four copies of beta-N-acetylhexosaminidase, BlaI family transcriptional regulator, oligopeptide transport system ATP-binding protein from oppA to oppF, penicillin-binding protein 1A and 2A). Additionally, the Peanibacillus genome encodes a single M56 peptidase and 15 proteins bearing an S12 domain. Family M56 includes BlaR1 which is the antirepressor in beta-lactamase operon (Zhang and Chambers, 2004). Family S12 groups carboxypeptidases B vital for cell wall synthesis and remodeling, as well as beta-lactamases (Tang, 2018). Taken together, all these traces point at a potential beta-lactam resistance of Paenibacillus. Glycopeptide antibiotic resistance may also be present in some form since partial vancomycin resistance operon was also identified, once more only regulation and accessory components are preserved. The possible activity of these proteins and their relevance for resistance is not known.
The Paenibacillus genome harbors vitamin biosynthesis pathways, e.g., cobalamin (B12), riboflavin (B2), menaquinone (K2), and thiamine (B1). It also has all the necessary genes for molybdenum cofactor synthesis and manipulation. Paenibacillus has many quorum sensing genes and those coding for flagellar and mobility proteins. Its metabolic potential is described below in parallel with its fungal host and remaining fungal genomes.
The Paenibacillus genome encodes 3 copies of Peptidase_U32 (PF01136), a collagenase that may facilitate meat degradation by Thamnidium elegans which is one of the few fungi known for this property (Dashdorj et al., 2016). This protein is found mostly in Firmicutes and is absent from eukaryotic genomes. The U32 collagenases are considered as virulence factors in animalinfecting bacteria (Navais et al., 2014). The U32 collagenase (PrtC) together with urease subunit alpha (UreB) are parts of the Helicobacter pylori arsenal used in epithelial cell invasion (Kavermann et al., 2003). Paenibacillus sp. genome harbors all three urease subunits (alpha, beta and gamma). Moreover, we identified genes coding other urea processing enzymes including: cyanuric acid amidohydrolase, biuret amidohydrolase (BiuH), urea carboxylase and two copies of allophanate hydrolase (AtzF). The Paenibacillus genome encodes also one copy of ulilysin Peptidase_M43 (PF05572) with possible gelatinase function (Tallant et al., 2007), and 33 copies of Peptidase_M23 (PF01551), which includes mostly bacterial peptidoglycan hydrolases but also prokaryotic collagenases (Sasagawa et al., 1995). Genes Coding for CAZymes, Proteases and Transport-Related Proteins

Proteases
The overall profile of encoded proteases is very similar in all sequenced fungi (Figure 2), with high numbers of encoded pepsin-like A01A peptidases, FtsH-like M41 peptidases, M48 peptidases active on di-and tripeptides, proteasome peptidases T1, lysosomal peptidases C26, and ubiquitin-specific proteases C19. These proteases, except for pepsin, contribute to intracellular protein turnover and regulation. Pepsin and subtilisin proteases are found in high copy numbers in all of the genomes pointing at a high degradation potential of these saprotrophic organisms. In all analyzed fungal genomes, we found an expansion of the C44 family. Proteins from this family are homologs of glutamine-fructose-6-phosphate transaminase (GFPT) and its precursor. GFTP is known to control the flux of glucose into the hexosamine pathway which plays a crucial role in the regulation of chitin synthesis in fungi (Maia, 1994). There is a two-fold expansion of S8A serine proteases in Mucorales when compared to Umbelopsidales (Supplementary Table 3). The same observation applies to family I4 which groups inhibitors of S8 peptidases. Umbelopsidales possess several families of cysteine peptidases (C15, C40, C110) and metalloproteases (M14A and M20A) absent from Mucorales genomes. The metalloproteases are likely involved in protein degradation whereas the cysteine peptidases have no obvious function in fungi.
Among sequenced Mucorales, the average number of proteincoding genes per peptidase family is elevated, which could be a consequence of the whole genome duplication (WGD) described in Mucor and Phycomyces by Corrochano and coworkers (Corrochano et al., 2016). However, BUSCO results show a low level of duplicated genes in all of the isolates. The genome of Thamnidium stands out as it encodes fewer proteases than remaining Mucorales. However, it may benefit from enzymes provided by its endohyphal bacterium which has genes encoding meat crumbling collagenase U32 and plenty of typically bacterial proteases representing families A36, A25, S15, S55, S66, S51, M29, and U57.
NagA α-N-acetylgalactosaminidase, NagZ β-Nacetylhexosaminidase (GH109) involved in peptidoglycan recycling occur in multiple copies not only in Paenibacillus genome but also in sequenced fungal genomes. It is worth noting that Peanibacillus has three times more copies (30) of Nag genes than studied fungi. No proteins from this family had been characterized in Eukaryota.

Metabolic Clusters, Secondary Metabolites, and Cofactors
The genome of Paenibacillus harbors NRPS, terpene, bacteriocin, T3PKS and S-layer-glycan producing clusters. Mucoromycotina were long considered devoid of secondary metabolite clusters. A review by Voigt et al. (2016) showed genetic determinants for natural product synthesis present in all analyzed genomes. Our newly sequenced genomes encode between 3 and 8 secondary metabolite clusters (according to AntiSMASH scans) belonging to different classes. Interestingly, all of the genomes encode terpene clusters (Supplementary Table 6) which potentially could produce new natural products like those isolated from Mortierella (Baldeweg et al., 2019).
Umbelopsidales additionally have a trans-2,3-dihydro-3hydroxyanthranilate isomerase (PhzF) involved in phenazine biosynthesis with yet unknown biological product in fungi (Blankenfeldt et al., 2004). This gene is also present in other fungi and in Endogonales but seems to be missing in Mucorales. Umbelopsidales produce also a citronellol/citronellal dehydrogenase which converts citronellol to citronellic acid an odorous compound with antimicrobial properties. All four Mucorales isolates have a single gene coding for salicylate hydroxylase which is involved in plant host manipulation by Epichloë (Bastias et al., 2017) and other fungi.

Sex Locus
Sequenced isolates belong to heterothallic genera (Lee and Heitman, 2014) and genomic screening shows a single sex locus per genome. Generally, these loci contain a single high mobility group (HMG)-domain transcription factor gene (sexP or sexM), flanked by genes for an RNA helicase (rnhA), a triosephosphate transporter (tptA) and an alginate lyase (agl) (Lee and Idnurm, 2017).
The sex locus of T. elegans and M. plumbeus is organized like in other Mucorales with genes in the following order agl/tptA/sex/rnhA (Lee and Idnurm, 2017). M. saturninus lacks the tptA gene and has a sex locus architecture (agl/sexP/rnhA) like M. mucedo (Wetzel et al., 2012). M. circinatus has remains of an integrase instead of the sex gene between tptA and rnhA genes (agl/tptA/rve/rnhA). Mobile element insertions in the sex locus have already been documented in Phycomyces blakesleeanus (Idnurm et al., 2008).
The rnhA gene seems to be excluded from the sex locus in both Umbelopsidales genomes, in which the sex locus is organized like in Mucorales but followed by a gene with an additional protein of unknown function belonging to DUF2405 (PF09435) family (agl/tptA/sexP/DUF2405). Schulz et al. (2016) described a similar architecture with the DUF2405 for Umbelopsis ramaniana from JGI database.

Carbon Assimilation Profiles
Carbon assimilation profiles obtained for six Mucoromycotina strains by screening on Biolog FF microplates are summarized in Supplementary Table 7. None of the analyzed strains was able to use a full set of 95 tested carbon sources. Each strain was able to grow on 40 to 70 different substrates and had a unique carbon assimilation profile (Figure 4) with T. elegans being the most versatile degrader (Figure 5).  Umbelopsis spp. were more efficient in the utilization of carbohydrates, while Mucorales representatives showed the fastest growth rate on amino acids. Umbelopsis fungi were able to utilize Adonitol, d-Galacturonic Acid, Maltitol, β-Methyl-D-Glucoside and d-Raffinose whereas none of the four Mucorales grew on these substrates. Additionally, similarly to Mortierella elongata AG77 (Uehling et al., 2017), they represented elevated growth rate on other carbohydrates, like d-Galactose, d-Mannose, l-Arabinose, l-Rhamnose, d-Trehalose and lipid (Tween 80), which is consistent with predicted gene models (see the chapter on CaZymes -the presence of rhamnosidase GH78, α-mannosidase GH92, exo-α-L-1,5-arabinanase GH93, α-L-fucosidase GH95). All four Mucor species can utilize D-Malic Acid and L-Malic Acid while these substrates seem inaccessible for Umbelopsis spp., which have only 3 copies of lactate dehydrogenase whereas Mucor spp. have from four to six enzymes from this family. Only T. elegans was able to utilize m-Inositol, Sedoheptulosan, β-Hydroxy-butyric Acid, p-Hydroxyphenyl-acetic Acid and L-Threonine. Previous reports showed efficient lactose assimilation by T. elegans what was not replicated in this study (Vamvakaki et al., 2010)Additionally, Thamnidium is distinguished by very efficient development on carboxylic acids. This ability may be explained by the presence of endobacteria whose proteome harbors representatives of M14 and M20 carboxypeptidase families. M14A is present uniquely in both Umbelopsis genomes while M14C occurs only in Paenibacillus. Also, peptidase T (M20B) is present exclusively in Paenibacillus, whereas fungal genomes have several copies of the remaining M20 subfamilies. Some of the analyzed compounds are available only to Paenibacillus, based on genomic evidence, e.g., 3-oxoacid CoA-transferase missing from sequenced Mucorales is present in Dikarya and bacteria, including Paenibacillus.

Fungal Lipids
For all six Mucorales, we determined the composition of sterols, fatty acids and phospholipids (Supplementary Tables 8-10). The presence of ergosterol, an important component of fungal plasma membranes, contributing to their stability, was confirmed in all 6 fungal biomasses.
Using LC-MS/MS we determined the composition of phospholipids (PLs) present in the biomass of analyzed fungi (Figure 7, Supplementary Table 9). We identified 84 PLs species belonging to 6 classes: phosphatidic acid (PA), phosphatidylcholine (PC), phosphatidylethanolamine (PE), phosphatidylglycerol (PG), phosphatidylinositol (PI), and phosphatidylserine (PS). Sixty-two of them were chosen for subsequent quantitative assessments. This analysis of the PLs for selected strains revealed that PC and PE were predominant for selected Mucoromyotina and constituted up to 56% and 41% of the total cell PLs, respectively. PA, a lipid signal which usually constitutes a minute portion of PLs, in T. elegans and M. saturninus was found at 8% level. According to several reports, the increased levels of PA in living cells are a consequence of biotic and abiotic stress (Darwish et al., 2009;Bernat et al., 2014). The open question remains whether T. elegans and M. saturninus experienced stressing conditions during colony growth in the lab, or the observed increased level of PA is native for them as an adaptation to grow on animal substrate.
An important parameter describing the physical characteristics of biological membranes is PC/PE ratio. PC has FIGURE 7 | Composition of phospholipids in the analyzed strains.
Frontiers in Microbiology | www.frontiersin.org a bigger head group than PE. Tighter packing of PE and its acyl chains negatively influences fluidity of membranes as opposed to PC (Renne and de Kroon, 2018). PC can be synthesized from PE, so they are closely related, and the balance between PE and PC is crucial for maintaining the physiological structure of the cell membrane. Among selected species, T. elegans showed the lowest PC/PE ratio (0.79), which may indicate a relatively greater stiffening of the membrane compared to other strains, which is consistent with lowered fatty acids unsaturation index.
Other lipids which are important for the fungal metabolism are acylglycerols, triacylglycerols (TAGs) and diacylglycerols (DAGs) (Figure 8 and Supplementary Table 10). Yeasts store lipids mainly in the form of TAGs, and to less extent, DAGs (Lastovetsky et al., 2016). Triacyglycerols have been revealed, by far, as the most abundant lipid compound of Mucoromycotina (Fakas et al., 2007)Neutral lipids and triacylglycerols in particular constitute the main lipid fraction in U. isabellina and Cunninghamella echinulata (Fakas et al., 2007;Gardeli et al., 2017). In all six fungal strains, both acylglycerols were found. All analyzed strains accumulated more TAGs than DAGs but in T. elegans this ratio was significantly shifted toward 3:1 TAG to DAG, in contrast to the 5:1 in M. plumbeus or even 99:1 in M. saturninus. Since TAG synthesis in fungi requires PA and DAG, both present in T. elegans, observed DAG/TAG ratio might be a consequence of inhibition of acyl-CoA-dependent diacylglycerol acyl-transferase (DGA1) (Markgraf et al., 2014). Moreover, it seems that the acylglycerols were less enriched in polyunsaturated fatty acids, especially 18:3, than phospholipids. Unsaturation index of fatty acids for phospholipid class was higher than for total and neutral lipids for different Mucoromycotina species (C. echinulata, T. elegans, U. isabellina, Mucor sp.) tested by others researchers (Fakas et al., 2007;Vamvakaki et al., 2010;Gardeli et al., 2017)with γ-linolenic acid found in higher quantities in the PLs class.
Lastovetsky and co-workers (Lastovetsky et al., 2016) reported that PE/TAG ratio plays a crucial role in establishing and maintaining symbiosis of the fungus Rhizopus microsporus (Mucoromycotina) and its Mycetohabitans endobacteria. PE/TAG ratio close to 1:1 is characteristic for symbiosis and departure from that balance might shift the interaction toward antagonism. Interestingly, it was observed that among six strains, the PE/TAG ratio for T. elegans was closer to 1:1 (2.21) compared to the average (6.4) for all tested fungi. Analyzed strains all contain several copies of diacylglycerol kinase DGK genes (3-5) which is deemed responsible for maintaining the balance between TAG and PE.
The genome of Paenibacillus sp. encodes processive diacylglycerol beta-glucosyltransferase required for the synthesis of beta-diglucosyl-DAG -a predominant glycolipid found in Bacillales (Jorasch et al., 1998), as well as the Ugp snglycerol-3-phosphate transport system which transports glycerol-3-phosphate, essential for phospholipid biosynthesis.

Cell-Wall Carbohydrates
A quantitative analysis of the cell wall carbohydrates revealed the presence of high amounts of glucosamine and fucose, and low amounts of mannose, galactose, and glucose compared to an ascomycetous fungus Trichoderma reesei TU-6 (Figure 9). Glucosamine content in their cell wall was up to 10 fold higher compared to Trichoderma. High glucosamine fraction can be a highlight of the chitin-chitosan cell wall typical for Mucoromycotina (Bartnicki-Garcia, 1968).
Chitin together with glucans participates in the rigidity of the cell wall. The total content of glucosamine (representing chitin/chitosan fraction) and glucose (representing glucan fraction) in four Mucorales: M. plumbeus, M. saturninus,  M. circinatus and T. elegans was above 80% of the cell wall carbohydrates. The amount of these two carbohydrates in Umbelopsis strains was lower and reached about 66 and 71% in U. isabelina and U. vinacea, respectively. Trichoderma had only 55% of these sugars in the cell wall and, in addition, glucose was the dominant one.
Melida and coworkers (Mélida et al., 2015) reported that genes encoding chitin biosynthesis (CHS) and its modification (CDA) were present in more copies in Phycomyces blakesleeanus and Rhizopus oryzae, compared to Neurospora crassa (Ascomycota), as well as fewer genes encoding glucan synthases. These results were confirmed in other studies on Mucorales (Lecointe et al., 2019) and our results show that respective CaZymes are abundant both in sequenced Umbelopsidales and Mucorales representatives. This could explain the possibility of synthesizing a huge amount of chitin in Mucoromycotina strains compared to Trichoderma.
Furthermore, all Mucoromycotina contain fucose, a carbohydrate characteristic for this subphylum, especially in such a high amount. Our study revealed that two Umbelopsis strains had from twice to three-fold more fucose when compared to M. circinatus and M. saturninus, respectively. Presence of fucan in the cell wall was previously reported for P. blakesleeanus and R. oryzae (Mélida et al., 2015). During the synthesis of fucan, fucose is transferred from GDP-fucose to polysaccharides by α-fucosyltransferase and this enzyme was detected in the membrane fraction of M. circinelloides and partially characterized (Lecointe et al., 2019).
The α-fucosyltransferase encoding genes were found in 2 and 4 copies in the genomes of P. blakesleeanus and R. oryzae, respectively, and they were not detected in N. crassa, which is in accordance with the fact that no fucose was detected in the cell wall of Neurospora (Mélida et al., 2015) and Trichoderma (this study). Differences in the copy number of α-fucosyltransferase encoding genes do not explain the significant differences in the amount of fucose in the cell wall of Mucor spp. (4-6 copies) and Umbelopsis spp. (3-4 copies). All six genomes have 1-3 copies of GDP-L-fucose synthase.
Thamnidium elegans despite the associated bacteria shows a cell wall carbohydrate profile similar to remaining Mucorales and other Mucoromycotina. This suggests that the interaction with a bacterial partner alters the metabolism, cell membrane composition but not the exoskeleton of the fungus.
In the present study, we aimed to point several differences between Umbelopsidales and Mucorales taxa. Besides limited genome size, Umbelopsis fungi have been demonstrated to be often associated with bacteria and to produce unsaturated fatty acids (Fakas et al., 2009). Our results build on top of these observations. Sequenced Umbelopsis taxa have compact genomes, almost devoid of repetitive sequences, and contain a moderate number of genes. They have a relatively high number of metabolism-related enzymes, especially glycohydrolases compared to Mucorales, but fewer peptidases. Moreover, we showed that all analyzed Mucoromycotina display comparable enzymatic capabilities tested on Biolog FF microplates and predicted from genomic data regardless of their genome size. Mucorales and Umbelopsidales representatives are mostly soil-inhabiting saprotrophs and require a broad spectrum of secreted enzymes which is reflected in high count of encoded peptidases, glycohydrolases and transporters. Nevertheless, they differ in the ability to use particular carbon sources, produce specific proteases, and have different ratios of different classes of phospholipids.
On one hand, Umbelopsis spp. in general have fewer secreted peptidases like pepsins and sublitisins than Mucorales. On the other, they encode several families of metalloproteases and cysteine proteases with unknown functions which are absent from Mucorales. The compact genomes of Umbelopsidales are richer in glycohydrolases and CaZymes. In consequence, they have a tendency to use carbohydrates efficiently and grow fast on these carbon sources. When compared to Mucorales, they produce higher amounts of 18:1 fatty acid and have more fucose in their cell wall. However, the meaning of these findings remains to be understood. Despite sharing an ecological niche, these two orders differ in genome size, associated bacteria and degrading capabilities but, nonetheless, share clear synapomorphic traits. Our experimental results support the validity of 18:3 lipids as a chemotaxonomic marker of Mucoromycotina and fucose as a specific component of their cell wall (Mélida et al., 2015;Lecointe et al., 2019).
Previous genomic studies covered diverse Mucorales representatives (Ma et al., 2009;Tang et al., 2015;Lebreton et al., 2020) whereas Umbelopsidales genomes were published in the form of brief genomic reports (Takeda et al., 2014) without a description of the genomic content. In this study, we aimed to fill this knowledge gap by bringing together genomic analyses with phenotype and biochemical studies, especially in the context of how these fungi function in their environment. Vast carbohydrate related enzyme repertoire observed in Umbelopsidales can be related to their ecology. According to the GlobalFungi database (Větrovský et al., 2020) the representatives of the genus Umbelopsis are present in 6155 out of 20009 global amplicon samples (ca 30%). They are detected mainly in Europe, North America, and Australia, most often in soil, root, and shoots probes from forest or grassland biomes. The representatives of this genus are well-known late wood colonizers, that probably feed on the substrates which were decomposed by other organisms able to degrade complex substrates like cellulose (Richardson, 2009). However, Umbelopsis representatives are also often isolated from living plant material or forest soil (Sheng et al., 2019) and are considered to be plant growth-promoting organisms and root endophytes (Tejesvi et al., 2013;Huang et al., 2015). They can alter plant metabolism leading to the enhanced production of complex metabolites which are not produced without the endophyte (Qin et al., 2018). U. isabellina and U. vinacea described in this study also encoded metabolic clusters including terpenoid clusters.
The representatives of Umbelopsidales were also shown to represent relatively high resistance to some heavy metals like Zn, Mn, Ni or Pb (Janicki et al., 2018) and xenobiotics, such as herbicides . The resistance may be correlated with the presence of numerous ABC transporters encoded in all analyzed genomes. Moreover, these genomes have a single arsenite resistance protein homologous to Absidia repens BCR42DRAFT_142507 and consisting of ARS2 (PF04959), DUF4187 (PF13821), RRM_1 (PF00076), SERRATE_Ars2_N (PF12066). Such domain architecture is conserved from chytrids to Entomophtoromycotina and Glomeromycotina. Interestingly, the genomes had neither Dikarya-type nor animal metallothioneins.
Umbelopsidales have also been detected in the deep-sea sediments from Magellan seamounts constituting 3.8% of all OTUs (Yang et al., 2020). Although this finding is surprising as Umbelopsis representatives are well known terrestrial organisms, their presence may be explained by association with plant material. Interestingly, 0.85% of all amplicon samples in which Umbelopsis spp. were detected according to GlobalFungi database (Větrovský et al., 2020), are also originating from marine biome. The understanding of this pattern needs further research.
Some representatives of Umbelopsidales are recently considered as effective single cell oils (SCOs) producers as they are capable of producing high amounts of lipids (75% to 84% in dry cell weight; w/w), including polyunsaturated fatty acids (PUFAs). These substances of high dietary and pharmaceutical importance are also considered as precursors for the synthesis of lipid-based biofuels. Although several Mucoromycota representatives were reported to synthesize PUFAs, U. isabellina cultivated on glucose has presented exceptionally high lipid production (comparable to the highest values achieved for genetically engineered SCO-producing bacterial strains) (Papanikolaou and Aggelis, 2019). In opposition to previous experiments (Gardeli et al., 2017), in our study the U. isabelina strain WA67209 growth on glucose was not more efficient than on xylose, and it was extensively assimilating neither glycerol, sucrose nor xylitol. Carbohydrate metabolism is closely related to metabolism of lipids (Dubois-Brissonnet et al., 2016) and lipid production depends on growth phase, their remodeling and their interplay with the synthesis of cellular polysaccharides. However, analyzed strains displayed a typical composition of PUFAs with a predominance of oleic acid (18:1), and higher levels of gamma-linolenic acid (18:3) in Mucorales compared to Umbelopsidales. Surprisingly, Thamnidium mycelium showed a relatively high saturation index possibly due to interaction with Paenibacillus. Bacteria produce more saturated fatty acids during biofilm formation yet it remains to be elucidated if the bacteria-fungus interaction has a similar effect on fatty acid composition in both partners (Dubois-Brissonnet et al., 2016). Thamnidium genome contains a high number of lipid metabolism-related genes and there is no clear explanation for the high level of fatty acid saturation. This phenotype was particularly unexpected since T. elegans has been used for biotechnological production of gamma-linoleic acid on diverse substrates especially in low temperature (Stredansky et al., 2000;Liu and Jin, 2008;Zikou et al., 2013). Similarly, the carbohydrate assimilation profiles of the analyzed strain WA18081 differed from other studies on this species Pawłowska et al., 2019). It may be hypothesized that an extended set of lipid-processing enzymes in Thamnidium is required in order to balance the bacterial impact on lipid homeostasis and shift it back toward fungal characteristics.
Interestingly, several representatives of Umbelopsidales have recently been shown to be colonized by EHB from Burkholderiaceae (Okrasińska et al., 2021). Moreover, the metagenomic analysis of another oleaginous fungus -Mortierella elongata and its endosymbiont Mycoavidus cysteinexigens showed that bacteria alters the metabolism of the fatty acids of the host. Endosymbiont was shown not only to cause declines in the storage of carbohydrates, organic acids and nitrogenous metabolites but also to be involved in the catabolism of fungal fatty acids and changes volatile compounds profiles of the fungus (Uehling et al., 2017). Among six sequenced Mucoromycotina, we found numerous bacterial reads which assembled into a complete genome only in Thamnidium elegans. The presence of an associated bacteria is reflected in, among else, DAG/TAG lipids composition and utilization of carbohydrates which are accessible exclusively to Dikarya and bacteria. This phenomenon together with a limited set of carbohydrate-processing enzymes present in Thamnidium excludes the possibility of bacterial contamination and supports the hypothesis of intimate interaction with detected Paenibacillus. Further studies are needed to elucidate the molecular basis of the interaction and identify the predisposing features of the bacterial and fungal partners.
Most of the initial reports on intracellular fungal bacteria were based on microscopic observation of uncultivable bacteria inside fungal hyphae (Macdonald and Chandler, 1981). Nowadays, endohyphal bacteria (EHB) can be efficiently identified using genome sequencing methods. The traces of bacterial presence have been found in other published genomes of basal fungi (Naranjo Ortiz and Others, 2019). Identification of EHB can expand our scarce knowledge on the frequency and host range of fungal-bacteria interactions. Our finding of Paenibacillus sp. associated with Thamnidium elegans is in line with this trend. Diverse Bacillus bacteria were found living with truffles (Barbieri et al., 2005;Perlińska-Lenart et al., 2020). Paenibacillus has been reported from Laccaria bicolor (Bertaux et al., 2003), Sebacina vermifera (Sharma et al., 2008), and is known to produce pre-symbiotic and symbiotic interactions with Glomus (Bidondo et al., 2011).
The genus Paenibacillus was erected from Bacillus by Ash et al. (1993); it belongs to the family Paenibacillaceae and comprises 253 highly variable species 1 . The representatives of this genus were isolated from a wide range of sources of plant and animal origin. Paenibacillus tend to occupy a similar niche to Mucoromycotina molds as they inhabit soil and dung. The best-studied species -P. larvae is known to cause lethal disease of honeybees. However, some other species are known for their plant growth promotion capacities (via siderophores or phytohormones synthesis), others produce a variety of 1 https://lpsn.dsmz.de/genus/paenibacillus antimicrobials and insecticides. Bacteria from this genus were also shown to produce a plethora of enzymes, like amylases, cellulases, hemicellulases, lipases, pectinases, oxygenases or dehydrogenases (Grady et al., 2016). The proteome of P. larvae includes a wide range of virulence factors including proteases and toxins (Erban et al., 2019). Paenibacillus validus stimulated the growth of Glomus intraradices (Hildebrandt et al., 2006) and P. vortex facilitates dispersal of Aspergillus fumigatus (Ingham et al., 2011). Although the representatives of Paenibacillus are known to promote plant growth and secrete several antimicrobial compounds, the endohyphal strain from our study did not encode any antibiotic compounds except for rifamycininactivating phosphotransferase. Rather, it provided multiple enzymes that significantly expanded the digestive capabilities of Thamnidium while reducing its genome size. Observed genome shrinking cannot be explained solely by the fragmented assembly of Thamnidium (its incompleteness is estimated at approximately 5%) because the differences in CaZyme and peptidase abundance from remaining isolates are far greater.
One of the major differences between Thamnidium and the remaining isolates is in the lipid composition potentially contributing to cell membrane stiffness. EHB are known to influence host lipid production and bias the TAG/PE ratio (Lastovetsky et al., 2016). Thamnidium had the lowest TAG/PE ratio among tested isolates which might be a sign of symbiotic interaction with Paenibacillus, yet the noted ratio was still far from the 1:1 "symbiotic equilibrium" described in Lastovetsky's report (Lastovetsky et al., 2016). It is not known whether the TAG/PE values estimated for symbiosis between Rhizopus microsporus and Mycetohabitans are valid also for other models. It is an open question of how intimate and stable is the interaction between T. elegans and Paenibacillus sp. The altered traits of Thamnidium elegans compared to remaining isolates regardless of their evolutionary distances could be explained by the presence of an associated bacterium classified to Paenibacillus. In contrast to lipids, the cell wall carbohydrate composition of T. elegans remained unchanged. What we observed is that Thamnidium has several metabolic parameters altered but its morphology remained unchanged compared to strains without detectable bacterial partners.
The ancestors of extant Mucoromycotina were present among the first land colonizers and had the ability to access decomposing material. Genome sequencing and phenotyping of Mucorales and Umbelopsidales enabled us to look at the differences of these two old lineages within Mucoromycotina. There are several differences, particularly in their carbon source preferences and encoded carbohydrate repertoire, which hints at subtle niche differentiation. Importantly, predicted digestive capabilities are in line with experimental validation. Early diverging Mucoromycotina representatives possess features characteristic of fungi including ergosterol present in the membranes and a cell wall made of chitin and beta-glucan. Additionally, all studied Mucoromycotina representatives produce 18:3 gamma-linoleic acid and encrust their cell wall with fucose, both of which traits can be a handy discriminant for marking their presence in environmental samples.

Isolates
Six non-pathogenic, common and soil-borne representatives of Mucorales and Umbelopsidales were chosen for sequencing. Two of them represented the Umbelopsis genus, isolated from forest soil in Warsaw (Poland). Other two taxa (i.e., M. circinatus and M. plumbeus) were also soil-derived isolates from both Americas but representing Mucorales order. Finally, two Mucorales representatives that are well known for their proteolytic activity (i.e., Mucor saturninus and Thamnidium elegans) were also selected ( Table 2). Related organisms were tested for biotechnological usage and reference genomic information for some of them (e.g., U. isabellina) was already available. The species-level identification of all isolates was confirmed by sequencing of ITS rDNA fragments (according to the protocol proposed by Walther and co-workers (Walther et al., 2013)) prior to genome sequencing.

Phenotypic Microarray Plates
FF phenotypic microarray plates (Biolog Inc., United States) were used to test the capacity of 6 strains to grow on 95 different carbon sources. Carbon sources were grouped into guilds according to Preston-Mafham et al. (2002). All fungal strains were cultured on Potato Dextrose Agar for 7 days and further their swabbed spores were suspended in FF inoculation fluid (deficient amount of carbon sources) to produce a final optical density of 0.036 A at 590 nm. Spores' suspensions were then inoculated on FF microplates and incubated in the aerobic SpectrostarNano universal plate reader (BMR Labtech, Germany) for 96 h at 20 • C. The analysis of each strain was done in three replicates. The metabolic activity was measured kinetically by determining the colorimetric reduction of a tetrazolium dye. Colorimetric values for wells containing carbon substrates were blanked against the control well. The result was considered positive when a difference between the metabolic activities of the first and last day of incubation was observed in all three repetitions. The mean values and standard deviations of AUC (area under the curve) were calculated for each strain and each guild of carbon sources. The metabolic activity of each species on a particular substrate was represented as a heatmap of log-transformed mean AUC values. All analyses were performed in RStudio using packages ggplot2 and vegan v2.4.2).

Lipids Analysis
Extraction Lipids from fungal cultures of the stationary phase of growth were extracted according to the method proposed by Folch et al. (1957) with some modifications. The fungal biomass was filtered and 0.1 mg was transferred into Eppendorf tubes containing glass beads, 0.66 mL of chloroform and 0.33 mL of methanol. The homogenization process using a ball mill (FastPrep) was carried out for 1 min. The mixture was extracted for 2 min. In order to facilitate the separation of two layers, 0.2 mL of 0.9 % saline was added. The lower layer was collected and evaporated.

Phospholipid Determination
The polar lipids were measured using an Agilent 1200 HPLC system (Santa Clara, CA, United States) and a 4500 Q-TRAP mass spectrometer (Sciex, Framingham, MA, United States) with an ESI source. For the reversed-phase chromatographic analysis, 10 µL of the lipid extract was injected onto a Kinetex C18 column (50 mm × 2.1 mm, particle size: 5 µm; Phenomenex, Torrance, CA, United States). The mobile phase consisted of 5-mM ammonium formate in water (A) and 5-mM ammonium formate in methanol (B). The solvent gradient was initiated at 70% B, increased to 95% B over 1.25 min, and maintained at 95% B for 6 min before returning to the initial solvent composition over 3 min. The column temperature was maintained at 40 • C, and the flow rate was 500 µL min −1 . The instrumental settings of mass spectrometer were as follows: spray voltage -4500 V, curtain gas (CUR) 25, nebulizer gas (GS1) 50, turbo gas (GS2) 60, and ion source temperature of 600 • C. The data analysis was performed with the Analyst TM v1.6.2 software (Sciex, Framingham, MA, United States). Two approaches were applied to identify PLs: targeted and untargeted. The untargeted approach was performed with the precursor ion scanning (precursor for m/z 153) survey scan, triggering the EPI experiments. On the basis of the untargeted analysis, a comprehensive list of the multiple reaction monitoring (MRM) transitions was generated.

Acylglycerols
Diacylglycerols and TAGs analysis was undertaken by liquid chromatography coupled to mass spectrometry (LC-MS) with electrospray ionization (ESI) on an QTRAP 4500 (Sciex). A Kinetex C18 column (see phospholipids determination) and mobile phases consisting of water (A), a mixture of acetonitrile:isopropanol (5:2, v/v) (B) and 5 mM ammonium formate with 0.1 % formic acid were used. The solvent gradient was initiated at 35% B, increased to 100% B over 4 min, and maintained at 100% B for 11 min before returning to the initial solvent composition over 2 min. The column temperature was maintained at 40 • C, and the flow rate was 600 µL min −1 . The QTRAP 4500 was operated with positive ionization at an electrospray voltage of 5500 V and a targeted multiple reaction monitoring (MRM) approach containing transitions for known precursor/product mass-to-charge ratio. Under these conditions, the TAG ionize as ammonium adducts.

Fatty Acid Analysis
A lipid sample was diluted in 1.5 mL of methanol and transferred to a screw-capped glass test tube. To the lipid solution, 0.2 mL of toluene and 0.3 mL of the 8.0% HCl solution were added (Ichihara and Fukubayashi, 2010). The tube was vortexed and then incubated at 45 • C overnight. After cooling to room temperature, 1 mL of hexane and 1 mL of water were added for the extraction of fatty acid methyl esters (FAMEs). The tube was vortexed, and then, 0.3 mL of the hexane layer was moved to the chromatographic vial. 1.6 µL of the extract samples were analyzed using gas chromatography.
A FAMEs analysis was performed with an Agilent Model 7890 gas chromatograph, equipped with a 5975C mass detector. The separation was carried out in the capillary column HP 5 MS methyl polysiloxane (30 m × 0.25 mm i.d. × 0.25 mm ft). The column temperature was maintained at 60 • C for 3 min, then increased to 212 • C at the rate of 6 • C min −1 , followed by an increase to 245 • C at the rate of 2 • C min −1 , and finally to 280 • C at the rate of 20 • C min −1 . The column temperature was maintained at 280 • C for 10 min. Helium was used as the carrier gas at the flow rate of 1 ml min −1 . The injection port temperature was 250 • C. The split injection was employed. Fungal fatty acids were identified by comparison with the retention times of the authentic standards (Sigma, Supelco) and the results were expressed as a percentage of the total amount of fatty acids.
Sterol analysis was undertaken using the QTRAP 3200 (Sciex) mass spectrometer connected to a 1200 series HPLC system. A Kinetex C18 column (see phospholipids determination) was used. The solvents were: water and methanol, both containing 5 mM ammonium formate. Analytes were eluted with the following gradient: 40% solvent B from 0 to 1 min, 100% solvent B from 1 to 4 min, 40% solvent B from 4.0 to 4.1 min, 40% solvent B from 4.1 to 6 min with the flow rate 0.8 ml min −1 . The QTRAP instrument was set to the positive ion mode, with the atmospheric pressure chemical ionization (APCI) temperature of 550 • C.

Cell Wall Preparation and Determination of Cell Wall Carbohydrates
Fungi were cultivated in PDB medium, washed with 10 mM Tris/HCl, pH 7.5, suspended in the same buffer, disintegrated with 0.5 mm glass beads in the presence of a protease inhibitor cocktail (Sigma-Aldrich) and centrifuged at 1500 × g for 10 min. The resulting pellet containing cell walls was washed with icecold 1 M NaCl until the disappearance of absorbance at 260-280 nm (Nemčovič and Farkaš, 2001).
The lyophilized cell wall was hydrolyzed o/n in 4 M trifluoroacetic acid (TFA) at 100 • C. After cooling on ice, samples were centrifuged at 17 000 × g for 5 min at 4 • C. The supernatant was dried under N2 and washed twice with pure methanol. After removing methanol with N2, the pellet was resuspended in Mili Q water and purified on a Millipore Filter Device (0.45 µm pores) by centrifugation at 16 000 × g for 4 min. Samples were stored at −20 • C. Monosaccharides were determined by highperformance anion-exchange chromatography using a Dionex ICS-3000 Ion Chromatography System with a Carbo Pac PA10 analytical column. Neutral sugars were eluted with 18 mM NaOH at 0.25 ml/min (Zdebska and Kościelak, 1999).

Culture Conditions and DNA Extraction
All fungal strains were cultured on 4% Potato Dextrose Agar for 7 days at 20 • C. Total genomic DNA was extracted from 30 mg of fresh mycelium following a CTAB-based chloroform extraction protocol (Doyle, 1991) and cleaned-up following caesium chloride density gradient centrifugation method (Garber and Yoder, 1983). DNA quality and concentration were estimated by 1% agarose gel electrophoresis and NanoDrop R (Thermo Fisher Scientific). Quantification of purified DNA was performed on a Qubit Fluorometer. The identity of all strains was confirmed by the preliminary sequencing of the internal transcribed spacer (ITS rDNA) region and with standard morphological identification procedures.

Sequencing
DNA was sequenced at the High Throughput Sequencing Facility of UNC, Chapel Hill, NC, United States.
Whole-genome sequencing was accomplished using a hybrid approach, combining Illumina short-read data with PacBio longread data.
Total cellular DNA was sheared using a Covaris E220 sonicator to achieve fragments with an average size of 500 bp. Then libraries were prepared using the Kapa Hyper kit. Libraries were size selected for insert fragments around 500 base pairs using Pippin Prep automatic DNA size selection system (Sage Science). Libraries were analyzed and quantified using a LabChip GX automated electrophoresis system (Caliper) and pooled. The pools were sequenced on Illumina MiSeq sequencer pairedend sequencing (2 × 300 cycles) to obtain longer reads and on Illumina HiSeq sequencer 2500 (2 × 150 cycles) to obtain more coverage.
For the PacBio RSII data, ten microgram aliquots of genomic DNA were sheared in a Covaris g-TUBE to a target fragment size of 20 kb using the shearing conditions provided in the Covaris g-TUBE user manual. The protocol for preparing a 20 kb library (Pacific Biosciences Procedure and Checklist-20 kb Template Preparation Using Blue Pippin TM Size-Selection system) was subsequently followed, using 5 µg of purified, sheared DNA as starting material. Template concentration was calculated using the Qubit fluorometer and the average size was determined by BioAnalyzer trace analysis and served as input to the Annealing and Binding Calculator v.2.1.0.2 (Pacific Biosciences) to prepare SMRTbell-template annealing and polymerase-template binding reactions, as well as the final dilution of the polymerase-bound template complex for sample plate loading and spike-in of control DNA. The PacBio reads were filtered to a minimum read length of 100 bp and a minimum read quality score of 0.85.
BlobTools2 (Laetsch and Blaxter, 2017) was used to partition the scaffolds, on the basis of read coverage, G/C content, and taxonomic affiliation. The bacterial and fungal reads were assembled separately.
Phylogenetic Analysis of Bacteria Found in the Genome of Thamnidium 16S rRNA sequence was extracted from the bacterial reads found in Thamnidium using blast. Then it was combined with publicly available 16S sequences of several species and strains of Paenibacillus and Bacillus (GB accession numbers can be found on Figure 1). Sequences were then aligned using MAFFT (Katoh and Standley, 2013) and trimmed using trimal-automated 1 (Capella-Gutiérrez et al., 2009). Then the best evolution model was detected using modeltest-ng across all evolutionary models (Darriba et al., 2020) (TrN+I+G4 model selected based on AIC, BIC and AICc criteria) and phylogenetic tree was calculated using raxml-ng (Kozlov et al., 2019) with 1000 bootstrap replicates. The tree was then rooted using four Bacillus sequences.

DATA AVAILABILITY STATEMENT
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: NCBI BioProject, accession no: PRJNA668042. Raw reads for all fungal strains are available in the SRA database under accession numbers SRR12875449-SRR12875464. Assemblies and annotations are deposited under accession numbers JAEPQZ000000000-JAEPRE000000000 (

AUTHOR CONTRIBUTIONS
AM and JP designed the study. AM performed assembly, annotation, and sequence analyses. JK, PB, AO, KSt, JP, and AM interpreted the data and drafted the manuscript. KSt, JP, and AM wrote the manuscript. EM and PM performed wholegenome sequencing. OD, UZ, and SP performed experimental procedures and prepared the samples. TA-P and KSz performed carbon source usage experiments. UP-L and JK analyzed cell wall composition. PB analyzed lipid composition. JP analyzed carbon source usage. All authors contributed to the article and approved the submitted version.

ACKNOWLEDGMENTS
We thank Gustavo Henrique Goldman and Marcin Grynberg for their insight and comments about the manuscript.