The Genomic Potentials of NOB and Comammox Nitrospira in River Sediment Are Impacted by Native Freshwater Mussels

Freshwater mussel assemblages of the Upper Mississippi River (UMR) sequester tons of ammonia- and urea-based biodeposits each day and aerate sediment through burrowing activities, thus creating a unique niche for nitrogen (N) cycling microorganisms. This study explored how mussels impact the abundance of N-cycling species with an emphasis on Candidatus Nitrospira inopinata, the first microorganism known to completely oxidize ammonia (comammox) to nitrate. This study used metagenomic shotgun sequencing of genomic DNA to compare nitrogen cycling species in sediment under a well-established mussel assemblage and in nearby sediment without mussels. Metagenomic reads were aligned to the prokaryotic RefSeq non-redundant protein database using BLASTx, taxonomic binning was performed using the weighted lowest common ancestor algorithm, and protein-coding genes were categorized by metabolic function using the SEED subsystem. Linear discriminant analysis (LDA) effect sizes were used to determine which metagenomes and metabolic features explained the most differences between the mussel habitat sediment and sediment without mussels. Of the N-cycling species deemed differentially abundant, Nitrospira moscoviensis and “Candidatus Nitrospira inopinata” were responsible for creating a distinctive N-cycling microbiome in the mussel habitat sediment. Further investigation revealed that comammox Nitrospira had a large metabolic potential to degrade mussel biodeposits, as evidenced the top ten percent of protein-coding genes including the cytochrome c-type biogenesis protein required for hydroxylamine oxidation, ammonia monooxygenase, and urea decomposition SEED subsystems. Genetic marker analysis of these two Nitrospira taxons suggested that N. moscoviensis was most impacted by diverse carbon metabolic processes while “Candidatus Nitrospira inopinata” was most distinguished by multidrug efflux proteins (AcrB), NiFe hydrogenase (HypF) used in hydrogen oxidation and sulfur reduction coupled reactions, and a heme chaperone (CcmE). Furthermore, our research suggests that comammox and NOB Nitrospira likely coexisted by utilizing mixotrophic metabolisms. For example, “Candidatus Nitrospira inopinata” had the largest potentials for ammonia oxidation, nitrite reduction with NirK, and hydrogen oxidation, while NOB Nitrospira had the greatest potential for nitrite oxidation, and nitrate reduction possibly coupled with formate oxidation. Overall, our results suggest that this mussel habitat sediment harbors a niche for NOB and comammox Nitrospira, and ultimately impacts N-cycling in backwaters of the UMR.

Freshwater mussel assemblages of the Upper Mississippi River (UMR) sequester tons of ammonia-and urea-based biodeposits each day and aerate sediment through burrowing activities, thus creating a unique niche for nitrogen (N) cycling microorganisms. This study explored how mussels impact the abundance of N-cycling species with an emphasis on Candidatus Nitrospira inopinata, the first microorganism known to completely oxidize ammonia (comammox) to nitrate. This study used metagenomic shotgun sequencing of genomic DNA to compare nitrogen cycling species in sediment under a well-established mussel assemblage and in nearby sediment without mussels. Metagenomic reads were aligned to the prokaryotic RefSeq non-redundant protein database using BLASTx, taxonomic binning was performed using the weighted lowest common ancestor algorithm, and protein-coding genes were categorized by metabolic function using the SEED subsystem. Linear discriminant analysis (LDA) effect sizes were used to determine which metagenomes and metabolic features explained the most differences between the mussel habitat sediment and sediment without mussels. Of the N-cycling species deemed differentially abundant, Nitrospira moscoviensis and "Candidatus Nitrospira inopinata" were responsible for creating a distinctive N-cycling microbiome in the mussel habitat sediment. Further investigation revealed that comammox Nitrospira had a large metabolic potential to degrade mussel biodeposits, as evidenced the top ten percent of proteincoding genes including the cytochrome c-type biogenesis protein required for hydroxylamine oxidation, ammonia monooxygenase, and urea decomposition SEED subsystems. Genetic marker analysis of these two Nitrospira taxons suggested that N. moscoviensis was most impacted by diverse carbon metabolic processes while "Candidatus Nitrospira inopinata" was most distinguished by multidrug efflux proteins (AcrB), NiFe hydrogenase (HypF) used in hydrogen oxidation and sulfur reduction coupled reactions, and a heme chaperone (CcmE). Furthermore, our research suggests that comammox and NOB Nitrospira likely coexisted by utilizing mixotrophic metabolisms. For example, "Candidatus Nitrospira inopinata" had the INTRODUCTION Water quality of the Upper Mississippi River (UMR) has been documented for decades (Lerch et al., 2015), yet the UMR basin contributes over 50,000 metric tons of bioactive nitrogen (N) to the Gulf of Mexico each year (Donner and Kucharik, 2008). Research has shown that microbial communities are impacted by the addition of bioactive N (Hallin et al., 2009), and subsequently alter N-biogeochemical cycling in the UMR through nitrification and denitrification processes (Mulholland et al., 2008;Shange et al., 2012). Enhancing the vertical exchange between overlying water and groundwater (i.e., water-sediment interface) of UMR backwater channels has been proposed to significantly enhance N removal (Strauss et al., 2006;Gomez-Velez et al., 2015), particularly because biotic removal of N reaches a maximum efficiency of 40% as N loads increase in large streams (Mulholland et al., 2008) and denitrification rates plateau as nitrate (NO 3 -N) reaches 5 mg/L in backwater channels (Kreiling et al., 2011). Taken together, these findings emphasize the large N-cycling potential of benthic organisms, by enhancing the flux of nutrients into sediment for microbial transformations (Vaughn and Hakenkamp, 2001;Atkinson et al., 2013).
Freshwater mussels (order Unionidae) native to the UMR live in assemblages of 3-5 mussels m −2 , collectively filter billions of gallons of water, and remove tons of N-containing biomass from overlying water each day (Newton et al., 2011). In addition to the ecosystem services of water filtration and enhancing nutrient exchange rates across the water-sediment interface (Atkinson et al., 2014), mussel excretion of feces and pseudofeces (biodeposition products) sequesters ammonia (NH 3 ) and carbon (C) into sediment porewater (Newton et al., 2011;Bril et al., 2014Bril et al., , 2017. As a result, mussel assemblages are attributed with creating "hotspots" of N and C in surrounding sediment (Atkinson and Vaughn, 2015), and create a microbial niche ripe for nitrification at the interface of oxic and anoxic conditions (Black et al., 2017).
Nitrifying organisms capable of mixotrophy may pose an advantage in a mussel-influenced habitat, owing to the adaptation of switching metabolic functions when conditions change from oxic to anoxic (Matantseva and Skarlato, 2013;Daebeler et al., 2014). It was previously thought that nitrite (NO − 2 ) oxidizing bacteria (NOB) were restricted to oxic environments where ammonia (NH 3 ) oxidizing bacteria (AOB) produce NO − 2 , but recent genomic analyses have expanded the known metabolic functions of conventional NOB Nitrospira. For example, the NO − 2 oxidizing species Nitrospira moscoviensis is genetically capable of cyanate degradation (Palatinszky et al., 2015), aerobic hydrogen oxidation (Koch et al., 2014), and formate oxidation coupled with nitrate (NO − 3 ) reduction (Koch et al., 2015). Nitrification was further expanded after discovering that Nitrospira moscoviensis can produce NH 3 and CO 2 by way of urea hydrolysis, and can reciprocally feed NH 3 to urease-lacking AOB and receive NO − 2 in return (Koch et al., 2015). Furthermore, the N-cycle was transformed after the discovery of a single organism capable of complete NH 3 oxidation (comammox) (van Kessel et al., 2015) and confirmation of genes required for complete nitrification encoded by "Candidatus Nitrospira inopinata" (Daims et al., 2015), and potentially even sulfur reduction (Camejo et al., 2017).
In a previous study, we showed that sediment of a wellestablished mussel habitat in UMR backwaters contained an enhanced niche for Nitrospirae in addition to a greater abundance of microorganisms indicative of an oxic-anoxic niche, like anaerobic ammonium oxidizers (anammox). This presumed oxic-anoxic niche was detected closer to the watersediment interface in the mussel habitat, since the relative abundance of anammox bacteria peaked at shallow (3 cm) sediment depths with mussels, but were more abundant in deeper (5 cm) sediments in the no-mussel treatment (Black et al., 2017). Furthermore, the 16S rRNA amplicon survey showed fewer differences among N-cycling phylotypes in shallow sediment with mussels and deeper sediment without mussels (i.e., intrasample differences), and the fewest differences when comparing the shallow mussel sediment against the deeper no-mussel sediment (inter-sample differences). In response, this study used the deeper no-mussel sediment as the most stringent baseline to assess mussel influences on the N-cycling community with mussels. We employed metagenomic shotgun sequencing of total DNA corresponding with the aforementioned oxic-anoxic niche sediment components, with the goal of identifying the N-cycling species most impacted by mussels. We hypothesized that sediment from the mussel habitat would contain an increased abundance of nitrifying taxons, presumably due to an enhanced genomic potential for ammonia oxidation.

Sediment Collection and DNA Isolation
Our study sites were located in the backwaters of the UMR navigation pool 16, where a consistently populated mussel assemblage has been studied for decades (USACE, 1981(USACE, , 1984Young et al., 2005;Morales et al., 2006). Sediment cores were obtained from the mussel habitat (41.452804, −90.763299) and upstream sediment (41.451540, −90.753275) lacking mussels (Black et al., 2017); both sites had similar hydraulics and sediment composition (Young et al., 2005), and will be considered as treatments "with-mussels" and with "no-mussels" according to previous studies (Mirto et al., 2000;Danovaro et al., 2004). Cores were removed from each site using a 2-in diameter, post-driver with a polypropylene liner (Multi-State Sediment Sampler, Art's Manufacturing and Supply, Inc.; American Falls, ID, United States), and an ethanol flame-sterilized 3/8-in diameter drill bit was used to penetrate the cores at depths of 3 and 5 cm. For each core, samples (0.25 g sediment) were removed in quadruplicate (n = 4, 3 cm depth with mussels; n = 4, 5 cm depth without mussels) and stored in sterile bead beating tubes overnight at -20 o C. Genomic DNA was isolated (PowerSoil DNA Isolation Kit; MoBio Laboratories, Inc., Carlsbad, CA, United States) and stored at −20 • C. Following verification of DNA quality and quantity (NanoDrop 1000; Thermo Fisher Scientific, Waltham, MA, United States), genomic DNA was sequenced at the University of Iowa Institute for Human Genetics (IIHG). As mentioned earlier, selection of these samples were informed by evidence suggesting oxic-anoxic interface niches were located at 5 cm sediment depth without mussels and 3 cm depth beneath mussels (Black et al., 2017). Samples chosen for this experiment correspond to the following 16S rRNA amplicon sequencing data at MG-RAST: without-mussels-mgm4705698.3, mgm4705704.3, mgm4705686.3, mgm4705697.3 with-mussels mgm4705708.3, mgm4705672.3, mgm4705699.3, mgm4705680.3.

Metagenomic Shotgun Sequencing
For each sample, 120 ng of genomic DNA in 60 µL of 10 mM Tris-HCl, pH 8.0 buffer, was placed into 1.5 mL RNase-/DNase-free, low binding microcentrifuge tubes. Library creation steps were performed by the IIHG Genomics Division, and included DNA shearing using the Covaris Adaptive Focused Acoustics TM process (Covaris E220 Focused-ultrasonicator; Covaris, Inc., Woburn, MA, United States), and DNA fragment purification and end polishing (KAPA Hyper prep kits; Kapa Biosystems, Inc., Wilmington, MA, United States) prior to ligation to indexed adaptors. The library size distribution was validated using the Agilent 2100 Bioanalyzer Instrument (Agilent Technologies, Santa Clara, CA, United States), and quantified using the q-PCR KAPA library amplification module following manufacturer instructions (Kapa Biosystems, Inc.). The indexed libraries were normalized, pooled, and clustered on a flow cell using the cBOT Cluster Generation System (Illumina, Inc., San Diego, CA, United States) and sequenced on the Illumina HiSeq 4000 System (Illumina, Inc.) in high output mode (1 lane, 2 × 150 bp). FASTQ files are accessible at ENA (Study Accession: PRJEB23134) and NCBI repositories (BioProject ID: PRJNA414922), and MG-RAST contains QA/QC and analyses of metagenomes (MG-RAST project mgp21252).

Bioinformatics Pipeline
FastQC (Andrews, 2010) was used for quality control and revealed an average sequence abundance of 42,606,252± 2,365,531 sequences, sequence lengths of 151 bp, and 60% (±0.84%) GC content for no-mussel samples, and 41,402,435 ± 3,444,191 sequences, with sequence lengths of 151 bp, and 60% (±1.31%) GC content for samples with mussels. For taxonomic and functional binning of reads, we employed the streamlined DIAMOND (Buchfink et al., 2015) and MEGAN (Huson et al., 2007) pipeline specialized for microbiome shotgun sequencing analyses (Huson et al., 2016). First, RefSeq (O'Leary et al., 2016) non-redundant (nr) archaeal and bacterial protein sequences (Release80) were concatenated to construct a database for BLASTx alignments in DIAMOND using the "make.db" command, and pairwise alignments were performed using the default BLASTx settings (BLOSUM62 matrix, γ = 0.267, K = 0.041, Penalties = 11/1). The aligned reads from DIAMOND were imported into MEGAN using daa-meganizer (Huson et al., 2016), keeping only the top 100 matches per read. The weighted lowest common ancestor (LCA) algorithm was used for taxonomic binning using default settings in MEGAN6 (Huson et al., 2016) (min score = 50.0, max expected = 0.01, top percent = 10.0%, min support percent = 0.05, min support = 1, 80% coverage for weighted LCA algorithm) and classified according to NCBI taxonomy IDs (Nov 2016 release). Furthermore, aligned reads were assigned functional roles using accession mapping files for SEED subsystems (Overbeek et al., 2005) (SEED May 2015 annotation). The resulting files contained all reads, alignments, taxonomic and functional classifications, and were normalized for sampling read depth (normalized to 12,726,950 reads per sample) and assigned metadata categories, "no-mussel" and "with-mussel." Reads aligned to N-cycling genomes were extracted from MEGAN for differential abundance analysis using LefSe (Segata et al., 2011). The most differentially abundant genomes belonged to "Candidatus Nitrospira inopinata" (GCF_001458695.1) and Nitrospira moscoviensis (GCF_001273775.1), and were further assembled and annotated in Unipro UGENE (Okonechnikov et al., 2012) and depicted using DNAPlotter (Carver et al., 2009). The UGENE workflow included mapping reads to indexed reference genomes using BWA MEM (Li and Durbin, 2009) with default settings, followed by filtering and sorting the BAM files using SAMtools , and a final quality control step using FastQC (Andrews, 2010).

LDA Effect Size
A linear discriminant analysis (LDA) method was used to assess which genomes and genomic features were most discriminative of the freshwater mussel habitat. N-cycling taxonomies of interest were chosen based on previous research (Pfister et al., 2014) and included AOB and NOB phylotypes with the prefix "nitro, " N-reducing phylotypes designated by "denitrificans" or "nitroreducens, " and anammox candidate genera, Brocadia and Jettenia. The relative abundance of reads aligned to these N-cycling taxons were assessed for LDA effect size (LEfSe) (Segata et al., 2011) to determine which N-cycling taxonomic features were most responsible for differences in the mussel habitat microbiomes. All samples were labeled by class ("mussel" and "no-mussel") and features were compared for differential distribution using the non-parametric factorial Kruskal-Wallis sum-rank test (α = 0.05) and normalized to a total read count of 1 M. Features deemed differentially abundant were compared for significant effect size using the pairwise Wilcoxon ranksum test (α = 0.05; "all against all"), and ranked features according to greatest effect size. A minimum LDA score of 2.0 was chosen as a cutoff for significant features to limit analysis to the most distinctive metagenomic traits. "Candidatus Nitrospira inopinata" and Nitrospira moscoviensis were shown to be the most distinctive genomes with mussels, and follow-up genetic marker tests were performed for SEED assignments to address which metabolic functions may be responsible for this differentiation.

N-cycle Taxonomic Composition
The DIAMOND/MEGAN pipeline revealed metagenomic reads assigned to N-cycling organisms (Figure 1) were slightly more abundant with mussels (157,275 ± 17,503 reads) than without mussels (136,884 ± 20,982 reads), and the mussel habitat contained more reads belonging to N-cycling bacterial lineages (Figure 1) with Nitrospirae representing the most bacterial species. In both treatments, "Candidatus Methanoperdens nitroreducens" represented the largest number of archaeal metagenomic reads but was not differentially abundant between treatments. NO − 2 oxidizing organisms represented the largest N-cycling group, with Nitrospira moscoviensis, Nitrospira defluvii, and "Candidatus Nitrospira inopinata" comprising an average of 22, 15, and 11% of N-cycling metagenomic reads in the mussel habitat, respectively. Steroidobacter denitrificans was also a highly abundant component of the N-cycling community (12-14%) but was not differentially abundant between treatments.
Linear discriminant analysis effect size of the metagenomic reads assigned to N-cycle taxons further emphasized the major differences in the mussel habitat (Figure 1). Of the taxons considered, bacterial lineages experienced the most increases with mussels (LDA = 4.27, P = 0.043), and the most differentially abundant species were Nitrospira moscoviensis (LDA = 3.80, P = 0.021) and "Candidatus Nitrospira inopinata" (LDA = 3.63, P = 0.021). One other group of nitrifying taxons were differentially greater with mussels, and belonged to the Nitrosococcus genus (LDA = 2.20, P = 0.021). Multiple denitrifying taxons were greater with mussels, including Methylomonas denitrificans, Denitrobacterium detoxificans, Competibacter denitrificans, and a Gammaproteobacterial sulfur oxidizing symbiont, Thiohalorhabdus denitrificans. However, these denitrifying species were lower in abundance than Nitrospira, and thus were ranked lower as biomarker species. Protein functional assignments of "Candidatus Nitrospira inopinata" and N. moscoviensis were also assessed for distinctive features between the mussel and no-mussel treatments, with the goal of discovering niche differentiating functions responsible for the enhanced abundance of NOB and comammox Nitrospira genomes (McGill et al., 2007).

Nitrospira moscoviensis Genomic Potential
A total of 435,151 SEED protein functions were assigned to the genome of N. moscoviensis (normalized to 36,562 per sample) and was dominated by 5 SEED categories: carbohydrates, cofactors, vitamins, prosthetic groups, pigments, amino acids and derivatives, protein metabolism, and DNA metabolism (Supplementary Table S1). The 25% most abundant SEED subsystems included those indicative of metabolic activity and growth, such as peptidoglycan and cytoskeleton biosynthesis, respiration and carbon fixation, DNA replication and repair (Supplementary Table S1). Highly abundant SEED proteins unique to N. moscoviensis in the no-mussel treatment included those potentially functioned in motility, chemotaxis, biotin synthesis, and thiamin metabolism ("5-FCL-like protein"), while the mussel habitat treatment was dominated by genes encoding folate and cysteine biosynthesis, carbon cycling ("alpha carboxysome"), and the DNA regulatory proteins YebC and proteasomes (Supplementary Table S1).
Of the N-metabolism functional assignments (Table 1), formate hydrogenases were the most abundant N-cycling category for both treatments, and numerous enzymatic functions were relatively more abundant in the mussel habitat treatment. These included genes encoding an NH + 4 permease, NO reductase proteins (NorD and NorQ), formate dehydrogenase subunits (beta and chain D), NO − 2 /NO − 3 transporters and sensors, periplasmic nitrate reductases (NapG and NapF), and urease proteins (UreA and ureF) ( Table 1).

N. moscoviensis Genetic Markers
Genetic code processing was a major potential function of N. moscoviensis in the mussel habitat, as evidenced by increased functional proteins used in DNA, RNA, and protein metabolism (LDA up to 3.32). Two of the most distinct features were genes encoding YebC-like DNA-binding regulatory proteins, an ATPdependent DNA helicase protein (PcrA), and ribonuclease H III (Supplementary Table S2 and Figure 2), in addition to other proteins like DNA polymerase III and LSU ribosome. Unclassified hypothetical proteins (FIG039061) related to heme utilization was a highly ranked SEED subsystem (LDA = 3.33, P = 0.043) and likely was due to a large abundance a gene encoding a modular heme utilizing protein (NITMOv2_0147), with other iron-based genetic markers including Cytochrome C553 and an Fe-S cluster regulator (IscR). Other differentially abundant features were related to carbon cycling SEED subsystems (LDA = 2.99, P = 0.021) and included genes encoding ribulose phosphate-3 epimerase used for carbon fixation, and two glycogen synthesis enzymes, 4-alpha-glucanotransferase (MalQ), and 1,4-alpha glucan (glycogen) branching enzyme (GH-13 type) (GlgB).
Contrastingly, the N. moscoviensis potential protein functions without mussels were largely marked by genetic markers of stress response. The glutathione-regulated K + efflux system used in potassium metabolic processes (LDA = 3.28, P = 0.043) FIGURE 1 | Nitrogen-cycling taxonomies assessed for linear discriminant analysis (LDA) effect size are colored based on phylum, as specified in the legend. Taxons shown with a " * " icon and salmon-colored background had statistically significant LDA scores (LDA > 2, P < 0.05). Rings surrounding the phylogenetic tree depict the relative abundance of reads assigned to the respective species. Ring color intensity represents relative read count with mussels (salmon-colored; "M") and with no mussels (turquoise-colored; "NM"). The opacity of "Read count" ring-segments corresponds to the greatest taxonomic abundance. Several N-cycling taxons were differentially abundant with mussels and LDA effect sizes are represented by the height of black bars in the outer-most ring.
were the most definitive SEED subsystems of the no-mussel N. moscoviensis genome. Furthermore, the glutathione-regulated K + efflux system protein family (KefB) is activated in the presence of methylglyoxal (Ferguson et al., 1995), and the metabolism of methylglyoxal was also a genetic marker of the no-mussel treatment (LDA = 2.45, P = 0.043). These results may suggest a stress response genetic marker, as glutathione-regulated potassium efflux systems are often utilized to counteract electrophilic compounds during stress (Tötemeyer et al., 1998;Bott and Love, 2004). Additionally, superoxide dismutase was another highly ranked functional marker and may suggest an enhanced stress from reactive oxygen. Other stress genetic markers included the carbon starvation stress SEED subsystem (LDA = 2.85, P = 0.021) and an enhanced abundance of genes encoding carbon starvation protein A (CstA) (Supplementary Table S2). Other genetic markers suggest a potential metabolic ability to respond to diverse carbon compounds, including genes encoding gluconolactonase of the Entner Duodoroff pathway (LDA = 2.99, P = 0.021), acetoacetate metabolism with an enhanced regulatory protein, AtoC, and an acetyl-CoA biosynthesis enzyme, pyruvate decarboxylase (Supplementary Table S2). Differential features also included the potential metabolism of complex carbon sources, such as lactose (LDA = 2.40, P = 0.021) via 2oxoglutarate decarboxylase, mannose by way of mannose-1-P guanylyltransferase, maltose and maltodextrin degradation via alpha amylase, and N-acetylglucosamine (LDA = 2.96, P = 0.043) with beta-hexosaminidase enzymes.
N metabolic genes were lowly abundant for N. moscoviensis genomes in both treatments, but a NO − 3 /NO − 2 sensor protein was a functional marker of the mussel habitat (LDA = 2.61, P = 0.014). This enhanced ability to transport NO − 2 and NO − 3 across the cell membrane could be linked to potential energy conservation processes or metabolic reactions beyond N-cycling. For example, the Nxr protein of N. moscoviensis is contained within the periplasm and does not require transportation of NO − 2 or NO − 3 into the cell (Spieck et al., 1998). Instead, N. moscoviensis likely transports NO − 2 and NO − 3 into the cell for assimilation and outside the cell to protect against excess NO − 2 (Lücker et al., 2010).
In comparison, the no-mussel treatment showed an enhance genetic ability to uptake and store N by way of encoding a cyanate ABC transporter (LDA = 2.40, P = 0.043) and asparagine synthetase (AsnB) (Supplementary Table S2), respectively. Differential abundance of cyanate transporters would indicate N. moscoviensis could have increased ability to obtain an TABLE 1 | Nitrospira moscoviensis N-cycling protein functions from the mussel habitat in relative abundance (RPKM) and as a proportion of SEED enzymatic function.

SEED subsystems and protein functions
Average read abundance (RPKM) Relative proportion of protein function alternative source of N via the cyanase enzyme (Koch et al., 2015) or could be reciprocally fed to cyanase-lacking nitrifiers (Palatinszky et al., 2015). The co-increases of genes encoding AsnB and a cyanate transporter suggests that N. moscoviensis may have utilized alternative N sources in the sediment without mussels, perhaps due to inorganic N limitations in its environment.

Comammox Nitrospira Genomic Potential
A total of 163,253 SEED functionalities were assigned to the genome of "Candidatus Nitrospira inopinata" (normalized to 13,278 per sample) and the top 25% most prominent SEED subsystems did not differ substantially between treatments (Supplementary Table S3 and Figure 3). The "restrictionmodification system" represented the largest potential functional category for the no-mussel treatment, while the potential for thiamin metabolism ("5-FCL-like protein"), and stress response ("commensurate regulon activation") SEED assignments were the most abundant for the genome with mussels. The metabolic potential for "urea decomposition" was more abundant without mussels, while "biogenesis of c-type cytochromes" was greater in the mussel treatment. The "ammonia monooxygenase" potential function was the 4 th most abundant SEED subsystem for both mussel and no-mussel treatments. Both treatments FIGURE 2 | The assembled N. moscoviensis genome is depicted with tick mark intervals of 225 kBp, and tracks are composed of the following components, starting with the outermost rings: Forward strand coding regions (gray), reverse strand coding regions (gray), the top 25% most abundant enzymes with mussels (dark orange), genomic read coverage from the mussel habitat (salmon colored), genomic coverage without mussels (turquoise colored), the top 25% most abundant enzymes with no mussels (dark green), GC coverage, and GC skew. The N-cycling genes with greatest abundance, GlnEB (NITMOv2_1289, NITMOv2_1290), nitrite oxidoreductase C (NITMOv2_3624), and periplasmic nitrate reductase (NITMOv2_3626) are marked with red lines over the outer rings. Gene names are shown for those with the largest LDA effect size (LDA > 3.0). For the mussel habitat, these include the YebC-like proteins, YchF and thrS, heme utilization protein, and DNA helicase (PcrA). Without mussels, these include the CPA2 protein families (kefBC in N. moscoviensis), 6-phosphogluconolactonase (pgl), a Fe-Mn superoxide dismutase (sod), asparagine synthetase (asnB), glucose methanol choline oxidoreductase (gmc), pyruvate decarboxylase (cfp), and carbon starvation protein (cstA). Table S3), including genes encoding a resistance-nodulationcell division (RND) efflux system inner membrane transporter, AmoC, alcohol dehydrogenase, and glycogen utilizing enzymes (Supplementary Table S3). Numerous N-cycling functions of "Candidatus Nitrospira inopinata" were highly abundant in the mussel habitat (Table 2), including genes encoding NOand N 2 O-forming enzymes, Amo, and NO − 2 /NO − 3 transforming enzymes (Nxr, NapG, and Nas). Although the total abundance of genes encoding Amo was slightly larger in the mussel treatment, this trend was most evident for AmoA and AmoB. Genes encoding urea transporters (UrtB and UrtC) and urease proteins (UreD, UreG, UreA, and UreF) were relatively more abundant in the mussel treatment, though the "urea decomposition" subsystem did not follow this overall trend. Finally, N-cycling genes shared by N. moscoviensis and "Candidatus Nitrospira inopinata" were assessed for differential abundance to reveal if N-cycling genes could explain how both species were distinct to the mussel habitat, despite competing for similar N substrates. The protein functions present in both species include NH + 4 transporters and permeases, coppercontaining nitrite reductase (NirK), a "NnrS protein involved in response to NO, " NO reductase, periplasmic NO − 3 reductase, NO − 2 /NO − 3 sensor and response regulator proteins, Nxr, urea transporters, and urease enzymes. A mean rank multiple comparison analysis (Kruskal-Wallis test with Dunn's multiple test correction) revealed that the sum of genes encoding urea transporters (Urt) (Mean rank difference = 53.8, P = 0.002) and copper-containing nitrite reductases (NO-forming) (NirK) (Mean rank difference = 54.0, P = 0.002) were more abundant from the "Candidatus Nitrospira inopinata" genome (mussel habitat) compared to N. moscoviensis. It should be noted that other studies have detected nirK in Nitrospira but we are not aware of studies detecting NO production (Camejo et al., 2017). FIGURE 3 | Candidatus Nitrospira inopinata assembly shown with tick mark intervals of 160 kBp. Tracks within the genome are composed of the following, starting with the outermost rings: Forward strand coding regions, reverse strand coding regions, the top 25% most abundant enzymes with mussels (dark orange), genomic coverage with mussels (salmon colored), top 25% most abundant SEED subsystems in both treatments (royal blue), genomic coverage without mussels (turquoise colored), the top 25% most abundant enzymes with no mussels (dark green), GC coverage, and GC skew. N-cycling genes are designated with red lines across the CDS rings and the most abundant genes [amoC3 (NITINOP_0766), nirK (NITINOP_3146), hao (NITINOP_0065)] are designated with text. For simplicity, genes with LDA > 3 are named (acrB, hypF, and thiE), and therefore limits to the mussel habitat treatment (text shown above salmon-colored ring).

shared numerous abundant SEED assignments (Supplementary
Additionally, it has been suggested that nirK from AOB could catalyze the oxidation of NO to NO − 2 (Kuypers et al., 2018), but we cannot make conclusions on enzymatic activity from metagenomic evidence alone.

Comammox Nitrospira Genetic Markers
The most discerning differences in the genome of "Candidatus Nitrospira inopinata" with mussels included a gene encoding the RND efflux system inner membrane transporter (AcrB) (Supplementary Table S4 and Figure 3) as part of the SEED subclass "FOL commensurate regulon activation" (LDA = 3.71, P = 0.043). Numerous other potential stress response and virulence pathways were greater in the mussel treatment (Supplementary Table S4), such as genes encoding a RND efflux system membrane fusion protein (TtgA), a membrane protease with a mechanism for aminoglycoside resistance (HflK), an integral inner membrane protein type IV secretion complex (VirB), and "death on curing protein" (Doc) as part of the pathway of YdcE and YdcD toxin programmed cell death (LDA = 2.46, P = 0.014). Results also indicated a larger metabolic potential for metal tolerance in the mussel treatment, namely copper tolerance (LDA = 2.26, P = 0.047) by way of a gene encoding a periplasmic divalent cation tolerance protein (CutA).
Other genetic markers in the mussel habitat had indirect links to N metabolism through potential for urea cycling and cytochrome biosynthesis. A gene encoding a NiFe hydrogenase (HypF) assembly protein responsible for regulating the sulfurreducing hydrogenase gene set (hydBGDA and hybD) was greater with mussels, and was recently suggested as giving comammox Nitrospira the potential function of hydrogen 2 | "Candidatus Nitrospira inopinata" N-cycling protein functions from the mussel habitat as relative abundance (RPKM) and as a proportion of SEED enzymatic function.

SEED subsystems and protein functions
Average read abundance (RPKM) Relative proportion of protein function SEED counts are shown as a percentage of the total classified protein function in the mussel habitat.
oxidation coupled to sulfur reduction in anaerobic conditions (Camejo et al., 2017). Genes encoding a cytochrome c-type biogenesis protein (CcmF) were greater with mussels, and is a heme chaperone required for biogenesis of cytochrome c-type proteins located immediately downstream of hydroxylamine dehydrogenase (hao) (Figure 3). Another genetic functional marker greater with mussels encodes carbamoyl phosphate synthetase (CarB) and would effectively add urea-derived NH 3 into central metabolism. SEED subsystems of "Candidatus Nitrospira inopinata" most indicative of the no-mussel treatment were those potentially functioned in potassium homeostasis (LDA = 3.53, P = 0.043), alpha carboxysome (LDA = 3.04, P = 0.043), and DNA metabolic CRISPR function (LDA = 3.04, P = 0.043). The only significantly different protein-coding gene classified in the "potassium homeostasis" subsystem encoded a potassium transporting ATPase (KdpC), and is a catalytic chaperone for high-affinity potassium uptake (Irzik et al., 2011). Greater function potential for rubrerythrin, a stress response protein used to combat oxidative stress (LDA = 2.45, P = 0.021) (Cardenas et al., 2016), was greater in the no-mussel treatment, and may be linked to greater abundances of genes encoding a recombination and repair protein (RecO) and excinuclease ABC genes in DNA repair pathways (LDA = 2.38, P = 0.021). N-cycling genes were not found to be a significant feature differentiating comammox Nitrospira genomes in the no-mussel treatment.

DISCUSSION
In support of our research goal, we provided metagenomic evidence of niche partitioning features to explain how two species of Nitrospira were greater in UMR sediment with mussels. This aligns with our previous 16S rRNA amplicon sequencing study which detected a 10% greater abundance of Nitrospirae in the mussel habitat (Black et al., 2017). Furthermore, this metagenomic research showed greater abundances of species capable of urea hydrolysis and clarifies our previous assumptions that an alternative source of N allowed coincreases in anammox, AOB, and NOB phylotypes. We found that Nitrospira moscoviensis and "Candidatus Nitrospira inopinata" were the most differentiating N-cycling taxons in this mussel habitat niche, and these organisms likely co-existed because of metabolic flexibility beyond the conventional N-cycle.

Nitrospira moscoviensis Marked by Genetic Diversity and Potential for Carbon Metabolism
The most definitive genetic marker of N. moscoviensis from the mussel habitat encoded YebC (LDA = 3.49) and may suggest the potential for enhanced genetic diversity. For example, the YebC protein regulates genetic recombination and resolution of Holliday Junctions (Zhang et al., 2012), and enzymes classified within the YebC subsystem, YchF and ThrRS, function as translational control factors (Teplyakov et al., 2003;Zhou et al., 2016). These results suggest that N. moscoviensis in the mussel habitat had the genetic potential to synthesize proteins and respond to environmental variations (Hershey et al., 2012). Additionally, it makes sense that enhanced DNA repair using "UvrD-related helicases" and "two cell division clusters relating to chromosome partitioning" were also top differentiating features in the mussel habitat. Increased DNA repair and recombination genetic markers would also have permitted N. moscoviensis to respond to changing environments by way of genetic diversity. To this point, flexible metabolism and a mixotrophic lifestyle enables N. moscoviensis to be ecologically successful in numerous other environments (Koch et al., 2015).
We demonstrated that gene abundances associated with carbon metabolic processes of N. moscoviensis were impacted in the mussel habitat, as evidenced by increased Calvin cycle and glycogen metabolism genetic markers while the no-mussel treatments contained carbon stress protein genetic markers. CstA is a membrane protein predicted to be involved in peptide uptake when carbon availability is low (Rasmussen et al., 2013). Experimental evidence has also shown the cstA gene is upregulated during carbon starvation (Dubey et al., 2003) and allows for increased cellular growth by importing peptides as sources of C and N (Vastermark et al., 2014). Furthermore, it is possible that N. moscoviensis was genetically capable of responding to nutrient fluctuations in the no-mussel environment due to the glutathione-regulated potassium efflux gene (kefC) and methylglyoxal metabolism genetic markers. Evidence suggests that glutathione activates KefC potassium channels to protect against the toxic effects of methylglyoxal, and often occurs in response to limited phosphate or excessive carbon concentrations (Ferguson et al., 1998;Booth, 2005). It is possible that N. moscoviensis were C-stressed and nutrient-stressed in sediments without mussels, therefore the environment selected for the NOB Nitrospira possessing CstA and KefC proteins. Previous studies have shown that mussel biodeposits contain fairly consistent carbon and nutrient ratios (van Broekhoven et al., 2015) while others indicate biodeposit decomposition varies seasonally (Jansen et al., 2012), so future research should address if nutrient composition of mussel habitat sediments correspond to real-time metabolic activity of N. moscoviensis.
Furthermore, it is possible that mussel habitat sediments housed an enhanced niche for N. moscoviensis by containing a diverse source of carbon rather than nitrogen compounds. Contrary to our initial hypothesis, formate hydrogenase enzymes were a highly abundant N-cycling SEED category, while the gene families nirK, ure and urt, represented the lowest N-cycling genetic potential. These results suggest that N. moscoviensis had greater genetic potential to degrade to formate than from reciprocal feeding of mussel-derived urea. Taken together, these results signify N. moscoviensis in the mussel habitat had greatest potential for carbon degradation, NO − 2 oxidation, and likely thrived from fermentation products commonly found in sediments at the interface of oxic and anoxic conditions.

Comammox Nitrospira Marked by Potential for Multidrug Efflux Pumps and Diverse Metabolism
The comammox genome was marked with multidrug resistance efflux pumps (CmeB/AcrB), and metal resistance (Zn and Cu) in the mussel habitat, and suggests there may have been a selective pressure to use defense mechanisms. These multidrug resistance genes could be a response to one or more substrates and would enable resistance toward numerous antimicrobial substrates including antibiotics and heavy metals (Lin et al., 2002;Anes et al., 2015). Furthermore, our detected genetic markers for increased Phd-Doc toxin-antitoxin genes, are attributed with biochemical processes including antibiotic resistance (Liu et al., 2008). One explanation for these findings may be related to mussels hosting antibiotic resistant symbionts (Cooke, 1976;Liu et al., 2016), multidrug-resistant pathogens, (da Silveira et al., 2016), and enhanced horizontal gene transfer within their gut microbial community (Grevskott et al., 2017). Previous research has also shown that mussels bioaccumulate various metals (Naimo, 1995;Rzymski et al., 2014), including elements found in this genetic marker study, copper and zinc (Jorge et al., 2018). It is possible that horizontal gene transfer of multidrug resistance was facilitated by symbionts in mussel biodeposits, or from the decay of mussel tissue after death. However, this study was not designed to pinpoint antimicrobial stressors in the mussel habitat, so we cannot definitively say that commamox Nitrospira obtained these features as a direct result of their niche.
The genetic marker analysis also revealed that phosphorus metabolism was a distinctive feature of comammox Nitrospira in the mussel habitat. It is possible that phosphorus metabolic potential was influenced by mussel excretion products regenerating phosphate within the sediment, but would ultimately depend on the nutrient content and type of suspended food consumed by mussels (Jansen et al., 2012). Regeneration of phosphate in mussel habitat sediments would give comammox Nitrospira a selective advantage over AOA and AOB which do not encode an alkaline phosphatase (Palomo et al., 2018). Another genetic marker potentially explaining the success of comammox Nitrospira was the NiFe hydrogenase maturation protein (HypF) found in the mussel habitat. Recent studies have reported that comammox have the potential to oxidize H 2 and use sulfur as an electron acceptor in anaerobic conditions (Camejo et al., 2017), and this further emphasizes the importance of metabolic versatility in the UMR mussel habitat.

NOB and Comammox Nitrospira Coexisted in a Mussel Habitat Niche
Our results from the mussel habitat showed that "Candidatus Nitrospira inopinata" had the largest genomic potentials for ammonia oxidation, urea decomposition, and NO − 2 redox reactions. In comparison to N. moscoviensis, "Candidatus Nitrospira inopinata" was genetically equipped to obtain electrons from multiple N compounds. Furthermore, since the comammox genome contained large abundances of N-cycling genes compared to N. moscoviensis, this suggests that these two organisms were likely successful in the mussel habitat by utilizing different metabolic functions. In the mussel habitat, "Candidatus Nitrospira inopinata" had an order of magnitude greater capacity for urea decomposition than Nitrospira moscoviensis, with the most evident differences shown for urease proteins and urea transporters. It is likely that the comammox Nitrospira genome had greater metabolic potentials for urea decomposition since it contains the whole gene set encoding a urea ABC transporter (urtABCDE). This high affinity urea transporter is shared among most Nitrospira species (Ushiki et al., 2017) and likely enables an adaptation to environments with variable and micromolar concentrations of urea. In comparison, N. moscoviensis only has the potential to encode UrtA, and not accessory proteins UreD, UreE, UreF, and UreG (Koch et al., 2015).
Not only do these results suggest that Ca. N. inopinata had a greater genetic potential to degrade urea, but also may have outcompeted the potential for reciprocal feeding between N. moscoviensis and other ammonia oxidizers. Another explanation for this finding could be that Ca. N. inopinata has over an order of magnitude greater affinity for NH 3 than canonical AOB and is well suited for low NH 3 and variable environments (Kits et al., 2017). The importance of scavenging urea and NH 3 is emphasized by the fact that "Candidatus Nitrospira inopinata" cannot survive on NO − 2 alone, since the organism lacks the ability to utilize NO − 2 as a source of N (Daims et al., 2015).
Although comammox Nitrospira would have the advantage over N. moscovinesis by scavenging NH 3 and urea, both organisms could occupy the same niche due to their unique metabolic flexibilities. NOB Nitrospira thrive in oxic-anoxic niches where formate is produced by fermentation, and would not be outcompeted by comammox Nitrospira clade A, which lack a formate dehydrogenase enzyme (Hu and He, 2017). Furthermore, N. moscovinesis can simultaneously produce and consume NO − 2 , by coupling formate oxidation to NO − 3 reduction while aerobically oxidizing NO − 2 (Koch et al., 2015). This may explain our findings that Nitrospira moscoviensis had high genomic potentials for NO − 2 oxidation (Nxr), NO − 3 reduction (NapG), and formate hydrogenation (Hyf, Fds).
Surprisingly, we detected an order of magnitude greater nirK compared to nxr in "Candidatus Nitrospira inopinata" and signifies that NO could be a major product of nitrification in this mussel habitat niche. Although previous studies have not documented gaseous NO x production from nirK belonging to Nitrospira, comammox organisms do have the genetic capability for NO − 3 reduction to NO − 2 and NO (Camejo et al., 2017). Furthermore, N. moscoviensis showed genomic potential to respond to and degrade NO and N 2 O, potentially removing the major gaseous products of nitrification.
Our findings agree with other comparative genomic studies showing that Nitrospira species are functionally diverse and provides new insights on the niche separation between comammox and NOB (Hu and He, 2017). Although many genomic features differed between N. moscoviensis and Ca. N. inopinata in this study, both had greater genetic potentials for type IV pili in the mussel habitat. This feature would enhance the ability of these organisms to respond to environmental changes and form protective flocs and biofilms (Hospenthal et al., 2017). Aggregation of these two species is feasible and consistent with their ecophysiological versatility (Kits et al., 2017). Ultimately, our results indicated that NOB-and comammox-Nitrospira were genetically capable of coexisting in the mussel habitat through niche differentiation features, but potentially also synergistically coupled their metabolic features.

CONCLUSION
This research used genomic markers to show that Nitrospira moscoviensis and "Candidatus Nitrospira inopinata" predominated an N-cycling oxic-anoxic niche in sediment of a mussel habitat. Our research showed that formate oxidation coupled to NO − 3 reduction by NOB Nitrospira may have enabled co-existence with comammox Nitrospira in this sediment niche. The mussel habitat harbored comammox Nitrospira with enhanced RND efflux transporters and metal resistance, phosphorus metabolism, and showed evidence of hydrogen oxidation, while decreasing the genomic potential for potassium homeostasis and oxidative stress. For N. moscoviensis, the mussel habitat contained greater abundances of translational control genes and heme utilization while the no-mussel treatment showed genomic evidence of carbon stress. Both NOB and comammox Nitrospira were marked by diverse metabolism in the mussel habitat and may have contributed toward the increased abundance of both organisms. More research is needed to determine the biogeochemical signatures of the mussel habitat that may be responsible for these various genetic markers. Ultimately, this study provided metagenomic evidence showing that niche partitioning and mixotrophic metabolism allowed NOB and commamox Nitrospira to coexist in mussel habitat sediment.

AUTHOR CONTRIBUTIONS
EB and CJ contributed conception and design of the study. EB performed the statistical analysis and bioinformatics. EB and CJ wrote sections of the manuscript. All the authors contributed to manuscript revision, read and approved the submitted version.