Characterization of a Metal-Resistant Bacillus Strain With a High Molybdate Affinity ModA From Contaminated Sediments at the Oak Ridge Reservation

A nitrate- and metal-contaminated site at the Oak Ridge Reservation (ORR) was previously shown to contain the metal molybdenum (Mo) at picomolar concentrations. This potentially limits microbial nitrate reduction, as Mo is required by the enzyme nitrate reductase, which catalyzes the first step of nitrate removal. Enrichment for anaerobic nitrate-reducing microbes from contaminated sediment at the ORR yielded Bacillus strain EB106-08-02-XG196. This bacterium grows in the presence of multiple metals (Cd, Ni, Cu, Co, Mn, and U) but also exhibits better growth compared to control strains, including Pseudomonas fluorescens N2E2 isolated from a pristine ORR environment under low molybdate concentrations (<1 nM). Molybdate is taken up by the molybdate binding protein, ModA, of the molybdate ATP-binding cassette transporter. ModA of XG196 is phylogenetically distinct from those of other characterized ModA proteins. The genes encoding ModA from XG196, P. fluorescens N2E2 and Escherichia coli K12 were expressed in E. coli and the recombinant proteins were purified. Isothermal titration calorimetry analysis showed that XG196 ModA has a higher affinity for molybdate than other ModA proteins with a molybdate binding constant (KD) of 2.2 nM, about one order of magnitude lower than those of P. fluorescens N2E2 (27.0 nM) and E. coli K12 (25.0 nM). XG196 ModA also showed a fivefold higher affinity for molybdate than for tungstate (11 nM), whereas the ModA proteins from P. fluorescens N2E2 [KD (Mo) 27.0 nM, KD (W) 26.7 nM] and E. coli K12[(KD (Mo) 25.0 nM, KD (W) 23.8 nM] had similar affinities for the two oxyanions. We propose that high molybdate affinity coupled with resistance to multiple metals gives strain XG196 a competitive advantage in Mo-limited environments contaminated with high concentrations of metals and nitrate, as found at ORR.

A nitrate-and metal-contaminated site at the Oak Ridge Reservation (ORR) was previously shown to contain the metal molybdenum (Mo) at picomolar concentrations. This potentially limits microbial nitrate reduction, as Mo is required by the enzyme nitrate reductase, which catalyzes the first step of nitrate removal. Enrichment for anaerobic nitrate-reducing microbes from contaminated sediment at the ORR yielded Bacillus strain EB106-08-02-XG196. This bacterium grows in the presence of multiple metals (Cd, Ni, Cu, Co, Mn, and U) but also exhibits better growth compared to control strains, including Pseudomonas fluorescens N2E2 isolated from a pristine ORR environment under low molybdate concentrations (<1 nM). Molybdate is taken up by the molybdate binding protein, ModA, of the molybdate ATP-binding cassette transporter. ModA of XG196 is phylogenetically distinct from those of other characterized ModA proteins. The genes encoding ModA from XG196, P. fluorescens N2E2 and Escherichia coli K12 were expressed in E. coli and the recombinant proteins were purified. Isothermal titration calorimetry analysis showed that XG196 ModA has a higher affinity for molybdate than other ModA proteins with a molybdate binding constant (K D ) of 2.2 nM, about one order of magnitude lower than those of P. fluorescens N2E2 (27.0 nM) and E. coli K12 (25.0 nM). XG196 ModA also showed a fivefold higher affinity for molybdate than for tungstate (11 nM), whereas the ModA proteins from P. fluorescens N2E2 [K D (Mo) 27.0 nM, K D (W) 26.7 nM] and E. coli K12[(K D (Mo) 25.0 nM, K D (W) 23.8 nM] had similar affinities for the two oxyanions. We propose that high molybdate affinity coupled with resistance to multiple metals gives strain XG196 a competitive advantage in Molimited environments contaminated with high concentrations of metals and nitrate, as found at ORR.

INTRODUCTION
Molybdenum (Mo) is an essential metal for the growth of virtually all known life forms, including humans, plants and microorganisms, as it is required for the function of several key enzymes involved in the cycling of N, C, and S (Hamlin, 2016;Schwarz, 2016;Maia et al., 2017). Tungsten (W), an antagonist of Mo, is more uncommon in nature but required in some enzymes, most notably in archaea. Physiologically-relevant oxidation states of Mo and W are + 4, + 5, and + 6 (Maia et al., 2017). There are five distinct enzyme families that use Mo and/or W, represented by nitrogenase (Mo only, although some use vanadium), xanthine oxidase (Mo only), the sulfite oxidase (Mo only), DMSO reductase (most family members use Mo, a few use W) and tungsten-containing oxidoreductase (WOR, W only) (Hille et al., 2014;Maia et al., 2017). In most microorganisms, molybdate is taken up into the cell by the molybdate ATP-binding cassette or Mod transporter (ModABC), which can also take up tungstate (Grunden and Shanmugam, 1997;Self et al., 2001).
In the nitrogen cycle, Mo is utilized in three key steps, N 2 -fixation (by nitrogenase), nitrite oxidation (by nitrite oxidoreductase) and nitrate reduction (by nitrate reductase) (Zhang and Gladyshev, 2008). Hence Mo is required for the biological removal of nitrate from contaminated environments as the reductase is a key enzyme in both the denitrification (yielding N 2 ) and dissimilatory nitrate reduction to ammonium (DNRA) pathways (Zhang and Gladyshev, 2008). Consequently, in natural environments, the availability of Mo can limit nitrate removal (Barron et al., 2009;Glass et al., 2012). Mo limitation can also negatively impact nitrate removal in contaminated environments, which can be caused by the extensive use of nitrate-containing fertilizers, the release of nitrate-containing industrial wastes, as well as mining and other anthropogenic activities leading to problems for human health and natural environments (Spalding and Exner, 1993;Kellman and Hillaire-Marcel, 2003;Diaz and Rosenberg, 2008;Gruber and Galloway, 2008;Powlson et al., 2008;Thorgersen et al., 2015;Zhang et al., 2015).
The Oak Ridge Reservation (ORR) in Tennessee, United States contains a nitrate-contaminated waste site -the S-3 ponds. These are four adjacent (∼9.5 million liters each) earthen reservoirs used for the disposal of waste liquids that had been produced from the Y-12 nuclear plant for more than 30 years (Brooks, 2001). The waste liquids contained high, and potentially toxic, concentrations of nitrate (up to 1.2 M) and a wide variety of metals, such as iron (up to 21 mM), aluminum (up to 180 mM), magnesium (up to 28 mM), and uranium (up 1.3 mM) (Brooks, 2001). In 1983, the waste liquids in the S-3 ponds were adjusted to about pH 9, and the precipitates formed were allowed to settle before the liquid was removed (Brooks, 2001;Revil et al., 2013). In 1988, the S-3 ponds were filled and capped and now serve as a parking lot (Revil et al., 2013). However, the area is still heavily contaminated and groundwater in the contamination plume emanating from the former S-3 ponds is at low pH (as low as 3.0) and contains high concentrations of nitrate (up to 230 mM), much higher than the surrounding pristine groundwater (less than 32 µM) (Nolan et al., 1998;Ge et al., 2019). In addition, the contaminating plume has elevated concentrations of over 20 metals, including uranium (up to 580 µM) (Smith et al., 2015;Thorgersen et al., 2015). In stark contrast, extremely low concentrations of Mo (in the picomolar range) were measured in this highly contaminated groundwater. It was demonstrated experimentally that the pM concentrations of Mo in ORR contaminated groundwater were likely a result of molybdate adsorption and incorporation into Fe-and Albased minerals that are formed as the groundwater from the highly contaminated area (pH < 1) mixes with the surrounding groundwater (Moura et al., 2004;Ge et al., 2019).
Hence, a fundamental question is whether microorganisms that thrive in the unique ORR environment contaminated with high concentrations of metals and nitrate, yet containing only picomolar levels of Mo, have unique features that enhance Mo utilization. Herein, we describe the characterization of a novel nitrate-reducing Bacillus, designated strain EB106-08-02-XG196 (hereafter XG196), that was isolated from a sample of nitrateand metal-contaminated ORR sediment (EB-106) located 21 m downstream of the S-3 ponds area . XG196 is resistant to high concentrations of a metal mixture that was designed to mimic the ORR contaminated groundwater. More importantly, it is also much less sensitive to Mo-limitation than other ORR isolates, including four other EB-106 strains and a microbe obtained from a non-contaminated ORR environment. The molecular basis for the ability of XG196 to thrive under Mo-limited conditions was investigated.

Nitrate Reductase Activity
Nitrate reductase activities of the EB-106 isolates were determined using whole cell suspensions (Filiatrault et al., 2013). Strains were grown anaerobically in Hungate tubes and cells were collected between mid-log phase and early stationary phase, then 15 µL of 5 mg/ml chloramphenicol was added to 1.5 ml of culture to inhibit protein synthesis. Cells were washed twice and re-suspended in buffer (50 mM phosphate buffer, pH 7.2) and the OD 660 was determined. 200 µL of cells were mixed with 25 µL of methyl viologen (0.5 mg/ml) in an anaerobic sealed cuvette at 25 • C. 100 µL of reaction solution (4 mg/ml Na 2 S 2 O 4 , 4 mg/ml NaHCO 3 and 100 mM KNO 3 ) was added to start the reaction. In control reaction buffer, Na 2 S 2 O 4 was replaced with water. After incubation at room temperature for 5 min, the mixtures were vortexed in air to stop the reaction by oxidizing the electron donors (Na 2 S 2 O 4 and reduced methyl viologen). The amount of nitrite produced was measured by adding 100 µL of sulfanilic acid (1% w/v in 20% HCl) to 30 µL of each reaction mixture followed by 100 µL of N-(1-naphthyl)ethylenediamine-HCl (1.3 mg/ml). The OD 540 of each sample supernatant was measured and the amount of nitrite was calculated according to nitrite standards. The OD 420 of the samples was also measured to account for light scattering by residual cells and cell fragments. Nitrate reductase specific activity is expressed as units/OD 660 , in which units are calculated using the formula 100 × [OD 540 -(0.72 × OD 420 )]/(T × V), T is time in minutes and V is reaction volume in milliliters (Filiatrault et al., 2013;Thorgersen et al., 2019).

Carbon Sources Utilization for Anaerobic Growth Analysis
Growth on various carbon sources was determined at 25 • C under anaerobic conditions using the standard medium lacking yeast extract and the organic mixture but containing either formate, acetate, ethanol, lactate, succinic acid, fumarate, xylose, xylitol, glucose, fructose, maltose, sodium benzoate, sodium 4-hydroxybenzoate, potassium sodium tartrate, proline, phenylalanine, arginine, threonine, leucine, glutamate, or glutamine (all at 2 mM) with and without nitrate (KNO 3 , 20 mM). Growth was measured in 400 µl wells on a 100-well plate (Bioscreen sterile plates HONEYCOMB, Thermo Fisher Scientific, Waltham, MA, United States) using a Bioscreen C (Thermo Labsystems, Thermo Fisher Scientific, Waltham, MA, United States) placed in a PLAS LABS anaerobic chamber under a 5% H 2 and 95% Ar atmosphere. Optical density (OD 600 ) of cultures in each wells were measured every 5 min, after the plate was shaken using the Bioscreen C to resuspend cells. 1 https://blast.ncbi.nlm.nih.gov Mo Accumulation Analysis EB-106 isolates were grown in 500 ml of defined media with 1 µM Mo ((NH 4 ) 2 MoO 4 ) and harvested at mid log phase, washed three times with 10 ml of Tris buffer (Tris 50 mM, pH 8.0, containing 100 mM NaCl) and then resuspended in Tris buffer. Cells were lysed by sonication, then were spun down at 10,000 × G for 15 min and the supernatants were used for further centrifugation. The cytoplasmic extract (S100) was obtained after centrifugation at 100,000 × G for 1 h in a Beckman Coulter Optima L-90 ultracentrifuge. The membrane fraction was resuspended in 2 ml of Tris buffer. Both S100 and membrane fractions were diluted (1:40) with trace grade 2% nitric acid (VWR, Radnor, PA, United States) and incubated overnight prior to analysis by inductively coupled plasma mass spectrometry (ICP-MS) analysis to quantify Mo (Lancaster et al., 2014;Scott et al., 2015). Protein concentrations were measured using the Bradford assay (Bio-Rad protein assay kit, Bio-Rad, Berkeley, CA, United States). The amount of Mo accumulated is expressed as nmoles per gram of protein (nmol/g).

Molybdenum-Limited Growth
For the Mo-depleted medium, a solution was prepared that contained 1.3 mM KCl, 2 mM MgSO 4 , 0.1 mM CaCl 2 , and 0.3 mM NaCl together with the vitamins and minerals described above except that molybdenum and tungsten were not added (Widdel and Bak, 1992). Fe(NO 3 ) 3 (20 mM) was then added, which acidifies the solution to pH ∼ 2.5. The pH was then adjusted to pH 6.7 using trace grade NaOH (2.0 M) to induce precipitation of ferric hydroxide. As previously described , the Fe precipitates any contaminating Mo present in the medium components. The Mo-depleted growth medium was prepared by adding trace grade Fe(NO 3 ) 3 (7.4 µM), Na 2 SO 4 (2 mM), NaHCO 3 (30 mM), and NaH 2 PO 4 (5 mM) and inoculated with 1% (vol/vol) washed XG77, XG146, XG95, XG201, or XG196 cells grown in media with no Mo added. Growth in this medium with and without added Mo (0.1, 0.5, 1, 5, 10, 50, or 500 nM Na 2 MoO 4 ) was measured in quadruplet using the Bioscreen C described above. Mo and W competition analysis of XG196 and Pseudomonas fluorescens N2E2 under nitrate reducing conditions were performed using the same media with added Mo (0, 5, 50, 500, 5000, or 50000 nM Na 2 MoO 4 ) and added W (0, 50 nM, 5 or 500 µM Na 2 WO 4 ).

16S rDNA Sequencing and Phylogenetic Analysis
The 16S rDNA of isolate XG196 was amplified by PCR using universal bacterial primers 27F (5 -AGA GTT TGA TCC TGG CTC AG-3 ) and 1492R (5 -ACG GCT ACC TTG TTA CGA CTT-3 ) from Integrated DNA Technologies, Coralville, IA, United States. DNA sequencing was carried out by GENEWIZ, South Plainfield, NJ, United States. The sequence was first analyzed by BLAST 3 (Altschul et al., 1990), which indicated that XG196 is a Bacillus strain. The sequence was uploaded to the Ribosomal Database Project (RDP 4 ) (Cole et al., 2013). The RDP tool Seqmatch was run to find the closest relatives of isolate XG196 and the Bacillus type strains with high quality 16S rDNA sequences (>1200 bp). The two closest relatives of XG196 and a total of 187 Bacillus type strains with one out group strain were selected to build the 16S rRNA phylogenetic tree by IQ-TREE using maximum likelihood ( 5 Nguyen et al., 2014). GTR + F + R6 model was selected by ModelFinder (Kalyaanamoorthy et al., 2017) and 1000 times of bootstrapping was run using UFBoot (Hoang et al., 2017).

Phylogenetic Analysis of Molybdate and Tungstate Binding Proteins
Accession numbers of ModA (family IPR005950) and WtpA (family IPR022498) were downloaded from the InterPro database 6 (Mitchell et al., 2018). Information, such as sequence, mass, protein name, gene name, taxonomic lineage, cross-reference in PDB, cross-reference in KEGG, PubMed ID, etc. were all downloaded together. Proteins with candidadus/candidate organisms, uncultured organisms, fragment proteins, wrong/poorly-labeled organisms, and duplicates were removed from the list. Two lists (A and B) of strains were selected from downloaded candidates for ModA phylogenetic analysis. List A uses strains with ATCC (American Type Culture Collection) or DSM (Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH) reference IDs. For list B, all downloaded protein sequences were clustered by CD-HIT 7 at 60% sequence identity (Huang et al., 2010). In both lists, all archaea, eukaryote sequences and sequences with KEGG cross-reference or 3D structures were kept. Several Bacillus type-strains and the top two closest isolate XG196 ModA relatives and ORR isolate P. fluorescens N2E2 from non-contaminated area were also kept. ModA/WtpA sequences from list A (617 sequences, Supplementary Material list A) and B (4623 sequences, Supplementary Material list B) were all used for tree building. Multiple sequence alignment was done by Clustal Omega 8 (Madeira et al., 2019). IQ-tree were used to build the phylogenetic tree by maximum likelihood (Nguyen et al., 2014).
LG + F + R10 model and WAG + R9 model were selected for list A and B ModA tree building by ModelFinder (Kalyaanamoorthy et al., 2017). 2000 and 3000 times of bootstrapping was run for list A and B ModA tree using UFBoot (Hoang et al., 2017). Signal peptide prediction analysis was performed for all list A sequences by SignalP5.0 9 (Armenteros et al., 2019).

Multi-Alignment and Structural Modeling Analysis of ModA
Multiple sequence alignments of XG196 ModA and selected proteins were first run by Clustal Omega 10 (Madeira et al., 2019) and further analyzed with selected ModA proteins with structural data from the PDB 11 using ESPript 3.0 12 (Robert and Gouet, 2014). The structures of the ModA proteins from Pyrococcus furiosus ATCC 43587 ModA (PDB: 3CG1) and Escherichia coli K12 ModA (PDB: 1AMF) were used for comparison with XG196 ModA. Mean identity and mean similarity of protein sequences were also calculated by ESPript 3.0. SWISS-MODEL 13 (Brooks, 2001) was used to predict the model of XG196 using template ModA (PDB: 2H5Y) from Xanthomonas axonopodis pv. citri 306. UCSF Chimera 14 (Pettersen et al., 2004) was used to visualize the model.

Expression and Purification of Recombinant ModA Proteins
ModA genes were amplified by PCR from the genomes of XG196, P. fluorescens N2E2 and E. coli K12. The primers are listed in Supplementary Table S2. The forward primer for the ModA gene of XG196 was designed to omit the N-terminal 20 amino acids, which include a signal peptide and a putative 7 http://weizhongli-lab.org/cd-hit/ 8 https://www.ebi.ac.uk/Tools/msa/clustalo/ 9 http://www.cbs.dtu.dk/services/SignalP/ 10 https://www.ebi.ac.uk/Tools/msa/clustalo/ 11 https://www.rcsb.org/ 12 http://espript.ibcp.fr/ESPript/ESPript/ 13 https://swissmodel.expasy.org/ 14 http://www.cgl.ucsf.edu/chimera lipoprotein-attachment sight (Cys20). The forward primers for ModA genes of P. fluorescens N2E2 and E. coli K12 were designed to omit the N-terminal signal peptides, the first 23 and 25 amino acids, respectively. Signal sequences and lipoproteinattachment site were predicted by SignalP-5.0 15 (Armenteros et al., 2019). The PCR amplicons were cloned into the pET24a (+) plasmid (Novagen). ModA proteins were expressed in E. coli Rosetta 2 (DE3)pLysS (Novagen) cells in LB media supplemented with kanamycin (50 µg/ml). Recombinant gene expression was induced at an OD 600 ∼ 0.6 with 0.5 mM IPTG and the growth temperature was reduced from 37 to 25 • C. Cells were harvested after 16 h and resuspended in start buffer (Tris 20 mM, pH 7.6, 100 mM NaCl, 5 mM imidazole). Cells were lysed by sonication and centrifuged to remove unlysed cells. The supernatant fractions were loaded onto a HisTrap FF crude column (GE health care) pre-equilibrated with start buffer and washed with two column volumes of wash buffer (Tris 20 mM, pH 7.6, 100 mM NaCl, 30 mM imidazole) and the recombinant ModA proteins were then eluted with elution buffer (Tris 20 mM, pH 7.6, 100 mM NaCl, 300 mM imidazole). ModA proteins were further purified by gel filtration using a Superdex 200 HiLoad 16/60 prep grade column (GE health care) equilibrated with Tris 20 mM, pH 7.6, containing 250 mM NaCl. Fractions containing the purified ModA protein as determined by SDS-PAGE were buffer exchanged to a low salt buffer (Tris 20 mM, pH 7.6, 90 mM NaCl) using an Amicon Ultra-15 10K centrifugal filter device at 4 • C for 16 h for further ITC analysis. Mo in 40 µM of protein samples before and after dialysis were measured by ICP-MS. Trace grade of Tris (MilliporeSigma, St. Louis, MO, United States) and NaCl (MilliporeSigma, St. Louis, MO, United States) were used in protein purification and dialysis.

Isothermal Titration Calorimetry (ITC) Analysis
Molybdate (100 mM Na 2 MoO 4 ) and tungstate (100 mM Na 2 WO 4 ) stock solutions were prepared in trace grade ITC buffer (Tris 20 mM, pH 7.6, 90 mM NaCl) and then diluted to a final concentration of 0.3 or 0.4 mM using ITC buffer. ITC analysis was performed using a Malvern MicroCal PEAQ-ITC (Malvern Panalytical, Malvern, United Kingdom) at 25 • C. Molybdate or tungstate were injected into the sample chamber (300 µL) containing 30 or 40 µM ModA to give a final molar ratio of oxyanion to ModA of 2:1. Displacement titrations were carried out by titrating molybdate or tungstate with chromate-saturated ModA (containing twofold of chromate from Na 2 CrO 4 ) (Sigurskjold, 2000). Data were analyzed by Malvern MicroCal PEAQ-ITC analysis software (Malvern Panalytical, Malvern, United Kingdom). Each test was done twice and the average data were used.

Metagenome Annotation and Analysis of Nitrate-Reducing Bacteria in ORR
Previously published metagenome sequence reads of samples from ORR groundwater were obtained from the NCBI database under BioProject PRJNA513876 (Tian et al., 2020). Metagenomic reads were preprocessed using BBtools version 38.60 (no references known 16 ) to remove Illumina adapters, perform quality filtering and trimming, and remove PhiX174 spike-ins. The script bbduk.sh was run with parameters ktrim = r k = 23 mink = 11 hdist = 1 ref = adapters.fa tbo tpe 2 to remove any remaining standard Illumina adapters given in adapters.fa. The script was run again with parameters bf1 k = 27 hdist = 1 qtrim = rl trimq = 17 cardinality = t ref = phix174_Illumina.fa to perform quality filtering and trimming, and to remove Illumina PhiX174 spike ins given in the file phix174_Illumina.fa. We assembled the reads using SPAdes version 3.13.0 (Bankevich et al., 2012) with parameters -meta -k 21,33,55,77,99,127. We predicted proteincoding genes using Prodigal v 2.6.3 (Hyatt et al., 2010) with parameters -n -p single. Predicted protein-coding genes were annotated on the contigs using eggNOG mapper (v2) with default parameters (Huerta-Cepas et al., 2017). The number of predicted genes for each protein of interest was normalized by the number of raw reads obtained from metagenome sequencing.

Isolation and Physiological Characterization of Bacillus Strain XG196
In order to isolate nitrate-reducing microbes with a high affinity for molybdate from the metal-and nitrate-contaminated ORR site, sediments from the contaminated EB-106 vertical core were used for enrichment and isolation. This 8-m core, taken about 21 m downstream of the contamination source (the S-3 ponds), was cut into 22 cm segments under anaerobic conditions Moon et al., 2020). The EB-106 core covered the vadose zone (0-300 cm, the area between the land surface and water table), the capillary fringe (300-350 cm, the subsurface layer between vadose zone and the water table), and saturated zone (350-800 cm, the region below the water table) of the soil (Figure 1A). The groundwater passing through the saturated zone of the EB-106 core flows from the contamination site and is considered to be highly contaminated. A total of 88 unique nitrate-reducing bacteria were isolated from EB-106 sediment samples under nitrate-reducing conditions in a medium containing a combination of carbon sources (2 mM of formate, acetate, ethanol, lactate, succinate and glucose, together with 0.1 g/L yeast extract) and various levels of metal contaminants (no, 0.5 × MM or 1.0 × MM). Five strains, XG77, XG95, XG146, XG196 and XG201, were selected for further characterization based on their ability to grow anaerobically on nitrate, their nitrate reductase activities and metal resistance properties. XG77, XG95, and XG196 were identified as Bacillus strains, while XG146 and XG201 were identified as Ensifer and Enterobacter strains, respectively, by 16S rDNA sequences ( Figure 1B). All five were isolated from the contaminated saturated zone (below 350 cm) (Figures 1A,B).
Growth of the EB-106 isolates was determined under nitratereducing growth conditions in the presence of increasing concentrations of a single metal (Cd, Ni, Cu, Co, Mn, or U) or the MM metal mixture containing all six metals, which mimics the concentrations of metals found in the ORR contaminated groundwater (Supplementary Table S1). The effects of the metals on growth was determined by calculating the IC 50 values. Generally, strain XG196 had the highest metal tolerance of the five stains to the metal contaminants in the EB-106 sediments. Specifically, isolate XG196 had the highest IC 50 values when grown with Ni 2+ (119 µM), Co 2+ (220 µM), Mn 2+ (>900 µM), U 6+ (2,000 µM) and the metal mixture (1.2×) and the second highest IC 50 value when grown with Cu 2+ (94 µM) (Supplementary Figure S2). Strain XG196 also grew in the presence of very high concentrations of nitrate and nitrite, with IC 50 values of 299 and 99 mM, respectively (Supplementary Figure S2).
To analyze the dependence of growth under nitrate-reducing conditions on Mo, the five EB-106 strains and one strain previously isolated from non-contaminated ORR groundwater (P. fluorescens N2E2) were grown with increasing concentrations of molybdate in Mo depleted media prepared with trace metal grade chemicals in order to lower the amount of contaminating Mo to picomolar concentrations in cultures (∼400 pM, Ge et al., 2019). As shown in Figure 2A, strain XG196 showed the highest percentage of maximum growth (84% of highest OD 600 ) even when no Mo was added to the medium, while the other EB-106 sediment strains tested required at least 1 nM Mo to reach ≥ 74% of maximum growth. P. fluorescens N2E2, which has been used as a reference strain in other ORR contamination studies (Thorgersen et al., 2015;Ge et al., 2019), had the lowest percentage (as low as 43%) of maximum growth when less than 1 nM Mo was added.
Tungstate is a competitive inhibitor of molybdate transport (Grunden and Shanmugam, 1997;Hu et al., 1997;Self et al., 2001). A Mo/W competitive growth analysis of isolate XG196 and P. fluorescens N2E2 under nitrate reducing conditions showed that low concentrations of W (up to 50 nM) do not affect the nitrate-dependent growth of XG196 but limits the growth of P. fluorescens N2E2 to only 20% of the maximum (Figure 2B). At higher W concentrations, W inhibits nitrate-dependent growth of both isolate XG196 and P. fluorescens N2E2. However, strain XG196 requires the addition of less Mo to resume maximal growth. For example, when 5 µM W was added to their media, XG196 only required 50 nM Mo to reach maximum growth, but P. fluorescens N2E2 required at least two orders of magnitude more Mo (5,000 nM; Figure 2B). Our hypothesis is that XG196 has a much higher affinity for molybdate than the other strains tested, especially that of strain N2E2. The environment from which strain N2E2 was isolated has much higher molybdate concentrations (approximately 10 nM) than the contaminated groundwater (Mo < 1 nM) (Smith et al., 2015;Thorgersen et al., 2015). A higher affinity for molybdate could give a growth advantage to XG196 by nitrate reduction under Molimited conditions.

Genomic and 16S rDNA Analysis of XG196
The draft genome of strain XG196 contained 6,010,169 bp in 55 contigs longer than 500 bp with a 38.35% G + C content. A total of 5721 coding sequences were predicted. The genome sequencing information from strain XG196 was submitted to the National Center for Biotechnology Information (NCBI) genome database and the accession number is JABWSY000000000. Nitrate reduction-related genes were annotated in the XG196 genome, including for nitrate reductase (napA and napB), copper-containing nitrite reductase (aniA) and nitrous-oxide reductase (nosZ), while the gene encoding nitric oxide reductase (nor) was missing. Some assimilatory nitrate reduction-related genes (nasC, nasD, and nasE) were also present in the genome. Genes encoding the molybdate ABC transport system (modA and modB) were also identified. The 16S rDNA sequence (1487 bp) of strain XG196 identified the organism as a member of the Bacillus genus. In order to characterize it at the species level, a total of 190 16S rDNA sequences, which include those of two XG196 close relatives, 186 Bacillus type strains and one out group strain, were used to build a phylogenetic tree using maximum likelihood by IQ-TREE (Supplementary Figure S3)

Phylogenetic Analysis of the Molybdate Binding Protein (ModA) of XG196
ModA is the molybdate-binding protein component of the molybdate ModABC transporter. We hypothesize that the ability of XG196 to grow by nitrate reduction using an extremely low concentration of Mo [<1 nM in contaminated groundwater close to S-3 ponds area ] is because its ModA has an unusually high affinity for molybdate. Phylogenetic analysis of XG196 ModA based on protein sequences of about 600 ATCC and DSM strains, including those from Archaea, Bacteria and Eukaryota, showed that it is, indeed, distinct from those of the ORR isolate P. fluorescens N2E2 (N2E2) and of E. coli K12 (Figure 3). The same conclusion was reached by a similar analysis using over 4,000 strains (Supplementary Figure S4). Most ModA proteins in proximity to XG196 ModA on the phylogenetic tree originate from other Bacillus strains, most of which were also isolated from soil, but their sequence identities are only about 50% (Supplementary Table S4). The two closest relatives of XG196 ModA are from Rhodococcus qingshengii (entry: A0A4R6A6K9, 85.9% identity) and Bacillus sp. 7884-1 (entry: A0A268JZS1, 85.9% identity) by UniProt BLAST (Figure 4 and Supplementary Table S4 , and Azotobacter vinelandii (PDB: 1ATG, WO 2− 4 ) have been determined (Figures 3, 5) (Hu et al., 1997;Lawson et al., 1998;Santacruz et al., 2006). Each binds a single molybdate (or tungstate) ion. In addition, some archaea are able to utilize tungsten, a metal seldom used in biology, in their pyranopterin-containing enzymes (other than Mo-dependent nitrate reductase) (Cabello et al., 2004;Bevers et al., 2006). These tungsten-utilizing microorganisms take up tungstate using a transporter (WtpA) that is highly homologous to ModA (Supplementary Figure S5) Figure S5) (Hollenstein et al., 2009).
XG196 ModA was modeled using ModA (PDB: 2H5Y) from X. axonopodis pv. citri 306 as the template, which has 37% sequence identity with XG196 ModA and contains molybdate as the ligand in the crystal structure. Based on the residues invoved in molybdate binding in X. axonopodis pv. citri 306 ModA, Ser36, Ser63, Ala149, Val176, and Tyr194 of XG196 ModA are predicted to directly bind molybdate via hydrogen bonds (Figure 5 and Supplementary Figure S6). However, from the modeling it is not clear why the XG196 protein has increased affinity for the metal. Although archaeal and bacterial WtpA/ModA proteins are evolutionally distant, the residues involved in metal binding are partially conserved, suggesting a similar ligand binding mechanism (Figure 3 and Supplementary Figure S5). Multi-alignment analysis of the ModA proteins from E. coli K12, XG196, two close relatives of XG196 ModA (from Rhodococcus qingshengii and Bacillus sp. 7884-1), other Bacillus ModA proteins from phylogenetic analysis (Figure 3 and Supplementary Table S4) and of EB-106 isolate XG77, isolated from sediments of similar depth with isolate XG196 (Figures 1A,B), are shown in Supplementary Figure S7. The mean sequence identity and similarity of these ModA sequences is about 12.2 and 65.4%, respectively. The sizes of these ModA proteins are similar (about 250 residues) and their sequences are conserved at 11 out of 12 the molybdate binding residues found in E. coli K12 ModA (Ala34, Ala35, Ser36, Ser63, Ala82, Val147, Pro148, Ala149, Asp175, Val176, and Tyr/Phe194), the exception being position Ser/Gly/Ala62. It seems that ModA proteins are quite similar, particularly XG196 ModA and other Bacillus ModA proteins, and novel attributes of the XG196 protein are not obvious, especially in the deduced oxyanion binding site.

ITC Analysis of ModA Proteins
To determine their molybdate-binding properties, the genes encoding the ModA proteins from XG196, N2E2 and E. coli were expressed in, and the recombinant proteins were purified from, E. coli. ICP-MS analysis showed that XG196 ModA(40 µM) can naturally bind about 67 nM of Mo even when trace grade chemicals were used, higher than what N2E2 ModA (15 nM) and E. coli ModA (9 nM) can bind (Supplementary Figure S8). After dialysis in low salt ITC buffer, all ModA proteins can pick up a little bit more Mo from the ITC buffer (XG196 ModA to 82 nM, N2E2 ModA 19 nM, and E. coli ModA 10 nM). On average, XG196 ModA, N2E2 ModA and E. coli ModA bound 0.002, 0.0005, and 0.0003 of molybdate per protein, respectively, which are far away from being saturated. ITC analysis showed that these proteins contain a single binding site for molybdate (values were 1.10 ± 0.01, 0.95 ± 0.08, and 0.92 ± 0.01, respectively). However, the molybdate binding curves showed that XG196 ModA had a K D value for molybdate of 2.21 ± 1.03 nM, which is about one order of magnitude lower than those of N2E2 (27.0 ± 6.2 nM) and E. coli (25.01 ± 3.7 nM) ModA (Table 1 and Supplementary Figure S9). Hence, XG196 ModA has a much higher affinity for molybdate, consistent with results from the physiological study showing that XG196 is able to grow by nitrate reduction using Mo concentrations (<1 nM) that limit the growth of other bacteria, including N2E2. The tungstate-binding affinity of XG196 ModA was about fivefold higher than that for molybdate (K D 11.15 ± 1.34 nM), and about half of the tungstate dissociation constent values for the ModA proteins of N2E2 and FIGURE 3 | Phylogenetic analysis of ModA. Rooted phylogenetic tree of ModA and WtpA from Bacteria (pink, inner circle), Archaea (green, inner circle), and Eukaryota (blue, inner circle). Outer circle indicates different signal peptide type (LIPO_Sec/SPII in purple, TAT-Tat/SPI in yellow, SP_Sec/SPI in blue and other in gray). ModA proteins with PDB 3D structure data were indicated as black stars. Clades where XG196 ModA (blue clade), N2E2 ModA (purple clade), and E. coli ModA (green clade) belong to were also labeled in different colors.
E. coli (26.6 ± 2.0 and 23.7 ± 0.6 nM, respectively; see Table 1 and Supplementary Figure S9). The stoichiometry of tungstate binding to each of these proteins was also 1:1, as found for molybdate. Hence, the lower binding affinity for molybdate than tungstate of XG196 ModA is consistent with the better growth of the organism under nitrate-reducing conditions than N2E2 when tungstate is present (Figure 2B).

Gene Abundance of Mo-Related Proteins in ORR Groundwater
To better understand the utilization of Mo in the ORR environment, the abundances of ModA genes and genes encoding representative proteins from the four families of Mo proteins were analyzed in ORR groundwater samples from both contaminated and background wells. As shown in Table 2, the abundance of Mo-related genes are generally higher in ORR contaminated groundwater samples. In particular, the abundance of modA (encoding ModA) and napA/narG (encoding dissimilatory nitrate reductase Mo-containing subunit) are significantly higher in contaminated well FW021, FW104 and FW106 (modA 27 to 39.9 copies per 10 8 reads, napA 11.2 to 32.5 copies per 10 8 reads, and narG 26.4 to 42.6 copies per 10 8 reads) than in background well FW300, FW301, and FW305 (modA 4.1 to 27 copies per 10 8 reads, napA 1.6 to 8.9 copies per 10 8 reads, and narG 1.8 to 10.5 copies per 10 8 reads). In contrast, the abundance of nasA (encoding assimilatory nitrate reductase Mo-containing subunit), dmsA (encoding DMSO reductase Mocontaining subunit), xdhB (xanthine oxidase/dehydrogenase) and sorA (encoding sulfite oxidoreductase Mo-containing subunit) were only slightly higher in contaminated wells, while the abundance of nifK (encoding nitrogenase) is similar in both contaminated and background wells ( Table 2). The higher
abundance of modA, napA, and narG relative to other Mo-related protein genes in the contaminated wells is likely an adaptive advantage given the high nitrate concentrations (0.02-13.3 mM), which are about 1000-fold higher than in the background wells (0.1-1.8 µM).

DISCUSSION
The ORR S-3 ponds contamination plume is unique as it contains high concentrations of nitrate (up to 230 mM in groundwater) and various metals (Cd, Ni, Cu, Co, Mn, U, etc.) at low pH (∼3) (Brooks, 2001;Revil et al., 2013). Yet, we previously showed that in this unique environment, Mo is generally limiting for microbial nitrate reduction (Thorgersen et al., 2015). Previous studies have revealed that complex microbial communities survive in this contaminated site (Abulencia et al., 2006;Vishnivetskaya et al., 2011). The overall goal of this research was to elucidate the molecular mechanisms that give certain microorganisms competitive advantages in these extreme habitats. Bacillus strain XG196 was isolated from contaminated core EB-106 that was drilled adjacent to the origin of the contamination (the S-3 ponds). XG196 was shown to grow by nitrate reduction in the presence of an exceedingly low concentration of Mo that contaminated its defined medium from the inoculum and the chemicals that make the media (to which no Mo was added). The ability to grow with limited Mo appears to be due to its molybdate-binding protein, ModA, which has a very high affinity for molybdate (K D ∼ 2 nM). This is the lowest K D value yet reported for any ModA to date and it is also the first ModA characterized from a Bacillus strain. Previous studies have typically reported molybdate affinities with ModA proteins that are more than an order of magnitude lower (Corcuera et al., 1993;Bevers et al., 2006;Smart et al., 2009;Aryal et al., 2012). A similarly high affinity but for tungstate was reported for the Wtp protein of W-dependent P. furiosus, a member of the archaea domain. Its binding affinity of molybdate is about fivefold lower (K D = 11 ± 5 nM) than that found here for XG196 ModA (Bevers et al., 2006). The molybdate binding affinity of E. coli ModA measured in this study (∼25 nM) is consistent with what has been reported by others (K D = 20-26 nM; Corcuera et al., 1993;Imperial et al., 1998). The K D value for molybdate of N2E2 ModA is about 27 nM, consistent with the poor nitrate-reducing growth observed in Mo-limited media compared to XG196. In addition, the ModA proteins of E. coli and N2E2 have very similar K D values for both molybdate and tungstate, hence, neither protein is able to distinguish between these two oxyanions, consistent with what has been reported for E coli ModA (Rech et al., 1996;Imperial et al., 1998). In contrast, XG196 ModA has a fivefold higher affinity for molybdate compared to tungstate, which could give the organism a selective advantage in scavenging molybdate for growth in the presence of tungstate as seen in the Mo/W competition growth studies herein (Figure 2).
Phylogenetic analysis showed that XG196 ModA is distinct from previously described ModA proteins, including that of E. coli K12 (Corcuera et al., 1993;Aryal et al., 2012), the WtpA/ModA proteins from the bacterium Azotobacter vinelandii (Lawson et al., 1998), and the archaea P. horikoshii (Hollenstein et al., 2009) and P. furiosus (Bevers et al., 2006). Multialignment analysis indicates that XG196 ModA is quite similar to those of other Bacillus species based on their sequence (65.4% mean similarity) and their deduced oxyanion binding sites (Supplementary Figure S7). However, it is hard to conclude that all of these Bacillus ModA proteins have molybdate affinities as high as that of XG196 ModA since the molybdate-binding residues are highly conserved. Unfortunately, modeling of XG196 ModA (Supplementary Figure S6) did not shed light on why it has a much higher affinity for molybdate than structurallycharacterized proteins. ModA proteins contain a signal peptide at the N terminus that enables the protein to be transported across the membrane. ModA signal peptides fall into one of four different groups: Sec/SPI, Sec/SPII, Tat/SPI, and other (Nielsen et al., 2019). Surprisingly, the ModA from XG196 grouped with the ModA proteins from archaea, and these are all predicted to be lipopeptides and belong to the Sec/SPII group, while N2E2 and E. coli ModA proteins belong to the Sec/SPI group with non-lipopeptides (Figure 3). Substrate-binding lipoproteins are widely observed in gram-positive bacteria (Sutcliffe and Russell, 1995;Hutchings et al., 2009). It is believed that the lipopeptides can tether substrate-binding proteins in order to prevent their loss into the growth environment because of the absence of the retentive outer membrane in gram-positive bacteria (Sutcliffe and Russell, 1995). At present, not enough information is available to distinguish "high" affinity molybdate transporters (like XG196 ModA) from "low" affinity ones (like those of N2E2 and E. coli ModA) based only on sequence similarity or the deduced molybdate-binding residues. Structural determinations of high affinity ModA proteins in addition to that of XG196 will be required to elucidate the molecular basis as to why these particular proteins bind molybdate so tightly.
Strain XG196 exhibited higher nitrate reductase specific activity than the other EB-106 strains XG95, XG146, and XG201 (Supplementary Figure S1) and accumulated more Mo in its cytoplasm than XG77, XG95, XG146, and XG201 (Supplementary Figure S10). This could be the result of the higher molybdate affinity of its ModA, which must provide more than sufficient Mo for the biosynthesis of functional pyranopterin cofactor in nitrate reductase (Schwarz and Mendel, 2006) when Mo is limited in the environment. XG196 also accumulated the second highest concentration of Mo in the membrane fractions compared to the other EB-106 strains, which might be the result of a high nitrate reductase concentration in the membrane because of more than sufficient Mo taken up from environment. However, these results might not be directly related to the high affinity of ModA for molybdate. Nitrate reductases with high specific activities or high affinity molybdate storage proteins described in previous studies (Pienkos and Brill, 1981;Grunden and Shanmugam, 1997) could also contribute to XG196 being able to grow robustly under nitrate reducing conditions with limited Mo. Further study is required to clarify this issue.
Mo is removed from groundwater in the ORR contaminated area but not from the non-contaminated area as a result of Fe and Al precipitation . The low Mo concentrations (picomolar range) in the ORR contaminated environment is unusual but not unique. Low Mo concentrations (5-70 nM) occur in naturally-acidic groundwater (pH 2.4 to 2.9) (Nordstrom, 2015), in an acid mine drainage (<10 nM) (Sánchez-España et al., 2016), in harbors (<20 nM) as a result of sedimentary processes (Morford et al., 2007), and in various aquifers, including the Yorkshire Chalk aquifer (<10 nM) due to co-precipitation with or adsorption to sulfide minerals under strong reducing conditions (Smedley et al., 2014). These environments have significantly lower Mo concentrations than most freshwater and open seawater systems, which are typically > 300 nM (Smedley and Kinniburgh, 2017). Limiting Mo concentrations in natural water systems could lead to other environmental problems, for example, by affecting critical steps in the nitrogen cycle, such as nitrate reduction, leading to nitrate accumulation or to slowing down of nitrate removal from contaminated water or soil systems.
There are several factors that affect nitrate reduction in the ORR contaminated environment besides lack of the essential metal Mo. These include the acidic conditions, high nitrate concentrations, the presence of heavy metal contaminants, and limited availability of carbon sources to serve as electron donors for nitrate reduction (Smith et al., 2015;Thorgersen et al., 2015;Ge et al., 2019). Other factors, such as O 2 concentrations in the soil and groundwater (Zumft, 1997;Qu et al., 2016), temperature and denitrifier community composition (Wallenstein et al., 2006), can also affect the efficiency of nitrate reduction. Meanwhile, the higher abundance of genes encoding the molybdate transport protein (modA) and assimilatory nitrate reductase Mo-containing subunits (napA/narG) in nitrate-contaminated wells indicates enhanced nitrate reduction in the ORR contaminated groundwater. The high abundance of modA could result in a greater uptake of molybdate into cells for the biosynthesis of dissimilatory nitrate reductase, enabling microorganisms to survive in the nitrate-contaminated and Mo-limited ORR environment. These numerous complex environmental factors make it difficult to study the relationships between nitrate reduction and natural microbial communities. There are therefore many unanswered questions at present that can be addressed in part by characterizing novel microbial stains with unique molecular mechanisms, as reported here for XG196 and its ModA protein.
Such microorganisms could also be instrumental in developing novel methods to remove contaminating nitrate in complex waste environments.

DATA AVAILABILITY STATEMENT
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: "https://www.ncbi.nlm. nih.gov/nuccore/JABWSY000000000."