Metagenomic analysis reveals that modern microbialites and polar microbial mats have similar taxonomic and functional potential

Within the subarctic climate of Clinton Creek, Yukon, Canada, lies an abandoned and flooded open-pit asbestos mine that harbors rapidly growing microbialites. To understand their formation we completed a metagenomic community profile of the microbialites and their surrounding sediments. Assembled metagenomic data revealed that bacteria within the phylum Proteobacteria numerically dominated this system, although the relative abundances of taxa within the phylum varied among environments. Bacteria belonging to Alphaproteobacteria and Gammaproteobacteria were dominant in the microbialites and sediments, respectively. The microbialites were also home to many other groups associated with microbialite formation including filamentous cyanobacteria and dissimilatory sulfate-reducing Deltaproteobacteria, consistent with the idea of a shared global microbialite microbiome. Other members were present that are typically not associated with microbialites including Gemmatimonadetes and iron-oxidizing Betaproteobacteria, which participate in carbon metabolism and iron cycling. Compared to the sediments, the microbialite microbiome has significantly more genes associated with photosynthetic processes (e.g., photosystem II reaction centers, carotenoid, and chlorophyll biosynthesis) and carbon fixation (e.g., CO dehydrogenase). The Clinton Creek microbialite communities had strikingly similar functional potentials to non-lithifying microbial mats from the Canadian High Arctic and Antarctica, but are functionally distinct, from non-lithifying mats or biofilms from Yellowstone. Clinton Creek microbialites also share metabolic genes (R2 < 0.750) with freshwater microbial mats from Cuatro Ciénegas, Mexico, but are more similar to polar Arctic mats (R2 > 0.900). These metagenomic profiles from an anthropogenic microbialite-forming ecosystem provide context to microbialite formation on a human-relevant timescale.

Biologically-induced mineralization involves the microbial alteration of water chemistry causing mineral saturation and precipitation (Dupraz et al., 2009). Microbial processes that cycle carbon, particularly within microenvironments, are important for inducing carbonate precipitation under appropriate chemical conditions (e.g., alkaline pH and sufficient cations; Spanos and Koutsoukos, 1998;Dupraz et al., 2009). For instance, cyanobacteria can cause alkalinization through photosynthesis, thereby driving pH to more alkaline conditions that favor carbonate precipitation (Thompson and Ferris, 1990). (Equation 1) HCO − 3 + H 2 O + hv → CH 2 O + OH − + O 2 ↑. Microbial cell walls and exopolymeric substances (EPS) may provide surfaces for mineral nucleation and aid in concentrating cations (e.g., Ca 2+ ) due to adsorption by negatively charged functional groups (e.g., R-COO − ) (Schultze-Lam et al., 1996). Additionally, heterotrophic bacteria can increase the availability of dissolved inorganic carbon (DIC) for carbonate precipitation through the degradation of organics (Von Knorre and Krumbein, 2000).
Although aragonite is supersaturated in the Clinton Creek open-pit pond, studies of non-marine environments exhibiting calcifying cyanobacteria show that a 9.5 to 15-fold supersaturation with respect to calcite is required for precipitation to occur (Arp et al., 2001). Such biological activity, especially in microenvironments where carbonate precipitation may be occurring, may increase pH, and/or increase cation and DIC concentrations.
Microbial processes that cycle carbon may also induce carbonate precipitation under certain geochemical conditions (e.g., alkaline pH and sufficient cations; Dupraz et al., 2009). Photosynthesis by cyanobacteria may result in the alkalization of microenvironments by producing hydroxyl anions, causing an increase in pH (Thompson and Ferris, 1990;Ludwig et al., 2005;Tesson et al., 2008); whereas, degradation of organics may increase the concentration of DIC (Slaughter and Hill, 1991;Van Lith et al., 2003).
In the present study we examined the microbial communities associated with microbialites found in a flooded and abandoned open-pit asbestos mine (64 • 26 ′ 42 ′′ N, 140 • 43 ′ 25 ′′ W) referred to as Clinton Creek, and located in the subarctic, ∼77 km northwest of Dawson City, Yukon, Canada (Figure 1), and which was previously studied to elucidate the geology of asbestos deposits (Htoon, 1979) and for its potential for sequestering carbon dioxide in mine wastes (Wilson et al., 2009). The microbialites at Clinton Creek are unusual in that they have estimated accretion rates of up to ∼5 mm per year (Power et al., 2011a), much higher than other modern microbialite-forming systems including Highbourne Cay (∼0.33 mm per year) (Planavsky and Ginsburg, 2009), Shark Bay (0.4 mm per year) (Chivas et al., 1990), and Pavilion Lake (0.05 mm per year) (Brady et al., 2009). Consequently, the Clinton Creek microbialites should be excellent models for understanding the biological processes responsible for microbialites formation.
Studies have examined the diversity of microbialites using both metagenomic and 16S rDNA sequencing. Metagenomic studies have focused mainly on marine systems (Reid et al., 2000;Burns et al., 2004;Papineau et al., 2005;Allen et al., 2009;Goh et al., 2009;Khodadad and Foster, 2012;Mobberley et al., 2013), with Cuatro Ciénegas being the only reported metagenomic investigation of freshwater microbialites (Breitbart et al., 2009). In contrast, 16S rDNA sequencing has been used to examine the diversity of freshwater microbialites in Lake Van (López-García et al., 2005), Pavilion Lake (Russell et al., 2014), Ruidera Pools (Santos et al., 2010), Lake Alchichica (Couradeau et al., 2011), and Cuatro Ciénegas (Centeno et al., 2012). The extent to which microbialite communities are similar or different from those in surrounding sediments and waters remains an unresolved but important question. The microbial communities in the surrounding sediments provide the environmental context needed to better constrain common and unique aspects of microbialite community structure and function. This information may be used to uncover conserved patterns of microbial community assembly and the metabolic pathways mediating microbialite formation under different environmental conditions.
In the present study, we use a metagenomic approach to explore the structure and function of Clinton Creek microbialites in relation to adjacent sediments in order to examine the metabolic drivers of microbialite growth in this freshwater system. We focus on metabolic pathways mediating photosynthetic or heterotrophic carbonate precipitation and the taxonomic distribution of these pathways. We then compare the metagenomic data from Clinton Creek microbialites and sediments to diverse non-lithifying mats, sediments, and microbialites to better define the conserved microbialite community structure and function.

Site Description and Water Chemistry
The conditions at the Clinton Creek mine are highly conducive to microbialite formation and are described extensively in  Power et al. (2011a), and are summarized briefly below. The photic zone likely occupies the full depth of the open pit pond and there is minimal nutrient input due to the lack of surrounding soil. Sediments are composed of chrysotile, quartz, muscovite, kaolinite, as well as minor amounts of aragonite and trace calcite. The microbialites are columnar, up to 15 cm in height, and are primarily composed of aragonite with spherulitic fabric (Figure 1D). The open pit water is subsaline (Na + 17.6-35.7 mg L −1 and K + 2.7-5.2 mg L −1 ), oligotrophic (undetectable phosphate), alkaline in pH (8.4), possessing a cation concentrations distribution of Mg 2+ >> Ca 2+ >> Na + > K + > Si 4+ , while anions concentrations were SO 2− 4 >> DIC > Cl − ( Table 1). As is common in microbialite forming systems (Dupraz et al., 2009;Lim et al., 2009), the water is oligotrophic with very low iron concentrations and undetectable phosphate which is common in microbialite forming systems (Dupraz et al., 2009;Lim et al., 2009). The water is supersaturated with respect to aragonite (saturation index = 0.6), the dominant mineral forming the microbialites, as well as calcite [CaCO 3 ].

Sampling, DNA Extraction, Purity, and Concentration Measurements
Microbialites and sediment samples were obtained in July 2011. Triplicate sediment and microbialite samples were taken ∼10 m apart. Microbialites were ground with mortar and pestle under liquid nitrogen prior to DNA extraction. Community genomic DNA was extracted from triplicate 10 g microbialite and sediment subsamples using a PowerMax soil DNA isolation kit (MoBio Laboratories, Inc., Carlsbad, CA, USA), following the manufacturer's instructions. DNA concentrations were determined using a Nanodrop-3300 (ThermoFisher, Nandrop Wilmington, DE) with PicoGreen R reagent according to the manufacturer's instructions (Invitrogen, Carlsbad, CA). Purity of extracted DNA and samples was determined by absorbance (260/280 and 260/230 ratios) using a Nanodrop-1000 (ThermoFisher, Nandrop Wilmington, DE). Genomic DNA from each replicate was pooled prior to Illumina library construction.

Illumina Hiseq/Miseq Library Construction Quality Control and Quantification
For Illumina library construction, DNA was sheared by ultrasonication (Covaris M220 series, Woburn, MA), and the fragments end-paired, A-tailed (Lucigen NxSeq DNA prep kit, Middleton, WI), and ligated to TruSeq adapters (IDT, Coralville, Iowa); small fragments were removed twice using magnetic beads (Beckman Coulter, Danvers, MA) (White III et al., 2013a,b;White III and Suttle, 2013). No PCR enrichment was used to amplify libraries to avoid PCR duplication bias. Libraries were checked for size and adapter-dimers using a Bioanalyzer HighSens DNAchip (Agilent). Libraries were Year pH Alkalinity

Analysis of Illumina Sequencing Data
Raw Illumina data was screened for PhiX spike-in contaminants sequencing data using Bowtie2 then removed using Picard tools (White III et al., 2013a,b;White III and Suttle, 2013). Reads were quality checked using FastQC, then paired-end reads merged by FLASH and assembled with the Ray assembler (kmer size: 39 and 55) (Boisvert et al., 2010(Boisvert et al., , 2012White III et al., 2013a,b;White III and Suttle, 2013). The assemblies were selected based on the number of contigs (>100 bp), N50/N90 values, longest contig, and total length (bp) of the assembly ( Table 2). Based on these analyzes, a kmer size 39 was used for all further analysis ( Table 2). A kmer size of 55 generally yielded longer but fewer contigs, which would not allow for a comparable differential analysis between microbialites and sediments ( Table 2). Only contigs with >2x read coverage were used in analysis with average coverage of 3x for both the microbialite and sediment contigs. Nevertheless, only 0.64 and 1.74% of the raw reads from the sediment and microbialite metagenomes, respectively, assembled into contigs, indicating that both environments had complex microbial communities. FragGeneScan was used to predict and translate contig open reading frames (ORFs) (Rho et al., 2010) and ProPas (Wu and Zhu, 2012) was used to calculate predicted protein isoelectric points (pI). The assemblies were annotated using Metagenomic Rapid Annotations using Subsystems Technology (MG-RAST) (Meyer et al., 2008). MG-RAST analysis of the contigs, used BLAT (BLAST-like Alignment Tool) annotations based on hierarchical classification against M5RNA (MG-RAST ribosomal specific database), SEED subsystems and RefSeq databases with a minimum e-value cutoff of 10 −5 , a minimum percent identity cutoff of 60%, and a minimum alignment length cutoff of 15 base pairs. The SEED, RefSeq and M5RNA (MG-RAST rRNA database) classifications were normalized using relative count abundances for each sample. Principal component analysis (PCA) for the normalized RefSeq classifications (top 25) used R (version 3.0.3) libraries Ecodist (dissimilarity-based functions for ecological analysis), and pvclust (hierarchical clustering with pvalues via multiscale bootstrap resampling) using ward clustering and Bray-Curtis distance metric at a thousand replicates (Suzuki and Shimodaira, 2006). PCA for the normalized RefSeq classifications was plotted using R library ggplot2 (Wickham, 2009). A dotplot of the normalized RefSeq classifications (top 25), was completed using R libraries Reshape2, using the melt function, then plotted using ggplot2 (Wickham, 2009). The annotations were parsed by custom python scripts and analyzed using statistical analysis of metagenomic profiles (STAMP) (Parks and Beiko, 2010). MG-RAST annotations using SEED subsystems for Clinton Creek microbialite and sediment contigs were loaded into STAMP and compared for metabolic potential using a one sided G-test (w/Yates' + Fisher's), alternative to the chi-squared, with asymptomatic confidence intervals (0.95) using Benjamini-Hochberg FDR procedure (Parks and Beiko, 2010). In addition to MG-RAST, metabolic pathways were predicted using MetaPathways, a modular pipeline for gene prediction and annotation that uses pathway tools and the MetaCyc database to construct environmental pathway/genome databases (ePGBDs) (Konwar et al., 2013). Metapathways uses the seedand-extend homology search algorithm LAST (local alignment search tool) for annotations of ORFs with a minimum of 180 bp and minimum alignment length cutoff of 50 (Kiełbasa et al., 2011). Venn diagrams were constructed from predicted MetaCyc pathways based on normalized pathway size and the number of ORFs associated with each pathway using R libraries then plotted using ggplot2 (Wickham, 2009).

Metagenomic Data Depositing
All the data used in this study is freely available and available for public access from the MG-RAST metagenomics analysis server. From MG-RAST, it is listed in the project name Yukon microbialites under the names Clinton Creek microbialite (ID 4532705.3) and sediment contigs (ID 4532704.3).

Non-database Based Community Properties
Based on GC content Clinton Creek the microbialite microbial communities are clearly distinct from the sediment microbial communities. The GC content was higher in the microbialites compared to the sediments; whereas, protein isoelectric points (pI) were similar ( Figure S1). The GC content was lower in sediments likely due to the higher presence of low GC containing microbes belonging to phyla such as Bacteroides and Firmicutes. The higher GC content in the microbialite data is likely due to the high relative abundance of sequences assigned to anoxic photoheterotrophic Alphaproteobacteria, including Rhodobacterales (59-65% GC content) and Rhodomicrobium (62.2% GC content). GC content across bacterial genomes can be highly variable amongst microbial taxa, although amino-acid usage is typically similar, which is similar to our data on GC and pI content observed in the metagenomes (Lightfield et al., 2011; Figure S1).

Community Composition
The microbial communities within Clinton Creek are distinct from each other (Figure 2) and are dominated by differing compositions of Proteobacteria and Cyanobacteria.
Proteobacteria comprised >50% of the ORFs and >35% of the 16S sequences recovered from the Clinton Creek sediments and microbialites (Figure 2). The microbialite contigs were dominated by anoxic photoheterotrophic Alphaproteobacteria (e.g., Rhodobacterales) (Figure 2A). In contrast, sediments contigs had greater abundance nitrogen-fixing Gammaproteobacteria (e.g., Pseudomonas spp.) (Figure 2A). Alphaproteobacteria are commonly found amongst, and are likely a critical component of, the microbialiteforming microbial consortium due to their role in nitrogen fixation, even in the presence of heterocystous cyanobacteria (Havemann and Foster, 2008). It has been suggested that prior to the evolution of cyanobacteria, anoxic phototrophs like Rhodobacterales could have had a role in the formation of Precambrian stromatolites (Bosak et al., 2007).
Deltaproteobacteria represented ∼10% of the predicted Proteobacterial ORFs (based on RefSeq) from the sediments and microbialites (Figure 2A). The microbialite contigs consisted mainly of Myxococcus spp. whereas, members of the Desulfuromonadales dominated in the sediments. Myxococcus spp. are abundant in a variety of microbialite-forming systems and can mediate precipitate carbonate through the release of ammonium (Jimenez-Lopez et al., 2011). The microbialites and sediments had similar representation from Desulfurovibrionales, Desulfobacterales and Syntrophobacterales, the major orders of dissimilatory sulfate reducers. The dissimilatory sulfate-reducers in the Deltaproteobacteria may be critical drivers of the "the alkalinity engine, " thereby inducing carbonate precipitation (Gallagher et al., 2012). Finding the major dissimilatory sulfate-reducing groups of bacteria (Desulfurovibrionales, Desulfobacterales, and Syntrophobacterales) in Clinton Creek microbialites is not surprising; however, their abundances were similar in the sediments, including genes involved in dissimilatory sulfate-reduction. Thus, the sediments and ground water likely generate alkalinity and could also be the source(s) of these dissimilatory sulfate-reducing bacteria in microbialites. For example, sulfate-reducing bacteria may be transported as spores from other environments and then disperse in microbialite cyanobacterial mats.
Cyanobacteria were the fourth most abundant group, comprising 6.1% of the total contigs in the microbialites (Figure 2A). The microbialites had 4-fold more protein coding ORFs that were classified as cyanobacteria than the sediments (6.1-1.4%, Figure 2A). The cyanobacterial ribosomal sequences were detected in the microbialites only (based on M5RNA database) and from only filamentous cyanobacteria genera, which include Tolypothrix, Leptolyngbya, and unclassified Antarctic cyanobacteria. In contrast, no Cyanobacteria ribosomal sequences (e.g., rDNA) were recovered from the sediments (Figure 2A; M5RNA). No Cyanobacteria ribosomal sequences (e.g., rDNA) were detected in the sediments due to very low abundance sediments. Microbialite contigs based on RefSeq classification had higher abundances of filamentous genera including Microcoleus, Lyngbya, Nodularia, and Anabaena, and more unicellular calcifying Synechococcus, than the sediments. The sediments had fewer filamentous cyanobacteria genera as a whole and fewer unicellular cyanobacteria (e.g., Synechococcus) FIGURE 2 | Microbial community structure of Clinton Creek metagenomes. (A) Dot pot of representative taxonomic groups from Clinton Creek sediments and microbialites using RefSeq (protein coding ORFs) and M5RNA (rRNA, MG-RAST rRNA database) in log relative abundances. Samples were clustered (top) by ward clustering matrix using bootstraping of one thousand replications with Bray-Curtis distance cut-offs. "Other" denotes low abundance taxa that were <1% of the total ORF or rRNA, individually, but were all combined here into one point. (B) PCAs of top 25 taxonomic groups from Clinton Creek sediments and microbialites by RefSeq (ORFs) and M5RNA (rRNA, MG-RAST rRNA database) classification using ward clustering matrix followed by bootstrapping of one thousand replications with Bray-Curtis distance cut-offs.
contigs. Cyanobacteria likely drive microbialite formation in Clinton Creek by increasing carbon biomass in the form of carbon-rich EPS, which supports the growth of the entire heterotrophic microbial consortium through carbon fixation, which in turn contributes to carbonate precipitation by increasing the saturation index (Dupraz and Visscher, 2005;Braissant et al., 2007;Dupraz et al., 2009;McCutcheon et al., 2014).
The phylum Gemmatimonadetes was present in both the sediments and microbialite contigs, and comprised 7-8% of the protein coding ORFs (Figure 2A). To our knowledge, this is the first report of protein sequences from Gemmatimonadetes in microbialites based on metagenomic data, although they were not restricted to that environment. The Gemmatimonadetes contigs annotated mainly as hypothetical proteins; however, positive Gemmatimonadetes annotations included ATPases, Zn-dependent peptidases and glucose/ sorbosone dehydrogenase-like genes. Glucose/sorbosone dehydrogenases transform various sugar moieties into vitamins, including L-ascorbic acid (vitamin C), or can make D-glucono-1,5-lactone from D-glucose, which can acidify the extracellular environment, which may lead to dissolution of carbonate by heterotrophic process (Dupraz and Visscher, 2005;Miyazaki et al., 2006;Fender et al., 2012). Although their estimated relative abundance is not high, this could in part be because there are few representative Gemmatimonadetes genomes in databases. Ultimately, whether they are involved in microbialite formation, or are just opportunists, or lead to dissolution, needs to be elucidated.
The microbialite and sediment microbial communities were dominated by bacteria with low abundances of eukaryotes and archaea (Table 3). From RefSeq annotations, <1% of microbialite and sediment contigs were of archaeal origin ( Table 3, RefSeq), and no archaeal ribosomal genes were detected in either the sediment or microbialite contigs (  Cay marine microbialites and the freshwater microbialites from Cuatro Ciénegas, had low abundances (<1%) of archaea and eukaryotes (Breitbart et al., 2009;Khodadad and Foster, 2012;Mobberley et al., 2013). Eukaryotes were rare, as they make up <1% of the sediment and microbialite contigs of Clinton Creek (Table 3), although common taxa such as diatoms, dinoflagellates, cryptomonads, chlamydomonadales, and fungi were detected. Diatoms and other protists have been observed by microscopy and detected in the metagenomic data from Clinton Creek, but their contribution to the formation of microbialite structures requires further study (Power et al., 2011a). Diatoms may influence carbonate precipitation through photosynthetic alkalinization (Tesson et al., 2008), akin to processes found in cyanobacteria, and/or through the ammonification of amino acids (Castanier et al., 1999). Clinton Creek microbialites had very low sequence abundance (<0.1%) of metazoans including nematoda, cryptomonads, platyhelminthes, microsporidia, cnidaria (e.g., hydra) and arthropods (e.g., insects). Our sequence data supports prior microscopy data that similarly showed low abundances of metazoans (Power et al., 2011a). With such a low metazoan abundance, the destructive impact of grazing on the Clinton Creek microbialites is presumably very low. Phosphorus was undetectable down to the parts per million detection limit in Clinton Creek (data not shown). It has been suggested that limitation of phosphorus affects metazon growth in microbialites (Elser et al., 2005). Metazoan grazing is the "prime suspect" in the global decline of microbialites as they remove cyanobacterial mats, thereby negatively impacting microbialite formation by removing the main carbon source and structural components (Grotzinger, 1990).

Metabolic Potential
The metabolic potential of the Clinton Creek microbialite metagenome predicts photosynthetic dominance, whereas the sediment metagenomes contained more heterotrophic metabolism (e.g., respiration) ( Figure 3A). SEED subsystem level I (i.e., highest functional classification group) annotations indicated that carbohydrate metabolism relating to carbon fixation, DNA metabolism and photosynthesis pathways were significantly more abundant in the microbialites than sediments (Figure 4A, FDR p < 0.01). Lower level SEED subsystem predictions (level III) further revealed a higher abundance of photosynthetic pathways (e.g., photosystem II reaction centers and carotenoids and chlorophyll biosynthesis) in microbialites than sediments (Figure 3B, FDR p < 0.01). These photosynthetic pathways in microbialites were annotated as filamentous cyanobacteria genera such as Microcoleus, Lyngbya, Nodularia, and Anabaena, which were not found in the sediments.
The metapathway pipeline was used for MetaCyc pathway annotations to complement SEED functional gene annotations. MetaCyc predicted pathways revealed that most pathways were shared between microbialites and sediments ( Figure 4A). Only 13 pathways were restricted to the sediments and 240 pathways were identified in the microbialites, while 358 pathways were shared ( Figure 4A). The hundred most abundant shared pathways were housekeeping genes with functions such as protein, nucleic acid, lipid, and carbohydrate biosynthesis and degradation ( Figure 4B). In the microbialites, MetaCyc annotations predict higher levels of glutamine degradation I, which results in the donation of nitrogen in the form of ammonium, while glutamine biosynthesis appears to be higher in the sediments ( Figure 4B). Both the sediments and the microbialites are able to recycle ammonium through ammonium assimilation cycle I-II ( Figure 4B). Ammonium donation provides nitrogen, which feeds the primary photosynthetic production of the filamentous cyanobacterial mats in microbialites, which in turn could lead to further carbonate precipitation.
Isotopic analysis of the carbonate minerals composing the microbialite may indicate a dominant process, e.g., alkalinization by phototrophs vs. increased CO 2 supply via heterotrophic degradation of organic matter. However, microbialites form though complex interactions between the physical and chemical factors with the microbial community. For instance, calcite composing the Pavilion Lake microbialites is enriched in 13 C by 2.5 ± 0.5‰ relative to calcite that may precipitate in isotopic equilibrium with lake water DIC, (Brady et al., 2009), indicating that alkalinization driven by cyanobacteria. The biomass-associated aragonite within the Clinton Creek microbialites was modestly enriched in 13 C by 0.8‰ relative to aragonite exhibiting no biomass, which is indicative of carbonate precipitation in association with phototrophs, including cyanobacteria (Power et al., 2011a). Electron microscopy of the microbialites confirmed that phototrophs were associated with carbonate that is enriched in 13C (Power et al., 2011a). A greater proportion of heterotrophic activity within the microbialites may explain why microbialite aragonite was isotopically lighter than periphyton found in the open pit. Omelon et al. (2013) hypothesize that microbialites become progressively lithified as the photosynthetically derived carbonate becomes in-filled through subsequent carbonate precipitation by heterotrophic activity. Similarly, Andres et al. (2006) suggest heterotrophs play a more direct role than phototrophs in the lithification stromatolites from Highborne Cay, Bahamas as indicated by isotopically light aragonite (Andres et al., 2006).

Comparative Metagenomic Analysis
Clinton Creek microbialite and sediment metagenomes are more functionally related to polar mats than microbialites isolated from marine or tropical ecosystems. Using SEED subsystem level III, PCA indicates better clustering to polar mats and sediments from the Arctic and Antarctica (Figure 5). The Clinton Creek samples cluster most closely Markham Ice shelf and Ward Hunt Ice shelf mats isolated from the Canadian High Arctic (Figure 5; Varin et al., 2012). Markham mats are functionally the most similar based on strong correlation to Clinton Creek microbialites based on SEED subsystem level III (Figure 6, R 2 = 0.952). Overall, polar mats (e.g., Markham, Ward Hunt, and McMurdo) had strong correlation of pathways in SEED than other ecosystems (Figure 6, R 2 > 0.900). Markham mats like Clinton Creek microbialites are dominated by Proteobacteria, and have Gemmatimonadetes present at 3% of the total 16S clones (Bottos et al., 2008). Polar mats, whether on microbialites or on ice shelves, appear to have functional gene similarities; this likely relates to handling shifts in temperature, including temperatures well below freezing (−10 • C; Varin et al., 2012).
SEED based functional genes present in Clinton Creek microbialites were also analyzed across both freshwater microbialites from Cuatro Ciénegas and Highbourne Cay stromatolite metagenomes. Clinton Creek microbialites had weak correlations to Pozas Azules II metagenome (Figure 6, R 2 = 0.702), followed by Rio Mesquites (Figure 6, R 2 = 0.633), and Highbourne Cay stromatolite (Figure 6, R 2 = 0.018). Arctic polar mats had stronger correlations for SEED pathways than Cuatro Ciénegas microbialites and Highbourne Cay stromatolite metagenomes. Cuatro Ciénegas microbialites and Highbourne Cay are in tropical climates, which would remove many pathways related to cold-adaptation which are present in Clinton Creek and polar mats (Varin et al., 2012). The Highbourne Cay marine stromatolite had the lowest correlation of SEED functional gene classifications to Clinton Creek microbialites, which further suggests that marine microbialites differ from freshwater microbialites.
Clinton Creek samples were distinct from non-lithifying Octopus and Mushroom spring mats from Yellowstone ( Figure 5). These data reveal that Clinton Creek microbialites are closely related to polar mats, due possibly to cold-adaptation, and differ greatly from tropical microbialites. This reveals that under the correct chemistry (e.g., alkaline pH, high DIC, and dissolved Ca 2+ or Mg 2+ ), and with low numbers of metazoans, polar mats on ice shelves could have at least the metabolic potential to make microbialites.

Clinton Creek Geochemistry
The key chemical parameters with regard to CaCO 3 precipitation are pH, and concentrations of Ca 2+ and DIC. These parameters influence the degree of saturation of a solution as given by the Saturation Index (SI), which is defined as SI = log(IAP/K sp ), where the IAP is the Ion Activation Product and K sp is the solubility product of a given mineral. Rates of mineral nucleation and precipitation are generally greater with increasing degree of saturation (De Yoreo and Vekilov, 2003). Speciation calculations using PHREEQC (Parkhurst and Appelo, 1999) determined that the average SI for aragonite in the Clinton Creek open pit water is 0.6 vs. 0.72 for calcite of Pavilion Lake (Brady et al., 2009). This may explain the extremely rapid accretion rate in Clinton Creek, which is two orders of magnitude faster than Pavilion Lake microbialites (Brady et al., 2009), and one order faster than Highbourne Cay microbialites (Planavsky and Ginsburg, 2009). Given a similar CaCO 3 saturation index as Pavilion Lake, the relatively rapid formation of Clinton Creek microbialites cannot be explained by the bulk chemical parameters of the open pit water. Furthermore, the microbialites and surrounding sediments experience nearly the same environmental conditions (e.g., nutrient availability, bulk water chemistry, and lighting). Consequently, we are able to differentiate between the environmental and microbial controls on carbonate precipitation through a comparative analysis of the microbialites vs. the surrounding sediment using metagenomic analysis. The surrounding sediments do contain some aragonite (Power et al., 2011a); however, it is clear that carbonate precipitation rates are much faster in the microbialites given their greater abundance of aragonite. These finding suggest that microbialite formation in Clinton Creek is indeed driven by the local microbial community. Microbial metabolism is expected to significantly modify the water chemistry in the interstitial waters of the microbialites (Dupraz et al., 2009). On a geologic and even a human timescale, Clinton Creek microbialites are exceedingly young, and it may be that their rapid accretion rates will not extend into the future.
Our data suggest that polar mats have the metabolic potential to make microbialites under the correct chemical conditions. Further work is needed to definitively ascertain specific microbe influence in terms of the speed of microbialite formation. In the future, this may provide an avenue for us to engineer microbial communities to store atmospheric carbon through biolithification, especially given the recent, anthropogenic origin of the Clinton Creek site. Biogenic carbonate deposits are the largest reservoirs of carbon on Earth and could provide a cost-efficient method of carbon sequestration for greenhouse FIGURE 6 | Scatter plots of functional gene annotations using SEED subsystem level III. One sided G-test (w/Yates' + Fisher's) with asymptomatic confidence intervals (0.95) using Benjamini-Hochberg FDR procedure in STAMP. Each dot represents a unique functional classification gene. gas emissions (Falkowski et al., 2000). Passive carbonation and carbon capture has been documented within the Clinton Creek mine tailings, leading to the proposition that microbiallymediated carbonate precipitation is a means to ameliorate carbon emissions from mining operations (Power et al., 2011a).

Conclusions
The northernmost microbialites known are located at subarctic Clinton Creek (Yukon, Canada). DNA from representative microbialites was extracted and directly sequenced, without bias from DNA amplification, and used to produce the largest set of assembled metagenomic data from a freshwater microbialite-forming ecosystem. The data revealed a high proportion of photosynthetic genes that were absent in the surrounding sediments, implying that microbialite formation is driven by photosynthesisinduced alkalinization, which is supported by 13C isotopic enrichment (Power et al., 2011b). Predicted metabolic pathways overlapped extensively between microbialite and sediment communities, particularly with respect to housekeeping genes; however, they have distinct core communities with microbialites dominated by Alphaproteobacteria (mainly anoxic phototrophs like Rhodobacterales) and sediments dominated by Gammaproteobacteria (mainly heterotrophic nitrogen-fixing Pseudomonas spp.).
While Clinton Creek microbialites shared some functional potential with microbialites from Cuatro Ciénegas, they shared far greater relation to Arctic mats (e.g., Markham and Ward Hunt), possibly due to cold-adaptation facilitated by long winters. The shared metabolic potential between Clinton Creek microbialites and polar mats from ice shelves, suggests that under favorable geochemical conditions, (e.g., alkaline pH, high DIC, and dissolved Ca 2+ or Mg 2+ ), Arctic mats have the metabolic potential to form microbialites.
This study illustrates that cyanobacteria generate alkalinity and support heterotrophic communities, which have the potential to drive the formation of microbialites at Clinton Creek. Together, this suggests that an anthropogenic environment can foster microbial communities capable of mediating carbonate precipitation, and that these microbes could offer an effective means of carbon sequestration (Power et al., 2011a,b). Microbially-mediated carbonate precipitation is an environmentally safe and novel process that could be harnessed to provide a cost-efficient strategy for the long-term storage of anthropogenic greenhouse gasses (e.g., CO 2 ).