Highly Diverse Aquatic Microbial Communities Separated by Permafrost in Greenland Show Distinct Features According to Environmental Niches

The Greenland Analog Project (GAP) study area in the vicinity of Kangarlussuaq, Western Greenland, was sampled for surface water and deep groundwater in order to determine the composition and estimate the metabolic features of the microbial communities in water bodies separated by permafrost. The sampling sites comprised a freshwater pond, talik lake, deep anoxic groundwater, glacier ice and supraglacial river, meltwater river and melting permafrost active layer. The microbial communities were characterized by amplicon sequencing of the bacterial and archaeal 16S rRNA genes and fungal ITS1 spacer. In addition, bacterial, archaeal and fungal numbers were determined by qPCR and plate counts, and the utilization pattern of carbon and nitrogen substrates was determined with Biolog AN plates and metabolic functions were predicted with FAPROTAX. Different sample types were clearly distinguishable from each other based on community composition, microbial numbers, and substrate utilization patterns, forming four groups, (1) pond/lake, (2) deep groundwater, (3) glacial ice, and (4) meltwater. Bacteria were the most abundant microbial domain, ranging from 0.2–1.4 × 107 16S rRNA gene copies mL-1 in pond/lake and meltwater, 0.1-7.8 × 106 copies mL-1 in groundwater and less than 104 copies mL-1 in ice. The number of archaeal 16S and fungal 5.8S rRNA genes was generally less than 6.0 × 103 and 1.5 × 103, respectively. N2-fixing and methane-oxidizing Actinomycetes, Bacteroidetes and Verrucomicrobia were the dominant microorganisms in the pond/lake samples, whereas iron reducing Desulfosporosinus sp. dominated the deep anaerobic groundwater. The glacial ice was inhabited by Cyanobacteria, which were mostly Chloroplast-like. The meltwater contained methano- and methylotrophic Proteobacteria, but had also high relative abundances of the nano-sized Parcubacteria. The archaea composed approximately 1% of the 16S rRNA gene pool in the pond/lake samples with nano-sized Woesearchaeota as the dominating taxon, while in the other sample types archaea were almost negligent. Fungi were also most common in the pond/lake communities, were zoospore-forming Chytridiomycetes dominated. Our results show highly diverse microbial communities inhabiting the different cold Greenlandic aqueous environments and show clear segregation of the microbial communities according to habitat, with distinctive dominating metabolic features specifically inhabiting defined environmental niches and a high relative abundance of putatively parasitic or symbiotic nano-sized taxa.


INTRODUCTION
Arctic environments are highly susceptible to the effects of global climate change. The effects can be seen as increased glacier melt and glacier thinning, melting of the permafrost to greater depths and increased generation of melt water (Hanna et al., 2008). The permafrost has a major impact on hydrology in Arctic regions since it separates the surface water from the deep subsurface water, providing to two from each other separate water environments (e.g., White et al., 2007;Bosson et al., 2013;Kane et al., 2013). However, in specific spots or areas called taliks, flow between surface and deep subsurface water may be possible because the ground remains unfrozen below these taliks throughout the year.
Warming of the arctic has a profound effect on arctic lake ecosystems. Even a few degrees raise in temperature may increase the open water period of arctic lakes, which may result in greater primary production rates (Vincent et al., 2009). Greater primary production is followed by increase (blooms) in biota consuming the primary produced compounds, which affect the light penetration to deeper water layers and eventually increases the amount of dead biomass sinking to the bottom of the lake where it is decomposed and causes alterations in the oxygen levels of the lake (Scheffer and Carpenter, 2003). Decreasing light penetration would also lead to decrease in the original rate of primary production, turning the lakes from CO 2 sinks to CO 2 sources (Mariash et al., 2018). Increased organic matter could severely change the food web composition in oligotrophic lakes from mostly bacteria driven in oligotrophic water to microbial eukaryotic driven with increasing organic carbon content (Cotner and Biddanda, 2002;Mariash et al., 2018). Heterotrophic bacteria have been shown to dominate the microbial communities in oligotrophic aquatic environments, such as arctic lakes, where they have crucial roles in the mineralization of primary produced organic compounds as well as recycling of carbon (reviewed by Cotner and Biddanda, 2002). In oligotrophic lakes the organic carbon is generally dissolved, in comparison to eutrophic lakes, where the organic carbon is bound in particulate matter (Biddanda et al., 2001).
Bioavailable nitrogen is introduced to the ecosystem through nitrogen fixation. In Svalbard tundra and lake sediment the availability of ammonia, nitrate, organic carbon and organic nitrogen were strong drivers for the diversity of the bacterial communities (Wang et al., 2016). In Greenlandic tundra soil, moisture was the main factor driving the nitrogen fixation, but increasing soil temperature had a negative effect on the nitrogen fixation rate (Rousk et al., 2018). Nevertheless, bedrock may also be a significant source of nitrogen compounds in aquatic ecosystems as different types of rock, e.g., metasedimentary rock types, may contain significant (up to 1 g kg −1 ) concentrations of nitrogen (Holloway and Dahlgren, 2002). In tundra ponds in Alaska, phosphorus was the more critically limiting nutrient, whereas nitrogen was constantly supplied from the bottom sediment (Alexander et al., 1989). Severe phosphorus limitation was also reported for two lakes in SW Greenland, especially in spring (Brutemark et al., 2006). A study on more than 50 lakes and other surface water types (supraglacial and subglacial meltwater) in the Kangarlussuaq area also reported severe P and N limitations in the water (Henkemans, 2016). Thus, oligotrophic lakes are highly susceptible to increased input of nitrogen and phosphorus, which may lead to increased growth of the algal communities and this may lead to an elevation in the nitrogen and phosphorus concentrations in the water resulting from decaying biomass (Ogbebo et al., 2009).
The motivation for studying the microbial communities in aquatic environments above and below the permafrost in Greenland is connected to the safety of deep geological repositories (DGRs) for high-level radioactive waste. DGRs are designed to keep the spent nuclear fuel isolated from the surface biosphere on a time scale of hundreds of thousands of years. Over this time frame, glacial conditions are expected in northern latitudes, such as the Nordic countries and Canada (e.g., SKB, 2011;Posiva, 2012). A thick ice cover may influence groundwater flow due to increased ice load. Melt water from the ice may also penetrate in to the bedrock environment bringing nutrients and oxygen down to repository depths (e.g., SKB, 2011;Posiva, 2012). Greenland with its ice sheet and permafrost was used as an analogous environment in the Greenland Analog Project (GAP) to investigate the hydrological, hydrogeological and geochemical processes that a nordic DGR can experience during glacial conditions resembling the retreat phase of a glacial cycle Harper et al., 2016). The aims were to investigate how and if the surface waters influence the deep groundwater using both geochemical and microbiological tools. In addition, we aimed to characterize the composition of arctic microbial communities in different aquatic environments and estimate the metabolic potential of these communities in order to gain more information on the little studied arctic microbial flora. Our hypotheses were; (1) the deep subsurface microbial communities are distinctly different from the surface water communities, and (2) the different environment types contain defined, environmentspecific communities, even though the environments may be in direct contact with each other, such as the inflow to the Talik lake and the Talik lake water. FIGURE 1 | The study area is located on the west coast of Greenland (A) in the Kangarlussuaq area (B), which is indicated by the red square in A. The sampling area (C) is indicated by a red square in B. The sampling sites are indicated by colored symbols in C. The sample codes are explained in Table 1. The white bar in the lower left corner of the figures represent 1000, 100, and 1 km in A, B, and C, respectively. DWP, drill water pond; TLU, talik lake up; TLM, talik lake mid; DH-GAP01, drillhole DH-GAP01; DH-GAP04, drillhole DH-GAP04; ICE, glacial ice from Issunguata Sermia; SGR, Supra Glacial River; MWR, melt water river directly below the glacier; ISR, Issunguata Sermia River; ITL, inflow to Talik lake. Maps from Google Earth 2017.

Site Description
The Kangerlussuaq area is situated on the west coast of Greenland, approximately 50 km north of the Arctic circle. Local bedrock belongs to the Nagssugtoqidian Orogen area, which is composed of 1800-1900 Ma old Archaean ortho-gneisses containing minor amounts of amphibolite and metasedimentary rocks, that have undergone metamorphosis during the Palaeo-Proterozoic (van Gool et al., 2002;Garde and Marker, 2010). The nearest ice tongues of the Greenland Ice Sheet (GrIS) are situated only 20 km east of the Kangerlussuaq village (Figure 1). Here the ice sheet terminates on land at a distance of 150 km from the coast. The mean annual air temperature is −5.1 • C and the annual precipitation 173 mm for the period 1977-2011 (Cappelen, 2018). The two main outlet glaciers in the area are Isunnguata Sermia and the Russell glacier (Figure 1). In the GAP study area, the ice sheet thickness reaches approximately 1500 m with a mean value of approximately 800 m (Lindbäck et al., 2014). The ice flow is generally in the direction from east to west with a mean surface velocity of the 150 m/year. The outlet glaciers here are land-terminated and isolated from marine influence.
The vegetation consists of dwarf-shrub heath, which grow lower and less dense with increasing altitude and proximity to the ice sheet. The ice sheet melt season, with surface melt and runoff, lasts from May to September, during which time the ice sheet loses 3-4 m of thickness. The climate in the study area in low Arctic continental with continuous permafrost and is highly impacted by the presence of the GrIS. The study area is described in detail in Claesson Liljedahl et al. (2016) and Harper et al. (2016).
Surface waters (glacial ice and meltwaters, melt water ponds, lakes, rivers) and subsurface waters (deep groundwater) have been extensively studied in the Kangarlussuaq area (Henkemans, 2016). The waters are typically highly nitrogen deficient with nitrate concentrations generally below the detection limit of 0.2 mg L −1 . The surface waters are furthermore severely depleted in phosphorus, with P concentrations generally below 10 µg L −1 . However, the deep groundwater from three different sections (described below) of borehole and DH-GAP04 was shown to contain 42-4320 µg L −1 P, although the concentration of P in groundwater from borehole DH-GAP01 was below 10 µg L −1 .
A total of 12 different samples, representing lake and pond water (samples DWP, TLU, TLM), deep subsurface water (samples DH-GAP01, DH-GAP04.up/.mid/.low) and ice and meltwater (samples ICE, SGR, MWR, ISR, and ITL) were collected during September 6-14, 2014 (Table 1 and Figure 1) when the permafrost is thinnest. The Talik lake area, from where samples ITL, TLU, TLM, and DH-GAP01 originate is situated at an elevation of 369 m above sea level (masl) measured from the lake surface, and approximately 800 m from the ice sheet margin. The lake has a surface area of 0.37 km 2 and a catchment area of 1.56 km 2 and a maximum depth of 29.9 m. The catchment area is dominated by glacial till and glaciofluvial deposits overlain by eolian silt and fine sand. Water flows into the lake from the melting active layer of the permafrost and may also be influenced by the under lying groundwater (Johansson et al., 2015). Drill hole DH-GAP01 was drilled in 2009. It is situated 20 m from the lake and was drilled at an angle to reach below the lake. The drill hole has a length of 221 m and reaches a vertical depth of 191 m below land surface. It was drilled using sodium fluorescein (C 2 OH 10 Na 2 O 5 ) spiked water from the Talik lake. An inflatable packer is installed at 129 m vertical depth below land surface (drill hole length 150 m), allowing for sample collection from 129 -191 m vertical depth, yielding a sampling section with a volume of 174 L. Sample intake and sensors for in situ pressure, temperature and electric conductivity (EC) are located at 161 m borehole length (138 m vertical depth) . The lithology of this section consists mostly of felsic gneiss, with inclusions of intermediate gneiss at drill hole length 110-130 m and mafic gneiss below 180 m (Pere, 2014). Fractures are frequent and the fracture filling mineral is pyrite. The groundwater is anaerobic.
Drill hole DH-GAP04 is situated just at the margin of the Isunnguata Sermia outlet glacier (Figure 1). This drill hole was drilled in June 2011, has a length of 687 m and reaches a vertical depth of 645 m below land surface. Water from the Drill Water Pond (DWP), a small pond situated at approximately 1 km southwest of DH-GAP04, spiked with sodium fluorescein, was used as drilling water. Two inflatable packers are installed dividing the drill hole into three sections. The packers are installed at vertical depths of 561 and 571 m below land surface (drill hole lengths 593.5 and 605.5 m, respectively), isolating a 44 L section (DH-GAP04.mid) including two fracture zones . The section above the upper packer (DH-GAP04.up) extends from the base of the permafrost at 400 m depth to the upper packer, forming a 840 L drill hole section with groundwater inflow from 3 fracture systems (borehole length 548, 551, and 584 m, vertical depth 518, 522, 554 m, respectively), with a sample intake, EC and in situ pressure sensors installed at 541 m borehole length (512 m vertical depth). The lower section (DH-GAP04.low) has a volume of approximately 350 L containing 3 weak fractures at 638, 670, and 682 m drill hole length, equal to 604, 632, and 644 m vertical depth. Sample intake, EC and in situ pressure sensors are located within 2 m of the lower packer . The lithology of the DH-GAP04 drill hole was intermediate gneiss at the sampling depths (Pere, 2014). The fracture filling mineral is gypsum at the depth of sampling. For more details on boreholes DH-GAP01 and DH-GAP04, see e.g., Pere (2014), Harper et al. (2016), Kontula et al. (2016). The groundwater is anaerobic.

Sampling for Chemistry
The temperature, pH and conductivity of the water samples were measured immediately in the field with a portable pH, conductivity and temperature meter (Oakton), with exception of the ice sample, which was thawed in the laboratory before measurement. Sodium fluorescein had been added to the drilling fluid when the drill holes were drilled in order to monitor the possible contamination of the deep groundwater by the drilling fluid. By purging the drill holes and allowing them to fill with groundwater from the fracture zones, the sodium fluorescein concentration and thus the drilling fluid impact eventually decreases. Samples for measuring the fluorescein content were collected into plastic bottles and wrapped in aluminum foil in order to protect the fluorescein from deterioration. Water samples for analysis of anions were collected as such directly in to unused factory-clean plastic bottles (Nalgene) and were filtered in the laboratory prior to analysis through 0.45 µm pore-size cellulose acetate filters. Samples for cationic analyses were filtered in the field through 0.45 µm poresize cellulose acetate filters in to factory-clean plastic bottles (Nalgene) and acidified with ultra-clean nitric acid that had been rinsed three times with filtered sample water before sample collection. The anaerobic samples from DH-GAP01 and DH-GAP04 samples were collected anaerobically under nitrogen atmosphere. The samples for geochemical analyses were sent to Teollisuuden Voima Oyj (TVO), Finland, where the cations were analyzed with ICP-OES (iCAP 6500 Thermo Fisher Scientific). Anions (Br,Cl,F,NO 3 , and SO 4 ) were analyzed with ion chromatography . The alkalinity of the samples was determined with titrimetric analysis (Metrohm 905 Titrando) and the concentration of HCO 3 was calculated from this analysis. Non-purgeable organic carbon (NPOC) and Dissolved inorganic carbon (DIC) were filtered through 0.45 µm pore-size cellulose acetate filters and analyzed with a TOC analyzer at TVO with Shimadzu TOC-V CPH. Chemical analyses were done from three subsamples of each sample.

Samples for Microbiology
Water samples were collected in triplicate from each sampling site. The biomass from the water samples was collected on to Sterivex TM filter units using a peristaltic pump equipped with sterile silicone tubing. The samples on the filters were immediately fixed with 5 mL MoBio LifeGuard TM solution in order to preserve the nucleic acids and placed on coolers in a cooling box for a maximum time of 6 h until the samples could be placed in a freezer at −20 • C. The samples were transported frozen from Greenland to Finland for processing.
In addition, two parallel 50 mL water samples were collected from each site into sterile, oxygen-free glass infusion bottles equipped with an air tight butyl rubber stopper (Bellco) and aluminum crimp cap. The water samples were collected in sterile syringes equipped with a hypodermic needle. The water samples were inserted into the sealed bottle by pushing the needle through the rubber stopper. These live water samples were kept refrigerated at 4 • C and transported on ice to Finland. The storage time before processing, due to logistical necessity, was at most 2 weeks.

Estimation of Number of Cultivable Microorganisms
Nutrient agar, TSA and R2A (Sigma) agar plates for cultivation of heterotrophic microorganisms were prepared from ready media mixes according to the manufacturer's recommendations. In addition, autotrophic microorganisms were targeted using modified Medium 72 agar plates 1 without methanol. A tenfold dilution series was prepared from each sample in sterile 0.9% NaCl solution. Aliquots of 100 µl of the undiluted sample, 10 −1 and 10 −2 dilutions were spread on duplicate agar plates in a laminar flow hood. The agar plates were incubated at +1 • C (+/−1 • C) and the appearance of colonies was checked for weekly. 1 http://bccm.belspo.be/catalogs/lmg-media-details?CultureMediaID=72

Biolog AM
The capacity of the indigenous microbial flora of the different water samples for hydrolyzing carbon and nitrogen compounds was tested on Biolog AM 96-well plates (Biolog, Hayward, CA, United States) intended also for anaerobic microorganisms. The Biolog AM plates were prepared in an anaerobic cabinet in order to protect the plates and the samples from oxygen. The sealed sample bottles were opened in an anaerobic cabinet and aliquots 100 µl sample water were pipetted into each well on four parallel Biolog AM plates per sample. Two of the four plates were inserted into anaerobic pouches equipped with an oxygen indicator in order to maintain anoxic conditions during incubation, and two of the plates were incubated in ambient atmosphere. The Biolog AM plates were kept at 10 • C and checked weekly for changes in color in the wells.

Nucleic Acid Isolation
For DNA extraction the Sterivex filter units were opened in a laminar flow hood using sterilized pliers. The membranes were removed with sterile scalpels and tweezers and put in 15 mL sterile screw cap cone tubes (Corning) for DNA extraction. DNA was extracted from the filters using the NucleoSpin Soil DNA isolation kit (Macherey-Nagel, Germany). First the beads from one bead tube were inserted to the cone tube containing the Sterivex TM membrane and two reaction volumes of SL1 buffer was added to the tube, and the sealed tubes were horizontally shaken using a vortex shaker for 10 min. After 5 min centrifugation at 4000 rpm in an Eppendorf 5810R table top centrifuge the supernatant was collected and the extraction procedure continued as recommended by the manufacturer. The DNA was eluted in 50 µl of buffer SE and the DNA concentrations were measured using a Nanodrop 1000 spectrophotometer (Thermo Fisher Scientific). The mean DNA concentrations for the different sample types varied between 4.0 ± 1.0 ng µl −1 in DH-GAP04-UP to 10.2 ± 6.8 ng µl −1 in ITL. The extracted DNA was stored at −80 • C.

Estimation of Microbial Community Size by Quantitative PCR
The size of the microbial community in the different sampling sites was estimated by bacterial and archaeal 16S rRNA gene qPCR as described in Bomberg et al. (2016). The fungal community sizes were estimated targeting the fungal 5.8 rRNA gene using a TaqMan approach, using primers 5.8F1 and 5.8R1 and probe 5.8P1 (Haugland and Vesper, 2002) as described in Rajala et al. (2016).

Amplicon Library Preparation
The amplification libraries for high throughput sequencing with Ion Torrent PGM were prepared by PCR from the DNA samples. Bacterial 16S genes were amplified with primers S-D-Bact-0341b-S-17/S-D-Bact-0785-a-A-21 (Herlemann et al., 2011), targeting the variable region V3-V4 region of the 16S rDNA gene, archaeal 16S genes with primers S-D-Arch-0349-a-S-17/S-D-Arch-0787a-A-20 (Klindworth et al., 2013), targeting the V4 region of the gene and fungal internal transcribed spacer (ITS) gene markers with primer pair ITS1 and ITS2 targeting the fungal ITS1 region (White et al., 1990;Gardes and Bruns, 1993). PCR amplification was performed in parallel 25 µl reactions for every sample containing 1× MyTaq TM Red Mix (Bioline, London, United Kingdom, 20 pmol of each primer, up to 25 µl molecularbiology-grade water (Sigma) and 2 µl of template. The PCR program consisted of an initial denaturation step at 95 • C for 3 min, 35 cycles for bacteria and fungi and 40 cycles for archaea of 15 s at 95 • C, 15 s at 50 • C, and 15 s at 72 • C. A final elongation step of 30 s was performed at 72 • C. The PCR products were verified with agarose gel electrophoresis. Amplicons were sent to Ion Torrent sequencing with PGM equipment (Bioser, Oulu, Finland) and amplicons were purified before sequencing by the staff at Bioser. The sequences have been submitted to the European Nucleotide Archive (ENA) under Study Accession Number PRJEB30970.

Sequence Processing and Analysis
The fastq sequence files obtained from Ion Torrent sequencing were first converted to fasta and qual files using the QIIME v. 1.9 software (Caporaso et al., 2010) and the sequence analysis was continued using the mothur software, v. 1.33.4 (Schloss et al., 2009). Adapters, barcodes and primers were removed and the sequence reads were trimmed to a minimum length of 250 nucleotides using mothur. No barcode differences, no ambiguous nucleotides and a maximum of 8 nucleotide homopolymers were allowed. A qwindowaverage of 25 and a qwindowsize of 50 were used on the PGM read data in order to remove erroneous reads from the data set. Chimeric sequence reads were removed with Chimera Slayer in mothur using the Silva 128 database as template (Quast et al., 2012). A phylip distance matrix was built according to the aligned sequences using the dist.seqs command in mothur with a cutoff value of 0.03. Sequence reads were clustered in to operational taxonomic units (OTU) sharing 97% identity within each OTU. The OTUs were classified against the Silva.seed_v128 database in mothur using the wang classification method. Functional profiles for the bacterial and archaeal communities were predicted using FAPROTAX (Louca et al., 2016). ITS1 sequences were analyzed using the QIIME v 1.9 software. Adapters, barcodes and primers were removed from the sequence reads, and chimeric sequence reads were removed from the dataset with the USEARCH algorithm (Edgar, 2010) by de novo detection and through similarity searches against the UNITE reference dataset (Version sh_refs_qiime_ver7_dynamic_s_20.11.2016_dev) (Kõljalg et al., 2013;Unite Community, 2017;Nilsson et al., 2018) with fungal sequences. Sequences that failed to hit the sequence database with a minimum of 60% sequence identity were discarded as sequencing errors. The open reference strategy in QIIME was used for OTU picking and taxonomic classification of the sequences.
Alpha-diversity measures (number of OTUs detected in the dataset -observed OTUs, estimated number of OTUs that could in total be detected in the environment from which the data set originates -Chao1 OTU richness, and the Shannon diversity index describing the OTU richness and evenness of OTU distribution in the examined environment) were calculated based on the absolute number of sequence reads per OTU using the Phyloseq package in R (McMurdie and Holmes, 2013; R Development Core Team, 2018) and visualized using ggplot2. The similarity of the archaeal, bacterial and fungal communities between the different sample sites was tested by principal coordinate analysis (PCoA) using the Phyloseq package in R using the Bray-Curtis dissimilarity model. Eigen values for the variance explained by the PCoA dimensions were calculated on 999 random repeats.
Significant difference between the mean number of bacterial and archaeal 16S rRNA gene copies, fungal 5.8S rRNA copies and colony forming units (cfu) per mL sample water was tested using one-way ANOVA, Tukey's Q test and Kruskal-Wallis test using the PAST3 software (Hammer et al., 2009). In addition, the differences between microbial communities above and below the permafrost and between different sample types was tested with PERMANOVA using the Bray-Curtis dissimilarity model and 9999 permutations in PAST3.

Chemical Parameters of the Samples
The pH of the different water samples varied between 6.3 in the ICE sample (measured from molten ice in the laboratory) and 8.7 in the DH-GAP04.mid sample ( Table 2). The conductivity (EC) of the water was highest, 3.71 mS/cm, in the DH-GAP04.low sample and lowest in the ICE and melt water samples. The deep subsurface DH-GAP04.up, .mid, and .low samples had the highest salinity, total concentration of sulfur and sulfate, while the pond and lake water contained the highest concentrations of dissolved inorganic carbon (DIC), bicarbonate and nonpurgeable organic carbon (NPOC) ( Table 2).
The highest concentration of bacterial 16S rRNA genes and fungal 5.8S rRNA genes correlated positively and significantly (r > 0.8, p < 0.02 and r > 0.6, p < 0.02 for bacteria and fungi, respectively) with the highest concentrations of DIC and bicarbonate. The highest number of cfu:s obtained on TSA correlated positively and significantly (r > 0.59, p < 0.05) with the highest concentration of Ca, S tot , Sr, SO 4 , and TDS, and the highest number of cfu:s on R2A correlated positively and significantly (r > 0.6, p < 0.04) with the highest concentration of Ca.

Alpha Diversity
The number of bacterial, archaeal and fungal sequences varied in the samples between 3.0 × 10 3 -7.9 × 10 3 , 1.0 × 10 1 -8.6 × 10 3 and 3.0 × 10 3 -7.9 × 10 3 , respectively (Supplementary Table S1 and Supplementary Figure S1). The highest number of observed bacterial OTUs (1.5 × 10 3 ) was observed in MWR, while the lowest number (2.2-4.5 × 10 2 ) were detected in the deep subsurface samples and ITL. Nevertheless, the highest number of Chao1 estimated bacterial OTUs (4.6 × 10 3 ) was estimated for the ITL sample and the lowest for the deep subsurface samples. The ITL bacterial community also had the highest Shannon diversity index of 8.8 while the lowest Shannon diversity indices 3.6-6.1) were calculated for the deep subsurface samples and ICE.
The number of archaeal observed OTUs was highest in the DWP, MWR, ISR and ITL (8.8 × 10 2 -1.2 × 10 3 ) and lowest in the deep subsurface and ICE, where the number of archaeal sequences was also low. The highest number of Chao1 estimated archaeal OTUs was also detected in DWP. The Shannon diversity index was highest in the lake/pond samples and MWR, ISR, and ITL and lowest in the deep subsurface and ICE samples.
The highest number of fungal ITS OTUs was observed in the lake/pond samples and MWR, ISR and ITL, while the lowest OTU numbers were obtained from the deep subsurface samples, ICE and SGR. The same was seen in the Chao1 estimations, with the exception that DH-GAP01 and ICE were also estimated to contain approximately 1.0 × 10 3 OTUs. The Shannon index for the fungal communities was above 4 in all other samples except TLU and the deep subsurface samples.

Bacterial Community
The bacterial community profiles differed greatly between the pond/lake, deep subsurface and ice/melt water samples. Altogether 52 bacterial Phyla were detected ( Figure 3A). Nevertheless, specific bacterial phyla clearly dominated in the different habitats. In the pond/lake samples (DWP, TLU, and TLM), the majority of the bacterial communities consisted of Actinobacteria (Figure 3A), especially taxa belonging to the family Sporichthyaceae (unidentified Sporichthyaceae 15-26% and hgcl clade 18 -28% of the bacterial 16S rRNA gene sequence reads, Supplementary Figure S2A), and Candidate genus Planktophila (4.8 -8.2%). In addition, the Talik lake samples (TLU and TLM) contained a large proportion of Verrucomicrobia sequences belonging to the genus Methylacidiphilum (11-12%) (Supplementary Figure  S2B). Alphaproteobacteria belonging to the SAR11 clade were present as a major group in the Talik lake (9.4 -11.4%) and as a small fraction (1.4%) also in the DWP sample.
In the deep subsurface samples, the major bacterial constituent was Firmicutes belonging to the genus Desulfosporosinus (23 -61% of the sequence reads, Supplementary Figure S2C), and the DH-GAP04 samples also contained Thermoanaerobacterales bacteria classified as family SRB2 (13 -37% of the sequence reads). Nevertheless, DH-GAP01 also contained 3% deltaproteobacteria belonging to the Desulfovibrio, 5.1 and 7.2% alphaproteobacteria belonging to Janthinobacteria and Undibacteria, respectively (Supplementary Figure S3A).
In the ITL sample, the majority of the bacterial sequences belonged to Parcubacteria (13.5 -62% of all bacterial sequence reads) (Figure 3A), of which approximately 15% (of all bacterial sequence reads) belonged to a genus of the Candidatus class Nomurabacteria and an 10% to an unidentified Parcubacteria genus. Proteobacteria (19 -50% of the sequence reads) belonged mostly to Methylobacter (6.4%) (Supplementary Figure S3D). Bacteroidetes bacteria were present in all samples to some extent, 0.6 -22%, most of which were Flavobacteria (Figure 3 and Supplementary Figure S2D).
The difference in the bacterial communities could also be seen in the PCoA plots (Figure 4A)  the same direction as the concentration of Fe and Al, and the ICE and SGR samples to the middle. Interestingly, the DH-GAP01 samples also fall to the middle of this plot.
The two-way PERMANOVA analysis showed that the bacterial communities of the deep subsurface samples differed significantly (p = 0.0001) from the bacterial communities in water samples from above the permafrost. The bacterial communities identified from each sample type, i.e., lake/pond, deep subsurface, and ice and meltwater, also differed significantly (p = 0.0001) from each other.
The archaeal communities divide in to four distinct groups in the PCoA analysis ( Figure 4B). The lake/pond samples form a cluster to the lower right corner of the plot, together with the highest concentration of inorganic and organic carbon (NPOC), total N concentration and buffering capacity of the water, while the MWR and ISR cluster together to the left of the plot. The SGR archaeal community cluster to the upper right of the plot while the deep subsurface samples fall in to a loose group to the middle right of the graph. The ITL samples appear to fall closer to the deep subsurface samples, but this is most likely a bias due to the low number of archaeal sequences obtained from the deep subsurface samples.
The two-way PERMANOVA analysis showed that the archaeal identified from samples above and below the permafrost, as well as between different sample types (lake/pond, deep subsurface, and ice and meltwater) differed significantly (p = 0.0001).
The fungal communities followed the trend of the bacterial and archaeal communities in the PCoA analysis ( Figure 4C). The TLU and TLM samples grouped together and the SGR, MWR and ISR formed a cluster of their own. The DH-GAP01, DH-GAP04, ICE and, surprisingly, ITL samples clustered together in the direction of salinity, sulfate and total S concentrations, while the DWP samples fell into a separate group closer to the deep subsurface samples than the other lake samples.
In accordance to the bacterial and archaeal communities, also the fungal communities identified from samples above and below the permafrost, as well as between different sample types (lake/pond, deep subsurface, and ice and meltwater) differed significantly (p = 0.0001) from each other when tested with two-way PERMANOVA.

Metabolic Profiles
The Biolog AM 96-well substrate plates were used to screen the capacity to utilize different carbon and nitrogen substrates. Substrate hydrolysis was recorded in the oxic tests in all other samples, but ICE (Figure 5). In the anoxic tests, substrates were hydrolyzed only by DWP and ITL communities. Between 32 and 86 of the 95 different substrates were hydrolyzed aerobically ( Figure 5). The least number of substrates, 32 and 38, were hydrolyzed by the TLU and DH-GAP04.mid communities, respectively. The highest number of substrates, 80 and 86, were hydrolyzed by the ISR and ITL communities, respectively. In the anaerobic tests, 19 and 28 substrates were hydrolyzed by the ITL and DWP communities, respectively. The two-way PERMANOVA showed that the substrate utilization patterns were distinctly different in the surface water samples (lake/pond, ice and meltwater) compared to the deep subsurface samples (p = 0.0009), as well as between the different sample types (lake/pond, ice and meltwater, deep subsurface) (p = 0.0001) and the anaerobic vs. aerobic substrate utilization (p = 0.0001). In the PCoA analyses the anaerobic DWP and ITL samples clustered together as a separate group from the rest, with the DWP in the direction of the total N, NPOC and Fe concentrations ( Figure 4D). The aerobic samples clustered to the lower half of the plot with the samples with the lowest number of hydrolyzed substrates in the left part of the cluster and to the right the samples with the highest number of hydrolyzed substrates. The greatest difference in substrate utilization was observed in the utilization pattern of carbohydrates, as the melt water communities were able to hydrolyze almost all tested substrates, while only D-Cellobiose, beta-Cyclodextrin, Dextrin, alpha-D-Glucose, Maltose, Maltotriose, L-Rhamnose, Sucrose and D-trehalose were hydrolyzed in at least 50% of the reactions from the lake/pond samples, and D-Cellobiose, beta-Cyclodextrin, L-Fructose, D-Galactose, alpha-D-Glucose, Maltose, D-Melibiose, L-Rhamnose, Sucrose and D-trehalose were used in at least 50% of the tests of the deep subsurface samples. Gentibiose was hydrolyzed anaerobically in DWP, but not aerobically. Carboxylic acids were generally hydrolyzed in all sample types, but interestingly, formic acid was not used in any sample. Acetate was used in the DWP, but not in the Talik lake samples. In addition, acetate was sporadically used in the meltwater sample type, but not in the deep subsurface environment, with the exception of one of the replicate reactions from DH-GAP04-LOW. D-Galacturonic acid, D-Glucuronic acid, L-Malic acid, Propionic acid, Pyruvic acid, and Succinic acid were used in all samples, of which D-Galacturonic acid, L-Malic acid and Pyruvic acid were also hydrolyzed anaerobically in DWP and ITL. Beta-hydroxybutyric acid, Succinamic acid, and Urocanic acid was used in all but one sample. Of the alcohols, D-Arabitol was used by all communities, also anaerobically in DWP and ITL. Of the amino acids, L-asparagine, L-glutamic acid, L-serine, and L-valine plus L-aspartic acid were utilized aerobically in all samples, while Glycyl-L-Proline was used only in two of four deep subsurface samples and the MWR, ISR, and ITL. L-Threonine was generally only used in the MWR, ISR, and ITL. Glycosides were sporadically used in all samples, but arbutin was especially popular and used by all, except the TLU and MWR. Interestingly, all tested glycosides, except Amygdalin, was used was used anaerobically in DWP. In contrast, Amygdalin was hydrolyzed aerobically in DWP. Thymidine and Uridine was used only in the melt water samples MWR, ISR and ITL, and only aerobically. DLalpha-Glyserol Phosphate was used only in DWP, MWR, ISR, and ITL. Glucose-6-Phosphate was used in all sample types, although in a lesser degree in the Talik lake and the DH-GAP04 samples. The amino sugars N-acetyl-D-galactosamine and N-acetyl-Dglucosamine were used to some degree in all sample types, but not in the Talik lake. N-acetyl-D-glucosamine was also hydrolyzed anaerobically in DWP and ITL.
The FAPTOTAX predictions showed that chemoheterotrophy was the most common metabolic strategy in all identified bacterial communities, with the exception of the ICE, where chloroplasts and photosynthesis was most common (Figure 6A and Supplementary Table S2). Fermentation, sulfate and sulfur respiration were also prominent in the deep subsurface communities, whereas methylotrophy, methanotrophy and hydrocarbon degradation were prominent metabolic strategies in the Talik lake (TLU and TLM) communities and in the melt water communities (MWR, ISR, and ITL). Ureolysis, in addition to chemoheterotrophy, was the most prominent metabolic strategy in the SGR, although this was not seen in the ICE or the MWR samples. In the archaeal communities, aceticlastic and CO 2 -reducing-H 2 -oxidizing methanogenesis was the most common predicted metabolic strategy ( Figure 6B and Supplementary Table S3). However, in the DWP and ITL, chemoheterotrophy was more prominent in the archaeal communities than methanogenesis.

DISCUSSION
We discovered a surprisingly wide microbial diversity in the permanently cold aquatic environments of western Greenland, ranging from glacier ice and melt water to river, lake and permafrost active layer melt water and deep groundwater from below the permafrost (Figure 1). The microbial communities clustered into 3 -4 more or less distinct community types according to the different habitat types, namely lake/pond (DWP, TLU, and TLM), deep groundwater (DH-GAP01 and DH-GAP04 samples) and the melt water samples (SGR, MWR, and ISR) with ICE and ITL often grouping with different samples, depending on the analysis (Figure 4). Adams et al. (2014) showed that the bacterial communities in the inflow, lake water, and outflow of a lake system differed significantly from each other. The authors suggested that the environmental parameters and the prevailing microbial communities had a greater role in determining the microbial community composition than the input of new organisms from inflow water. It has been shown that the deep subsurface water of the Kangarlussuaq area of Greenland originates from glacier melt water . However, the microbial communities inhabiting the deep groundwater differ greatly in composition from the communities in the water above the permafrost layer.

Lake/Pond
Bacteria The lake/pond samples (DWP, TLU, and TLM) were mostly affected by the concentration of DIC and the alkalinity ( Figure 4A). Controversially, they contained high proportions, between 50.7 and 72.7%, of usually nitrogenfixing Actinomycetes, such as Planktophila, typical in alkaline lakes (Jezbera et al., 2009) and Frankiales. According to the FAPROTAX analysis, these bacteria are responsible for the chemoheterotrophy in the lake. Bacteroidetes were also abundant in these samples, which also contribute to the chemoheterotrophy predicted in the bacterial communities ( Figure 6A and Supplementary Table S2). Flavobacteria-like bacteria (also belonging to the Bacteroidetes) generally co-occur with periods of high heterotrophic activity and growth, and they have the ability to adapt to high-nutrient conditions (Newton and McMahon, 2011). Such conditions may have been prevailing in the lake ecosystem at the time of our sampling campaign, as we visited the site at the end of the growth season. Nitrogen fixation was not predicted by the FAPROTAX analysis for the Actinomycetes or Bacteroidetes phyla, but whole genome sequences of species belonging to the phylum Bacteroidetes (Inoue et al., 2015) have contained genes for nitrogen fixation.
Cyanobacteria were present in all lake/pond samples, but in the DWP their relative abundance was especially high. The Cyanobacteria were mostly similar to chloroplasts, indicating the importance of phytoplankton as primary producers in this ecosystem. The Talik lake samples, TLU and TLM, also contained a high relative abundance of methane-oxidizing Verrucomicrobia belonging to the novel Methylacidiphilum genus (Khadem et al., 2012), which also fixes nitrogen. The known Methylacidiphilum strains are extremely acidophilic, growing at pH as low as 1. However, in our study, approximately 12% of the bacterial sequences detected in the Talik lake samples (TLU and TLM) belonged to this genus, indicating a broader pH tolerance by this genus, or the presence of a novel type of Methylacidiphilum. This was also supported by the FAPROTAX predictions, which identified the Methylacidiphilum as both a methanotroph and a methylotroph ( Figure 6A and Supplementary Table S2).

Archaea
The archaeal communities in the lake/pond samples were most affected by the concentration of organic carbon (NPOC), total N, inorganic carbon (DIC and HCO 3 ) and alkalinity ( Figure 4B). Woesearchaeota were the most prominent archaeal lineage present in the lake/pond (DWP, TLU, and TLM) samples. This novel archaeal phylum was first detected in deep sea hydrothermal vents (DHVEG-6, Takai and Horikoshi, 1999), but has since then been shown to be wide spread in aquatic environments. Woesearchaeota were for example found to be frequent and abundant archaeal inhabitants of oligotrophic, high-altitude lake water in the Pyrenees (Ortiz-Alvarez and Casamayor, 2016). Ortiz-Alvarez and Casamayor (2016) also suggested that the Woesearchaeota preferred environmental conditions that also promoted high bacterial diversity, which might explain why the organic carbon, total N and inorganic carbon concentrations affected the archaeal community in this habitat. Interestingly, the Woesearchaea live in aquatic habitats covering a pH range of at least 4.4 -10.1 as reported by Ortiz-Alvarez and Casamayor (2016). Castelle et al. (2015) showed in a study of single cell genome re-construction that the Woesearchaea have very small genomes completely lacking genes for several common metabolic pathways, such as glycolysis/gluconeogenesis, and appears to have only partial pathways for the biosynthesis of nucleotides and amino acids. It has been proposed that this group of archaea live symbiotic/parasitic lives. Between 33.4 and 40.4% of the archaeal sequences in the lake/pond samples belonged to yet unidentified archaea. Due to novel high throughput sample handling and sequencing techniques the number of genomes of uncultured archaeal lineages are expanding almost exponentially (Adams et al., 2014). Thus, the genes contained in these new genomes as well as the functions and metabolic properties or phylogenetic affiliations of the genomes of the uncultured archaea may not yet be defined, which is reflected in the databases. Due to their novelty, Woesearchaea have not yet been included in the latest version of the FAPROTAX database.

Fungi
The fungal community did not appear to be affected by organic or inorganic carbon concentrations, nor by nitrogen concentration, only by salinity and sulfate concentrations ( Figure 4C). The main fungal component in the lake/pond (DWP, TLU, and TLM) samples belonged to the zoospore-forming Chytridiomycota. These fungi are typical aquatic inhabitants and are important decomposers of recalcitrant material, thus contributing to the nutrient cycle by releasing more easily digestible compounds from e.g., plant cell walls to the general microbial community. In addition, the DWP contained a high proportion of saprobic Ramicandelaber fungi belonging to the Zygomycetes phylum (Ogawa et al., 2001). These fungi were quite recently described and are generally found in soil, but now also in aquatic habitats. It is possible that the DWP, being small and shallow, is more profoundly influenced by the surrounding soil than the Talik lake, thus having a surplus of Ramicandelaber. However, the active layer melt water infiltrating into the Talik lake (ITL) did not have a high relative abundance of these fungi, which suggests that the Ramicandelaber is not originating from the soil. Or, it may also be possible that the soil surrounding the DWP differs from the soil in the vicinity of the Talik lake. The Talik lake environment have been glacier free for longer than the DWP, which may affect the chemical composition of the soil, which could be less favorable for the Ramicandelaber fungi compared to the younger soil surrounding the DWP.
The lake/pond samples had the highest concentration of bacterial 16S rRNA and fungal 5.8S rRNA genes but the lowest cfu concentrations. The TLU and TLM also showed among the lowest number of utilized substrates in the Biolog AM tests.

Bacteria
The bacterial community of the deep groundwater (DH-GAP01, DH-GAP04.up/.mid/.low) consisted mostly of Firmicutes (37.2 -94.3%), which in contrast were almost absent in the surface water samples. The Desulfosporosinus genus had the highest relative abundance, with exception of DH-GAP04.low, contributing with up to 61% of the bacterial 16S rRNA gene sequences in these samples. The other significant firmicutes group belonging to the SRB2 cluster of the Thermoanaerobacterales family, contributed with 13.3 -37.2% of the bacterial sequences in DH-GAP04 samples, but was absent in DH-GAP01. Desulfosporosinus species are known sulfate reducers with the capacity to also reduce Fe(III), nitrate, elemental sulfur ant thiosulfate (Pester et al., 2012;Sánchez-Andrea et al., 2015). The Thermoanaerobacterales SRB2 cluster is also highly likely sulfate reducers, although many Thermoanaerobacterales species do not reduce sulfate (e.g., Slobodkina et al., 2017). Desulfosporosinus species also have a broad range of possible electron donors and may even turn to fermentation in the absence of reducible electron acceptors (Sánchez-Andrea et al., 2015). The FAPROTAX predictions supported this as the Desulfosporosinus were the predominant bacteria deemed capable of fermentation, sulfate and sulfur compound respiration (Figure 6A and Supplementary Table S2). This makes the Desulfosporosinus very versatile bacteria able to survive in the harsh conditions of the arctic deep subsurface. Nixon et al. (2017) showed that Desulfosporosinus were important iron reducers in sub glacial arctic sediments, and this taxon appears to be wide spread in arctic anaerobic environments around the globe (Blanco et al., 2012). However, the FAPROTAX predictions did not show iron reduction for the Desulfosporosinus, neither did the deep subsurface community appear to be influenced by iron ( Figure 4A). Instead, the samples containing the highest relative abundance of Desulfosporosinus were also most affected by the concentration of sulfate and total Sulfur.

Archaea
The archaea in the deep subsurface environments contributed with approximately 1% of the 16S rRNA gene pool. The main archaeal groups detected were the Ianarchaeum (DH-GAP01, DH-GAP04.mid), the Woesearchaeota (mainly DH-GAP04.up/.mid/.low) and the Bathyarchaeota (mainly DH-GAP04.mid). Ianarchaea, like the Woesearchaea, are minimal microbial cells dependent on other microbial species to support their growth (Golyshina et al., 2017). Nanorchaeota have been shown to grow together with specific Thermoplasma species in samples and cultures derived from acidic streamers, but in the deep Greenlandic subsurface groundwater the Nanorchaeota did not co-occur with any Thermoplasma. Due to the low abundance of archaea in these habitats, it is possible that these interactions were not seen because we worked close to the detection limit of the archaea. Bathyarchaeota, found in the DH-GAP04.mid and DH-GAP04.low samples have recently been described as a new archaeal phylum, formerly known as the Miscellaneous Crenarchaeotic Group, MCG (Evans et al., 2015). Based on single cell genome analyses this group of archaea have the necessary genes for methanogenesis, revealing that methanogenesis may be more widely spread among archaeal groups than previously thought. In addition, the fact that these archaea live in deep groundwater with an ambient temperature of 0-2 • C may also suggest psychrophilic methanogenesis by these archaea.

Fungi
The fungal community in the deep subsurface samples was almost negligent. It should be considered, however, that the qPCR standard used for the estimation of fungal biomass is based on number of spores, and that spores may have several rRNA gene operons per genome. Nevertheless, the fungal communities were most affected by salinity and sulfate concentrations ( Figure 4C).
Cladosporium was present in all samples (5.9 -12.0 %), except DH-GAP04.low. This fungus is commonly found as a constituent of indoor mold communities and as parasites and pathogens of fungi and plants, respectively. Because the fungal community in the deep subsurface samples were very scarce, it is possible that the Cladosporium sequences in these samples are derived from contaminants. Nevertheless, these samples also contained different kinds of fungi previously detected in arctic environments, such as the Helotiales (Walker et al., 2011), Phaeococcomyces black yeast growing on rock surfaces (Gadd, 2007) and a variety of decomposer fungi (Singh and Singh, 2012). Despite their low abundance in the deep subsurface water, these fungi may be important decomposers in biofilms on rock surfaces in the subsurface environments. In addition, fungi weather rock releasing important nutrients for the use by the rest of the microbial community. Fossils of filamentous fungal structures have recently been revealed from deep subsurface rock . The heterotrophic anaerobic fungi inhabiting the anoxic deep subsurface release hydrogen through their metabolism, thus also supporting other microbial growth (Drake and Ivarsson, 2018). It has also been shown in igneous crystalline deep rock environments that there is a coupling between the anaerobic, rock-weathering fungi and sulfate reducing bacteria (Drake et al., 2017).
The deep subsurface waters had (together with ICE and SGR) the lowest number of bacterial and archaeal 16S rRNA and fungal 5.8S rRNA genes in this study, with the exception of the number of archaea in DH-GAP04.up and DH-GAP04.mid, where the archaeal number was approximately the same as in ISR and ITL. In contrast to the low gene copy numbers, the highest numbers of bacterial colonies were detected in DH-GAP04.up and DH-GAP04.mid, despite these samples being anoxic. The microbial communities hydrolyzed between 33 and 53 of the 95 substrates provided on the Biolog plate, similarly to the lake/pond communities. In addition to the sulfate reducing bacteria and the fastidious archaea, a subpopulation of heterotrophic aerobic bacteria inhabited this ecosystem.

Bacteria
The glacial ICE had a surprisingly high number of bacterial 16S rRNA gene copies, 4.8 × 10 4 mL −1 . The population consisted mostly of Cyanobacteria, more exactly to Chloroplasts, which may originate from ice algae. Tanaka et al. (2016) found a large population of green algae and cyanobacteria in Siberian glaciers and showed that the biomass of algae increased with increasing temperature. Algae and cyanobacteria are primary producers, i.e., produce carbon compounds in their photosynthesis. In addition, these microorganisms fix atmospheric nitrogen. The produced carbon and nitrogen compounds are released in to the melt water and may be carried long distances from the point of primary production. Kohlbach et al. (2016) showed that the substrates produced by ice algae sustain ecological (macrobiota) key species in the Arctic Ocean, where the ice algae contribute with up to 25% of the primary produced carbon. This carbon strongly affects food webs in this ecosystem. In our study, the effect may be seen in the increase of microbial 16S rRNA gene copy numbers in the MWR and ISR, which are down flow from the glacier.
Proteobacteria was the most abundant bacterial phylum in the SGR, MWR and ISR, with beta-and gammaproteobacterial types as the most common. Psychrotrophic Polaromonas (1.1 -7.4%) (Mattes et al., 2008;Darcy et al., 2011) and iron and manganese reducing Albidiferax (4.8 -9.0%) (Lu et al., 2013;Akob et al., 2014;Kaden et al., 2014) were detected in all samples. The MWR and ISR also contained iron reducers belonging to the Rhodoferax (1.2 -4.2%) (Kaden et al., 2014), some of which have been shown to be psychrotolerant and facultatively anaerobic (Finneran et al., 2003). This is well in agreement with the chemical parameters, as the bacterial communities in MWR, ISR and ITL were most strongly affected by the concentration of iron ( Figure 4A). The Rhodoferax also contributed the majority of the phototrophy predicted to be present in the ISR and ITL (Figure 6A and Supplementary Table S2). Interestingly, the MWR and the ISR had a high relative abundance of methylotrophic bacteria belonging to the Methylotenera (6.3 -8.8%) and Methylobacter (10.7 -17.0%). Methylobacter species frequently been detected in arctic environments (Wartiainen et al., 2006). Although generally aerobic, Methylobacter species have been shown to perform anaerobic methane oxidation in subarctic lake sediments and to closely associate with iron reducers (Martinez-Cruz et al., 2017) and Methylotenera have also been shown to perform methane oxidation in arctic lake sediment (He et al., 2012). In addition, ISR had 5.4% of the 16S rRNA gene sequences belonging to Crenotrix, which have recently been shown to oxidize methane aerobically, but also anaerobically while reducing nitrate (Oswald et al., 2017). The FAPROTAX predictions also identified the Methylotenera, Methylobacter, and Crenotrix as the main methylotrophs in these samples ( Figure 6A and Supplementary Table S2). Undibacterium, a genus ubiquitously found in aquatic  and soil (Li et al., 2017) environments was detected at high relative abundance (20.6%) only in SGR. Massilia, which contributed with a significant portion (12.7%) of the bacterial 16S rRNA gene pool in SGR, have been found in soil environments and especially as colonizers of plant roots (Ofek et al., 2012). Massilia could also play a significant role in the cycling of nitrogen compounds in the melting surface environment on glaciers, as they were predicted to be the main ureolytic microorganisms in the SGR (Figure 6A and Supplementary Table S2). Urea may be released from the decomposition of nitrogenous organic matter (Berman et al., 1999), which in this case could originate from the abundant Cyanobacteria found in the ICE. Our results indicate a more widespread distribution of habitats for this bacterium.
The MWR and ISR contained a high relative abundance of Parcubacteria (9.2 -17.2%), which is a novel bacterial superphylum. These bacteria have reduced genome sizes, leading to limited metabolic capabilities, and they strongly rely on fermentation for their energy demand (León-Zayas et al., 2017). They are adapted to oxidative stress and they have genes for nitrate reduction. Due to their reduced metabolic capabilities in addition to the presence of genes for several adhesion and attachment proteins, it is assumed that they lead either symbiotic or parasitic lifestyles, but the assumption leans more toward symbiotic (Nelson and Stegen, 2015). The Bacteroidetes and Actinomycetes communities in these samples consisted of similar genera as in the lake/pond samples and were responsible for the chemoheterotrophic cycling of nutrients ( Figure 6A and Supplementary Table S2). In the ITL samples, Parcubacteria dominated the bacterial communities with a relative abundance of up to 62.0%. The Proteobacteria were the second most abundant bacterial phylum, and also in ITL, Methylobacter contributed with 6.4% of the bacterial 16S rRNA gene sequences, indicating their important role in the oxidation of methane and methylated compounds in this environment (Figure 6A and Supplementary Table S2).

Archaea
The ICE had hardly any archaea according to the qPCR and only a few sequences from one ICE sample replicate, all belonging to the Nitrosoarchaeum, were obtained. The SGR had 2.5 × 10 2 mL −1 archaeal 16S rRNA gene copies consisting almost exclusively of methanogens belonging to the Methanosaeta (37.3%) and Methanospherula (58.7%). The MWR had the highest abundance of Archaea measured in this study, with up to 1.2 × 10 4 mL −1 archaeal 16S rRNA gene copies. The ISR had 1 order of magnitude less archaea, but the archaeal community structure in the MWR and ISR were very similar. The majority of the archaea belonged to the earlier mentioned Woesearchaeota. Methanogens belonging to the Methanoregula (17.5 -19.1%) and Methanosaeta (3.3 -5.1%) were also prominent. However, ammonia oxidizing Nitrosoarchaeum (Li et al., 2016) (5.9 -8.2%) and anaerobic methane oxidizing Methanoperedens (formerly known as ANME-2D archaea) contributed with 5.2 -7.3% of the archaeal 16S rRNA gene sequences. Putative methanogenic Bathyarchaeota (Evans et al., 2015) were also present in these samples (5.8 -6.9%). This further implies that the methane and nitrogen cycling microbiota are important parts of the microbial communities of the melting arctic ( Figure 6B and Supplementary Table S3). The ITL archaeal community was similar to that of the lake/pond samples.

Fungi
The fungal community in the ICE sample consisted of only 7.9 × 10 1 mL −1 spore equivalents. In the SGR, MWR, ISR and ITL the fungal communities were over 10 times larger. Yeasts, such as Cryptococcus and Rhodotorula (Basidiomycetes) have previously been found in subglacial ice of high Arctic glaciers (Butinar et al., 2007), and the authors reported up to 4.0 × 10 3 cfu mL −1 melt ice. Other yeasts, such as Microbotryomyces, were also found in these samples. Genera of the Glomeraceae were abundant and found exclusively in this group of samples. This phylum contains fungi that grow only in symbiotic mycorrhizal relationships with plants. They may have ended up in these environments as spores or sloughed off from plant roots from the surrounding terrestrial habitats. The rust fungus Cronartium was only detected in ITL, probably because the melt water from the active layer of the permafrost runs through shrub vegetated soil, from where the rust has infected the plants. Fungi belonging to the Anaeromyces were found only in the SGR, MWR, ISR, and ITL samples, but only at low relative abundances. Nevertheless, this fungus is obligately anaerobic (Breton et al., 1990), but in our samples found in aerobic habitats.
The number of cfu in the SGR, MWR, ISR and ITL samples were in the range of the deep subsurface samples, but the number of bacterial 16S rRNA gene copies and fungal 5.8S rRNA gene copies were 1 order of magnitude higher, with exception for the bacteria in the SGR. The ISR and ITL microbial communities hydrolyzed the highest number of substrates in the Biolog AM test, indicating a high metabolic variety of the microbial communities in these samples. The lack of autotrophically growing bacteria also support the dominance of heterotrophic strategies in the high Arctic aquatic systems.

SUMMARY
The microbial communities in the different habitat types differed both in taxonomical and metabolic characteristics. The most important microbial process in the lake/pond environment at the time of sampling was chemoheterotrophy. These heterotrophic bacteria have key positions in the degradation of organic matter and nutrient cycling in oligotrophic aquatic environments. Nitrogen fixation capacity was predicted to be low, according to the metabolic predictions, indicating that the bioavailable nitrogen is cycled through the breakdown of biomass, but may also originate from geological deposits in the host rock. The primary producing phytoplankton are also important key species because they provide photosynthetically produced carbon for the use of the rest of the microbial community. The relatively high abundance of fungi in the lake/pond environments also indicate that dead biomass needs to be recycled. Methane released from the anaerobic sediments in the lake/pond environment is oxidized by the methylotrophic Verrucomicrobia.
The deep subsurface biosphere functions mostly through sulfate or iron reduction or fermentation. Thus, the both the microbial community composition as well as the metabolic profiles of the communities in the deep subsurface differ from the microbial communities of the surface biosphere.
The algae of the glacial ice are important primary producers for the whole aquatic system as the ice melts. These algae release photosynthetically produced carbon compounds and nitrogen compounds from their nitrogen fixation, which is then transported away from the glacier. They may also function as a nitrogen source in the melt water through biomass breakdown. The melt water streams are rich in methane oxidizing bacterial and archaeal types, which may function together with iron reducing bacteria when oxidizing methane. The high relative abundance of methane oxidizers and iron reducers in the melt water streams may thus decrease the general methane emissions from arctic aquatic environments.
Most of the archaea and some of the bacteria detected in this study affiliated with newly described groups, which have extremely small genomes. It is still debated whether these microorganisms are symbionts or parasites. However, their high abundance in cold arctic aquatic environments may mean that they are symbionts. By serving as symbionts to larger microorganisms, these micro-archaea and -bacteria may provide their hosts with specific enhancement or advantage for better survival in these conditions. These small genomesize microorganisms appear to have in common the capacity for fermentation coupled with the release of hydrogen, which may aid the host.
Fungi and yeast degrade organic macromolecules in arctic conditions by secretion of cold-adapted enzymes. They also weather rock, which releases important mineral nutrients, and release hydrogen, which is a preferred energy source for many microorganisms. Thus, fungi contribute strongly to nutrient cycling in these environments.

CONCLUSION
In conclusion, our results demonstrate that the different water types (lake/pond, deep subsurface, ice and melt water) differ from each other both by chemical and biological measurements, despite the fact that a great part of the waters originate from glacier melt water. The infiltration of water from one habitat type to the other, does not affect the microbial communities as much as the environmental conditions of the habitat do. For example, the harsh conditions in the subsurface below the permafrost does not support survival of most of the aerobic surface microorganisms. However, some cultivable aerobic microorganisms survive in the anaerobic deep subsurface, although in the deep subsurface conditions, they form a minority of the microbial community.

AUTHOR CONTRIBUTIONS
MB, AK, and TL conceived the study and performed the sampling. MB did the microbiology work and analyses. AK and TL were responsible for the chemistry data. LC gave valuable background information of the site and prepared the deep boreholes for sampling. MB wrote the manuscript. AK, TL, and LC commented on the manuscript.

FUNDING
This work was supported by the Academy of Finland (MB, Grant No. 261220), Posiva Oy and Svensk Kärnbränslehantering AB.