Genome-Centric Analysis of Microbial Populations Enriched by Hydraulic Fracture Fluid Additives in a Coal Bed Methane Production Well

Coal bed methane (CBM) is generated primarily through the microbial degradation of coal. Despite a limited understanding of the microorganisms responsible for this process, there is significant interest in developing methods to stimulate additional methane production from CBM wells. Physical techniques including hydraulic fracture stimulation are commonly applied to CBM wells, however the effects of specific additives contained in hydraulic fracture fluids on native CBM microbial communities are poorly understood. Here, metagenomic sequencing was applied to the formation waters of a hydraulically fractured and several non-fractured CBM production wells to determine the effect of this stimulation technique on the in-situ microbial community. The hydraulically fractured well was dominated by two microbial populations belonging to the class Phycisphaerae (within phylum Planctomycetes) and candidate phylum Aminicenantes. Populations from these phyla were absent or present at extremely low abundance in non-fractured CBM wells. Detailed metabolic reconstruction of near-complete genomes from these populations showed that their high relative abundance in the hydraulically fractured CBM well could be explained by the introduction of additional carbon sources, electron acceptors, and biocides contained in the hydraulic fracture fluid.


INTRODUCTION
Over the last decade, coal bed methane (CBM) has emerged as an important resource for meeting rising global energy demands. It is anticipated that consumption of natural gas will grow by 1.5% each year until 2040, the fastest growth of any fossil fuel resource (U.S. Energy Information Administration, 2013). CBM is generated through biotic and abiotic processes, however analysis of methane isotopic compositions from CBM reservoirs worldwide suggest that the majority of methane is derived from microbial activity, especially at shallow depths (Scott, 2002;Strąpoć et al., 2011;Golding et al., 2013). Despite its economic importance, our understanding of the microbial communities responsible for the conversion of coal to methane is limited, hampering our ability to engineer strategies for stimulating native microbial communities to produce additional methane.
To extract CBM, a vertical well is drilled 200-1000 m into a coal bed. Water and gas are simultaneously extracted from the well and the gas is separated from the water at the surface. In cases where the natural permeability of the coal does not allow for economical rates of extraction, stimulation techniques such as hydraulic fracture are commonly applied. Hydraulic fracture involves the injection of a fluid mixture into the well at high pressure to fracture the coal (Australian Department of the Environment, 2014). The hydraulic fracture fluid mix often contains biocides to inhibit the growth of undesirable microorganisms, namely sulfate reducers, which may cause corrosion of the well bore. Flow paths created by the new fractures are held open by a proppant (e.g., sand, ceramic, or walnut husks) contained in the fracturing fluid. A gelling agent, typically a polysaccharide polymer, is commonly included in the hydraulic fracture fluid to suspend the proppant to ensure that it disperses evenly within the seam. In order to remove the fracturing fluid from the well, a breaker (e.g., hydrogen peroxide, diammonium peroxydisulfate, or a hemicellulase enzyme) is added to the well to depolymerize the gelling agent. Once the fracturing fluid is removed from the well, a production pump is installed at the wellhead to begin dewatering of the CSG well.
Here, community profiles for 11 wells from across the Surat Basin were subjected to metagenomic sequencing and characterization to identify strategies to enhance CBM production. One well, PK-28, had been subjected to hydrofracture stimulation and showed clear differences in community composition to the other wells sampled. The use of additives such as gelling agents, breakers, and biocides in the hydraulic fracture process is commonplace, but it is unknown how these additives may affect CBM community structure. Sugar polymer-based gelling agents and sulfate-based breakers may enable the growth of microorganisms capable of using these compounds, while additives such as biocides are likely to select for specific microbial populations. Metabolic reconstruction of microbial populations enriched in the PK-28 well strongly suggest that this shift in community composition is the result of exposure to hydrofracture fluid additives.

Sample Collection
Biomass was collected from 11 previously characterized CBM production wells in the Surat Basin, Australia for metagenomic characterization (Evans et al., 2015). Water chemistry and isotope measurements were also collected for comparison and are described in detail by Baublys et al. (2015). Prior to microbial sampling, temperature, pH, and conductivity were measured using an Accumet multimeter (Fisher Scientific model 13636AP85). When these readings stabilized (∼10-20 min), between 10 and 50 l of production water were filtered through two sequential 142 mm stainless steel filter housings (#YY3014236, Millipore, MA, USA) containing a 20 µm polypropylene prefilter followed by a 0.22 µm nitrocellulose filter. Both filters were folded aseptically, placed into separate falcon tubes, and frozen on dry ice for transport back to the laboratory.

DNA Extraction, Sequencing, and Binning
Metagenomic libraries were prepared using the Illumina Nextera XT DNA Sample Preparation kit and sequenced on twofifths of a lane on the Illumina HiSeq2000 platform in rapid mode (2 × 100 bp paired end; 500 bp fragment size) producing an average of 4.1 Gb of paired-end data for each sample. Adapter clipping and merging of overlapping reads was performed using SeqPrep v2013-12-17 (https:// github.com/jstjohn/SeqPrep). Nesoni v0.99 (https://github.com/ VictorianBioinformaticsConsortium/nesoni) was used to remove homopolymers, quality trim bases with a Phred score <20, and discarding trimmed reads ≤30 bp. Assembly of the metagenome was performed using CLC Genomics Workbench v6.5 using default parameters.
Microbial community profiles for all metagenomes were generated by identifying sequencing reads from the 16S rRNA gene and mapping them to the Greengenes database using CommunityM (https://github.com/dparks1134/CommunityM) at a 97% threshold to define OTUs. Binning of the PK-28 metagenome was carried out using DBB v1.0.0 (https:// github.com/dparks1134/DBB), which recruits scaffolds into population genomes based on similarity in GC-content, coverage, and tetranucleotide frequency. Genome completeness and contamination were estimated using the CheckM v0.9.6 lineagespecific workflow using default parameters .

Statistical Analysis
A heatmap showing the relative abundances of all OTUs present at a minimum of 1% in at least one sample was generated using STAMP (Parks et al., 2014). All statistical analyses were performed in R v3.1.2 (R Core Team, 2013). Differences in OTU composition were further explored through principal components analysis of Hellinger transformed OTU relative abundances (Legendre and Gallagher, 2001) using the CRAN package vegan (Dixon, 2003).

Phylogenetic Identification of Population Genomes
In order to determine the phylogenetic affiliation of each metagenome bin, an approximate maximum-likelihood phylogenetic tree was constructed using FastTree v2.1.7 (Price et al., 2010) from a concatenated set of 83 bacterial single-copy marker genes (Soo et al., 2014) extracted from all PK-28 population genomes ≥70% completeness with ≤10% contamination as well as all IMG v4.0 genomes (Markowitz et al., 2012). Single-copy marker genes were identified and extracted from genomes using HMMER v3.1 (Finn et al., 2011). True maximum-likelihood trees were then re-inferred with RAxML (Stamatakis, 2006) including only IMG genomes of interest from 100 bootstrap replicates, using the PROTGAMMAWAG substitution model.
Maximum likelihood trees were also constructed with RAxML from 16S rRNA gene sequences recovered from previous studies using the GTRGAMMA substitution model. For one recovered population, Aminicenantes-PK28, the 16S rRNA gene tree was constructed for the Aminicenantes phylum from 100 bootstrap replicates using near-full length (>1400 bp) 16S rRNA genes recovered in previous studies (Rinke et al., 2013;Farag et al., 2014;Gies et al., 2014;Sharon et al., 2015). The 16S rRNA gene sequences obtained from Sharon et al. (2015) were mined from a metagenome where an Aminicenantes population genome was recovered, but the 16S rRNA gene and population genome could not be linked (Sharon et al., 2015). Consequently, all four recovered 16S rRNA gene fragments identified in the metagenome were included in the tree. A 16S rRNA gene fragment (∼250 bp) extracted from the Aminicenentates-PK28 population genome was placed into the full length reference tree by parsimony insertion in ARB (Ludwig et al., 2004). Members of the phylum Acidobacteria were used as an outgroup based on a previous analysis showing this phylum to be a sister group to the Aminicenantes (Rinke et al., 2013).

CBM Formation Water Sampling and Community Profiling from Metagenomes
Metagenomic datasets averaging 4.1 ± 0.6 Gb of pairedend data were generated for formation waters collected from 11 CBM wells located in the Surat Basin, Queensland, Australia (Figure 1; Table 1). One of these wells, PK-28, had been subjected to hydraulic fracture stimulation in September of 2011. The hydraulic fracture fluid was injected and removed after ∼2 weeks. However, no gas or water was extracted from PK-28 well until July 2013, approximately 4 months prior to sampling. In order to identify differences in the microbial community composition of the hydraulically fractured and non-fractured wells, community profiles for each formation water sample were generated by classifying 16S rRNA gene sequences from the metagenomic datasets (Figure 2). Operational taxonomic units (OTUs) from the actinobacterial order OPB41 (2-30%) and methanogens from the Euryarchaeotal family Methanobacteriaceae (0-39%) were typically dominant in all wells. In contrast, the PK-28 microbial community was dominated by OTUs belonging to the Planctomycetes class Phycisphaerae (9%), the candidate phylum Aminicenantes order OPB95 (11%), the actinobacterial order OPB41 (10%), and hydrogenotrophic methanogens from the family Methanobacteriaceae (11%). Comparison of the PK-28 community composition to that of the other wells using principal components analysis showed that PK-28 clustered away from the other wells, indicating that its overall microbial community was atypical compared to the rest of the basin (Figure 3). The difference in the PK-28 community composition was primarily driven by the Aminicenantes and Phycisphaerae populations. The Aminicenantes were identified only in wells BB-3, WP-3, and BV-9 while the Phycisphaerae were identified in all wells other than WP-3 and BV-9. However, they only reached an abundance of >0.1% in PK-28. Wells WP-3 and AG-13 also appeared to be somewhat atypical compared to the rest of the basin (Figure 3). These wells showed higher abundances of thermophilic populations from the family Thermodesulfovibrionaceae and genus Methanothermobacter, as well as a higher abundance of the class OPB41.

PK-28 Population Genome Binning
De novo assembly of the paired-end data for PK-28 produced 52,312 scaffolds ≥500 bp with an N50 value of 3831 bp. A total of 11 population genomes with ≥70% completeness and ≤10% contamination were obtained by partitioning scaffolds based on GC-content, tetranucleotide frequency, and coverage ( Table 2). These genomes span the majority of dominant populations identified in the 16S rRNA gene community profile, with the exception of Caldiserica. The coverage of the population genomes generally matched the expected relative abundances, with coverage being highest for members of the family Methanobacteriaceae, followed by the Phycisphaerae and Aminicenantes. The Aminicenantes (Aminicenantes-PK28), and Phycisphaerae (Phycisphaerae-PK28) population genomes were targeted for detailed metabolic characterization to determine why these microorganisms were enriched in the hydraulically fractured well. Both the Aminicenantes-PK28 and Phycisphaerae-PK28 population genomes have been deposited in IMG under IDs 2593339135 and 2593339136 respectively.

Phylogenetic Placement of Phycisphaerae-PK28 Population Genome
The approximate maximum-likelihood phylogenetic tree constructed with FastTree placed Phycisphaerae-PK28 (  within the Planctomycetes phylum next to Phycisphaerae mikurensis (Fukunaga et al., 2009). A true maximum likelihood tree inferred using all IMG Phycisphaerae genomes confirms this placement ( Figure 4A). In order to more precisely determine its taxonomic affiliation, a 16S rRNA gene tree was constructed ( Figure 4B) from the full-length rRNA gene sequence from Phycisphaerae-PK28 and additional Planctomycete sequences obtained from the Greengenes database (Desantis et al., 2006). This analysis placed Phycisphaerae-PK28 in the candidate order MSBL9.

Phylogenetic Placement of Aminicenantes-PK28 Population Genome
The approximate maximum-likelihood phylogenetic tree constructed with FastTree placed Aminicenantes-PK28 within the candidate phylum Aminicenantes (Figure 5A). Three Aminicenantes genomes have been sequenced to date (Rinke et al., 2013;Sharon et al., 2015), but there are no cultured representatives of this lineage. Previous phylogenetic analysis of the Aminicenantes using 16S rRNA gene sequences (>800 bp in length) identified several putative subgroups within the candidate phylum, including four proposed classes and eight orders (Farag et al., 2014). Reconstruction of this phylogeny with the addition of 16S rRNA gene sequences from the three publically available Aminicenantes genomes revealed that these genomes belong to two distinct orders, HMMV and SHA-124, within the class OP8-1 ( Figure 5B). Parsimony insertion of a 16S rRNA gene fragment from the Aminicenantes-PK28 population genome places it within the order OPB95, within the proposed class OP8-1.

Carbon Metabolism
Differences in the PK-28 well community could result from the introduction of additional carbon sources in the hydraulic fracture fluid, enriching microorganisms best able to utilize the foreign organic matter. The vast majority of fluid is made up of water and inorganic proppant. In addition, the galactomannan polymer guar was used as a gelling agent. In order to determine FIGURE 2 | Heatmap of the relative abundance of community members (operational taxonomic unit; OTU) from each of eleven formation waters sampled for microbial community profiling. Each row represents an OTU clustered at 97% identity. Only OTUs present at ≥1% relative abundance in at least one sample are shown. Reads that did not match the reference database at ≥97% identity were designated as unmapped.

Aminicenantes-PK28
Phycisphaerae-PK28 FIGURE 3 | PCA of Hellinger transformed OTU relative abundances for each formation water sample. Clustering of PK-28 away from the other wells appears to be driven by the abundance of Phycisphaerae-PK28 and Aminicenantes-PK28 which are not present at >0.1% in any other well ( Figure 2). Plus signs represent individual OTUs and circles represent well samples. whether the introduction of the galactomannon contributed to the enrichment of the Aminicenantes and Phycisphaerae groups, the presence of genes for the utilization of galactomannon as a carbon substrate were examined (Figures 6, 7). All genes required for the endo-hydrolysis of the mannan backbone of galactomannan (endo-βi-mannanase) were identified in Phycisphaerae-PK28 (Figure 6), but not Aminicenantes-PK28 (Figure 7), and included representatives of glycosyl hydrolase (GH) families 5 and 76. In contrast, genes for the hydrolysis of terminal mannose residues (i.e., β-mannosidase) were identified in both population genomes, including GH families 2 and 113. The presence of β-galactosidases from GH family 2 in both population genomes, and GH 16 in Phycisphaerae-PK28, suggests that both Aminicenantes-PK28 and Phycisphaerae-PK28 are able to cleave the galactose side groups from guar. Hydrolysed mannose and galactose residues are likely to be fed into glycolysis. For example, hexokinase and mannose-6P-isomerase in both microorganisms can be used to convert mannose to fructose-6P, an intermediate in glycolysis. In Aminicenantes-PK28, metabolism of galactose follows the Leloir pathway, whereby β-D-galactose is converted to UDPglucose by galactose mutarotase, galactokinase, galactose-1phosphate uridylyltransferase, and UDP-galactose-4-epimerase. Although neither galactose-1-phosphate uridylyltransferase or UDP-galactose-4-epimerase were identified in Phycisphaerae-PK28, the presence of galactose mutarotase and galactokinase, as well as a sodium/galactose symporter, suggest that a route similar to the Leloir pathway is used to degrade galactose. In both microorganisms, the pyruvate generated through glycolytic degradation of mannose and galactose may be converted to acetyl-CoA by the action of pyruvate-ferredoxin oxidoreductase for use in a number of biosynthetic reactions. Alternatively, the presence of pyruvate-formate lyase in Phycisphaerae-PK28 suggests that pyruvate may instead be converted to formate. Although no specific mechanism for generating formate was found in Aminicenantes-PK28, putative genes for formate dehydrogenase (i.e., hydrogenase-3 and formate hydrogenylase) were identified and could be used to convert formate to hydrogen and carbon dioxide as terminal products of fermentation.

Alternative Sugar Substrates
In general, both Aminicenantes-PK28 and Phycisphaerae-PK28 appear to be adapted to utilizing a variety of complex sugar polymers. An analysis of glycosyl hydrolases (GHs), carbohydrate binding modules (CBMs), carbohydrate esterases (CEs), and polysacharaide lyases (PLs) in all PK28 population bins revealed that both Aminicenantes-PK-28 and Phycisphaerae-PK28 contained higher proportions of carbohydrate active enzymes compared to other members of the PK-28 microbial community, suggesting that they are highly adapted to utilizing sugar polymers as a carbon and energy source ( Table 3). Aminicenantes-PK28 and Ignavibacteriae-PK28 (population genome 1) also devoted a high proportion of their genome to carbohydrates degradation. However, Ignavibacteriae-PK28 was not present at high proportion in the PK-28 microbial community (∼1.5%).

Aminicenantes-PK28
Thermoanaerobaculum aquaticum, 2579778550 Acidobacterium capsulatum ATCC 51196, 643692001 0.10 A B FIGURE 5 | Maximum likelihood phylogenetic trees constructed from (A) 83 bacterial single-copy marker genes and (B) near-full length 16S rRNA gene sequences obtained from the sequence read archive (Farag et al., 2014), as well as from sequenced genomes from Rifle Creek (Sharon et al., 2015) and Sakinaw Lake (Gies et al., 2014). Only 16S rRNA gene sequences >1400 bp were included in order to ensure overlap in alignment with the short fragment from Aminicenantes-PK28. A dashed line is used to indicate that this sequence was inserted by maximum parsimony. This analysis places Aminicenantes-PK28 into the order OPB95 within the class OP8-1. White, gray, and black circles represent nodes with 70-80%, 80-90%, and >90% bootstrap support values respectively. that Phycisphaerae-PK28 is able to hydrolyse xylan. Xylose monomers liberated from this process can be converted by xylose isomerase (xylA), hexokinase, and ribulose-3P-epimerase (rpe) to D-ribulose-5P, an intermediate in the pentose phosphate pathway that can be directed into glycolysis.

Amino Acid Metabolism
Oligopeptide transporters are present in both Aminicenantes-PK28 and Phycisphaerae-PK28, and both microorganisms appear to be able to utilize select amino acids, such as glycine (glycine cleavage system), glutamate (glutamate dehydrogenase, gldh; and glutamine synthetase, gs), and aspartate (aspartate transaminase, ast). Additionally, genes encoding multiple proline transporters (ABC-type and proline permease) were also identified in Aminicenantes-PK28. The presence of genes encoding pyrroline-5-carboxylate reductase (pcra) and aspartate transaminase (ast) suggest that proline is converted to glyoxylate and pyruvate. In Phycisphaerae-PK28, nearly all of the 21 peptidases identified were linked to cell signaling or the modification/maturation of specific proteins, rather than peptide degradation. In contrast, 36 of the 80 peptidases identified in Aminicenantes-PK28 are associated with the degradation of oligopeptides, including representatives from peptidase families M3, M14, M20, M28, M55, S14, S16, S41, S46, C1B, C69, and T1B. Interestingly, five genes encoding representatives of peptidase family M23 used to degrade the cell walls of other bacteria were identified in Aminicenantes-PK28, which suggests a possible role in peptide scavenging from dead cells.

Nitrogen, Sulfur, and Oxygen Metabolism
In order to determine whether Aminicenantes-PK28 or Phycisphaerae-PK28 could carry out either aerobic or anaerobic respiration, the presence of genes for oxidative phosphorylation (electron transport cytochromes), dissimilatory sulfate and sulfite reduction (dissimilatory sulfite reductase; dsr), and dissimilatory nitrate and nitrite reduction (dissimilatory nitrate, nar; or nitrite reductase, nrf ) were examined. The absence of these genes suggests that neither Aminicenantes-PK28 nor Phycisphaerae-PK28 is able to respire using these electron acceptors. However, genes for assimilatory acquisition of sulfur and nitrogen acquisition were identified. For example, genes for assimilatory sulfate reduction were present in both FIGURE 7 | Metabolic reconstruction of the Aminicenantes-PK28 population genome. The sugar polymers galactomannan (guar) and polygalacturonan (pectin) may be degraded. Galactomannan is likely to be directed toward glycolysis for energy production. Although polygalacturonases were identified, it is not clear how Aminicenantes-PK28 processes galacturonate. Several peptidases involved in the degradation of oligopeptides were identified, including family M23, which is involved in the lysis of bacterial cells. These findings suggest that Aminicenantes-PK28 may scavenge non-viable cells. Abbreviations used: fumC, fumarate hydratase; sdh, succinate dehydrogenase; scs, succinyl-CoA synthetase; ogdc, oxoglutarate dehydrogenase complex; idh, isocitrate dehydrogenase; can, citrate hydro-lyase; cs, citrate synthase; mdh, malate dehydrogenase; pfor, pyruvate-ferrodoxin oxidoreductase; ast, aspartate transaminase; gs, glutamate synthetase; gldh, glutamate dehydrogenase; gcs, glycine cleavage system; shmt, serine hydroxmethyltransferase; sdh, serine dehydrogenase; fdh, formate dehydrogenase. microorganisms, including sulfate adenylyltransferase (sat), adenylylsuflate kinase (cysC), phosphoadenylylsufate reductase (cysH), and a putative assimilatory sulfite reductase (sir). Although sulfate may be present in low concentrations in coal strata, peroxidisulfate was also introduced as a breaker to depolymerize the guar gelling agent and may contribute to the cell sulfur pool. The presence of a full operon for an iron-molybdenum nitrogenase (nif ) was identified, as well as a nitrogenase-associated rnf electron transport complex, suggests that Phycisphaerae-PK28 is able to fix nitrogen.

Effect of Biocide
Kathon, a mixture of 5-chloro-2-methyl-4-isothiazolin-3one and 2-methyl-4-isothiazolin-3-one, was included in the fracturing fluid to inhibit microbial growth. The chemical mechanism of this biocide is complex, but is known to act by disrupting the cell membrane, cleave thiol bonds, generate free radicals, and inactivate a number of key metabolic enzymes, including pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase, succinate dehydrogenase, NADH dehydrogenase, lactate dehydrogenase, and alcohol dehydrogenase (Williams, 2007). Of these enzymes, Aminicenantes-PK28 and Phycisphaerae-PK28 appear to contain only genes for 2-oxoglutarate dehydrogenase.

Water Chemistry and Isotopic Analysis
Geochemical parameters with the potential to influence microbial community structure were measured ( Table 4) as part of a larger investigation into the geochemistry of Surat Basin CBM production waters (Baublys et al., 2015). Some wells were sampled at multiple time points as part of a time series, with one time point paired with samples for microbial analysis. Few systematic trends were evident across the basin, but carbonate tended to be lower in wells located in the western Surat Basin (avg. 1031) compared to the east (avg. 1787). The pH of the wells ranged from 7.6 in well WP-3 to 8.67 in CX-10. Most wells  showed temperature values of ∼35 • C, with wells AG-13, AG-31, WP-3, and PK-28 reaching above 40 • C. Conductivity showed more variability, ranging from 4.40 mS in AG-13 to 13.41 mS in WP-3, indicating substantially higher salinity in that well. Consistent with this finding, WP-3 also shows the highest levels of sodium, chloride, potassium, magnesium, and total iron. As described by Baublys et al. (2015), trends within the isotopic data ( Table 5) are primarily reflective of the region from which the water is derived. Consistent with the injection of additional water and carbon into the well, PK-28 shows a younger water age than any other well and a higher percentage of modern carbon.

DISCUSSION
Stimulation of additional biogenic methane from CBM production wells is likely to require a detailed understanding of the in situ microbial communities. Although a number of studies have characterized the microbial communities present in unperturbed CBM production wells, this is the first study to examine a CBM microbial community after hydraulic fracture stimulation. Clear differences in community composition were identified between PK-28 and wells that had not been exposed to hydraulic fracture additives. Metagenomic analysis revealed strong links between potential carbon substrates introduced in the hydraulic fracturing fluid and the metabolism of the dominant bacterial populations. These findings suggest that hydraulic fracturing has a marked effect on the composition and metabolism of CBM microbial communities. The most significant compositional difference between PK-28, the hydraulically fractured well, and the 10 other CBM production wells was the presence of representatives from the bacterial candidate phylum Aminicenantes (11%) and class Phycisphaerae (9%) within the phylum Planctomycetes. These were present at <0.5% relative abundance in all non-fractured wells. Metabolic reconstruction of the Aminicenantes-PK28 and Phycisphaerae-PK28 genomes revealed the presence of genes for the degradation of galactomannon (i.e., guar), a common additive in hydraulic fracture fluid. Orem et al. (2014) have shown that the organic constituents of hydraulic fracture fluid can persist in the coal bed for several months after the fluids have been removed. Therefore, as water had only been extracted from Reproduced with permission from Baublys et al. (2015). All concentrations are listed in mg/L. An asterisk indicates that the measurements were taken at the same time as samples for microbial analysis.
the well for ∼4 months, after having been allowed to incubate for 2 years, it is likely that galactomannan polymer still resided in the well (Struchtemeyer and Elshahed, 2012). In addition, estimates for the doubling time of microbes present in the deep subsurface biosphere under energy-starved conditions range from a few years to several millennia (Hoehler and Jørgensen, 2013;Onstott et al., 2014), suggesting that the microbial community structure of the well is likely to remain largely static for years after the galactomannan is removed. Genes for the endo-hydrolysis of galactomannon (i.e., endo-mannases) were identified in the Phycisphaerae-PK28 genome, indicating that it is primarily responsible for the depolymerization of guar into short oligosaccharides monomers. These genes were identified in only one other member of the PK-28 community, Ignavibacteriae-PK28, and it is unclear why this microorganism is not more abundant. However, we can speculate that Ignavibacteriae-PK28 was more susceptible to the biocide. The presence of β-galactosidases and β-mannosidases in the Aminicenantes-PK28 and Phycisphaerae-PK28 genomes suggest that both microorganisms are able to use mannose and galactose produced by galactomannan degradation. In Aminicenantes-PK28, a putative phosphotransferase system for the uptake of mannose was identified that could be used to absorb mannose into the cell. No such system was identified in Phycisphaerae-PK28. The application of a peroxidisulfate breaker to partially hydrolyse the gelling agent in the hydraulic fracture fluid prior to removal from the well was likely to release free mannose and galactose monomers for consumption, as well as cell sulfur for Phycisphaerae-PK28. The presence of genes encoding pyruvate-formate lyase in Phycisphaerae-PK28, and formate dehydrogenase in Aminicenantes-PK28, suggests that formate, hydrogen, and carbon dioxide are major end products of fermentation. This would provide an avenue for a syntrophic association with the dominant hydrogenotrophic methanogens in the PK-28 community belonging to the family Methanobacteriaceae. In support of this hypothesis, a previous 16S rRNA gene amplicon based analysis of the water column of Sakinaw Lake (Canada) showed a statistical correlation between the Aminicenantes and hydrogenotrophic members of the order Methanomicrobiales (Gies et al., 2014). The unique ability of Aminicenantes-PK28 and Phycisphaerae-PK28 to ferment galactomannon in syntrophic association with a hydrogenotrophic methanogen, may have provided a selective advantage allowing these rare microorganisms to become enriched. Further support for the role of Aminicenantes and Phycisphaerae-PK28 in in-situ galactomannan degradation could be generated through the establishment of enrichment cultures seeded with CBM formation waters growing on galactomannan as a carbon substrate, potentially containing Kathon as a selective agent.
The ability to utilize a diverse array of polysaccharides, including guar-like polysaccharides, has been identified as a defining feature of Planctomycetes which are found in a variety of environments, including fresh and marine waters, hot springs, soils, and hydrocarbon contaminated environments (Yakimov et al., 2006;Abed et al., 2010Abed et al., , 2011Lage and Bondoso, 2011;Tekere et al., 2013). Metabolic analysis of Phycisphaerae-PK28 showed that in addition to galactomannon, it has the potential to hydrolyse the xylose polymer xylan, as well as pectin. The ability to utilize complex sugars has been demonstrated previously in members of the Planctomycetes present within macroalgae-associated biofilms, and more specifically within Reproduced with permission from Baublys et al. (2015). An asterisk indicates that the measurements were taken at the same time as samples for microbial analysis.
the Phycisphaerae (Lage and Bondoso, 2014). For example, Algisphaera agarilytica, isolated from the surface of macroalgae, was shown to use agar, a galactose polymer, as a carbon source (Yoon et al., 2014), and Tepidisphaera mucosa, isolated from a hot spring, was shown to utilize pectin, galactomannon (i.e., locus bean gum), xylose, and galactose, but not xylan (Kovaleva et al., 2014). The Phycisphaerae-PK28 genome is consistent with these previous observations for members of the Phycisphaerae.
In contrast, very little is known about the ecology of the Aminicenantes, as no cultured representatives exist for direct characterization and metabolic analysis of three publicly available genome sequences has been extremely limited (Rinke et al., 2013;Gies et al., 2014;Sharon et al., 2015). Efforts to isolate the Aminicenantes or genomically characterize representative taxa are hampered by their low abundance in most communities. Previous analysis of over 3100 16S rRNA gene amplicon datasets mined from NCBI's sequence read archive (SRA) showed that the Aminicenantes were present in a quarter of all datasets, but they did not exceed 1% relative abundance in >99% of the data sets examined (Farag et al., 2014). Although present at low relative abundance, the Aminicenantes were identified frequently in fresh water, marine, and hydrocarbon-impacted environments, leading researchers to speculate on their role in these environments (Farag et al., 2014). Limited metabolic reconstruction of an Aminicenantes genome recovered from an acetate contaminated aquifer (Rifle, Colorado, U.S.A) revealed that it contained several glycosyl hydrolases (Sharon et al., 2015), but no investigation of the function of those genes was conducted. It was concluded that the Rifle Creek Aminicenantes may degrade carbon through either fermentation or aerobic respiration based on the presence of genes involved in aerobic respiration (respiratory Complex I, II, and III). This microorganism is also proposed to participate in hydrogen metabolism and assimilatory sulfite reduction. Analysis of a separate Aminicenantes genome recovered from Sakinaw Lake (Canada) revealed a partial set of genes for the Wood-Ljungdahl pathway (Gies et al., 2014). The authors speculated that the Sakinaw Lake Aminicenantes is capable of using this pathway in reverse to consume acetate and generate CO 2 in syntrophic association with a hydrogenotrophic methanogen. In contrast to these previous findings, the population genome of Aminicenantes-PK28 does not indicate that it has the ability to perform aerobic respiration or produce CO 2 via the Wood-Ljungdahl pathway. Instead, Aminicenantes-PK28 appeared to be capable only of anaerobic carbohydrate and amino-acid fermentation, producing CO 2 through the oxidation of formate. Interestingly, a broad range of peptidase families were identified in Aminicenantes-PK28, suggesting that amino acid fermentation may be a key feature of its metabolism. For example, peptidases from family M23 capable of lysing the cell walls of other bacteria were identified in Aminicenantes-PK28 and may indicate that this microorganism acts as a scavenger of dead cells in CBM formation waters.
In this study, we have shown compelling evidence that specific additives within the hydraulic fracture fluid are responsible for a major shift in community composition which favors the enrichment of microorganisms from the rare biosphere that are able to utilize galactomannan. The observed enrichment of novel representatives of the class Phycisphaerae and candidate phylum Aminicenantes may also be coupled to their ability to work in syntrophic association with hydrogenotrophic methanogens and to the introduction of specific biocides into the well. It is possible that Aminicenantes and Phycisphaerae-PK28 are resistant to the Kathon biocide used in PK-28. Their resistance may result from the lack of genes known to be targeted by this biocide. Although both microorganisms possess 2-oxoglutarate dehydrogenase, other pathways may be used to accommodate its inhibition. For example, aspartate transaminase could be used to generate oxaloacetate for use in the TCA cycle. In addition, the unique cell wall structure of members of the phylum Planctomycetes (Fuerst and Sagulenko, 2011;Devos, 2014) may confer resistance to membrane disruption by Kathon. However, it is also possible, and perhaps more likely, that neither microorganism is biocide resistant, and instead may have simply recolonized the seam after the Kathon had degraded or dispersed to a low concentration. In this case, microorganisms best able to efficiently utilize guar as a carbon substrate would recolonize more quickly.
In addition to PK-28, wells WP-3, and AG-13 also appeared to cluster away from the other wells (Figure 3), indicating that they harbor atypical microbial communities compared to the rest of the Surat Basin. Neither of these wells were subjected to hydrofracture stimulation and neither showed enrichment in either Phycisphaerae or Aminicenantes lineages. Instead, both wells showed enrichment in thermophilic members of the bacterial family Thermodesulfovibrio and archaeal genus of methanogens Methanothermobacter, as well as members of the class OPB41 from the Actinobacteria. The observed enrichment in Thermodesulfovibrio and Methanothermobacter in wells with considerably higher than average temperatures (>40 • C) is consistent with the optimum growth range of these lineages (Henry et al., 1994;Wasserfallen et al., 2000). Therefore, it is likely that these wells are atypical because their temperature is conducive to the enrichment of thermophiles. Additionally, WP-3 displayed a number of geochemical parameters such as pH, conductivity, and total iron that could be responsible for the observed microbial community shift. Additional basin-wide surveys are needed to identify the geochemical factors that govern CBM microbial community structure.
This study provides a basis for understanding how specific additives commonly used in hydraulic fracture fluid may alter CBM microbial communities. However, it is important to note that the findings of this study are specific to the set of additives used and may not be applicable to all CBM wells. Further, only one hydrofractured well was available for sampling. Therefore, examination of additional hydraulically fractured CBM production wells will be necessary to confirm these findings and determine how the microbial community will be affected under different stimulation scenarios. A longitudinal study is also warranted to document the community composition before and for several months after hydraulic fracture stimulation to determine if the community is capable of returning to an unperturbed state.

AUTHOR CONTRIBUTIONS
SR, PE, DP, SG, and GT all contributed to the study design and helped to draft the manuscript. PE participated in sample collection and contributed to the analysis of the data. DP contributed to the bioinformatic analysis of this work, particularly in the area of population genome binning. All authors approved the final manuscript.