Alkane hydroxylase gene (alkB) phylotype composition and diversity in northern Gulf of Mexico bacterioplankton

Natural and anthropogenic activities introduce alkanes into marine systems where they are degraded by alkane hydroxylases expressed by phylogenetically diverse bacteria. Partial sequences for alkB, one of the structural genes of alkane hydroxylase, have been used to assess the composition of alkane-degrading communities, and to determine their responses to hydrocarbon inputs. We present here the first spatially extensive analysis of alkB in bacterioplankton of the northern Gulf of Mexico (nGoM), a region that experiences numerous hydrocarbon inputs. We have analyzed 401 partial alkB gene sequences amplified from genomic extracts collected during March 2010 from 17 water column samples that included surface waters and bathypelagic depths. Previous analyses of 16S rRNA gene sequences for these and related samples have shown that nGoM bacterial community composition and structure stratify strongly with depth, with distinctly different communities above and below 100 m. Although we hypothesized that alkB gene sequences would exhibit a similar pattern, PCA analyses of operational protein units (OPU) indicated that community composition did not vary consistently with depth or other major physical-chemical variables. We observed 22 distinct OPUs, one of which was ubiquitous and accounted for 57% of all sequences. This OPU clustered with AlkB sequences from known hydrocarbon oxidizers (e.g., Alcanivorax and Marinobacter). Some OPUs could not be associated with known alkane degraders, however, and perhaps represent novel hydrocarbon-oxidizing populations or genes. These results indicate that the capacity for alkane hydrolysis occurs widely in the nGoM, but that alkane degrader diversity varies substantially among sites and responds differently than bulk communities to physical-chemical variables.

Several different enzyme systems that vary in substrate chain length and reaction mechanism initiate bacterial n-alkane catabolism (van Beilen and Funhoff, 2007). The particulate (or membrane-associated) non-heme iron alkane hydroxylases (alkane 1-monooxygenases) oxidize substrates with chain lengths ≥C 5 -C 16 . These "alkB" hydroxylases are widely distributed among bacteria (Vomberg and Klinner, 2000;van Beilen et al., 2003;Liu and Shao, 2005;Liu et al., 2007;van Beilen and Funhoff, 2007;Wasmund et al., 2009). They are encoded by three genes, alkB for the catalytically active alkane hydroxylase, and alkG and alkT for rubredoxin and rubredoxin reductase, respectively, (Cappelletti et al., 2011). Though they are variable overall, alkB gene sequences contain sufficient conservation for the design of broad spectrum PCR primers, which yield amplicons that contain diagnostic histidine motifs (Kloos et al., 2006). AlkB sequence conservation has been exploited in a variety of molecular ecological studies to assess the distribution and diversity of alkane degraders in hydrocarbon-contaminated soils and sediments (van Beilen et al., 2003;Harayama et al., 2004;Kloos et al., 2006;van Beilen and Funhoff, 2007). However, surprisingly few studies have explored alkane degraders in marine systems (Wasmund et al., 2009;Wang et al., 2010a). Wasmund et al. (2009) analyzed alkB diversity in genomic extracts obtained from hydrocarbon seep-associated sediments in the Timor Sea. They observed numerous novel sequences, many of which were related to, but distinct from known alkane oxidizers within the γ-Proteobacteria and Actinobacteria. Diversity was greater in sediments from shallower depths (<100 m) than deeper depths (>400 m), and alkB gene copy numbers were elevated in sediments nearest hydrocarbon seeps. Guibert et al. (2012) analyzed alkane degraders in intertidal and shallow sub-Antarctic coastal sediments, and like Wasmund et al. (2009) observed novel phylotypes that appeared to represent a temperature-selected community. In addition, they identified alkB phylotypes that were proposed as biomarkers for Antarctic alkane degradation. In contrast, Païssé et al. (2011) found no clear relationship between alkB expression and hydrocarbon contamination in sediments from coastal Berre lagoon that were chronically polluted by hydrocarbons. However, this study only investigated polluted sediments, so its relevance for unpolluted systems is uncertain.
Thus far, analyses of alkane degraders in the water column have mostly involved culture-based studies supplemented with determinations of isolate alkB sequences (e.g., Wang et al., 2010a;Choi and Cho, 2013), although Wang et al. (2010b) also showed that alkB gene abundance ranged from 3 × 10 3 l −1 to 3 × 10 5 l −1 in surface waters around Xiamen Island. In addition, Lu et al. (2012) have used gene probes (GeoChip) to show that relative to uncontaminated waters, alkB genes were enriched in the hydrocarbon plume of the Macondo well oil spill. Lu et al. (2012) also attributed alkB sequences in the plume to various Proteobacteria (e.g., Bdellovibrio, Roseobacter, and Rhodospirillum), Firmicutes, and Actinobacteria (e.g., Gordonia and Rhodococcus), including rather enigmatically the obligate mammalian pathogens, Mycobacterium bovis and M. tuberculosis; representatives of Alcanivorax were either undetectable or present in low abundances.
While clearly informative, these studies have not included spatially extensive analyses of alkB distribution and diversity, or comparative analyses of patterns for alkB and other genetic markers, e.g., 16S rRNA genes. Thus, it is unclear whether alkane-degrading communities as defined by alkB are structured similarly to bulk bacterioplankton communities in unpolluted systems, or whether they respond to different variables. To help address this uncertainty, we have analyzed alkB gene sequences derived from clone libraries prepared from genomic extracts of bacterioplankton samples distributed across the northern Gulf of Mexico (nGoM) shelf at depths from 2 m to 1700 m. We have previously used a pyrosequencing-based analysis of 16S rRNA genes from the same and additional samples to characterize nGoM bacterioplankton diversity . Results from the latter study indicated that composition was stratified by depth, and that known alkane-degrading genera (especially members of the γ-Proteobacteria) occurred throughout the water column. Therefore, we hypothesized that patterns for alkB composition and diversity would mirror those for 16S rRNA genes, and for γ-Proteobacteria and Actinobacteria in particular.

SAMPLE COLLECTION AND alKB ANALYSIS
Bacterioplankton DNA was collected during the March 2010 R/V Cape Hatteras cruise GC-5 (30 • 07 N, 088 • 02 W to 27 • 39 N, 093 • 39 W) as described in greater detail by King et al. (2013) and Tolar et al. (2013). Genomic DNA was extracted from bacterioplankton collected by filtering about 1 liter of seawater through 0.2 μm filters. DNA extracts were used for alkB PCR in reactions containing 12.1 μL of PCR grade water, 2.5 μL 10X High Fidelity PCR buffer (Invitrogen), 0.2 μL 25 mM dNTP mixture, 1 μL 50 mM MgSO 4 , 5 μL 5X bovine serum albumin (Promega), 1.5 μL each of 10 mM stocks of forward and reverse primers, 0.2 μL Platinum Taq DNA Polymerase High Fidelity (Invitrogen), and 1 μL template DNA. The primers used were alkB-1f 5 -AAYACNGCNCAYGARCTNGGNCAYAA and alkB-1r 5 -GCRTGRTGRTCNGARTGNCGYTG (Kloos et al., 2006). The PCR program consisted of an initial denaturation step (94 • C, 3 min), followed by 26 cycles of 94 • C (1 min), 61 • C (1 min), 68 • C (45 s), with a final extension of 10 min at 68 • C. Amplicons were visualized by electrophoresis on 0.8% agarose gels. Bands of the correct size were excised from the gels, and DNA was extracted from the gel slices with a Zymoclean gel DNA recovery Kit (Zymo Research) according to the manufacturer's instructions. DNA concentrations of the cleaned reactions were determined using a Nanodrop spectrophotometer.
Cloning reactions were carried out using a CloneJET PCR cloning kit according to the manufacturer's instructions. Three (3) μL of ligation mix was added to 25 μL Genlantis SmartCells and incubated on ice for 30 min before heat shock at 42 • C for 45 s. Room temperature SOC medium (125 μL) was added and the cells were incubated at 37 • C for 1 h while shaking at 225 rpm. Cells were plated on LB plates containing ampicillin, and colonies (at least 30 per sample) were picked for screening after overnight incubation at 37 • C. Picked colonies were added to PCR reactions containing 10.5 μL PCR grade water, 12.5 μL Promega GoTaq® Green Master Mix, and 1 μL each of F pJET primer and R pJET primer from 10 μM stocks. The PCR program consisted of an initial denaturation step (95 • C, 3 min), followed by 30 cycles of 94 • C (30 s), 60 • C (30 s), 72 • C (90 s), with a final extension of 10 min at 72 • C. Amplicons were visualized by electrophoresis on a 1% agarose gel and purified with UltraClean PCR Clean-up kits (MoBio, Folsom, CA) according to the manufacturer's instructions and then sequenced bi-directionally at the LSU Genomics Facility.

DATA ANALYSIS
AlkB clone sequences and reference sequences downloaded from the FunGene alkB database were aligned with ClustalW (Larkin et al., 2007) and adjusted manually as needed. Clone sequences were screened for the presence of two diagnostic histidinecontaining motifs (HNXXHH and HSDHH) that contribute to the alkB protein active site. Sequences with both motifs were retained; sequences lacking a motif were retained if they were substantially similar to other sequences, and otherwise at least 150 residues in length. Sequences not meeting these criteria were eliminated from further consideration. After manually adjusting the alignment, a maximum likelihood phylogenetic tree was created using MEGA ver. 5.05 (Tamura et al., 2011) with 100 bootstrap replications. A distance matrix of clone sequences was prepared using Phylip (Felsenstein, 1989) as input for the Mothur platform (Schloss et al., 2009), which grouped the alkB sequences into OPUs using a distance cutoff of 0.20 (Wasmund et al., 2009;Guibert et al., 2012). Mothur was used to determine relative abundance of alkB OPUs and to compare distributions among sites. Statistical analyses of alkB data were carried out using Mothur and R (R Development Core Team., 2008).

ACCESSION NUMBERS
Sequences have been deposited in GenBank with accession numbers KF163175-KF613575.

RESULTS AND DISCUSSION
A total of 508 clone sequences were obtained from 17 samples representing 9 stations and 14 depths with approximately equal numbers of clones from each. However, after curating the sequences the total number of validated alkB gene clones was reduced to 401. Rarefaction analysis indicated OPU "discovery" for each of the libraries was saturated or nearly saturated. Samples from two sites, A6-2 m and F6-2 m, were excluded from additional statistical analyses because the numbers of validated clone sequences (8 and 2, respectively) were too few for meaningful comparisons. Although both of these sites harbored alkB-containing populations based on results from a prior survey of 16S rRNA genes , they appeared to be dominated by taxa that contained divergent genes that were nonetheless amplified with alkB primers. The resulting PCR products contained the HSDHH motif, but lacked HNXXH and otherwise differed substantially from all other validated sequences. The function of these genes is unknown, and could not be inferred from BLAST analysis.
Two additional samples from station A6 (20 m and 1700 m), and one from station D5 (2 m), did not yield alkB amplicons at all, in spite of repeated efforts. Reasons for amplification failure with these samples are unclear, since a separate study  showed that they each yielded 16S rRNA gene amplicons, including sequences assigned to known alkane-degrading genera (e.g., Alcanivorax; King et al., 2013) that were also observed in other samples based on both 16S rRNA and alkB gene sequences. Although alkB concentrations in the three negative samples might be at or below detection by our PCR protocol, the total amounts of DNA used in each of the PCR reactions was similar, and there is no a priori reason to expect substantial variation in relative alkB concentrations among samples from similar locations or depths. It also seems unlikely that phylotypes not susceptible to amplification dominated the alkB-containing communities in the negative samples. Thus, the lack of amplicons for A6-2 m, A6-1700 m and D5-2 m remains enigmatic.

OPU ABUNDANCE AND CLASSIFICATION
Sequences from the 17 positive samples were clustered into 22 OPUs using a distance cutoff of 0.20. Guibert et al. (2012) used a cutoff of 0.20 and reported 30 OPUs from 202 clones pooled from 5 libraries obtained during an analysis of coastal sub-Antarctic sediments. Wasmund et al. (2009) reported even greater richness (53 OPUs) from 246 clones from Timor Sea sediments, also pooled from 5 libraries and based on a cutoff of 0.20.
The lower AlkB OPU richness for nGoM bacterioplankton relative to Timor Sea and sub-Antarctic sediment samples (Wasmund et al., 2009;Guibert et al., 2012) is consistent with patterns observed for bacterioplankton and sediment communities as a whole, since the latter typically support greater richness than the former based on 16S rRNA gene sequence analyses (Kemp and Aller, 2004). The difference between bacterioplankton and sediment AlkB OPU richness might simply reflect differences in the number of cells extracted for analysis, since even small sediment masses can support much larger communities than the water column. In addition (or alternatively), lower habitat diversity in the water column might select for fewer hydrocarbon-oxidizing phylotypes. Whether or not these differences in richness affect responses to hydrocarbon inputs is uncertain.
An additional 85 clone sequences that formed 17 OPUs were distributed among polyphyletic clades comprised of sequences from Actinobacteria, Gammaproteobacteria, and Firmicutes (Figure 1). These 17 OPUs appeared to represent novel phylotypes, some of which were associated with, but distinct from, novel phylotypes from the Timor Sea (Wasmund et al., 2009). However, the lack of congruence between 16S rRNA and alkB gene phylogenies (Jurelevicius et al., 2013), and the apparent mobility of alkB (van Beilen et al., 2001(van Beilen et al., , 2003(van Beilen et al., , 2005Smits et al., 2002;Wang et al., 2010b), precluded more specific inferences about OPU affiliations. Irrespective of their phylogeny, however, these unclassified OPUs have not been observed in other AlkB clone libraries (e.g., Wasmund et al., 2009;Guibert et al., 2012). This suggests that the nGoM supports a few widely distributed and dominant alkane degraders along with less abundant, more geographically constrained populations.
A relatively small number of clones (49, 12.2% of the total) forming 3 OPUs (4, 5 and 13) could not be assigned to phyla even tentatively (Figure 1). Sequences of these clones contained the signature AlkB histidine motifs, but their divergence might indicate altered substrate ranges relative to other AlkB proteins, or even different functions. Regardless, these OPUs appear unique to the nGoM.
Of the classified AlkB sequences identified in this study, the most abundant were associated with Gammaproteobacteria, and an Alcanivorax-Marinobacter OPU in particular (Figures 1, 2;

Frontiers in Microbiology | Aquatic Microbiology
December 2013 | Volume 4 | Article 370 | 4 FIGURE 2 | Phylum level alkB composition by sample. Blue represents Proteobacteria, red represents Actinobacteria, and green represents unclassified. Table 1). Gammaproteobacterial AlkB has also been reported as the dominant phylotype in analyses of Timor Sea sediments and surface waters near Xiamen Island (Wasmund et al., 2009;Wang et al., 2010b). In addition, Hazen et al. (2010) noted that hydrocarbon-oxidizing Gammaproteobacteria, albeit not Alcanivorax, dominated microbial communities in the Macondo well oil plume. Thus, Gammaproteobacteria appear to dominate the hydrocarbon oxidizers in many natural marine systems as well as those exposed to chronic or acute hydrocarbon inputs. In contrast, soils appear to harbor more diverse alkanedegrading communities, with a predominance of Actinobacteria (e.g., Jurelevicius et al., 2013).

OPU DISTRIBUTION
Alcanivorax-like AlkB sequences in the dominant OPU 1 were detected in all samples but A6-700 m, even though this sample contained Alcanivorax 16S rRNA based on results from a previous analysis  and Figure 3). At this site, Alcanivorax alkB genes might have been at or near detection limits for our PCR conditions, since 16S rRNA gene sequences attributed to Alcanivorax accounted for only 0.07% of the total reads. In contrast, Alcanivorax-like alkB gene sequences were observed in 7 samples (A6-2 m, D5-50 m, D5-100 m, D5-450 m, E6-2 m, F6-2 m, and MR1-2 m) where Alcanivorax 16S rRNA genes were not detected (Figure 3). The lack of concordance between Alcanivorax 16S rRNA and Alcanivorax-like alkB gene sequences at these sites might be explained by the presence of Alcanivorax-like alkB genes in taxa other than Alcanivorax (e.g., Marinobacter). More generally, there was no significant correlation between the relative abundance of Alcanivorax-like alkB sequences and the relative abundance of Alcanivorax 16S rRNA gene sequences (Figure 3). PCA analysis showed that the composition of AlkB OPU assemblages did not vary consistently with depth or sampling station (Figure 4A), since there were no coherent clusters with separation on axes 1 or 2. These results were consistent with inferences from a similarity plot constructed using the structure-based θ YC calculator (SI Figure 1), which also indicated that AlkB OPUs were not clustered by location or depth. Similarly, results from a canonical correlation analysis (CCorA) revealed no consistent relationships among OPUs and depth (SI Figure 2), or several other physical-chemical variables available for the samples (e.g., salinity, fluorescence, beam attenuation (a measure of particle density), dissolved oxygen, pH and temperature). Removing the most abundant OPUs from the PCA analysis (OPU1 and OPU3) resulted in a cluster of shallow water samples (≤100 m depth) that included one deeper water sample (A6-350 m); the remaining deep-water samples were dispersed across both axes (SI Figure 3). This suggests that the less abundant OPUs in surface waters might form similar assemblages across sites, but that variability in the more abundant OPUs obscures patterns.
Results similar to those for a PCA with all AlkB OPUs were obtained from a PCA analysis of the relative abundances in each sample of 16S rRNA gene sequences for Alcanivorax, Marinobacter, Pseudomonas, Hydrocarboniphaga, and Kordiimonas, the primary alkane-oxidizing reference taxa with which some of the clone sequences were identified (and SI Figure 4 and King et al., 2013). The distribution of these taxa did not vary consistently with depth. In contrast, PCA analysis of 16S rRNA gene sequences for the bulk bacterioplankton communities in these same samples showed that they separated distinctly by depth ( Figure 4B) as previously reported for a larger set of nGoM samples Tolar et al., 2013).
These results collectively indicate that the composition of nGoM alkane-degrading communities varies in response to as yet unidentified biological or abiological factors that do not change  King et al., 2013). Samples from depths >100 m, ≤100 m and the Mississippi River plume are indicated by blue, black, and red, respectively. Composition data were analyzed after an arcsine transformation.
consistently with sample depth. This pattern differs from that for bulk bacterioplankton, the composition of which appears to be governed by depth-dependent variables Tolar et al., 2013). In particular, the relative abundance of nGoM Gammaproteobacteria, which constitute the majority of classifiable alkane degraders based on 16S rRNA gene and alkB sequences, depends strongly on depth . However, a greater relative abundance of Gammaproteobacteria does not necessarily imply a greater abundance of alkane degraders, since there was limited, albeit significant, positive correlation between the two (r = 0.518, p = 0.033; SI Figure 5). For example, Gammaproteobacteria at D5-900 m constituted >40% of the total bacterial community , yet only 0.1% of this community was identified as alkane degraders. In contrast, samples from A6-160 m, A6-350 m, A6-700 m, B5-450 m, and H6-45 m harbored greater relative abundances of alkane degraders, but lower relative abundances of Gammaproteobacteria .

INDICES OF AlkB DIVERSITY
Statistical measures of diversity (e.g., Chao1, ACE, Shannon and Inverse Simpson's indices, and evenness indices) also did not vary consistently with depth or sampling station (Figure 5, SI Table 1). Values for the Shannon index, for example, fell between 0.0 (MR1-2 m, 1 OPU) and 1.79, but there were no systematic differences among samples. Likewise, there were no systematic differences among samples for other diversity indices. In this respect, spatial trends for AlkB diversity were comparable to those for bulk nGoM bacterioplankton based on 16S rRNA gene sequences; diversity metrics for the latter also did not vary with depth or geographic location on the shelf . The absence of spatial patterns in nGoM alkB gene sequence diversity suggests that at least during the sampling period (March, 2010), assembly of alkane-degrading communities depended on a common mechanism (e.g., neutral assembly; Emerson and Gillespie, 2008) irrespective of local conditions. A different pattern might emerge, however, during summer stratification and the establishment of hypoxia in surface waters. Changes in substrate availability and quality, temperature and dissolved oxygen could enrich specific members of surface but not deeper communities resulting in more distinct spatial patterns. Data from the Macondo Well oil plume also indicate that the introduction of hydrocarbons from spills or natural sources can create bloom conditions that alter both richness (Hazen et al., 2010;Valentine et al., 2010) and evenness (Lu et al., 2012;Rivers et al., 2013).

SUMMARY AND CONCLUSIONS
In summary, results from alkB gene sequence analyses show that community structure (composition, richness and diversity) of nGoM alkane-degrading bacteria varies among sites independently of depth and location. The absence of vertical structure contrasts with distinct patterns observed for bulk bacterioplankton communities, but is consistent with the distribution of alkane-degrading genera identified by 16S rRNA gene sequences. Two OPUs, one including Alcanivorax borkumensis-like AlkB sequences and a second comprised of Marinobacter-like AlkB sequences, accounted for nearly two-thirds of all sequences; the former was widely distributed. In contrast to marine sediment AlkB, nGoM bacterioplankton AlkB appeared OPU-depauperate, and only a small percentage of sequences were not identifiable as Proteobacteria or Actinobacteria. Inconsistencies between AlkB distribution and the distribution of 16S rRNA gene sequences attributable to alkane degraders suggest that both markers may be needed to assess the composition and structure of hydrocarbonoxidizing communities.

SUPPLEMENTARY MATERIAL
The Samples from depths >100 m, ≤100 m and the Mississippi River plume are indicated by blue, black, and red, respectively. The alkane-degrading genera included Alcanivorax, Marinobacter, Pseudomonas, Hydrocarbonophaga, and Kordiimonas. Composition data were analyzed after an arcsine transformation.

SI Figure 5 | Relative abundance of gammaproteobacterial
alkane-degrading genera identified on the basis of 16S rRNA gene sequences for the sites used in this study as a function of the relative abundance of all gammaproteobacteria in the same samples. Data from King et al. (2013).