Analysis of Composition and Structure of Coastal to Mesopelagic Bacterioplankton Communities in the Northern Gulf of Mexico

16S rRNA gene amplicons were pyrosequenced to assess bacterioplankton community composition, diversity, and phylogenetic community structure for 17 stations in the northern Gulf of Mexico (nGoM) sampled in March 2010. Statistical analyses showed that samples from depths ≤100 m differed distinctly from deeper samples. SAR 11 α-Proteobacteria and Bacteroidetes dominated communities at depths ≤100 m, which were characterized by high α-Proteobacteria/γ-Proteobacteria ratios (α/γ > 1.7). Thaumarchaeota, Firmicutes, and δ-Proteobacteria were relatively abundant in deeper waters, and α/γ ratios were low (<1). Canonical correlation analysis indicated that δ- and γ-Proteobacteria, Thaumarchaeota, and Firmicutes correlated positively with depth; α-Proteobacteria and Bacteroidetes correlated positively with temperature and dissolved oxygen; Actinobacteria, β-Proteobacteria, and Verrucomicrobia correlated positively with a measure of suspended particles. Diversity indices did not vary with depth or other factors, which indicated that richness and evenness elements of bacterioplankton communities might develop independently of nGoM physical-chemical variables. Phylogenetic community structure as measured by the net relatedness (NRI) and nearest taxon (NTI) indices also did not vary with depth. NRI values indicated that most of the communities were comprised of OTUs more distantly related to each other in whole community comparisons than expected by chance. NTI values derived from phylogenetic distances of the closest neighbor for each OTU in a given community indicated that OTUs tended to occur in clusters to a greater extent than expected by chance. This indicates that “habitat filtering” might play an important role in nGoM bacterioplankton species assembly, and that such filtering occurs throughout the water column.


INTRODUCTION
The northern Gulf of Mexico (nGoM) supports some of the most economically and ecologically valuable marine and coastal ecosystems in North America (Rabalais et al., 1996). A number of process-based studies (mostly within the Mississippi River plume) have analyzed bacterial numbers, respiration rates, production rates, and nitrogen transformations (e.g., Chin-Leo and Benner, 1992;Pakulski et al., 1995;Pomeroy et al., 1995;Jochem, 2001Jochem, , 2003Liu et al., 2004Liu et al., , 2009Malmstrom et al., 2004;Hewson et al., 2006), and shown that bacteria form critical linkages within the food webs of these systems (Dagg et al., 2006). Carbon and nitrogen transformations are particularly important, not only because of their contributions to the "microbial loop," but because they affect the development and persistence of an extensive oxygendepleted or hypoxic zone (Pakulski et al., 1995;Chen et al., 2001;Dagg et al., 2006). This zone, which develops seasonally and threatens the integrity of both planktonic and benthic systems on the nGoM shelf, depends on elevated primary production supported by riverine nitrogen inputs, and high bacterial respiration rates that reduce oxygen concentrations.
Additional studies have also shown that nGoM bacterioplankton mediate the impacts of acute and chronic anthropogenic disturbances to nGoM waters, including nitrogen loading and aromatic and aliphatic hydrocarbon inputs from a variety of sources (Hall et al., 2008;Hazen et al., 2010;Senn et al., 2010;Valentine et al., 2010;Edwards et al., 2011;Kessler et al., 2011). Recent data derived from assessments of the Deep-water Horizon (DWH) oil spill have shown that bacterioplankton within an oil and gas plume at depths of about 1000-1200 m responded rapidly to hydrocarbon inputs, were dominated by γ-Proteobacteria, included taxa most closely related to genera known for hydrocarbon degradation, and were distinct from non-plume bacterioplankton.
Nonetheless, surprisingly little is known about nGoM bacterioplankton composition and structure. Jones et al. (2010) have shown that Proteobacteria dominate bacterioplankton in nearshore waters of the west Florida shelf, and that community compositions change during blooms of Karenia brevis. Olapade (2010) also identified α-Proteobacteria as dominant members of nGoM bacterioplankton, but his sites were all in shallow nearshore waters, and only one (from Carrabelle) was characterized by seawater salinities. However, this site cannot be used to extrapolate to the nGoM broadly since it is adjacent to a developed beach (Olapade, 2010).
In an effort to characterize nGoM bacterioplankton communities, we have analyzed 44 samples obtained during mid-March 2010 at 17 stations distributed along a longitudinal gradient with transects from inshore to offshore, including the Mississippi River plume, and depths from 2 to 1700 m. Results showed that αand γ-Proteobacteria, Bacteriodetes, and Actinobacteria dominated surface (≤100 m) communities, with minor contributions from Planctomycetes and Verrucomicrobia. αand γ-Proteobacteria also dominated deeper communities (>100 m), but these communities included relatively large fractions of Thaumarchaeota. Although members of the SAR11 clade were abundant in all but a Mississippi River plume surface sample, the distribution of OTUs identified at an evolutionary distance of 0.03 varied substantially among sites, with relatively few taxa shared widely.

SAMPLE COLLECTION AND DNA EXTRACTION
Samples were collected during March 2010 during R/V Cape Hatteras cruise GC-5 as described by Tolar et al., in revision; see also Figure A1 in Appendix for station locations and Table A1 in Appendix for selected sample data). Samples were obtained using Niskin bottles and a General Oceanics rosette sampling system equipped with a SBE25 CTD data package. They were pressure filtered through 0.22 µm Durapore filters (about 1 l at ∼60 kPa; Millipore, Inc., New Bedford, MA, USA). Filters were frozen at −20˚C in 2 mL lysis buffer (0.75 M sucrose, 40 mM EDTA, 50 mM Tris; pH 8.3). After enzymatic hydrolysis (lysozyme and proteinase K with sodium dodecyl sulfate), DNA in 800 µL of lysate was purified by phenol-chloroform extraction. Purified DNA was stored in 50 µL of Tris-EDTA buffer (pH 8) at −80˚C (Bano and Hollibaugh, 2002).

PCR AND PYROSEQUENCING
Triplicate PCR reactions for each sample consisted of 12 µL of PCR grade water, 2.5 µL 10× High Fidelity PCR Buffer (Invitrogen), 0.75 µL 25 mM dNTP mix, 1 µL 50 mM MgSO 4 , 5 µL 5× Bovine Serum Albumin (Promega), 1.5 µL each of forward and reverse primers (10 mM stocks), 0.2 µL Platinum Taq DNA Polymerase High Fidelity (Invitrogen), and 0.5 µL template DNA. Primers 515F and 806R were used to amplify Bacteria and Archaea 16S rRNA genes (Liu et al., 2007;Jones et al., 2009;Bergmann et al., 2011). Primer 515F included the Roche 454-B pyrosequencing adapter and a GT linker. Primer 806R included the Roche 454-A sequencing adapter, a 12-bp unique barcode (Hamady et al., 2010), and a GG linker. The PCR program consisted of an initial denaturation step (94˚C; 3 min), followed by 26 cycles of 94˚C (1 min), 54˚C (1 min), 68˚C (2 min), with a final extension of 10 min at 68˚C. Amplicons were visualized by electrophoresis on a 1% agarose gel. Products from triplicate reactions were pooled and purified with UltraClean PCR Clean-up kits (MoBio; Folsom, CA, USA) according to the manufacturer's instructions. After quantifying DNA concentrations of the cleaned reactions using a Nanodrop spectrophotometer, equal masses of PCR product for each sample were combined and shipped to the Environmental Genomics Core Facility at the University of South Carolina where pyrosequencing was performed using a Roche 454 automated sequencer using titanium chemistry.

DATA ANALYSIS
Sequences were analyzed using PANGEA (Giongo et al., 2010) and Mothur (Schloss et al., 2009) pipelines. After trimming the initial set of reads (435,290 sequences), PANGEA produced 239,983 sequences (>150 bp and ≥20 quality score) with identifiable barcodes distributed among the 44 samples (minimum sequence number per sample = 764, maximum = 9154; mean = 5454, SE = 362). A total of 38,045 reads could not be associated with a barcode. PANGEA identified the phylogenetic affiliations of the reads using MEGABLAST with a database of 170,273 Bacteria and Archaea isolate 16S rRNA sequences obtained from the Ribosomal Database Project. PANGEA clustered unidentified sequences for each sample using CD-HIT and threshold values (D) of 0.80, 0.90, 0.95, 0.97, and 0.99. OTUs comprised of sequences that could be identified with MEGABLAST were clustered similarly for each sample, and these clusters were combined with unidentified OTUs to produce the final composition for each sample. Spatial patterns in the resulting sample compositions, including OTU abundances, were analyzed using multivariate statistics [e.g., principal components analysis (PCA), non-metric multidimensional scaling, and canonical correlation analysis (CCorA)] with XLSTAT after excluding cyanobacterial and mitochondrial sequences, and removing OTUs represented by singletons within the 44-sample set.
Sequences were also processed using the Mothur platform (Schloss et al., 2009). After trimming barcodes and primers, and filtering for quality, sequences ≥150 bp were aligned to the Greengenes ProkMSA aligned database. Chimeric sequences were identified with the Chimera Slayer algorithm from the Broad Institute and removed. The pre.cluster command was used to minimize errors introduced during pyrosequencing. Cyanobacterial sequences were excluded from analysis. A total of 109,867 sequences comprised of 15,071 unique sequences were included in the analysis. For OTU-based analyses of sample composition, the "average neighbor" clustering algorithm was used to group sequences at a similarity level of 97%. The taxonomy function in Mothur was used with the Ribsosomal Database Project (RDP) training set to identify each OTU for phylotype-based. Because the RDP database used for analysis lacked Thaumarchaeota sequences, and because this phylum is highly represented in the Gulf of Mexico (Tolar et al., in revision), representative sequences for all OTUs that were identified as Archaea at a distance of 0.03 were analyzed with BLAST against the Greengenes prokMSA database to verify their identity. We also used the NCBI database to conduct a manual BLAST analysis of the representative sequences for the most abundant OTUs at a distance of 0.03 to verify identities, and to assess membership within SAR11, SAR86 and SAR92, and SAR324 clades for α-, γ-, and δ-Proteobacteria, respectively. These results were further verified by analyzing representative sequences in the ARB-SILVA database using the classify function. Mothur was also used to generate diversity indices based on samples with normalized numbers of sequences, to select specific phylogenetic groups for further analysis and to provide input for fast UniFrac Frontiers in Microbiology | Aquatic Microbiology (Hamady et al., 2010), which was used to assess spatial patterns in bacterioplankton communities.
The phylogenetic structure of nGoM bacterioplankton was also analyzed using Phylocom (Webb et al., 2002) to calculate the "net relatedness index" (NRI) and "nearest taxon index" (NTI). NRI is a standardized estimate of the mean pairwise phylogenetic distances for all pairs of OTUs in a sample compared to the mean pairwise distances of a random or null set of OTUs; NRI provides a measure of the extent of tree-wide clustering, including terminal and deep branches. NTI is a similarly standardized measure, but assesses the phylogenetic distance of each OTU in a sample to its nearest neighbor OTU; NTI provides a measure of terminal clustering independent of deeper clustering (Webb et al., 2002). Both indices were generated with 9999 randomized runs and included OTU frequencies.
Sequence data have been deposited with MG-RAST (metagenomics.anl.gov) at accession numbers 4509220.3-4509263.3. Metadata are available via the project page, "Analysis of composition and structure of coastal to mesopelagic bacterioplankton communities in the nGoM."
Although the relative abundances of major phyla and sub-phyla (or classes) changed with depth, members of the ubiquitous SAR11 clade dominated the α-Proteobacteria throughout the water column, accounting for about 69% of all α-Proteobacteria sequences. Representatives of Alteromonas and Pseudoalteromonas dominated the γ-Proteobacteria, with contributions of 31 and 6%, respectively, by members of the widely distributed SAR86 and SAR92 clades. These groups did not vary consistently with depth or geographic location.
Several other groups exhibited consistent variations in relative abundance with depth. Relative abundances of δ-Proteobacteria, Bacilli, and Clostridia were low in surface waters, but increased below 100 m ( Figures 3A,B). In spite of their anaerobic character, δ-Proteobacteria, and Clostridia represented ∼2-4 and 1-6%, respectively, of the sequences from oxic deep-water samples. Most of the sequences affliliated with δ-Proteobacteria (about 66%) belonged to the SAR324 clade. In addition, Thaumarchaeota increased in relative abundance below 100 m ( Figure 3C). Thaumarchaeota accounted for ∼10-25% of deep-water sequences in general, and 27% of sequences at a 1700-m deep site; they also constituted a relatively constant fraction of the Archaea (about 75%, Figure A2 in Appendix), with unclassified Archaea and a small percentage of Euryarchaeota accounting for the remainder.
Thaumarchaeotal sequences were comprised of two clades: one contained sequences that were most closely related to a shallow water/sediment group of Nitrosopumilales, while the second consisted of sequences most closely related to a deep-water group of

www.frontiersin.org
Cenarchaeales. The former accounted for most (80-100%) of the Thaumarchaeota sequences in surface waters, but the contribution of this clade decreased linearly (r 2 = 0.634, p < 0.0001) with increasing depth below 110 m, reaching a minimum of 29% at 1700 m ( Figure A3 in Appendix).
The distributions of the 10 most abundant OTUs (defined at an evolutionary distance = 0.03), which collectively accounted for >44% of the sequences analyzed by the Mothur pipeline, were consistent with distributions at phylum and class levels (e.g., Figure 4A). An OTU identified as Candidatus Pelagibacter ubique (SAR11 clade, α-Proteobacteria) occurred in all samples, and dominated the bacterioplankton at depths ≤110 m; however at greater depths, its relative abundance declined markedly. Similar profiles were observed for other α-Proteobacteria OTUs (e.g., Rhodobacteriaceae), and for an OTU identified as Yeosuana (Bacteroidetes). In contrast, OTUs identified as γ-Proteobacteria (e.g., Alteromonas, Pseudoalteromonas, and a representative of SAR86) and Thaumarchaeota increased in relative abundance below 100 m, although their distribution was variable ( Figure 4B).
Principal component analyses based on UniFrac indices ( Figure 5) and composition (Figure A4 in Appendix) revealed two sample clusters distinguished on the first PCA axis. These two clusters were comprised of samples from depths ≤100 and >100 m, respectively, and were consistent with trends observed in depth profiles of bacterioplankton composition (e.g., Figures 1,  3, and 4). Within each of these clusters samples were not further differentiated based on depth or location, with the exception of station MR1-2 m, which consistently differed from all other samples. A canonical correlation analysis indicated that Clostridia, δand γ-Proteobacteria, and Thaumarchaeota were positively correlated with depth, and α-Proteobacteria were inversely correlated with depth ( Figure 6). α-Proteobacteria and Bacteroidetes were positively correlated with temperature and dissolved oxygen, while   Actinobacteria, β-Proteobacteria, and Verrucomicrobia were positively correlated with beam attenuation, a measure of suspended particles in the water column.
In spite of consistent differences in bacterioplankton composition between surface and deeper waters, diversity indices were not spatially structured. Neither richness indices (Figure A5 in Appendix) nor evenness and dominance indices (Shannon and inverse Simpson's) correlated significantly with depth ( Figure 7A; Table A2 and Figure A5 in Appendix). Likewise, two measures of phylogenetic community structure, the NRI and NTI, did not vary consistently with depth ( Figure 7B). NRI values were ≤−1.96 for most samples, which indicated that communities were significantly overdispersed. Overdispersion occurred because the OTUs of any given community were less related to each other in a www.frontiersin.org community-wide phylogenetic comparison than expected for a community assembled randomly from a given pool of OTUs. In contrast, NTI values were mostly >1.96. This indicated that communities were significantly clustered. Clustering occurred because FIGURE 6 | Results from a canonical correlation analysis using relative abundances of phyla and classes (e.g., as in Figure 1) as determined by PANGEA, and salinity, pH, depth, dissolved oxygen, fluorescence (a measure of chlorophyll concentration), and beam attenuation (a measure of particle content) for each sample.
in any given community the nearest phylogenetic neighbors of each of its OTUs were more closely related than expected for a randomly assembled community ( Figure 7B).
The controls of nGoM α-Proteobacteria distribution are uncertain, as are reasons for the increased relative abundance of γ-Proteobacteria with depth. Multiple linear regression analyses show that neither depth, temperature, dissolved oxygen, pH, fluorescence (a measure of chlorophyll concentration), nor particle concentration (beam attenuation) individually accounts for α-Proteobacteria variability. However, the interaction of temperature and oxygen explains >70% (r 2 = 0.760, p < 0.0001). While this suggests that α-Proteobacteria might be sensitive to the combined effects of several variables that affect metabolic activity, factors other than or in addition to those included here likely play important roles.
Both the individual variables noted above and interactions among variables have limited explanatory power for γ-Proteobacteria distributions (e.g., maximum r 2 = 0.244, p = 0.003). This indicates that nGoM αand γ-Proteobacteria respond to different ecological determinants. The availability of aliphatic and aromatic hydrocarbons emanating from extensive hydrocarbon deposits on the nGoM continental slope (Sassen et al., 2001;Milkov and Sassen, 2003;Liu et al., 2009) might account for the increased relative abundance of γ-Proteobacteria in deeper waters of the nGoM compared to other systems. Several prior studies in other systems have shown that γ-Proteobacteria respond strongly to hydrocarbon availability (Gerdes et al., 2005;Al-Awadhi et al., 2007;Berthe-Corti and Nachtkamp, 2010), and Hazen et al. (2010), Valentine et al. (2010), Kessler et al. (2011) have shown that various γ-Proteobacteria formed blooms at depths of about 1000-1200 m in response to the DWH hydrocarbon plume. In support of a role for hydrocarbon availability as a structuring factor for deep γ-Proteobacteria, members of the hydrocarbonoxidizing genus, Pseudoalteromonas (Melcher et al., 2002), are among the most abundant phylotopes in deep nGoM waters ( Figure 4B).
The increase in relative abundance of both γand δ-Proteobacteria with depth (Figures 3A and 4B) might also be supported by chemolithoautotrophic growth as proposed by Swan et al. (2011). However, Swan et al. (2011) characterized clades of γ-Proteobacteria (ARCTIC96BD-19 and Agg47) that are poorly represented in the nGoM, and it is not yet known if some of the better represented groups, e.g., SAR86 and SAR92, grow chemolithoautotrophically. Initial genomic analyses suggest that SAR86 lacks genes essential for CO 2 fixation (Dupont et al., 2012). In contrast, Swan et al. (2011) provided evidence for chemolithoautotrophic metabolism in a δ-Proteobacteria clade that is important in deepwater nGoM communities. This clade, SAR324, accounts for about 66% of nGoM δ-Proteobacteria. Thus, chemoautolithotrophy might account for changes in δ-Proteobacteria, while other factors determine the distribution of γ-Proteobacteria.
The abrupt increase in Thaumarchaeota relative abundance below 100 m as reported ( Figure 3C) here agrees with results of a separate qPCR study of Archaea and Bacteria 16S rRNA genes at the same sites (Tolar et al., in revision). Tolar et al. (in revision) show that compared to Bacteria, absolute and relative Thaumarchaeota abundances increase in deep samples. An earlier study of nGoM Archaea at a single station (samples from 10, 400, 900 m) also observed that Marine Group I Crenarchaeota (i.e., Thaumarchaeota) dominate at depth, while Group II-β Euryarchaeota dominate in surface waters (Liu et al., 2009).
Although multiple reports have documented increased Thaumarchaeota abundance with increasing water column depth, the controls of this pattern remain unclear. Some have proposed that Thaumarchaeota are better adapted to environments characterized by low metabolic energy fluxes (see Valentine, 2007), and thus are better able than other lineages to couple growth to the limited resources (e.g., ammonium and perhaps some heterotrophic substrates) available below the epipelagic zone (Pester et al., 2011).
This hypothesis has implications for the spatial partitioning of thaumarchaeal clades reported by others (Santoro et al., 2010;Hu et al., 2011a,b;Yakimov et al., 2011) and also observed here (Figure A3 in Appendix). In particular, a "shallow water" clade consisting of phylotypes from surface waters and sediments has been distinguished from a "deep-water" clade found largely in meso-to bathypelagic samples (Beman et al., 2008;Hu et al., 2011a,b;Tolar et al., in revision). For nGoM samples, deep-clade phylotypes increase in relative abundance abruptly below 100 m, and then continue to increase linearly with depth ( Figure A3 in Appendix). This might indicate that the surface-clade phylotypes are less well adapted than deep-clade phylotypes to low metabolic energy fluxes. Differences in gene expression for members of each clade in surface and deep samples could provide an indication of adaptations and competitiveness.

SPATIAL STABILITY OF DIVERSITY WITH DEPTH
Although nGoM bacterioplankton composition changes substantially with depth, neither richness (e.g., Chao 1), nor evenness (e.g., Shannon index) and dominance (e.g., inverse Simpson's index) measures of diversity show any distinct geographic trend ( Figure 7A; Table A2 and Figure A5 in Appendix). The absence of trends within nGoM bacterioplankton at a regional scale is consistent with patterns reported for samples at a global scale. In particular, no distinct general trends as a function of depth or location have been reported for Chao 1 and the Shannon and inverse Simpson's indices for assemblages ranging from epipelagic to bathypelagic depths in the sub-tropical Pacific to the Arctic Ocean (summarized in Table 2, Stevens and Ulloa, 2008), although site-specific trends have been described for Archaea and Bacteria (e.g., Brown et al., 2009). This similarity suggests common patterns for community assembly regardless of location, depth, temperature, or other variables. Comparisons among data sets should be treated cautiously, however, since diversity indices have well known limitations (Hill et al., 2003). These include sensitivity to sequencing effort, numbers of OTUs, and patterns of rank abundance within a given community. Nonetheless, Hill et al. (2003) have shown that Chao 1 and the Shannon index can provide robust measures of changes among samples or systems.
To promote comparisons for the nGoM, we have used the Mothur pipeline (Schloss et al., 2009) to produce normalized samples comprised of equal numbers of sequences randomly selected from the pool of sequences available for each sample. We have then compared theoretical minimum and maximum Shannon indices with observed values (Figure 7A). Minimum values assume a single dominant OTU, and that all others occur as singletons. Maximum values assume that all OTUs are equally abundant. With one notable exception, a sample from 760 m, most of the assemblages are characterized by Shannon indices that range between about 50 and 70% of theoretical maxima. This provides further support for the notion that the structure of nGoM bacterioplankton communities develops independently of major physical-chemical variables (e.g., depth, temperature, oxygen) and www.frontiersin.org even some biological variables (e.g., community composition, chlorophyll concentration).

SPATIAL STABILITY OF PHYLOGENETIC COMMUNITY STRUCTURE
Two measures of phylogenetic community structure, NRI and NTI, provide additional insights about nGoM bacterioplankton communities. For this study, NRI values have been calculated using the full set of nGoM OTUs for null models; however, the choice of null model (see http:/phylodiversity.net/phylocom/phylocom_ manual.pdf ) did not affect the outcome.
Like nGoM diversity indices, neither NRI nor NTI vary consistently spatially (Figure 7B). This suggests that substantial changes in numerous physical-chemical and biological variables have little effect on phylogenetic community structure. Negative NRI values for all samples indicate that nGoM bacterioplankton communities are phylogenetically overdispersed throughout the water column. Overdispersion arises in a community when its OTUs are more distantly related to each other across terminal and deep branches of a community phylogenetic tree than expected by chance. Multiple processes have been proposed to account for overdispersion, including competitive exclusion when traits that determine community membership are phylogenetically conserved, and habitat filtering when such traits arise in different lineages by convergence or horizontal gene transfer (Webb et al., 2002;Vamosi et al., 2009).
Distinguishing between these or other processes that might lead to overdispersion requires a deeper understanding of the factors that constrain bacterioplankton richness than is currently available. However, since oligotrophs dominate the marine microbiota , heterotrophic substrate limitation might promote competitive exclusion, particularly if different groups of bacteria specialize on different carbon sources. Lauro et al. (2009) have recently identified genomic/metagenomic markers for copiotrophs and oligotrophs. Phylogenetic analysis of these markers in nGoM and other bacterioplankton could provide a means to test relationships between substrate utilization and phylogenetic community structure.
In contrast to uniformly negative NRI values, uniformly positive NTI values ( Figure 7B) indicate that nGoM OTUs occurring at the tips of the community trees form clusters more closely related than expected by chance. OTU clustering has been attributed to "habitat filtering," which occurs when one or more environmental variables determine the patterns of community assembly. However, since clustering characterizes all nGoM samples, the environmental variables responsible for habitat filtering are uncertain, and likely vary among phylogenetic groups and with depth.
Habitat filtering has also been suggested as a structuring agent for surface bacterioplankton communities based on analyses of 16S rRNA gene sequences obtained from globally distributed samples (Barberán and Casamayor, 2010;Pontarp et al., 2012). For many of these communities, positive values have been obtained for both NRI and NTI, although in some cases similar to the nGoM, negative NRI, and positive NTI values have also been reported. The more consistently positive NRI values in global scale studies (Barberán and Casamayor, 2010;Pontarp et al., 2012) might have arisen in part from differences in the null communities used to calculate NRI (Webb et al., 2002). Null communities used in global scale studies have been derived from a global OTU pool, while the nGoM analysis has been based on a regional pool. Nonetheless, NTI values from all samples are consistent with environmental selection as a driver for assembly of both surface and deep communities.

SUMMARY AND CONCLUSION
An extensive analysis of nGoM bacterioplankton (17 stations, 44 discrete samples, depths from 2 to 1700 m) showed that distinct assemblages characterized ≤100 and >100 m depths. SAR 11 α-Proteobacteria and Bacteroidetes were prominent in the former, while γ-Proteobacteria, Firmicutes, and Thaumarchaeota were prominent in the latter. Though composition varied substantially with depth, diversity indices did not, which indicated that the structure of nGoM bacterioplankton communities was relatively stable across large gradients in physical-chemical and biological variables. Phylogenetic community structure was also relatively stable, with no variation evident among stations or depths. NTI values indicated that habitat filtering played a role in community assembly at all depths, while NRI values indicated that other processes, e.g., competitive exclusion, contributed as well. Collectively, these results offer the first synoptic insights into the composition and diversity of nGoM bacterioplankton, and provide a basis for understanding their dynamics at a regional scale.

ACKNOWLEDGMENTS
This work was supported partially by GoMRI-LSU and the National Science Foundation (OCE-0943278 and ANT-0838996 to James T. Hollibaugh). We thank L. Powers for assistance with sample collection and CTD data processing. We thank the crew of the R/V Cape Hatteras and scientists from the Gulf Carbon 5 cruise who provided data for this study, especially C. Fichot, W.-J. Cai, and W.-J. Huang. Funding for GulfCarbon was from NSF awards OCE-0752110 (W.-J. Cai) and OCE-0752254 (S. Lohrenz).  www.frontiersin.org FIGURE A2 | Thaumarchaeota sequences as a percentage of total sequences plotted versus total Archaea sequence percentages. Linear regression trend line indicates that Thaumarchaeota account for about 76% of all Archaea sequences (y = −1.14 + 0.755×, r 2 = 0.946) regardless of sample site or depth.