<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2019.00993</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>The Water Microbiome Through a Pilot Scale Advanced Treatment Facility for Direct Potable Reuse</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Kantor</surname> <given-names>Rose S.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/636865/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Miller</surname> <given-names>Scott E.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/727493/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Nelson</surname> <given-names>Kara L.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/485454/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Civil and Environmental Engineering, University of California, Berkeley</institution>, <addr-line>Berkeley, CA</addr-line>, <country>United States</country></aff>
<aff id="aff2"><sup>2</sup><institution>Engineering Research Center for Re-inventing the Nation&#x2019;s Urban Water Infrastructure</institution>, <addr-line>Berkeley, CA</addr-line>, <country>United States</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Frederik Hammes, Swiss Federal Institute of Aquatic Science and Technology, Switzerland</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Catherine J. Paul, Lund University, Sweden; Katherine S. Dowdell, University of Michigan, United States</p></fn>
<corresp id="c001">&#x002A;Correspondence: Kara L. Nelson, <email>karanelson@berkeley.edu</email></corresp>
<fn fn-type="other" id="fn002"><p>This article was submitted to Microbiotechnology, Ecotoxicology and Bioremediation, a section of the journal Frontiers in Microbiology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>08</day>
<month>05</month>
<year>2019</year>
</pub-date>
<pub-date pub-type="collection">
<year>2019</year>
</pub-date>
<volume>10</volume>
<elocation-id>993</elocation-id>
<history>
<date date-type="received">
<day>02</day>
<month>11</month>
<year>2018</year>
</date>
<date date-type="accepted">
<day>18</day>
<month>04</month>
<year>2019</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2019 Kantor, Miller and Nelson.</copyright-statement>
<copyright-year>2019</copyright-year>
<copyright-holder>Kantor, Miller and Nelson</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Advanced treatment facilities for potable water reuse of wastewater are designed to achieve high removal levels of specific pathogens, as well as many other constituents. However, changes to the microbial community throughout treatment, storage, and distribution of this water have not been well characterized. We applied high-throughput amplicon sequencing, read-based, assembly-based, and genome-resolved metagenomics, and flow cytometry to investigate the microbial communities present in a pilot-scale advanced water treatment facility. Advanced treatment of secondary-treated wastewater consisted of ozonation, chloramination, microfiltration, reverse osmosis (RO), advanced oxidation (UV/H<sub>2</sub>O<sub>2</sub>), granular activated carbon (GAC) filtration, and chlorination. Treated water was fed into bench-scale simulated distribution systems (SDS). Cell counts and microbial diversity in bulk water decreased until GAC filtration, and the bacterial communities were significantly different following each treatment step. Bacteria grew within GAC media and contributed to a consistent microbial community in the filtrate, which included members of the <italic>Rhizobiales</italic> and <italic>Mycobacteriaceae</italic>. After chlorination, some of the GAC filtrate community was maintained within the SDS, and community shifts were associated with stagnation. Putative antibiotic resistance genes and potential opportunistic pathogens were identified before RO and after advanced oxidation, although few if any members of the wastewater microbial community passed through these treatment steps. These findings can contribute to improved design of advanced treatment trains and management of microbial communities in post-treatment steps.</p>
</abstract>
<kwd-group>
<kwd>direct potable reuse</kwd>
<kwd>metagenomics</kwd>
<kwd>16S rRNA gene sequencing</kwd>
<kwd>drinking water microbiome</kwd>
<kwd>advanced water treatment</kwd>
<kwd>antibiotic resistance</kwd>
</kwd-group>
<contract-num rid="cn001">1028968</contract-num>
<contract-sponsor id="cn001">National Science Foundation<named-content content-type="fundref-id">10.13039/100000001</named-content></contract-sponsor>
<counts>
<fig-count count="6"/>
<table-count count="2"/>
<equation-count count="0"/>
<ref-count count="67"/>
<page-count count="15"/>
<word-count count="0"/>
</counts>
</article-meta>
</front>
<body>
<sec><title>Introduction</title>
<p>Regions facing water scarcity are turning to new sources of drinking water, including wastewater that has been purified through advanced treatment to meet potable water quality standards (<xref ref-type="bibr" rid="B48">Raucher and Tchobanoglous, 2014</xref>). Advanced treatment follows secondary or tertiary wastewater treatment and typically includes MF and RO, followed by advanced oxidation (UV/H<sub>2</sub>O<sub>2</sub>) to degrade remaining chemical contaminants and inactivate pathogens (<xref ref-type="bibr" rid="B19">Gerrity et al., 2013</xref>).</p>
<p>In direct potable reuse systems, purified water is blended into the drinking water distribution system after minimal storage time, and therefore microbial risks must be considered carefully. Current regulations focus on virus and protozoan cyst removal, and ensuring that the risk level for these pathogens in advanced treated water is as low as for conventional drinking water. For example, California has implemented the so-called 12/10/10 rule for indirect potable reuse, requiring treatment trains to meet specific log<sub>10</sub> removals of viruses, <italic>Cryptosporidium</italic> oocysts, and <italic>Giardia</italic> cysts, respectively (<xref ref-type="bibr" rid="B7">California Code of Regulations, 2014</xref>), and regulations for direct potable reuse are under development (<xref ref-type="bibr" rid="B55">State Water Resources Control Board, 2016</xref>; <xref ref-type="bibr" rid="B41">Pecson et al., 2017</xref>). However, bacteria are also of concern, and bacterial communities established during treatment have been shown to influence communities found in distributed water (<xref ref-type="bibr" rid="B44">Pinto et al., 2012</xref>). Critically, unlike human viruses and enteric protozoa, bacteria can replicate during and after treatment, and their growth is dependent on a variety of factors including disinfectant residual and nutrient concentrations during distribution (<xref ref-type="bibr" rid="B35">Nescerecka et al., 2014</xref>; <xref ref-type="bibr" rid="B45">Prest et al., 2016a</xref>,<xref ref-type="bibr" rid="B46">b</xref>). To understand how advanced treatment affects microbial water quality, it is necessary to examine removal and growth of bacteria across treatment trains and in distribution.</p>
<p>In addition to culture-based methods (e.g., heterotrophic plate counts) and direct biomass quantification methods (e.g., adenosine triphosphate and flow cytometry), water engineers are increasingly making use of high-throughput DNA sequencing technologies and microbial ecology analyses to study the effects of drinking water treatment and distribution on microbial communities. Amplicon sequencing is used to inventory the microbial species present in water or biofilm in terms of taxonomic identity and relative abundance through use of a common marker sequence, typically one or several regions of the 16S ribosomal RNA (rRNA) gene (<xref ref-type="bibr" rid="B58">Vignola et al., 2017</xref>; <xref ref-type="bibr" rid="B31">Liu et al., 2018</xref>). Metagenomics, the reconstruction of genes and genomes from uncultured environmental microorganisms, has also been applied to drinking water treatment and distribution (<xref ref-type="bibr" rid="B43">Pinto et al., 2016</xref>; <xref ref-type="bibr" rid="B67">Zhang et al., 2017</xref>; <xref ref-type="bibr" rid="B36">Oh et al., 2018</xref>). However, there are few studies of microbial communities in the water of potable reuse treatment trains and in distribution systems fed with advanced purified water (<xref ref-type="bibr" rid="B50">Salveson et al., 2018</xref>; <xref ref-type="bibr" rid="B54">Stamps et al., 2018</xref>). Given that variations in treatment design and post-treatment processes may impact microbial communities, multiple studies of different treatment trains will be needed to advance the field.</p>
<p>We studied a pilot-scale advanced water treatment facility in El Paso, Texas. Here, we report on DNA sequencing-based analyses of microbial communities sampled across the advanced treatment train and chlorinated SDS fed with the advanced treated water. In a separate manuscript, we will report in more detail on changes in total and intact cells via flow cytometry, as well as metrics of microbial growth capacity.</p>
<p>With our analyses, we demonstrate the utility and pitfalls of high-throughput sequencing to study potable reuse treatment trains and simulated distribution, in which: (1) low-biomass samples are highly sensitive to contamination; (2) high resolution of sequences is critical; and (3) the engineering goals require information about absolute abundance. To meet these challenges, we report observations about our sequencing controls, make use of recent advances allowing resolution of Amplicon Sequencing Variants (ASVs) that in some cases correspond to near-complete MAGs, and combine sequencing-based relative abundance with absolute cell counts. We use this information to examine changes in microbial community composition through advanced treatment and water distribution to identify populations that may persist through treatment, and to search for potential pathogens. We use metagenomic data to investigate antibiotic resistance potential before and after treatment and to explore possible reasons for the growth of specific organisms in highly purified water.</p>
</sec>
<sec id="s1" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec><title>Experimental Facilities</title>
<p>We sampled a pilot-scale advanced purification facility in El Paso, Texas that operated from June 8, 2015 to January 29, 2016 and treated 0.14 million gallons per day. The feed to the plant was secondary-treated wastewater, which was ozonated to a target concentration of 3.5 mg/L (&#x223C;5 min storage time, after which there was no detectable residual) and chloraminated to a target residual of 2&#x2013;4 mg/L as Cl<sub>2</sub> to reduce fouling of the MF membranes. The flow was split and treated by parallel MF units (Pall module type UNA-620A and Evoqua Memcor CPII L40N), and effluents were recombined in a storage tank. Water leaving the storage tank was then split equally and fed into parallel NF (Dow, model NF90-400/34i) and RO (Hydranautics, model ESPA2-LD) processes. The recombined flow was treated by an ultraviolet/advanced oxidation process (UV/AOP). UV was supplied via Trojan UV Swift SC B08<sup>TM</sup> at a dose of 840 mJ/cm<sup>2</sup>, and oxidation was achieved by addition of hydrogen peroxide to a target concentration of 4 mg/L. UV/AOP treated water was then split for parallel filtration by three types of GAC (see <xref ref-type="supplementary-material" rid="SM1">Supplementary Methods</xref>).</p>
<p>On alternating days, effluents from each of the GAC filters were collected in sterile bottles, transported on ice to the laboratory, and chlorinated within 6 h in a 5 L glass reservoir using sodium hypochlorite (ACS grade, Spectrum Chemical MFG Corp.) to achieve a CT of 30 mg-min/L and a free chlorine residual of approximately 0.8&#x2013;1.0 mg/L as Cl<sub>2</sub> at the end of the 30-min batch chlorination (unless otherwise noted). Chlorinated water was then transferred to three separate storage containers and continuously pumped to three annular reactors to simulate distribution (referred to throughout as SDS). Prior to operation, the SDS were sterilized and prepared as previous described (<xref ref-type="bibr" rid="B61">Wang et al., 2013</xref>). SDS were operated in the dark at a hydraulic residence time of 18 h (flow rate &#x223C;0.92 mL &#x22C5; min<sup>&#x2212;1</sup>) and ambient temperature of &#x223C;22&#x00B0;C. The inner cylinder rotation speed was set to 50 rpm, resulting in a shear stress of &#x223C;0.15 N &#x22C5; m<sup>&#x2212;2</sup> on the inner cylinder surface. All components of the simulated system (including the chlorination and storage reservoirs) were enclosed in aluminum foil to block light. Reactor operation is further described in <xref ref-type="supplementary-material" rid="SM1">Supplementary Methods</xref>. Several days before the end of SDS operation, SDS reactors were intentionally left stagnant for 24 h, after which feeding and mixing were resumed to allow stagnated reactor effluent to be sampled. Stagnation was induced only this once in the entire SDS operation.</p>
</sec>
<sec><title>Sample Collection</title>
<p>Bulk water was sampled weekly from November 2015 through January 2016, during months 6&#x2013;8 of pilot facility operation. Samples were collected in autoclave-sterilized 1-L Nalgene bottles from secondary wastewater feed to the pilot facility and from taps located after each treatment step: after chloramination, post-MF storage tank, NF permeate, combined storage after parallel NF/RO, after each of three parallel GAC column filters, and after each of three parallel SDS (<xref ref-type="fig" rid="F1">Figure 1</xref> and <xref ref-type="table" rid="T1">Table 1</xref>). Samples with residual chloramine disinfectant were quenched with excess sodium thiosulfate. Sampling of just the RO permeate did not yield sufficient DNA for sequencing, and therefore sampling was changed to the NF/RO combined tap in an attempt to concentrate more biomass onto a single filter. Sample taps were flushed for following periods: secondary wastewater and chloramine (&#x003E;5 min), MF/UF filtrate (&#x003E;15 min), NF and NF/RO combined permeate (&#x003E;30 min), and GAC filtrate (&#x003E;15 min). Water samples were filtered onto 0.22 &#x03BC;m mixed cellulose ester filters (Millipore) (see <xref ref-type="supplementary-material" rid="SM4">Supplementary Table S1</xref> for filtration volumes). All filtration equipment (glass vacuum flask, glass funnel, glass filter supports) was autoclave-sterilized prior to use. Filters were handled using flame-sterilized tweezers. Filtered samples were stored in sterile 5-mL screw cap tubes (Axygen<sup>TM</sup>, Thermo Fisher Scientific).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Treatment train and experimental design. Secondary treated wastewater was fed to a pilot-scale advanced water treatment train which included parallel microfiltration (MF) and ultrafiltration (UF), parallel nanofiltration (NF), and reverse osmosis (RO), UV/advanced oxidation process (AOP), and three parallel granular activated carbon columns (see <xref ref-type="supplementary-material" rid="SM1">Supplementary Methods</xref>). Distribution was modeled by three parallel flows through batch chlorination, storage, and annular reactors. Red numbers indicate sample locations corresponding to identifiers in <xref ref-type="table" rid="T1">Table 1</xref>.</p></caption>
<graphic xlink:href="fmicb-10-00993-g001.tif"/>
</fig>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Counts and locations of samples.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left">Location</th>
<th valign="top" align="center">Identifier (<xref ref-type="fig" rid="F1">Figure 1</xref>)</th>
<th valign="top" align="center">Total amplicon</th>
<th valign="top" align="center">Post-QC amplicon</th>
<th valign="top" align="center">Metagenomics</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Secondary wastewater</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td valign="top" align="left">Chloramine</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">MF</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">NF/RO</td>
<td valign="top" align="center">4A, 4B</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">GAC filtrate (3)</td>
<td valign="top" align="center">5A, 5B, 5C</td>
<td valign="top" align="center">15</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td valign="top" align="left">GAC media (3)</td>
<td valign="top" align="center">6A, 6B, 6C</td>
<td valign="top" align="center">12</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td valign="top" align="left">SDS (3)</td>
<td valign="top" align="center">7A, 7B, 7C</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td valign="top" align="left">Amplification blank</td>
<td valign="top" align="center"></td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">Extraction blank</td>
<td valign="top" align="center"></td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">Field blank</td>
<td valign="top" align="center"></td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">&#x2013;</td>
<td valign="top" align="center">&#x2013;</td>
</tr>
<tr>
<td valign="top" align="left">Zymobiomics mock</td>
<td valign="top" align="center"></td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">1</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Total and intact cell counts in bulk water grab samples were determined by flow cytometry as described in <xref ref-type="supplementary-material" rid="SM1">Supplementary Methods</xref>. Biofilm samples were collected during plant shutdown by swabbing the internal surfaces of pipes downstream from each treatment process using sterile cotton swabs, but samples did not yield enough DNA for further analysis (data not shown). GAC media samples were collected by disassembling GAC columns immediately following the end of plant operation on January 28, 2016 (day 234 of the study). Media was poured into 50 mL tubes and stored at &#x2212;80&#x00B0;C until DNA extraction. Media from columns 1 and 2 was from the top of the column, while column 3 was sampled from the middle. Sampling information is summarized in <xref ref-type="table" rid="T1">Table 1</xref> and presented in detail in <xref ref-type="supplementary-material" rid="SM4">Supplementary Table S1</xref>.</p>
</sec>
<sec><title>DNA Extraction, Library Preparation, and Sequencing</title>
<p>An optimized nucleic acid extraction protocol was used on the concentrated bulk water samples because very low biomass was expected in some pilot facility and annular reactor bulk water samples. Briefly, filter membranes were torn and placed into Lysing Matrix A tubes (MP Biomedicals) with flame-sterilized tweezers. A modified phenol-chloroform-isoamyl alcohol extraction was used as previously described, with the exception of the 70&#x00B0;C water bath pre-heat of the filter (<xref ref-type="bibr" rid="B44">Pinto et al., 2012</xref>; <xref ref-type="bibr" rid="B2">Bautista-de Los Santos et al., 2016</xref>). Field blanks were created by filtering 1 L of autoclaved de-ionized water onto 0.22 &#x03BC;m mixed cellulose ester filters (Millipore) and then processed identically to field samples.</p>
<p>Genomic DNA (gDNA) was extracted from the GAC media samples using the MoBio DNeasy PowerSoil Kit (Qiagen) according to manufacturer&#x2019;s instructions, with the following modifications: samples were incubated in lysis buffer at 65&#x00B0;C for 15 min prior to proceeding with the extraction protocol, bead-beating was performed at maximum speed for 10 min using a vortex adapter, the two inhibitor removal steps were combined into a single step to limit loss of gDNA, and the spin filter was incubated at 55&#x00B0;C for 10 min with nuclease-free water prior to elution to increase gDNA eluted. Extraction blanks were created by processing PCR-grade water alongside with and identically to field samples. Media samples yielded DNA for both amplicon and metagenomic sequencing. The amplicon libraries suffered from contamination and were excluded from analysis.</p>
<p>Extraction yields were assessed via Qubit dsDNA HS assay kit (Thermo Fisher Scientific). When necessary, gDNA was concentrated using a SpeedVac, and inhibitors were reduced with OneStep PCR Inhibitor Removal Kit (Zymo Research, Irvine, CA, United States).</p>
<p>Library preparation for amplicon sequencing followed the Schloss Lab MiSeq wet-lab protocol for amplification of the V4 region of the 16S rRNA gene<sup><xref ref-type="fn" rid="fn01">1</xref></sup> (<xref ref-type="bibr" rid="B25">Kozich et al., 2013</xref>) with slight modification (<xref ref-type="supplementary-material" rid="SM4">Supplementary Table S2</xref>). Briefly, the V4 region was amplified using uniquely barcoded 515F and 806R primers with Phusion Hot Start II polymerase, HF buffer, dimethyl sulfoxide (3.75% final concentration) and 1&#x2013;2 &#x03BC;l of genomic DNA. Triplicate 25 &#x03BC;l reactions were combined and concentrated by SpeedVac. For difficult reactions, up to five replicates were pooled. The dual-barcoded libraries (including sampling, extraction, and PCR negative control libraries) were normalized using the SequalPrep Normalization Plate kit (Invitrogen), pooled, and sequenced on a MiSeq with V3 chemistry for 600 cycles, yielding paired-end 250 bp reads. Mock-community control libraries were constructed using the ZymoBIOMICS Microbial Community DNA Standard (Zymo Research, Irvine, CA, United States) and included in amplicon and metagenomic sequencing (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S1</xref>).</p>
<p>Library preparation for metagenomic sequencing was performed at the Functional Genomics Laboratory at UC Berkeley. Briefly, 500 bp insert libraries were constructed using the KAPA Hyper prep kit with a PCR amplification step. Sequencing was performed on an Illumina HiSeq 4000 at the Vincent J. Coates Genomics Sequencing Laboratory at UC Berkeley, yielding 150 bp paired-end reads.</p>
</sec>
<sec><title>Amplicon Sequence Data Processing</title>
<p>Reads were demultiplexed and mapped to PhiX (21.7%). Forward reads were processed using the DADA2 pipeline v1.4.0 (<xref ref-type="bibr" rid="B8">Callahan et al., 2016</xref>) according to <ext-link ext-link-type="uri" xlink:href="http://benjjneb.github.io/dada2/bigdata.html">http://benjjneb.github.io/dada2/bigdata.html</ext-link> to produce ASVs. Briefly, based on evaluation of read quality using FastQC<sup><xref ref-type="fn" rid="fn02">2</xref></sup>, reads were truncated to 230 nucleotides (nt) and left-trimmed to remove 15 nt from the 5&#x2032; end, resulting in 215 nt sequences. Reads were also truncated where quality dipped below <italic>Q</italic> = 11, and any reads &#x003C;200 nt were removed. On average 70% of reads remained after trimming. Error modeling used the first 2 million filtered reads, and samples were denoised individually using this error model. The denoised sequences were screened for chimeras with removeBimeraDenovo, which removed 3153 sequences accounting for 4.3% of all reads. Taxonomy was assigned using the Ribosomal Database Project&#x2019;s naive Bayesian classifier (<xref ref-type="bibr" rid="B62">Wang et al., 2007</xref>) within DADA2. The classifier was trained on Silva v128 (<xref ref-type="bibr" rid="B47">Quast et al., 2013</xref>) and set to use a minimum bootstrap confidence of 50 to assign a given rank (default minBoot = 50). Error-corrected, taxonomically classified data were analyzed with the Phyloseq (v1.20.0) package in R (<xref ref-type="bibr" rid="B33">McMurdie and Holmes, 2013</xref>) (see <xref ref-type="supplementary-material" rid="SM2">Supplementary Data Sheet S2</xref> and <ext-link ext-link-type="uri" xlink:href="https://github.com/rosekantor/16S_metagenomics_notebooks">https://github.com/rosekantor/16S_metagenomics_notebooks</ext-link>).</p>
<p>Data from bulk water samples and corresponding field, extraction, and amplification negative controls were prefiltered for successful sequencing (&#x003E;10000 reads), and low-biomass samples were removed prior to analyses due to obvious contamination with secondary wastewater (two samples) and MF filtrate (one sample). This prefiltering yielded 37 samples and 3 negative controls. DESeq2 analysis was performed with all ASVs found in any prefiltered negative control or sample. Of 108 ASVs that were shared between any sample and any negative control, 7 were significantly enriched in samples over negative controls (DESeq2 p-adjust &#x003C;0.01). These may represent cross-contamination of negative controls with samples or different but closely related organisms present in samples and negative controls. We removed from samples all ASVs found in negative controls except for these 7. After filtering contaminant reads, one sample and one negative control with fewer than 300 reads were removed, and the two remaining amplification and extraction controls contained 2 and 3 ASVs, respectively.</p>
<p>Rarefaction curves were plotted for bulk water samples using <italic>rarecurve</italic> in the vegan R package (v2.4.4). The DADA2 pipeline removes singletons because they cannot be reliably distinguished from sequencing errors, precluding the use of alpha diversity metrics that rely on singletons. Given that the rarefaction curves plateau for all samples (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S2</xref>), total observed ASVs were used as a metric of alpha diversity. Beta diversity was analyzed via principle coordinates analysis (PCoA) using Bray-Curtis dissimilarity and variables contributing to differences between communities were identified with PERMANOVA using <italic>adonis</italic> in the vegan package for R. Additional PERMANOVA testing using Aitchison distance found the same level of significance as was observed when using Bray-Curtis dissimilarity of percent relative abundance data. Calculating and testing for homogeneity of group dispersions used the <italic>betadisper</italic> and <italic>permutest</italic> functions within the vegan package in R (groups were sample locations).</p>
<p>To estimate abundances of genera containing opportunistic pathogens, the relative abundances of all ASVs (&#x003E;0.05% relative abundance in any sample) belonging to the genera <italic>Legionella</italic> and <italic>Mycobacterium</italic> were summed for each sample at each location in the treatment train. The minimum, median, and maximum relative abundances were then determined for each location. To generate the range of absolute abundances, we multiplied minimum, median, and maximum relative abundances by the minimum, median, and maximum total cell counts ever observed at a given location across the duration of the study.</p>
</sec>
<sec><title>Metagenomic Sequence Processing</title>
<p>Metagenomic read processing used FastQC (Babraham Bioinformatics, see text footnote 2) to inspect quality, bbmap (Bushnell<sup><xref ref-type="fn" rid="fn03">3</xref></sup>) to remove PhiX and adapter sequences, and sickle (<xref ref-type="bibr" rid="B27">Joshi and Fass, 2011</xref>) for quality trimming. Read-based phylogenetic characterization was performed with MetaPhlAn2 using default parameters (<xref ref-type="bibr" rid="B56">Truong et al., 2015</xref>). Assembly was performed independently for each sample using idba_ud (<xref ref-type="bibr" rid="B42">Peng et al., 2012</xref>) with the &#x201C;&#x2013;pre_correction&#x201D; option. Annotation of scaffolds ( 1 kbp was performed with Prokka v1.12 (<xref ref-type="bibr" rid="B51">Seemann, 2014</xref>) and RNAmmer (<xref ref-type="bibr" rid="B28">Lagesen et al., 2007</xref>).</p>
<p>Metagenomic analysis was conducted in Anvi&#x2019;o (<xref ref-type="bibr" rid="B17">Eren et al., 2015</xref>) according to the metagenomic workflow for v3 and v4. For each assembly (scaffolds &#x2265;2.5 kbp), an Anvi&#x2019;o contig database was generated and a profile database was created that included read-mapping information from each sample. Taxonomy was added using Centrifuge (v1.0.2-beta). Binning was performed manually based on hierarchical clustering by sequence composition and differential coverage in Anvi&#x2019;o. Genome bins &#x003E;70% complete with &#x003C;10% contamination according to CheckM (<xref ref-type="bibr" rid="B40">Parks et al., 2015</xref>) were de-replicated to generate a non-redundant set of MAGs using dRep (<xref ref-type="bibr" rid="B37">Olm et al., 2017</xref>) with primary clustering at 95% ANI and secondary clustering at 99% ANI, requiring 60% coverage of the larger genome in each pairwise comparison. MAGs were taxonomically identified based on their position in a concatenated gene tree (see <xref ref-type="supplementary-material" rid="SM1">Supplementary Methods</xref>), Centrifuge results, and CheckM taxonomy placement (<xref ref-type="supplementary-material" rid="SM4">Supplementary Tables S3</xref>, <xref ref-type="supplementary-material" rid="SM4">S4</xref>). Non-redundant MAGs were curated to remove misassemblies using ra2.py (<xref ref-type="bibr" rid="B5">Brown et al., 2015</xref>)<sup><xref ref-type="fn" rid="fn04">4</xref></sup> and open reading frames were predicted using Prodigal in single genome mode (<xref ref-type="bibr" rid="B22">Hyatt et al., 2010</xref>). Annotation of predicted proteins was conducted using USEARCH (<xref ref-type="bibr" rid="B15">Edgar, 2010</xref>) against the KEGG database (<xref ref-type="bibr" rid="B23">Kanehisa et al., 2016</xref>). C1-carbon utilization was determined by manual searches of KEGG-based annotations against select proteins representative of the pathways of interest (<xref ref-type="supplementary-material" rid="SM4">Supplementary Table S5</xref>).</p>
<p>To investigate shared microbial community members across samples, reads from each sample were stringently mapped to (1) assemblies of each sample (scaffolds &#x2265;1 kbp), (2) scaffolds &#x2265;1 kbp containing unique RpsC sequences, and (3) the non-redundant MAGs (see <xref ref-type="supplementary-material" rid="SM1">Supplementary Methods</xref>). Stringent mappings were also used to calculate indices of replication (iRep values) (<xref ref-type="bibr" rid="B6">Brown et al., 2016</xref>) for each MAG in each sample using iRep<sup><xref ref-type="fn" rid="fn05">5</xref></sup> with default settings.</p>
<p>Antibiotic resistance genes were detected via HMMsearch of the predicted proteins against the ResFams core database of hidden Markov models (HMMs) (<xref ref-type="bibr" rid="B20">Gibson et al., 2015</xref>). Within each resistance gene family, ResFam hits were sorted by length and clustered at 99% identity using USEARCH -cluster_fast (<xref ref-type="bibr" rid="B15">Edgar, 2010</xref>). Tabulated results were searched for any clusters containing proteins from both secondary wastewater and post-NF/RO samples. Ammonium monooxygenase genes were identified using TIGRFAM HMMs, and custom HMMs were used to search for Adenovirus and JC Polyomavirus sequences (see <xref ref-type="supplementary-material" rid="SM1">Supplementary Methods</xref>).</p>
<p>A phylogenomic tree for the Mycobacteriaceae was constructed with the Anvi&#x2019;o phylogenomics pipeline, using thirty reference genomes downloaded from NCBI representing each of the major clades described in <xref ref-type="bibr" rid="B21">Gupta et al. (2018)</xref>. A subset of 108 of the single copy genes/domains from <xref ref-type="bibr" rid="B9">Campbell et al. (2013)</xref> were identified via HMMsearch (<xref ref-type="bibr" rid="B13">Eddy, 2009</xref>), and individual alignments were constructed with MUSCLE (<xref ref-type="bibr" rid="B14">Edgar, 2004</xref>). A concatenated alignment was manually curated to remove C- and N-termini overhangs and regions of poor alignment. The resulting alignment was used to construct a phylogenetic tree with RAxML (<xref ref-type="bibr" rid="B53">Stamatakis, 2014</xref>) using 100 bootstraps.</p>
</sec>
</sec>
<sec><title>Results</title>
<p>Bulk water was sampled over a 3-month span at a pilot-scale advanced treatment facility that processed secondary wastewater to potable quality via ozonation, chloramination, MF, RO or NF, UV-hydrogen peroxide AOP, and GAC filtration (<xref ref-type="fig" rid="F1">Figure 1</xref>). Samples were collected after each treatment process, and sample totals for 16S rRNA gene (V4 region) amplicon and metagenomic sequencing are shown in <xref ref-type="table" rid="T1">Table 1</xref>. GAC media were also sampled and passed QC for metagenomics but not amplicon sequencing (<xref ref-type="table" rid="T1">Table 1</xref>). Amplicon sequencing for 37 bulk water samples yielded a total of 2900 unique ASVs. After data decontamination based on negative controls (see methods), sample read-counts ranged from 3499 to 78462, and negative controls contained fewer than 1000 reads. The majority of samples were taken between days 184&#x2013;228 of pilot plant operation.</p>
<p>Metagenomic sequencing was performed for 12 samples including three secondary wastewater feed samples from different days, GAC filtrates sampled from three parallel columns on the same day, GAC media samples from three parallel columns on the same day, and bulk water samples from three SDS on the same day (<xref ref-type="table" rid="T1">Table 1</xref> and <xref ref-type="supplementary-material" rid="SM4">Supplementary Table S3</xref>). Illumina paired-end sequencing of total genomic DNA yielded between 3.45 and 7.05 Gbp of raw reads per sample, and between 52 and 94% of trimmed reads could be mapped to their respective assemblies (scaffolds &#x2265;1 kbp). Hierarchical clustering based on differential coverage and sequence composition aided in reconstruction of 317 bins across all samples, and de-replication of 72 high-quality bins resulted in a set of 38 non-redundant MAGs (<xref ref-type="supplementary-material" rid="SM4">Supplementary Tables S3</xref>, <xref ref-type="supplementary-material" rid="SM4">S4</xref>). These bins collectively represented as little as 8% of the wastewater community (these assemblies had many incomplete bins), but as much as 73% of the GAC filtrate and SDS communities, based on percentages of reads mapped.</p>
<sec><title>Microbial Diversity and Absolute Abundance</title>
<p>Absolute abundances and alpha diversity of the bacterial community followed similar trajectories throughout the treatment train. Flow-cytometric cell counts decreased from secondary wastewater through NF/RO and then increased after GAC filtration, indicating growth on filter media (<xref ref-type="fig" rid="F2">Figure 2A</xref>). Cell counts in GAC filtrate and SDS were 3 to 4 log<sub>10</sub> lower than the secondary wastewater feed. Despite chlorination following GAC filtration, the median intact cell count in the three SDS was 2.7 &#x00D7; 10<sup>3</sup> cells &#x22C5; mL<sup>&#x2212;1</sup>. Intact cell counts were often lower than total cell counts, indicating that some sequence information was derived from non-viable cells. Median richness decreased across the treatment train with a slight increase after GAC (<xref ref-type="fig" rid="F2">Figure 2B</xref>). Observed ASVs were used as a proxy for richness within each sample because rarefaction curves plateaued for nearly all bulk water samples (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S2</xref>) and singletons had been removed during data processing. Between secondary wastewater and SDS, median observed ASVs decreased from 536 to 34 (a 1.09 log<sub>10</sub> reduction) (<xref ref-type="fig" rid="F2">Figure 2B</xref>).</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption><p>Cell counts and diversity in bulk water across the treatment train. <bold>(A)</bold> Counts of total (gray) and intact (black) cells based on flow cytometry, and <bold>(B)</bold> counts of observed amplicon sequence variants (ASVs). Solid line in both panels indicates chlorination and storage prior to feeding into simulated distribution systems (SDS). Dashed lines indicate minimum detection limits for total and intact cell counts (see <xref ref-type="supplementary-material" rid="SM4">Supplementary Table S7</xref> for supporting data). Sample numbers for flow cytometry and amplicon sequencing (after data decontamination) are noted for each location. In panel <bold>(A)</bold>, MF and NF/RO cell counts represent interprocess storage tanks after the respective process.</p></caption>
<graphic xlink:href="fmicb-10-00993-g002.tif"/>
</fig>
</sec>
<sec><title>Microbial Communities Across Treatment and Simulated Distribution</title>
<p>There were significant differences in the taxonomic composition of bulk water bacterial communities from different sample locations (<xref ref-type="fig" rid="F3">Figure 3A,B</xref>). By principal coordinate analysis (PCoA), bacterial communities clustered by location within the treatment train (<xref ref-type="fig" rid="F4">Figure 4</xref>). This clustering was supported by PERMANOVA, which suggested differences between communities were partially explained by sample location (PERMANOVA <italic>R</italic><sup>2</sup> = 0.57; <italic>p</italic> &#x003C; 0.001; same level of significance using proportion-normalized or rarefied data), but not sample date (<italic>R</italic><sup>2</sup> = 0.15; <italic>p</italic> &#x003C; 0.08). Some of the significance of sample location in this test may be due to heterogeneous group dispersions of samples within different locations (permutest <italic>p</italic> = 0.053 with 10000 permutations) (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S3</xref>). Notably, MF and GAC had the lowest distance to the centroid of their groups, suggesting the most consistent communities. NF/RO permeate and negative controls did not include enough successfully amplified and decontaminated samples to provide information about within-location variability (<italic>n</italic> = 2 for each).</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption><p>Changes in the bacterial composition of bulk water through advanced treatment and simulated distribution. <bold>(A)</bold> Barplot and <bold>(B)</bold> heatmap of amplicon data across the treatment train. <bold>(A)</bold> Samples are bars grouped by location in the treatment train. Bar height indicates relative abundances and Family-level taxonomy of ASVs, and each segment within a bar represents a unique ASV and is colored by Family (I.S., &#x201C;Incertae Sedis&#x201D;). <bold>(B)</bold> Samples are columns and ASVs are rows. Color indicates percent relative abundance on a log-scale (black indicates non-detection). Both panels report the same 64 ASVs present at a minimum relative abundance of 2% in at least one sample (see <xref ref-type="supplementary-material" rid="SM4">Supplementary Table S6</xref> for additional data). Sample descriptions (<italic>x</italic>-axis) indicate secondary wastewater entering advanced treatment (&#x201C;Feed&#x201D;), chloramine treated (&#x201C;Treated&#x201D;), MF permeate in storage tank (&#x201C;Perm&#x201D;), NF permeate in storage tank (&#x201C;Comb_Perm&#x201D;), NF permeate (&#x201C;NF_Perm&#x201D;), filtrate from one of three parallel GAC filters (1&#x2013;3, grouped by brackets), or bulk water from one of three SDS (1&#x2013;3, grouped by brackets). Sample day (days since pilot plant start-up) is indicated following the underscore for each sample. Asterisks indicate samples taken from the three SDS after 24-h stagnation.</p></caption>
<graphic xlink:href="fmicb-10-00993-g003.tif"/>
</fig>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption><p>Bacterial community dissimilarity by location. Principle coordinate analysis (PCoA) of amplicon data based on Bray-Curtis dissimilarity. Locations within the advanced treatment train are indicated by color and shape and include secondary wastewater (&#x201C;WW_2ndary&#x201D;), chloramine treated (&#x201C;Chloramine&#x201D;), MF permeate in storage tank (&#x201C;MF&#x201D;), NF permeate and NF permeate in storage tank (&#x201C;NF-RO&#x201D;), filtrates from three parallel GAC filters (&#x201C;GAC&#x201D;) and bulk water from three SDS (&#x201C;SDS&#x201D;).</p></caption>
<graphic xlink:href="fmicb-10-00993-g004.tif"/>
</fig>
<p>After each treatment step, the taxonomy of the most prevalent ASVs changed, and overall membership also changed (<xref ref-type="fig" rid="F3">Figure 3B</xref>). One exception was that secondary wastewater communities appeared similar to some samples taken after chloramine treatment. This effect might be expected as ozonation and chloramination inactivated cells (<xref ref-type="fig" rid="F2">Figure 2A</xref>) but may not have significantly damaged their DNA. Relative abundances and phylogenetic classifications based on metagenomic data yielded a somewhat similar picture of the microbial community, except for among low-biomass samples. We used a read-based method (MetaPhlAn2) (<xref ref-type="bibr" rid="B56">Truong et al., 2015</xref>; <xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S4</xref>), unique RpsC sequences from assembled data (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S5</xref>), and non-redundant MAGs from across all samples (<xref ref-type="fig" rid="F6">Figure 6A</xref>) to assess the community composition and structure within metagenomes compared with amplicon data.</p>
<p>In the secondary wastewater before and after chloramine treatment, the dominant ASVs (as high as 57% relative abundance), most abundant RpsC sequence, and most abundant MAGs corresponded to the family Neisseriaceae. Meanwhile, MetaPhlAn2 identified Rhodocyclaceae as the most abundant Family in secondary wastewater (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S4</xref>). One post-chloramine sample and all MF filtrate samples were overwhelmingly dominated by a single ASV classified as Mycobacteriaceae. In the NF permeate sample, the most prevalent ASV was classified as Pseudonocardiaceae (61.0% relative abundance), while in the NF/RO permeate storage tank, an ASV classified as Methylobacteriaceae was highly abundant (92.1% relative abundance) (<xref ref-type="fig" rid="F3">Figure 3A</xref>). Abundant ASVs, metagenomic reads, RpsC sequences, and MAGs in the GAC filtrates were classified as Rhizobiales (including Bradyrhizobiaceae, Hyphomicrobiaceae, Methylocystaceae, and Methylobacteriaceae) (<xref ref-type="fig" rid="F3">Figure 3A</xref>, <xref ref-type="fig" rid="F6">6</xref> and <xref ref-type="supplementary-material" rid="SM1">Supplementary Figures S4</xref>, <xref ref-type="supplementary-material" rid="SM1">S5</xref>). The SDS communities were more variable over time than GAC communities, likely due to variation of the chlorine residual and a 24-h stagnation event just prior to the final sampling (day 227). In the two sample days prior to stagnation, SDS 1 and SDS 3 communities contained abundant ASVs, RpsCs, and MAGs classified as Burkholderiales/Comamonadaceae, Methylobacteriaceae, and Bradyrhizobiaceae. By amplicon sequencing, the community in SDS 2 prior to stagnation (SDS 2, day 205) appeared similar to the GAC filtrate (<xref ref-type="fig" rid="F3">Figure 3</xref>), but by all metagenomic analyses, the same sample appeared more similar to the other SDS samples prior to stagnation (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figures S4</xref>, <xref ref-type="supplementary-material" rid="SM1">S5</xref> and <xref ref-type="fig" rid="F6">Figure 6A</xref>). Based on our amplicon data with limited sampling, all three SDS communities were different before and after stagnation (<xref ref-type="fig" rid="F3">Figure 3A,B</xref>).</p>
<p>Seven ASVs found in negative controls and in samples from all locations were not removed during data decontamination because they were significantly enriched (adjusted <italic>p</italic> &#x003C; 0.005) in samples relative to controls. Two of these ASVs, classified as <italic>Bradyrhizobium</italic> and Hyphomicrobiaceae, were abundant in low biomass samples, especially GAC filtrates (33 and 22% median relative abundances) and some SDS samples. In metagenomic data, these two taxa were also highly abundant in GAC media and filtrate but not SDS (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S5</xref> and <xref ref-type="fig" rid="F6">Figure 6A</xref>).</p>
<p>In addition to bacterial genomes, several eukaryotic nuclear and plastid sequences were detected with metagenomics. Most notably, all three GAC media metagenomes contained identical sequences from a single Chlorophyta bin. Based on best BLAST hits of chloroplast 16S, 23S, and co-binned nuclear 28S rRNA genes to NCBI-nr, the genome was likely related to the Sphaeropleales. The chloroplast RpsC sequence from this bin was detected by read-mapping in all three GAC media metagenomes (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S5</xref>), and the presence of algae is consistent with observations of green growth on walls of the media columns. An additional Chlorophyta species with 100% identity to <italic>Parachlorella kessleri</italic> across both the chloroplast 16S rRNA gene and RpsC sequence (NCBI accessions FJ968741.1 and YP_003058311.1) was present in the metagenome of only GAC media from column 3 (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S5</xref>), and this column was designed to filter water 69% more slowly than the others (see <xref ref-type="supplementary-material" rid="SM1">Supplementary Methods</xref>).</p>
</sec>
<sec><title>Persistence of Bacteria Through Advanced Treatment</title>
<p>To examine potential persistence of bacteria from secondary wastewater and early treatment steps into post-NF/RO samples, we investigated the reoccurrence of specific ASVs, and used metagenomic read-mapping to assemblies, to unique RpsC-containing scaffolds, and to non-redundant MAGs. ASVs were defined as recurring if they were detected with &#x003E;50 reads in at least two samples from a given location. The 50-read cut-off was used as minimal protection against barcode bleed (also known as sample cross-talk) from Illumina sequencing (<xref ref-type="bibr" rid="B34">Mitra et al., 2015</xref>; <xref ref-type="bibr" rid="B16">Edgar, 2016</xref>). The criterion for recurrence in two samples was used to protect against one-off cross-contamination of samples or barcodes. There was no intersection between the recurring ASVs in the secondary wastewater or MF samples and ASVs found in the SDS (all three SDS treated as one location) (<xref ref-type="fig" rid="F5">Figure 5</xref>). A total of four ASVs were shared between pre-NF/RO samples (secondary wastewater and MF) and post-NF/RO samples (GAC, all three treated as one location) (<xref ref-type="fig" rid="F5">Figure 5</xref> and <xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S7</xref>). One of these four ASVs was the Neisseriaceae sequence highly abundant in secondary wastewater. This ASV was detected but not prevalent in samples after NF/RO, having a maximum of 160 reads or 1.3% relative abundance in any post-NF/RO sample (GAC column 3, day 184). Two other recurring ASVs, classified as Methylocystaceae and Bradyrhizobiaceae were most prevalent in GAC and SDS samples and were widespread in these samples. These ASVs were also detected before NF/RO with as many as 309 reads (1.5%) and 2168 reads (8.1%), respectively, in pre-NF/RO samples. The Bradyrhizobiaceae and Neisseriaceae ASVs were also present in negative controls, but were not removed by data decontamination (see section &#x201C;Materials and Methods&#x201D;). Lastly, the fourth ASV shared between pre- and post-NF/RO samples was classified as Comamonadaceae, and was relatively rare in all samples. This ASV constituted less than 0.44% relative abundance in any pre-NF/RO and less than 2.4% in any post-NF/RO sample (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S7</xref>).</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption><p>Recurring ASVs shared between different locations in the treatment train. Counts include only ASVs that appear in at least two samples from the same location with at least 50 reads. All parallel GAC samples and all SDS samples were included together as single &#x201C;locations&#x201D; for this analysis. Read counts and abundances of the four ASVs shared between secondary wastewater (&#x201C;WW_2ndary&#x201D;) and MF can be found in <xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S7</xref>.</p></caption>
<graphic xlink:href="fmicb-10-00993-g005.tif"/>
</fig>
<p>Metagenomic analyses with stringent read-mapping also revealed little to no overlap between secondary wastewater and post-NF/RO samples (GAC media, GAC filtrate, and SDS bulk water) (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S8</xref>). At most, 0.011% of metagenomic reads from any given secondary wastewater sample were stringently mapped to the assemblies of any post-NF/RO samples (=1 mismatch per read across scaffolds &#x2265;1 kbp). Likewise, fewer than 0.10% of reads from any post-NF/RO sample mapped to a secondary wastewater assembly (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S8</xref>). Based on read-mapping to scaffolds containing the marker gene RpsC, no overlap was detected between the secondary wastewater and the post-NF/RO (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S5</xref>). Mapping reads to 38 non-redundant MAGs gave the same result (<xref ref-type="fig" rid="F6">Figure 6A</xref>).</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption><p>Metagenome assembled genomes (MAGs) relative abundance, indices of replication, ARG contents, and selected C1-compound utilization. <bold>(A)</bold> Relative abundance of 38 MAGs (row) across the 12 samples (columns) based on stringent read mapping. Cell color indicates log-10 normalized coverage per 10 million reads, and white indicates non-detection (detection threshold required at least 1&#x00D7; coverage across &#x003E;5% of genome length). MAGs are named based on their placement in a concatenated ribosomal protein tree (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S6</xref>), with numerical identifiers added for uniqueness when necessary. Hierarchical clustering of MAGs (<italic>y</italic>-axis) is based on a distance matrix calculated from Spearman&#x2019;s rho. Sample names are based on location in treatment train, secondary wastewater (&#x201C;WW_2ndary&#x201D;), media from three parallel GAC columns (&#x201C;GAC_media&#x201D;), filtrate from three parallel GAC columns (GAC_filt), and bulk water from three parallel SDS (&#x201C;SDS&#x201D;). <bold>(B)</bold> iRep values for each MAG in each sample. Gray indicates no data where requirements for calculating iRep were not met. <bold>(C)</bold> Counts of ARGs by category based on ResFams. <bold>(D)</bold> Presence of selected pathways for C1-carbon utilization in each MAG, based on annotations of a subset of predicted proteins involved in those pathways. See <xref ref-type="supplementary-material" rid="SM4">Supplementary Table S5</xref> for data.</p></caption>
<graphic xlink:href="fmicb-10-00993-g006.tif"/>
</fig>
<p>As a group, GAC media samples shared the majority of unique RpsC sequences and MAGs with GAC filtrates. Individually, between 42 and 59% of GAC filtrate reads could be mapped to their respective GAC media assemblies (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S8</xref>). Between GAC filtrates and SDS samples, overall community structure differed; while some MAGs were shared, others were detected in only GAC filtrate or SDS (<xref ref-type="fig" rid="F5">Figure 5</xref>). Between these sample points, water underwent batch chlorination, storage, and simulated distribution, and there was a 10-fold drop in median intact cell counts (<xref ref-type="fig" rid="F2">Figure 2A</xref>). Thus, some of the ASVs shared between GAC filtrate and SDS may represent chlorine-inactivated cells whose DNA remained intact (<xref ref-type="fig" rid="F3">Figure 3B</xref>, <xref ref-type="fig" rid="F6">6A</xref> and <xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S5</xref>).</p>
</sec>
<sec><title>Potential Pathogens and Antibiotic Resistance Genes</title>
<p>To investigate whether the advanced treatment facility could be a source of opportunistic drinking water pathogens, we calculated the minimum, median, and maximum possible relative and absolute abundances of <italic>Legionella</italic> spp. and <italic>Mycobacterium</italic> spp. at each location in the treatment train (<xref ref-type="table" rid="T2">Table 2</xref>). Calculations were based on amplicon sequencing (including ASVs &#x003E;0.05% relative abundance in at least one sample) combined with total cell counts quantified via flow cytometry. The estimated maximum <italic>Legionella</italic> spp. in secondary wastewater and SDS was 4.1 &#x00D7; 10<sup>5</sup> cells &#x22C5; mL<sup>&#x2212;1</sup> and 1.6 &#x00D7; 10<sup>3</sup> cells &#x22C5; mL<sup>&#x2212;1</sup>, respectively (<xref ref-type="table" rid="T2">Table 2</xref> and <xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S9</xref>). Of the seven <italic>Legionella</italic> ASVs included in the calculations, four were from pre-NF/RO, two from post-NF/RO, and one shared between pre- and post-NF/RO samples (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S9</xref>). In the genus <italic>Mycobacterium</italic>, maximum possible concentrations in secondary wastewater and SDS were 1.7 &#x00D7; 10<sup>6</sup> cells &#x22C5; mL<sup>&#x2212;1</sup> and 3.7 &#x00D7; 10<sup>4</sup> cells &#x22C5; mL<sup>&#x2212;1</sup>, respectively (<xref ref-type="table" rid="T2">Table 2</xref>). Only one <italic>Mycobacterium</italic> sp. ASV was detected in both pre- and post-NF/RO samples. This was the same ASV that was highly abundant in MF filtrate samples and detected at very low abundances in every other location. The <italic>Legionella</italic> and <italic>Mycobacterium</italic> ASVs found in pre-and post-NF/RO were not identified in the analysis of persistent ASVs across the treatment train (see section &#x201C;Persistence of Bacteria Through Advanced Treatment&#x201D;) because they didn&#x2019;t meet the criteria for consideration. Thus, while they were detected, their presence in the data may be due to Illumina barcode bleed or sample cross-contamination. The reported maximum opportunistic pathogen concentrations were likely overestimated because they were calculated using maximum relative abundances and maximum cell counts ever observed at a given location across the duration of the study, regardless of whether these occurred at the same time. Furthermore, abundance calculations used total cell counts and thus also included non-viable cells (<xref ref-type="fig" rid="F2">Figure 2A</xref>) and assumed a single copy of the 16S rRNA gene per cell. <italic>Legionella</italic> spp. and <italic>Mycobacterium</italic> spp. have median 16S rRNA gene copy numbers of 3 and 1, respectively<sup><xref ref-type="fn" rid="fn06">6</xref></sup>.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Relative and absolute abundance (total cells per milliliter) estimates for genera containing opportunistic pathogens.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"></th>
<th valign="top" align="left"></th>
<th valign="top" align="center">WW_2ndary</th>
<th valign="top" align="center">Chloramine</th>
<th valign="top" align="center">MF</th>
<th valign="top" align="center">NF-RO</th>
<th valign="top" align="center">GAC</th>
<th valign="top" align="center">SDS</th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left"><bold>Total cell counts</bold></td>
<td valign="top" align="left">Min. cells/mL</td>
<td valign="top" align="center">6.8E+06</td>
<td valign="top" align="center">2.4E+06</td>
<td valign="top" align="center">1.4E+03</td>
<td valign="top" align="center">2.1E+01</td>
<td valign="top" align="center">4.6E+03</td>
<td valign="top" align="center">7.3E+01</td>
</tr>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="left">Med. cells/mL</td>
<td valign="top" align="center">9.7E+06</td>
<td valign="top" align="center">7.6E+06</td>
<td valign="top" align="center">2.7E+03</td>
<td valign="top" align="center">1.2E+02</td>
<td valign="top" align="center">9.8E+03</td>
<td valign="top" align="center">5.6E+03</td>
</tr>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="left">Max. cells/mL</td>
<td valign="top" align="center">7.1E+07</td>
<td valign="top" align="center">5.8E+07</td>
<td valign="top" align="center">5.9E+03</td>
<td valign="top" align="center">2.8E+03</td>
<td valign="top" align="center">2.1E+04</td>
<td valign="top" align="center">1.4E+05</td>
</tr>
<tr>
<td valign="top" align="left" colspan="8"><hr/></td>
</tr>
<tr>
<td valign="top" align="left"><bold>Legionella<sup>&#x2217;</sup></bold></td>
<td valign="top" align="left">Min. % abund</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="left">Med. % abund</td>
<td valign="top" align="center">0.081</td>
<td valign="top" align="center">0.035</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0.16</td>
<td valign="top" align="center">0.3</td>
<td valign="top" align="center">0.036</td>
</tr>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="left">Max. % abund</td>
<td valign="top" align="center">0.57</td>
<td valign="top" align="center">0.11</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0.32</td>
<td valign="top" align="center">1.3</td>
<td valign="top" align="center">1.1</td>
</tr>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="left" colspan="7"><hr/></td>
</tr>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="left">Min. cells/mL</td>
<td valign="top" align="center">0.0E+00</td>
<td valign="top" align="center">0.0E+00</td>
<td valign="top" align="center">0.0E+00</td>
<td valign="top" align="center">0.0E+00</td>
<td valign="top" align="center">0.0E+00</td>
<td valign="top" align="center">0.0E+00</td>
</tr>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="left">Med. cells/mL</td>
<td valign="top" align="center">7.9E+03</td>
<td valign="top" align="center">2.6E+03</td>
<td valign="top" align="center">0.0E+00</td>
<td valign="top" align="center">1.9E-01</td>
<td valign="top" align="center">3.0E+01</td>
<td valign="top" align="center">2.0E+00</td>
</tr>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="left">Max. cells/mL</td>
<td valign="top" align="center">4.1E+05</td>
<td valign="top" align="center">6.5E+04</td>
<td valign="top" align="center">0.0E+00</td>
<td valign="top" align="center">8.9E+00</td>
<td valign="top" align="center">2.7E+02</td>
<td valign="top" align="center">1.6E+03</td>
</tr>
<tr>
<td valign="top" align="left" colspan="8"><hr/></td>
</tr>
<tr>
<td valign="top" align="left"><bold>Mycobacterium<sup>&#x2217;</sup></bold></td>
<td valign="top" align="left">Min. % abund</td>
<td valign="top" align="center">0.27</td>
<td valign="top" align="center">0.38</td>
<td valign="top" align="center">68</td>
<td valign="top" align="center">0.029</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="left">Med. % abund</td>
<td valign="top" align="center">0.88</td>
<td valign="top" align="center">1.9</td>
<td valign="top" align="center">93</td>
<td valign="top" align="center">0.044</td>
<td valign="top" align="center">0</td>
<td valign="top" align="center">0.074</td>
</tr>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="left">Max. % abund</td>
<td valign="top" align="center">2.3</td>
<td valign="top" align="center">87</td>
<td valign="top" align="center">97</td>
<td valign="top" align="center">0.06</td>
<td valign="top" align="center">2.4</td>
<td valign="top" align="center">26</td>
</tr>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="left" colspan="7"><hr/></td>
</tr>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="left">Min. cells/mL</td>
<td valign="top" align="center">1.8E+04</td>
<td valign="top" align="center">8.9E+03</td>
<td valign="top" align="center">9.5E+02</td>
<td valign="top" align="center">6.2E-03</td>
<td valign="top" align="center">0.0E+00</td>
<td valign="top" align="center">0.0E+00</td>
</tr>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="left">Med. cells/mL</td>
<td valign="top" align="center">8.5E+04</td>
<td valign="top" align="center">1.4E+05</td>
<td valign="top" align="center">2.5E+03</td>
<td valign="top" align="center">5.3E-02</td>
<td valign="top" align="center">0.0E+00</td>
<td valign="top" align="center">4.1E+00</td>
</tr>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="left">Max. cells/mL</td>
<td valign="top" align="center">1.7E+06</td>
<td valign="top" align="center">5.0E+07</td>
<td valign="top" align="center">5.7E+03</td>
<td valign="top" align="center">1.7E+00</td>
<td valign="top" align="center">5.2E+02</td>
<td valign="top" align="center">3.7E+04</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<attrib><italic><sup>&#x2217;</sup>Includes only ASVs present at &#x2265;0.05% relative abundance in at least one sample. Absolute abundance calculations assume 16S rRNA gene copy number of 1.</italic></attrib>
</table-wrap-foot>
</table-wrap>
<p>Mycobacteriaceae were also identified in metagenomic analyses, where they could be more accurately classified. A recent phylogenomic study split the genus <italic>Mycobacterium</italic> into five new genera, of which one, the <italic>Mycobacterium</italic> spp., contains the majority of slow-growing pathogens (<xref ref-type="bibr" rid="B21">Gupta et al., 2018</xref>) (not reflected in SILVA v128 used for amplicon analysis). We reconstructed two unique, near-complete Mycobacteriaceae MAGs, each detected by read-mapping in at least three samples. Using <xref ref-type="bibr" rid="B21">Gupta et al. (2018)</xref> as a starting point, we conducted a phylogenomic analysis, placing the two MAGs within the Mycobacteriaceae. One MAG was nearest neighbor to <italic>M. gordonae</italic> (within the slow-growing, pathogen-containing <italic>Tuberculosis-Simiae</italic> clade, genus <italic>Mycobacterium</italic>), and the other closest to <italic>M. smegmatis</italic> (within the rapid-growing <italic>Fortuitum-Vaccae</italic> clade, genus <italic>Mycolicibacterium</italic>) (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S10</xref>). Notably, the <italic>M. gordonae</italic>-related MAG was not detected by read-mapping in two of the SDS, and in the third SDS it was at low relative abundance (coverage of 0.24&#x00D7; per 10 million reads) (<xref ref-type="fig" rid="F6">Figure 6A</xref>).</p>
<p>In addition to opportunistic pathogens, we searched metagenomic assemblies for protein sequences from human adenoviruses and polyomaviruses (both dsDNA enteric viruses) that could indicate passage through the treatment train. Using custom built HMMs (see <xref ref-type="supplementary-material" rid="SM1">Supplementary Methods</xref>), no hits were identified for any of these proteins within any metagenome, including secondary wastewater samples. This result was not surprising, given that pore size of membranes used for sample collection (0.22 &#x03BC;m) was much larger than these viruses and the sample concentration method was not optimized for retention of viruses.</p>
<p>We examined whether ARGs from secondary wastewater had passed through advanced treatment or were sourced from bacteria within the treatment train. Putative ARGs were identified in assembled metagenomic data by searching against the ResFams database (<xref ref-type="bibr" rid="B20">Gibson et al., 2015</xref>). Of the 123 ARG families in ResFams, only 34 were found, and these were mechanistically classified as efflux pumps, enzymes catalyzing structural modifications to antibiotics, beta-lactamases, and regulatory proteins (<xref ref-type="fig" rid="F6">Figure 6C</xref> and <xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S11</xref>). Of the 1609 protein hits across 12 metagenomes, no putative ARG sequences from secondary wastewater samples clustered with sequences in post-NF/RO samples at &#x2265;99% amino acid identity. Putative ARGs were found at higher normalized coverages in GAC media, GAC filtrates, and SDS samples relative to wastewater, an effect primarily driven by ARG families associated with transport, and beta-lactamases of classes A and B (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S11</xref>). Many ARGs co-occurred in the same MAGs (<xref ref-type="fig" rid="F6">Figure 6C</xref>).</p>
</sec>
<sec><title>Microbial Growth in Advanced Treatment and Distribution</title>
<p>Given the increases in microbial cell counts after GAC filtration (<xref ref-type="fig" rid="F2">Figure 2A</xref>), we were interested in microbial growth in oligotrophic conditions after NF/RO. We estimated TOC that could have passed through a single GAC column over 205 days prior to sampling. With a maximum flow rate of 4,400 L per day through a GAC column, and the typical 50&#x2013;300 &#x03BC;g &#x22C5; L<sup>&#x2212;1</sup> of TOC after RO in advanced treatment trains (<xref ref-type="bibr" rid="B66">Zeng et al., 2016</xref>), we estimated the maximum possible amount of TOC adsorbed in one column was 45&#x2013;135 g. Chemical analyses have shown that aldehydes and other small carbon molecules are formed after ozonation of wastewater (<xref ref-type="bibr" rid="B63">Wert et al., 2007</xref>), and other studies have shown that small uncharged compounds may pass through RO membranes (<xref ref-type="bibr" rid="B3">Bellona et al., 2004</xref>). Other carbon sources for bacterial growth may have included carbon fixed during algal growth in the columns, and the GAC media itself.</p>
<p>Given these possible carbon sources, annotations of predicted proteins from the non-redundant MAGs were used to assess the capacity for C1-carbon compound utilization, including methanol and formaldehyde (<xref ref-type="fig" rid="F6">Figure 6D</xref>). Seven MAGs recovered from post-NF/RO samples encoded putative methanol or methanol/ethanol dehydrogenases. Additional MAGs encoded the capacity to convert formaldehyde to formate, which could then be a substrate for energy generation via formate dehydrogenase. Lastly, incorporation of C1-carbon compounds into biomass may occur via the serine cycle, which was encoded in some MAGs (<xref ref-type="fig" rid="F6">Figure 6D</xref>). It should be noted that the absence of specific pathways in MAGs does not necessarily indicate the inability of the corresponding organisms to grow on C1 compounds. These analyses were not exhaustive of all possible pathways, nor were all genomes complete.</p>
<p>Lastly, a proxy for bacterial growth rate, the index of replication (iRep), was calculated for each MAG in each sample. After filtering to remove MAG-by-sample combinations with low coverage, high genome fragmentation, and poorly fitted regressions (see <xref ref-type="bibr" rid="B6">Brown et al., 2016</xref>), iRep values were retained for 80 MAG-by-sample combinations (<xref ref-type="fig" rid="F6">Figure 6B</xref> and <xref ref-type="supplementary-material" rid="SM4">Supplementary Table S5</xref>). Many genomes had low iReps (1.1&#x2013;1.2) suggesting they were not actively replicating. The fastest replication rates (&#x003E;1.5) were observed for four genomes in the secondary wastewater samples (Rhodocyclaceae_3, Flavobacteriaceae_1, Proteobacterium_1, and Acidimicrobidae_1), two genomes in GAC media (Hyphomicrobium_1 and Bradyrhizobiaceae_1), one genome in GAC filtrate (Bradyrhizobiaceae_1), and one genome in an SDS (Curvibacter_1).</p>
</sec>
</sec>
<sec><title>Discussion</title>
<p>Based on amplicon sequencing and metagenomic analyses of microbial communities in a pilot advanced treatment facility for potable reuse, we found that: (1) bulk water bacterial diversity paralleled cell counts through treatment; (2) bacterial communities were significantly different by location in treatment; (3) few or no community members were shared between secondary wastewater and SDSs; (4) absolute abundances of potential opportunistic pathogens dropped substantially during treatment and neither opportunistic pathogens nor ARGs present in treated water were likely due to passage of bacteria through advanced treatment; and (5) microbial growth occurred after NF/RO and could have been due to metabolism of small carbon compounds.</p>
<p>Based on cell counts by flow cytometry and amplicon sequencing, advanced treated water at the pilot facility was not completely sterile, but its microbiota was significantly different from that of secondary wastewater. Changes in microbial communities across another advanced treatment train were also recently reported via a read-based metagenomic approach (<xref ref-type="bibr" rid="B54">Stamps et al., 2018</xref>). The present study and work by <xref ref-type="bibr" rid="B54">Stamps et al. (2018)</xref> complement a large body of work documenting the ability of advanced treatment to transform and removal chemical constituents as well as specific pathogen targets (enteric viruses and protozoan cysts) (<xref ref-type="bibr" rid="B57">Trussell et al., 2011</xref>). Together, these studies illustrate that the chemical and microbial identity of advanced treated water intended for potable reuse is fundamentally distinct from its wastewater source.</p>
<p>We looked for evidence of microbial persistence through treatment and found four ASVs detected before and after NF/RO. For all four, the ASV was differentially abundant either before or after NF/RO, meaning it was primarily removed through treatment or that it survived or grew in later treatment processes while other ASVs became less abundant. Alternative explanations besides persistence through treatment include Illumina barcode bleed and sample cross-contamination. In the future, taxa identified in this manner could be quantified from the original samples using qPCR, which would provide a means to rule out barcode bleed or contamination during library preparation. In a larger-scale study, separately sequenced technical replicates could be used to address cross-contamination. Some ASVs were detected only in post-NF/RO sampling locations (<xref ref-type="fig" rid="F5">Figure 5</xref>). Assuming they were not simply below the detection limit in pre-NF/RO samples or controls, these post-NF/RO sequences could represent organisms that grew in the oligotrophic, post-NF/RO environment, especially in GAC filter media. Sequences classified within the order Rhizobiales were well-represented in the GAC media and filtrate samples. Rhizobiales were previously identified in ultrapure water (ironically, in studies of sequencing contaminants) (<xref ref-type="bibr" rid="B26">Kulakov et al., 2002</xref>; <xref ref-type="bibr" rid="B1">Barton et al., 2006</xref>), and are suggested to be capable of oligotrophic metabolism (<xref ref-type="bibr" rid="B32">McAlister et al., 2002</xref>). The prevalence of Rhizobiales is also consistent with studies of GAC filter microbiota in laboratory-scale surface water treatment (<xref ref-type="bibr" rid="B58">Vignola et al., 2017</xref>) and in conventional drinking water treatment facilities (<xref ref-type="bibr" rid="B29">Lautenschlager et al., 2014</xref>; <xref ref-type="bibr" rid="B36">Oh et al., 2018</xref>).</p>
<p>The media samples harvested from the three different GAC columns shared community members (MAGs, RpsC sequences, and reads), likely due installation under the same environmental conditions (introduction of same GAC-colonizing organisms) and feeding with the same water. However, the media communities also exhibited differences from one another (<xref ref-type="fig" rid="F6">Figure 6</xref> and <xref ref-type="supplementary-material" rid="SM1">Supplementary Figures S5</xref>, <xref ref-type="supplementary-material" rid="SM1">S8</xref>), likely attributable to the different types of GAC and differing empty bed contact times. Based on read-mapping to assemblies, column 3, which had a greater empty bed contact time, shared less of its microbial community with columns 1 and 2 than those columns shared with each other (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S8</xref>). Column 3 was also the only column with a second unique algal species.</p>
<p>GAC media, GAC filtrates, and SDS shared some ASVs and MAGs (<xref ref-type="fig" rid="F5">Figure 5</xref>, <xref ref-type="fig" rid="F6">6A</xref>), consistent with findings from conventional drinking water systems in which &#x201C;leaky&#x201D; treatment filter colonizers seed filtrates (<xref ref-type="bibr" rid="B58">Vignola et al., 2017</xref>) and enter drinking water distribution systems (<xref ref-type="bibr" rid="B44">Pinto et al., 2012</xref>). The laboratory-scale batch chlorination applied to GAC filtrate in our study was likely much more aggressive than that of full-scale, continuous-flow treatment plants. This chlorination step is one likely reason for differences between GAC filtrate and SDS microbial communities, a difference that would have been more pronounced had we sequenced DNA from intact cells only (<xref ref-type="bibr" rid="B10">Chiao et al., 2014</xref>). SDS communities appeared to be affected by stagnation, which has been shown to change microbial communities in premise plumbing (<xref ref-type="bibr" rid="B30">Ling et al., 2018</xref>).</p>
<p>Our estimated concentration ranges of opportunistic pathogens (i.e., <italic>Legionella</italic> spp. and <italic>Mycobacterium</italic> spp.; <xref ref-type="table" rid="T2">Table 2</xref>) in GAC filtrate and SDS effluent are similar to previously reported values for simulated or full-scale drinking water systems (<xref ref-type="bibr" rid="B60">Wang et al., 2012</xref>, <xref ref-type="bibr" rid="B61">2013</xref>; <xref ref-type="bibr" rid="B59">Waak et al., 2018</xref>). <xref ref-type="bibr" rid="B60">Wang et al. (2012)</xref> observed average concentrations of 1.4 &#x00D7; 10<sup>4</sup> and 187 gene copies &#x22C5; mL<sup>&#x2212;1</sup> of <italic>Mycobacterium</italic> spp. and <italic>Legionella</italic> spp., respectively, in two chloraminated drinking water distribution systems. In a follow-up drinking water simulation using annular reactors, <xref ref-type="bibr" rid="B61">Wang et al. (2013)</xref> found <italic>Mycobacterium</italic> spp. and <italic>Legionella</italic> spp. concentrations to range from 10<sup>3.5</sup> to 10<sup>5</sup> and 10<sup>2</sup> to 10<sup>4.5</sup> gene copies &#x22C5; mL<sup>&#x2212;1</sup>. In the current study, the observed increase in <italic>Legionella</italic> spp. through GAC filtration is unsurprising, given previous reports on drinking water filters as potential harbors of opportunistic pathogens (<xref ref-type="bibr" rid="B64">Wullings and van der Kooij, 2006</xref>; <xref ref-type="bibr" rid="B58">Vignola et al., 2017</xref>). Finally, we note that not all organisms in the genera <italic>Legionella</italic> and <italic>Mycobacterium</italic> are pathogenic, pathogenicity cannot be inferred from amplicon data, and amplification of low-biomass samples may lead to biased data (<xref ref-type="bibr" rid="B24">Kennedy et al., 2014</xref>). In metagenomic data, extensive phylogenetic characterization was performed for Mycobacteriaceae genomes, but we did not convert this data to a metric of absolute abundance due to the fact that all 38 MAGs accounted for only a fraction of total reads (52&#x2013;55% for GAC media metagenomes).</p>
<p>In our study, no ARG sequences from secondary wastewater were found in post-NF/RO advanced treated water. Other studies have sought to use metagenomic sequencing and qPCR to trace ARGs from wastewater treatment into the environment or from raw water through drinking water treatment to potable water. <xref ref-type="bibr" rid="B65">Xi et al. (2009)</xref> observed 0.5&#x2013;3 log<sub>10</sub> removal of six ARGs across a conventional drinking water treatment plant, followed by a 0&#x2013;1 log<sub>10</sub> increase in ARG concentrations after distribution. Using metagenomics and qPCR, <xref ref-type="bibr" rid="B18">Garner et al. (2018)</xref> found no significant difference in ARG abundances before and after conventional drinking water treatment, but it is unclear if the exact same ARG sequences were present before and after and thus uncertain whether these ARGs passed through treatment. In another study, multiple ARGs and mobile genetic elements increased in relative abundance through chlorination, but subsequently decreased in relative abundance after pipeline distribution (<xref ref-type="bibr" rid="B52">Shi et al., 2013</xref>). Especially in studies of advanced treatment trains, deeper sequencing of the highly diverse secondary wastewater could capture more of the upstream microbial community and increase the likelihood of detecting any ARGs that reappear downstream. It should be noted that even when putative ARGs are predicted in sequence-based analyses, experimental validation is required to determine whether these genes confer antibiotic resistance. Here, the majority of ARGs identified in post-NF/RO metagenomes encode beta-lactamases and transporters, which could have biological functions besides or in addition to antibiotic resistance (e.g., catabolism of beta-lactamases) (<xref ref-type="bibr" rid="B11">Crofts et al., 2018</xref>).</p>
<p>A challenge of this study and of <xref ref-type="bibr" rid="B54">Stamps et al. (2018)</xref> was the low-biomass in samples taken after NF/RO. Prior to data decontamination, many of the low-biomass amplicon samples appeared similar to the negative controls (not shown), and several ASVs were shared between samples and controls. Although flow cytometry results for NF/RO permeate indicated cell counts above the limit of detection (<xref ref-type="fig" rid="F2">Figure 2A</xref>), only two samples from this location were successfully amplified and decontaminated. As the presence of reagent contaminants in amplicon data is directly dependent on input DNA concentration (<xref ref-type="bibr" rid="B49">Salter et al., 2014</xref>; <xref ref-type="bibr" rid="B12">de Goffau et al., 2018</xref>), filtration of larger sample volumes, as well as sequencing of more negative controls, may allow future studies to more comprehensively and conclusively profile microorganisms present in advanced treated water. Additionally, recovery of longer 16S rRNA gene sequences could provide greater resolution to distinguish between contaminants from reagents made with molecular-grade water and true sequences present in advanced treated water, which may be very similar but not identical. We did not perform metagenomic sequencing of negative controls due to low DNA concentrations in field and extraction blanks, but possible contaminants of metagenomes were several MAGs classified as <italic>Methylobacterium</italic>, <italic>Bradyrhizobium</italic>, and <italic>Melainabacteria</italic>. These MAGs were present at high coverages across all low-biomass samples (GAC media, GAC filtrate, and SDS) (<xref ref-type="fig" rid="F6">Figure 6A</xref>), and similar patterns for the same taxa were observed in amplicon sequencing data prior to decontamination.</p>
<p>We used multiple high-throughput sequencing and bioinformatic approaches for studying advanced treatment, which converged on similar results, and provided different types of information. Amplicon sequencing yielded greater depth than metagenomics, allowing detection of rarer bacterial populations. To complement this approach, assembly based metagenomics provided more accurate phylogenetic identification of abundant populations (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figures S6</xref>, <xref ref-type="supplementary-material" rid="SM1">S10</xref>), additional confidence about the presence/absence of specific populations represented by MAGs in a given sample, as well as predicted metabolisms and replication rates (<xref ref-type="fig" rid="F6">Figure 6</xref>). Amplicon and metagenomic sequencing also had different biases as observed in multiple analyses of the same mock community (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S1</xref>). Read-based metagenomics often disagreed with both amplicon and assembly based metagenomic data in terms of taxonomy (<xref ref-type="supplementary-material" rid="SM1">Supplementary Figure S4</xref>), which was not surprising, as it relied upon a fixed database. A strength of read-based analysis is that it includes all data, even reads from genomes that may not assemble well. However, read-mapping against a database using short reads cannot directly determine whether the exact same gene or genome sequence is present in multiple samples (such as across advanced treatment), and read mapping without <italic>de novo</italic> assembly provides limited information about novel genomes and genes from the environment. The utility of MAGs to address questions of microbial transfer across environments and into patients has been previously demonstrated (<xref ref-type="bibr" rid="B4">Brooks et al., 2017</xref>; <xref ref-type="bibr" rid="B38">Olm et al., 2019</xref>), and we predict that this high-resolution approach will be most useful in future studies of advanced water treatment, provided that higher-biomass sampling and deep sequencing can be achieved.</p>
<p>There are practical implications to the finding that new microbial growth occurs after advanced treatment (<xref ref-type="bibr" rid="B39">Park and Hu, 2010</xref>) and is not completely mitigated by disinfection or maintaining a disinfectant residual in the distribution network (<xref ref-type="bibr" rid="B35">Nescerecka et al., 2014</xref>). Such growth, including opportunistic pathogens and ARG-containing organisms, might be expected to occur where biofilms attach or nutrients accumulate (such as in GAC filters, chemical conditioning processes, decarbonation towers, or storage tanks). Thus, advanced treatment facility design should also consider the types of processes (and associated microorganisms) intentionally placed after NF/RO and AOP in treatment trains. Ultimately, a more comprehensive understanding of chemical and microbial water quality in advanced treatment will also aid in effective management of direct potable reuse distribution systems, considering corrosion, disinfectant by-products, and opportunistic pathogens.</p>
</sec>
<sec><title>Data Availability Statement</title>
<p>The datasets generated for this study can be found in the National Center for Biotechnology Information (<ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/">https://www.ncbi.nlm.nih.gov/</ext-link>) under Bioproject <ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="PRJNA490743">PRJNA490743</ext-link> with Sample IDs <ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="SAMN10056964">SAMN10056964</ext-link>&#x2013;<ext-link ext-link-type="DDBJ/EMBL/GenBank" xlink:href="SAMN10057130">SAMN10057130</ext-link>.</p>
</sec>
<sec><title>Author Contributions</title>
<p>SM and KN designed the study. SM performed the fieldwork. RK performed all bioinformatic analyses and wrote the manuscript with contributions from KN and SM. SM and RK jointly completed DNA extractions and amplicon library preparation.</p>
</sec>
<sec><title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<fn-group>
<fn fn-type="financial-disclosure">
<p><bold>Funding.</bold> Funding was provided by ReNUWIt NSF-ERC #1028968, and SM was supported by an Environmental Protection Agency Science to Achieve Results (EPA STAR) Graduate Fellowship. This work used the Vincent J. Coates Genomics Sequencing Laboratory at University of California, Berkeley, supported by NIH S10 OD018174 Instrumentation Grant.</p>
</fn>
</fn-group>
<ack>
<p>We are grateful to Prof. Roberto Rodriguez of the School of Public Health, University of Texas, and to the project partners, El Paso Water and Arcadis whose extensive support made this study possible. We thank Prof. Jill Banfield for computing resources and Emily Marron for helpful conversations about TOC in advanced treated water.</p>
</ack>
<sec sec-type="supplementary material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmicb.2019.00993/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmicb.2019.00993/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.PDF" id="SM1" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Data_Sheet_2.PDF" id="SM2" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Data_Sheet_3.PDF" id="SM3" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_1.XLSX" id="SM4" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barton</surname> <given-names>H. A.</given-names></name> <name><surname>Taylor</surname> <given-names>N. M.</given-names></name> <name><surname>Lubbers</surname> <given-names>B. R.</given-names></name> <name><surname>Pemberton</surname> <given-names>A. C.</given-names></name></person-group> (<year>2006</year>). <article-title>DNA extraction from low-biomass carbonate rock: an improved method with reduced contamination and the low-biomass contaminant database.</article-title> <source><italic>J. Microbiol. Methods</italic></source> <volume>66</volume> <fpage>21</fpage>&#x2013;<lpage>31</lpage>. <pub-id pub-id-type="doi">10.1016/j.mimet.2005.10.005</pub-id> <pub-id pub-id-type="pmid">16305811</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bautista-de Los Santos</surname> <given-names>Q. M.</given-names></name> <name><surname>Schroeder</surname> <given-names>J. L.</given-names></name> <name><surname>Blakemore</surname> <given-names>O.</given-names></name> <name><surname>Moses</surname> <given-names>J.</given-names></name> <name><surname>Haffey</surname> <given-names>M.</given-names></name> <name><surname>Sloan</surname> <given-names>W.</given-names></name><etal/></person-group> (<year>2016</year>). <article-title>The impact of sampling, PCR, and sequencing replication on discerning changes in drinking water bacterial community over diurnal time-scales.</article-title> <source><italic>Water Res.</italic></source> <volume>90</volume> <fpage>216</fpage>&#x2013;<lpage>224</lpage>. <pub-id pub-id-type="doi">10.1016/j.watres.2015.12.010</pub-id> <pub-id pub-id-type="pmid">26734781</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bellona</surname> <given-names>C.</given-names></name> <name><surname>Drewes</surname> <given-names>J. E.</given-names></name> <name><surname>Xu</surname> <given-names>P.</given-names></name> <name><surname>Amy</surname> <given-names>G.</given-names></name></person-group> (<year>2004</year>). <article-title>Factors affecting the rejection of organic solutes during NF/RO treatment&#x2013;a literature review.</article-title> <source><italic>Water Res.</italic></source> <volume>38</volume> <fpage>2795</fpage>&#x2013;<lpage>2809</lpage>. <pub-id pub-id-type="doi">10.1016/j.watres.2004.03.034</pub-id> <pub-id pub-id-type="pmid">15223273</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brooks</surname> <given-names>B.</given-names></name> <name><surname>Olm</surname> <given-names>M. R.</given-names></name> <name><surname>Firek</surname> <given-names>B. A.</given-names></name> <name><surname>Baker</surname> <given-names>R.</given-names></name> <name><surname>Thomas</surname> <given-names>B. C.</given-names></name> <name><surname>Morowitz</surname> <given-names>M. J.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>Strain-resolved analysis of hospital rooms and infants reveals overlap between the human and room microbiome.</article-title> <source><italic>Nat. Commun.</italic></source> <volume>8</volume>:<issue>1814</issue>. <pub-id pub-id-type="doi">10.1038/s41467-017-02018-w</pub-id> <pub-id pub-id-type="pmid">29180750</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>C. T.</given-names></name> <name><surname>Hug</surname> <given-names>L. A.</given-names></name> <name><surname>Thomas</surname> <given-names>B. C.</given-names></name> <name><surname>Sharon</surname> <given-names>I.</given-names></name> <name><surname>Castelle</surname> <given-names>C. J.</given-names></name> <name><surname>Singh</surname> <given-names>A.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Unusual biology across a group comprising more than 15% of domain acteria.</article-title> <source><italic>Nature</italic></source> <volume>523</volume> <fpage>208</fpage>&#x2013;<lpage>211</lpage>. <pub-id pub-id-type="doi">10.1038/nature14486</pub-id> <pub-id pub-id-type="pmid">26083755</pub-id></citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>C. T.</given-names></name> <name><surname>Olm</surname> <given-names>M. R.</given-names></name> <name><surname>Thomas</surname> <given-names>B. C.</given-names></name> <name><surname>Banfield</surname> <given-names>J. F.</given-names></name></person-group> (<year>2016</year>). <article-title>Measurement of bacterial replication rates in microbial communities.</article-title> <source><italic>Nat. biotechnol.</italic></source> <volume>34</volume> <fpage>1256</fpage>&#x2013;<lpage>1263</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.3704</pub-id> <pub-id pub-id-type="pmid">27819664</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><collab>California Code of Regulations</collab> (<year>2014</year>). <source><italic>22 CA ADC 60320.108. Pathogenic Microorganism Control.</italic></source> <publisher-loc>California</publisher-loc>: <publisher-name>California Code of Regulations</publisher-name>.</citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Callahan</surname> <given-names>B. J.</given-names></name> <name><surname>McMurdie</surname> <given-names>P. J.</given-names></name> <name><surname>Rosen</surname> <given-names>M. J.</given-names></name> <name><surname>Han</surname> <given-names>A. W.</given-names></name> <name><surname>Johnson</surname> <given-names>A. J. A.</given-names></name> <name><surname>Holmes</surname> <given-names>S. P.</given-names></name></person-group> (<year>2016</year>). <article-title>DADA2: High-resolution sample inference from Illumina amplicon data.</article-title> <source><italic>Nat. Methods</italic></source> <volume>13</volume> <fpage>581</fpage>&#x2013;<lpage>583</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.3869</pub-id> <pub-id pub-id-type="pmid">27214047</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Campbell</surname> <given-names>J. H.</given-names></name> <name><surname>O&#x2019;Donoghue</surname> <given-names>P.</given-names></name> <name><surname>Campbell</surname> <given-names>A. G.</given-names></name> <name><surname>Schwientek</surname> <given-names>P.</given-names></name> <name><surname>Sczyrba</surname> <given-names>A.</given-names></name> <name><surname>Woyke</surname> <given-names>T.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>UGA is an additional glycine codon in uncultured SR1 bacteria from the human microbiota.</article-title> <source><italic>Proc. Natl. Acad. Sci. U.S.A.</italic></source> <volume>110</volume> <fpage>5540</fpage>&#x2013;<lpage>5545</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1303090110</pub-id> <pub-id pub-id-type="pmid">23509275</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chiao</surname> <given-names>T.-H.</given-names></name> <name><surname>Clancy</surname> <given-names>T. M.</given-names></name> <name><surname>Pinto</surname> <given-names>A.</given-names></name> <name><surname>Xi</surname> <given-names>C.</given-names></name> <name><surname>Raskin</surname> <given-names>L.</given-names></name></person-group> (<year>2014</year>). <article-title>Differential resistance of drinking water bacterial populations to monochloramine disinfection.</article-title> <source><italic>Environ. Sci. Technol.</italic></source> <volume>48</volume> <fpage>4038</fpage>&#x2013;<lpage>4047</lpage>. <pub-id pub-id-type="doi">10.1021/es4055725</pub-id> <pub-id pub-id-type="pmid">24625288</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Crofts</surname> <given-names>T. S.</given-names></name> <name><surname>Wang</surname> <given-names>B.</given-names></name> <name><surname>Spivak</surname> <given-names>A.</given-names></name> <name><surname>Gianoulis</surname> <given-names>T. A.</given-names></name> <name><surname>Forsberg</surname> <given-names>K. J.</given-names></name> <name><surname>Gibson</surname> <given-names>M. K.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Shared strategies for beta-lactam catabolism in the soil microbiome.</article-title> <source><italic>Nat. Chem. Biol.</italic></source> <volume>14</volume> <fpage>556</fpage>&#x2013;<lpage>564</lpage>. <pub-id pub-id-type="doi">10.1038/s41589-018-0052-1</pub-id> <pub-id pub-id-type="pmid">29713061</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Goffau</surname> <given-names>M. C.</given-names></name> <name><surname>Lager</surname> <given-names>S.</given-names></name> <name><surname>Salter</surname> <given-names>S. J.</given-names></name> <name><surname>Wagner</surname> <given-names>J.</given-names></name> <name><surname>Kronbichler</surname> <given-names>A.</given-names></name> <name><surname>Charnock-Jones</surname> <given-names>D. S.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Recognizing the reagent microbiome.</article-title> <source><italic>Nat. Microbiol.</italic></source> <volume>3</volume> <fpage>851</fpage>&#x2013;<lpage>853</lpage>. <pub-id pub-id-type="doi">10.1038/s41564-018-0202-y</pub-id> <pub-id pub-id-type="pmid">30046175</pub-id></citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eddy</surname> <given-names>S. R.</given-names></name></person-group> (<year>2009</year>). <article-title>A new generation of homology search tools based on probabilistic inference.</article-title> <source><italic>Genome Inform.</italic></source> <volume>23</volume> <fpage>205</fpage>&#x2013;<lpage>211</lpage>. <pub-id pub-id-type="pmid">20180275</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Edgar</surname> <given-names>R. C.</given-names></name></person-group> (<year>2004</year>). <article-title>MUSCLE: multiple sequence alignment with high accuracy and high throughput.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>32</volume> <fpage>1792</fpage>&#x2013;<lpage>1797</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkh340</pub-id> <pub-id pub-id-type="pmid">15034147</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Edgar</surname> <given-names>R. C.</given-names></name></person-group> (<year>2010</year>). <article-title>Search and clustering orders of magnitude faster than BLAST.</article-title> <source><italic>Bioinformatics</italic></source> <volume>26</volume> <fpage>2460</fpage>&#x2013;<lpage>2461</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btq461</pub-id> <pub-id pub-id-type="pmid">20709691</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Edgar</surname> <given-names>R. C.</given-names></name></person-group> (<year>2016</year>). <article-title>UNCROSS: Filtering of high-frequency cross-talk in 16S amplicon reads.</article-title> <source><italic>bioRxiv</italic></source> [Preprint]. <pub-id pub-id-type="doi">10.1101/088666</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eren</surname> <given-names>A. M.</given-names></name> <name><surname>Esen</surname> <given-names>&#x00D6;C.</given-names></name> <name><surname>Quince</surname> <given-names>C.</given-names></name> <name><surname>Vineis</surname> <given-names>J. H.</given-names></name> <name><surname>Morrison</surname> <given-names>H. G.</given-names></name> <name><surname>Sogin</surname> <given-names>M. L.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Anvi&#x201C;o: an advanced analysis and visualization platform for &#x201D;omics data.</article-title> <source><italic>PeerJ</italic></source> <volume>3</volume>:<issue>e1319</issue>. <pub-id pub-id-type="doi">10.7717/peerj.1319</pub-id> <pub-id pub-id-type="pmid">26500826</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Garner</surname> <given-names>E.</given-names></name> <name><surname>Chen</surname> <given-names>C.</given-names></name> <name><surname>Xia</surname> <given-names>K.</given-names></name> <name><surname>Bowers</surname> <given-names>J.</given-names></name> <name><surname>Engelthaler</surname> <given-names>D. M.</given-names></name> <name><surname>McLain</surname> <given-names>J.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Metagenomic characterization of antibiotic resistance genes in full-scale reclaimed water distribution systems and corresponding potable systems.</article-title> <source><italic>Environ. Sci. Technol.</italic></source> <volume>52</volume> <fpage>6113</fpage>&#x2013;<lpage>6125</lpage>. <pub-id pub-id-type="doi">10.1021/acs.est.7b05419</pub-id> <pub-id pub-id-type="pmid">29741366</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gerrity</surname> <given-names>D.</given-names></name> <name><surname>Pecson</surname> <given-names>B.</given-names></name> <name><surname>Trussell</surname> <given-names>R. S.</given-names></name> <name><surname>Trussell</surname> <given-names>R. R.</given-names></name></person-group> (<year>2013</year>). <article-title>Potable reuse treatment trains throughout the world.</article-title> <source><italic>J. Water Supply Res. Technol.</italic></source> <volume>62</volume> <fpage>321</fpage>&#x2013;<lpage>338</lpage>. <pub-id pub-id-type="doi">10.2166/aqua.2013.041</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gibson</surname> <given-names>M. K.</given-names></name> <name><surname>Forsberg</surname> <given-names>K. J.</given-names></name> <name><surname>Dantas</surname> <given-names>G.</given-names></name></person-group> (<year>2015</year>). <article-title>Improved annotation of antibiotic resistance determinants reveals microbial resistomes cluster by ecology.</article-title> <source><italic>ISME J.</italic></source> <volume>9</volume> <fpage>207</fpage>&#x2013;<lpage>216</lpage>. <pub-id pub-id-type="doi">10.1038/ismej.2014.106</pub-id> <pub-id pub-id-type="pmid">25003965</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gupta</surname> <given-names>R. S.</given-names></name> <name><surname>Lo</surname> <given-names>B.</given-names></name> <name><surname>Son</surname> <given-names>J.</given-names></name></person-group> (<year>2018</year>). <article-title>Phylogenomics and comparative genomic studies robustly support division of the genus <italic>Mycobacterium</italic> into an emended genus <italic>Mycobacterium</italic> and four novel genera.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>9</volume>:<issue>67</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2018.00067</pub-id> <pub-id pub-id-type="pmid">31024493</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hyatt</surname> <given-names>D.</given-names></name> <name><surname>Chen</surname> <given-names>G.-L.</given-names></name> <name><surname>Locascio</surname> <given-names>P. F.</given-names></name> <name><surname>Land</surname> <given-names>M. L.</given-names></name> <name><surname>Larimer</surname> <given-names>F. W.</given-names></name> <name><surname>Hauser</surname> <given-names>L. J.</given-names></name></person-group> (<year>2010</year>). <article-title>Prodigal: prokaryotic gene recognition and translation initiation site identification.</article-title> <source><italic>BMC Bioinform.</italic></source> <volume>11</volume>:<issue>119</issue>. <pub-id pub-id-type="doi">10.1186/1471-2105-11-119</pub-id> <pub-id pub-id-type="pmid">20211023</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kanehisa</surname> <given-names>M.</given-names></name> <name><surname>Sato</surname> <given-names>Y.</given-names></name> <name><surname>Kawashima</surname> <given-names>M.</given-names></name> <name><surname>Furumichi</surname> <given-names>M.</given-names></name> <name><surname>Tanabe</surname> <given-names>M.</given-names></name></person-group> (<year>2016</year>). <article-title>KEGG as a reference resource for gene and protein annotation.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>44</volume> <fpage>D457</fpage>&#x2013;<lpage>D462</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkv1070</pub-id> <pub-id pub-id-type="pmid">26476454</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kennedy</surname> <given-names>K.</given-names></name> <name><surname>Hall</surname> <given-names>M. W.</given-names></name> <name><surname>Lynch</surname> <given-names>M. D. J.</given-names></name> <name><surname>Moreno-Hagelsieb</surname> <given-names>G.</given-names></name> <name><surname>Neufeld</surname> <given-names>J. D.</given-names></name></person-group> (<year>2014</year>). <article-title>Evaluating bias of illumina-based bacterial 16S rRNA gene profiles.</article-title> <source><italic>Appl. Environ. Microbiol.</italic></source> <volume>80</volume> <fpage>5717</fpage>&#x2013;<lpage>5722</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.01451-14</pub-id> <pub-id pub-id-type="pmid">25002428</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kozich</surname> <given-names>J. J.</given-names></name> <name><surname>Westcott</surname> <given-names>S. L.</given-names></name> <name><surname>Baxter</surname> <given-names>N. T.</given-names></name> <name><surname>Highlander</surname> <given-names>S. K.</given-names></name> <name><surname>Schloss</surname> <given-names>P. D.</given-names></name></person-group> (<year>2013</year>). <article-title>Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform.</article-title> <source><italic>Appl. Environ. Microbiol.</italic></source> <volume>79</volume> <fpage>5112</fpage>&#x2013;<lpage>5120</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.01043-13</pub-id> <pub-id pub-id-type="pmid">23793624</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kulakov</surname> <given-names>L. A.</given-names></name> <name><surname>McAlister</surname> <given-names>M. B.</given-names></name> <name><surname>Ogden</surname> <given-names>K. L.</given-names></name> <name><surname>Larkin</surname> <given-names>M. J.</given-names></name> <name><surname>O&#x2019;Hanlon</surname> <given-names>J. F.</given-names></name></person-group> (<year>2002</year>). <article-title>Analysis of bacteria contaminating ultrapure water in industrial systems.</article-title> <source><italic>Appl. Environ. Microbiol.</italic></source> <volume>68</volume> <fpage>1548</fpage>&#x2013;<lpage>1555</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.68.4.1548-1555.2002</pub-id> <pub-id pub-id-type="pmid">11916667</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Joshi</surname> <given-names>N. A.</given-names></name> <name><surname>Fass</surname> <given-names>J. N.</given-names></name></person-group> (<year>2011</year>). <source><italic>Sickle: A Sliding-Window, Adaptive, Quality-Based Trimming Tool for FastQ Files (Version 1.33).</italic></source> Available at: <ext-link ext-link-type="uri" xlink:href="https://github.com/najoshi/sickle">https://github.com/najoshi/sickle</ext-link></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lagesen</surname> <given-names>K.</given-names></name> <name><surname>Hallin</surname> <given-names>P.</given-names></name> <name><surname>Rodland</surname> <given-names>E. A.</given-names></name> <name><surname>Staerfeldt</surname> <given-names>H. H.</given-names></name> <name><surname>Rognes</surname> <given-names>T.</given-names></name> <name><surname>Ussery</surname> <given-names>D. W.</given-names></name></person-group> (<year>2007</year>). <article-title>RNAmmer: consistent and rapid annotation of ribosomal RNA genes.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>35</volume> <fpage>3100</fpage>&#x2013;<lpage>3108</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkm160</pub-id> <pub-id pub-id-type="pmid">17452365</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lautenschlager</surname> <given-names>K.</given-names></name> <name><surname>Hwang</surname> <given-names>C.</given-names></name> <name><surname>Ling</surname> <given-names>F.</given-names></name> <name><surname>Liu</surname> <given-names>W.-T.</given-names></name> <name><surname>Boon</surname> <given-names>N.</given-names></name> <name><surname>K&#x00F6;ster</surname> <given-names>O.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Abundance and composition of indigenous bacterial communities in a multi-step biofiltration-based drinking water treatment plant.</article-title> <source><italic>Water Res.</italic></source> <volume>62</volume> <fpage>40</fpage>&#x2013;<lpage>52</lpage>. <pub-id pub-id-type="doi">10.1016/j.watres.2014.05.035</pub-id> <pub-id pub-id-type="pmid">24937356</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ling</surname> <given-names>F.</given-names></name> <name><surname>Whitaker</surname> <given-names>R.</given-names></name> <name><surname>LeChevallier</surname> <given-names>M. W.</given-names></name> <name><surname>Liu</surname> <given-names>W.-T.</given-names></name></person-group> (<year>2018</year>). <article-title>Drinking water microbiome assembly induced by water stagnation.</article-title> <source><italic>ISME J.</italic></source> <volume>12</volume> <fpage>1520</fpage>&#x2013;<lpage>1531</lpage>. <pub-id pub-id-type="doi">10.1038/s41396-018-0101-5</pub-id> <pub-id pub-id-type="pmid">29588495</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>G.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>van der Mark</surname> <given-names>E.</given-names></name> <name><surname>Knezev</surname> <given-names>A.</given-names></name> <name><surname>Pinto</surname> <given-names>A.</given-names></name> <name><surname>van den Bogert</surname> <given-names>B.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Assessing the origin of bacteria in tap water and distribution system in an unchlorinated drinking water system by source tracker using microbial community fingerprints.</article-title> <source><italic>Water Res.</italic></source> <volume>138</volume> <fpage>86</fpage>&#x2013;<lpage>96</lpage>. <pub-id pub-id-type="doi">10.1016/j.watres.2018.03.043</pub-id> <pub-id pub-id-type="pmid">29573632</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McAlister</surname> <given-names>M. B.</given-names></name> <name><surname>Kulakov</surname> <given-names>L. A.</given-names></name> <name><surname>O&#x2019;Hanlon</surname> <given-names>J. F.</given-names></name> <name><surname>Larkin</surname> <given-names>M. J.</given-names></name> <name><surname>Ogden</surname> <given-names>K. L.</given-names></name></person-group> (<year>2002</year>). <article-title>Survival and nutritional requirements of three bacteria isolated from ultrapure water.</article-title> <source><italic>J. Ind. Microbiol. Biotechnol.</italic></source> <volume>29</volume> <fpage>75</fpage>&#x2013;<lpage>82</lpage>. <pub-id pub-id-type="doi">10.1038/sj.jim.7000273</pub-id> <pub-id pub-id-type="pmid">12161774</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McMurdie</surname> <given-names>P. J.</given-names></name> <name><surname>Holmes</surname> <given-names>S.</given-names></name></person-group> (<year>2013</year>). <article-title>phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data.</article-title> <source><italic>PLoS One</italic></source> <volume>8</volume>:<issue>e61217</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0061217</pub-id> <pub-id pub-id-type="pmid">23630581</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mitra</surname> <given-names>A.</given-names></name> <name><surname>Skrzypczak</surname> <given-names>M.</given-names></name> <name><surname>Ginalski</surname> <given-names>K.</given-names></name> <name><surname>Rowicka</surname> <given-names>M.</given-names></name></person-group> (<year>2015</year>). <article-title>Strategies for achieving high sequencing accuracy for low diversity samples and avoiding sample bleeding using illumina platform.</article-title> <source><italic>PLoS One</italic></source> <volume>10</volume>:<issue>e0120520</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0120520</pub-id> <pub-id pub-id-type="pmid">25860802</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nescerecka</surname> <given-names>A.</given-names></name> <name><surname>Rubulis</surname> <given-names>J.</given-names></name> <name><surname>Vital</surname> <given-names>M.</given-names></name> <name><surname>Juhna</surname> <given-names>T.</given-names></name> <name><surname>Hammes</surname> <given-names>F.</given-names></name></person-group> (<year>2014</year>). <article-title>Biological instability in a chlorinated drinking water distribution network.</article-title> <source><italic>PLoS One</italic></source> <volume>9</volume>:<issue>e96354</issue>. <pub-id pub-id-type="doi">10.1371/journal.pone.0096354</pub-id> <pub-id pub-id-type="pmid">24796923</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oh</surname> <given-names>S.</given-names></name> <name><surname>Hammes</surname> <given-names>F.</given-names></name> <name><surname>Liu</surname> <given-names>W.-T.</given-names></name></person-group> (<year>2018</year>). <article-title>Metagenomic characterization of biofilter microbial communities in a full-scale drinking water treatment plant.</article-title> <source><italic>Water Res.</italic></source> <volume>128</volume> <fpage>278</fpage>&#x2013;<lpage>285</lpage>. <pub-id pub-id-type="doi">10.1016/j.watres.2017.10.054</pub-id> <pub-id pub-id-type="pmid">29107912</pub-id></citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Olm</surname> <given-names>M. R.</given-names></name> <name><surname>Brown</surname> <given-names>C. T.</given-names></name> <name><surname>Brooks</surname> <given-names>B.</given-names></name> <name><surname>Banfield</surname> <given-names>J. F.</given-names></name></person-group> (<year>2017</year>). <article-title>dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication.</article-title> <source><italic>ISME J.</italic></source> <volume>11</volume> <fpage>2864</fpage>&#x2013;<lpage>2868</lpage>. <pub-id pub-id-type="doi">10.1038/ismej.2017.126</pub-id> <pub-id pub-id-type="pmid">28742071</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Olm</surname> <given-names>M. R.</given-names></name> <name><surname>West</surname> <given-names>P. T.</given-names></name> <name><surname>Brooks</surname> <given-names>B.</given-names></name> <name><surname>Firek</surname> <given-names>B. A.</given-names></name> <name><surname>Baker</surname> <given-names>R.</given-names></name> <name><surname>Morowitz</surname> <given-names>M. J.</given-names></name><etal/></person-group> (<year>2019</year>). <article-title>Genome-resolved metagenomics of eukaryotic populations during early colonization of premature infants and in hospital rooms.</article-title> <source><italic>Microbiome</italic></source> <volume>7</volume>:<issue>26</issue>. <pub-id pub-id-type="doi">10.1186/s40168-019-0638-1</pub-id> <pub-id pub-id-type="pmid">30770768</pub-id></citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Park</surname> <given-names>S.-K.</given-names></name> <name><surname>Hu</surname> <given-names>J. Y.</given-names></name></person-group> (<year>2010</year>). <article-title>Assessment of the extent of bacterial growth in reverse osmosis system for improving drinking water quality.</article-title> <source><italic>J. Environ. Sci. Health Part A</italic></source> <volume>45</volume> <fpage>968</fpage>&#x2013;<lpage>977</lpage>. <pub-id pub-id-type="doi">10.1080/10934521003772386</pub-id> <pub-id pub-id-type="pmid">20512722</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parks</surname> <given-names>D. H.</given-names></name> <name><surname>Imelfort</surname> <given-names>M.</given-names></name> <name><surname>Skennerton</surname> <given-names>C. T.</given-names></name> <name><surname>Hugenholtz</surname> <given-names>P.</given-names></name> <name><surname>Tyson</surname> <given-names>G. W.</given-names></name></person-group> (<year>2015</year>). <article-title>CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes.</article-title> <source><italic>Genome Res.</italic></source> <volume>25</volume> <fpage>1043</fpage>&#x2013;<lpage>1055</lpage>. <pub-id pub-id-type="doi">10.1101/gr.186072.114</pub-id> <pub-id pub-id-type="pmid">25977477</pub-id></citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pecson</surname> <given-names>B. M.</given-names></name> <name><surname>Triolo</surname> <given-names>S. C.</given-names></name> <name><surname>Olivieri</surname> <given-names>S.</given-names></name> <name><surname>Chen</surname> <given-names>E. C.</given-names></name> <name><surname>Pisarenko</surname> <given-names>A. N.</given-names></name> <name><surname>Yang</surname> <given-names>C.-C.</given-names></name><etal/></person-group> (<year>2017</year>). <article-title>Reliability of pathogen control in direct potable reuse: performance evaluation and QMRA of a full-scale 1 MGD advanced treatment train.</article-title> <source><italic>Water Res.</italic></source> <volume>122</volume> <fpage>258</fpage>&#x2013;<lpage>268</lpage>. <pub-id pub-id-type="doi">10.1016/j.watres.2017.06.014</pub-id> <pub-id pub-id-type="pmid">28609729</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peng</surname> <given-names>Y.</given-names></name> <name><surname>Leung</surname> <given-names>H. C. M.</given-names></name> <name><surname>Yiu</surname> <given-names>S. M.</given-names></name> <name><surname>Chin</surname> <given-names>F. Y. L.</given-names></name></person-group> (<year>2012</year>). <article-title>IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth.</article-title> <source><italic>Bioinformatics</italic></source> <volume>28</volume> <fpage>1420</fpage>&#x2013;<lpage>1428</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bts174</pub-id> <pub-id pub-id-type="pmid">22495754</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pinto</surname> <given-names>A. J.</given-names></name> <name><surname>Marcus</surname> <given-names>D. N.</given-names></name> <name><surname>Ijaz</surname> <given-names>U. Z.</given-names></name> <name><surname>Bautista-de lose Santos</surname> <given-names>Q. M.</given-names></name> <name><surname>Dick</surname> <given-names>G. J.</given-names></name> <name><surname>Raskin</surname> <given-names>L.</given-names></name></person-group> (<year>2016</year>). <article-title>Metagenomic evidence for the presence of comammox <italic>nitrospira</italic>-like bacteria in a drinking water system.</article-title> <source><italic>mSphere</italic></source> <volume>1</volume>:<issue>e00054</issue>-15. <pub-id pub-id-type="doi">10.1128/mSphere.00054-15</pub-id> <pub-id pub-id-type="pmid">27303675</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pinto</surname> <given-names>A. J.</given-names></name> <name><surname>Xi</surname> <given-names>C.</given-names></name> <name><surname>Raskin</surname> <given-names>L.</given-names></name></person-group> (<year>2012</year>). <article-title>Bacterial community structure in the drinking water microbiome is governed by filtration processes.</article-title> <source><italic>Environ. Sci. Technol.</italic></source> <volume>46</volume> <fpage>8851</fpage>&#x2013;<lpage>8859</lpage>. <pub-id pub-id-type="doi">10.1021/es302042t</pub-id> <pub-id pub-id-type="pmid">22793041</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Prest</surname> <given-names>E. I.</given-names></name> <name><surname>Hammes</surname> <given-names>F.</given-names></name> <name><surname>K&#x00F6;tzsch</surname> <given-names>S.</given-names></name> <name><surname>van Loosdrecht</surname> <given-names>M. C. M.</given-names></name> <name><surname>Vrouwenvelder</surname> <given-names>J. S.</given-names></name></person-group> (<year>2016a</year>). <article-title>A systematic approach for the assessment of bacterial growth-controlling factors linked to biological stability of drinking water in distribution systems.</article-title> <source><italic>Water Sci. Technol. Water Supply</italic></source> <volume>16</volume> <fpage>865</fpage>&#x2013;<lpage>880</lpage>. <pub-id pub-id-type="doi">10.2166/ws.2016.001</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Prest</surname> <given-names>E. I.</given-names></name> <name><surname>Hammes</surname> <given-names>F.</given-names></name> <name><surname>van Loosdrecht</surname> <given-names>M. C. M.</given-names></name> <name><surname>Vrouwenvelder</surname> <given-names>J. S.</given-names></name></person-group> (<year>2016b</year>). <article-title>Biological stability of drinking water: controlling factors, methods, and challenges.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>7</volume>:<issue>45</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2016.00045</pub-id> <pub-id pub-id-type="pmid">26870010</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quast</surname> <given-names>C.</given-names></name> <name><surname>Pruesse</surname> <given-names>E.</given-names></name> <name><surname>Yilmaz</surname> <given-names>P.</given-names></name> <name><surname>Gerken</surname> <given-names>J.</given-names></name> <name><surname>Schweer</surname> <given-names>T.</given-names></name> <name><surname>Yarza</surname> <given-names>P.</given-names></name><etal/></person-group> (<year>2013</year>). <article-title>The SILVA ribosomal RNA gene database project: improved data processing and web-based tools.</article-title> <source><italic>Nucleic Acids Res.</italic></source> <volume>41</volume> <fpage>D590</fpage>&#x2013;<lpage>D596</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gks1219</pub-id> <pub-id pub-id-type="pmid">23193283</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Raucher</surname> <given-names>R. S.</given-names></name> <name><surname>Tchobanoglous</surname> <given-names>G.</given-names></name></person-group> (<year>2014</year>). <source><italic>The Opportunities and Economics of Direct Potable Reuse.</italic></source> <publisher-loc>Arlington, VA</publisher-loc>: <publisher-name>WateReuse Research Foundation</publisher-name>.</citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Salter</surname> <given-names>S. J.</given-names></name> <name><surname>Cox</surname> <given-names>M. J.</given-names></name> <name><surname>Turek</surname> <given-names>E. M.</given-names></name> <name><surname>Calus</surname> <given-names>S. T.</given-names></name> <name><surname>Cookson</surname> <given-names>W. O.</given-names></name> <name><surname>Moffatt</surname> <given-names>M. F.</given-names></name><etal/></person-group> (<year>2014</year>). <article-title>Reagent and laboratory contamination can critically impact sequence-based microbiome analyses.</article-title> <source><italic>BMC Biol.</italic></source> <volume>12</volume>:<issue>87</issue>. <pub-id pub-id-type="doi">10.1186/s12915-014-0087-z</pub-id> <pub-id pub-id-type="pmid">25387460</pub-id></citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Salveson</surname> <given-names>A.</given-names></name> <name><surname>Sutherland</surname> <given-names>J.</given-names></name> <name><surname>Garvey</surname> <given-names>E.</given-names></name> <name><surname>Charbonnet</surname> <given-names>E.</given-names></name> <name><surname>Steinle-Darling</surname> <given-names>E.</given-names></name> <name><surname>Edwards</surname> <given-names>M.</given-names></name><etal/></person-group> (<year>2018</year>). <source><italic>Blending Requirements for Water From Direct Potable Reuse Treatment Facilities.</italic></source> <publisher-loc>Arlington, VA</publisher-loc>: <publisher-name>WateReuse Research Foundation</publisher-name>.</citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Seemann</surname> <given-names>T.</given-names></name></person-group> (<year>2014</year>). <article-title>Prokka: rapid prokaryotic genome annotation.</article-title> <source><italic>Bioinformatics</italic></source> <volume>30</volume> <fpage>2068</fpage>&#x2013;<lpage>2069</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu153</pub-id> <pub-id pub-id-type="pmid">24642063</pub-id></citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shi</surname> <given-names>P.</given-names></name> <name><surname>Jia</surname> <given-names>S.</given-names></name> <name><surname>Zhang</surname> <given-names>X.-X.</given-names></name> <name><surname>Zhang</surname> <given-names>T.</given-names></name> <name><surname>Cheng</surname> <given-names>S.</given-names></name> <name><surname>Li</surname> <given-names>A.</given-names></name></person-group> (<year>2013</year>). <article-title>Metagenomic insights into chlorination effects on microbial antibiotic resistance in drinking water.</article-title> <source><italic>Water Res.</italic></source> <volume>47</volume> <fpage>111</fpage>&#x2013;<lpage>120</lpage>. <pub-id pub-id-type="doi">10.1016/j.watres.2012.09.046</pub-id> <pub-id pub-id-type="pmid">23084468</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stamatakis</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies.</article-title> <source><italic>Bioinformatics</italic></source> <volume>30</volume> <fpage>1312</fpage>&#x2013;<lpage>1313</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btu033</pub-id> <pub-id pub-id-type="pmid">24451623</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stamps</surname> <given-names>B. W.</given-names></name> <name><surname>Leddy</surname> <given-names>M. B.</given-names></name> <name><surname>Plumlee</surname> <given-names>M. H.</given-names></name> <name><surname>Hasan</surname> <given-names>N. A.</given-names></name> <name><surname>Colwell</surname> <given-names>R. R.</given-names></name> <name><surname>Spear</surname> <given-names>J. R.</given-names></name></person-group> (<year>2018</year>). <article-title>Characterization of the microbiome at the world&#x2019;s largest potable water reuse facility.</article-title> <source><italic>Front. Microbiol.</italic></source> <volume>9</volume>:<issue>2435</issue>. <pub-id pub-id-type="doi">10.3389/fmicb.2018.02435</pub-id></citation></ref>
<ref id="B55"><citation citation-type="journal"><collab>State Water Resources Control Board</collab> (<year>2016</year>). <source><italic>Investigation on the Feasibility of Developing Uniform Water Recycling Criteria for Direct Potable Reuse.</italic></source> Available at: <ext-link ext-link-type="uri" xlink:href="https://www.waterboards.ca.gov/drinking_water/certlic/drinkingwater/documents/rw_dpr_criteria/final_report.pdf">https://www.waterboards.ca.gov/drinking_water/certlic/drinkingwater/documents/rw_dpr_criteria/final_report.pdf</ext-link> (accessed December 2016).</citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Truong</surname> <given-names>D. T.</given-names></name> <name><surname>Franzosa</surname> <given-names>E. A.</given-names></name> <name><surname>Tickle</surname> <given-names>T. L.</given-names></name> <name><surname>Scholz</surname> <given-names>M.</given-names></name> <name><surname>Weingart</surname> <given-names>G.</given-names></name> <name><surname>Pasolli</surname> <given-names>E.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>MetaPhlAn2 for enhanced metagenomic taxonomic profiling.</article-title> <source><italic>Nat. Methods</italic></source> <volume>12</volume> <fpage>902</fpage>&#x2013;<lpage>903</lpage>. <pub-id pub-id-type="doi">10.1038/nmeth.3589</pub-id> <pub-id pub-id-type="pmid">26418763</pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Trussell</surname> <given-names>R. R.</given-names></name> <name><surname>Salveson</surname> <given-names>A.</given-names></name> <name><surname>Snyder</surname> <given-names>S. A.</given-names></name> <name><surname>Trussell</surname> <given-names>R. S.</given-names></name> <name><surname>Gerrity</surname> <given-names>D.</given-names></name> <name><surname>Pecson</surname> <given-names>B. M.</given-names></name></person-group> (<year>2011</year>). <source><italic>Potable Reuse: State of the Science Report and Equivalency Criteria for Treatment Trains.</italic></source> <publisher-loc>Arlington, VA</publisher-loc>: <publisher-name>WateReuse Research Foundation</publisher-name>, <fpage>1</fpage>&#x2013;<lpage>276</lpage>.</citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vignola</surname> <given-names>M.</given-names></name> <name><surname>Werner</surname> <given-names>D.</given-names></name> <name><surname>Wade</surname> <given-names>M. J.</given-names></name> <name><surname>Meynet</surname> <given-names>P.</given-names></name> <name><surname>Davenport</surname> <given-names>R. J.</given-names></name></person-group> (<year>2017</year>). <article-title>Medium shapes the microbial community of water filters with implications for effluent quality.</article-title> <source><italic>Water Res.</italic></source> <volume>129</volume> <fpage>499</fpage>&#x2013;<lpage>508</lpage>. <pub-id pub-id-type="doi">10.1016/j.watres.2017.09.042</pub-id> <pub-id pub-id-type="pmid">29195186</pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Waak</surname> <given-names>M. B.</given-names></name> <name><surname>LaPara</surname> <given-names>T. M.</given-names></name> <name><surname>Hall&#x00E9;</surname> <given-names>C.</given-names></name> <name><surname>Hozalski</surname> <given-names>R. M.</given-names></name></person-group> (<year>2018</year>). <article-title>Occurrence of Legionella spp. in water-main biofilms from two drinking water distribution systems.</article-title> <source><italic>Environ. Sci. Technol.</italic></source> <volume>52</volume> <fpage>7630</fpage>&#x2013;<lpage>7639</lpage>. <pub-id pub-id-type="doi">10.1021/acs.est.8b01170</pub-id> <pub-id pub-id-type="pmid">29902377</pub-id></citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Edwards</surname> <given-names>M.</given-names></name> <name><surname>Falkinham</surname> <given-names>J. O.</given-names></name> <name><surname>Pruden</surname> <given-names>A.</given-names></name></person-group> (<year>2012</year>). <article-title>Molecular survey of the occurrence of <italic>Legionella</italic> spp., <italic>Mycobacterium</italic> spp., <italic>Pseudomonas aeruginosa</italic>, and amoeba hosts in two chloraminated drinking water distribution systems.</article-title> <source><italic>Appl. Environ. Microbiol.</italic></source> <volume>78</volume> <fpage>6285</fpage>&#x2013;<lpage>6294</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.01492-12</pub-id> <pub-id pub-id-type="pmid">22752174</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Pryor</surname> <given-names>M. A.</given-names></name> <name><surname>Edwards</surname> <given-names>M. A.</given-names></name> <name><surname>Falkinham</surname> <given-names>J. O.</given-names></name> <name><surname>Pruden</surname> <given-names>A.</given-names></name></person-group> (<year>2013</year>). <article-title>Effect of GAC pre-treatment and disinfectant on microbial community structure and opportunistic pathogen occurrence.</article-title> <source><italic>Water Res.</italic></source> <volume>47</volume> <fpage>5760</fpage>&#x2013;<lpage>5772</lpage>. <pub-id pub-id-type="doi">10.1016/j.watres.2013.06.052</pub-id> <pub-id pub-id-type="pmid">23906775</pub-id></citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Q.</given-names></name> <name><surname>Garrity</surname> <given-names>G. M.</given-names></name> <name><surname>Tiedje</surname> <given-names>J. M.</given-names></name> <name><surname>Cole</surname> <given-names>J. R.</given-names></name></person-group> (<year>2007</year>). <article-title>Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy.</article-title> <source><italic>Appl. Environ. Microbiol.</italic></source> <volume>73</volume> <fpage>5261</fpage>&#x2013;<lpage>5267</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.00062-07</pub-id> <pub-id pub-id-type="pmid">17586664</pub-id></citation></ref>
<ref id="B63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wert</surname> <given-names>E. C.</given-names></name> <name><surname>Rosario-Ortiz</surname> <given-names>F. L.</given-names></name> <name><surname>Drury</surname> <given-names>D. D.</given-names></name> <name><surname>Snyder</surname> <given-names>S. A.</given-names></name></person-group> (<year>2007</year>). <article-title>Formation of oxidation byproducts from ozonation of waste water.</article-title> <source><italic>Water Res.</italic></source> <volume>41</volume> <fpage>1481</fpage>&#x2013;<lpage>1490</lpage>. <pub-id pub-id-type="doi">10.1016/j.watres.2007.01.020</pub-id> <pub-id pub-id-type="pmid">17335867</pub-id></citation></ref>
<ref id="B64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wullings</surname> <given-names>B. A.</given-names></name> <name><surname>van der Kooij</surname> <given-names>D.</given-names></name></person-group> (<year>2006</year>). <article-title>Occurrence and genetic diversity of uncultured legionella spp. in drinking water treated at temperatures below 15 C.</article-title> <source><italic>Appl. Environ. Microbiol.</italic></source> <volume>72</volume> <fpage>157</fpage>&#x2013;<lpage>166</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.72.1.157-166.2006</pub-id> <pub-id pub-id-type="pmid">16391038</pub-id></citation></ref>
<ref id="B65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xi</surname> <given-names>C.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Marrs</surname> <given-names>C. F.</given-names></name> <name><surname>Ye</surname> <given-names>W.</given-names></name> <name><surname>Simon</surname> <given-names>C.</given-names></name> <name><surname>Foxman</surname> <given-names>B.</given-names></name><etal/></person-group> (<year>2009</year>). <article-title>Prevalence of antibiotic resistance in drinking water treatment and distribution systems.</article-title> <source><italic>Appl. Environ. Microbiol.</italic></source> <volume>75</volume> <fpage>5714</fpage>&#x2013;<lpage>5718</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.00382-09</pub-id> <pub-id pub-id-type="pmid">19581476</pub-id></citation></ref>
<ref id="B66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zeng</surname> <given-names>T.</given-names></name> <name><surname>Plewa</surname> <given-names>M. J.</given-names></name> <name><surname>Mitch</surname> <given-names>W. A.</given-names></name></person-group> (<year>2016</year>). <article-title>N-Nitrosamines and halogenated disinfection byproducts in US Full Advanced Treatment trains for potable reuse.</article-title> <source><italic>Water Res.</italic></source> <volume>101</volume> <fpage>176</fpage>&#x2013;<lpage>186</lpage>. <pub-id pub-id-type="doi">10.1016/j.watres.2016.03.062</pub-id> <pub-id pub-id-type="pmid">27262122</pub-id></citation></ref>
<ref id="B67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Oh</surname> <given-names>S.</given-names></name> <name><surname>Liu</surname> <given-names>W.-T.</given-names></name></person-group> (<year>2017</year>). <article-title>Impact of drinking water treatment and distribution on the microbiome continuum: an ecological disturbance&#x2019;s perspective.</article-title> <source><italic>Environ. Microbiol.</italic></source> <volume>19</volume> <fpage>3163</fpage>&#x2013;<lpage>3174</lpage>. <pub-id pub-id-type="doi">10.1111/1462-2920</pub-id> <pub-id pub-id-type="pmid">28654183</pub-id></citation></ref>
</ref-list>
<glossary>
<title>Abbreviations</title>
<def-list id="DL1">
<def-item>
<term>AOP</term>
<def>
<p>advanced oxidation process</p>
</def>
</def-item>
<def-item>
<term>ARG</term>
<def>
<p>antibiotic resistance gene</p>
</def>
</def-item>
<def-item>
<term>ASV</term>
<def>
<p>amplicon sequence variant</p>
</def>
</def-item>
<def-item>
<term>GAC</term>
<def>
<p>granular activated carbon</p>
</def>
</def-item>
<def-item>
<term>MAG</term>
<def>
<p>metagenome assembled genome</p>
</def>
</def-item>
<def-item>
<term>MF</term>
<def>
<p>microfiltration</p>
</def>
</def-item>
<def-item>
<term>NF</term>
<def>
<p>nanofiltration</p>
</def>
</def-item>
<def-item>
<term>RO</term>
<def>
<p>reverse osmosis</p>
</def>
</def-item>
<def-item>
<term>RpsC</term>
<def>
<p>ribosomal protein S3</p>
</def>
</def-item>
<def-item>
<term>SDS</term>
<def>
<p>simulated distribution system(s)</p>
</def>
</def-item>
<def-item>
<term>TOC</term>
<def>
<p>total organic carbon.</p>
</def>
</def-item>
</def-list>
</glossary>
<fn-group>
<fn id="fn01"><label>1</label><p><ext-link ext-link-type="uri" xlink:href="https://github.com/SchlossLab/MiSeq_WetLab_SOP/blob/master/MiSeq_WetLab_SOP.md">https://github.com/SchlossLab/MiSeq_WetLab_SOP/blob/master/MiSeq_WetLab_SOP.md</ext-link></p></fn>
<fn id="fn02"><label>2</label><p><ext-link ext-link-type="uri" xlink:href="https://www.bioinformatics.babraham.ac.uk/projects/fastqc/">https://www.bioinformatics.babraham.ac.uk/projects/fastqc/</ext-link></p></fn>
<fn id="fn03"><label>3</label><p><ext-link ext-link-type="uri" xlink:href="https://sourceforge.net/projects/bbmap/">https://sourceforge.net/projects/bbmap/</ext-link></p></fn>
<fn id="fn04"><label>4</label><p><ext-link ext-link-type="uri" xlink:href="https://github.com/christophertbrown/fix_assembly_errors">https://github.com/christophertbrown/fix_assembly_errors</ext-link></p></fn>
<fn id="fn05"><label>5</label><p><ext-link ext-link-type="uri" xlink:href="https://github.com/christophertbrown/iRep">https://github.com/christophertbrown/iRep</ext-link></p></fn>
<fn id="fn06"><label>6</label><p><ext-link ext-link-type="uri" xlink:href="https://rrndb.umms.med.umich.edu/">https://rrndb.umms.med.umich.edu/</ext-link> (accessed August 14, 2018)</p></fn>
</fn-group>
</back>
</article>