<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Microbiol.</journal-id>
<journal-title>Frontiers in Microbiology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Microbiol.</abbrev-journal-title>
<issn pub-type="epub">1664-302X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmicb.2021.764631</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Microbiology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Ancient Metabolisms of a Thermophilic Subseafloor Bacterium</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Smith</surname>
<given-names>Amy R.</given-names>
</name>
<xref rid="aff1" ref-type="aff"><sup>1</sup></xref>
<xref rid="aff2" ref-type="aff"><sup>2</sup></xref>
<xref rid="aff3" ref-type="aff"><sup>3</sup></xref>
<xref rid="c001" ref-type="corresp"><sup>&#x002A;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/1238814/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Mueller</surname>
<given-names>Ryan</given-names>
</name>
<xref rid="aff3" ref-type="aff"><sup>3</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/519300/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Fisk</surname>
<given-names>Martin R.</given-names>
</name>
<xref rid="aff3" ref-type="aff"><sup>3</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/1477596/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Colwell</surname>
<given-names>Frederick S.</given-names>
</name>
<xref rid="aff3" ref-type="aff"><sup>3</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/31053/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Science, Mathematics, and Computing, Bard College at Simon&#x2019;s Rock</institution>, <addr-line>Great Barrington, MA</addr-line>, <country>United States</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Marine Chemistry and Geochemistry, Woods Hole Oceanographic Institution</institution>, <addr-line>Woods Hole, MA</addr-line>, <country>United States</country></aff>
<aff id="aff3"><sup>3</sup><institution>College of Earth, Ocean, and Atmospheric Sciences, Oregon State University</institution>, <addr-line>Corvallis, OR</addr-line>, <country>United States</country></aff>
<author-notes>
<fn id="fn1" fn-type="edited-by"><p>Edited by: Elizabeth Trembath-Reichert, Arizona State University, United States</p></fn>
<fn id="fn2" fn-type="edited-by"><p>Reviewed by: Mark Alexander Lever, ETH Z&#x00FC;rich, Switzerland; Peter R. Girguis, Harvard University, United States</p></fn>
<corresp id="c001">&#x002A;Correspondence: Amy R. Smith, <email>asmith@simons-rock.edu</email></corresp>
<fn id="fn3" fn-type="other"><p>This article was submitted to Extreme Microbiology, a section of the journal Frontiers in Microbiology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>01</day>
<month>12</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>12</volume>
<elocation-id>764631</elocation-id>
<history>
<date date-type="received">
<day>25</day>
<month>08</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>22</day>
<month>10</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2021 Smith, Mueller, Fisk and Colwell.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Smith, Mueller, Fisk and Colwell</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>The ancient origins of metabolism may be rooted deep in oceanic crust, and these early metabolisms may have persisted in the habitable thermal anoxic aquifer where conditions remain similar to those when they first appeared. The Wood&#x2013;Ljungdahl pathway for acetogenesis is a key early biosynthetic pathway with the potential to influence ocean chemistry and productivity, but its contemporary role in oceanic crust is not well established. Here, we describe the genome of a novel acetogen from a thermal suboceanic aquifer olivine biofilm in the basaltic crust of the Juan de Fuca Ridge (JdFR) whose genome suggests it may utilize an ancient chemosynthetic lifestyle. This organism encodes the genes for the complete canonical Wood&#x2013;Ljungdahl pathway, but is potentially unable to use sulfate and certain organic carbon sources such as lipids and carbohydrates to supplement its energy requirements, unlike other known acetogens. Instead, this organism may use peptides and amino acids for energy or as organic carbon sources. Additionally, genes involved in surface adhesion, the import of metallic cations found in Fe-bearing minerals, and use of molecular hydrogen, a product of serpentinization reactions between water and olivine, are prevalent within the genome. These adaptations are likely a reflection of local environmental micro-niches, where cells are adapted to life in biofilms using ancient chemosynthetic metabolisms dependent on H<sub>2</sub> and iron minerals. Since this organism is phylogenetically distinct from a related acetogenic group of Clostridiales, we propose it as a new species, <italic>Candidatus</italic> Acetocimmeria pyornia.</p>
</abstract>
<kwd-group>
<kwd>metabolism</kwd>
<kwd>carbon fixation</kwd>
<kwd>acetogenesis</kwd>
<kwd>bacteria</kwd>
<kwd>seafloor</kwd>
<kwd>hydrogen</kwd>
<kwd>amino acid</kwd>
<kwd>clostridia</kwd>
</kwd-group>
<contract-sponsor id="cn1">Alfred P. Sloan Foundation<named-content content-type="fundref-id">10.13039/100000879</named-content>
</contract-sponsor>
<counts>
<fig-count count="5"/>
<table-count count="4"/>
<equation-count count="0"/>
<ref-count count="51"/>
<page-count count="15"/>
<word-count count="10345"/>
</counts>
</article-meta>
</front>
<body>
<sec id="sec1" sec-type="intro">
<title>Introduction</title>
<p>The extrusive layer of igneous oceanic crust is one of Earth&#x2019;s largest microbial habitats, and, though estimates vary widely, this habitat could contain up to 75% of the total carbon biomass on Earth (~200Pg of carbon; Pg=10<sup>15</sup>g; <xref ref-type="bibr" rid="ref53">Whitman et al., 1998</xref>; <xref ref-type="bibr" rid="ref13">Heberling et al., 2010</xref>; <xref ref-type="bibr" rid="ref18">Kallmeyer et al., 2012</xref>). Water&#x2013;rock reactions common to the deep crust support active subsurface chemosynthetic microbial communities (<xref ref-type="bibr" rid="ref5">Edwards et al., 2011</xref>; <xref ref-type="bibr" rid="ref23">Lever et al., 2013</xref>) through the production of reduced compounds such as molecular hydrogen and labile organic carbon. These products are subsequently vented to the seafloor and may significantly increase secondary productivity and carbon cycling in the deep ocean (<xref ref-type="bibr" rid="ref27">Mason et al., 2009</xref>; <xref ref-type="bibr" rid="ref29">McCarthy et al., 2010</xref>). Due to the logistical difficulties of deep biosphere sampling and the lack of cultivation strategies for autochthonous microbes, little is known about the rates of chemosynthesis and the corresponding metabolic capabilities of organisms from this habitat. The Juan de Fuca Ridge (JdFR) flank and its permanent subsurface microbial observatories represent one of the few places in the world where access to these types of environments is available.</p>
<p>Recently, it was reported that the Wood&#x2013;Ljungdahl pathway for acetogenesis may be a significant chemosynthetic pathway within the oceanic crustal aquifer (<xref ref-type="bibr" rid="ref17">Jungbluth et al., 2017</xref>; <xref ref-type="bibr" rid="ref44">Smith et al., 2019</xref>). In deep oligotrophic environments like the JdFR aquifer (<xref ref-type="bibr" rid="ref54">Lin et al., 2012</xref>), annual photosynthetically derived carbon inputs are low; thus, chemosynthetic metabolisms such as acetogenesis are hypothesized to play an important role. Acetogenesis may have first appeared in the Earth&#x2019;s crust as one of the first carbon fixation pathways and remained the dominant form of primary production until the evolution of photosynthesis (<xref ref-type="bibr" rid="ref39">Ragsdale and Pierce, 2008</xref>; <xref ref-type="bibr" rid="ref31">Nitschke and Russell, 2013</xref>; <xref ref-type="bibr" rid="ref26">Martin, 2020</xref>). Thus, acetogenesis may remain a key metabolic strategy in deep crustal aquifers, where contemporary conditions can often mirror those of the primordial state on early Earth (hot reducing fluids reacting with igneous minerals in oceanic crust).</p>
<p>The Wood&#x2013;Ljungdahl pathway is a metabolically versatile pathway that can be used for biosynthesis by producing acetyl-CoA, or for energy conservation when completing the pathway and excreting acetate. Acetogens may use either mode of this metabolic pathway, switching between them depending on the needs of the organism (<xref ref-type="bibr" rid="ref37">Pierce et al., 2008</xref>; <xref ref-type="bibr" rid="ref22">Lever, 2012</xref>). Additional energy could be gained from this pathway by utilizing sodium or proton pumps to create a proton motive force that drives subsequent ATP generation using ATP synthase (<xref ref-type="bibr" rid="ref41">Schuchmann and M&#x00FC;ller, 2014</xref>). This pathway has two branches, the carbonyl and the methyl branch, which both rely on molecular hydrogen (H<sub>2</sub>) to reduce and condense two molecules of CO<sub>2</sub> to form acetyl-CoA. Formate, CO, and methanol are also substrates that can be used. As a chemosynthetic pathway, the acetyl-CoA end-product is a central metabolic intermediate that can be used in multiple anabolic pathways, such as lipid, carbohydrate, and amino acid biosynthesis (<xref ref-type="bibr" rid="ref37">Pierce et al., 2008</xref>; <xref ref-type="bibr" rid="ref38">Ragsdale, 2008</xref>).</p>
<p>Conditions in deep crustal aquifers are particularly favorable for organisms relying on the Wood&#x2013;Ljungdahl pathway for carbon fixation. Molecular hydrogen is the key reductant of this pathway, and its high abundance in deep crustal aquifers is a result of chemical reactions between water and iron-bearing minerals. Bicarbonate (HCO<sub>3</sub><sup>&#x2212;</sup>), the other key reactant of this pathway, is also known to be abundant in the oceanic crust (<xref ref-type="bibr" rid="ref47">Takai et al., 2004</xref>; <xref ref-type="bibr" rid="ref2">Brazelton et al., 2006</xref>; <xref ref-type="bibr" rid="ref50">Ver Eecke et al., 2012</xref>; <xref ref-type="bibr" rid="ref25">Lin et al., 2014</xref>). Although acetogenesis is less thermodynamically favorable than methanogenesis, which also uses H<sub>2</sub> and CO<sub>2</sub> (or HCO<sub>3</sub><sup>&#x2212;</sup>), acetogens have the advantage of requiring a smaller set of enzymes for biosynthesis and energy generation <italic>via</italic> the Wood&#x2013;Ljungdahl pathway. This lowers the overall energetic cost of metabolism and may allow acetogens to compete more effectively for resources with methanogens (<xref ref-type="bibr" rid="ref39">Ragsdale and Pierce, 2008</xref>; <xref ref-type="bibr" rid="ref22">Lever, 2012</xref>; <xref ref-type="bibr" rid="ref31">Nitschke and Russell, 2013</xref>).</p>
<p>Since the production of acetate yields little net energy, acetogens often utilize additional metabolisms such as sulfate reduction and heterotrophy to supplement their energy needs (<xref ref-type="bibr" rid="ref4">Drake et al., 1997</xref>; <xref ref-type="bibr" rid="ref39">Ragsdale and Pierce, 2008</xref>; <xref ref-type="bibr" rid="ref22">Lever, 2012</xref>; <xref ref-type="bibr" rid="ref49">Th&#x00F3;r Marteinsson et al., 2012</xref>; <xref ref-type="bibr" rid="ref41">Schuchmann and M&#x00FC;ller, 2014</xref>). <italic>Moorella thermoacetica</italic> (<xref ref-type="bibr" rid="ref9">Fontaine et al., 1942</xref>; <xref ref-type="bibr" rid="ref37">Pierce et al., 2008</xref>), <italic>Candidatus</italic> Desulforudis audaxviator (<xref ref-type="bibr" rid="ref3">Chivian et al., 2008</xref>), and <italic>Desulfopertinax hafniense</italic> (<xref ref-type="bibr" rid="ref32">Nonaka et al., 2006</xref>) are three terrestrial acetogens who have had their complete genome sequenced, and each encodes additional energy-generating pathways uniquely suited to their needs. <italic>Moorella thermoacetica</italic> is a common acetogen that uses carbohydrates and a wide variety of other substrates for energy (<xref ref-type="bibr" rid="ref9">Fontaine et al., 1942</xref>; <xref ref-type="bibr" rid="ref37">Pierce et al., 2008</xref>). <italic>Candidatus</italic> Desulforudis audaxviator is a deep subsurface microorganism that can use H<sub>2</sub> derived from radiolytic processes and is able to reduce sulfate and fix nitrogen (<xref ref-type="bibr" rid="ref3">Chivian et al., 2008</xref>). <italic>Desulfitobacterium hafniense</italic> is metabolically versatile and is able to use dehalogenation and a multitude of organic and inorganic electron acceptors and donors to generate energy (<xref ref-type="bibr" rid="ref32">Nonaka et al., 2006</xref>). These divergent strategies are likely dependent largely on the unique environmental conditions in which they thrive. In the marine subsurface, inorganic electron donors and acceptors are expected to comprise the primary redox couples, much like the deep terrestrial subsurface where <italic>Ca.</italic> D. audaxviator is known to live. However, peptides and amino acids have also been reported both in aquifer fluid and adsorbed to mineral surfaces, leaving open the possibility of organoheterotrophic processes in this environment (<xref ref-type="bibr" rid="ref24">Lin et al., 2015</xref>).</p>
<p>The following report compares the genome characteristics of a MAG obtained from an olivine biofilm in the deep subsurface of oceanic crust to the genomes of these three known acetogens from terrestrial environments. The purpose of this study was to determine the similarities and differences between deep marine and terrestrial acetogens to inform the adaptations required for life in a deep crustal aquifer. Based on the inferred ecology and metabolism of this organism, we propose this novel subseafloor acetogen as <italic>Candidatus</italic> Acetocimmeria pyornia (aceto=acetogen; Cimmeria=dweller of dark land beyond the ocean; pyornia, short for Pyornkrachzark, or &#x201C;Rock Biter,&#x201D; character from the 1984 film <italic>The NeverEnding Story</italic>).</p>
</sec>
<sec id="sec2" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec id="sec3">
<title><italic>In situ</italic> Colonization of Olivine in the JdFR Aquifer</title>
<p>Olivine and other igneous minerals and glasses were incubated in 3.5-mya oceanic crust for 4years to investigate the properties of surface-colonizing microbial communities from the JdFR aquifer. These ~2-mm-sized mineral grains were emplaced in flow cells at 275&#x2013;287m below seafloor (mbsf) in International Ocean Drilling Program Hole 1301A (47&#x00B0; 45.210' N, 127&#x00B0; 45.833' W; 2,667m below sea level) on the eastern flank of the JdFR (<xref ref-type="bibr" rid="ref7">Fisher et al., 2005</xref>; <xref ref-type="bibr" rid="ref45">Smith et al., 2011</xref>, <xref ref-type="bibr" rid="ref43">2016</xref>). Details of incubation and sample retrieval have been reported previously and are only briefly summarized here (<xref ref-type="bibr" rid="ref45">Smith et al., 2011</xref>, <xref ref-type="bibr" rid="ref43">2016</xref>). After the 4-year incubation in Hole 1301A, recovered minerals were frozen at &#x2212;40&#x00B0;C.</p>
</sec>
<sec id="sec4">
<title>Genomic DNA Extraction and Metagenome Sequencing</title>
<p>Extraction and amplification methods were published previously (<xref ref-type="bibr" rid="ref45">Smith et al., 2011</xref>, <xref ref-type="bibr" rid="ref43">2016</xref>). Briefly, genomic DNA was extracted from ~500mg olivine using a modified protocol for the FastDNA Spin Kit for Soil (MP Biomedicals Catalog #116560200) as described previously (<xref ref-type="bibr" rid="ref52">Wang and Edwards, 2009</xref>; <xref ref-type="bibr" rid="ref43">Smith et al., 2016</xref>). Genomic DNA extracted from the two most common olivine mineral phases (forsterite and Fo<sub>90</sub> olivine) was pooled to produce the olivine metagenome (a total of 50&#x2013;70ng of DNA). The olivine genomic DNA extract was sent to the Census of Deep Life sequencing facility at Marine Biological Laboratory in Woods Hole, MA, United States. Metagenomic sequencing was carried out on an Illumina Hi-Seq1000 instrument with 2&#x00D7;101bp paired-end sequencing and dedicated-read indexing. Sequence reads were de-multiplexed using the Consensus Assessment of Sequence and Variance (CASAVA) 1.8.2 (Illumina). A total of 84 million reads with average length of 108 bases were produced for the olivine metagenome, totaling 9.8 gigabases of sequence data.</p>
</sec>
<sec id="sec5">
<title>Sequence Quality Check, Assembly, and Binning</title>
<p>Sequence quality check, assembly, and binning protocols were reported previously (<xref ref-type="bibr" rid="ref44">Smith et al., 2019</xref>) and are summarized briefly again here. Raw sequence files were quality-filtered using String Graph Assembler (SGA)&#x2018;s preprocessing pipeline (<xref ref-type="bibr" rid="ref42">Simpson and Durbin, 2012</xref>; <xref ref-type="bibr" rid="ref44">Smith et al., 2019</xref>). High-quality mate pair sequence reads were assembled using the Iterative deBruin Graph Assembler&#x2014;Uneven Depth (IDBA-UD) program (<xref ref-type="bibr" rid="ref36">Peng et al., 2012</xref>) with the following specifications: minimum contig length of 450 bases and iterative kmer values from 45 to 69, stepwise by 4. The assembly with kmer length of 65 was chosen for use in analytical subsequent steps based on optimal values of n50 and maximum contig size.</p>
<p>VizBin (<xref ref-type="bibr" rid="ref21">Laczny et al., 2014</xref>), a Java-based genomic binning program, which uses nonlinear dimension reduction of genomic signatures (i.e., nucleotide frequency) to assign contigs to taxonomic bins, was used to separate contig groups from the full assembly into reduced genome bins based on sequence similarity (see <xref ref-type="bibr" rid="ref44">Smith et al., 2019</xref>). A total of 12 bins were detected in the olivine metagenome assembly, each of which was exported for manual bin reconstruction. The <italic>Ca. A. pyornia</italic> metagenome-assembled genome (MAG JdFRolivine-5) is described here. All genome MAGs from the olivine community were uploaded to the Metagenomics Analysis Server (MG-RAST) for initial analyses and comparison of genome features. CheckM (<xref ref-type="bibr" rid="ref35">Parks et al., 2015</xref>) was used to verify taxonomic purity (3.96% contamination) and completeness (91.58%) of MAGs using a total of 43 universal marker genes present in complete bacterial genomes. The genome is publicly available on NCBI&#x2019;s website with accession #SESX00000000, biosample SAMN10863484.</p>
</sec>
<sec id="sec6">
<title>Phylogenetic Tree Construction</title>
<p>A number of 16S rRNA genes were included within the JdFRolivine-5 MAG (subsequently defined as the <italic>Ca.</italic> A. pyornia MAG); therefore, we used several methods to determine phylogenetic relatedness of this genome&#x2019;s sequences to other references. First, we employed whole genome phylogenetic tree construction in The Department of Energy Systems Biology Knowledgebase (KBase; <xref ref-type="bibr" rid="ref1">Arkin et al., 2018</xref>) with Species Tree v.1.1.2. We used a set of 49 concatenated core universal marker genes to construct the genome tree. Input genomes are combined with closely related genomes from the RefSeq database from KBase, and a curated multiple sequence alignment of a subset of 49 COG domains is used to determine relatedness. The alignments are trimmed, concatenated, and then used to a construct phylogenetic tree with maximum likelihood phylogeny. We set a threshold of 20 nearby genomes to construct the tree and did not remove any genomes. We performed FastANI v.0.1.3 (<xref ref-type="bibr" rid="ref12">Goris et al., 2007</xref>; <xref ref-type="bibr" rid="ref14">Jain et al., 2018</xref>), or whole-genome Average Nucleotide Identity (ANI), in KBase to determine relatedness to these nearby genomes.</p>
<p>We also compiled phylogenetic results from PhyloPythia, CheckM, MG-RAST, Kyoto Encyclopedia of Genes and Genomes (KEGG), and the PyroTag sequencing of olivine community DNA (<xref ref-type="bibr" rid="ref43">Smith et al., 2016</xref>, <xref ref-type="bibr" rid="ref44">2019</xref>) to identify the relatedness of known species. In order to include more environmental sequences in our phylogenetic analysis, we chose the <italic>ffh</italic> gene sequence, another highly conserved, single copy phylogenetic marker gene (<xref ref-type="bibr" rid="ref46">Sunagawa et al., 2013</xref>) that can be used in lieu of the 16S rRNA gene, to construct a maximum likelihood tree based on the Tamura-Nei model in MEGA5 (<xref ref-type="bibr" rid="ref48">Tamura et al., 2011</xref>; <xref ref-type="supplementary-material" rid="SM4">Supplementary Figure 1</xref>). This tree also includes related sequences obtained from the same sample: JdFRolivine-7, JdFRolivine-9, and the organism reported here, <italic>Ca.</italic> Acetocimmeria pyornia (identified as MAG JdFRolivine-5; <xref ref-type="bibr" rid="ref44">Smith et al., 2019</xref>). The initial <italic>ffh</italic> trees were obtained through the Neighbor-Join (<xref ref-type="bibr" rid="ref40">Saitou and Nei, 1987</xref>) and BioNJ (<xref ref-type="bibr" rid="ref11">Gascuel, 1997</xref>) algorithms. These algorithms were applied to a matrix of pairwise distances that were estimated by maximum composite likelihood. Positions that represented gaps or missing data were eliminated from the analysis. A total of 264 positions were used to construct the phylogenetic tree.</p>
</sec>
<sec id="sec7">
<title>Genome Annotation and Metabolic Pathway Reconstruction</title>
<p>Gene sequences of MAGs obtained using Prodigal v2.6.0 were uploaded to the KEGG and annotated using BlastKOALA (<xref ref-type="bibr" rid="ref19">Kanehisa et al., 2016</xref>). The reconstructed metabolism for the <italic>Ca.</italic> A. pyornia MAG was used to assess the potential for carbon fixation (<xref rid="tab1" ref-type="table">Table 1</xref>), determine likely energy sources, and explore mineral or surface-based metabolic properties that may yield clues as to how this organism survives in JdFR aquifer biofilms. The resulting metabolic pathway reconstruction was compared against KEGG genome annotations of closely related acetogens <italic>Ca.</italic> Desulforudis audaxviator (<xref ref-type="bibr" rid="ref3">Chivian et al., 2008</xref>), <italic>Moorella thermoacetica</italic> (<xref ref-type="bibr" rid="ref37">Pierce et al., 2008</xref>), and <italic>Desulfitobacterium hafniense</italic> (<xref ref-type="bibr" rid="ref32">Nonaka et al., 2006</xref>; <xref rid="tab2" ref-type="table">Tables 2</xref>&#x2013;<xref rid="tab4" ref-type="table">4</xref>; <xref ref-type="supplementary-material" rid="SM1">Supplementary Tables 1</xref>&#x2013;<xref ref-type="supplementary-material" rid="SM3">3</xref>).</p>
<table-wrap position="float" id="tab1">
<label>Table 1</label>
<caption><p>Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations of contig-65_203 containing the <italic>acs</italic> gene cluster of the Wood&#x2013;Ljungdahl pathway for carbon fixation.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Gene ID for Contig-65_203</th>
<th align="left" valign="top">Gene definition</th>
<th align="left" valign="top">KO best match</th>
<th align="left" valign="top">Second best match</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="middle">contig-65_203_1</td>
<td align="left" valign="middle"><italic>Unassigned</italic></td>
<td align="left" valign="middle"><italic>None</italic></td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_2</td>
<td align="left" valign="middle"><italic>Unassigned</italic></td>
<td align="left" valign="middle"><italic>None</italic></td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_3</td>
<td align="left" valign="middle">cooS; carbon-monoxide dehydrogenase catalytic subunit [EC:1.2.7.4]</td>
<td align="left" valign="middle">K00198</td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_4</td>
<td align="left" valign="middle">acsB; acetyl-CoA synthase [EC:2.3.1.169]</td>
<td align="left" valign="middle">K14138</td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_5</td>
<td align="left" valign="middle">cdhE; acetyl-CoA decarbonylase/synthase complex subunit gamma [EC:2.1.1.245]</td>
<td align="left" valign="middle">K00197</td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_6</td>
<td align="left" valign="middle"><italic>Unassigned</italic></td>
<td align="left" valign="middle"><italic>None</italic></td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_7</td>
<td align="left" valign="middle">cooC; CO dehydrogenase maturation factor</td>
<td align="left" valign="middle">K07321</td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_8</td>
<td align="left" valign="middle">cdhD; acetyl-CoA decarbonylase/synthase complex subunit delta [EC:2.1.1.245]</td>
<td align="left" valign="middle">K00194</td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_9</td>
<td align="left" valign="middle">acsE; 5-methyltetrahydrofolate corrinoid/iron sulfur protein methyltransferase [EC:2.1.1.258]</td>
<td align="left" valign="middle">K15023</td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_10</td>
<td align="left" valign="middle">hdrA; heterodisulfide reductase subunit A [EC:1.8.98.1]</td>
<td align="left" valign="middle">K03388</td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_11</td>
<td align="left" valign="middle"><italic>Unassigned</italic></td>
<td align="left" valign="middle"><italic>None</italic></td>
<td align="left" valign="middle">K14127</td>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_12</td>
<td align="left" valign="middle"><italic>Unassigned</italic></td>
<td align="left" valign="middle"><italic>None</italic></td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_13</td>
<td align="left" valign="middle">metF; methylenetetrahydrofolate reductase (NADPH) [EC:1.5.1.20]</td>
<td align="left" valign="middle">K00297</td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_14</td>
<td align="left" valign="middle"><italic>Unassigned</italic></td>
<td align="left" valign="middle"><italic>None</italic></td>
<td align="left" valign="middle">K17836</td>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_15</td>
<td align="left" valign="middle">E4.2.1.2AA; fumarate hydratase subunit alpha [EC:4.2.1.2]</td>
<td align="left" valign="middle">K01677</td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_16</td>
<td align="left" valign="middle">E4.2.1.2AB; fumarate hydratase subunit beta [EC:4.2.1.2]</td>
<td align="left" valign="middle">K01678</td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_17</td>
<td align="left" valign="middle">ME2; malate dehydrogenase (oxaloacetate-decarboxylating) [EC:1.1.1.38]</td>
<td align="left" valign="middle">K00027</td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_18</td>
<td align="left" valign="middle">sdhC; succinate dehydrogenase/fumarate reductase, cytochrome b subunit</td>
<td align="left" valign="middle">K00241</td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_19</td>
<td align="left" valign="middle">sdhD; succinate dehydrogenase/fumarate reductase, membrane anchor subunit</td>
<td align="left" valign="middle">K00242</td>
<td align="left" valign="middle">K00241</td>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_20</td>
<td align="left" valign="middle">sdhA; succinate dehydrogenase/fumarate reductase, flavoprotein subunit [EC:1.3.5.1 1.3.5.4]</td>
<td align="left" valign="middle">K00239</td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_21</td>
<td align="left" valign="middle">sdhB; succinate dehydrogenase/fumarate reductase, iron-sulfur subunit [EC:1.3.5.1 1.3.5.4]</td>
<td align="left" valign="middle">K00240</td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_22</td>
<td align="left" valign="middle">sucC; succinyl-CoA synthetase beta subunit [EC:6.2.1.5]</td>
<td align="left" valign="middle">K01903</td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_23</td>
<td align="left" valign="middle">sucD; succinyl-CoA synthetase alpha subunit [EC:6.2.1.5]</td>
<td align="left" valign="middle">K01902</td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_24</td>
<td align="left" valign="middle"><italic>Unassigned</italic></td>
<td align="left" valign="middle"><italic>None</italic></td>
<td align="left" valign="middle">K15876</td>
</tr>
<tr>
<td align="left" valign="middle">contig-65_203_25</td>
<td align="left" valign="middle"><italic>Unassigned</italic></td>
<td align="left" valign="middle"><italic>None</italic></td>
<td align="left" valign="middle">K01181</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Contig-65_203 contains 25 genes, here numbered contig-65_203_1 &#x2013; contig-65_203_25. Some genes do not have matching orthologous proteins in the KEGG database, and the gene or function is therefore unassigned. Some of these matches do have &#x201C;second best&#x201D; matches, which may be used to infer the closest function. KO, KEGG Orthology.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="tab2">
<label>Table 2</label>
<caption><p>Complete pathways of carbon fixation and energy metabolism in <italic>Candidatus</italic> A. pyornia (<italic>Ca.</italic> Apy) and three closely related acetogens as determined by the KEGG pathway module.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">KEGG Gene</th>
<th align="left" valign="top">Discrete pathway</th>
<th align="left" valign="top" colspan="4">Organism</th>
</tr>
<tr>
<th align="left" valign="top" colspan="2">Energy metabolism</th>
<th align="left" valign="top">Mta</th>
<th align="left" valign="top">Dau</th>
<th align="left" valign="top">Dsy</th>
<th align="left" valign="top"><italic>Ca.</italic> Apy</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="middle" colspan="6">Carbon fixation</td>
</tr>
<tr>
<td align="left" valign="middle">M00377</td>
<td align="left" valign="bottom">Reductive acetyl-CoA pathway (Wood&#x2013;Ljungdahl pathway)</td>
<td align="left" valign="middle">(+)</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle">M00579</td>
<td align="left" valign="bottom">Phosphate acetyltransferase-acetate kinase pathway, acetyl-CoA&#x2265;acetate</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Nitrogen metabolism</td>
</tr>
<tr>
<td align="left" valign="middle">M00175</td>
<td align="left" valign="bottom">Nitrogen fixation, nitrogen&#x2265;ammonia</td>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">M00530</td>
<td align="left" valign="bottom">Dissimilatory nitrate reduction, nitrate&#x2265;ammonia</td>
<td/>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Methane metabolism</td>
</tr>
<tr>
<td align="left" valign="middle">M00422</td>
<td align="left" valign="bottom">Acetyl-CoA pathway, CO<sub>2</sub>&#x2265;acetyl-CoA</td>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Sulfur metabolism</td>
</tr>
<tr>
<td align="left" valign="middle">M00596</td>
<td align="left" valign="bottom">Dissimilatory sulfate reduction, sulfate&#x2265;H<sub>2</sub>S</td>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Closely related known acetogens: Mta, <italic>Moorella thermoacetica</italic>, Dau, <italic>Ca. Desulforudis audaxviator</italic>, and Dsy, <italic>Desulfitobacterium hafniense</italic>. +, complete pathway, (+), known complete pathways not identified through the KEGG module. (&#x2212;), near-complete, with at least one gene.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="tab3">
<label>Table 3</label>
<caption><p>Complete pathways of carbohydrate and lipid metabolism in <italic>Ca.</italic> A. pyornia (<italic>Ca.</italic> Apy) and two closely related acetogens as determined by the KEGG pathway module.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">KEGG Gene</th>
<th align="left" valign="top">Discrete pathway</th>
<th align="left" valign="top" colspan="4">Organism</th>
</tr>
<tr>
<th align="left" valign="top" colspan="2">Energy metabolism</th>
<th align="left" valign="top">Mta</th>
<th align="left" valign="top">Dau</th>
<th align="left" valign="top">Dsy</th>
<th align="left" valign="top"><italic>Ca.</italic> Apy</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="middle" colspan="6">Carbohydrate and lipid metabolism</td>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Central carbohydrate metabolism</td>
</tr>
<tr>
<td align="left" valign="middle">M00001</td>
<td align="left" valign="bottom">Glycolysis (Embden&#x2013;Meyerhof pathway), glucose&#x2265;pyruvate</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
<tr>
<td align="left" valign="middle">M00002</td>
<td align="left" valign="bottom">Glycolysis, core module involving three-carbon compounds</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle">M00003</td>
<td align="left" valign="bottom">Gluconeogenesis, oxaloacetate&#x2265;fructose-6P</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
<tr>
<td align="left" valign="middle">M00010</td>
<td align="left" valign="bottom">Citrate cycle, first carbon oxidation, oxaloacetate&#x2265;2-oxoglutarate</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
<td/>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
<tr>
<td align="left" valign="middle">M00307</td>
<td align="left" valign="bottom">Pyruvate oxidation, pyruvate&#x2265;acetyl-CoA</td>
<td/>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">M00007</td>
<td align="left" valign="bottom">Pentose phosphate pathway, non-oxidative phase, fructose 6P&#x2265;ribose 5P</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle">M00308</td>
<td align="left" valign="bottom">Semi-phosphorylative Entner&#x2013;Doudoroff pathway, gluconate&#x2265;glycerate-3P</td>
<td/>
<td/>
<td/>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
<tr>
<td align="left" valign="middle">M00005</td>
<td align="left" valign="bottom">PRPP biosynthesis, ribose 5P&#x2265;PRPP</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Other carbohydrate metabolism</td>
</tr>
<tr>
<td align="left" valign="middle">M00741</td>
<td align="left" valign="middle">Propanoyl-CoA metabolism, propanoyl-CoA&#x2265;succinyl-CoA</td>
<td/>
<td/>
<td/>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
<tr>
<td align="left" valign="middle">M00549</td>
<td align="left" valign="middle">Nucleotide sugar biosynthesis, glucose&#x2265;UDP-glucose</td>
<td/>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Fatty acid metabolism</td>
</tr>
<tr>
<td align="left" valign="middle">M00082</td>
<td align="left" valign="bottom">Fatty acid biosynthesis, initiation</td>
<td/>
<td/>
<td/>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
<tr>
<td align="left" valign="middle">M00083</td>
<td align="left" valign="bottom">Fatty acid biosynthesis, elongation</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle">M00086</td>
<td align="left" valign="bottom">beta-Oxidation, acyl-CoA synthesis</td>
<td/>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Lipid metabolism</td>
</tr>
<tr>
<td align="left" valign="middle">M00093</td>
<td align="left" valign="bottom">Phosphatidylethanolamine (PE) biosynthesis, PA&#x2265;PS&#x2265;PE</td>
<td/>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Terpenoid backbone biosynthesis</td>
</tr>
<tr>
<td align="left" valign="middle">M00096</td>
<td align="left" valign="bottom">C5 isoprenoid biosynthesis, non-mevalonate pathway</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle">M00364</td>
<td align="left" valign="bottom">C10-C20 isoprenoid biosynthesis, bacteria</td>
<td/>
<td/>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Closely related known acetogens: Mta, <italic>Moorella thermoacetica</italic>, Dau, <italic>Ca. Desulforudis audaxviator</italic>, and Dsy, <italic>Desulfitobacterium hafniense</italic>. +, complete pathway, (+), known complete pathways not identified through the KEGG module. (&#x2212;), near-complete, with at least one gene.</p>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="tab4">
<label>Table 4</label>
<caption><p>Complete pathways of nucleotide and amino acid metabolism in <italic>Ca.</italic> A. pyornia (<italic>Ca.</italic> Apy) and two closely related acetogens as determined by the KEGG pathway module.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">KEGG Gene</th>
<th align="left" valign="top">Discrete pathway</th>
<th align="left" valign="top" colspan="4">Organism</th>
</tr>
<tr>
<th align="left" valign="top" colspan="2">Energy metabolism</th>
<th align="left" valign="top">Mta</th>
<th align="left" valign="top">Dau</th>
<th align="left" valign="top">Dsy</th>
<th align="left" valign="top"><italic>Ca.</italic> Apy</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="middle" colspan="6">Nucleotide and amino acid metabolism</td>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Purine metabolism</td>
</tr>
<tr>
<td align="left" valign="middle">M00048</td>
<td align="left" valign="bottom">Inosine monophosphate biosynthesis, PRPP + glutamine&#x2265;IMP</td>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
<tr>
<td align="left" valign="middle">M00049</td>
<td align="left" valign="bottom">Adenine ribonucleotide biosynthesis, IMP&#x2265;ADP,ATP</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle">M00050</td>
<td align="left" valign="bottom">Guanine ribonucleotide biosynthesis IMP&#x2265;GDP,GTP</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Pyrimidine metabolism</td>
</tr>
<tr>
<td align="left" valign="middle">M00051</td>
<td align="left" valign="bottom">Uridine monophosphate biosynthesis, glutamine (+PRPP)&#x2265;UMP</td>
<td/>
<td/>
<td/>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
<tr>
<td align="left" valign="middle">M00052</td>
<td align="left" valign="bottom">Pyrimidine ribonucleotide biosynthesis, UMP&#x2265;UDP/UTP,CDP/CTP</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Serine and threonine metabolism</td>
</tr>
<tr>
<td align="left" valign="middle">M00018</td>
<td align="left" valign="bottom">Threonine biosynthesis, aspartate&#x2265;homoserine&#x2265;threonine</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Cysteine and methionine metabolism</td>
</tr>
<tr>
<td align="left" valign="middle">M00021</td>
<td align="left" valign="bottom">Cysteine biosynthesis, serine&#x2265;cysteine</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Branched-chain amino acid metabolism</td>
</tr>
<tr>
<td align="left" valign="middle">M00019</td>
<td align="left" valign="bottom">Valine/isoleucine biosynthesis, pyruvate&#x2265;valine/2-oxobutanoate&#x2265;isoleucine</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle">M00570</td>
<td align="left" valign="bottom">Isoleucine biosynthesis, threonine&#x2265;2-oxobutanoate&#x2265;isoleucine</td>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle">M00432</td>
<td align="left" valign="bottom">Leucine biosynthesis, 2-oxoisovalerate&#x2265;2-oxoisocaproate</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Lysine metabolism</td>
</tr>
<tr>
<td align="left" valign="middle">M00526</td>
<td align="left" valign="bottom">Lysine biosynthesis, DAP dehydrogenase pathway, aspartate&#x2265;lysine</td>
<td/>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
<tr>
<td align="left" valign="middle">M00527</td>
<td align="left" valign="bottom">Lysine biosynthesis, DAP aminotransferase pathway, aspartate&#x2265;lysine</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Arginine and proline metabolism</td>
</tr>
<tr>
<td align="left" valign="middle">M00015</td>
<td align="left" valign="bottom">Proline biosynthesis, glutamate&#x2265;proline</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle">M00028</td>
<td align="left" valign="bottom">Ornithine biosynthesis, glutamate&#x2265;ornithine</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Histidine metabolism</td>
</tr>
<tr>
<td align="left" valign="middle">M00045</td>
<td align="left" valign="bottom">Histidine degradation, histidine&#x2265;N-formiminoglutamate&#x2265;glutamate</td>
<td/>
<td/>
<td/>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
<tr>
<td align="left" valign="middle">M00026</td>
<td align="left" valign="bottom">Histidine biosynthesis, PRPP&#x2265;histidine</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Aromatic amino acid metabolism</td>
</tr>
<tr>
<td align="left" valign="middle">M00022</td>
<td align="left" valign="bottom">Shikimate pathway, phosphoenolpyruvate+erythrose-4P&#x2265;chorismate</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle">M00023</td>
<td align="left" valign="bottom">Tryptophan biosynthesis, chorismate&#x2265;tryptophan</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle">M00024</td>
<td align="left" valign="bottom">Phenylalanine biosynthesis, chorismate&#x2265;phenylalanine</td>
<td/>
<td/>
<td/>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
<tr>
<td align="left" valign="middle">M00025</td>
<td align="left" valign="bottom">Tyrosine biosynthesis, chorismate&#x2265;tyrosine</td>
<td/>
<td/>
<td/>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Cofactor and vitamin biosynthesis</td>
</tr>
<tr>
<td align="left" valign="middle">M00127</td>
<td align="left" valign="bottom">Thiamine biosynthesis, AIR&#x2265;thiamine-P/thiamine-2P</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle">M00125</td>
<td align="left" valign="bottom">Riboflavin biosynthesis, GTP&#x2265;riboflavin/FMN/FAD</td>
<td/>
<td/>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle">M00115</td>
<td align="left" valign="bottom">NAD biosynthesis, aspartate&#x2265;NAD</td>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
</tr>
<tr>
<td align="left" valign="middle">M00119</td>
<td align="left" valign="bottom">Pantothenate biosynthesis, valine/L-aspartate&#x2265;pantothenate</td>
<td/>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
<tr>
<td align="left" valign="middle">M00120</td>
<td align="left" valign="bottom">Coenzyme A biosynthesis, pantothenate&#x2265;CoA</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle">M00123</td>
<td align="left" valign="bottom">Biotin biosynthesis, pimeloyl-ACP/CoA&#x2265;biotin</td>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="middle">M00122</td>
<td align="left" valign="bottom">Cobalamin biosynthesis, cobinamide&#x2265;cobalamin</td>
<td/>
<td/>
<td/>
<td align="left" valign="middle">(&#x2212;)</td>
</tr>
<tr>
<td align="left" valign="middle">M00140</td>
<td align="left" valign="bottom">C1-unit interconversion, prokaryotes</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
<tr>
<td align="left" valign="middle">M00141</td>
<td align="left" valign="bottom">C1-unit interconversion, eukaryotes</td>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="middle" colspan="6">Polyamine biosynthesis</td>
</tr>
<tr>
<td align="left" valign="middle">M00133</td>
<td align="left" valign="bottom">Polyamine biosynthesis, arginine&#x2265;agmatine&#x2265;putrescine&#x2265;spermidine</td>
<td align="left" valign="middle"><bold>+</bold></td>
<td align="left" valign="middle"><bold>+</bold></td>
<td/>
<td align="left" valign="middle"><bold>+</bold></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Closely related known acetogens: Mta, <italic>Moorella thermoacetica</italic>, Dau, <italic>Ca. Desulforudis audaxviator</italic>, and Dsy, <italic>Desulfitobacterium hafniense</italic>. +, complete pathway, (+), known complete pathways not identified through the KEGG module. (&#x2212;), near-complete, with at least one gene.</p>
</table-wrap-foot>
</table-wrap>
<p>Genes not covered by pathway analysis were searched using the KEGG-annotated genome data obtained through BlastKOALA for all contigs. Genes necessary for completion of near-complete pathways or those that may be involved in alternative pathways were searched and identified. Information regarding other genes relating to biofilm formation, chemotaxis and flagella, pilus formation, metal transport or utilization, and others was gathered from the annotated genome.</p>
</sec>
</sec>
<sec id="sec8" sec-type="results">
<title>Results</title>
<sec id="sec9">
<title>Evolutionary Relatedness</title>
<p>The evolutionary relationships of <italic>Ca.</italic> A. pyornia indicate this organism represents a novel acetogen within the bacterial Class Clostridia (<xref rid="fig1" ref-type="fig">Figure 1</xref>; <xref ref-type="supplementary-material" rid="SM4">Supplementary Figure 1</xref>). <italic>Candidatus</italic> A. pyornia is placed within an &#x201C;acetogen&#x201D; cluster which includes <italic>M. thermoacetica</italic> (<xref ref-type="bibr" rid="ref37">Pierce et al., 2008</xref>), <italic>Ca.</italic> D. audaxviator (<xref ref-type="bibr" rid="ref3">Chivian et al., 2008</xref>), and <italic>D. hafniense</italic> (<xref ref-type="bibr" rid="ref32">Nonaka et al., 2006</xref>; <xref ref-type="supplementary-material" rid="SM4">Supplementary Figure 1</xref>). FastANI analysis did not return results as there was &#x003C;80% whole genome nucleotide similarity between <italic>Ca.</italic> A. pyornia and all nearby available genomes. An ANI percentage of &#x2265;95% is considered the same species as the reference genome (<xref ref-type="bibr" rid="ref14">Jain et al., 2018</xref>); thus, <italic>Ca.</italic> A. pyornia is at least a new species and most likely a separate genus of Clostridia.</p>
<fig position="float" id="fig1">
<label>Figure 1</label>
<caption><p>Whole genome phylogenetic tree for <italic>Ca.</italic> Acetocimmeria pyornia and closest genomic relatives. Branches with no bootstrap numbers are 1.000. Branches whose bootstrap numbers are less than 1.000 are indicated on the branch.</p></caption>
<graphic xlink:href="fmicb-12-764631-g001.tif"/>
</fig>
</sec>
<sec id="sec10">
<title>Genome Attributes</title>
<p>The genome of <italic>Ca.</italic> A. pyornia was predicted to be 91.5% complete, with 3.96% contamination detected (<xref ref-type="bibr" rid="ref44">Smith et al., 2019</xref>). Of 2,725 predicted coding sequences, 1,446 genes were annotated to a given KEGG orthologous functional group (KO number). <italic>Candidatus</italic> A. pyornia&#x2019;s genome summary statistics (i.e., genome size and number of protein-encoding genes) were more similar to those of the closely related acetogens <italic>Ca.</italic> D. audaxviator and <italic>M. thermoacetica</italic> than to <italic>D. hafniense</italic>. <italic>Desulfopertinax hafniense</italic>&#x2019;s genome is roughly twice the size with double the amount of protein-encoding genes (<xref ref-type="bibr" rid="ref32">Nonaka et al., 2006</xref>). The G+C content, which can reflect environmental factors, was similar among all acetogens in this study (~50%) with the exception of <italic>Ca.</italic> D. audaxviator, whose G+C content is much higher (60.9%; <xref ref-type="bibr" rid="ref3">Chivian et al., 2008</xref>). When <italic>Ca.</italic> A. pyornia&#x2019;s genome attributes from BlastKOALA were compared to the seven other high-quality bacterial genomes (&#x003E;90% complete and &#x003C;5% contamination; <xref ref-type="bibr" rid="ref55">Bowers et al., 2017</xref>) from the same community, <italic>Ca.</italic> A. pyornia had the highest percentage of genes for amino acid metabolism and respiration and lowest percentage of carbohydrate metabolism genes. There were 21 protease genes and 15 peptidase genes identified in the <italic>Ca.</italic> A. pyornia genome.</p>
</sec>
<sec id="sec11">
<title>Wood&#x2013;Ljungdahl Pathway and the acs Gene Cluster</title>
<p><italic>Candidatus</italic> A. pyornia contained the complete Wood&#x2013;Ljungdahl pathway for acetogenesis (<xref rid="fig2" ref-type="fig">Figure 2</xref>). This included genes that code for enzymes and electron transfer agents needed to complete both branches of the pathway. This organism&#x2019;s genome also contained the complete phosphate acetyltransferase&#x2013;acetate kinase pathway to transform acetyl-CoA to acetate and recover the energy expended earlier in the pathway (<xref rid="fig2" ref-type="fig">Figure 2</xref>). Genes for the carbonyl branch of the Wood&#x2013;Ljungdahl pathway are contained within an <italic>acs</italic> gene cluster, which is commonly found in acetogens (<xref rid="fig3" ref-type="fig">Figure 3</xref>; <xref rid="tab1" ref-type="table">Table 1</xref>). The synteny of the <italic>Ca.</italic> A. pyornia <italic>acs</italic> gene cluster most closely resembled that of <italic>Ca.</italic> D. audaxviator, with only slight differences in the annotations of a 4Fe-4S ferredoxin gene, which is specifically denoted as heterodisulfide reductase A (<italic>hdrA</italic>) in <italic>Ca.</italic> A. pyornia. The <italic>acs</italic> gene clusters of <italic>M. thermoacetica</italic> and <italic>D. hafniense</italic> were less similar to the <italic>acs</italic> gene cluster of <italic>Ca.</italic> A. pyornia due to the addition or loss of genes within the cluster.</p>
<fig position="float" id="fig2">
<label>Figure 2</label>
<caption><p>The Wood&#x2013;Ljungdahl pathway for carbon fixation. The acetogenic Wood&#x2013;Ljungdahl pathway (or acetyl-CoA) pathway for carbon fixation uses molecular hydrogen and carbon dioxide to produce acetate, generating ATP. If needed, acetyl-CoA can be rerouted for biomass production. The enzymes in this pathway are as follows: <italic>a</italic>, formate dehydrogenase; <italic>b</italic>, formyl-H<sub>4</sub>folate synthase; <italic>c</italic>, formyl-H<sub>4</sub>folate cyclohydrolase; <italic>d</italic>, methylene-H<sub>4</sub>folate dehydrogenase; <italic>e</italic>, methylene-H<sub>4</sub>folate reductase; <italic>f</italic>, methyltransferase; <italic>g</italic>, carbon monoxide dehydrogenase; <italic>h</italic>, acetyl-CoA synthase; <italic>i</italic>, phosphotransacetylase; and <italic>j</italic>, acetate kinase. THF (tetrahydrofolate)=H<sub>4</sub>. The genome of <italic>Ca.</italic> A. pyornia also contains the complete phosphate acetyltransferase-acetate kinase pathway.</p></caption>
<graphic xlink:href="fmicb-12-764631-g002.tif"/>
</fig>
<fig position="float" id="fig3">
<label>Figure 3</label>
<caption><p>Arrangement of <italic>acs</italic> gene cluster of the Wood&#x2013;Ljungdahl pathway on contig-65_203 of <italic>Ca.</italic> Acetocimmeria pyornia as compared to closely related acetogens. Descriptions of abbreviated gene IDs and relevant Enzyme Commission (EC) numbers (in brackets) are: <italic>acsA/cooS</italic>, acetyl-CoA synthase alpha subunit/carbon-monoxide dehydrogenase catalytic subunit [EC:1.2.7.4]; <italic>acsB</italic>, acetyl-CoA synthase beta subunit [EC:2.3.1.169]; <italic>acsC/cdhE</italic>, acetyl-CoA decarbonylase/synthase complex subunit gamma [EC:2.1.1.245]; ferredoxin (RefSeq), no KEGG orthology (KO) assigned; <italic>acsF/cooC</italic>, CO dehydrogenase maturation factor; cas, cobyrinic acid a,c-diamide synthase (RefSeq), no KO assigned; <italic>acsD/cdhD</italic>, acetyl-CoA decarbonylase/synthase complex subunit delta [EC:2.1.1.245]; <italic>acsE</italic>, 5-methyltetrahydrofolate corrinoid/iron sulfur protein methyltransferase [EC:2.1.1.258]; <italic>hdrC</italic>, heterodisulfide reductase subunit C; hp, hypothetical protein; <italic>hdrA</italic>, heterodisulfide reductase subunit A [EC:1.8.98.1]; 4Fe-4S ferredoxin (RefSeq), no KO assigned; <italic>d</italic>, methyl-viologen-reducing hydrogenase subunit delta (RefSeq), no KO assigned; zfp, zinc-finger protein (RefSeq), no KO assigned; and <italic>metF</italic>, methylenetetrahydrofolate reductase (NADPH) [EC:1.5.1.20].</p></caption>
<graphic xlink:href="fmicb-12-764631-g003.tif"/>
</fig>
</sec>
<sec id="sec12">
<title>Incomplete TCA Cycle</title>
<p><italic>Candidatus</italic> A. pyornia encodes for an incomplete TCA cycle; the genes coding for the enzymes malate dehydrogenase and citrate synthase are missing from this MAG (<xref rid="fig4" ref-type="fig">Figure 4</xref>). Malate dehydrogenase transforms malate into oxaloacetate, and citrate synthase uses oxaloacetate and acetyl-CoA to produce citrate and begin the cycle again. However, if portions of the TCA cycle are run in reverse, key biosynthetic intermediates can still be produced from anaplerotic reactions. This genome bin does, however, contain the oxaloacetate-decarboxylating malate dehydrogenase, which is situated in a TCA cluster on the same contig immediately following the <italic>acs</italic> cluster from the Wood&#x2013;Ljungdahl pathway (<xref rid="tab1" ref-type="table">Table 1</xref>). This portion of the contig also encodes genes for fumarate hydratase, all four succinate dehydrogenase/fumarate reductase subunits, and succinyl-CoA synthetase; however, the oxaloacetate-decarboxylating malate dehydrogenase is likely making pyruvate as a part of anaplerosis, converting it to phosphoenolpyruvate (PEP) and on up the gluconeogenesis/glycolysis pathway to make carbohydrates in order to supplement what it cannot get from the environment. At least one other pathway that results in oxaloacetate formation was identified. The enzyme aspartate aminotransferase can produce oxaloacetate and L-glutamate from L-aspartate and &#x03B1;-ketoglutarate (<xref rid="fig4" ref-type="fig">Figure 4</xref>). Oxaloacetate can also be converted to pyruvate using oxaloacetate decarboxylase. The gene for citrate lyase beta subunit was also present; therefore it is possible that other genes not identified could be present in the genome, allowing <italic>Ca.</italic> A. pyornia to produce oxaloacetate from citrate in the reverse direction (<xref ref-type="bibr" rid="ref39">Ragsdale and Pierce, 2008</xref>). <italic>Candidatus</italic> A. pyornia contains at least two complete pathways for the production of acetate from acetyl-CoA (<xref rid="fig4" ref-type="fig">Figure 4</xref>) and can convert pyruvate to PEP and vice versa.</p>
<fig position="float" id="fig4">
<label>Figure 4</label>
<caption><p>Tricarboxylic Acid (TCA) cycle and related pathways in <italic>Ca.</italic> A. pyornia. <italic>Candidatus</italic> A. pyornia has an incomplete TCA cycle and is missing genes that code for two enzymes: citrate synthase and malate dehydrogenase (red arrows <italic>o</italic> and <italic>g</italic>). Oxaloacetate can still be produced from L-aspartate and 2-oxoglutarate, and oxaloacetate can be decarboxylated to pyruvate. Acetate can be produced through multiple pathways, including the phosphate acetyltransferase-acetate kinase pathway. The enzymes labeled <italic>a</italic> &#x2013; <italic>s</italic> are as follows: <italic>a</italic>: pyruvate, orthophosphate dikinase, <italic>b</italic>: pyruvate kinase, <italic>c</italic>: pyruvate dehydrogenase, <italic>d</italic>: acetyl-CoA synthase, <italic>e</italic>: putative phosphotransacetylase, <italic>f</italic>: acetate kinase, <italic>g</italic>: citrate synthase, <italic>h</italic> and <italic>i</italic>: aconitase, <italic>j</italic>: isocitrate dehydrogenase, <italic>k</italic>: &#x03B1;-ketoglutarate dehydrogenase, <italic>l</italic>: succinyl-CoA synthetase, <italic>m</italic>: succinate dehydrogenase, <italic>n</italic>: fumarase, <italic>o</italic>: malate dehydrogenase, <italic>p</italic>: aspartate aminotransferase, <italic>q</italic>: oxaloacetate decarboxylase, <italic>r</italic>: pyruvate synthase, <italic>s</italic>: ATP citrate lyase, and <italic>t</italic>: malate dehydrogenase (oxaloacetate-decarboxylating)/malic enzyme. PEP, phosphoenolpyruvate.</p></caption>
<graphic xlink:href="fmicb-12-764631-g004.tif"/>
</fig>
</sec>
<sec id="sec13">
<title>Oxidative Phosphorylation</title>
<p><italic>Candidatus</italic> A. pyornia contained all but one gene to construct a complete anaerobic electron transport chain (ETC) for oxidative phosphorylation. We found the near-complete gene set for Complex I, a complete Complex II, and a complete prokaryotic F-type ATPase (<xref ref-type="supplementary-material" rid="SM1">Supplementary Table 1</xref>). Complex I, or NADH: quinone oxidoreductase (NADH dehydrogenase), is a multi-subunit complex consisting of 14 gene products, 13 of which are present in the <italic>Ca.</italic> A. pyornia genome [subunits A&#x2013;F and H&#x2013;N (no G), <xref ref-type="supplementary-material" rid="SM1">Supplementary Table 1</xref>]. We also found the complete pathway for succinate dehydrogenase, which consists of four genes that code for each of the four subunits of this enzyme, <italic>sdhABCD</italic> (<xref ref-type="supplementary-material" rid="SM1">Supplementary Table 1</xref>). <italic>Moorella thermoacetica</italic>, <italic>Ca.</italic> D. audaxviator, and <italic>D. hafniense</italic> are all missing the complete gene set to make succinate dehydrogenase (<xref ref-type="supplementary-material" rid="SM1">Supplementary Table 1</xref>), but all have at least one subunit in their genome. All four acetogens discussed in this manuscript have the F-type ATPase.</p>
</sec>
<sec id="sec14">
<title>Incomplete Sulfate Reduction Pathway</title>
<p>The <italic>Ca.</italic> A. pyornia genome does not contain genes that allow it to perform dissimilatory sulfate reduction (<xref rid="tab2" ref-type="table">Table 2</xref>). Dissimilatory sulfate reductase genes (<italic>dsrA</italic> and <italic>dsrB</italic>) were not present in the genome, although <italic>sat</italic> and <italic>aprA</italic>B genes, whose enzyme products catalyze the reversible transformation of sulfate to sulfite, were detected (<xref rid="tab2" ref-type="table">Table 2</xref>).</p>
</sec>
<sec id="sec15">
<title>Transporters</title>
<p>Genes encoding transporters for proteinaceous compounds (e.g., oligopeptide, peptide/nickel, and branched chain amino acid transporters), minerals, and metals were exclusively found within the <italic>Ca.</italic> A. pyornia genome (<xref ref-type="supplementary-material" rid="SM2">Supplementary Tables 2</xref>, <xref ref-type="supplementary-material" rid="SM3">3</xref>; <xref rid="fig5" ref-type="fig">Figure 5</xref>). When coupled with a cytosolic aminotransferase, the branched chain amino acid transport system allows for the uptake and utilization of leucine, isoleucine, and valine. <italic>Candidatus</italic> A. pyornia also encodes one of three enzymes that make up the branched-chain &#x03B1;-keto acid dehydrogenase complex, involved in the degradation of branched-chain amino acids to produce ATP. There were also complete pathways for iron, nickel, molybdate, and tungstate transporters, some of which are cofactors required for Wood&#x2013;Ljungdahl pathway and respiratory enzymes.</p>
<fig position="float" id="fig5">
<label>Figure 5</label>
<caption><p>Metabolic pathways of <italic>Ca.</italic> A. pyornia. This organism contains the complete Wood&#x2013;Ljungdahl pathway for carbon fixation, hydrogenases, and no carbohydrate or lipid importers. It is predicted that this organism is able to take advantage of hydrogen production on Fe<sup>2+</sup>-bearing minerals like olivine as they react with seawater. Hydrogen and bicarbonate can be funneled into the Wood&#x2013;Ljungdahl pathway to produce acetyl-CoA for biosynthesis and energy. This organism is missing many amino acid synthesis pathways and likely needs to import these for biosynthesis or energy. Individual transport or regulatory systems pathways (denoted by lower case letters here) are listed in <xref rid="tab2" ref-type="table">Tables 2</xref>&#x2013;<xref rid="tab4" ref-type="table">4</xref> and <xref ref-type="supplementary-material" rid="SM1">Supplementary Tables 1</xref>&#x2013;<xref ref-type="supplementary-material" rid="SM3">3</xref> under the headings that match the labels in this figure.</p></caption>
<graphic xlink:href="fmicb-12-764631-g005.tif"/>
</fig>
<p><italic>Candidatus</italic> A. pyornia appears unable to import lipids or carbohydrate molecules from the environment, since genes encoding complete transporters were missing from its genome. These include the saccharide, polyol, and lipid transport system and sugar-importing phosphotransferase (PTS) systems (<xref ref-type="supplementary-material" rid="SM2">Supplementary Table 2</xref>).</p>
<p>Genes involved in export of lipooligosaccharides, heme (used in membrane-bound succinate dehydrogenase), and other components related to the cell membrane or extracellular structures were detected (<xref ref-type="supplementary-material" rid="SM2">Supplementary Table 2</xref>). The Sec secretion system, involved in the secretion of proteases, toxins, and pilus proteins like adhesin, was present in all acetogens compared in this study. Pathways for export of some lipooligosaccharides, which are likely membrane components, and heme, which is an integral part of the membrane-bound succinate dehydrogenase complex, were also present in the genome.</p>
</sec>
<sec id="sec16">
<title>Central Carbohydrate Metabolism</title>
<p><italic>Candidatus</italic> A. pyornia may be capable of glycolysis <italic>via</italic> an alternative route to the canonical pathway. The glycolysis pathway is missing the step performed by fructose bisphosphate aldolase (<xref rid="tab3" ref-type="table">Table 3</xref>), which converts &#x03B1;-D fructose 1,6-bisphosphate to D-glyceraldehyde 3-phosphate; however, the reverse of this step is not reported missing in the gluconeogenesis pathway, as <italic>Ca.</italic> A. pyornia contains the fructose 1, 6-bisphosphate aldolase, phosphatase. The gluconeogenesis pathway was deemed incomplete since the conversion of oxaloacetate to phosphoenolpyruvate <italic>via</italic> PEP carboxykinase is missing, but this organism encodes the oxaloacetate decarboxylase and pyruvate orthophosphate&#x2013;dikinase enzymes, which provide an alternative route to gluconeogenesis (<xref rid="fig4" ref-type="fig">Figure 4</xref>). These enzymes enable the conversion of oxaloacetate to pyruvate and then to PEP, from which gluconeogenesis can then proceed. There are complete pathways for the glycolysis 3-C compound core module and the non-oxidative pentose phosphate pathway. The pentose phosphate pathway allows this organism to produce 5-carbon compounds for biosynthesis, and the gluconeogenesis pathway allows it to produce 6-carbon compounds. <italic>Moorella thermoacetica</italic>, <italic>Ca.</italic> D. audaxviator, and <italic>D. hafniense</italic> all have complete pathways for glycolysis and the pentose phosphate pathway (<xref rid="tab3" ref-type="table">Table 3</xref>).</p>
</sec>
</sec>
<sec id="sec17" sec-type="discussions">
<title>Discussion</title>
<sec id="sec18">
<title>Piecing Together the Genomic Puzzle</title>
<p>Novel genomic evidence from this study suggests that <italic>Ca.</italic> A. pyornia is a thermophilic acetogen from the bacterial order Clostridiales. Previously we found evidence that <italic>Ca.</italic> A. pyornia is a dominant member from a mineral biofilm collected from a deep subsurface oceanic crust sample (estimated to be ~15% of the community; <xref ref-type="bibr" rid="ref43">Smith et al., 2016</xref>). This organism has the genetic potential to act as the dominant primary producer of this simple microbial community (<xref ref-type="bibr" rid="ref44">Smith et al., 2019</xref>) and appears to be capable of chemosynthesis and energy generation using molecular hydrogen as a reductant to fix inorganic carbon in seawater. It may be supplementing its energy by utilizing amino acids or peptides commonly found in the mineral-attached biofilms, but it does not appear capable of sulfate reduction or importing carbohydrates, as has been observed for other acetogenic Clostridiales.</p>
<p>The full annotated MAG of <italic>Ca.</italic> A. pyornia is most similar to terrestrial acetogens <italic>M. thermoacetica</italic> and <italic>Ca.</italic> D. audaxviator; however, the genome of <italic>Ca.</italic> A. pyornia is not complete and can complicate pathway reconstruction efforts and hinder our understanding of its role in the subsurface biosphere. Further, this MAG was obtained from a flow cell loaded with crystals incubated in a subseafloor borehole rather than directly obtained from the pristine basaltic aquifer. The potential for contamination from surrounding fluids or <italic>via</italic> drilling processes adds to the uncertainty in this study. Nonetheless, the evolutionary relationships among acetogens in this clade provide strong evidence that this MAG represents a thermophilic organism adapted to a deep marine subsurface environment and is capable of performing acetogenesis using molecular hydrogen and CO<sub>2</sub>.</p>
</sec>
<sec id="sec19">
<title>Evolutionary Relationship to the Acetogen Firmicutes Clade</title>
<p>The phylogenetic relationships of <italic>Ca.</italic> A. pyornia to other acetogens suggest that this is a novel organism within a clade of related acetogens (<xref rid="fig1" ref-type="fig">Figure 1</xref>; <xref ref-type="supplementary-material" rid="SM4">Supplementary Figure 1</xref>). This group now contains both marine and terrestrial organisms; however, the marine acetogens from oceanic crust have only recently been described (<xref ref-type="bibr" rid="ref17">Jungbluth et al., 2017</xref>; <xref ref-type="bibr" rid="ref44">Smith et al., 2019</xref>). The terrestrial acetogen <italic>Ca.</italic> D. audaxviator was first described from deep continental crust (<xref ref-type="bibr" rid="ref3">Chivian et al., 2008</xref>), <italic>D. hafniense</italic> from contaminated soil (<xref ref-type="bibr" rid="ref32">Nonaka et al., 2006</xref>), and <italic>M. thermoacetica</italic> from horse manure (<xref ref-type="bibr" rid="ref9">Fontaine et al., 1942</xref>). A new marine acetogen related to <italic>Ca.</italic> D. audaxviator, <italic>Ca.</italic> Desulfopertinax cowenii, was recently described from the JdFR flank aquifer fluid community (<xref ref-type="bibr" rid="ref17">Jungbluth et al., 2017</xref>), and seven other putative acetogens were discovered from the JdFR olivine community, one of which is the <italic>Ca.</italic> A. pyornia MAG described here. <italic>Candidatus</italic> D. audaxviator and <italic>M. thermoacetica</italic> appear to be the closest genomic relatives to <italic>Ca.</italic> A. pyornia that are known acetogens; however, their genome similarity to <italic>Ca.</italic> A. pyornia does not exceed 80%. We are proposing <italic>Ca.</italic> A. pyornia represents at the very least a new genus within the Clostridiales.</p>
</sec>
<sec id="sec20">
<title>A Biofilm-Adapted Primary Producer?</title>
<p>The presence of this acetogen could impact carbon cycling and the health and function of the suboceanic aquifer ecosystem and the deep ocean by fixing carbon that makes its way back to the seafloor (<xref ref-type="bibr" rid="ref27">Mason et al., 2009</xref>; <xref ref-type="bibr" rid="ref29">McCarthy et al., 2010</xref>). <italic>Candidatus</italic> A. pyornia may be a prominent member of the attached community in Hole 1301A (and perhaps 1026B) as evidenced by its ubiquity in colonized rock and mineral biofilms (<xref ref-type="bibr" rid="ref33">Orcutt et al., 2011</xref>; <xref ref-type="bibr" rid="ref43">Smith et al., 2016</xref>) and because it encodes the full Wood&#x2013;Ljungdahl carbon fixation pathway (<xref ref-type="bibr" rid="ref44">Smith et al., 2019</xref>; <xref rid="fig2" ref-type="fig">Figure 2</xref>). Additionally, olivine-rich crust in this same region of the seafloor was previously revealed as a habitat for active methane- and sulfur-cycling microbes (<xref ref-type="bibr" rid="ref23">Lever et al., 2013</xref>). This organism&#x2019;s genome also encodes reductases and a ferredoxin that may be used to generate ATP <italic>via</italic> oxidative phosphorylation (<xref ref-type="bibr" rid="ref37">Pierce et al., 2008</xref>). Since this organism has only been described from mineral biofilms and does not appear to be a dominant member of the aquifer fluid community, the organism this genome represents has indications that it benefits from a mineral biofilm lifestyle. A mineral biofilm can provide a wider range of substrates for metabolism either obtained from the mineral as it reacts with water or from other members of the community. Since H<sub>2</sub> can be produced by mineral surfaces as they react with water in the thermal aquifer (<xref ref-type="bibr" rid="ref28">Mayhew et al., 2013</xref>), this organism may be using H<sub>2</sub> to power the Wood&#x2013;Ljungdahl pathway. Charged mineral surfaces can also attract other charged molecules like certain amino acids from aquifer fluid (<xref ref-type="bibr" rid="ref24">Lin et al., 2015</xref>), effectively concentrating these nutrients near the mineral surface. There is also some indication that this organism may participate in a syntrophic relationship in the biofilm community, especially since <italic>Ca.</italic> A. pyornia is missing some complete biosynthetic pathways, such as those involved in amino acid biosynthesis, and is able to import small peptides (<xref rid="tab4" ref-type="table">Table 4</xref>; <xref ref-type="supplementary-material" rid="SM2">Supplementary Table 2</xref>).</p>
</sec>
<sec id="sec21">
<title><italic>acs</italic> Gene Cluster Comparison</title>
<p>The co-localization of <italic>acs</italic> genes within a gene clusters is a common attribute of acetogens and may allow this pathway to be horizontally transferred (<xref ref-type="bibr" rid="ref37">Pierce et al., 2008</xref>). Previously, it was found that <italic>acs</italic> gene clusters come in several varieties (<xref ref-type="bibr" rid="ref37">Pierce et al., 2008</xref>), and the gene arrangement of <italic>Ca.</italic> A. pyornia, along with the gene cluster that most closely resembles that of <italic>Ca.</italic> A. pyornia (<italic>Ca.</italic> D. audaxviator), may form a new class of <italic>acs</italic> gene cluster that is unlike those that were described before. The gene clusters of <italic>Ca.</italic> A. pyornia and <italic>Ca.</italic> D. audaxviator are near-identical, with the only difference being the annotation for the 4Fe-4S ferredoxin (green arrow, <xref rid="fig3" ref-type="fig">Figure 3</xref>). This gene was annotated as <italic>hdrA</italic>, also a 4Fe-4S ferredoxin, but these genes may represent isoforms of the same heterodisulfide reductase.</p>
<p>The <italic>acs</italic> gene clusters of <italic>M. thermoacetica</italic> and <italic>D. hafniense</italic> are similar to the <italic>Ca.</italic> A. pyornia and <italic>Ca.</italic> D. audaxviator gene clusters in that they share the same <italic>acs</italic> genes in the same arrangement (<xref rid="fig3" ref-type="fig">Figure 3</xref>); however, there are some marked differences with each genome. <italic>Moorella thermoacetica</italic>&#x2019;s genome contains another heterodisulfide reductase gene (<italic>hdrC</italic>) and a hypothetical protein (hp) after the <italic>acsE</italic> gene, and <italic>D. hafniense</italic> has a simplified gene cluster with no other associated genes after <italic>acsE</italic>. Both <italic>M. thermoacetica</italic> and <italic>D. hafniense</italic> also contain an <italic>acsF</italic> gene at the beginning of the <italic>acs</italic> cluster (<xref rid="fig3" ref-type="fig">Figure 3</xref>). <italic>Moorella thermoacetica</italic> also contains an unannotated isoform of the acsF gene within its cluster, the cobyrinic acid a,c-diamide synthase gene. Due to the similarity of <italic>Ca.</italic> A. pyornia&#x2019;s <italic>acs</italic> gene cluster with that of <italic>Ca.</italic> D. audaxviator&#x2019;s, it is likely that these organisms have a shared history relating to the Wood&#x2013;Ljungdahl pathway, and these gene clusters may form a new cluster arrangement apart from the one <italic>M. thermoacetica</italic> or <italic>D. hafniense</italic> occupies.</p>
</sec>
<sec id="sec22">
<title>Incomplete TCA Cycle</title>
<p><italic>Candidatus</italic> A. pyornia contains an incomplete TCA cycle and is missing citrate synthase and malate dehydrogenase. Although incomplete TCA cycles are common in Clostridia, such as the closely related acetogens discussed here (<xref ref-type="bibr" rid="ref32">Nonaka et al., 2006</xref>; <xref ref-type="bibr" rid="ref3">Chivian et al., 2008</xref>; <xref ref-type="bibr" rid="ref37">Pierce et al., 2008</xref>), the missing enzymes vary. All of the bacteria from the olivine community to which <italic>Ca.</italic> A. pyornia belongs (eight microbes &#x2013; two &#x201C;Bacteria,&#x201D; six Clostridia) were also missing citrate synthase from their genomes, and the acetogens <italic>Ca.</italic> D. audaxviator and <italic>D. hafniense</italic> are also missing this enzyme. It is not known whether <italic>Ca.</italic> A. pyornia can make citrate using alternative pathways. <italic>Candidatus</italic> A. pyornia is also missing malate dehydrogenase, as is <italic>M. thermoacetica</italic> (<xref ref-type="bibr" rid="ref37">Pierce et al., 2008</xref>); however, <italic>Ca.</italic> A. pyornia should still be able to make oxaloacetate through the aspartate aminotransferase reaction (<xref rid="fig4" ref-type="fig">Figure 4</xref>). It is possible this is a sign of genome reduction in the deep subsurface, but since it appears widespread in this biofilm community, this reduction would have had to have taken place long ago in an ancestral lineage. Alternatively, the ancestral organisms in this environment never evolved to have them and this absence of TCA cycle steps was the ancestral form.</p>
<p>The other closely related acetogens to <italic>Ca.</italic> A. pyornia are also missing some TCA cycle genes; however, these varied among genomes. As noted above, among those that shared the same missing enzymes as <italic>Ca.</italic> A. pyornia, <italic>Ca.</italic> D. audaxviator and <italic>D. hafniense</italic> are both missing citrate synthase (<xref ref-type="bibr" rid="ref32">Nonaka et al., 2006</xref>; <xref ref-type="bibr" rid="ref3">Chivian et al., 2008</xref>), and <italic>M. thermoacetica</italic> is missing malate dehydrogenase (<xref ref-type="bibr" rid="ref37">Pierce et al., 2008</xref>). <italic>Candidatus</italic> D. audaxviator is missing the additional enzymes aconitase and &#x03B1;-ketoglutarate dehydrogenase, <italic>M. thermoacetica</italic> is also missing succinyl-CoA synthetase and fumarase, and <italic>D. hafniense</italic> is missing isocitrate dehydrogenase and succinate dehydrogenase. <italic>Candidatus</italic> D. audaxviator and <italic>D. hafniense</italic> do contain fumarate reductase that can act similarly to succinate dehydrogenase; thus, these organisms can still complete the reversible enzymatic step that forms fumarate from succinate, depending on the conditions present (<xref ref-type="bibr" rid="ref32">Nonaka et al., 2006</xref>; <xref ref-type="bibr" rid="ref3">Chivian et al., 2008</xref>). However, <italic>Ca.</italic> A. pyornia is the only one of this group that contains all four succinate dehydrogenase genes that code for different subunits (<xref ref-type="bibr" rid="ref44">Smith et al., 2019</xref>).</p>
</sec>
<sec id="sec23">
<title>Oxidative Phosphorylation</title>
<p><italic>Candidatus</italic> A. pyornia may use oxidative phosphorylation during anaerobic respiration with formate as a terminal electron acceptor. <italic>Candidatus</italic> A. pyornia contains the genes for succinate dehydrogenase (sdhABCD), and heme export (Complex II) has all but one gene coding for NADH dehydrogenase (Complex I) and has the complete gene set for production of the F-type ATPase (<xref ref-type="supplementary-material" rid="SM1">Supplementary Table 1</xref>). This genome also contains abundant hydrogenases, ferredoxins, and cytochromes that may be used to shuttle protons out of the cell to create a proton-motive force and drive oxidative phosphorylation (<xref ref-type="bibr" rid="ref51">Vignais and Billoud, 2007</xref>; <xref ref-type="bibr" rid="ref44">Smith et al., 2019</xref>). These electron transport proteins may be coupled to the Wood&#x2013;Ljungdahl pathway to provide additional energy for the cell, as has been proposed in <italic>M. thermoacetica</italic> (<xref ref-type="bibr" rid="ref37">Pierce et al., 2008</xref>; <xref ref-type="bibr" rid="ref41">Schuchmann and M&#x00FC;ller, 2014</xref>). All genomes from the olivine community from which <italic>Ca.</italic> A. pyornia originated lacked at least one gene to complete NADH dehydrogenase, but most of the 11 high-quality genomes lacked more than one gene. <italic>Candidatus</italic> A. pyornia and two other genomes lacked only one gene, and one of these was missing the gene that codes for the same subunit as <italic>Ca.</italic> A. pyornia, <italic>nuoG</italic>. This organism is not an acetogen and is unrelated as it belongs to Archaea. Although these <italic>nuo</italic> genes were typically found in clusters of two or more throughout the olivine community genomes and were often associated with Wood&#x2013;Ljungdahl pathway genes, there did not appear to be a conserved operon with potential un-annotated or mis-annotated <italic>nuo</italic> genes. There is evidence that formate dehydrogenase and some NAD(P)<sup>+</sup>-dependent hydrogenases contain the <italic>nuoEFG</italic> module of NADH dehydrogenase (<xref ref-type="bibr" rid="ref10">Friedrich and Scheide, 2000</xref>), so perhaps <italic>Ca.</italic> A. pyornia is able to couple these enzymes together for respiration or use an alternative module to complete the NADH dehydrogenase complex. The NADH dehydrogenase complex has also been shown to be coupled with hydrogenases such as <italic>ech</italic>, and <italic>Ca.</italic> A. pyornia has this and a myriad of other hydrogenases that may be candidates for such coupling (<xref ref-type="bibr" rid="ref44">Smith et al., 2019</xref>). Thus, it remains unknown whether <italic>Ca.</italic> A. pyornia is able to synthesize a fully functional NADH dehydrogenase, but there may be a mechanism to allow it to complete anaerobic respiration. In contrast to the other acetogens discussed here, the deep subsurface <italic>Ca.</italic> D. audaxviator only has the <italic>nuoEFG</italic> complex, but <italic>M. thermoacetica</italic> and <italic>D. hafniense</italic> contain the complete gene set (<italic>nuoA</italic>&#x2013;<italic>nuoN</italic>) to synthesize this complex.</p>
</sec>
<sec id="sec24">
<title>Lack of Sulfate Reduction and Carbohydrate/Lipid Degradative Pathways</title>
<p><italic>Candidatus</italic> A. pyornia does not appear to use sulfate reduction, carbohydrates, or lipids for energy, although key genes may have been either unannotated or mis-annotated in our analysis. This may be indicated a loss of function due to competition with others for sulfate or a possible syntrophic relationship whereby less energy can be devoted to producing substrates that can be obtained from others nearby. Although the <italic>Ca.</italic> A. pyornia genome is high-quality and is nearly complete, we are missing 8.5% of its genome that may contain key metabolism genes. The closely related <italic>Ca.</italic> D. audaxviator is a sulfate reducer that uses nitrogen fixation and carbohydrate degradation to supplement its energy (<xref ref-type="bibr" rid="ref3">Chivian et al., 2008</xref>). <italic>Moorella thermoacetica</italic> also uses carbohydrates (<xref ref-type="bibr" rid="ref37">Pierce et al., 2008</xref>), and among other metabolic strategies, <italic>D. hafniense</italic> is known to dechlorinate compounds for energy (<xref ref-type="bibr" rid="ref32">Nonaka et al., 2006</xref>). Since <italic>Ca.</italic> A. pyornia is unable to use these substrates and energy sources for supplemental metabolism, it may be using other substrates that are not typical of the other acetogens.</p>
</sec>
<sec id="sec25">
<title>Transport and Degradation of Amino Acids and Oligopeptides</title>
<p>Although all acetogens considered in this study can import branched-chain amino acids, oligopeptides, and contain a peptide/nickel transporter, <italic>Ca.</italic> A. pyornia may rely on these compounds as a source of energy. <italic>Candidatus</italic> A. pyornia had more genes related to respiration and amino acid metabolism in its genome than any other organism in its community (4.86 and 9.17%, respectively), as well as a larger number of peptidases and proteases. This suggests that <italic>Ca.</italic> A. pyornia is either more dependent on amino acids and peptides as an additional source of energy than other members of its community or that it imports these compounds to spend less energy in producing them.</p>
<p>Dissolved amino acids are available in the JdFR aquifer, although their concentrations are less than typical submarine hydrothermal fluid and are instead roughly equivalent with concentrations in deep seawater (1&#x2013;13nM for dissolved free amino acids and 43&#x2013;89nM for dissolved hydrolyzable amino acids; <xref ref-type="bibr" rid="ref24">Lin et al., 2015</xref>). The lower amino acid concentrations in the JdFR basement fluids relative to seawater could indicate interaction with adsorptive mineral surfaces as has been suggested (<xref ref-type="bibr" rid="ref24">Lin et al., 2015</xref>), or it could be an indication of consumption by the microbial community. <italic>Candidatus</italic> A. pyornia could be one such organism contributing to this drawdown, and since it was detected here in mineral biofilms, it may be utilizing those amino acids adsorbing to the mineral surfaces as an energy or nitrogen source. For <italic>Ca.</italic> A. pyornia, it would be beneficial to use amino acid fermentation in this environment, especially since this organism does not appear to use or import other exogenous substrates, and using an uncommon metabolic strategy (<xref ref-type="bibr" rid="ref8">Fonknechten et al., 2010</xref>) may mean less competition.</p>
<p>Although the pathways to import and begin the degradation of branched-chain amino acids are present in the <italic>Ca.</italic> A. pyornia genome, the degradation pathway is incomplete. It is possible that the fate of these amino acids lies in Stickland fermentation; however, we were unable to identify enzymes that could be used to ferment amino acids using this strategy. Stickland fermentation couples the oxidation and deamination of one amino acid to the reduction and deamination of another (<xref ref-type="bibr" rid="ref30">Nisman, 1954</xref>; <xref ref-type="bibr" rid="ref8">Fonknechten et al., 2010</xref>), producing ATP during a subsequent enzymatic reaction involving acetylphosphate. <italic>Candidatus</italic> A. pyornia contains the enzymes alanine dehydrogenase and glycine dehydrogenase (<xref rid="sec32" ref-type="sec">Supplementary Table 5</xref>) to complete the deamination reaction; however, we could not identify an amino acid reductase that these could be coupled with. Glycine dehydrogenase is coupled with glycine hydroxymethyltransferase in the interconversion of glycine and serine during the production of a Wood&#x2013;Ljungdahl pathway intermediate 5,10-methylene THF (<xref ref-type="bibr" rid="ref8">Fonknechten et al., 2010</xref>). Alanine dehydrogenase will deaminate alanine and convert it to pyruvate, while producing NADH and H<sup>+</sup>. The NADH and H<sup>+</sup> are usually shuttled to an amino acid reductase to deaminate the other amino acid to acetylphosphate and then acetate. ATP is produced from each amino acid in the Stickland reaction when acetylphosphate is converted to acetate; therefore, it could be used as an alternative energy source for acetogens in environments like the suboceanic aquifer where carbohydrates and lipids are scarce.</p>
</sec>
<sec id="sec26">
<title>Central Carbohydrate Metabolism</title>
<p><italic>Candidatus</italic> A. pyornia may be able to build 5- and 6-carbon sugars for biosynthesis of saccharides, nucleotides, and energy and enzymatic cofactors <italic>via</italic> a modified gluconeogenesis pathway originating from pyruvate. The enzyme pyruvate, orthophosphate kinase allows for the conversion of pyruvate to phosphoenolpyruvate that is one of the steps of the Crassulacean acid metabolism (CAM) carbon fixation pathway (<xref rid="tab2" ref-type="table">Table 2</xref>; <xref rid="sec32" ref-type="sec">Supplementary Table 5</xref>; <xref rid="fig4" ref-type="fig">Figure 4</xref>). <italic>Candidatus</italic> A. pyornia is unable to complete glycolysis, which is unusual among the closely related acetogens (<xref rid="tab3" ref-type="table">Table 3</xref>); however, the lack of sugar importers along with the missing genes for the glycolysis pathway suggests that <italic>Ca.</italic> A. pyornia is not utilizing exogenous carbohydrates and must therefore synthesize these biomolecules from pyruvate using the modified gluconeogenesis pathway.</p>
</sec>
<sec id="sec27">
<title>Other Carbon Metabolisms</title>
<p><italic>Candidatus</italic> A. pyornia is able to synthesize nucleotides and some amino acids by using the non-oxidative pentose phosphate pathway (<xref rid="tab3" ref-type="table">Table 3</xref>); however, the pathway for lipid synthesis is incomplete. <italic>Candidatus</italic> A. pyornia is missing the gene necessary to make acetyl-CoA carboxylase (ACC), which forms malonyl-CoA from acetyl-CoA; however, this initiation of fatty acid synthesis is also incomplete in the other closely related acetogens (<xref rid="tab3" ref-type="table">Table 3</xref>). All of these acetogens are able to elongate fatty acids, so they may have an unknown pathway for malonyl-CoA synthesis.</p>
</sec>
</sec>
<sec id="sec28" sec-type="conclusions">
<title>Conclusion</title>
<p><italic>Candidatus</italic> A. pyornia represents a novel deep-sea thermophilic acetogen that may be utilizing amino acids instead of carbohydrates or sulfate reduction to supplement its carbon and energy metabolism. Its prevalence in this subseafloor-incubated olivine community is unusual in that its acetogenic metabolism is not predicted under thermodynamic considerations. This organism&#x2019;s products may be a source of carbon for others in the community, particularly since it is capable of chemoautotrophy and produces acetate as a waste product. <italic>Candidatus</italic> A. pyornia&#x2019;s genome indicates that this organism is uniquely suited to life on Fe-bearing minerals in the subseafloor, potentially utilizing molecular hydrogen and amino acids adsorbed to mineral surfaces. Its metabolic pathways include the complete Wood&#x2013;Ljungdahl pathway for acetogenesis, and it utilizes enzymes from other partial carbon fixation pathways including the reverse TCA cycle, the CAM light reaction in photosynthesis, and the reductive pentose phosphate cycle. The unusual metabolisms of <italic>Ca.</italic> A. pyornia have been previously undetected in the suboceanic crustal aquifer and may bring new light to the function of this vast ecosystem.</p>
</sec>
<sec id="sec29" sec-type="data-availability">
<title>Data Availability Statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article.</p>
</sec>
<sec id="sec30">
<title>Author Contributions</title>
<p>AS and RM analyzed the genome and metabolic pathways for A. pyornia. FC and MF proposed and guided the subseafloor sampling strategy. RM, FC, and MF provided advising and dissertation support for AS as committee members and advisors. AS wrote the manuscript. All authors provided feedback and edits to the manuscript. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="sec41" sec-type="funding-information">
<title>Funding</title>
<p>Metagenome sequencing was made possible by the Deep Carbon Observatory Census of Deep Life supported by the Alfred P. Sloan Foundation and was performed at the Marine Biological Laboratory (Woods Hole, MA, United States). This work was funded by NASA grant NNX08AO22G and a graduate fellowship from the NSF Center for Dark Energy Biosphere Investigations. The flow cells were funded under J0972A from the U.S. Science Support Program of Joint Oceanographic Institutions.</p>
</sec>
<sec id="conf1" sec-type="COI-statement">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="sec31" sec-type="disclaimer">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ack>
<p>We are grateful for the assistance of Mitch Sogin, Susan Huse, Joseph Vineis, Andrew Voorhis, Sharon Grim, and Hilary Morrison at MBL. This is an NSF Center for Dark Energy Biosphere Investigations (C-DEBI) contribution.</p>
</ack>
<sec id="sec32" sec-type="supplementary-material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link xlink:href="https://www.frontiersin.org/articles/10.3389/fmicb.2021.764631/full#supplementary-material" ext-link-type="uri">https://www.frontiersin.org/articles/10.3389/fmicb.2021.764631/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.DOCX" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_2.DOCX" id="SM2" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_3.DOCX" id="SM3" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Image_1.JPEG" id="SM4" mimetype="image/jpeg" xmlns:xlink="http://www.w3.org/1999/xlink"><label>Supplementary Figure 1</label><caption><p><italic>ffh</italic> gene phylogeny of <italic>Ca</italic>. Acetocimmeria pyornia and closest relatives. The evolutionary history of <italic>Ca</italic>. Acetocimmeria pyornia was inferred using the Maximum Likelihood method based on the Tamura-Nei model in MEGA5 (<xref ref-type="bibr" rid="ref48">Tamura et al., 2011</xref>), with the optimal tree for the <italic>ffh</italic> gene shown. The percentage of 500 replicate trees in which the associated taxa clustered together (bootstrap test) is shown next to the branches if greater than 50% (<xref ref-type="bibr" rid="ref6">Felsenstein, 1985</xref>). Branch lengths are measured in the number of substitutions per site. There were 264 positions in the final dataset. Organisms in green are known acetogens.</p></caption></supplementary-material>
</sec>
<ref-list>
<title>References</title>
<ref id="ref1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arkin</surname> <given-names>A. P.</given-names></name> <name><surname>Cottingham</surname> <given-names>R. W.</given-names></name> <name><surname>Henry</surname> <given-names>C. S.</given-names></name> <name><surname>Harris</surname> <given-names>N. L.</given-names></name> <name><surname>Stevens</surname> <given-names>R. L.</given-names></name> <name><surname>Maslov</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>KBase: The United States Department of Energy Systems Biology Knowledgebase</article-title>. <source>Nat. Biotechnol.</source> <volume>36</volume>:<fpage>566</fpage>. doi: <pub-id pub-id-type="doi">10.1038/nbt.4163</pub-id>, PMID: <pub-id pub-id-type="pmid">29979655</pub-id></citation></ref>
<ref id="ref55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bowers</surname> <given-names>R. M.</given-names></name> <name><surname>Kyrpides</surname> <given-names>N. C.</given-names></name> <name><surname>Stepanauskas</surname> <given-names>R.</given-names></name> <name><surname>Hamilton-Smith</surname> <given-names>M.</given-names></name> <name><surname>Reddy</surname> <given-names>T. B. K.</given-names></name> <name><surname>Schulz</surname> <given-names>F.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea</article-title>. <source>Nat. Biotech</source> <volume>35</volume>, <fpage>725</fpage>&#x2013;<lpage>731</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nbt.3893</pub-id></citation></ref>
<ref id="ref2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brazelton</surname> <given-names>W. J.</given-names></name> <name><surname>Schrenk</surname> <given-names>M. O.</given-names></name> <name><surname>Kelley</surname> <given-names>D. S.</given-names></name> <name><surname>Baross</surname> <given-names>J. A.</given-names></name></person-group> (<year>2006</year>). <article-title>Methane- and sulfur-metabolizing microbial communities dominate the lost City hydrothermal field ecosystem</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>72</volume>, <fpage>6257</fpage>&#x2013;<lpage>6270</lpage>. doi: <pub-id pub-id-type="doi">10.1128/AEM.00574-06</pub-id>, PMID: <pub-id pub-id-type="pmid">16957253</pub-id></citation></ref>
<ref id="ref3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chivian</surname> <given-names>D.</given-names></name> <name><surname>Brodie</surname> <given-names>E.</given-names></name> <name><surname>Alm</surname> <given-names>E.</given-names></name> <name><surname>Culley</surname> <given-names>D.</given-names></name></person-group> (<year>2008</year>). <article-title>Environmental genomics reveals a single-species ecosystem deep within earth</article-title>. <source>Science</source> <volume>322</volume>, <fpage>275</fpage>&#x2013;<lpage>278</lpage>. doi: <pub-id pub-id-type="doi">10.1126/science.1155495</pub-id>, PMID: <pub-id pub-id-type="pmid">18845759</pub-id></citation></ref>
<ref id="ref4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Drake</surname> <given-names>H. L.</given-names></name> <name><surname>Daniel</surname> <given-names>S. L.</given-names></name> <name><surname>K&#x00FC;sel</surname> <given-names>K.</given-names></name> <name><surname>Matthies</surname> <given-names>C.</given-names></name> <name><surname>Kuhner</surname> <given-names>C.</given-names></name> <name><surname>Braus-Stromeyer</surname> <given-names>S.</given-names></name></person-group> (<year>1997</year>). <article-title>Acetogenic bacteria: what are the in situ consequences of their diverse metabolic versatilities?</article-title> <source>Biofactors</source> <volume>6</volume>, <fpage>13</fpage>&#x2013;<lpage>24</lpage>. doi: <pub-id pub-id-type="doi">10.1002/biof.5520060103</pub-id>, PMID: <pub-id pub-id-type="pmid">9233536</pub-id></citation></ref>
<ref id="ref5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Edwards</surname> <given-names>K. J.</given-names></name> <name><surname>Wheat</surname> <given-names>C. G.</given-names></name> <name><surname>Sylvan</surname> <given-names>J. B.</given-names></name></person-group> (<year>2011</year>). <article-title>Under the sea: microbial life in volcanic oceanic crust</article-title>. <source>Nat. Rev. Microbiol.</source> <volume>9</volume>, <fpage>703</fpage>&#x2013;<lpage>712</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nrmicro2647</pub-id>, PMID: <pub-id pub-id-type="pmid">21894169</pub-id></citation></ref>
<ref id="ref6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Felsenstein</surname> <given-names>J.</given-names></name></person-group> (<year>1985</year>). <article-title>Confidence limits on phylogenies: an approach using the bootstrap</article-title>. <source>Evolution</source> <volume>39</volume>, <fpage>783</fpage>&#x2013;<lpage>791</lpage>.</citation></ref>
<ref id="ref7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fisher</surname> <given-names>A. T.</given-names></name> <name><surname>Wheat</surname> <given-names>C. G.</given-names></name> <name><surname>Becker</surname> <given-names>K.</given-names></name> <name><surname>Davis</surname> <given-names>E. E.</given-names></name> <name><surname>Jannasch</surname> <given-names>H.</given-names></name> <name><surname>Schroeder</surname> <given-names>D.</given-names></name> <etal/></person-group>. (<year>2005</year>). <article-title>Scientific and technical design and deployment of long-term subseafloor observatories for hydrogeologic and related experiments, IODP expedition 301, eastern flank of Juan de Fuca ridge 1 and general design</article-title>. <source>Proc. Integr. Ocean Drill Progr.</source> <volume>301</volume>, <fpage>1</fpage>&#x2013;<lpage>39</lpage>. doi: <pub-id pub-id-type="doi">10.2204/iodp.proc.301.103.2005</pub-id></citation></ref>
<ref id="ref8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fonknechten</surname> <given-names>N.</given-names></name> <name><surname>Chaussonnerie</surname> <given-names>S.</given-names></name> <name><surname>Tricot</surname> <given-names>S.</given-names></name> <name><surname>Lajus</surname> <given-names>A.</given-names></name> <name><surname>Andreesen</surname> <given-names>J. R.</given-names></name> <name><surname>Perchat</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Clostridium sticklandii, a specialist in amino acid degradation:revisiting its metabolism through its genome sequence</article-title>. <source>BMC Genomics</source> <volume>11</volume>:<fpage>555</fpage>. doi: <pub-id pub-id-type="doi">10.1186/1471-2164-11-555</pub-id>, PMID: <pub-id pub-id-type="pmid">20937090</pub-id></citation></ref>
<ref id="ref9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fontaine</surname> <given-names>F. E.</given-names></name> <name><surname>Peterson</surname> <given-names>W. H.</given-names></name> <name><surname>McCoy</surname> <given-names>E.</given-names></name> <name><surname>Johnson</surname> <given-names>M. J.</given-names></name> <name><surname>Ritter</surname> <given-names>G. J.</given-names></name></person-group> (<year>1942</year>). <article-title>A new type of glucose fermentation by <italic>clostridium thermoaceticum</italic></article-title>. <source>J. Bacteriol.</source> <volume>43</volume>, <fpage>701</fpage>&#x2013;<lpage>715</lpage>. doi: <pub-id pub-id-type="doi">10.1128/jb.43.6.701-715.1942</pub-id>, PMID: <pub-id pub-id-type="pmid">16560531</pub-id></citation></ref>
<ref id="ref10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Friedrich</surname> <given-names>T.</given-names></name> <name><surname>Scheide</surname> <given-names>D.</given-names></name></person-group> (<year>2000</year>). <article-title>The respiratory complex I of bacteria, archaea and eukarya and its module</article-title>. <source>FEBS Lett.</source> <volume>479</volume>, <fpage>1</fpage>&#x2013;<lpage>5</lpage>. doi: <pub-id pub-id-type="doi">10.1016/S0014-5793(00)01867-6</pub-id>, PMID: <pub-id pub-id-type="pmid">10940377</pub-id></citation></ref>
<ref id="ref11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gascuel</surname> <given-names>O.</given-names></name></person-group> (<year>1997</year>). <article-title>BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data</article-title>. <source>Mol. Biol. Evol.</source> <volume>14</volume>, <fpage>685</fpage>&#x2013;<lpage>695</lpage>. doi: <pub-id pub-id-type="doi">10.1093/oxfordjournals.molbev.a025808</pub-id>, PMID: <pub-id pub-id-type="pmid">9254330</pub-id></citation></ref>
<ref id="ref12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goris</surname> <given-names>J.</given-names></name> <name><surname>Konstantinidis</surname> <given-names>K. T.</given-names></name> <name><surname>Klappenbach</surname> <given-names>J. A.</given-names></name> <name><surname>Coenye</surname> <given-names>T.</given-names></name> <name><surname>Vandamme</surname> <given-names>P.</given-names></name> <name><surname>Tiedje</surname> <given-names>J. M.</given-names></name></person-group> (<year>2007</year>). <article-title>DNA-DNA hybridization values and their relationship to whole-genome sequence similarities</article-title>. <source>Int. J. Syst. Evol. Microbiol.</source> <volume>57</volume>, <fpage>81</fpage>&#x2013;<lpage>91</lpage>. doi: <pub-id pub-id-type="doi">10.1099/ijs.0.64483-0</pub-id>, PMID: <pub-id pub-id-type="pmid">17220447</pub-id></citation></ref>
<ref id="ref13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Heberling</surname> <given-names>C.</given-names></name> <name><surname>Lowell</surname> <given-names>R. P.</given-names></name> <name><surname>Liu</surname> <given-names>L.</given-names></name> <name><surname>Fisk</surname> <given-names>M. R.</given-names></name></person-group> (<year>2010</year>). <article-title>Extent of the microbial biosphere in the oceanic crust</article-title>. <source>Geochem. Geophys. Geosyst.</source> <volume>11</volume>, <fpage>1</fpage>&#x2013;<lpage>15</lpage>. doi: <pub-id pub-id-type="doi">10.1029/2009GC002968</pub-id></citation></ref>
<ref id="ref14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jain</surname> <given-names>C.</given-names></name> <name><surname>Rodriguez-R</surname> <given-names>L. M.</given-names></name> <name><surname>Phillippy</surname> <given-names>A. M.</given-names></name> <name><surname>Konstantinidis</surname> <given-names>K. T.</given-names></name> <name><surname>Aluru</surname> <given-names>S.</given-names></name></person-group> (<year>2018</year>). <article-title>High-throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries</article-title>. <source>Nat. Commun.</source> <volume>9</volume>:<fpage>5114</fpage>. doi: <pub-id pub-id-type="doi">10.1038/s41467-018-07641-9</pub-id>, PMID: <pub-id pub-id-type="pmid">30504855</pub-id></citation></ref>
<ref id="ref17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jungbluth</surname> <given-names>S. P.</given-names></name> <name><surname>del Rio</surname> <given-names>T. G.</given-names></name> <name><surname>Tringe</surname> <given-names>S. G.</given-names></name> <name><surname>Stepanauskas</surname> <given-names>R.</given-names></name> <name><surname>Rapp&#x00E9;</surname> <given-names>M. S.</given-names></name></person-group> (<year>2017</year>). <article-title>Genomic comparisons of a bacterial lineage that inhabits both marine and terrestrial deep subsurface systems</article-title>. <source>PeerJ</source> <volume>5</volume>:<fpage>e3134</fpage>. doi: <pub-id pub-id-type="doi">10.7717/peerj.3134</pub-id>, PMID: <pub-id pub-id-type="pmid">28396823</pub-id></citation></ref>
<ref id="ref18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kallmeyer</surname> <given-names>J.</given-names></name> <name><surname>Pockalny</surname> <given-names>R.</given-names></name> <name><surname>Adhikari</surname> <given-names>R. R.</given-names></name> <name><surname>Smith</surname> <given-names>D. C.</given-names></name> <name><surname>D&#x2019;Hondt</surname> <given-names>S.</given-names></name></person-group> (<year>2012</year>). <article-title>Global distribution of microbial abundance and biomass in subseafloor sediment</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>109</volume>, <fpage>16213</fpage>&#x2013;<lpage>16216</lpage>. doi: <pub-id pub-id-type="doi">10.1073/pnas.1203849109</pub-id>, PMID: <pub-id pub-id-type="pmid">22927371</pub-id></citation></ref>
<ref id="ref19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kanehisa</surname> <given-names>M.</given-names></name> <name><surname>Sato</surname> <given-names>Y.</given-names></name> <name><surname>Morishima</surname> <given-names>K.</given-names></name></person-group> (<year>2016</year>). <article-title>BlastKOALA and GhostKOALA: KEGG tools for functional characterization of genome and metagenome sequences</article-title>. <source>J. Mol. Biol.</source> <volume>428</volume>, <fpage>726</fpage>&#x2013;<lpage>731</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.jmb.2015.11.006</pub-id>, PMID: <pub-id pub-id-type="pmid">26585406</pub-id></citation></ref>
<ref id="ref21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Laczny</surname> <given-names>C. C.</given-names></name> <name><surname>Pinel</surname> <given-names>N.</given-names></name> <name><surname>Vlassis</surname> <given-names>N.</given-names></name> <name><surname>Wilmes</surname> <given-names>P.</given-names></name></person-group> (<year>2014</year>). <article-title>Alignment-free visualization of metagenomic data by nonlinear dimension reduction</article-title>. <source>Sci. Rep.</source> <volume>4</volume>:<fpage>4516</fpage>. doi: <pub-id pub-id-type="doi">10.1038/srep04516</pub-id>, PMID: <pub-id pub-id-type="pmid">24682077</pub-id></citation></ref>
<ref id="ref22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lever</surname> <given-names>M. A.</given-names></name></person-group> (<year>2012</year>). <article-title>Acetogenesis in the energy-starved deep biosphere-a paradox?</article-title> <source>Front. Microbiol.</source> <volume>2</volume>:<fpage>284</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fmicb.2011.00284</pub-id>, PMID: <pub-id pub-id-type="pmid">22347874</pub-id></citation></ref>
<ref id="ref23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lever</surname> <given-names>M.</given-names></name> <name><surname>Rouxel</surname> <given-names>O.</given-names></name> <name><surname>Alt</surname> <given-names>J.</given-names></name> <name><surname>Shimizu</surname> <given-names>N.</given-names></name></person-group> (<year>2013</year>). <article-title>Evidence for microbial carbon and sulfur cycling in deeply buried ridge flank basalt</article-title>. <source>Science</source> <volume>339</volume>, <fpage>1305</fpage>&#x2013;<lpage>1308</lpage>. doi: <pub-id pub-id-type="doi">10.1126/science.1229240</pub-id>, PMID: <pub-id pub-id-type="pmid">23493710</pub-id></citation></ref>
<ref id="ref24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname> <given-names>H.-T.</given-names></name> <name><surname>Amend</surname> <given-names>J. P.</given-names></name> <name><surname>LaRowe</surname> <given-names>D. E.</given-names></name> <name><surname>Bingham</surname> <given-names>J.-P.</given-names></name> <name><surname>Cowen</surname> <given-names>J. P.</given-names></name></person-group> (<year>2015</year>). <article-title>Dissolved amino acids in oceanic basaltic basement fluids</article-title>. <source>Geochim. Cosmochim. Acta</source> <volume>356</volume>, <fpage>155</fpage>&#x2013;<lpage>159</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.gca.2015.04.044</pub-id></citation></ref>
<ref id="ref54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname> <given-names>H.-T.</given-names></name> <name><surname>Cowen</surname> <given-names>J. P.</given-names></name> <name><surname>Olson</surname> <given-names>E. J.</given-names></name> <name><surname>Amend</surname> <given-names>J. P.</given-names></name> <name><surname>Lilley</surname> <given-names>M. D.</given-names></name></person-group> (<year>2012</year>). <article-title>Inorganic chemistry, gas compositions and dissolved organic carbon in fluids from sedimented young basaltic crust on the Juan de Fuca Ridge flanks</article-title>. <source>Geochim. Cosmochim. Acta</source> <volume>85</volume>, <fpage>213</fpage>&#x2013;<lpage>227</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.gca.2012.02.017</pub-id></citation></ref>
<ref id="ref25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname> <given-names>H.-T.</given-names></name> <name><surname>Cowen</surname> <given-names>J. P.</given-names></name> <name><surname>Olson</surname> <given-names>E. J.</given-names></name> <name><surname>Lilley</surname> <given-names>M. D.</given-names></name> <name><surname>Jungbluth</surname> <given-names>S. P.</given-names></name> <name><surname>Wilson</surname> <given-names>S. T.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Dissolved hydrogen and methane in the oceanic basaltic biosphere</article-title>. <source>Earth Planet. Sci. Lett.</source> <volume>405</volume>, <fpage>62</fpage>&#x2013;<lpage>73</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.epsl.2014.07.037</pub-id></citation></ref>
<ref id="ref26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Martin</surname> <given-names>W. F.</given-names></name></person-group> (<year>2020</year>). <article-title>The acetyl CoA pathway and origins</article-title>. <source>Front. Microbiol.</source> <volume>11</volume>:<fpage>817</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fmicb.2020.00817</pub-id>, PMID: <pub-id pub-id-type="pmid">32655499</pub-id></citation></ref>
<ref id="ref27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mason</surname> <given-names>O. U.</given-names></name> <name><surname>Di Meo-Savoie</surname> <given-names>C. A.</given-names></name> <name><surname>Van Nostrand</surname> <given-names>J. D.</given-names></name> <name><surname>Zhou</surname> <given-names>J.</given-names></name> <name><surname>Fisk</surname> <given-names>M. R.</given-names></name> <name><surname>Giovannoni</surname> <given-names>S. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Prokaryotic diversity, distribution, and insights into their role in biogeochemical cycling in marine basalts</article-title>. <source>ISME J.</source> <volume>3</volume>, <fpage>231</fpage>&#x2013;<lpage>242</lpage>. doi: <pub-id pub-id-type="doi">10.1038/ismej.2008.92</pub-id>, PMID: <pub-id pub-id-type="pmid">18843298</pub-id></citation></ref>
<ref id="ref28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mayhew</surname> <given-names>L. E.</given-names></name> <name><surname>Ellison</surname> <given-names>E. T.</given-names></name> <name><surname>McCollom</surname> <given-names>T. M.</given-names></name> <name><surname>Trainor</surname> <given-names>T. P.</given-names></name> <name><surname>Templeton</surname> <given-names>A. S.</given-names></name></person-group> (<year>2013</year>). <article-title>Hydrogen generation from low-temperature water&#x2013;rock reactions</article-title>. <source>Nat. Geosci.</source> <volume>6</volume>, <fpage>478</fpage>&#x2013;<lpage>484</lpage>. doi: <pub-id pub-id-type="doi">10.1038/ngeo1825</pub-id></citation></ref>
<ref id="ref29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McCarthy</surname> <given-names>M. D.</given-names></name> <name><surname>Beaupr&#x00E9;</surname> <given-names>S. R.</given-names></name> <name><surname>Walker</surname> <given-names>B. D.</given-names></name> <name><surname>Voparil</surname> <given-names>I.</given-names></name> <name><surname>Guilderson</surname> <given-names>T. P.</given-names></name> <name><surname>Druffel</surname> <given-names>E. R. M.</given-names></name></person-group> (<year>2010</year>). <article-title>Chemosynthetic origin of 14C-depleted dissolved organic matter in a ridge-flank hydrothermal system</article-title>. <source>Nat. Geosci.</source> <volume>4</volume>, <fpage>32</fpage>&#x2013;<lpage>36</lpage>. doi: <pub-id pub-id-type="doi">10.1038/ngeo1015</pub-id></citation></ref>
<ref id="ref30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nisman</surname> <given-names>B.</given-names></name></person-group> (<year>1954</year>). <article-title>The STICKLAND reaction</article-title>. <source>Bacteriol. Rev.</source> <volume>18</volume>, <fpage>16</fpage>&#x2013;<lpage>42</lpage>. doi: <pub-id pub-id-type="doi">10.1128/br.18.1.16-42.1954</pub-id>, PMID: <pub-id pub-id-type="pmid">13140081</pub-id></citation></ref>
<ref id="ref31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nitschke</surname> <given-names>W.</given-names></name> <name><surname>Russell</surname> <given-names>M. J.</given-names></name></person-group> (<year>2013</year>). <article-title>Beating the acetyl coenzyme A-pathway to the origin of life</article-title>. <source>Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci.</source> <volume>368</volume>:<fpage>20120258</fpage>. doi: <pub-id pub-id-type="doi">10.1098/rstb.2012.0258</pub-id>, PMID: <pub-id pub-id-type="pmid">23754811</pub-id></citation></ref>
<ref id="ref32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nonaka</surname> <given-names>H.</given-names></name> <name><surname>Keresztes</surname> <given-names>G.</given-names></name> <name><surname>Shinoda</surname> <given-names>Y.</given-names></name> <name><surname>Ikenaga</surname> <given-names>Y.</given-names></name> <name><surname>Abe</surname> <given-names>M.</given-names></name> <name><surname>Naito</surname> <given-names>K.</given-names></name> <etal/></person-group>. (<year>2006</year>). <article-title>Complete genome sequence of the dehalorespiring bacterium <italic>Desulfitobacterium hafniense</italic> Y51 and comparison with <italic>Dehalococcoides ethenogenes</italic> 195</article-title>. <source>J. Bacteriol.</source> <volume>188</volume>, <fpage>2262</fpage>&#x2013;<lpage>2274</lpage>. doi: <pub-id pub-id-type="doi">10.1128/JB.188.6.2262-2274.2006</pub-id>, PMID: <pub-id pub-id-type="pmid">16513756</pub-id></citation></ref>
<ref id="ref33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Orcutt</surname> <given-names>B. N.</given-names></name> <name><surname>Bach</surname> <given-names>W.</given-names></name> <name><surname>Becker</surname> <given-names>K.</given-names></name> <name><surname>Fisher</surname> <given-names>A. T.</given-names></name> <name><surname>Hentscher</surname> <given-names>M.</given-names></name> <name><surname>Toner</surname> <given-names>B. M.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Colonization of subsurface microbial observatories deployed in young ocean crust</article-title>. <source>ISME J.</source> <volume>5</volume>, <fpage>692</fpage>&#x2013;<lpage>703</lpage>. doi: <pub-id pub-id-type="doi">10.1038/ismej.2010.157</pub-id>, PMID: <pub-id pub-id-type="pmid">21107442</pub-id></citation></ref>
<ref id="ref35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parks</surname> <given-names>D. H.</given-names></name> <name><surname>Imelfort</surname> <given-names>M.</given-names></name> <name><surname>Skennerton</surname> <given-names>C. T.</given-names></name> <name><surname>Hugenholtz</surname> <given-names>P.</given-names></name> <name><surname>Tyson</surname> <given-names>G. W.</given-names></name></person-group> (<year>2015</year>). <article-title>CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes</article-title>. <source>Genome Res.</source> <volume>25</volume>, <fpage>1043</fpage>&#x2013;<lpage>1055</lpage>. doi: <pub-id pub-id-type="doi">10.1101/gr.186072.114</pub-id>, PMID: <pub-id pub-id-type="pmid">25977477</pub-id></citation></ref>
<ref id="ref36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peng</surname> <given-names>Y.</given-names></name> <name><surname>Leung</surname> <given-names>H. C. M.</given-names></name> <name><surname>Yiu</surname> <given-names>S. M.</given-names></name> <name><surname>Chin</surname> <given-names>F. Y. L.</given-names></name></person-group> (<year>2012</year>). <article-title>IDBA-UD: A de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth</article-title>. <source>Bioinformatics</source> <volume>28</volume>, <fpage>1420</fpage>&#x2013;<lpage>1428</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/bts174</pub-id>, PMID: <pub-id pub-id-type="pmid">22495754</pub-id></citation></ref>
<ref id="ref37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pierce</surname> <given-names>E.</given-names></name> <name><surname>Xie</surname> <given-names>G.</given-names></name> <name><surname>Barabote</surname> <given-names>R.</given-names></name> <name><surname>Saunders</surname> <given-names>E.</given-names></name> <name><surname>Han</surname> <given-names>C.</given-names></name> <name><surname>Detter</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>The complete genome sequence of <italic>Moorella thermoacetica</italic> (f. <italic>clostridium thermoaceticum</italic>)</article-title>. <source>Environ. Microbiol.</source> <volume>10</volume>, <fpage>2550</fpage>&#x2013;<lpage>2573</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1462-2920.2008.01679.x</pub-id>, PMID: <pub-id pub-id-type="pmid">18631365</pub-id></citation></ref>
<ref id="ref38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ragsdale</surname> <given-names>S. W.</given-names></name></person-group> (<year>2008</year>). <article-title>Enzymology of the wood-Ljungdahl pathway of acetogenesis</article-title>. <source>Ann. N. Y. Acad. Sci.</source> <volume>1125</volume>, <fpage>129</fpage>&#x2013;<lpage>136</lpage>. doi: <pub-id pub-id-type="doi">10.1196/annals.1419.015</pub-id>, PMID: <pub-id pub-id-type="pmid">18378591</pub-id></citation></ref>
<ref id="ref39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ragsdale</surname> <given-names>S. W.</given-names></name> <name><surname>Pierce</surname> <given-names>E.</given-names></name></person-group> (<year>2008</year>). <article-title>Acetogenesis and the wood-Ljungdahl pathway of CO<sub>2</sub> fixation</article-title>. <source>Biochim. Biophys. Acta, Proteins Proteomics</source> <volume>1784</volume>, <fpage>1873</fpage>&#x2013;<lpage>1898</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.bbapap.2008.08.012</pub-id>, PMID: <pub-id pub-id-type="pmid">18801467</pub-id></citation></ref>
<ref id="ref40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saitou</surname> <given-names>N.</given-names></name> <name><surname>Nei</surname> <given-names>M.</given-names></name></person-group> (<year>1987</year>). <article-title>The neighbor-joining method: a new method for reconstructing phylogenetic trees</article-title>. <source>Mol. Biol. Evol.</source> <volume>4</volume>, <fpage>406</fpage>&#x2013;<lpage>425</lpage>. doi: <pub-id pub-id-type="doi">10.1093/oxfordjournals.molbev.a040454</pub-id>, PMID: <pub-id pub-id-type="pmid">3447015</pub-id></citation></ref>
<ref id="ref41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schuchmann</surname> <given-names>K.</given-names></name> <name><surname>M&#x00FC;ller</surname> <given-names>V.</given-names></name></person-group> (<year>2014</year>). <article-title>Autotrophy at the thermodynamic limit of life: a model for energy conservation in acetogenic bacteria</article-title>. <source>Nat. Rev. Microbiol.</source> <volume>12</volume>, <fpage>809</fpage>&#x2013;<lpage>821</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nrmicro3365</pub-id>, PMID: <pub-id pub-id-type="pmid">25383604</pub-id></citation></ref>
<ref id="ref42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Simpson</surname> <given-names>J. T.</given-names></name> <name><surname>Durbin</surname> <given-names>R.</given-names></name></person-group> (<year>2012</year>). <article-title>Efficient de novo assembly of large genomes using compressed data structures</article-title>. <source>Genome Res.</source> <volume>22</volume>, <fpage>549</fpage>&#x2013;<lpage>556</lpage>. doi: <pub-id pub-id-type="doi">10.1101/gr.126953.111</pub-id>, PMID: <pub-id pub-id-type="pmid">22156294</pub-id></citation></ref>
<ref id="ref43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smith</surname> <given-names>A. R.</given-names></name> <name><surname>Fisk</surname> <given-names>M. R.</given-names></name> <name><surname>Thurber</surname> <given-names>A. R.</given-names></name> <name><surname>Flores</surname> <given-names>G. E.</given-names></name> <name><surname>Mason</surname> <given-names>O. U.</given-names></name> <name><surname>Popa</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Deep crustal communities of the Juan de Fuca ridge are governed by mineralogy</article-title>. <source>Geomicrobiol J.</source> <volume>451</volume>, <fpage>147</fpage>&#x2013;<lpage>156</lpage>. doi: <pub-id pub-id-type="doi">10.1080/01490451.2016.1155001</pub-id></citation></ref>
<ref id="ref44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smith</surname> <given-names>A. R.</given-names></name> <name><surname>Kieft</surname> <given-names>B.</given-names></name> <name><surname>Mueller</surname> <given-names>R.</given-names></name> <name><surname>Fisk</surname> <given-names>M. R.</given-names></name> <name><surname>Mason</surname> <given-names>O. U.</given-names></name> <name><surname>Popa</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2019</year>). <article-title>Carbon fixation and energy metabolisms of a subseafloor olivine biofilm</article-title>. <source>ISME J.</source> <volume>13</volume>, <fpage>1737</fpage>&#x2013;<lpage>1749</lpage>. doi: <pub-id pub-id-type="doi">10.1038/s41396-019-0385-0</pub-id>, PMID: <pub-id pub-id-type="pmid">30867546</pub-id></citation></ref>
<ref id="ref45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smith</surname> <given-names>A.</given-names></name> <name><surname>Popa</surname> <given-names>R.</given-names></name> <name><surname>Fisk</surname> <given-names>M.</given-names></name> <name><surname>Nielsen</surname> <given-names>M.</given-names></name> <name><surname>Wheat</surname> <given-names>C. G.</given-names></name> <name><surname>Jannasch</surname> <given-names>H. W.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>In situ enrichment of ocean crust microbes on igneous minerals and glasses using an osmotic flow-through device</article-title>. <source>Geochem. Geophys. Geosyst.</source> <volume>12</volume>, <fpage>1</fpage>&#x2013;<lpage>19</lpage>. doi: <pub-id pub-id-type="doi">10.1029/2010GC003424</pub-id></citation></ref>
<ref id="ref46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sunagawa</surname> <given-names>S.</given-names></name> <name><surname>Mende</surname> <given-names>D. R.</given-names></name> <name><surname>Zeller</surname> <given-names>G.</given-names></name> <name><surname>Izquierdo-Carrasco</surname> <given-names>F.</given-names></name> <name><surname>Berger</surname> <given-names>S.</given-names></name> <name><surname>Kultima</surname> <given-names>J. R.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Metagenomic species profiling using universal phylogenetic marker genes</article-title>. <source>Nat. Methods</source> <volume>10</volume>, <fpage>1196</fpage>&#x2013;<lpage>1199</lpage>. doi: <pub-id pub-id-type="doi">10.1038/nmeth.2693</pub-id>, PMID: <pub-id pub-id-type="pmid">24141494</pub-id></citation></ref>
<ref id="ref47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Takai</surname> <given-names>K.</given-names></name> <name><surname>Gamo</surname> <given-names>T.</given-names></name> <name><surname>Tsunogai</surname> <given-names>U.</given-names></name> <name><surname>Nakayama</surname> <given-names>N.</given-names></name> <name><surname>Hirayama</surname> <given-names>H.</given-names></name> <name><surname>Nealson</surname> <given-names>K. H.</given-names></name> <etal/></person-group>. (<year>2004</year>). <article-title>Geochemical and microbiological evidence for a hydrogen-based, hyperthermophilic subsurface lithoautotrophic microbial ecosystem (HyperSLiME) beneath an active deep-sea hydrothermal field</article-title>. <source>Extremophiles</source> <volume>8</volume>, <fpage>269</fpage>&#x2013;<lpage>282</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s00792-004-0386-3</pub-id>, PMID: <pub-id pub-id-type="pmid">15309563</pub-id></citation></ref>
<ref id="ref48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tamura</surname> <given-names>K.</given-names></name> <name><surname>Peterson</surname> <given-names>D.</given-names></name> <name><surname>Peterson</surname> <given-names>N.</given-names></name> <name><surname>Stecher</surname> <given-names>G.</given-names></name> <name><surname>Nei</surname> <given-names>M.</given-names></name> <name><surname>Kumar</surname> <given-names>S.</given-names></name></person-group> (<year>2011</year>). <article-title>MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods</article-title>. <source>Mol. Biol. Evol.</source> <volume>28</volume>, <fpage>2731</fpage>&#x2013;<lpage>2739</lpage>. doi: <pub-id pub-id-type="doi">10.1093/molbev/msr121</pub-id>, PMID: <pub-id pub-id-type="pmid">21546353</pub-id></citation></ref>
<ref id="ref49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Th&#x00F3;r Marteinsson</surname> <given-names>V.</given-names></name> <name><surname>R&#x00FA;narsson</surname> <given-names>A.</given-names></name> <name><surname>Stef&#x00E1;nsson</surname> <given-names>A.</given-names></name> <name><surname>Thorsteinsson</surname> <given-names>T.</given-names></name> <name><surname>J&#x00F3;hannesson</surname> <given-names>T.</given-names></name> <name><surname>Magn&#x00FA;sson</surname> <given-names>S. H.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Microbial communities in the subglacial waters of the Vatnaj&#x00F6;kull ice cap, Iceland</article-title>. <source>ISME J.</source> <volume>7</volume>, <fpage>427</fpage>&#x2013;<lpage>437</lpage>. doi: <pub-id pub-id-type="doi">10.1038/ismej.2012.97</pub-id>, PMID: <pub-id pub-id-type="pmid">22975882</pub-id></citation></ref>
<ref id="ref50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ver Eecke</surname> <given-names>H. C.</given-names></name> <name><surname>Butterfield</surname> <given-names>D. A.</given-names></name> <name><surname>Huber</surname> <given-names>J. A.</given-names></name> <name><surname>Lilley</surname> <given-names>M. D.</given-names></name> <name><surname>Olson</surname> <given-names>E. J.</given-names></name> <name><surname>Roe</surname> <given-names>K. K.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Hydrogen-limited growth of hyperthermophilic methanogens at deep-sea hydrothermal vents</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>109</volume>, <fpage>13674</fpage>&#x2013;<lpage>13679</lpage>. doi: <pub-id pub-id-type="doi">10.1073/pnas.1206632109</pub-id>, PMID: <pub-id pub-id-type="pmid">22869718</pub-id></citation></ref>
<ref id="ref51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vignais</surname> <given-names>P. M.</given-names></name> <name><surname>Billoud</surname> <given-names>B.</given-names></name></person-group> (<year>2007</year>). <article-title>Occurrence, classification, and biological function of hydrogenases: an overview</article-title>. <source>Chem. Rev.</source> <volume>107</volume>, <fpage>4206</fpage>&#x2013;<lpage>4272</lpage>. doi: <pub-id pub-id-type="doi">10.1021/cr050196r</pub-id>, PMID: <pub-id pub-id-type="pmid">17927159</pub-id></citation></ref>
<ref id="ref52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Edwards</surname> <given-names>K. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Bacterial and archaeal DNA extracted from inoculated experiments: implication for the optimization of DNA extraction from deep-sea basalts</article-title>. <source>Geomicrobiol J.</source> <volume>26</volume>, <fpage>463</fpage>&#x2013;<lpage>469</lpage>. doi: <pub-id pub-id-type="doi">10.1080/01490450903060814</pub-id></citation></ref>
<ref id="ref53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Whitman</surname> <given-names>W. B.</given-names></name> <name><surname>Coleman</surname> <given-names>D. C.</given-names></name> <name><surname>Wiebe</surname> <given-names>W. J.</given-names></name></person-group> (<year>1998</year>). <article-title>Prokaryotes: the unseen majority</article-title>. <source>Proc. Natl. Acad. Sci.</source> <volume>95</volume>:<fpage>6578</fpage>. doi: <pub-id pub-id-type="doi">10.1073/pnas.95.12.6578</pub-id>, PMID: <pub-id pub-id-type="pmid">9618454</pub-id></citation></ref></ref-list>
</back>
</article>