Introduction
Microalgae have garnered extensive interest as renewable fuel feedstocks due to their high production potential relative to terrestrial crops, and unique cultivation capacity on non-arable lands (Wijffels and Barbosa, 2010; Davis et al., 2011). The oleaginous chlorophyte Chlorella vulgaris represents a promising model microalgal system and production host, due to its ability to synthesize and accumulate large quantities of fuel intermediates in the form of storage lipids (Guarnieri et al., 2011, 2012; Gerken et al., 2013; Griffiths et al., 2014; Zuñiga et al., 2016). Recent omic analyses have identified transcriptional, post-transcriptional and -translational mechanisms governing lipid accumulation in this alga (Guarnieri et al., 2011, 2013), including active protein nitrosylation (Henard et al., 2017). Here we report the draft nuclear genome and annotation of C. vulgaris UTEX 395.
Materials and methods
Cultivation and genomic DNA isolation
For genomic DNA isolation C. vulgaris UTEX 395 was grown photoautotrophically to exponential phase in Bold's Basal Media, under constant illumination (200 μE m−2 s−1 white fluorescent light), and supplemented with 2% CO2/air, as described previously (Guarnieri et al., 2011, 2013). Genomic DNA was extracted following the protocol adapted from Varela-Alvarez et al. (2006).
Genome sequencing and assembly
Sequencing was performed using Illumina HiSeq 2000 technology with 108 cycles. 171,758,456 paired-end (SIPES) reads were trimmed to an error rate of <1:100, then trimmed until no ambiguous nucleotides remain; reads shorter than 20 nucleotides were discarded, retaining 168,611,711 reads, of which 165,874,962 remained as pairs. Resultant reads were assembled using a DeBruijn method; 113 scaffolds were generated at ≥1,000x depth of coverage, 24 of which were longer than 100 kb and 566 of which were 20–100 kb, ultimately generating a total assembly size of 37.34 Mb, with a 61.5% GC content. This represents the smallest nuclear genome size and lowest GC content reported to date for a sequenced Chlorella species (Supplemental Table 1).
Genome annotation
Transcript prediction was conducted using Maker (Cantarel et al., 2008; Campbell et al., 2014). Transcripts were six-frame translated into protein sequences and functionally annotated with EC, GO and InterProScan identifiers using two approaches. First, a bidirectional BLASTp against SwissProt sequences was carried out and paralogs were identified using BLASTclust. Secondly, InterProScan and PRIAM analyses with gene and genome-specific profiles were conducted.
To facilitate refined annotation and comprehensive pathway mapping of C. vulgaris, a draft nuclear genome sequence was generated and integrated with previously acquired de novo transcriptomic datasets (Guarnieri et al., 2011). 7,100 transcripts were predicted from the C. vulgaris genome, resulting in 6,056 annotated gene models. Genomic queries identified complete gene sets encoding fatty acid and triacylglyceride biosynthetic pathways. The nitrogen assimilation inventory includes genes for nitrate/nitrite transporters and reductases. The genome also encodes meiosis-associated DMC1 and Rad51 DNA recombinase homologs (Fanning et al., 2006; Broderick et al., 2010), offering a possibility that sexual mating may occur in this microalga. Genes for the synthesis of the global stress response alarmone, guanine tetraphosphate (ppGpp) (Takahashi et al., 2004; Tozawa and Nomura, 2011), were also identified. Combined, these genetic pathways will enable potential markerless strain-engineering strategies targeting lipid accumulation in the absence of stress induction, ultimately facilitating the development of robust, deployment-viable microalgae for cost-competitive biofuel production.
Direct link to deposited data and information to users
This whole-genome project has been deposited at DDBJ/EMBL/GenBank under the accession LDKB00000000. The version described in this paper is version LDKB01000000. Additional details can be found at http://www.nrel.gov/biomass/proj_microalgal_biofuels.html and http://chlorella.genomeprojectsolutions-databases.com.
Statements
Author contributions
The work was designed by EK, MB, KZ, and MG. MG directed wet lab analyses. JB directed genome assembly. JL and CH directed genome annotation and pathway mapping.
Acknowledgments
This work was supported by National Renewable Energy Laboratory, Laboratory-Directed Research and Development (LDRD) grants 06511103 and 06511301 and by the U.S. Department of Energy, Office of Science, Office of Biological & Environmental Research under Award Number DE-SC-0012658 and Office of Energy Efficiency and Renewable Energy (EERE) under Agreement No. 22000. The views and opinions of the authors expressed herein do not necessarily state or reflect those of the United States Government or any agency thereof. Neither the United States Government nor any agency thereof, nor any of their employees, makes any warranty, expressed or implied, or assumes any legal liability or responsibility for the accuracy, completeness, or usefulness of any information, apparatus, product, or process disclosed, or represents that its use would not infringe privately owned rights. We kindly thank Robert Stiles for assistance in genome assembly.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fbioe.2018.00037/full#supplementary-material
References
1
BroderickS.RehmetK.ConcannonC.NasheuerH. P. (2010). Eukaryotic single-stranded DNA binding proteins: central factors in genome stability. Subcell. Biochem.50, 143–163. 10.1007/978-90-481-3471-7_8
2
CampbellM. S.HoltC.MooreB.YandellM. (2014). Genome annotation and curation using MAKER and MAKER-P. Curr. Protoc. Bioinformat.48, 4.11.11–14.11.39. 10.1002/0471250953.bi0411s48
3
CantarelB. L.KrofI.RobbS. M.ParraG.RossE.MooreB.et al. (2008). MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res.18, 188–196. 10.1101/gr.6743907
4
DavisR.AdenA.PienkosP. T. (2011). Techno-economic analysis of autotrophic microalgae for fuel production. Appl. Energy88, 3524–3531. 10.1016/j.apenergy.2011.04.018
5
FanningE.KlimovichV.NagerA. R. (2006). A dynamic model for replication protein A (RPA) function in DNA processing pathways. Nucleid. Acids Res.34, 4126–4137. 10.1093/nar/gkl550
6
GerkenH. G.DonohoeB.KnoshaugE. P. (2013). Enzymatic cell wall degradation of Chlorella vulgaris and other microalgae for biofuels production. Planta237, 239–253. 10.1007/s00425-012-1765-0
7
GriffithsM. J.van HilleR. P.HarrisonS. T. (2014). The effect of nitrogen limitation on lipid productivity and cell composition in Chlorella vulgaris. Appl. Microbiol. Biotechnol.98, 2345–2356. 10.1007/s00253-013-5442-4
8
GuarnieriM. T.LaruensL. M. L.KnoshaugE. P.ChouY. C.DonohoeB. S.PienkosP. T. (2012). Complex Systems Engineering: A Case Study for an Unsequenced Microalga.Hoboken, NJ: John Wiley & Sons, Inc.
9
GuarnieriM. T.NagA.SmolinskiS. L.DarzinsA.SeibertM.PienkosP. T. (2011). Examination of triacylglycerol biosynthetic pathways via de novo transcriptomic and proteomic analyses in an unsequenced microalga. PLoS ONE6:e25851. 10.1371/journal.pone.0025851
10
GuarnieriM. T.NagA.YangS.PienkosP. T. (2013). Proteomic analysis of Chlorella vulgaris: potential targets for enhanced lipid accumulation. J. Proteomics93, 245–253. 10.1016/j.jprot.2013.05.025
11
HenardC. A.GuarnieriM. T.KnoshaugE. P. (2017). The Chlorella vulgaris S-Nitrosoproteome under nitrogen-replete and -deplete conditions. Front. Bioeng. Biotechnol.4:100. 10.3389/fbioe.2016.00100
12
TakahashiK.KasaiK.OchiK. (2004). Identification of the bacterial alarmone guanosine 5′-diphosphate 3′-diphosphate (ppGpp) in plants. Proc. Natl. Acad. Sci. U.S.A.101, 4320–4324. 10.1073/pnas.0308555101
13
TozawaY.NomuraY. (2011). Signalling by the global regulatory molecule ppGpp in bacteria and choloplasts of land plants. Plant Biol.13, 699–709. 10.1111/j.1438-8677.2011.00484.x
14
Varela-AlvarezE.AndreakisN.Lago-LestonA.PearsonG. A.SerraoE. A.ProcacciniG.et al. (2006). Genomic DNA isolation from green and brown algae (Caulerpales and Fucales) for microsatellite library construction. J. Phycol.42, 741–745. 10.1111/j.1529-8817.2006.00218.x
15
WijffelsR. H.BarbosaM. J. (2010). An outlook on microalgal biofuels. Science329, 796–799. 10.1126/science.1189003
16
ZuñigaC.LiC. T.HuelsmanT.LeveringJ.ZielinskiD. C.McConnellB. O.et al. (2016). Genome-scale metabolic model for the green alga Chlorella vulgaris UTEX395 accurately predicts phenotypes under autotrophic, heterotrophic, and mixotrophic growth conditions. Plant Phys. 172, 589–602. 10.1104/pp.16.00593
Summary
Keywords
microalgae, biofuels, lipids, genome, Chlorella vulgaris
Citation
Guarnieri MT, Levering J, Henard CA, Boore JL, Betenbaugh MJ, Zengler K and Knoshaug EP (2018) Genome Sequence of the Oleaginous Green Alga, Chlorella vulgaris UTEX 395. Front. Bioeng. Biotechnol. 6:37. doi: 10.3389/fbioe.2018.00037
Received
04 October 2017
Accepted
19 March 2018
Published
05 April 2018
Volume
6 - 2018
Edited by
Qiang Wang, Institute of Hydrobiology (Chinese Academy of Sciences), China
Reviewed by
Jianhua Fan, East China University of Science and Technology, China; Yingchun Wang, Chinese Academy of Sciences, China
Updates
Copyright
© 2018 Guarnieri, Levering, Henard, Boore, Betenbaugh, Zengler and Knoshaug.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Michael T. Guarnieri michael.guarnieri@nrel.gov
This article was submitted to Bioenergy and Biofuels, a section of the journal Frontiers in Bioengineering and Biotechnology
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.