Front. Microbiol., 22 January 2016

Novel Virophages Discovered in a Freshwater Lake in China

Chaowen Gong1,2†, Weijia Zhang1,2†, Xuewen Zhou1,2, Hongming Wang1,2, Guowei Sun1,2, Jinzhou Xiao1,2, Yingjie Pan1,2, Shuling Yan1,3 and Yongjie Wang1,2*
  • 1College of Food Science and Technology, Shanghai Ocean University, Shanghai, China
  • 2Laboratory of Quality and Safety Risk Assessment for Aquatic Products on Storage and Preservation, Ministry of Agriculture, Shanghai, China
  • 3Institute of Biochemistry and Molecular Cell Biology, University of Goettingen, Goettingen, Germany

Virophages are small double-stranded DNA viruses that are parasites of giant DNA viruses that infect unicellular eukaryotes. Here we identify a novel group of virophages, named Dishui Lake virophages (DSLVs) that were discovered in Dishui Lake (DSL): an artificial freshwater lake in Shanghai, China. Based on PCR and metagenomic analysis, the complete genome of DSLV1 was found to be circular and 28,788 base pairs in length, with a G+C content 43.2%, and 28 predicted open reading frames (ORFs). Fifteen of the DSLV1 ORFs have sequence similarity to known virophages. Two DSLV1 ORFs exhibited sequence similarity to that of prasinoviruses (Phycodnaviridae) and chloroviruses (Phycodnaviridae), respectively, suggesting horizontal gene transfer occurred between these large algal DNA viruses and DSLV1. 46 other virophages-related contigs were also obtained, including six homologous major capsid protein (MCP) gene. Phylogenetic analysis of these MCPs showed that DSLVs are closely related to OLV (Organic Lake virophage) and YSLVs (Yellowstone Lake virophages), especially to YSLV3, except for YSLV7. These results indicate that freshwater ecotopes are the hotbed for discovering novel virophages as well as understanding their diversity and properties.


Virophages are small double stranded DNA (dsDNA) viruses that act as parasites of giant DNA viruses (La Scola et al., 2008; Fischer and Suttle, 2011; Yau et al., 2011; Gaia et al., 2014). Virophages are considered parasitic because they co-infect unicellular eukaryotes along with large DNA viruses, but are dependent on the large DNA virus to replicate which results in the deformation and abortion of the large DNA viruses (La Scola et al., 2008). The first virophage, Sputnik, was isolated in conjunction with the giant DNA virus mamavirus, a relative of mimivirus, from a water-cooling tower in Paris, France (La Scola et al., 2008). Several other virophages were subsequently identified, including Mavirus, a virophage infecting Cafeteria roenbergensis virus (CroV), and Zamilon, a close relative of Sputnik, which was isolated with the Mimiviridae related virus Mont1 (Fischer and Suttle, 2011; Gaia et al., 2014). In 2015, the viral family of Lavidaviridae was proposed for virophages, which currently contains two proposed genera: Sputnikvirus and Mavirus (Krupovic et al., 2016).

More recently, metagenomic analyses have been utilized to discover diverse and unique virophages from large datasets worldwide. Seven complete genomes of Yellowstone Lake virophages (YSLVs) were assembled from Yellowstone Lake metagenomic datasets (Zhou et al., 2013, 2015). The Organic Lake virophage (OLV) was identified from a hypersaline meromictic lake in Antarctica and was considered as an essential manipulator for the carbon flux and energy cycle through the ecosystem (Yau et al., 2011). A nearly complete genome of Ace Lake Mavirus (ALM) was assembled from Ace Lake metagenomic datasets (Zhou et al., 2013). Early this year, rumen virophages (RVPs), a new group of hybrid virophages of virophage-Polinton, were discovered in a sheep rumen metagenome (Yutin et al., 2015). Interestingly, eukaryotic Polinton transposons appeared to bear viral characteristics (Krupovic et al., 2014) and to be involved in the evolution of eukaryotic virus, transposon, and plasmid (Krupovic and Koonin, 2015). A cryoconite virophage was also assembled from dsDNA viromes from cryoconite hole ecosystems of Svalbard and the Greenland Ice Sheet (Bellas et al., 2015). To date, diverse virophages have been discovered in Europe (La Scola et al., 2008; Desnues et al., 2012; Gaia et al., 2014; Bellas et al., 2015), the Americas (Fischer and Suttle, 2011; Zhou et al., 2013, 2015; Campos et al., 2014), and even Antarctica (Yau et al., 2011; Zhou et al., 2013; Zablocki et al., 2014). However, reports of virophages are lacking in Asia, Africa, and Australia.

In this study, we use PCR and metagenomic analyses to shed light on the diversity and distribution of virophages in freshwater ecosystems in China by examining water samples from lakes and rivers in East China. From these data, a novel group of virophages were identified in Dishui Lake (DSL), Shanghai, China. They are more closely related to YSLVs, especially to YSLV3, than to other known virophages. The DSLVs were consistently detected in DSL and their closely related relatives were found in the neighboring freshwater environments of DSL. The diversity and distribution of the virophages observed in DSL highlight the importance of virophages in China's freshwater ecosystem.

Materials and Methods

Sampling and DNA Extraction

Surface water samples (depth < 1 m, 500–1000 mL/site) were collected in sterile glass bottles using an electromotion aspiring pump, and kept on ice during delivery to our laboratory. The longitude and latitude of each sampling site were recorded along with sampling time (Table 1, Figure 1). Water samples were immediately filtered through 0.22 μm membrane (GSWP, Merck Millipore) upon receipt. After drying at room temperature, the membrane filters were cut into small pieces, and DNA was extracted from the sample using QIAamp Fast DNA Stool Mini Kit according to the manufacturer's instructions. Final DNA concentrations for each sample were determined using a microplate reader (BioTek) and stored at −20°C before use.


Table 1. The sampling sites and time.


Figure 1. Map shows the sampling sites in East China. Sampling sites are indicated with names and dark dots. Dishui Lake is highlighted in blue. Numbers on horizontal and vertical axis represent longitude (Lng) and Latitude (Lat), respectively. The area of Shanghai is shaded in gray.

Primers Design

Eight pairs of primers were designed using the MCP genes of eight virophages (Ace Lake Mavirus, Mavirus, Sputnik, OLV, YSLV1, 2, 3, and 4; Table 2) and synthesized by Sangon Biotech. Primer specificity was checked in silico based on NCBI blast.


Table 2. Eight pairs of virophage MCP genes specific primers designed in this study.


PCR reactions (25 μL) contained 0.4 mM forward and reverse primers (Table 2), 12.5 μL Taq PCR master mix 2 × (Sangon Biotech), and 20–25 ng of genomic DNA. The PCR cycle conditions are given in Table 3. After agarose gel electrophoresis and ethidium bromide staining, PCR amplicons were visualized with a Gel Doc XR+ system (Bio-Rad). PCR products were purified from the gel using a universal DNA purification kit (Tiangen Biotech), then ligated into the pUCm-T vector (Sangon Biotech). Recombinant plasmids were transformed into TOP10 cells (Tiangen Biotech), which were streaked on agar plates containing 50 μg/mL ampicillin and grown overnight at 37°C. Three positive colonies were selected for Sanger sequencing (Sangon Biotech).


Table 3. Thermal cycling programs of PCR for virophages.

Long-Term Detection of DSL Virophages

Dishui Lake water samples (depth < 1 m, 500–1000 mL) were collected monthly from Oct. 2013 to Sep. 2014. Their DNA extraction, PCR by using the YSLV3 MCP gene specific primers and sequencing were the same as described above.

Metagenomic Sequencing

Up to 6 μg of genomic DNA extracted from Dishui Lake water samples was used to construct two metagenomic libraries that contained 430 bp and 3.0 kb inserts, respectively. Paired-end sequencing (2 × 251 bp) was performed using an Illumina Miseq sequencer (Shanghai Personal Biotechnology).

QC of Metagenomic Datasets

The quality of metagenomic datasets was determined using FastQC software (Andrews, 2010). The average quality of five neighboring nucleotides was larger than Q20, with read lengths of more than 50 bp. Following FastQC analysis, the NGS QC Toolkit (Patel and Jain, 2012) was used to filter the reads again, with the default parameters, to ensure high quality reads suitable for assembly.

The Assembly of DSL Virophage Genomes

Python scripts (Supplementary Material) were modified according to Metavir 2 (Roux et al., 2014) in order to perform sequence alignment for mining virophages related reads in DSL environmental metagenomic data sets. All reads in the DSL metagenomic datasets were firstly aligned onto 12 known virophage genomes using tblastx with E-value of 10−5. Subsequently, the YSLV3 genome was used as a query to perform tblastx with E-value < 10−10, which allowed one hit per read, against the DSL metagenomic datasets. Retrieved YSLV3 related reads were assembled de novo with minimum overlap length of 25 bp and minimum overlap identity of 80%. Each of these assembled contigs was used as a reference sequence to which all reads from DSL metagenomic datasets were aligned with minimum overlap length of 25 bp and minimum overlap identity of 90%. The reference assemblies were repeated until the assembled sequences stop extending. All assemblies were performed with the bioinformatics software Geneious Pro (Biomatters).

PCR Verification of DSLV1 Genome

Eight pairs of primers (Table 4) were designed to verify regions of the DSLV1 genome where the coverage of reads was low or ambiguous. PCR reactions (25 μL) contained 0.4 mM forward and reverse primers (Table 4), 12.5 μL Taq PCR master mix 2 × (Sangon Biotech), and 20–25 ng of genomic DNA. The PCR cycle conditions were as follows: initial denaturation at 94°C for 4 min, followed by 30 cycles of 94°C for 30 s, 54°C (primers 1–4 and 6–8) or 56°C (primer 5) for 30 s, and 72°C for 45 s. Sequences obtained by PCR and sequencing (Sangon Biotech) were then aligned with the assembled genomes to confirm the accuracy of the sequence assembly.


Table 4. Primers for verification of DSLV1 genome.

Analysis of DSLV1 Genome

Geneious Pro software was used to predict virophage open reading frames (ORFs) by stipulating a start codon of ATG and a minimum 150 bp ORF. The predicted ORFs were then searched for homologous genes in NCBI non-redundant (nr) database and a local database containing all known virophage translated ORFs, e.g., Sputnik, Zamilon, Mavirus, OLV, ALM, YLV1-7, using the blastx program. InterProScan 5 and NCBI Conserved Domain Search were used for functional annotation of the predicted ORFs (Marchler-Bauer et al., 2011; Jones et al., 2014).

Whole genome alignment of DSLV1 and YSLV3 was carried out using the Mauve program on the Geneious Pro platform (default parameters; Darling et al., 2004).

Phylogenetic Analysis

The three conserved protein-encoding genes: ATPase, MCP, and Pro from the DSLVs were subjected to phylogenetic analysis. Amino acid sequences were aligned using MUSCLE (Edgar, 2004), and phylogenetic trees were reconstructed using the JTT model with PhyML and 100 iterations (Guindon et al., 2005).

Detection of DSLVs in Neighboring Water Environments

Water was sampled from Dianshan Lake, Dazhi River, Yangtze River Estuary, Yangshan harbor, and Gouqi island (Table 1, Figure 1). Sampling, DNA extraction and PCR were the same as described above. The YSLV3 MCP gene specific primers were used for screening since they completely matched the target-sequences of DSLV1 MCP gene.


PCR Detection of Virophages

Samples from five different lakes (Xi Lake, Qiandao Lake, Xuanwu Lake, East Lake, and Dishui Lake) in China were analyzed for virophages by using the MCP gene specific primers of eight known virophages (Figure 1, Table 2). As a result, PCR products using these primers were not detected in 4 of the lake water samples: Xi Lake, Qiandao Lake, Xuanwu Lake, and East Lake. However, PCR using the YSLV3 MCP gene specific primers successfully amplified DNA from the Dishui Lake water samples. The length of the PCR products was almost identical to that of the theoretical target sequence of YSLV3 MCP gene (Table 2). The PCR products from Dishui Lake were purified and subjected to cloning and sequencing. The sequences were searched against the NCBI non-redundant database using the blastx algorithm. The best blastx hit was the MCP gene sequence of YSLV6 with an expect value of 8e−15 and 33% of identity. When searched against a local dataset containing all translated ORFs of all known virophages, the obtained sequences had the highest sequence similarity with the MCP gene sequence of YSLV3 (89% of identity, E-value: 4.3e−106).

Additional 12 Dishui Lake water samples collected monthly from Oct. 2013 to Sep. 2014 were virophage positive based on the PCR analysis by using the YSLV3 primers, and their sequences were all the same. It indicated that the virophages existed stably in Dishui Lake.

Metagenomic Sequencing

To ensure both the depth and quality of metagenomic sequencing, six independent runs of Illumina sequencing were performed, which correspondingly yielded six data sets (Table 5). In total, 98,347,716 reads were obtained and subjected to sequence quality control analysis. As a result, 74,429,646 high quality reads were obtained and used for subsequent analysis. The average read length was ~177 bp (Table 5).


Table 5. Information of the DSL metagenomic data sets.

Sequence Alignment Analysis of Metagenomic Datasets

To examine the abundance of virophages-related sequences in the DSL metagenomic datasets, sequence alignment analysis was performed by using 12 known virophage genomes. Reads that were similar to OLV and YSLVs (n = 3337) were much more abundant than those related to Mavirus, ALM, Sputnik, and Zamilon (n = 582) (Figure 2). Notably, YSLV3-related reads were most abundant (n = 1199, 30.6% of total reads, E-value: 1e−5), and 22.4% of reads were similar to the MCP gene of YSLV3 (Figure 2). Accordingly, the YSLV3-related reads were considered for subsequent genomic sequence assembly.


Figure 2. Sequence alignment analysis of DSL metagenomic data sets. Red dots represent the recruited reads that shared significant sequence similarity with the virophage genomes. The numbers on the X-axis indicate the position and length (in base pairs) of the virophage genomes. The Y axis shows the percentage of sequence identity shared between the recruited reads and virophage genomic sequences.

Assembly of DSLVs

To be more stringent, the YSLV3 genome was used as the query to search against DSL data sets with tblastx and E-value of 1e−10. 280 YSLV3-related reads retrieved from the DSL metagenomic datasets were assembled de novo, resulting in 47 contigs that ranged from 151 to 1560 bp in size (Table 6). All contigs exhibited sequence similarity to that of YSLV3 (Table 6). Therefore, these contigs were used as references for sequence assembly against the DSL metagenomic datasets. As a result, contig1 was the longest assembly of about 28 kb (Table 6) and contained a direct repeat of 189 bp at both ends, indicating the Dishui Lake virophage represented by contig1 contained a circular genome. Eight additional assembled contigs were longer than 5 kb, including four longer than 10 kb.


Table 6. Information of the assembled contigs.

Verification of DSLV Genome

To further verify the ambiguous or low coverage regions of the contig1 as well as to confirm the circular nature of the genome, PCR was performed by using contig1 sequence specific primers and the DSL DNA samples. The obtained PCR products were then cloned and sequenced. After sequence alignment analysis of the PCR amplicons and contig1 genomic sequence, the DSLV genome was determined to be circular, and its length was 28,788 bp with 43.2% of GC content (GenBank accession no. KT894027). We named this virophage DSLV1.

Genomic and Phylogenetic Analysis

The DSLV1 genome encodes 28 predicted ORFs, and more than half of the ORFs share higher sequence similarity with YSLV3 (33–70%) than to the other YSLVs, including five conversed virophage genes: putative DNA helicase (HEL), packaging ATPase (ATPase), cysteine protease (PRO), major capsid protein (MCP), and minor capsid protein (mCP) (Figure 3, Tables 7, 8). In addition, whole genome alignments revealed five highly conserved regions that are shared between DSLV1 and YSLV3 (Figure 4). These results suggest that DSLV1 is closely related to YSLV3.


Figure 3. Circular map of the complete genome of DSLV1. Homologous genes shared between DSLV1 and YSLV3 are labeled in light blue; DSLV1 and giant algal viruses in red; DSLV1 and cellular organisms in orange; DSLV1 and PgVV (Phaeocystis globosa virus 16T virophage) in green. ORFans are marked in light gray. The interior blue line represents %G+C skew throughout the genome. Three gene clusters that have conserved synteny with other virophages are highlighted by the hashed rectangles. Red asterisks indicate the five conserved virophage genes. HEL, ATPase, PRO, MCP, and mCP.


Table 7. Homologous genes present in virophages (modified from Zhou et al., 2013).


Table 8. ORFs and their homologs predicted in DSLV1.


Figure 4. Whole genome alignment of DSLV1 and YSLV3. Five conserved genomic regions, shared between DSLV1 and YSLV3, are displayed with rectangles of different sizes and colors.

Interestingly, the DSLV1 ORF2 shares 27% amino acid identity with the hypothetical protein BpV2_168 from the giant DNA virus Bathcoccus sp. RCC1105 virus BpV2. In addition, DSLV1 ORF25 shared 45% of amino acid identity with the hypothetical protein NY2A_B677R from the Paramecium bursaria Chlorella virusNY2A. Such evident genetic link between giant algal viruses and virophages might result from horizontal gene transfer (Table 8).

To better understand the relationship between DSLV1 and previously identified virophages, the concatenated amino acid sequences of three conserved genes of ATPase, PRO, and MCP were used to reconstruct phylogenetic linkages between all the virophages. DSLV1 clustered with YSLV3 (Figure 5A), indicating their closer relationship, as revealed based on the gene content and genome alignment analyses presented above, in comparison to other virophages.


Figure 5. Phylogenetic trees showing the relationship of DSLVs to other virophages. (A) A phylogenetic tree was reconstructed based on amino acid sequences of three conserved genes: MCP, Pro, and ATPase to compare DSLV1 to other known virophages. (B) Phylogenetic analysis of virophage MCP proteins, including all seven DSLV sequences identified in this study. Bootstrap values are indicated on each branch (100 iterations). DSLVs and YSLV3 are shaded in light blue. DSLVs and the closely related YSLVs and OLV are highlighted in blue. DSLVs are shown in bold. YSLV, Yellowstone Lake virophage; DSLV, Dishui Lake virophage; ALM, Ace Lake Mavirus; OLV, Organic Lake virophage. The accession numbers of the DSLV MCP gene sequences are as follows: KU245924 (MCP1), KU245925 (MCP2), KU245926 (MCP3), KU245927 (MCP4), KU245928 (MCP5), and KU245929 (MCP6).

ORFs obtained from the other 46 contigs isolated from Dishui Lake were also analyzed to better understand the diversity of virophages in the Dishui Lake region. In total, we identified six complete MCP genes and six nearly complete MCP genes. All the complete MCP ORFs were used for the phylogenetic tree construction. As shown in Figure 5B, all seven Dishui Lake virophages along with YSLV3 formed a monophyletic group with 100% bootstrap support.

Distribution of DSLVs in Neighboring Water Environments

To understand whether DSLVs were unique to DSL, water samples were taken from three neighboring freshwater of Dazhi River (inflow of Dishui Lake), Dianshan Lake and Yangtze River Estuary, and two coastal surface sea water of Yangshan harbor and Gouqi island, and screened by PCR for the presence of the DSLVs using the YSLV3 MCP gene specific primers. PCR products were obtained from all three freshwater samples, and when sequenced, these amplicons shared more than 90% of sequence similarity with that of DSLV1. In contrast, DSLVs were not detected in the two sea water samples.


In our previous study, virophages were found to be distributed worldwide, and freshwater environments appeared to contain the highest abundance of virophages (Zhou et al., 2013). In addition, most of the known virophages are distantly related to each other (Bellas et al., 2015; Yutin et al., 2015; Zhou et al., 2015). Therefore, the genetic diversity of virophages and their potential roles in different environments remain enigmatic.

To understand the diversity of virophages in freshwater lakes in China, we performed PCR on samples isolated from five different lakes using virophage MCP genes specific primers. Since the MCP genes of the available virophage genomes are not conserved enough to design degenerate primers, eight pairs of primers were designed individually according to each genome. The presence of virophage in lake surface water was detected in one lake, Dishui Lake, and the presence of these virophages were steadily detectable for a whole year. These results indicate that Dishui Lake virophages existed in the lake over a long time, as opposed to a transient phenomenon, and are worthy of further exploration. In addition, the inability to PCR amplify virophage DNA from the other lakes that surveyed here does not exclude the possibility of far distantly related virophage being present that were not detected using the methods in this study.

To shed more light on the diversity of virophages in Dishui Lake, subsequently, the metagenomic analysis was conducted using the DNA extracts that were virophage positive based on the PCR detection. Initially, the virophage-related MCP gene sequence obtained by PCR was used as a reference sequence for the assembly. However, the sequence assembly terminated after extending to 1.9 kb (data not shown). This suggests that the corresponding virophages were not the most abundant in Dishui Lake. Subsequently, the sequence alignment analysis of the DSL metagenomic data sets revealed that the YSLV3-related virophages were more abundant than virophages related to the other known virophages, such as Sputnik and Mavirus. The seven virophage MCP genes obtained from DSL were phylogenetically grouped with that of YSLV3 as well, which indeed supports the close relationship between DSLVs and YSLV3. This results also suggest that DSLVs and YSLV3 likely infect similar giant viruses and unicellular eukaryotes present in these two lakes, although there are significant differences between DSL and YSL with respect to origin, size, location, geography, physicochemical properties, and human activities (Zhang et al., 2014); http://www.nps.gov/yell/learn/nature/geology.htm). Coincidently, the RNV (Rio Negro virophage), associated with Samba virus, was detected in the Rio Negro River in Brazil (Campos et al., 2014), which shared 100% sequence similarity with the MCP gene of Sputnik that was discovered in a water-cooling tower in Paris, France (La Scola et al., 2008). Taken together, these results suggest that viruses move between different biomes, and their diversity could be high on a local scale but relatively limited globally (Breitbart and Rohwer, 2005; Ignacio-Espinoza et al., 2013).

Interestingly, DSLVs and their close relatives appear to distribute widely throughout the freshwater environments in Shanghai. In contrast, they were not detected in the two neighboring sea water samples, although we did observe some reads in the viral metagenomic data sets derived from Yangshan deep harbor and Gouqi island which were similar to Sputnik (Wang et al., unpublished). Accordingly, these results indicate that the giant virus and the eukaryotic hosts of virophages are likely different in the freshwater and sea water ecotopes.

Both the giant virus and protist hosts of the DSLVs we identified remain unknown. Some virophages share homologous genes with their associated giant viral hosts (Zhang et al., 2015). We identified homologs to both Bathycoccus sp. RCC1105 virus BpV2 and Paramecium bursaria Chlorella virus NY2A genes in the DSLV1 genome (ORF2 and 25 respectively). Both of these are algae-infecting large dsDNA viruses and respectively belong to the genera of Prasinovirus and Chlorovirus (King et al., 2011). In a separate study, putative virophage genes as well as giant virus inserts were detected in the nuclear genome of the unicellular alga Bigelowiella natans (Blanc et al., 2015). These data suggest that algae possibly serve as the prey of giant viruses as well as their parasitic virophages, and DSLVs may be the parasites of giant viruses that infect algae. Further, work needs to focus on such potential tripartite relationships in China's aquatic ecosystem.

In conclusion, novel virophages were discovered for the first time in an artificial freshwater lake in Shanghai, China, and were found to be widely distributed in the neighboring freshwater bodies. These results further confirm the global distribution and genetic diversity of virophages in freshwater environments.

Author Contributions

Conceived and designed the experiments: SY, YW Performed the experiments: CG, XZ Analyzed the data: CG, WZ, YW Contributed analysis tools: HW, GS, JX, YP Wrote the paper: CG, WZ, YW All authors have read and approved the final manuscript.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.


This work was supported partially by the National Natural Science Foundation of China (41376135, 31570112), Doctoral Fund of Ministry of Education of China (20133104110006), Innovation Program of Shanghai Municipal Education Commission (14ZZ144), China, and the Special Fund for the Development of Science and Technology of Shanghai Ocean University (2015).

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/article/10.3389/fmicb.2016.00005


Andrews, S. (2010). FastQC: A Quality Control Tool for High Throughput Sequence Data. Available online at: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/

Bellas, C. M., Anesio, A. M., and Barker, G. (2015). Analysis of virus genomes from glacial environments reveals novel virus groups with unusual host interactions. Front. Microbiol. 6:656. doi: 10.3389/fmicb.2015.00656

CrossRef Full Text | Google Scholar

Blanc, G., Gallot-Lavallée, L., and Maumus, F. (2015). Provirophages in the Bigelowiella genome bear testimony to past encounters with giant viruses. Proc. Natl. Acad. Sci. U.S.A. 112, E5318–E5326. doi: 10.1073/pnas.1506469112

PubMed Abstract | CrossRef Full Text | Google Scholar

Breitbart, M., and Rohwer, F. (2005). Here a virus, there a virus, everywhere the same virus? Trends Microbiol. 13, 278–284. doi: 10.1016/j.tim.2005.04.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Campos, R. K., Boratto, P. V., Assis, F. L., Aguiar, E. R., Silva, L. C., Albarnaz, J. D., et al. (2014). Samba virus: a novel mimivirus from a giant rain forest, the Brazilian Amazon. Virol. J. 11:95. doi: 10.1186/1743-422X-11-95

PubMed Abstract | CrossRef Full Text | Google Scholar

Darling, A. C., Mau, B., Blattner, F. R., and Perna, N. T. (2004). Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 14, 1394–1403. doi: 10.1101/gr.2289704

PubMed Abstract | CrossRef Full Text | Google Scholar

Desnues, C., La Scola, B., Yutin, N., Fournous, G., Robert, C., Azza, S., et al. (2012). Provirophages and transpovirons as the diverse mobilome of giant viruses. Proc. Natl. Acad. Sci. U.S.A. 109, 18078–18083. doi: 10.1073/pnas.1208835109

PubMed Abstract | CrossRef Full Text | Google Scholar

Edgar, R. C. (2004). MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 1792–1797. doi: 10.1093/nar/gkh340

PubMed Abstract | CrossRef Full Text | Google Scholar

Fischer, M. G., and Suttle, C. A. (2011). A virophage at the origin of large DNA transposons. Science 332, 231–234. doi: 10.1126/science.1199412

PubMed Abstract | CrossRef Full Text | Google Scholar

Gaia, M., Benamar, S., Boughalmi, M., Pagnier, I., Croce, O., Colson, P., et al. (2014). Zamilon, a novel virophage with Mimiviridae host specificity. PLoS ONE 9:e94923. doi: 10.1371/journal.pone.0094923

PubMed Abstract | CrossRef Full Text | Google Scholar

Guindon, S., Lethiec, F., Duroux, P., and Gascuel, O. (2005). PHYML Online–a web server for fast maximum likelihood-based phylogenetic inference. Nucleic Acids Res. 33, W557–W559. doi: 10.1093/nar/gki352

CrossRef Full Text | Google Scholar

Ignacio-Espinoza, J. C., Solonenko, S. A., and Sullivan, M. B. (2013). The global virome: not as big as we thought? Curr. Opin. Virol. 3, 566–571. doi: 10.1016/j.coviro.2013.07.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Jones, P., Binns, D., Chang, H. Y., Fraser, M., Li, W., McAnulla, C., et al. (2014). InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240. doi: 10.1093/bioinformatics/btu031

PubMed Abstract | CrossRef Full Text | Google Scholar

King, A. M., Adams, M. J., and Lefkowitz, E. J. (2011). Virus Taxonomy: Classification and Nomenclature of Viruses. Ninth Report of the International Committee on Taxonomy of Viruses, Elsevier.

Google Scholar

Krupovic, M., Bamford, D. H., and Koonin, E. V. (2014). Conservation of major and minor jelly-roll capsid proteins in Polinton (Maverick) transposons suggests that they are bona fide viruses. Biol. Direct 9:6. doi: 10.1186/1745-6150-9-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Krupovic, M., and Koonin, E. V. (2015). Polintons: a hotbed of eukaryotic virus, transposon and plasmid evolution. Nat. Rev. Microbiol. 13, 105–115. doi: 10.1038/nrmicro3389

PubMed Abstract | CrossRef Full Text | Google Scholar

Krupovic, M., Kuhn, J. H., and Fischer, M. G. (2016). A classification system for virophages and satellite viruses. Arch. Virol. 161, 233–247. doi: 10.1007/s00705-015-2622-9

PubMed Abstract | CrossRef Full Text | Google Scholar

La Scola, B., Desnues, C., Pagnier, I., Robert, C., Barrassi, L., Fournous, G., et al. (2008). The virophage as a unique parasite of the giant mimivirus. Nature 455, 100–104. doi: 10.1038/nature07218

PubMed Abstract | CrossRef Full Text | Google Scholar

Marchler-Bauer, A., Lu, S., Anderson, J. B., Chitsaz, F., Derbyshire, M. K., Deweese-Scott, C., et al. (2011). CDD: a Conserved Domain Database for the functional annotation of proteins. Nucleic Acids Res. 39, D225–D229. doi: 10.1093/nar/gkq1189

PubMed Abstract | CrossRef Full Text | Google Scholar

Patel, R. K., and Jain, M. (2012). NGS QC Toolkit: a toolkit for quality control of next generation sequencing data. PLoS ONE 7:e30619. doi: 10.1371/journal.pone.0030619

PubMed Abstract | CrossRef Full Text | Google Scholar

Roux, S., Tournayre, J., Mahul, A., Debroas, D., and Enault, F. (2014). Metavir 2: new tools for viral metagenome comparison and assembled virome analysis. BMC Bioinform. 15:76. doi: 10.1186/1471-2105-15-76

PubMed Abstract | CrossRef Full Text | Google Scholar

Yau, S., Lauro, F. M., DeMaere, M. Z., Brown, M. V., Thomas, T., Raftery, M. J., et al. (2011). Virophage control of antarctic algal host-virus dynamics. Proc. Natl. Acad. Sci. U.S.A. 108, 6163–6168. doi: 10.1073/pnas.1018221108

PubMed Abstract | CrossRef Full Text | Google Scholar

Yutin, N., Kapitonov, V. V., and Koonin, E. V. (2015). A new family of hybrid virophages from an animal gut metagenome. Biol. Direct. 10, 19. doi: 10.1186/s13062-015-0054-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Zablocki, O., van Zyl, L., Adriaenssens, E. M., Rubagotti, E., Tuffin, M., Cary, S. C., et al. (2014). High-level diversity of tailed phages, eukaryote-associated viruses, and virophage-like elements in the metaviromes of antarctic soils. Appl. Environ. Microbiol. 80, 6888–6897. doi: 10.1128/AEM.01525-14

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Q., Ding, C., Achal, V., Shan, D., Zhou, Y., Xu, Y., et al. (2014). Potential for nutrient removal by integrated remediation methods in a eutrophicated artificial lake - a case study in Dishui Lake, Lingang New City, China. Water Sci. Technol. 70, 2031–2039. doi: 10.2166/wst.2014.453

PubMed Abstract | CrossRef Full Text

Zhang, W., Zhou, J., Liu, T., Yu, Y., Pan, Y., Yan, S., et al. (2015). Four novel algal virus genomes discovered from Yellowstone Lake metagenomes. Sci. Rep. 5:15131. doi: 10.1038/srep15131

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, J., Sun, D., Childers, A., McDermott, T. R., Wang, Y., and Liles, M. R. (2015). Three novel virophage genomes discovered from yellowstone lake metagenomes. J. Virol. 89, 1278–1285. doi: 10.1128/JVI.03039-14

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, J., Zhang, W., Yan, S., Xiao, J., Zhang, Y., Li, B., et al. (2013). Diversity of virophages in metagenomic data sets. J. Virol. 87, 4225–4236. doi: 10.1128/JVI.03398-12

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: virophage, genome, metagenomics, freshwater lake, diversity

Citation: Gong C, Zhang W, Zhou X, Wang H, Sun G, Xiao J, Pan Y, Yan S and Wang Y (2016) Novel Virophages Discovered in a Freshwater Lake in China. Front. Microbiol. 7:5. doi: 10.3389/fmicb.2016.00005

Received: 02 November 2015; Accepted: 05 January 2016;
Published: 22 January 2016.

Edited by:

Bernard La Scola, Aix Marseille Université, France

Reviewed by:

Christelle Desnues, URMITE Centre National de la Recherche Scientifique UMR 7278, France
Matthias Fischer, Max Planck Society, Germany

Copyright © 2016 Gong, Zhang, Zhou, Wang, Sun, Xiao, Pan, Yan and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yongjie Wang, yjwang@shou.edu.cn

These authors have contributed equally to this work.