ORIGINAL RESEARCH article

Front. Microbiol., 22 September 2023

Sec. Virology

Volume 14 - 2023 | https://doi.org/10.3389/fmicb.2023.1248323

Comparison of de novo assembly using long-read shotgun metagenomic sequencing of viruses in fecal and serum samples from marine mammals

  • Department of Environmental Health Sciences, School of Public Health and Tropical Medicine, Tulane University, New Orleans, LA, United States

Article metrics

View details

8

Citations

5,7k

Views

1,4k

Downloads

Abstract

Introduction:

Viral diseases of marine mammals are difficult to study, and this has led to a limited knowledge on emerging known and unknown viruses which are ongoing threats to animal health. Viruses are the leading cause of infectious disease-induced mass mortality events among marine mammals.

Methods:

In this study, we performed viral metagenomics in stool and serum samples from California sea lions (Zalophus californianus) and bottlenose dolphins (Tursiops truncates) using long-read nanopore sequencing. Two widely used long-read de novo assemblers, Canu and Metaflye, were evaluated to assemble viral metagenomic sequencing reads from marine mammals.

Results:

Both Metaflye and Canu assembled similar viral contigs of vertebrates, such as Parvoviridae, and Poxviridae. Metaflye assembled viral contigs that aligned with one viral family that was not reproduced by Canu, while Canu assembled viral contigs that aligned with seven viral families that was not reproduced by Metaflye. Only Canu assembled viral contigs from dolphin and sea lion fecal samples that matched both protein and nucleotide RefSeq viral databases using BLASTx and BLASTn for Anelloviridae, Parvoviridae and Circoviridae families. Viral contigs assembled with Canu aligned with torque teno viruses and anelloviruses from vertebrate hosts. Viruses associated with invertebrate hosts including densoviruses, Ambidensovirus, and various Circoviridae isolates were also aligned. Some of the invertebrate and vertebrate viruses reported here are known to potentially cause mortality events and/or disease in different seals, sea stars, fish, and bivalve species.

Discussion:

Canu performed better by producing the most viral contigs as compared to Metaflye with assemblies aligning to both protein and nucleotide databases. This study suggests that marine mammals can be used as important sentinels to surveil marine viruses that can potentially cause diseases in vertebrate and invertebrate hosts.

1. Introduction

Worldwide, viruses are reported to cause 72% of the infectious disease-induced mass mortality events (ID MME) in marine mammals from 1955 to 2018, specifically morbilliviruses and Influenza A viruses (Sanderson and Alexander, 2020). The U.S. National Oceanic and Atmospheric Administration (NOAA) reports that around 49% of marine mammal unusual mortality events (UME) from 1991 to 2021 are classified as undetermined (Onens et al., 2023). Marine mammals infected with viruses may be more susceptible to other oceanic algal toxins and harmful bacteria such as Vibrio and Klebsiella exacerbated by climate change (Bogomolni et al., 2016; Siebert et al., 2017; Sanderson and Alexander, 2020). For example, increased water temperatures enhance the survival of Vibrio parahaemolyticus in marine environments, which was documented to cause die offs of northern sea otters in Alaska due to septicemia and enteritis (Burek et al., 2008). Furthermore, Klebsiella pneumoniae was introduced into the New Zealand sea lion population in 1998 and is known to cause endemic pup mortality (Castinel et al., 2007; Roe et al., 2015). Harmful algal blooms that produce brevotoxins (Flewelling et al., 2005), domoic acid (Lefebvre et al., 1999; Gulland, 2000; Lefebvre et al., 2016), and saxitoxin (Lefebvre et al., 2016; Fire et al., 2020) are documented to cause mortality in marine mammals and are becoming more prevalent globally due to climate change (Hendrix et al., 2021).

The monitoring of viruses in ocean ecosystems can increase the likelihood of detecting emerging infectious diseases. Calicivirus is a prime example of a zoonotic virus, with ocean origin, that spilled over from sea lion to swine, and is known to cause vesicular disease in marine mammals (Neill et al., 1995; Smith et al., 1998). Furthermore, UME first responders were documented to contract sealpox from marine mammals infected with Parapoxvirus, with symptoms presented as contagious pustular dermatitis or lesion (Clark et al., 2005; Roess et al., 2011). In 1988, a phocine distemper Morbillivirus caused the biggest ID MME, with over 18,000 harbor seals (Phoca vitulina) stranded in Europe (Dietz et al., 1989). Morbilliviruses have caused more than half of the ID MME’s in marine mammals since 1955 (Sanderson and Alexander, 2020) with symptoms of skin lesions, pneumonia, brain infections and pup abortions (Pomeroy et al., 2005; Duignan et al., 2014). Although distemper viruses are not readily transmissible to humans, it was documented that canine distemper virus can adapt to use human cell receptors suggesting potential zoonotic spillover (Bieringer et al., 2013; Sakai et al., 2013). Influenza A virus (IAV) is the second leading cause of viral ID MME’s in marine mammals after being reported in harbor seal die offs since 1979, causing acute hemorrhagic pneumonia (Webster et al., 1981b; Sanderson and Alexander, 2020). Several cases of conjunctivitis caused by an IAV virus spillover event from seals to humans were documented in 1981 (Webster et al., 1981a). Therefore, there is a critical need to develop more comprehensive, rapid, and affordable detection methods for zoonotic viruses in marine environments to prevent the spread of emerging and endemic infectious diseases.

Viral diversity in marine environments is immense, and most viruses cannot be identified using traditional culturing techniques (Noble and Fuhrman, 1997; Munang'andu et al., 2017; Arya, 2020). Early metagenomics studies revealed that most of the viral diversity remains uncharacterized (Breitbart et al., 2002). Next generation sequencing (NGS) technologies combined with reliable bioinformatic pipelines are critical for the detection and characterization of novel and existing viral pathogens in marine mammals that could cross over to other vertebrate populations. Although the second generation short-read (200–400 bp) technologies, such as Illumina or Ion Torrent have high read throughput, the shorter DNA fragments can be difficult to assemble and annotate (Pop and Salzberg, 2008; Hu et al., 2021). The third generation nanopore sequencing technology is providing a new opportunity to develop more rapid, portable, and cost-effective genomic sequencing assays for viruses. For example, the nanopore-based sequencer MinION can produce read lengths of over 10 kB, which can overcome annotation of genomic repeat regions and structural variations that are difficult to assemble (Tørresen et al., 2019; Hu et al., 2021). Nanopore sequencing has been used to detect various viruses from diverse clinical samples (Quick et al., 2017; Hayashida et al., 2019; Cohen et al., 2020; Brown et al., 2021). This technology has been shown to sequence the full genome of four variants of herpes simplex viruses in single read, with read lengths ranging from 100 kb to 2.3 Mb (Saranathan et al., 2022). To our knowledge, the nanopore long-read sequencing technology has not been evaluated for non-targeted metagenomic sequencing of viruses in marine mammals.

De novo assemblers are programs that assemble shorter nucleotide sequences into longer fragments called contigs without a reference database. Many de novo assemblers that exist for long-read sequencing technologies have been evaluated only using bacteria (Chin et al., 2013; Zimin et al., 2013; Kamath et al., 2017; Koren et al., 2017; Kolmogorov et al., 2020), plants (Chin et al., 2016; Ruan and Li, 2020), and fungi (Chin et al., 2016) samples. Choosing the right de novo assembler is important for constructing error-free and artifact-less genome assemblies. De novo assembly algorithms include overlap-layout-consensus (OLC), de-Bruijn-graph (DBG), string-graph (SG) and hybrid approaches (Dida and Yi, 2021). Briefly, the OLC algorithm finds overlaps between reads, creates a read layout, then a consensus sequence is produced (Idury and Waterman, 1995; Li et al., 2012; Liao et al., 2019). DBG is an algorithm that chops reads up into short k-mers (substrings of length k), where overlapping edges (k−1) are found, resulting in an Eulerian (edges) or Hamiltonian (nodes) path to create a graph (Pevzner et al., 2001; Li et al., 2012) where contigs are constructed (Compeau et al., 2011). SG is a simplified OLC where sequence reads (nodes) and non-transitive edges produce suffix to prefix overlaps (Liao et al., 2019). Canu (Koren et al., 2017) is an upgraded long-read OLC assembler algorithm that integrates newer computational procedures to overcome noisy overlapping reads and decreases assembly time compared to the now unsupported Celera Assembler (Myers et al., 2000; Miller et al., 2008). Metaflye is a long-read DBG assembler algorithm that constructs repeat graphs from arbitrary paths called disjoints that are stringed together to construct contigs (Kolmogorov et al., 2019, 2020). Both Metaflye and Canu have been evaluated to construct genomes of plant (Arabidopsis thaliana), bacteria (Escherichia coli, Bacillus cereus, and Staphylococcus aureus), human, and yeast (Saccharomyces cerevisiae) (Dida and Yi, 2021). Metaflye outperformed Canu by generating larger contigs and higher N50 values, but is prone to more mis-assemblies and mismatches (Dida and Yi, 2021).

The objectives of this study were to (i) use metagenomics to characterize viruses in stool and serum samples from bottlenose dolphins (Tursiops truncates) and California sea lions (Zalophus californiansus) with a long-read nanopore sequencing approach; (ii) compare two de novo assemblers, Canu v2.2 and Metaflye v2.9.1, in generating viral contigs for annotation. The improved knowledge on marine viruses and successful protocol development will lead to a translational science that informs protection of animals and public health under a One Health framework.

2. Materials and methods

2.1. Sample collection

Five sea lion fecal, four sea lion serum, four dolphin fecal, and four dolphin serum samples were collected from the U.S. Navy’s dolphin and sea lion clinic facility in Point Loma, CA (Lat 32.746021, Long −117.237030) in 2018 and 2019, respectively. Samples were collected in 10 mL conical tubes and frozen at –80°C until analysis.

Samples from Navy animals were collected during their routine care and under the authority codified in U.S. Code, Title 10, Section 7524. Secretary of Navy Instruction 3900.41H directs that Navy marine mammals be provided the highest quality of care. The U.S. Navy Marine Mammal Program (MMP), Naval Information Warfare Center (NIWC) Pacific, houses, and cares for a population of bottlenose dolphins and California sea lions in San Diego Bay (CA, United States). NIWC Pacific is accredited by The Association for Assessment and Accreditation of Laboratory Animal Care (AAALAC) International and adheres to the national standards of the U.S. Public Health Service policy on the Humane Care and Use of Laboratory Animals and the Animal Welfare Act.

2.2. Samples processing for viral metagenomics

Animal fecal samples were weighed in 50 mL sterile conical tubes. Sterile phosphate-buffered saline (PBS) was added at 1 mL/g and vortexed for 5 min. Sterile PBS was also used as a negative control. The samples were centrifuged at 15,000g for 10 min, and the supernatant was filtered through 0.45 μm and 0.22 μm filters. Serum samples were filtered directly through 0.45 μm and 0.22 μm filters. In order to remove any free DNA, 1 mL aliquot of each sample was incubated with Ambion™ DNase I (RNase Free) (Thermo Fisher Scientific, United States) at 37°C for 1 h in a water bath at a final concentration of 0.1 U (unit) per μL. The DNase was deactivated by treating with 50 mM EDTA at 75°C in a heat block for 10 min.

2.3. Nucleic acid extraction

Viral nucleic acids were extracted using the Invitrogen™ PureLink™ Viral RNA/DNA Extraction kit according to manufacturer’s instructions (Thermo Fisher Scientific, United States). In this study, 200 μL of pre-treated samples were extracted and eluted into 50 μL sterile RNase-free water. Nucleic acid samples were stored in −80°C until further processing.

2.4. Random PCR (rPCR) assay

To generate sufficient material for sequencing, a random reverse transcription/amplification protocol was used to amplify both viral DNA and RNA. Briefly, 5 μL of the nucleic acid samples were mixed with 1uL of primer-A (5’-GTTTCCCAGTCACGATCNNNNNNNNN) (40 μM) (Wang et al., 2002, 2003), 4 μL nuclease-free water, and was heated to 65°C for 5 min and 22°C for 5 min. The 10 μL reaction was then added to 4 μL 5X First Strand Buffer [250 mM Tris–HCl (pH 8.3), 375 mM KCl, 15 mM MgCl2], 0.4 μL deoxynucleotide Solution Mix (dNTP) (10 mM) (New England Biolabs, United States), 0.6 μL nuclease-free water, 2 μL DTT (100 mM), 2uL SuperScript™ III Reverse Transcriptase (200 U/uL) (Thermo Fisher Scientific, United States), 1uL RNaseOUT™ (40 U/uL) (Thermo Fisher Scientific, United States) and was heated to 42°C for 60 min, 94°C for 2 min, and 10°C for 5 min. The 20 μL reaction was added to 2 μL 5X Sequenase™ Reaction Buffer, 7.7 μL nuclease-free water, 0.3 μL Sequenase™ Version 2.2 DNA Polymerase (13 U/μL) (Thermo Fisher Scientific, United States), and was heated to 10°C for 5 min, 37°C for 8 min with 1°C/s ramping, 94°C for 2 min, 10°C for 5 min (1.2 μL of 2.5 μL Sequenase™ Version 2.2 DNA Polymerase (13 U/uL) with 7.5 μL Sequenase™ Enzyme Dilution Buffer was added to each tube during this step), 37°C for 8 min with 1°C/s ramping, and 94°C for 8 min. 6 μL of the cDNA template was added to 8 μL MgCl2, 10 μL Buffer II (100 mM Tris–HCl, pH 8.3, 500 mM KCl), 1 μL dNTP (10 mM), 1 μL Primer-B (5’-GTTTCCCAGTCACGATC) (100 μM) (Wang et al., 2002, 2003), 1 μL AmpliTaq Gold™ DNA Polymerase with Gold Buffer and MgCL2 (Thermo Fisher Scientific, United States), and 73 μL nuclease-free water. The cDNA was amplified under the following conditions, 94°C for 15 min, 40 cycles of 94°C for 30 s, 40°C for 30 s, 50°C for 30 s, and 72°C for 1 min. The resulting PCR product was run on a 1% agarose gel and a smear between 500 bp to 1 kb was considered positive for sequencing. Positive samples were purified using the Wizard SV Gel and PCR Clean-Up System according to manufacturer’s instructions (Promega, Madison, WI, United States), where 100 μL cDNA was purified and eluted into 50 μL nuclease-free water. Samples were not normalized to a specific concentration prior to barcoding for sequencing.

2.5. Virome library preparation and nanopore sequencing

Amplified samples were checked for DNA concentration using Qubit® 4 dsDNA HS Assay kit according to manufacturer’s instructions (Thermo Fisher Scientific, United States), and 1 μL samples were tested for concentration (ng/μL). 50 μL of purified rPCR samples were barcoded using the Oxford Nanopore PCR Barcoding Kit (SQK-PBK004) Version: RPB_9059_v1_revN_14Aug2019 according to the manufacturer’s instructions (Oxford Nanopore, United Kingdom). All samples were sequenced on the flow cell MinION Mk1b with R9.4.1 flow cell chemistry. Due to the frequent software updates, different version of MinKNOW and Guppy basecaller were used (MinKNOW v20.10.3 and v22.05.5 with Guppy basecaller v4.4.1 and v6.1.7 for fecal samples; MinKNOW v21.02.1 with Guppy basecaller v4.3.4 for all serum samples). The libraries were analyzed in the MinKNOW software under the following parameters: PCR Barcoding kit (SQK-PBK004), Native Barcoding Expansion 1–12 (EXP-PBC001), fast basecalling, trim barcodes, mid-read barcode filtering and barcode trimming. Fastq files were concatenated after sequencing for data analysis and bioinformatics.

2.6. Data analysis and bioinformatics

The in silico pipeline analysis for nanopore sequencing data is shown in (Figure 1). Fastq files were uploaded to the Oxford Nanopore Technology (ONT) EPI2ME Software and analyzed using the “Whats in my pot?” WIMP (Humane + Viral) workflow to obtain quality control statistics on reads analyzed, total yield (Mb), average quality score, and average sequence length. Fastq sequencing files were analyzed in the Ubuntu 20.04 LTS 64-bit on Tulane Universities’ Cypress high performance computing (HPC) 124-node cluster with dual 10-core 2.8 GHz Intel Xeon E5-2680 v2 CPUs, 64 GB or RAM, and dual Xeon Phi 7120P coprocessor system. Fastq files were de novo assembled with Canu v2.2 and Metaflye v2.9.1 (Koren et al., 2017; Kolmogorov et al., 2020) pipeline and polished with Medaka v1.6.01 (Lee et al., 2021). Canu 2.2 program parameters for low coverage reads were genomeSize = 2 m maxinputCoverage = 10,000 corOutCoverage = 10,000 corMhapSensitivity = high corMinCoverage = 0 redMemory = 32 oeaMemory = 32 batMemory = 64 minInputCoverage = 0 stopOnLowCoverage = 0, and genomeSize = 2 m maxInputCoverage = 100 –nanopore fecal samplesDNA.fastq for high coverage data, respectively. Metaflye v2.9.1 program parameters were –meta –threads 20 –nano-raw. Medaka v1.6.0 program parameters were medaka_consensus -i ${BASECALLS} -d ${DRAFT} -o ${OUTDIR} -t ${NPROC} -m r941_min_high_g330. All contigs were annotated using Basic Local Alignment Search Tool (BLAST) against the National Center for Biotechnology Information (NCBI) GenBank protein database (viral RefSeq). BLASTx hits with e-values ≤10−4 were used for analysis. BLASTx output files were analyze to viral families using MEGAN-LR (long-read) Community Edition 6.24.1 with default parameters, min score 50.0, max expected 0.0001, min percent identity 10.0, top percent 10.0, min support percent 0.01, min support 1, min read length 0, LCA algorithm longReads, percent to cover 80.0, and read assignment mode readCount. Viral family contigs that were positive for viral families that infect invertebrate and vertebrate hosts were subject to manual BLASTn analysis for confirmation. Version 5 BLAST+ 2.10.0 was used for both BLASTx and BLASTn database queries. The Quality Assessment Tool for Genome Assemblies (QUAST) v5.2 MetaQUAST was used to analyze contig output files. To determine if Canu v2.22 assembles more viral family reads compared to MetaFlye v2.91,3 a one-tailed Wilcoxon rank-sum test (also known as the Mann–Whitney U test) for non-parametric data was used. Statistical analysis was performed using R version 4.2.3 (R Core Team, 2013).

Figure 1

The original nucleotide sequences described in this study have been deposited in the GenBank database under the Bioproject accession numbers PRJNA998092.

3. Results

3.1. Generation of viral metagenomic sequences

A long-read sequencing of randomly primed amplicons using the nanopore MinION generated a total of 1,698,981 sequencing reads, yielding 1,557 Mb after quality filtering of “passed” reads. The highest number of sequencing reads at 1,120,197 reads after basecalling, were produced from sea lion fecal samples, followed by dolphin fecal samples with 456,126 sequence reads after basecalling (Table 1). The average quality score of all the sequences ranged from 9 to 15, with the average sequence length (bases) ranging from 627 to 769 bases (Table 1).

Table 1

Marine mammalNumber of samplesSample typeReads analyzed (bases)Total yield (Mb)Average quality scoreAverage sequence length (bases)
Sea lion5Stool1,120,1971,1949769
Sea lion3Serum71,1214415627
Dolphin4Stool456,1262849721
Dolphin4Serum51,5373510684

EPI2ME quality control analysis of passed raw read fastq files generated from the MinKNOW software.

3.2. De novo assembly comparison

Canu v2.2 assembled a total of 333 contigs, with 118 viral contigs ranging from 1,029 to 13,513 nucleotides (nt) long for dolphin samples and 593 total contigs, with 224 viral contigs ranging from 1,026 to 8,114 nt for sea lion samples. In this study, viral contigs are the assembled sequences that aligned with the reference sequences from the NCBI Viral RefSeq database using BLASTx. Metaflye v2.9.1 assembled a total of 177 contigs, with 76 viral contigs ranging from 262 to 3,631 nt for dolphin samples and a total of 130 contigs, with 46 viral contigs ranging from 1,128 to 4,740 for sea lion samples (Table 2). Canu v2.2 produced the longest contig at 13,513 nt from dolphin fecal samples. Metaflye v2.9.1 produced higher N50 values compared to Canu v2.2 for both dolphin and sea lion fecal and serum samples. Canu v2.2 produced more contigs that could be annotated as viruses using the NCBI database compared to Metaflye v2.9.1. The mean viral contig size was higher for dolphin fecal samples, but lower in sea lion fecal samples with Canu v2.2 compared to Metaflye v2.9.1. The mean viral contig size was lower for dolphin serum samples, but higher for sea lion serum samples with Canu v2.2 compared to Metaflye v2.9.1. Metaflye v2.9.1 did not produce viral contigs for sea lion serum samples. Canu v2.2 produced lower percent viral contigs for dolphin fecal samples, but higher percent viral contigs for sea lion fecal samples, while Metaflye v2.9.1 produced higher percent viral contigs for dolphin serum. Overall, Canu v2.2 used lower central processing units (CPUs) compared to Metaflye v2.9.1 but took a longer time to assemble contigs compared to Metaflye v2.9.1 (Table 2).

Table 2

SamplesCanu v2.2Metaflye 2.9.1
Fecal samplesSerum samplesFecal samplesSerum samples
DolphinSea lionDolphinSea lionDolphinSea lionDolphinSea lion
Total contigs2905446310317512822
Largest contig (nt)13,5138,114191519673,6314,7402,5012,670
N50 (nt)1,1981,1941,1731,1642,3182,3022,4992,658
Total non-viral contigs16228053891018202
Total viral contigs1082131011744620
Viral contig size range (nt)1,029–13,5131,026–8,1141,055–17811,043–1,482262–3,6311,128–4,7402,214–2,5010
Mean viral contig size (nt)2,3461,6701,3461,2782,1852,3632,3580
Median viral contig size (nt)1,7441,3751,3171,2722,2982,3492,3580
% Viral contigs37%39%16%11%42%36%100%0%
GC (%)41.9641.1947.6947.1441.4141.1843.8655.28
Total viral BLASTx hits9821,37711351758207200
No. of CPUs1010101020402020
Run time (min)1,4601980270330601554

A comparison of the performance and annotation outputs for de novo assemblers Canu v2.2 and Metaflye v2.9.1.

BLASTx outputs with MEGAN were used to determine viral contigs, which are contigs that aligned against the NCBI RefSeq viral database.

3.3. The distribution of viral contigs

Using different de novo assemblers, contigs from diverse viral host distributions were observed. For dolphin fecal samples, both Metaflye v2.9.1 and Canu v2.2 assembled viral contigs that aligned with bacteriophages, invertebrate, and vertebrate viruses, but only Canu v2.2 produced contigs that aligned with algal viruses (Figure 2). Overall, Canu v2.2 produced viral contigs from more diverse hosts in fecal samples compared to Metaflye v2.9.1 (Figure 2). Vertebrate hosts were dominant in dolphin serum using Canu v2.2, while invertebrate hosts were dominant in dolphin serum using Metaflye v2.9.1. Contigs aligning with viruses from amoeba hosts were only detected in sea lion fecal and serum samples using Canu v2.2. Contigs aligning to plant viral hosts were only detected in sea lion fecal samples with Canu v2.2. Distribution of virus types between samples and de novo assemblers were similar. All serum samples had contigs that aligned 100% with dsDNA type viruses. Both dolphin and sea lion fecal samples had contigs that aligned with dsDNA, RNA and ssDNA virus types using either Canu v2.2 or Metaflye v2.9.1. Contigs that aligned with RNA viruses had slightly higher distribution in sea lion fecal samples compared to dolphin fecal samples, while ssDNA types had similar distribution (Figure 3).

Figure 2

Figure 3

Canu v2.2 assembled significantly higher numbers of viral contigs compared to Metaflye v2.9.1 for dolphin fecal samples, sea lion fecal samples and sea lion serum samples (p-value = <0.05) (Figure 4). Assembled viral contigs aligned with several vertebrate viral families such as Anelloviridae, Parvoviridae, Poxviridae, Smacoviridae using the protein BLASTx program (Figure 4). Invertebrate viruses detected in fecal samples of dolphin and sea lion aligned to the families of Baculoviridae, Circoviridae, Iridoviridae, and Parvoviridae. Baculoviridae viral reads only aligned with Canu v2.2 assembled viral contigs from dolphin fecal samples. Circoviridae, Parvoviridae, Riboviria (realm) viral reads were higher in sea lion fecal samples with Canu v2.2.

Figure 4

Bacteriophage families include Autographiviridae, Inoviridae, Myoviridae, Podoviridae, Salasmaviridae, Siphoviridae, and Zobellviridae were also detected in dolphin fecal samples using Canu v2.2 (Figure 4). In sea lion fecal samples, bacteriophage families included Microviridae and Tectiviridae were detected.

Although Canu v2.2 and Metaflye v2.9.1 assembled viral contigs from similar viral families, some discrepancies were found. Canu v2.2 produced viral contigs of seven viral families that were absent from the Metaflye v2.9.1 output (Figure 5). Overall, most of the viruses that were detected with both Canu v2.2 and Metaflye v2.9.1 were linear, non-enveloped dsDNA viruses (Supplementary Data_Sheet_1_virus_genome excel file).

Figure 5

3.4. Protein and nucleotide analysis

Viral contigs that were aligned using the protein (amino acid, aa) BLASTx program were further confirmed using the nucleotide BLASTn program. Canu v2.2 generated contigs of vertebrate and invertebrate viruses that were positive for both protein and nucleotide NCBI RefSeq viral databases, while Metaflye v2.9.1 only generated viral contigs that had positive viral families for protein, not nucleotide. No viral contigs from vertebrate and invertebrate hosts were identified in serum samples using both assemblers with both BLASTx and BLASTn searches. Dolphin fecal samples contained viral contigs that were found to be associated with seals, sea stars, and oysters, as confirmed by both protein and nucleotide databases. These viruses included torque teno midi virus (TTMDV), seal annellovirus, and sea star densoviruses (DNV) (Table 3). Sea lion fecal samples contained viral contigs that were also detected in oysters, fish, sea stars, crayfish, and clams. These viruses included sea star DNV, Cherax quadricarinatus DNV, Circoviridae species., and Ambidensovirus (AmDNV) (Table 4).

Table 3

Blast typeViral familySource taxonomySource nameContig size (nt)Viral assignmentNCBI accession #ID%e-valueBit score
BLASTxAnelloviridaeHomo sapiensHuman1,257Torque teno midi virus 12YP_009505786.1323.00E-0751
AnelloviridaePhoca vitulinaHarbor seal1,257Seal anellovirus 4YP_009115496.1334.00E-1154
AnelloviridaePhoca vitulinaHarbor seal1,257Seal anellovirus 4YP_009115496.1338.00E-0651
AnelloviridaeAilurus fulgensRed panda1,257Lesser panda anellovirusYP_009551687.1382.00E-1056
ParvoviridaeAsterias amurensisN.Pacific seastar1,625Sea star-associated densovirusYP_009507339.1591.00E-04223
ParvoviridaeAsterias amurensisN.Pacific seastar1,625Sea star-associated densovirusYP_009507339.1464.00E-1685
ParvoviridaeAsterias amurensisN.Pacific seastar2,929Sea star-associated densovirusYP_009507339.1461.00E-04602
BLASTnAnelloviridaeArctocephalus australisSo. American fur seal1,257Torque teno arctocephalus australis virus 1MW504281.1796.46E-148536
AnelloviridaeArctocephalus australisSo. American fur seal1,257Anellovirus fur seal/AAUST60/BR/2012MW504281.1842.71E-37180
ParvoviridaeCrassostrea ariakensisSuminoe oyster1,625AmbidensovirusKY548840.176.735.00E-140510
ParvoviridaeAsterias forbesiForbes sea star1,625Uncultured densovirusMN190158.182.232.00E-70345
ParvoviridaePisaster ochraceusPurple sea star2,929Uncultured densovirusMW073776.181.851.33E-63257
ParvoviridaeAsterias forbesiForbes sea star2,929Uncultured densovirusMN190158.179.298.02E-61248
ParvoviridaeAsterias amurensisN.Pacific seastar2,929Sea star-associated densovirusNC_038532.179.138.13E-51215
ParvoviridaeAsterias forbesiForbes sea star2,929Sea star-associated densovirusKY785181.179.794.89E-48206
ParvoviridaeAsterias forbesiForbes sea star2,929Sea star-associated densovirusKY785180.179.442.28E-46200
ParvoviridaeYangtze RiverN/A2,929Parvoviridae sp. IsolateMW348572.178.373.86E-34159
ParvoviridaeYangtze RiverN/A2,929Parvoviridae sp. IsolateMW348571.183.851.41E-23124
ParvoviridaePycnopodia helianthoidesSunflower sea star2,929Uncultured densovirusMT733051.179.258.13E-51215

Canu v2.2 assembled vertebrate and invertebrate viral contigs from pooled dolphin fecal samples (n = 4) that shared the same viral families between BLASTx and BLASTn.

The contigs with the same size (nucleotide, nt) were the same contigs, but resulted in multiple alignments from different taxonomic sources.

Table 4

Blast typeViral familySource taxonomySource nameContig size (nt)Viral assignmentNCBI accession #ID%e-valueBit score
BLASTxParvoviridaeAsterias amurensisN.Pacific seastar2,303Sea star-associated densovirusYP_009507339.1511.00E-04286
ParvoviridaeCherax quadricarinatusAustralian red claw crayfish2,303Cherax quadricarinatus densovirusYP_009134734.146.33.00E-34144
ParvoviridaeAsterias amurensisN.Pacific seastar2,309Sea star-associated densovirusYP_009507339.145.61.00E-04347
ParvoviridaeCherax quadricarinatusAustralian red claw crayfish2,309Cherax quadricarinatus densovirusYP_009134734.140.51.00E-04345
ParvoviridaeAsterias amurensisN.Pacific seastar2,309Sea star-associated densovirusYP_009507340.183.91.00E-04251
ParvoviridaeCherax quadricarinatusAustralian red claw crayfish2,309Cherax quadricarinatus densovirusYP_009134732.179.61.00E-04234
CircoviridaePaphies subtriangulataClam1,254Avon-Heathcote Estuary associated circular virus 28YP_009126886.140.16.00E-16119
CircoviridaePaphies subtriangulataClam1,254Avon-Heathcote Estuary associated circular virus 28YP_009126887.1351.00E-2981
ParvoviridaeCherax quadricarinatusAustralian red claw crayfish1,480Cherax quadricarinatus densovirusYP_009134732.144.21.00E-04294
ParvoviridaeAsterias amurensisN.Pacific seastar1,480Sea star-associated densovirusYP_009507340.145.61.00E-04291
ParvoviridaeCherax quadricarinatusAustralian red claw crayfish1,480Cherax quadricarinatus densovirusYP_009134732.167.81.00E-30128
ParvoviridaeAsterias amurensisN.Pacific seastar1,480Sea star-associated densovirusYP_009507340.171.61.00E-04124
ParvoviridaeCherax quadricarinatusAustralian red claw crayfish1,480Cherax quadricarinatus densovirusYP_009134732.153.81.00E-0490
ParvoviridaeAsterias amurensisN.Pacific seastar1,480Sea star-associated densovirusYP_009507340.153.91.00E-0486
ParvoviridaeCherax quadricarinatusAustralian red claw crayfish1,174Cherax quadricarinatus densovirusYP_009134731.135.31.00E-04184
ParvoviridaeCherax quadricarinatusAustralian red claw crayfish1,408Cherax quadricarinatus densovirusYP_009134732.159.91.00E-04463
ParvoviridaeAsterias amurensisN.Pacific seastar1,408Sea star-associated densovirusYP_009507340.156.51.00E-04451
ParvoviridaeAsterias amurensisN.Pacific seastar1,034Sea star-associated densovirusYP_009507339.153.31.00E-04240
ParvoviridaeCherax quadricarinatusAustralian red claw crayfish1,093Cherax quadricarinatus densovirusYP_009134732.162.21.00E-04325
ParvoviridaeAsterias amurensisN.Pacific seastar1,093Sea star-associated densovirusYP_009507340.159.31.00E-04318
ParvoviridaeCherax quadricarinatusAustralian red claw crayfish1,093Cherax quadricarinatus densovirusYP_009134732.177.21.00E-1994
ParvoviridaeAsterias amurensisN.Pacific seastar1,093Sea star-associated densovirusYP_009507340.178.93.00E-1992
ParvoviridaeSolenopsis invictaAnts1,093Solenopsis invicta densovirusYP_008766862.122.56.00E-1167
CircoviridaeOcean WaterN/A1,312Circoviridae 2 LDMD-2013YP_009109630.133.31.00E-04194
BLASTnParvoviridaeCrassostrea ariakensisSuminoe oyster2,303AmbidensovirusKY548840.176.0522.02E-140512
ParvoviridaeCrassostrea ariakensisSuminoe oyster2,309AmbidensovirusKY548840.186.8745.75E-126464
CircoviridaeLutjanus campechanusRed snapper1,254Circoviridae sp. isolateMH616634.192.5111.00E-04649
CircoviridaeOncorhynchus mykissRainbow trout1,254Circoviridae sp. isolateMH617160.192.1888.01E-177632
CircoviridaeLutjanus campechanusRed snapper1,254Circoviridae sp. isolateMH616871.192.2551.73E-173621
ParvoviridaeCygnus olorMute swan1,480AmbidensovirusMW588057.183.8985.48E-20111
ParvoviridaeAstropecten polyacanthusComb sea star1,480Uncultured densovirusMT733013.179.7625.60E-0562.1
ParvoviridaeCrassostrea ariakensisSuminoe oyster1,174AmbidensovirusKY548840.176.5091.74E-123455
ParvoviridaeCrassostrea ariakensisSuminoe oyster1,408AmbidensovirusKY548840.187.8861.00E-041,439
ParvoviridaeYangtze RiverN/A1,408Parvoviridae sp. IsolateMW348572.178.6831.00E-04749
ParvoviridaePisaster ochraceusPurple sea star1,408Densovirinae sp. isolateMW073782.174.8633.08E-32152
ParvoviridaePisaster ochraceusPurple sea star1,034Uncultured densovirusMT733042.177.1812.02E-107401
ParvoviridaePycnopodia helianthoidesSunflower sea star1,034Uncultured densovirusMT733032.175.4782.04E-97368
ParvoviridaeCrassostrea ariakensisSuminoe oyster1,093AmbidensovirusKY548840.187.9291.00E-041,061
ParvoviridaeYangtze RiverN/A1,093Parvoviridae sp. IsolateMW348572.179.6423.23E-175627
ParvoviridaePyloric caecaStarfish1,093Uncultured densovirusMN190158.173.8991.31E-89342
ParvoviridaePycnopodia helianthoidesSunflower sea star1,093Uncultured densovirusMT733031.173.0896.19E-78303
ParvoviridaePisaster ochraceusPurple sea star1,093Uncultured densovirusMT733037.173.0112.88E-76298
ParvoviridaePyloric caecaStarfish1,093Uncultured densovirusMT733024.172.9822.88E-76298
ParvoviridaePisaster ochraceusPurple sea star1,093Uncultured densovirusMT733041.175.3625.25E-1491.6
CircoviridaeLutjanus campechanusRed snapper1,312Circoviridae sp. isolateMH617401.197.1621.00E-041784
CircoviridaeLutjanus campechanusRed snapper1,312Circoviridae sp. isolateMH617399.178.8087.89E-1699

Canu v2.2 assembled vertebrate and invertebrate viral contigs from pooled sea lion fecal samples (n = 5) that shared the same viral families between BLASTx and BLASTn.

The contigs with the same size (nucleotide, nt) were the same contigs, but resulted in multiple alignments from different taxonomic sources.

4. Discussion

4.1. Viral metagenomics using nanopore sequencing and de novo sequence assembly

Assembling reliable metagenomic sequencing data is critical to characterizing viral diversity in marine environments. De novo assembly is important because it allows researchers to construct genomes without the need for a reference genome or when reference genomes are not available. De novo assembly can also discover novel genes and genetic variants of viruses (Allen et al., 2011; Chiu, 2013; Carbo et al., 2020). Several nanopore-based sequencing bioinformatic tools have been developed to handle the long-read sequencing reads, but many lack performance testing on unknown viral metagenomic sequencing dataset (Li, 2016; Kamath et al., 2017; Koren et al., 2017; Wang et al., 2018; Kolmogorov et al., 2020; Ruan and Li, 2020). This work aimed at filling this gap by using viral metagenomics data generated from marine mammal fecal and serum specimens with the nanopore sequencing platform.

Long-read sequencing technologies, such as the MinION nanopore sequencer, have several advantages over short-read sequencing technologies, e.g., it is portable, it does not require large imaging equipment to detect DNA nucleotides, lower cost, it can be powered through a Universal Serial Bus (USB) port, and it can be used in the field (Kono and Arakawa, 2019). In addition, nanopore sequencing can sequence longer stretches of DNA (>500 bp) (Adewale, 2020), can pick up long repetitive sequences (Kono and Arakawa, 2019), does not require fragmentation, and can directly sequence RNA molecules (Garalde et al., 2018). Although short-read sequencing can produce more reads with shorter lengths (<300 bp) (Hu et al., 2021), long-read sequencing is capable of sequencing full viral genomes, thus making assembly less error prone (Kono and Arakawa, 2019). Our results showed variation in results between the two de novo assemblers, Canu and Metaflye. Although Metaflye was a faster assembler that generated higher N50, it produced less viral alignments against the NCBI RefSeq viral protein database using BLASTx as compared to Canu. A study comparing Canu and Metaflye using sequences of bacteria, mammal, plant and fungi revealed that Metaflye outcompeted Canu with larger N50 values, but high error rates when assembling full genomes against reference sequences (Dida and Yi, 2021). Other studies show Canu and Metaflye performing similarly (Jung et al., 2020; Wang et al., 2021). Recent genome assembly pipelines created for viruses are using Canu for preprocessing of reads before reference alignment (Roach et al., 2022; Yu et al., 2023), suggesting that this assembler could produce quality assemblies for metagenomics analysis. De novo algorithms that generate large contigs and high N50s are considered good quality for genome assembly, but sometimes the results are inaccurate and produce more mismatches (Dida and Yi, 2021), which could be why Metaflye produced mismatches between viral contigs aligned against protein and nucleotide sequence databases for invertebrate and vertebrate viral families.

Canu was the only assembler that produced contigs that matched the same viral families between the protein and nucleotide databases from marine mammal fecal samples. This suggests that Canu could assemble contigs that are more accurate for viral annotation from environmental samples as compared to Metaflye. Canu is an OLC algorithm based de novo assembler, and is documented to have less error rates compared to DBG assemblers for long-reads (Kono and Arakawa, 2019). Vertebrate and invertebrate viruses in fecal samples of Parvoviridae, Anelloviridae, and Circoviridae families were confirmed with Canu assembler for both protein and nucleotide BLAST searches. In this study, there was a discrepancy between protein BLASTx and nucleotide BLASTn results for serum samples. For example, all the invertebrate and vertebrate viral contigs in serum samples identified using BLASTx against the protein database aligned with bacteriophages when using BLASTn against the nucleotide database. This could be that the lowest amount of sequence reads were generated in serum samples and both Canu and Metaflye are more accurate with higher sequencing read counts. De novo assembly could be less accurate with low sequencing depth due to insufficient genome coverage and limited redundancy for genome regions. De novo assemblers rely on read overlapping for OLC (Koren et al., 2017) or DBG (Kolmogorov et al., 2020) graph construction for contig or scaffold assembly. A low sequencing depth may result in reduced overlapping information making it harder for the algorithms to accurately detect sequencing overlaps. In addition, low read count may produce shorter contigs resulting in a fragmented assembly that may not accurately represent true structure of the sequenced genome regions. As seen in this study, the shortest mean nucleotide contig lengths were observed in serum samples and this could cause discrepancy in results between the two BLAST search programs. A future study could include a mock viral community to establish a baseline for sequencing reads prior to de novo assembly to better understand the relationship between sequencing depth and de novo assembly for viral metagenomics.

4.2. The detection of invertebrate and vertebrate viruses in fecal and serum samples of marine mammals using metagenomics

In this study, several viral contigs from dolphin and sea lion fecal samples aligned with viruses that were also isolated from human, mammal, seals, sea stars, bivalves, fish, birds and crayfish. These viruses include annelloviruses (AV), torque teno viruses (TTV), circoviruses and densoviruses. Multiple taxonomic sources aligned with the same viral contigs, which indicates that some viruses may infect multiple hosts (Tables 34). These results suggest that marine mammals can be used as important sentinel species (Bossart, 2006, 2011) to monitor marine environments for viruses that may spillover to different organisms.

First discovered in a serum samples from Hepatitis B (HBV) and Hepatitis C (HCV) patients (Khudair et al., 2019), TTV are single-stranded circular DNA (ssDNA) viruses, with a ~ 3.8 kb genome size and are currently classified into the Anelloviridae family under the genus Alphatorquevirus (Lolomadze and Rebrikov, 2020). TTV are known to be a diverse viral group with over 20 genotypes and 40% viral genome heterogeneity with frequent recombination in the N22 region of the open reading frame (ORF) 1 gene (Manni et al., 2002; Hino and Miyata, 2007; Hsiao et al., 2021). The TTV-like mini virus (TTMV) was added to the TTV group in 2000 (Takahashi et al., 2000) and has a genome size of ~2.9 kb (Hino and Miyata, 2007). Torque teno midi virus (TTMDV) is generally considered a non-pathogenic virus, and is commonly found in the virome of human blood (Cebriá-Mendoza et al., 2021). TTV have also been proposed as a fecal viral indicator for monitoring water quality (Griffin et al., 2008; Hamza et al., 2011; Haramoto et al., 2018; Tavakoli Nick et al., 2019). Viral contigs detected in dolphin fecal samples aligned with TTV’s from human (32% aa identity), South American fur seal (Arctocephalus australis) (79%–84% nt identities), and harbor seal (Phoca vitulina) (33% aa identity) sources. The South American fur seal torque teno arctecephalus australis virus 1 that aligned at 79% nt identity with the dolphin viral contig was detected in 2018 from seals found dead on the Rio Grande do Sul State shore in Brazil (Canova et al., 2021). The TTV that aligned with human could be indicative of a multi-host virus and possible sources from human wastewater or stormwater runoff. TTV are known to be highly resistant to wastewater treatment processes and are frequently detected in wastewater influent samples (Carducci et al., 2006; Haramoto et al., 2008; Plummer et al., 2014; Tavakoli Nick et al., 2019).

The Circoviridae family are circular ssDNA viruses with genome sizes of 1.7–2.1 kb and are composed of two genera Circovirus and Cyclovirus (Breitbart et al., 2017). Viral contigs from sea lion fecal samples aligned with viruses from the Circoviridae family of clams (Paphies subtriangulata) (35%–40.1% aa identity), red snappers (Lutjanus campechanus) (79%–97% nt identity) and rainbow trout (Oncorhynchus mykiss) (92% nt identity) sources. The Avon-Heathcote Estuary associated circular virus 28 from clams that aligned with a sea lion viral contig is from a group of single-stranded DNA (ssDNA) viruses encoding a replication-associated protein (Rep) (CRESS) viruses (Dayaram et al., 2015). Circoviridae species have unknown implications of disease but can cause infections in fish. In fish, circoviruses have been associated with skin and fin infections, called cauliflower disease (Doszpoly et al., 2014).

The Parvoviridae family are non-enveloped ssDNA viruses with linear genomes of 4–6 kb, and are split up into two subfamilies Parvovirinae and Densovirinae (Cotmore et al., 2019). The subfamily, Densovirinae (commonly referred to as densoviruses) infect insects and invertebrates, notably decapod crustaceans (shrimp and crayfish) (Jackson et al., 2020) Viral contigs from dolphin and sea lion fecal samples aligned with sea star associated densovirus (SSaDV). SSaDV is associated with sea star wasting disease (SSWD) that caused mass mortality events in >20 species of asteroids since 2013 from Alaska to Southern California (Hewson et al., 2014, 2018). SSaDV from dolphin fecal samples aligned with SSaDV from the North Pacific sea star (46%–59% aa identity; 79% nt identitiy). Viral contigs from sea lion fecal samples aligned with diverse densoviruses from comb sea star (Astropcten polyacanthus) (80% nt identity), purple sea star (Piesaster ochraceus) (77% nt identity), sunflower sea star (75% nt identity), and starfish (Pyloric caeca) (74% nt identity) sources. Sea stars are considered keystone species that play a role in regulating other species, such as bivalves, snails and other invertebrates in marine ecosystems (Paine, 1966; Collinge et al., 2008; Menge and Sanford, 2013). Viral contigs from sea lion fecal samples aligned with Cherax quadricarinatus densovirus (CqDV) from the Australian red claw crayfish (Cherax quadricarinatus) (35.3%–79.6% aa identity). CqDV has been linked to mortalities in red claw crayfish, which is a significant threat to the aquaculture industry (Saoud et al., 2013; Bochow et al., 2015). In addition, crayfish has an important ecological role in maintaining water quality (Chen et al., 2022), economic importance by providing food and jobs (McClain and Romaire, 2004), cultural importance with festivals (Gutierrez, 1998), and scientific importance for studying neurobiology, behavior and genetics (Huber et al., 2011; Jiang et al., 2014; Bacqué-Cazenave et al., 2017; D’Agnese et al., 2020). Viral contigs from dolphin fecal samples aligned with AmDNV from the Suminoe oyster (Crassostrea ariakensis) (76.7% nt identity). AmDNV was first isolated in 2017 from the Suminoe oysters (Crassostrea ariakensis) and shared similar amino acid identities with SSaDV (76–89% nt identity). But AmDNV has not been documented to cause mortality in oysters, instead most likely originated from the surrounding ocean ecosystem (Kang et al., 2017). Viral contigs from sea lion fecal samples aligned with AmDNV from Suminoe oysters (76%–88% nt identity) from Wuxi City, Jiangsu province, China (Kang et al., 2017). Another viral contig from sea lion fecal samples aligned with a AmDNV from a mute swan (Cygnus olor) (84% nt identity) and is not associated with disease in this bird species and more research is needed to understand the potential impact on bird health.

4.3. Study limitations

The use of filtration and DNase treatment could potentially reduce viral concentrations and diversity due to losses during these sample processing steps. DNase is an enzyme that breaks down extracellular “naked” or “free” DNA that are present in fecal and serum samples. In theory, intact viral particles should be resistant to DNase treatment through their protective protein coat such as a capsid and/or envelope, but some DNA viruses may be sensitive to DNase treatments. Studies have shown decreasing viral particles between samples with and without DNase treatment (Bettarel et al., 2000; Briese et al., 2015). Future studies could incorporate the use of biotinylated oligonucleotide probes (targeted viral enrichment) to capture viruses from complex matrices after sequencing library preparation to avoid upstream enzymatic treatments (Briese et al., 2015; Martínez-Puchol et al., 2020, 2022; Bonny et al., 2021). Although several studies use filtration (Allen et al., 2011; Fontenele et al., 2019; Garcia-Heredia et al., 2021; Patterson et al., 2021; Crum et al., 2023; LaRocca et al., 2023) to reduce background host and bacterial nucleic acids, it too can cause losses of viruses in filtration.

Although the third generation nanopore sequencing produces longer sequencing reads, it can be prone to higher error rates and low-quality scores, compared to short-read sequencing. The use of polishing after de novo assembly for error correction of substitutions, insertions and deletions has proved to increase assembly accuracy (Huang et al., 2021; Lee et al., 2021; Liu et al., 2022). Medaka polisher has been shown to decrease error rates after assemblies with MetaFlye or Canu (Goldsmith et al., 2020, 2021; Brancaccio et al., 2021; Wick et al., 2021). Future studies could include different combinations or rounds of polishing with Medaka or a combination of multiple types of polishers to enhance genome assembly accuracy. Furthermore, deeper long-read nanopore sequencing flow-cells such as the ONT PromethION (Yahara et al., 2021) with the updated Dorado4 (Pugh, 2023) base-caller and flow-cell chemistries that can reach quality scores of ~20 could be used in future studies to increase sequencing reads. In this study, different versions of the MinKNOW and Guppy basecaller software were used, due to frequent updates of the software packages by ONT. The different software updates would not affect the results of this study, because the same sequence files were used to compare Canu and MetaFlye assemblies. However, this may produce varying basecalling and/or read depth results when sequencing of viruses in environmental samples.

5. Conclusion

In this study, long-read metagenomics sequencing using a portable nanopore MinION sequencer allowed the detection of viruses in fecal and serum samples of marine mammals that are known to cause diseases in vertebrate and invertebrate hosts. This suggests that marine mammals could be used as sentinel species of ocean and human health by monitoring for emerging pathogens under a One Health framework. This sequencing method coupled with the sequence assembler, Canu identified Parvoviridae, Annelloviridae and Circoviridae that were confirmed with the NCBI viral protein (BLASTx) and nucleotide (BLASTn) Refseq databases. The data analysis approach presented here will be useful for virus surveillance using a long-read metagenomics sequencing.

Funding

This research was funded by the Office of Naval Research (ONR), U.S. Department of the Navy under grant no. N00014-20-1-2117.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found at: NCBI—PRJNA998092.

Ethics statement

Samples from Navy animals were collected during their routine care and under the authority codified in U.S. Code, Title 10, Section 7524. Secretary of Navy Instruction 3900.41H directs that Navy marine mammals be provided the highest quality of care. The U.S. Navy Marine Mammal Program (MMP), Naval Information Warfare Center (NIWC) Pacific, houses, and cares for a population of bottlenose dolphins and California sea lions in San Diego Bay (CA, United States). NIWC Pacific is accredited by The Association for Assessment and Accreditation of Laboratory Animal Care (AAALAC) International and adheres to the national standards of the U.S. Public Health Service policy on the Humane Care and Use of Laboratory Animals and the Animal Welfare Act. The study was conducted in accordance with the local legislation and institutional requirements.

Author contributions

KV: conceptualization, methodology, validation, formal analysis, investigation, data curation, writing—original draft, visualization. TA: conceptualization, resources, writing—review and editing, supervision, project administration, funding acquisition. All authors contributed to the article and approved the submitted version.

Acknowledgments

This research was supported in part using high performance computing (HPC) resources and services provided by Information Technology at Tulane University, New Orleans, LA. We would like to thank Carl Baribault for his bioinformatics expertise throughout this project and troubleshooting different in silico pipelines.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2023.1248323/full#supplementary-material

References

  • 1

    AdewaleB. A. (2020). Will long-read sequencing technologies replace short-read sequencing technologies in the next 10 years?Afr. J. Lab. Med.9, 15. doi: 10.4102/ajlm.v9i1.1340

  • 2

    AllenL. Z.IshoeyT.NovotnyM. A.McLeanJ. S.LaskenR. S.WilliamsonS. J. (2011). Single virus genomics: a new tool for virus discovery. PLoS One6:e17722. doi: 10.1371/journal.pone.0017722

  • 3

    AryaP., Metagenomics based approach to reveal the secrets of unculturable microbial diversity from aquatic environment, in MandalS.DeBhattP., eds., Recent advancements in microbial diversity. (2020), Amsterdam: Elsevier. p. 537559.

  • 4

    Bacqué-CazenaveJ.CattaertD.DelbecqueJ. P.FossatP. (2017). Social harassment induces anxiety-like behaviour in crayfish. Sci. Rep.7:39935. doi: 10.1038/srep39935

  • 5

    BettarelY.Sime-NgandoT.AmblardC.LaveranH. (2000). A comparison of methods for counting viruses in aquatic systems. Appl. Environ. Microbiol.66, 22832289. doi: 10.1128/AEM.66.6.2283-2289.2000

  • 6

    BieringerM.HanJ. W.KendlS.KhosraviM.PlattetP.Schneider-SchauliesJ. (2013). Experimental adaptation of wild-type canine distemper virus (CDV) to the human entry receptor CD150. PLoS One8:e57488. doi: 10.1371/journal.pone.0057488

  • 7

    BochowS.CondonK.EllimanJ.OwensL. (2015). First complete genome of an Ambidensovirus; Cherax quadricarinatus densovirus, from freshwater crayfish Cherax quadricarinatus. Mar. Genomics24, 305312. doi: 10.1016/j.margen.2015.07.009

  • 8

    BogomolniA. L.BassA. L.FireS.JasperseL.LevinM.NielsenO.et al. (2016). Saxitoxin increases phocine distemper virus replication upon in-vitro infection in harbor seal immune cells. Harmful Algae51, 8996. doi: 10.1016/j.hal.2015.10.013

  • 9

    BonnyP.SchaefferJ.BesnardA.DesdouitsM.NgangJ. J. E.le GuyaderF. S. (2021). Human and animal RNA virus diversity detected by metagenomics in Cameroonian clams. Front. Microbiol.12:770385. doi: 10.3389/fmicb.2021.770385

  • 10

    BossartG. D. (2006). Marine mammals as sentinel species for oceans and human health. Oceanography19, 134137. doi: 10.5670/oceanog.2006.77

  • 11

    BossartG. D. (2011). Marine mammals as sentinel species for oceans and human health. Vet. Pathol.48, 676690. doi: 10.1177/0300985810388525

  • 12

    BrancaccioR. N.RobitailleA.DuttaS.RollisonD. E.TommasinoM.GheitT. (2021). MinION nanopore sequencing and assembly of a complete human papillomavirus genome. J. Virol. Methods294:114180. doi: 10.1016/j.jviromet.2021.114180

  • 13

    BreitbartM.DelwartE.RosarioK.SegalésJ.VarsaniA.ICTV Report Consortium (2017). ICTV virus taxonomy profile: circoviridae. J. Gen. Virol.98, 19971998. doi: 10.1099/jgv.0.000871

  • 14

    BreitbartM.SalamonP.AndresenB.MahaffyJ. M.SegallA. M.MeadD.et al. (2002). Genomic analysis of uncultured marine viral communities. Proc. Natl. Acad. Sci.99, 1425014255. doi: 10.1073/pnas.202488399

  • 15

    BrieseT.KapoorA.MishraN.JainK.KumarA.JabadoO. J.et al. (2015). Virome capture sequencing enables sensitive viral diagnosis and comprehensive virome analysis. MBio6, e01491e01415. doi: 10.1128/mBio.01491-15

  • 16

    BrownE.FreimanisG.ShawA. E.HortonD. L.GubbinsS.KingD. (2021). Characterising foot-and-mouth disease virus in clinical samples using nanopore sequencing. Front. Vet. Sci.8:656256. doi: 10.3389/fvets.2021.656256

  • 17

    BurekK. A.GullandF. M.O'HaraT. M. (2008). Effects of climate change on Arctic marine mammal health. Ecol. Appl.18, S126S134. doi: 10.1890/06-0553.1

  • 18

    CanovaR.BudaszewskiR. F.WeberM. N.da SilvaM. S.PuhlD. E.BattistiL. O.et al. (2021). Spleen and lung virome analysis of South American fur seals (Arctocephalus australis) collected on the southern Brazilian coast. Infect. Genet. Evol.92:104862. doi: 10.1016/j.meegid.2021.104862

  • 19

    CarboE. C.SidorovI. A.Zevenhoven-DobbeJ. C.SnijderE. J.ClaasE. C.LarosJ. F. J.et al. (2020). Coronavirus discovery by metagenomic sequencing: a tool for pandemic preparedness. J. Clin. Virol.131:104594. doi: 10.1016/j.jcv.2020.104594

  • 20

    CarducciA.VeraniM.BattistiniR.PizziF.RoviniE.AndreoliE.et al. (2006). Epidemiological surveillance of human enteric viruses by monitoring of different environmental matrices. Water Sci. Technol.54, 239244. doi: 10.2166/wst.2006.475

  • 21

    CastinelA.DuignanP. J.PomroyW. E.López-VillalobosN.GibbsN. J.ChilversB. L.et al. (2007). Neonatal mortality in New Zealand sea lions (Phocarctos hookeri) at Sandy Bay, Enderby Island, Auckland islands from 1998 to 2005. J. Wildl. Dis.43, 461474. doi: 10.7589/0090-3558-43.3.461

  • 22

    Cebriá-MendozaM.BrachoM. A.ArbonaC.LarreaL.DíazW.SanjuánR.et al. (2021). Exploring the diversity of the human blood virome. Viruses13:2322. doi: 10.3390/v13112322

  • 23

    ChenL.XuJ.WanW.XuZ.HuR.ZhangY.et al. (2022). The microbiome structure of a rice-crayfish integrated breeding model and its association with crayfish growth and water quality. Microbiol. Spect.10, e02204e02221. doi: 10.1128/spectrum.02204-21

  • 24

    ChinC.-S.AlexanderD. H.MarksP.KlammerA. A.DrakeJ.HeinerC.et al. (2013). Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods10, 563569. doi: 10.1038/nmeth.2474

  • 25

    ChinC.-S.PelusoP.SedlazeckF. J.NattestadM.ConcepcionG. T.ClumA.et al. (2016). Phased diploid genome assembly with single-molecule real-time sequencing. Nat. Methods13, 10501054. doi: 10.1038/nmeth.4035

  • 26

    ChiuC. Y. (2013). Viral pathogen discovery. Curr. Opin. Microbiol.16, 468478. doi: 10.1016/j.mib.2013.05.001

  • 27

    ClarkC.McIntyreP. G.EvansA.McInnesC. J.Lewis-JonesS. (2005). Human sealpox resulting from a seal bite: confirmation that sealpox virus is zoonotic. Br. J. Dermatol.152, 791793. doi: 10.1111/j.1365-2133.2005.06451.x

  • 28

    CohenJ. M.SauerE. L.SantiagoO.SpencerS.RohrJ. R. (2020). Divergent impacts of warming weather on wildlife disease risk across climates. Science370:eabb1702. doi: 10.1126/science.abb1702

  • 29

    CollingeS. K.RayC.CullyJ. (2008). Effects of disease on keystone species, dominant species, and their communities. eds. OstfeldR.KeesingF.EvinerV.. Infect. Dis. Ecol., 129144. doi: 10.1515/9781400837885.129

  • 30

    CompeauP. E.PevznerP. A.TeslerG. (2011). How to apply de Bruijn graphs to genome assembly. Nat. Biotechnol.29, 987991. doi: 10.1038/nbt.2023

  • 31

    CotmoreS. F.Agbandje-McKennaM.CanutiM.ChioriniJ. A.Eis-HubingerA. M.HughesJ.et al. (2019). ICTV virus taxonomy profile: Parvoviridae. J. Gen. Virol.100, 367368. doi: 10.1099/jgv.0.001212

  • 32

    CrumE.MerchantZ.EneA.Miller-EnsmingerT.JohnsonG.WolfeA. J.et al. (2023). Coliphages of the human urinary microbiota. PLoS One18:e0283930. doi: 10.1371/journal.pone.0283930

  • 33

    D’AgneseE.LambournD.RiceJ.DuffieldD.HugginsJ.SprakerT.et al. (2020). Reemergence of Guadalupe fur seals in the U.S. Pacific northwest: the epidemiology of stranding events during 2005–2016. Mar. Mamm. Sci.36, 828845. doi: 10.1111/mms.12678

  • 34

    DayaramA.GoldstienS.Argüello-AstorgaG. R.Zawar-RezaP.GomezC.HardingJ. S.et al. (2015). Diverse small circular DNA viruses circulating amongst estuarine molluscs. Infect. Genet. Evol.31, 284295. doi: 10.1016/j.meegid.2015.02.010

  • 35

    DidaF.YiG. (2021). Empirical evaluation of methods for de novo genome assembly. PeerJ Comput. Sci.7:e636. doi: 10.7717/peerj-cs.636

  • 36

    DietzR.Heide-JørgensenM. P.HärkönenT. (1989). Mass deaths of harbor seals (Phoca vitulina) in Europe. Ambio (Sweden)18, 258264.

  • 37

    DoszpolyA.TarjánZ. L.GlávitsR.MüllerT.BenkőM. (2014). Full genome sequence of a novel circo-like virus detected in an adult European eel Anguilla anguilla showing signs of cauliflower disease. Dis. Aquat. Org.109, 107115. doi: 10.3354/dao02730

  • 38

    DuignanP. J.van BressemM. F.BakerJ.BarbieriM.ColegroveK.de GuiseS.et al. (2014). Phocine distemper virus: current knowledge and future directions. Viruses6, 50935134. doi: 10.3390/v6125093

  • 39

    FireS. E.BrowningJ. A.DurdenW. N.StolenM. K. (2020). Comparison of during-bloom and inter-bloom brevetoxin and saxitoxin concentrations in Indian river lagoon bottlenose dolphins, 2002–2011. Aquat. Toxicol.218:105371. doi: 10.1016/j.aquatox.2019.105371

  • 40

    FlewellingL. J.NaarJ. P.AbbottJ. P.BadenD. G.BarrosN. B.BossartG. D.et al. (2005). Red tides and marine mammal mortalities. Nature435, 755756. doi: 10.1038/nature435755a

  • 41

    FonteneleR. S.LacorteC.LamasN. S.SchmidlinK.VarsaniA.RibeiroS. G. (2019). Single stranded DNA viruses associated with capybara faeces sampled in Brazil. Viruses11:710. doi: 10.3390/v11080710

  • 42

    GaraldeD. R.SnellE. A.JachimowiczD.SiposB.LloydJ. H.BruceM.et al. (2018). Highly parallel direct RNA sequencing on an array of nanopores. Nat. Methods15, 201206. doi: 10.1038/nmeth.4577

  • 43

    Garcia-HerediaI.BhattacharjeeA. S.FornasO.GomezM. L.MartínezJ. M.Martinez-GarciaM. (2021). Benchmarking of single-virus genomics: a new tool for uncovering the virosphere. Environ. Microbiol.23, 15841593. doi: 10.1111/1462-2920.15375

  • 44

    GoldsmithC.CohenD.DuboiA.MartinezM.-G.PetitjeanK.CorluA.et al. (2020). Long read sequencing and de novo assembly of hepatitis B virus identifies 5mCpG in CpG islands. bioRxiv. doi: 10.1101/2020.05.29.122259

  • 45

    GoldsmithC.CohenD.DuboisA.MartinezM. G.PetitjeanK.CorluA.et al. (2021). Cas9-targeted nanopore sequencing reveals epigenetic heterogeneity after de novo assembly of native full-length hepatitis B virus genomes. Microbial. Genomics7:000507. doi: 10.1099/mgen.0.000507

  • 46

    GriffinJ. S.PlummerJ. D.LongS. C. (2008). Torque Teno virus: an improved indicator for viral pathogens in drinking waters. Virol. J.5:112. doi: 10.1186/1743-422X-5-112

  • 47

    GullandF. M., (2000) Domoic acid toxicity in California Sea lions (Zalophus Californianus) stranded along the Central California coast, may-October 1998: Report to the National Marine Fisheries Service Working Group on unusual marine mammal mortality events. Vol. 1. US Department of Commerce, National Oceanic and Atmospheric Administration

  • 48

    GutierrezC. P. (1998). “Cajuns and crawfish” in The taste of American place: a reader on regional and ethnic foods. eds. ShortridgeB. G.ShortridgeJ. R. (Lanham, MD: Rowman & Littlefield), 139144.

  • 49

    HamzaI. A.JurzikL.ÜberlaK.WilhelmM. (2011). Evaluation of pepper mild mottle virus, human picobirnavirus and torque Teno virus as indicators of fecal contamination in river water. Water Res.45, 13581368. doi: 10.1016/j.watres.2010.10.021

  • 50

    HaramotoE.KatayamaH.OhgakiS. (2008). Quantification and genotyping of torque Teno virus at a wastewater treatment plant in Japan. Appl. Environ. Microbiol.74, 74347436. doi: 10.1128/AEM.01605-08

  • 51

    HaramotoE.KitajimaM.HataA.TorreyJ. R.MasagoY.SanoD.et al. (2018). A review on recent progress in the detection methods and prevalence of human enteric viruses in water. Water Res.135, 168186. doi: 10.1016/j.watres.2018.02.004

  • 52

    HayashidaK.OrbaY.SequeiraP. C.SugimotoC.HallW. W.EshitaY.et al. (2019). Field diagnosis and genotyping of chikungunya virus using a dried reverse transcription loop-mediated isothermal amplification (LAMP) assay and MinION sequencing. PLoS Negl. Trop. Dis.13:e0007480. doi: 10.1371/journal.pntd.0007480

  • 53

    HendrixA. M.LefebvreK. A.QuakenbushL.BryanA.StimmelmayrR.SheffieldG.et al. (2021). Ice seals as sentinels for algal toxin presence in the Pacific Arctic and subarctic marine ecosystems. Mar. Mamm. Sci.37, 12921308. doi: 10.1111/mms.12822

  • 54

    HewsonI.BistolasK. S. I.Quijano CardéE. M.ButtonJ. B.FosterP. J.FlanzenbaumJ. M.et al. (2018). Investigating the complex association between viral ecology, environment, and Northeast Pacific Sea star wasting. Front. Mar. Sci.5:77. doi: 10.3389/fmars.2018.00077

  • 55

    HewsonI.ButtonJ. B.GudenkaufB. M.MinerB.NewtonA. L.GaydosJ. K.et al. (2014). Densovirus associated with sea-star wasting disease and mass mortality. Proc. Natl. Acad. Sci.111, 1727817283. doi: 10.1073/pnas.1416625111

  • 56

    HinoS.MiyataH. (2007). Torque teno virus (TTV): current status. Rev. Med. Virol.17, 4557. doi: 10.1002/rmv.524

  • 57

    HsiaoK.-L.WangL. Y.ChengJ. C.ChengY. J.LinC. L.LiuH. F. (2021). Detection and genetic characterization of the novel torque Teno virus group 6 in Taiwanese general population. R. Soc. Open Sci.8:210938. doi: 10.1098/rsos.210938

  • 58

    HuT.ChitnisN.MonosD.DinhA. (2021). Next-generation sequencing technologies: an overview. Hum. Immunol.82, 801811. doi: 10.1016/j.humimm.2021.02.012

  • 59

    HuangY.-T.LiuP.-Y.ShihP.-W. (2021). Homopolish: a method for the removal of systematic errors in nanopore sequencing by homologous polishing. Genome Biol.22, 117. doi: 10.1186/s13059-021-02282-6

  • 60

    HuberR.PankseppJ. B.NathanielT.AlcaroA.PankseppJ. (2011). Drug-sensitive reward in crayfish: an invertebrate model system for the study of SEEKING, reward, addiction, and withdrawal. Neurosci. Biobehav. Rev.35, 18471853. doi: 10.1016/j.neubiorev.2010.12.008

  • 61

    IduryR. M.WatermanM. S. (1995). A new algorithm for DNA sequence assembly. J. Comput. Biol.2, 291306. doi: 10.1089/cmb.1995.2.291

  • 62

    JacksonE. W.WilhelmR. C.JohnsonM. R.LutzH. L.DanforthI.GaydosJ. K.et al. (2020). Diversity of sea star-associated densoviruses and transcribed endogenous viral elements of densovirus origin. J. Virol.95, e01594e01520. doi: 10.1128/JVI.01594-20

  • 63

    JiangH.XingZ.LuW.QianZ.YuH.LiJ. (2014). Transcriptome analysis of red swamp crawfish Procambarus clarkii reveals genes involved in gonadal development. PLoS One9:e105122. doi: 10.1371/journal.pone.0105122

  • 64

    JungH.JeonM. S.HodgettM.WaterhouseP.EyunS. I. (2020). Comparative evaluation of genome assemblers from long-read sequencing for plants and crops. J. Agric. Food Chem.68, 76707677. doi: 10.1021/acs.jafc.0c01647

  • 65

    KamathG. M.ShomoronyI.XiaF.CourtadeT. A.TseD. N. (2017). HINGE: long-read assembly achieves optimal repeat resolution. Genome Res.27, 747756. doi: 10.1101/gr.216465.116

  • 66

    KangY.-J.HuangW.ZhaoA. L.LaiD. D.ShaoL.ShenY. Q.et al. (2017). Densoviruses in oyster Crassostrea ariakensis. Arch. Virol.162, 21532157. doi: 10.1007/s00705-017-3343-z

  • 67

    KhudairE. A.Al-ShuwaikhA. M.FarhanN. M. (2019). Detection of TTV antigen in patients with hepatitis HBV and HCV. Iraqi J. Med. Sci.17, 4349. doi: 10.22578/IJMS.17.1.7

  • 68

    KolmogorovM.BickhartD. M.BehsazB.GurevichA.RaykoM.ShinS. B.et al. (2020). metaFlye: scalable long-read metagenome assembly using repeat graphs. Nat. Methods17, 11031110. doi: 10.1038/s41592-020-00971-x

  • 69

    KolmogorovM.YuanJ.LinY.PevznerP. A. (2019). Assembly of long, error-prone reads using repeat graphs. Nat. Biotechnol.37, 540546. doi: 10.1038/s41587-019-0072-8

  • 70

    KonoN.ArakawaK. (2019). Nanopore sequencing: review of potential applications in functional genomics. Develop. Growth Differ.61, 316326. doi: 10.1111/dgd.12608

  • 71

    KorenS.WalenzB. P.BerlinK.MillerJ. R.BergmanN. H.PhillippyA. M. (2017). Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res.27, 722736. doi: 10.1101/gr.215087.116

  • 72

    LaRoccaC. J.JacobsenK. L.InokoK.ZakharkinS. O.YamamotoM.DavydovaJ. (2023). Viral shedding in mice following intravenous adenovirus injection: impact on biosafety classification. Viruses15:1495. doi: 10.3390/v15071495

  • 73

    LeeJ. Y.KongM.OhJ.LimJ. S.ChungS. H.KimJ. M.et al. (2021). Comparative evaluation of nanopore polishing tools for microbial genome assembly and polishing strategies for downstream analysis. Sci. Rep.11:20740. doi: 10.1038/s41598-021-00178-w

  • 74

    LefebvreK. A.PowellC. L.BusmanM.DoucetteG. J.MoellerP. D. R.SilverJ. B.et al. (1999). Detection of domoic acid in northern anchovies and California sea lions associated with an unusual mortality event. Nat. Toxins7, 8592. doi: 10.1002/(SICI)1522-7189(199905/06)7:3<85::AID-NT39>3.0.CO;2-Q

  • 75

    LefebvreK. A.QuakenbushL.FrameE.HuntingtonK. B.SheffieldG.StimmelmayrR.et al. (2016). Prevalence of algal toxins in Alaskan marine mammals foraging in a changing arctic and subarctic environment. Harmful Algae55, 1324. doi: 10.1016/j.hal.2016.01.007

  • 76

    LiH. (2016). Minimap and miniasm: fast mapping and de novo assembly for noisy long sequences. Bioinformatics32, 21032110. doi: 10.1093/bioinformatics/btw152

  • 77

    LiZ.ChenY.MuD.YuanJ.ShiY.ZhangH.et al. (2012). Comparison of the two major classes of assembly algorithms: overlap–layout–consensus and de-Bruijn-graph. Brief. Funct. Genomics11, 2537. doi: 10.1093/bfgp/elr035

  • 78

    LiaoX.LiM.ZouY.WuF. X.Yi-PanWangJ. (2019). Current challenges and solutions of de novo assembly. Quant. Biol.7, 90109. doi: 10.1007/s40484-019-0166-9

  • 79

    LiuL.YangY.DengY.ZhangT. (2022). Nanopore long-read-only metagenomics enables complete and high-quality genome reconstruction from mock and complex metagenomes. Microbiome10:209. doi: 10.1186/s40168-022-01415-8

  • 80

    LolomadzeE. A.RebrikovD. V. (2020). Constant companion: clinical and developmental aspects of torque Teno virus infections. Arch. Virol.165, 27492757. doi: 10.1007/s00705-020-04841-x

  • 81

    ManniF.RotolaA.CaselliE.BertorelleG.di LucaD. (2002). Detecting recombination in TT virus: a phylogenetic approach. J. Mol. Evol.55, 563572. doi: 10.1007/s00239-002-2352-y

  • 82

    Martínez-PucholS.CardonaL.DragoM.GazoM.Bofill-MasS. (2022). Viral metagenomics reveals persistent as well as dietary acquired viruses in Antarctic fur seals. Sci. Rep.12:18207. doi: 10.1038/s41598-022-23114-y

  • 83

    Martínez-PucholS.RusiñolM.Fernández-CassiX.TimonedaN.ItarteM.AndrésC.et al. (2020). Characterisation of the sewage virome: comparison of NGS tools and occurrence of significant pathogens. Sci. Total Environ.713:136604. doi: 10.1016/j.scitotenv.2020.136604

  • 84

    McClainW. R.RomaireR. P. (2004). Crawfish culture: a Louisiana aquaculture success story. World Aquac.35, 3135.

  • 85

    MengeB. A.SanfordE. (2013). “Ecological role of sea stars from populations” in Starfish: Biology and ecology of the Asteroidea. ed. LawrenceJ. M. (Asteroidea: JHU Press), 67.

  • 86

    MillerJ. R.DelcherA. L.KorenS.VenterE.WalenzB. P.BrownleyA.et al. (2008). Aggressive assembly of pyrosequencing reads with mates. Bioinformatics24, 28182824. doi: 10.1093/bioinformatics/btn548

  • 87

    Munang'anduH.MugimbaK. K.ByarugabaD. K.MutolokiS.EvensenØ. (2017). Current advances on virus discovery and diagnostic role of viral metagenomics in aquatic organisms. Front. Microbiol.8:406. doi: 10.3389/fmicb.2017.00406

  • 88

    MyersE. W.SuttonG. G.DelcherA. L.DewI. M.FasuloD. P.FlaniganM. J.et al. (2000). A whole-genome assembly of drosophila. Science287, 21962204. doi: 10.1126/science.287.5461.2196

  • 89

    NeillJ. D.MeyerR. F.SealB. S. (1995). Genetic relatedness of the caliciviruses: San Miguel sea lion and vesicular exanthema of swine viruses constitute a single genotype within the Caliciviridae. J. Virol.69, 44844488. doi: 10.1128/jvi.69.7.4484-4488.1995

  • 90

    NobleR. T.FuhrmanJ. (1997). Virus decay and its causes in coastal waters. Appl. Environ. Microbiol.63, 7783. doi: 10.1128/aem.63.1.77-83.1997

  • 91

    OnensP.WilkinS.FauquierD.SpradlinT.ManleyS.GreigD.et al. (2023). 2019 Report of marine mammal Strandings in the United States: National Overview.

  • 92

    PaineR. T. (1966). Food web complexity and species diversity. Am. Nat.100, 6575. doi: 10.1086/282400

  • 93

    PattersonQ. M.KrabergerS.MartinD. P.SheroM. R.BeltranR. S.KirkhamA. L.et al. (2021). Circoviruses and cycloviruses identified in Weddell seal fecal samples from McMurdo Sound, Antarctica. Infect. Genet. Evol.95:105070. doi: 10.1016/j.meegid.2021.105070

  • 94

    PevznerP. A.TangH.WatermanM. S. (2001). An Eulerian path approach to DNA fragment assembly. Proc. Natl. Acad. Sci.98, 97489753. doi: 10.1073/pnas.171285098

  • 95

    PlummerJ. D.LongS. C.LiuZ.CharestA. A. (2014). Torque Teno virus occurrence and relationship to bacterial and viral indicators in feces, wastewaters, and waters in the United States. Environ. Eng. Sci.31, 671680. doi: 10.1089/ees.2014.0091

  • 96

    PomeroyP.HammondJ. A.HallA. J.LonerganM.DuckC. D.SmithV. J.et al. (2005). Morbillivirus neutralising antibodies in Scottish grey seals Halichoerus grypus: assessing the effects of the 1988 and 2002 PDV epizootics. Mar. Ecol. Prog. Ser.287, 241250. doi: 10.3354/meps287241

  • 97

    PopM.SalzbergS. L. (2008). Bioinformatics challenges of new sequencing technology. Trends Genet.24, 142149. doi: 10.1016/j.tig.2007.12.006

  • 98

    PughJ. (2023). The current state of nanopore sequencing. Methods Mol. Biol.2632, 314. doi: 10.1007/978-1-0716-2996-3_1

  • 99

    QuickJ.GrubaughN. D.PullanS. T.ClaroI. M.SmithA. D.GangavarapuK.et al. (2017). Multiplex PCR method for MinION and Illumina sequencing of Zika and other virus genomes directly from clinical samples. Nat. Protoc.12, 12611276. doi: 10.1038/nprot.2017.066

  • 100

    R Core Team. R: A language and environment for statistical computing (2013).

  • 101

    RoachM. J.BeecroftS. J.MihindukulasuriyaK. A.WangL.ParedesA.Henry-CocksK.et al. (2022). Hecatomb: an end-to-end research platform for viral metagenomics. bio Rxiv:2022.05.15.492003. doi: 10.1101/2022.05.15.492003

  • 102

    RoeW. D.RogersL.PinpimaiK.DittmerK.MarshallJ.ChilversB. L. (2015). Septicaemia and meningitis caused by infection of New Zealand sea lion pups with a hypermucoviscous strain of Klebsiella pneumoniae. Vet. Microbiol.176, 301308. doi: 10.1016/j.vetmic.2015.01.019

  • 103

    RoessA. A.LevineR. S.BarthL.MonroeB. P.CarrollD. S.DamonI. K.et al. (2011). Sealpox virus in marine mammal rehabilitation facilities, North America, 2007-2009. Emerg. Infect. Dis.17, 22032208. doi: 10.3201/eid1712.101945

  • 104

    RuanJ.LiH. (2020). Fast and accurate long-read assembly with wtdbg2. Nat. Methods17, 155158. doi: 10.1038/s41592-019-0669-3

  • 105

    SakaiK.YoshikawaT.SekiF.FukushiS.TaharaM.NagataN.et al. (2013). Canine distemper virus associated with a lethal outbreak in monkeys can readily adapt to use human receptors. J. Virol.87, 71707175. doi: 10.1128/JVI.03479-12

  • 106

    SandersonC. E.AlexanderK. A. (2020). Unchartered waters: climate change likely to intensify infectious disease outbreaks causing mass mortality events in marine mammals. Glob. Chang. Biol.26, 42844301. doi: 10.1111/gcb.15163

  • 107

    SaoudI. P.GhanawiJ.ThompsonK. R.WebsterC. D. (2013). A review of the culture and diseases of redclaw crayfish Cherax quadricarinatus (von martens 1868). J. World Aquacult. Soc.44, 129. doi: 10.1111/jwas.12011

  • 108

    SaranathanR.AsareE.LeungL.de OliveiraA. P.KaugarsK. E.MulhollandC. V.et al. (2022). Capturing structural variants of herpes simplex virus genome in full length by Oxford nanopore sequencing. Microbiol. Spectr.10:e0228522. doi: 10.1128/spectrum.02285-22

  • 109

    SiebertU.RademakerM.UlrichS. A.WohlseinP.RonnenbergK.Prenger-BerninghoffE. (2017). Bacterial microbiota in harbor seals (Phoca vitulina) from the North Sea of Schelswig-Holstein, Germany, around the time of morbillivirus and influenza epidemics. J. Wildl. Dis.53, 201214. doi: 10.7589/2015-11-320

  • 110

    SmithA. W.SkillingD. E.CherryN.MeadJ. H.MatsonD. O. (1998). Calicivirus emergence from ocean reservoirs: zoonotic and interspecies movements. Emerg. Infect. Dis.4, 1320. doi: 10.3201/eid0401.980103

  • 111

    TakahashiK.IwasaY.HijikataM.MishiroS. (2000). Identification of a new human DNA virus (TTV-like mini virus, TLMV) intermediately related to TT virus and chicken anemia virus. Arch. Virol.145, 979993. doi: 10.1007/s007050050689

  • 112

    Tavakoli NickS.MohebbiS. R.HosseiniS. M.MirjalaliH.AlebouyehM. (2019). Occurrence and molecular characterization of torque Teno virus (TTV) in a wastewater treatment plant in Tehran. J. Water Health17, 971977. doi: 10.2166/wh.2019.137

  • 113

    TørresenO. K.StarB.MierP.Andrade-NavarroM. A.BatemanA.JarnotP.et al. (2019). Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases. Nucleic Acids Res.47, 1099411006. doi: 10.1093/nar/gkz841

  • 114

    WangJ.ChenK.RenQ.ZhangY.LiuJ.WangG.et al. (2021). Systematic comparison of the performances of de novo genome assemblers for oxford nanopore technology reads from piroplasm. Front. Cell. Infect. Microbiol.11:696669. doi: 10.3389/fcimb.2021.696669

  • 115

    WangD.CoscoyL.ZylberbergM.AvilaP. C.BousheyH. A.GanemD.et al. (2002). Microarray-based detection and genotyping of viral pathogens. Proc. Natl. Acad. Sci. U. S. A.99, 1568715692. doi: 10.1073/pnas.242579699

  • 116

    WangH.MarcišauskasS.SánchezB. J.DomenzainI.HermanssonD.AgrenR.et al. (2018). RAVEN 2.0: a versatile toolbox for metabolic network reconstruction and a case study on Streptomyces coelicolor. PLoS Comput. Biol.14:e1006541. doi: 10.1371/journal.pcbi.1006541

  • 117

    WangD.UrismanA.LiuY. T.SpringerM.KsiazekT. G.ErdmanD. D.et al. (2003). Viral discovery and sequence recovery using DNA microarrays. PLoS Biol.1:e2:E2. doi: 10.1371/journal.pbio.0000002

  • 118

    WebsterR. G.GeraciJ.PeturssonG.SkirnissonK. (1981a). Conjunctivitis in human beings caused by influenza a virus of seals. N. Engl. J. Med.304:911. doi: 10.1056/NEJM198104093041515

  • 119

    WebsterR. G.HinshawV. S.BeanW. J.van WykeK. L.GeraciJ. R.St. AubinD. J.et al. (1981b). Characterization of an influenza a virus from seals. Virology113, 712724. doi: 10.1016/0042-6822(81)90200-2

  • 120

    WickR. R.JuddL. M.CerdeiraL. T.HawkeyJ.MéricG.VezinaB.et al. (2021). Trycycler: consensus long-read assemblies for bacterial genomes. Genome Biol.22, 117. doi: 10.1186/s13059-021-02483-z

  • 121

    YaharaK.SuzukiM.HirabayashiA.SudaW.HattoriM.SuzukiY.et al. (2021). Long-read metagenomics using PromethION uncovers oral bacteriophages and their interaction with host bacteria. Nat. Commun.12:27. doi: 10.1038/s41467-020-20199-9

  • 122

    YuR.CaiD.SunY. (2023). AccuVIR: an ACCUrate VIRal genome assembly tool for third-generation sequencing data. Bioinformatics39:btac827. doi: 10.1093/bioinformatics/btac827

  • 123

    ZiminA. V.MarçaisG.PuiuD.RobertsM.SalzbergS. L.YorkeJ. A. (2013). The MaSuRCA genome assembler. Bioinformatics29, 26692677. doi: 10.1093/bioinformatics/btt476

Summary

Keywords

nanopore sequencing, metagenomics, viruses, marine mammals, bioinformatics

Citation

Vigil K and Aw TG (2023) Comparison of de novo assembly using long-read shotgun metagenomic sequencing of viruses in fecal and serum samples from marine mammals. Front. Microbiol. 14:1248323. doi: 10.3389/fmicb.2023.1248323

Received

27 June 2023

Accepted

04 September 2023

Published

22 September 2023

Volume

14 - 2023

Edited by

Sara Louise Cosby, Queen's University Belfast, United Kingdom

Reviewed by

Massimiliano Orsini, Experimental Zooprophylactic Institute of the Venezie (IZSVe), Italy; Ryan Devaney, Queen's University Belfast, United Kingdom

Updates

Copyright

*Correspondence: Tiong Gim Aw,

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics