- 1Department of Microbiology, School of Medicine, Jiangsu University, Zhenjiang, Jiangsu, China
- 2Qinghai Institute for Endemic Disease Prevention and Control, Xining, China
- 3Department of Clinical Laboratory, Wuxi Blood Center, Wuxi, Jiangsu, China
Introduction: This study aimed to characterize the viral diversity in the blood of marmots in the Qinghai-Tibet Plateau region, to assess their role as potential viral reservoirs and evaluate the potential implications for wildlife and human health.
Methodology: Seventy marmot blood samples were collected from Yushu and Guoluo Prefectures in Qinghai Province. Viral communities were comprehensively analyzed using high-throughput sequencing and bioinformatics techniques.
Results: Analysis identified a wide range of viral families, including Anelloviridae, Flaviviridae, Parvoviridae, and Polyomaviridae, and revealed multiple novel viral sequences. Notably, we documented the first evidence of Anelloviridae in marmot serum; phylogenetic analysis indicated these sequences cluster with those from marmot feces and tissues, suggesting a natural host relationship. A critical finding was the detection of Tick-borne encephalitis virus, with a sequence highly similar to human-derived strains, implying potential involvement of marmots in the transmission cycle. Furthermore, identification of a novel polyomavirus was supported by prediction of all main large tumor antigen functional domains and motifs, including a putative nuclear localization signal between its LXCXE motif and origin-binding domain, typical of mammalian-infecting polyomaviruses. Comparative analysis revealed significant regional differences in viral diversity between sampling areas, potentially linked to local ecological factors.
Discussion: This study significantly expands the known viral diversity in marmots and underscores their role as potential zoonotic reservoirs. However, the functional and pathogenic implications of these viruses require further experimental validation. These findings highlight the importance of ongoing wildlife surveillance for understanding viral ecology and mitigating emerging public health risks.
1 Introduction
Marmots (genus Marmota) are large ground squirrels that are found primarily in temperate regions of North America, Europe and Asia (Arnold, 2019). These rodents perform vital ecosystem roles by improving soil aeration while serving as key prey for top predators including snow leopards (Panthera uncia) and wolves (Canis lupus), thus sustaining trophic dynamics across Central Asia and China’s Qinghai–Tibet Plateau (Lyngdoh et al., 2014; Karimov et al., 2018). Particularly, the Himalayan marmot, as an important rodent species of the Tibetan Plateau and surrounding areas, is not only a major reservoir for bacterial diseases such as plague but also harbors a variety of viral pathogens, including bloodborne viruses such as hepatitis virus (Millman et al., 1984; Ben-Ari et al., 2012; Ma et al., 2024). These viruses may affect their health and pose risks to other wildlife and even humans. Specifically, marmots may serve as reservoirs for several zoonotic viruses, making the monitoring of their viral load and health status crucial for wildlife conservation and public health (Luis et al., 2013).
Marmots are exposed to several bloodborne viruses, which may be transmitted via vector organisms such as ticks, fleas, and other ectoparasites (Islam et al., 2021). Studying these viruses in marmots is important for understanding the potential risks they pose to both wildlife populations and human health, particularly in regions where marmots live in close proximity to human settlements (Xi et al., 2022). Research has shown that marmots can harbor several families of viruses, including flaviviruses (Mlera and Bloom, 2018), Hepatitis B (Liu et al., 2020; Michalak, 2020), and even coronaviruses (Moreno et al., 2024). Understanding these viruses in marmots is key to preventing potential outbreaks and mitigating health risks.
Tick-borne encephalitis virus (TBEV) is a flavivirus transmitted by ticks and has been found in various wild rodents, including marmots (Lindquist and Vapalahti, 2008). The detection of the Himalayan subtype of Tick-borne encephalitis virus in marmots suggests that they may serve as amplifying hosts in the arbovirus transmission chain (Dai et al., 2018). This virus is transmitted through tick bites and is highly pathogenic to the human central nervous system (Bogovic and Strle, 2015). The detection of TBEV in marmots highlights their potential role in maintaining the virus in natural foci. Moreover, the presence of the Himalayan subtype of TBEV in marmots suggests that these animals may act as a bridge for the virus to spread across different regions. This is particularly concerning given the virus’s high pathogenicity in humans, causing severe neurological disorders such as encephalitis and meningitis. Understanding the role of marmots in the blood-mediated transmission of TBEV is crucial for developing effective surveillance and control strategies to mitigate the risk of human infection.
Detecting blood-borne viruses in marmots typically requires a combination of molecular and serological methods. Commonly used detection techniques include serological assays (Svoboda et al., 2014). Additionally, next-generation sequencing (NGS) methods enable comprehensive high-throughput analysis of viral diversity in marmot blood samples, serving as an effective tool for discovering novel viruses or detecting low-abundance viruses (Zhu et al., 2020; Hilt and Ferrieri, 2022). These comprehensive sampling and processing methods aim to enhance our understanding of viral dynamics in wildlife, reduce the risk of viral spillover, and provide critical support for public health and ecological conservation efforts.
Understanding the viral ecology of marmots is crucial for better risk assessment, particularly in regions where marmot habitats overlap with human activities (He et al., 2021). Monitoring blood-borne viruses in marmots helps track viral transmission dynamics, especially in areas where marmots serve as reservoir hosts for tick-borne viruses such as TBEV (Kwasnik et al., 2023; Pustijanac et al., 2023). To address these knowledge gaps, we conducted an analysis of marmot blood samples from Yushu Prefecture and Guoluo Prefecture in Qinghai Province. This study aims to enhance our understanding of viral dynamics in wildlife, reduce the risk of viral spillover, and provide support for public health and ecological conservation efforts.
2 Methods
2.1 Sample collection
We collected blood samples from 50 marmots in Yushu Prefecture and 20 marmots in Guoluo Prefecture, Qinghai Province. These marmots (Marmota himalayana) were captured in their natural habitats using humane live traps. In accordance with ethical guidelines, all marmots were anesthetized, and blood samples were obtained via cardiac puncture. After centrifugation, the supernatant was discarded, and the samples were stored at −80 °C for further analysis. Marmots in this study were captured using ketamine hydrochloride as the anesthetic and were subsequently euthanized. All animal procedures in this study were conducted in accordance with the guidelines of the Ethics Committee of Jiangsu University and were approved (Approval No. JSDX20231007002).
All samples were resuspended in Dulbecco’s phosphate buffered saline (DPBS), homogenized, and subjected to three freeze–thaw cycles. The supernatants were collected after centrifugation at 15,000 × g for 10 min at 4 °C.
2.2 Library construction
Each supernatant (500 μL) was filtered through a 0.45-μm syringe filter (Millipore) to remove large cell-sized particles and subsequently centrifuged at 12,000 × g for 5 min to collect the filtrate enriched in viral particles. To degrade unprotected nucleic acids, the filtrate was incubated with a dual-enzyme cocktail of DNase I and RNase A at 37 °C for 60 min (Zhang et al., 2014, 2016; Zhao et al., 2022). Detailed reagent specifications are provided in Table 1. Total nucleic acids (RNA and DNA) protected within viral capsids were then extracted using either the FastPure Viral DNA/RNA Mini Kit (Vazyme) or the QiAamp Viral RNA Mini Kit, following the manufacturers’ protocols. Reverse transcription was performed on the extracted nucleic acids using SuperScript III or SuperScript IV reverse transcriptase (Invitrogen) with random hexamer primers. For single-stranded DNA (ssDNA) viruses, ssDNA was converted to double-stranded DNA (dsDNA) using the Klenow fragment polymerase (New England BioLabs).
The resulting nucleic acids were prepared for sequencing using the Illumina Nextera XT DNA Sample Preparation Kit, with dual barcoding applied to distinguish between samples. A total of 70 pools were constructed, depending on the study design, and sequenced on the Illumina NovaSeq 6000 platform, generating 250-bp paired-end reads (Liu et al., 2016).
2.3 Bioinformatics analysis
The 250-bp paired-end reads generated for each pool were debarcoded using Illumina’s vendor software. An in-house analysis pipeline running on a 32-node Linux cluster was used to process the data. Reads were considered duplicates if bases 5 to 55 were identical and only one random copy of duplicates was kept. Clonal reads were removed and low sequencing quality tails were trimmed using Phred quality score 30 as the threshold. Adaptors were trimmed using the default parameters of VecScreen which is NCBI BLASTn with specialized parameters designed for adapter removal. The cleaned reads were then subjected to a DIAMOND BLASTx search against an in-house non-virus non-redundant (NVNR) protein database (Buchfink et al., 2015), which was compiled from NCBI nr fasta files by excluding viral taxonomic annotations, to eliminate false positive viral hits. Taxonomic classification of the DIAMOND results was parsed using MEGAN, with the lowest common ancestor (LCA) assignment performed under default settings. To further identify viral sequences, the cleaned reads were de novo assembled using Geneious Prime v2019.0 (Biomatters Ltd.) with default parameters. The resulting contigs and singlet sequences were compared against a viral proteome database using BLASTx with an E-value threshold of <10−5 to classify virus types and filter out non-viral sequences. The viral proteome database was constructed using the NCBI virus reference proteome (available at http://ftp.ncbi.nih.gov/refseq/release/viral/) and viral protein sequences extracted from the NCBI nr FASTA files, categorized based on annotation taxonomy within the Virus Kingdom (Skewes-Cox et al., 2014; Deng et al., 2015).
To identify more distant viral protein similarities, contigs without significant BLASTx hits were further analyzed against viral protein families in the vFam database using HMMER3 (Johnson et al., 2010; Finn et al., 2011). Open reading frames (ORFs) within the viral genomes were predicted by integrating the BLASTx search results with Geneious Prime software. Protein domains were annotated using the NCBI Conserved Domain Search tool with an E-value threshold of <10−5. This comprehensive approach ensured the accurate identification and characterization of viral sequences while minimizing false positives and enhancing the detection of remote viral homologs. Protein sequences were analyzed with cNLS Mapper1 to identify putative cNLSs (Kosugi et al., 2009). cNLS Mapper (last updated on 2012/11/7) was used with uniform threshold settings: monopartite NLS ≥3, bipartite NLS ≥3. Structural models were predicted using the AlphaFold3 Server and the ColabFold v1.5.5: AlphaFold2 using MMseqs2 (Mirdita et al., 2022; Abramson et al., 2024). Conduct domain and motif analysis using ProSite2 and ELM.3
2.4 Viral sequences acquisition
To generate high-quality viral genomes or genomic segments, de novo assembly and reference mapping were conducted in Geneious Prime v2019.0 using the assembled contigs and unassembled reads with known taxonomic assignments obtained from the previous analysis stage. Additionally, Geneious Prime v2019.0 was utilized for genome annotation and ORF prediction, enabling comprehensive characterization of the viral sequences.
2.5 Statistical analysis
Statistical analyses were performed using R v4.2.1 and MEGAN v6.22.2. All reads in the quality-controlled data with sequence lengths greater than 50 bp were aligned to the viral proteome database using BLASTx (method as described above). The BLASTx results were then imported into the MEGAN software to generate rarefaction curves, visualizing differences in viral community composition (Huson et al., 2007). To visualize the viral community structure and richness, the R packages pheatmap and vegan were employed. Differences in viral communities were graphically represented using the ggplot2 package in R. Viral community alpha diversity was analyzed at the family level, with statistical significance assessed using the Kruskal–Wallis test, followed by pairwise Wilcoxon rank-sum tests for group comparisons. A p-value threshold of <0.05 was applied to determine statistical significance.
2.6 Phylogenetic analysis
Phylogenetic analyses were performed using the predicted protein sequences of the viruses identified in this study, along with approximately the top 50 protein sequences retrieved by BLASTx, as well as reference protein sequences from diverse host species or different viral genera obtained from the NCBI GenBank database (Shan et al., 2022). Related protein sequences were aligned using MUSCLE in MEGA version 10.1.8 with default settings (Kumar et al., 2018). Sites containing more than 50% gaps were temporarily removed from alignments. Bayesian inference trees were constructed using MrBayes version 3.2.7 (Ronquist et al., 2012), employing two concurrent runs of Markov chain Monte Carlo (MCMC) sampling. For protein-based phylogenetic analysis, the mixed amino acid model was specified using the command “prset aamodelpr = mixed.” The runs were terminated when the standard deviation of split frequencies fell below 0.01, and the first 25% of the trees were discarded as burn-in. To corroborate the Bayesian inference trees, maximum-likelihood trees were also generated using MEGA software (Kumar et al., 2018). The resulting phylogenetic trees were visualized and annotated using FigTree v1.4.4,4 Adobe Illustrator 2022 v26.0.1, and iTOL v6 (Letunic and Bork, 2021).
3 Results
3.1 Overview of marmot blood virome
The analysis of species richness in all marmot blood sediment samples, based on species-level rarefaction curves and accumulation curves, revealed the diversity characteristics of the viral community. Detailed library statistics are provided in Supplementary Table S1. In most of the 70 libraries, the number of observed virus families stabilized when the viral sequence count reached a specific threshold, indicating that the sequencing depth in the vast majority of the libraries was sufficient to represent the distribution characteristics of virus families in the blood samples of marmots. Furthermore, the sequencing data were deemed reasonable and reliable (Figure 1A). Although there is significant heterogeneity in viral taxonomic composition among different samples, the viral species accumulation curve has approached saturation, suggesting that the viral community in marmots has been adequately sampled with good representativeness (Figure 1B). The accumulation curve further suggests that there are likely over 800 distinct viral species present across the 70 libraries.
Figure 1. Viral diversity across 70 sequencing libraries. (A) Rarefaction curve of viral species generated with MEGAN v6.22.2 (log-transformed y-axis). (B) Species accumulation curve for the marmot blood virome. Error bars represent the data range; the shaded blue band indicates the 95% confidence interval.
Taxonomic analysis identified 18 specific and functionally complete viral genome sequences obtained through de novo assembly and reference alignment, including four known DNA viruses and one RNA virus capable of infecting vertebrates. These viruses were classified as follows: Anelloviridae (n = 14, one of which exhibited a complete circular genome), Microviridae (n = 1), Parvoviridae (n = 1), Polyomaviridae (n = 1), and Flaviviridae (n = 1).
3.2 Viral diversity and comparison in the viral communities
The heatmap (Figure 2) illustrates the relative abundance or distribution of different viral families in two regions (Guoluo and Yushu). The 70 libraries include various viral families, such as Lipothrixviridae, Anelloviridae, Herpesviridae, and Flaviviridae, encompassing both DNA and RNA viruses, and spanning diverse host ranges (e.g., vertebrates, invertebrates, etc.). The distribution of these viral families was compared between the two regions, Guoluo and Yushu. In terms of DNA viruses, Herpesviridae is widely distributed in both Yushu and Guoluo, with higher abundance in Yushu. Polyomaviridae is primarily distributed in Yushu, with lower abundance in Guoluo. Parvoviridae, Circoviridae, and Poxviridae are present in both regions, although their overall abundance is relatively low. Regarding RNA viruses, Flaviviridae exhibits extremely high abundance in certain Yushu samples, while Coronaviridae is predominantly distributed in Yushu, with no detection in Guoluo.
Figure 2. Heatmap of viral family abundance across sequencing pools. Read counts for each viral family are normalized and displayed on a log10 scale. Viral taxonomic classifications (types, families, and groups) are color-coded according to the legend.
In the Yushu region, analysis of viral family relative abundance revealed that Siphoviridae and Myoviridae dominated the community composition, exhibiting the highest proportions (Figure 3A). Host-based classification further confirmed that bacteriophage-associated viruses constituted the predominant fraction of the virome in Yushu, exceeding their relative contribution in Guoluo. In contrast, the viral community in Guoluo was predominantly characterized by Retroviridae, with its abundance significantly surpassing that of Myoviridae. At the species level, the two regions shared 290 viral species, while the proportion of species unique to Yushu (57.4%, 391/681) markedly exceeded that of Guoluo (28.0%, 113/403). Collectively, these patterns indicate that the Yushu virome is characterized by greater taxonomic richness and a stronger bacteriophage dominance than that of Guoluo (Figure 3B).
Figure 3. Taxonomic analysis of viral reads at the family and species levels. (A) Relative abundance of viral families, showing taxonomic classification and proportional distribution. (B) Shared and unique viral species between the two sample groups, visualized by Venn diagram.
To systematically compare the viral community differences, the Shannon index and Simpson index were used to assess alpha diversity and further analyze the differences in viral composition between Yushu and Guoluo. The results showed significant differences in viral communities between the two regions (p < 0.01), as shown in Figures 4A,B. Specifically, Yushu exhibited significantly higher alpha diversity compared to Guoluo. Principal coordinate analysis (PcoA) based on Bray–Curtis (Figure 4C) and Jaccard distances (Figure 4D) revealed statistically significant differences in the viral community composition at the family level between the two regions based on beta diversity (p = 0.01).
Figure 4. Diversity of viral communities between Yushu and Guoluo. (A,B) Comparison of viral alpha diversity, normalized using MEGAN and measured using the Shannon index based on viral abundance at the family level. The p-value was calculated using the Wilcoxon test. Horizontal bars within the boxes represent medians, while the tops and bottoms of the boxes indicate the 75th and 25th percentiles, respectively. (C,D) Principal coordinate analysis (PCoA) based on Bray–Curtis and Jaccard distances, illustrating viral beta diversity normalized using MEGAN at the family level. A p-value <0.05 was considered statistically significant.
3.3 Identification of Anelloviridae
Anelloviridae is a family of single-stranded circular DNA viruses with a genome length of approximately 2.8–3.9 kb. The virus is non-enveloped, and its capsid exhibits an icosahedral symmetrical structure (Biagini, 2009). The genome contains several open reading frames (ORFs), with ORF1 encoding the capsid protein, and ORF2/ORF3 involved in viral replication and regulation.
Anelloviridae viruses are known to infect vertebrates, including humans, other mammals, and birds (Yan et al., 2024). In humans, the virus is primarily transmitted through blood products, vertical transmission from mother to child, and close contact (e.g., saliva, feces) (Kaczorowska and van der Hoek, 2020). Most infections are asymptomatic, but in immunocompromised individuals (such as organ transplant recipients and HIV-infected individuals), increased viral load may be associated with risks such as graft rejection, hepatitis, or cancer, although causality has not been conclusively established (Spandole et al., 2015).
Through sequence assembly and functional annotation of high-throughput sequencing data, this study identified a total of 14 Anelloviridae viral sequences, one of which (Qblood059_12572) exhibited a complete circular genome structure. To investigate their phylogenetic relationships, the ORF1 genes of the 14 sequences were aligned with reference sequences downloaded from the GenBank database and a Bayesian phylogenetic tree was constructed (Figure 5). The results showed that all 14 sequences clustered with known viral sequences isolated from marmot (Marmota spp.) feces or tissues. BLASTx comparison analysis revealed that the amino acid similarity between these sequences and the reference sequences ranged from 69 to 94%, and they were all classified as unclassified Anelloviridae. Notably, some sequences exhibited unique clustering patterns in the phylogenetic tree. Qblood066_12162, Qblood058_16427, Qblood058_7160, Qblood061_6062, and Qblood061_1838 formed an independent branch, Qblood061_4301, Qblood061_1623, and Qblood066_7159 formed another independent branch, and Qblood066_3280 and Qblood058_16065 also showed independent clustering. These branches may represent new genera or species within the Anelloviridae family. Qblood058_12692 formed a separate branch in the phylogenetic tree, suggesting it may belong to a new genus-level taxon.
Figure 5. Phylogenetic analysis of Anelloviridae based on ORF1 protein sequences. The sequence identified in this study is highlighted in pink. Relevant taxonomic and feature annotations are provided in the figure legend.
Based on phylogenetic analysis of the ORF1 sequences, all marmot anelloviruses identified in this study were not assigned to any known genus and are currently designated as unclassified species within the family Anelloviridae. Phylogenetic analysis further indicated that the host range of Anelloviridae is broad, encompassing various vertebrate species. However, as a potential host, the diversity of Anelloviridae viruses within marmots has yet to be fully explored, suggesting that this species could serve as an important model for studies on the evolution and host adaptation of Anelloviridae viruses.
Anelloviridae capsids exhibit broad heterogeneity in amino acid length due to variable projection domains, despite a conserved jelly-roll fold (Butkovic et al., 2023). Intriguingly, while all Anelloviridae capsids encode an N-terminal arginine-rich motif (ARM) that functions as a non-classical nuclear localization signal (NLS), only those with large projection domains have evolved a unique classical nuclear localization signal (cNLS) at their C-terminus (Petersen et al., 2025). “NLSs” refers broadly to all types of nuclear localization signals, while “cNLS” specifically denotes classical nuclear localization signals, which are transported into the nucleus via the Importin α/β1 pathway (Hoad et al., 2026). To further investigate this issue, we structurally predicted the three-dimensional structure of the ORF1 protein encoded by one of the newly identified anelloviruses (Qblood059_12572) and utilized the cNLS Mapper to identify putative cNLSs. The analysis predicted a typical jelly-roll fold in the central region of Qblood059_12572 ORF1 (residues 59–342), accompanied by a small projection domain (Figure 6A). On the other hand, the N- and C-termini of the protein are predicted to be largely unstructured and contain multiple putative NLSs with low scores (Figure 6B), indicating that they are likely non-functional as cNLSs. Therefore, Qblood059_12572 encodes an ORF1 protein with a jelly-roll fold and a very short projection domain, and lacks a C-terminal NLS, consistent with recent reports (Butkovic et al., 2023; Petersen et al., 2025).
Figure 6. Prediction of NLSs in the Anelloviridae ORF1 protein. (A) AlphaFold3 model of ORF1. The top ranked prediction is shown; coloured by pLDDT score of estimated confidence: very high (pLDDT >90) in dark blue, confident (90 >pLDDT >70) in light blue, low (70 >pLDDT >50) in yellow, and very low (pLDDT <50) in orange. (B) Mapping of putative NLSs by cNLS Mapper analysis. Top: Schematic representation of the ORF1 protein sequence (single-letter amino acid code), with predicted NLS motifs highlighted in red. Bottom: Detailed view of each predicted NLS, indicating its position, sequence, and corresponding cNLS Mapper score.
3.4 Phylogenetic analysis of Tick-borne encephalitis virus
The Flaviviridae family comprises single-stranded, positive-sense RNA viruses (+ssRNA) with a genome length of approximately 10–11 kilobases (kb). These viruses are enveloped, and their capsid proteins exhibit icosahedral symmetry (Pierson and Diamond, 2020). Tick-borne encephalitis virus (TBEV), a member of the Orthoflavivirus genus within the Flaviviridae family, is primarily transmitted through the bite of hard ticks, such as Ixodes ricinus and Ixodes persulcatus (Yoshii, 2019). TBEV is predominantly distributed across forested and grassland regions of Europe, Northern Asia, and Eastern Asia, with significant epidemiological prevalence in Russia, Central Europe, and Northern Europe (Ruzek et al., 2019).
In this study, a viral sequence (Qblood061_6473) with a length of 6,550 bp was identified, though its polyprotein-coding region was incomplete. BLASTx alignment analysis revealed a 94.31% amino acid similarity to a reference sequence isolated from human brain tissue (GenBank Accession No. KC414090). Phylogenetic tree analysis demonstrated that Qblood061_6473 clustered with members of the Orthoflavivirus genus, which are known to infect a diverse range of hosts (Figure 7). It is noteworthy that this is the first time a viral sequence belonging to the Orthoflavivirus genus has been identified in the blood of marmots, suggesting that marmots may be potential hosts for TBEV.
Figure 7. Phylogenetic relationship of Tick-borne encephalitis virus. Phylogenetic tree based on RdRp amino acid sequences of the Flaviridae viruses. The pink name indicates the sequence obtained in this study. See legend for relevant labeling.
3.5 Phylogenetic analysis of Parvoviridae
The family Parvoviridae comprises small, non-enveloped viruses with a single-stranded DNA genome, approximately 4–6 kb in length, encapsulated within an icosahedral capsid (Decaro and Buonavoglia, 2012). This viral family exhibits a broad host range, infecting mammals, birds, and insects, and is associated with a variety of diseases (Dai et al., 2022). For instance, Canine parvovirus causes severe enteric disease in dogs (Nandi et al., 2010), Porcine parvovirus is linked to reproductive failure and abortion in swine (Streck and Truyen, 2020), and Human parvovirus B19 is implicated in conditions such as erythema infectiosum (fifth disease) in children and arthritis in adults (Dittmer et al., 2024).
In this study, a novel viral sequence, designated as Qblood060_22586, was identified through high-throughput sequencing and bioinformatics analysis. The viral genome has a total length of 4,599 bp and contains complete replication-associated protein (rep) and capsid protein (cap) genes, indicating a fully intact viral genome structure. BLASTx alignment analysis revealed that the Rep protein exhibits high similarity to viruses within the genus Dependoparvovirus. The highest similarity (88.63%) was observed with a Dependoparvovirus sequence derived from marmot tissue (GenBank Accession No. PP098970), and it clustered with this sequence in phylogenetic analysis (Figure 8). Additionally, the Rep protein showed 57.36% similarity to a Dependoparvovirus sequence from caprine. These results suggest that the virus Qblood060_22586 likely belongs to the genus Dependoparvovirus, with marmots being a potential host.
Figure 8. Phylogenetic relationship of Parvoviridae. Phylogenetic tree based on rep protein sequences. The pink name indicates the sequence obtained in this study. Taxonomy and hosts are annotated with corresponding colors, as indicated in the color legend.
3.6 Phylogenetic analysis of Polyomaviridae
Polyomaviridae is a small family of double-stranded circular DNA viruses, typically characterized by non-enveloped viral particles with a diameter of about 40–50 nanometers. Members of the family Polyomaviridae transcribe distinct early and late precursor mRNAs. The early transcript undergoes alternative splicing to generate the small, middle (when present), and large T antigens, while the late transcript encodes the structural proteins (Wiley et al., 1993). The Polyomaviridae family includes more than a dozen human-infecting species, notably John Cunningham polyomavirus (JCPyV), BK polyomavirus (BKPyV), Merkel cell polyomavirus (MCPyV), and Trichodysplasia spinulosa polyomavirus (TSPyV), which are clearly linked to human disease. Several avian polyomaviruses also exhibit strong pathogenic potential (DeCaprio and Garcea, 2013). Polyomaviridae can cause various diseases, including tumors and immunosuppression (Loutfy et al., 2017). The host range is broad, including humans, monkeys, mice, birds, and others. In this study, a novel polyomavirus, tentatively designated as Qblood079_1103, was identified using high-throughput sequencing combined with comprehensive bioinformatic analysis. The complete viral genome spans 5,188 bp and encodes a full-length capsid protein VP1, confirming its structural integrity as a member of the Polyomaviridae family. Comparative analysis using BLASTx revealed that the VP1 protein shares significant sequence homology with unclassified polyomaviruses, demonstrating the highest identity (86.72%) with a reference polyomavirus strain isolated from the skin tissue of a marmot (GenBank Accession No. BK066788). Phylogenetic reconstruction based on VP1 protein sequences placed the newly identified virus within a monophyletic clade alongside the marmot-derived polyomavirus, indicating a close evolutionary relationship (Figure 9A). Concurrently, we also constructed a phylogenetic tree for the large T antigen (LTA) protein (Figure 9B). The results closely parallel those obtained with the VP1 protein. The LTA is a multifunctional viral protein that orchestrates viral DNA replication and host cell cycle modulation. It contains several conserved domains, including the J domain, HPDKGG motif, LXCXE motif (for RB binding), the origin-binding domain (OBD), and the SF3 helicase domain (Topalis et al., 2013). These activities depend on its nuclear localization. Accordingly, a cNLS was first identified in SV40 LTA between the LXCXE motif and the OBD (Kalderon et al., 1984).
Figure 9. Phylogenetic and structural analysis of a novel polyomavirus. (A) Phylogenetic tree inferred from VP1 protein sequences of Polyomaviridae. The sequence obtained in this study is highlighted in pink. Taxonomy and hosts are annotated with corresponding colors, as indicated in the legend. (B) Phylogenetic tree based on LTA protein sequences. (C) Predicted nuclear localization signal (NLS) positions within the LTA domain. Key functional motifs are annotated. (D) A schematic representation of the novel virus LTA domains, using the SV40 LTA aa positions as a reference.
Intriguingly, comparative analyses show that all mammalian-infecting polyomaviruses encode an LTA bearing an NLS at this conserved position (Cross et al., 2024), whereas non-mammalian polyomaviruses encode NLSs elsewhere (Hoad et al., 2026). Therefore, identification of LTA functional domains and mapping the NLS in the newly identified virus may provide valuable insights into its host origin and evolutionary relationships. We therefore identified the principal functional motifs and domains within the LTA from Qblood079_1103. Our analysis indicates that this protein retains all major LTA functional regions and contains a putative bipartite cNLS located between the LXCXE and OBD motifs, supporting its classification as a mammalian polyomavirus (Figures 9C,D). Additionally, an additional NLS is located downstream of the SF3 helicase domain (Figures 9C,D).
4 Discussion
This study aimed to elucidate the viral diversity in the blood of marmots in the Qinghai–Tibet Plateau region using high-throughput sequencing and bioinformatics analysis, with the goal of assessing their role as potential viral reservoirs and exploring the potential implications of these viruses for wildlife and human health. To this end, we collected 70 marmot blood samples from Yushu and Guoluo Prefectures in Qinghai Province and conducted a comprehensive analysis of the viral communities in these samples using advanced molecular biology and bioinformatics techniques. Our research not only expands the understanding of the marmot virome but also provides critical data for understanding viral transmission dynamics in wildlife and their potential public health risks.
Through high-throughput sequencing and bioinformatics tools, we identified a wide range of viral families, including Anelloviridae, Flaviviridae, Parvoviridae, and Polyomaviridae, among others (Figure 2). Our findings not only enhance the current understanding of viral ecology in marmots but also highlight the potential role of these rodents as reservoirs for zoonotic viruses, which has significant implications for wildlife conservation and public health.
The observed variation in the relative proportions of bacteriophages and vertebrate-associated viruses between Yushu and Guoluo likely reflects differences in local ecological conditions and host availability (Figure 3). The higher dominance of bacteriophages in Yushu suggests a viral community structure more strongly shaped by bacterial hosts, potentially associated with differences in microbial community composition or environmental factors. In contrast, the relatively increased proportion of vertebrate-associated viruses in Guoluo may indicate stronger inputs from vertebrate hosts or host-derived materials, possibly linked to animal density or human–animal interactions. Overall, these findings highlight spatial heterogeneity in viral community structure and emphasize the influence of local host ecology on shaping virome composition.
The identification of Anelloviridae in marmot blood samples is particularly noteworthy. While Anelloviridae viruses are known to infect diverse vertebrates including humans (De Souza et al., 2018), this study provides the first documented evidence of their presence in marmot serum. Phylogenetic analysis revealed that the Anelloviridae sequences identified in this study clustered with known sequences from marmot feces and tissues (Figure 5), suggesting that marmots may serve as natural hosts for these viruses. The high genetic diversity observed among the Anelloviridae sequences further underscores the potential role of marmots in the evolution and host adaptation of these viruses. Future studies should explore the pathogenicity of these viruses in marmots and their potential for cross-species transmission. This study confirms that the ORF1 protein encoded by Qblood059_12572 possesses a typical jelly-roll fold and an extremely short projection domain, while lacking a functional C-terminal nuclear localization signal (Figure 6). These structural characteristics are fully consistent with recent descriptions of anelloviruses with small projection domains (Petersen et al., 2025). It is worth noting that the predicted C-terminal region exhibits low confidence (pLDDT) in the AlphaFold model. Therefore, inferences about the absence of a functional nuclear localization signal in this region based on the current model should be made with caution and require experimental validation.
The detection of Tick-borne encephalitis virus (TBEV) in marmot blood samples is another significant finding. TBEV, a member of the Flaviviridae family, is a highly pathogenic virus that can cause severe neurological disorders in humans (Blom et al., 2018). The identification of a TBEV sequence with high similarity to a human-derived reference sequence suggests that marmots may play a role in the transmission cycle of this virus (Figure 7). This finding aligns with previous studies that have implicated marmots as potential hosts for TBEV, particularly in regions where they coexist with human populations (Dai et al., 2018). Although the TBEV sequences we identified show high similarity to human pathogenic strains, the incomplete genome and lack of functional data mean we cannot assess their actual infectivity and transmission capacity. Therefore, considering marmots as part of the transmission cycle of this virus remains a hypothesis requiring further validation.
In addition to Anelloviridae and Flaviviridae, we also identified novel sequences belonging to the Parvoviridae and Polyomaviridae families. The Parvoviridae sequence, designated as Qblood060_22586, exhibited high similarity to a Dependoparvovirus sequence derived from marmot tissue, suggesting that marmots may be a natural host for this virus. Similarly, the Polyomaviridae sequence, Qblood079_1103, showed close phylogenetic relationships with a marmot-derived polyomavirus (GenBank Accession No. BK066788), further supporting the idea that marmots harbor a diverse array of viruses. These findings contribute to the growing body of evidence that marmots are important reservoirs for a variety of viral pathogens. Meanwhile, we also analyzed the presence of functional domains within the LTA protein (Figure 9), which might help clarify the relationship between the newly identified isolate and other species. For the novel polyomavirus identified in this study, both the predicted nuclear localization function of its LTA protein and its actual host range remain unknown. Future studies should clone the LTA gene of this virus and express it in mammalian cells, using immunofluorescence to directly observe its subcellular localization, thereby providing experimental evidence for its function.
It is critical to note that the all main LTA functional domains could be predicted in the LTA encoded by Qblood079_1103, including a strong putative bipartite cNLS located between the LXCXE motif and the OBD, similarly to all LTA from mammalian-infecting polyomaviruses studied so far (Cross et al., 2024). Intriguingly, a bipartite NLS was also predicted downstream of the SF3 helicase domain, in a similar position to NLSs recently characterized in the LTA from fish infecting polyomaviruses (Hoad et al., 2026). In the present study, the predicted NLS is located downstream of the SF3 helicase domain. This observation aligns with the hypothesis that the position and composition of NLS motifs in large T-like proteins are evolutionarily plastic, capable of shifting among the N-terminal, central, and C-terminal regions.
The comparative analysis of viral communities between Yushu and Guoluo revealed significant differences in viral diversity and abundance. Yushu exhibited a higher diversity of viral families, including Herpesviridae, Polyomaviridae, and Flaviviridae, compared to Guoluo. This regional variation in viral communities may be influenced by ecological factors such as habitat type, climate, and human activity. The higher viral diversity in Yushu could also reflect differences in marmot population density or the presence of additional reservoir species. Further studies are needed to elucidate the factors driving these regional differences and their implications for viral transmission dynamics. It should be noted that the observed regional variations in the virome, while potentially associated with habitat types, may also be influenced by other unmeasured confounding factors, such as host age structure, population density, immune status, and differences in ectoparasite communities. Future studies should systematically collect data on these covariates to more comprehensively elucidate the key drivers underlying the geographical variation of the virome.
One limitation of this study is the reliance on blood samples, which may not capture the full spectrum of viruses present in marmots. Future research should include other sample types, such as feces, urine, and tissue samples, to provide a more comprehensive picture of the marmot virome. It is important to emphasize that the viral sequences detected in this study only demonstrate the presence of viral nucleic acids. Confirming active replication and pathogenicity of these viruses in marmots requires further validation through future studies involving virus isolation, targeted serological surveys (e.g., IgM detection), or experimental infection approaches. Importantly, due to the limited availability of samples, we were unable to perform experimental validation of the identified viral sequences in this study. While these high-quality viral contigs provide valuable clues for future investigations, their biological relevance and authenticity require confirmation through independent experimental approaches. Further studies are needed to determine the potential impact of these viruses on marmot health and their capacity for cross-species transmission.
In conclusion, this study provides valuable insights into the viral diversity of marmots in the Qinghai–Tibet Plateau region. The identification of novel viruses and the detection of known pathogens such as TBEV highlight that marmots may serve as potential reservoirs for zoonotic viruses. These findings have important implications for wildlife conservation and public health, particularly in regions where marmots live in close proximity to human populations. Enhanced surveillance and further research are needed to better understand the role of marmots in viral transmission and to develop effective strategies for mitigating the risk of zoonotic disease outbreaks.
Data availability statement
The viral metagenomic data utilized to corroborate the findings of this study has been submitted and deposited at the National Genomics Data Center. The quality-filtered sequencing data have been deposited in the Sequence Read Archive (SRA) under BioProject accession PRJCA035278 and BioSample accessions SAMC4593464-SAMC4593533. All newly identified genes have been registered with and assigned sequence identifiers (C_AA105711.1-C_AA105727.1) by the National Genomics Data Center. These data are publicly available without any access restrictions.
Ethics statement
The animal study was approved by Qinghai Institute for Endemic Disease Prevention and Control. The study was conducted in accordance with the local legislation and institutional requirements.
Author contributions
HaisW: Writing – original draft, Writing – review & editing. XJ: Writing – original draft, Writing – review & editing. YX: Writing – original draft, Writing – review & editing. SN: Writing – review & editing. HailW: Writing – review & editing. WX: Writing – review & editing. WP: Writing – review & editing. SW: Methodology, Writing – review & editing. WZ: Methodology, Writing – review & editing.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This research was financially supported by National Key Research and Development Programs of China (Nos. 2023YFD1801300 and 2022YFC2603801) and the National Natural Science Foundation of China (No. 82341106) to WZ.
Acknowledgments
The authors thank the Qinghai Institute for Endemic Disease Prevention and Control, Jiangsu University and Wuxi Blood Center.
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that Generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2026.1668126/full#supplementary-material
Abbreviations
DPBS, Dulbecco’s phosphate buffered saline; ORF, Open reading frame; TBEV, Tick-borne encephalitis virus; LTA, Large T antigen; cNLS, Classical nuclear localization signal.
Footnotes
References
Abramson, J., Adler, J., Dunger, J., Evans, R., Green, T., Pritzel, A., et al. (2024). Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630, 493–500. doi: 10.1038/s41586-024-07487-w,
Ben-Ari, T., Neerinckx, S., Agier, L., Cazelles, B., Xu, L., Zhang, Z., et al. (2012). Identification of Chinese plague foci from long-term epidemiological data. Proc. Natl. Acad. Sci. U.S.A. 109, 8196–8201. doi: 10.1073/pnas.1110585109,
Biagini, P. 2009 “Classification of TTV and related viruses (anelloviruses),” TT viruses. Current topics in microbiology and immunology E.-M. VilliersDe and H. Z. Hausen Berlin Springer 21–33
Blom, K., Cuapio, A., Sandberg, J. T., Varnaite, R., Michaëlsson, J., Björkström, N. K., et al. (2018). Cell-mediated immune responses and immunopathogenesis of human Tick-borne encephalitis virus-infection. Front. Immunol. 9:2174. doi: 10.3389/fimmu.2018.02174,
Bogovic, P., and Strle, F. (2015). Tick-borne encephalitis: a review of epidemiology, clinical characteristics, and management. World J. Clin. Cases 3, 430–441. doi: 10.12998/wjcc.v3.i5.430,
Buchfink, B., Xie, C., and Huson, D. H. (2015). Fast and sensitive protein alignment using DIAMOND. Nat. Methods 12, 59–60. doi: 10.1038/nmeth.3176,
Butkovic, A., Kraberger, S., Smeele, Z., Martin, D. P., Schmidlin, K., Fontenele, R. S., et al. (2023). Evolution of anelloviruses from a circovirus-like ancestor through gradual augmentation of the jelly-roll capsid protein. Virus Evol. 9:vead035. doi: 10.1093/ve/vead035,
Cross, E. M., Akbari, N., Ghassabian, H., Hoad, M., Pavan, S., Ariawan, D., et al. (2024). A functional and structural comparative analysis of large tumor antigens reveals evolution of different importin α-dependent nuclear localization signals. Protein Sci. 33:e4876. doi: 10.1002/pro.4876,
Dai, X., Shang, G., Lu, S., Yang, J., and Xu, J. (2018). A new subtype of eastern tick-borne encephalitis virus discovered in Qinghai–Tibet Plateau, China. Emerg. Microbes Infect. 7:74. doi: 10.1038/s41426-018-0081-6,
Dai, Z., Wang, H., Wu, H., Zhang, Q., Ji, L., Wang, X., et al. (2022). Parvovirus dark matter in the cloaca of wild birds. GigaScience 12:giad001. doi: 10.1093/gigascience/giad001,
De Souza, W. M., Fumagalli, M. J., De Araujo, J., Sabino-Santos, G., Maia, F. G. M., Romeiro, M. F., et al. (2018). Discovery of novel anelloviruses in small mammals expands the host range and diversity of the Anelloviridae. Virology 514, 9–17. doi: 10.1016/j.virol.2017.11.001,
DeCaprio, J. A., and Garcea, R. L. (2013). A cornucopia of human polyomaviruses. Nat. Rev. Microbiol. 11, 264–276. doi: 10.1038/nrmicro2992,
Decaro, N., and Buonavoglia, C. (2012). Canine parvovirus—a review of epidemiological and diagnostic aspects, with emphasis on type 2c. Vet. Microbiol. 155, 1–12. doi: 10.1016/j.vetmic.2011.09.007,
Deng, X., Naccache, S. N., Ng, T., Federman, S., Li, L., Chiu, C. Y., et al. (2015). An ensemble strategy that significantly improves de novo assembly of microbial genomes from metagenomic next-generation sequencing data. Nucleic Acids Res. 43, e46–e46. doi: 10.1093/nar/gkv002,
Dittmer, F. P., Guimarães, C. D. M., Peixoto, A. B., Pontes, K. F. M., Bonasoni, M. P., Tonni, G., et al. (2024). Parvovirus B19 infection and pregnancy: review of the current knowledge. J. Pers. Med. 14:139. doi: 10.3390/jpm14020139,
Finn, R. D., Clements, J., and Eddy, S. R. (2011). HMMER web server: interactive sequence similarity searching. Nucleic Acids Res. 39, W29–W37. doi: 10.1093/nar/gkr367,
He, Z., Wei, B., Zhang, Y., Liu, J., Xi, J., Ciren, D., et al. (2021). Distribution and characteristics of human plague cases and Yersinia pestis isolates from 4 Marmota plague foci, China, 1950–2019. Emerg. Infect. Dis. 27, 2544–2553. doi: 10.3201/eid2710.202239,
Hilt, E. E., and Ferrieri, P. (2022). Next generation and other sequencing technologies in diagnostic microbiology and infectious diseases. Genes 13:1566. doi: 10.3390/genes13091566,
Hoad, M., Pavan, S., Nematollahzadeh, S., Tietz, O., Reeman, J., Forwood, J. K., et al. (2026). Importin α/β1 dependent nuclear import of black sea bass polyomavirus large tumor antigen is mediated by a classical NLS located downstream of the SF3 helicase domain. Biochim. Biophys. Acta Mol. Cell Res. 1873:120074. doi: 10.1016/j.bbamcr.2025.120074,
Huson, D. H., Auch, A. F., Qi, J., and Schuster, S. C. (2007). MEGAN analysis of metagenomic data. Genome Res. 17, 377–386. doi: 10.1101/gr.5969107,
Islam, M. M., Farag, E., Eltom, K., Hassan, M. M., Bansal, D., Schaffner, F., et al. (2021). Rodent Ectoparasites in the Middle East: a systematic review and meta-analysis. Pathogens 10:139. doi: 10.3390/pathogens10020139,
Johnson, L. S., Eddy, S. R., and Portugaly, E. (2010). Hidden Markov model speed heuristic and iterative HMM search procedure. BMC Bioinformatics 11:431. doi: 10.1186/1471-2105-11-431,
Kaczorowska, J., and van der Hoek, L. (2020). Human anelloviruses: diverse, omnipresent and commensal members of the virome. FEMS Microbiol. Rev. 44, 305–313. doi: 10.1093/femsre/fuaa007,
Kalderon, D., Richardson, W. D., Markham, A. F., and Smith, A. E. (1984). Sequence requirements for nuclear location of simian virus 40 large-T antigen. Nature 311, 33–38. doi: 10.1038/311033a0,
Karimov, K., Kachel, S. M., and Hackländer, K. (2018). Responses of snow leopards, wolves and wild ungulates to livestock grazing in the Zorkul Strictly Protected Area, Tajikistan. PLoS One 13:e0208329. doi: 10.1371/journal.pone.0208329,
Kosugi, S., Hasebe, M., Tomita, M., and Yanagawa, H. (2009). Systematic identification of cell cycle-dependent yeast nucleocytoplasmic shuttling proteins by prediction of composite motifs. Proc. Natl. Acad. Sci. U S A 106, 10171–10176. doi: 10.1073/pnas.0900604106,
Kumar, S., Stecher, G., Li, M., Knyaz, C., and Tamura, K. (2018). MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 35, 1547–1549. doi: 10.1093/molbev/msy096,
Kwasnik, M., Rola, J., and Rozek, W. (2023). Tick-borne encephalitis—review of the current status. J. Clin. Med. 12:6603. doi: 10.3390/jcm12206603,
Letunic, I., and Bork, P. (2021). Interactive Tree of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293–W296. doi: 10.1093/nar/gkab301,
Lindquist, L., and Vapalahti, O. (2008). Tick-borne encephalitis. Lancet 371, 1861–1871. doi: 10.1016/S0140-6736(08)60800-4
Liu, B., Bai, L., Fu, Y., Zhao, S., Liu, H., Wang, R., et al. (2020). Genetic and molecular features for hepadnavirus and plague infections in the Himalayan marmot. Genome 63, 307–317. doi: 10.1139/gen-2019-0161,
Liu, Z., Mi, B., Jiang, Z., Fei, B., Cai, Z., and Liu, X. (2016). Improved bulk density of bamboo pellets as biomass for energy production. Renew. Energy 86, 1–7. doi: 10.1016/j.renene.2015.08.011
Loutfy, S. A., Moneer, M. M., Salem, S. E., El-Moniem Abada, E. A., El-Moniem Ahmed, E. A., Ibrahim, L. H., et al. (2017). Polyomavirus infections and its clinical relevance in cancer patients: a prospective study. J. Infect. Public Health 10, 22–30. doi: 10.1016/j.jiph.2016.01.008,
Luis, A. D., Hayman, D. T. S., O’Shea, T. J., Cryan, P. M., Gilbert, A. T., Pulliam, J. R. C., et al. (2013). A comparison of bats and rodents as reservoirs of zoonotic viruses: are bats special? Proc. Biol. Sci. 280:20122753. doi: 10.1098/rspb.2012.2753
Lyngdoh, S., Shrotriya, S., Goyal, S. P., Clements, H., Hayward, M. W., and Habib, B. (2014). Prey preferences of the snow leopard (Panthera uncia): regional diet specificity holds global significance for conservation. PLoS One 9:e88349. doi: 10.1371/journal.pone.0088349,
Ma, Y., Liu, P., Li, Z., Yue, Y., Zhao, Y., He, J., et al. (2024). High genetic diversity of the himalayan marmot relative to plague outbreaks in the Qinghai–Tibet Plateau, China. BMC Genomics 25:262. doi: 10.1186/s12864-024-10171-y,
Michalak, T. I. (2020). Diverse virus and host-dependent mechanisms influence the systemic and intrahepatic immune responses in the woodchuck model of hepatitis B. Front. Immunol. 11:853. doi: 10.3389/fimmu.2020.00853,
Millman, I., Southam, L., Halbherr, T., Simmons, H., and Kang, C. M. (1984). ColabFold: making protein folding accessible to all.. Nat. Methods 19, 679–682. doi: 10.1038/s41592-022-01488-1,
Mirdita, M., Schütze, K., Moriwaki, Y., Heo, L., Ovchinnikov, S., and Steinegger, M. (2022). Tick-borne encephalitis virus: a comprehensive review of transmission, pathogenesis, epidemiology, clinical manifestations, diagnosis, and prevention. Microorganisms 11:1634. doi: 10.3390/microorganisms11071634,
Mlera, L., and Bloom, M. E. (2018). The role of mammalian reservoir hosts in tick-borne Flavivirus biology. Front. Cell. Infect. Microbiol. 8:298. doi: 10.3389/fcimb.2018.00298,
Moreno, A., Canziani, S., Lelli, D., Castelli, A., Bianchi, A., Bertoletti, I., et al. (2024). Molecular and serological detection of bovine coronaviruses in marmots (Marmota marmota) in the alpine region. Viruses 16:591. doi: 10.3390/v16040591,
Nandi, S., Chidri, S., Kumar, M., and Chauhan, R. S. (2010). Occurrence of canine parvovirus type 2c in the dogs with haemorrhagic enteritis in India. Res. Vet. Sci. 88, 169–171. doi: 10.1016/j.rvsc.2009.05.018,
Petersen, G. F., Pavan, S., Ariawan, D., Tietz, O., Nematollahzadeh, S., Sarker, S., et al. (2025). Nuclear trafficking of Anelloviridae capsid protein ORF1 reflects modular evolution of subcellular targeting signals. Virus Evol 11:veaf069. doi: 10.1093/ve/veaf069,
Pierson, T. C., and Diamond, M. S. (2020). The continued threat of emerging flaviviruses. Nat. Microbiol. 5, 796–812. doi: 10.1038/s41564-020-0714-0,
Pustijanac, E., Buršić, M., Talapko, J., Škrlec, I., Meštrović, T., and Lišnjić, D. (2023). Tick-borne encephalitis virus: a comprehensive review of transmission, pathogenesis, epidemiology, clinical manifestations, diagnosis, and prevention. Microorganisms 11:1634. doi: 10.3390/microorganisms11071634,
Ronquist, F., Teslenko, M., van der Mark, P., Ayres, D. L., Darling, A., Höhna, S., et al. (2012). MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst. Biol. 61, 539–542. doi: 10.1093/sysbio/sys029,
Ruzek, D., Avšič Županc, T., Borde, J., Chrdle, A., Eyer, L., Karganova, G., et al. (2019). Tick-borne encephalitis in Europe and Russia: review of pathogenesis, clinical features, therapy, and vaccines. Antivir. Res. 164, 23–51. doi: 10.1016/j.antiviral.2019.01.014,
Shan, T., Yang, S., Wang, H., Wang, H., Zhang, J., Gong, G., et al. (2022). Virome in the cloaca of wild and breeding birds revealed a diversity of significant viruses. Microbiome 10:60. doi: 10.1186/s40168-022-01246-7,
Skewes-Cox, P., Sharpton, T. J., Pollard, K. S., and DeRisi, J. L. (2014). Profile hidden Markov models for the detection of viruses within metagenomic sequence data. PLoS One 9:e105067. doi: 10.1371/journal.pone.0105067,
Spandole, S., Cimponeriu, D., Berca, L. M., and Mihăescu, G. (2015). Human anelloviruses: an update of molecular, epidemiological and clinical aspects. Arch. Virol. 160, 893–908. doi: 10.1007/s00705-015-2363-9,
Streck, A. F., and Truyen, U. (2020). Porcine parvovirus. Curr. Issues Mol. Biol. 37, 33–46. doi: 10.21775/cimb.037.033,
Svoboda, P., Dobler, G., Markotić, A., Kurolt, I.-C., Speck, S., Habuš, J., et al. (2014). Survey for hantaviruses, Tick-borne encephalitis virus, and Rickettsia spp. in small rodents in Croatia. Vector Borne Zoonotic Dis. 14, 523–530. doi: 10.1089/vbz.2013.1457,
Topalis, D., Andrei, G., and Snoeck, R. (2013). The large tumor antigen: a “Swiss Army knife” protein possessing the functions required for the polyomavirus life cycle. Antivir. Res. 97, 122–136. doi: 10.1016/j.antiviral.2012.11.007,
Wiley, S. R., Kraus, R. J., Zuo, F., Murray, E. E., Loritz, K., and Mertz, J. E. (1993). SV40 early-to-late switch involves titration of cellular transcriptional repressors. Genes Dev. 7, 2206–2219. doi: 10.1101/gad.7.11.2206,
Xi, J., Duan, R., He, Z., Meng, L., Xu, D., Chen, Y., et al. (2022). First case report of human plague caused by excavation, skinning, and eating of a hibernating marmot (Marmota himalayana). Front. Public Health 10:910872. doi: 10.3389/fpubh.2022.910872,
Yan, T., Wang, Z., Li, R., Zhang, D., Song, Y., and Cheng, Z. (2024). Gyrovirus: current status and challenge. Front. Microbiol. 15:1449814. doi: 10.3389/fmicb.2024.1449814,
Yoshii, K. (2019). Epidemiology and pathological mechanisms of Tick-borne encephalitis. J. Vet. Med. Sci. 81, 343–347. doi: 10.1292/jvms.18-0373,
Zhang, W., Li, L., Deng, X., Blümel, J., Nübling, C. M., Hunfeld, A., et al. (2016). Viral nucleic acids in human plasma pools. Transfusion 56, 2248–2255. doi: 10.1111/trf.13692,
Zhang, W., Li, L., Deng, X., Kapusinszky, B., Pesavento, P. A., and Delwart, E. (2014). Faecal virome of cats in an animal shelter. J. Gen. Virol. 95, 2553–2564. doi: 10.1099/vir.0.069674-0,
Zhao, M., Yue, C., Yang, Z., Li, Y., Zhang, D., Zhang, J., et al. (2022). Viral metagenomics unveiled extensive communications of viruses within giant pandas and their associated organisms in the same ecosystem. Sci. Total Environ. 820:153317. doi: 10.1016/j.scitotenv.2022.153317,
Keywords: blood, Qinghai–Tibet Plateau, viral diversity, wild marmots, zoonotic potential
Citation: Wu H, Jiang X, Xi Y, Ning S, Wu H, Xin W, Peng W, Wang S and Zhang W (2026) Characterization of viral diversity in wild marmot blood from the Qinghai–Tibet Plateau. Front. Microbiol. 17:1668126. doi: 10.3389/fmicb.2026.1668126
Edited by:
Mohammed Rohaim, Cairo University, EgyptReviewed by:
Gualtiero Alvisi, University of Padua, ItalyYi Sun, Beijing Institute of Microbiology and Epidemiology, China
Copyright © 2026 Wu, Jiang, Xi, Ning, Wu, Xin, Peng, Wang and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Shengjun Wang, c2p3anNAdWpzLmVkdS5jbg==; Wen Zhang, ejAyMTZ3ZW5AeWFob28uY29t
†These authors have contributed equally to this work
Haisheng Wu1,2†