Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Microbiol., 06 February 2026

Sec. Virology

Volume 17 - 2026 | https://doi.org/10.3389/fmicb.2026.1668126

Characterization of viral diversity in wild marmot blood from the Qinghai–Tibet Plateau

Haisheng Wu,&#x;Haisheng Wu1,2Xiaojie Jiang&#x;Xiaojie Jiang3Yuan Xi&#x;Yuan Xi3Songyi NingSongyi Ning1Hailian WuHailian Wu2Wenyuan XinWenyuan Xin2Wenxuan PengWenxuan Peng2Shengjun Wang
Shengjun Wang1*Wen Zhang
Wen Zhang1*
  • 1Department of Microbiology, School of Medicine, Jiangsu University, Zhenjiang, Jiangsu, China
  • 2Qinghai Institute for Endemic Disease Prevention and Control, Xining, China
  • 3Department of Clinical Laboratory, Wuxi Blood Center, Wuxi, Jiangsu, China

Introduction: This study aimed to characterize the viral diversity in the blood of marmots in the Qinghai-Tibet Plateau region, to assess their role as potential viral reservoirs and evaluate the potential implications for wildlife and human health.

Methodology: Seventy marmot blood samples were collected from Yushu and Guoluo Prefectures in Qinghai Province. Viral communities were comprehensively analyzed using high-throughput sequencing and bioinformatics techniques.

Results: Analysis identified a wide range of viral families, including Anelloviridae, Flaviviridae, Parvoviridae, and Polyomaviridae, and revealed multiple novel viral sequences. Notably, we documented the first evidence of Anelloviridae in marmot serum; phylogenetic analysis indicated these sequences cluster with those from marmot feces and tissues, suggesting a natural host relationship. A critical finding was the detection of Tick-borne encephalitis virus, with a sequence highly similar to human-derived strains, implying potential involvement of marmots in the transmission cycle. Furthermore, identification of a novel polyomavirus was supported by prediction of all main large tumor antigen functional domains and motifs, including a putative nuclear localization signal between its LXCXE motif and origin-binding domain, typical of mammalian-infecting polyomaviruses. Comparative analysis revealed significant regional differences in viral diversity between sampling areas, potentially linked to local ecological factors.

Discussion: This study significantly expands the known viral diversity in marmots and underscores their role as potential zoonotic reservoirs. However, the functional and pathogenic implications of these viruses require further experimental validation. These findings highlight the importance of ongoing wildlife surveillance for understanding viral ecology and mitigating emerging public health risks.

1 Introduction

Marmots (genus Marmota) are large ground squirrels that are found primarily in temperate regions of North America, Europe and Asia (Arnold, 2019). These rodents perform vital ecosystem roles by improving soil aeration while serving as key prey for top predators including snow leopards (Panthera uncia) and wolves (Canis lupus), thus sustaining trophic dynamics across Central Asia and China’s Qinghai–Tibet Plateau (Lyngdoh et al., 2014; Karimov et al., 2018). Particularly, the Himalayan marmot, as an important rodent species of the Tibetan Plateau and surrounding areas, is not only a major reservoir for bacterial diseases such as plague but also harbors a variety of viral pathogens, including bloodborne viruses such as hepatitis virus (Millman et al., 1984; Ben-Ari et al., 2012; Ma et al., 2024). These viruses may affect their health and pose risks to other wildlife and even humans. Specifically, marmots may serve as reservoirs for several zoonotic viruses, making the monitoring of their viral load and health status crucial for wildlife conservation and public health (Luis et al., 2013).

Marmots are exposed to several bloodborne viruses, which may be transmitted via vector organisms such as ticks, fleas, and other ectoparasites (Islam et al., 2021). Studying these viruses in marmots is important for understanding the potential risks they pose to both wildlife populations and human health, particularly in regions where marmots live in close proximity to human settlements (Xi et al., 2022). Research has shown that marmots can harbor several families of viruses, including flaviviruses (Mlera and Bloom, 2018), Hepatitis B (Liu et al., 2020; Michalak, 2020), and even coronaviruses (Moreno et al., 2024). Understanding these viruses in marmots is key to preventing potential outbreaks and mitigating health risks.

Tick-borne encephalitis virus (TBEV) is a flavivirus transmitted by ticks and has been found in various wild rodents, including marmots (Lindquist and Vapalahti, 2008). The detection of the Himalayan subtype of Tick-borne encephalitis virus in marmots suggests that they may serve as amplifying hosts in the arbovirus transmission chain (Dai et al., 2018). This virus is transmitted through tick bites and is highly pathogenic to the human central nervous system (Bogovic and Strle, 2015). The detection of TBEV in marmots highlights their potential role in maintaining the virus in natural foci. Moreover, the presence of the Himalayan subtype of TBEV in marmots suggests that these animals may act as a bridge for the virus to spread across different regions. This is particularly concerning given the virus’s high pathogenicity in humans, causing severe neurological disorders such as encephalitis and meningitis. Understanding the role of marmots in the blood-mediated transmission of TBEV is crucial for developing effective surveillance and control strategies to mitigate the risk of human infection.

Detecting blood-borne viruses in marmots typically requires a combination of molecular and serological methods. Commonly used detection techniques include serological assays (Svoboda et al., 2014). Additionally, next-generation sequencing (NGS) methods enable comprehensive high-throughput analysis of viral diversity in marmot blood samples, serving as an effective tool for discovering novel viruses or detecting low-abundance viruses (Zhu et al., 2020; Hilt and Ferrieri, 2022). These comprehensive sampling and processing methods aim to enhance our understanding of viral dynamics in wildlife, reduce the risk of viral spillover, and provide critical support for public health and ecological conservation efforts.

Understanding the viral ecology of marmots is crucial for better risk assessment, particularly in regions where marmot habitats overlap with human activities (He et al., 2021). Monitoring blood-borne viruses in marmots helps track viral transmission dynamics, especially in areas where marmots serve as reservoir hosts for tick-borne viruses such as TBEV (Kwasnik et al., 2023; Pustijanac et al., 2023). To address these knowledge gaps, we conducted an analysis of marmot blood samples from Yushu Prefecture and Guoluo Prefecture in Qinghai Province. This study aims to enhance our understanding of viral dynamics in wildlife, reduce the risk of viral spillover, and provide support for public health and ecological conservation efforts.

2 Methods

2.1 Sample collection

We collected blood samples from 50 marmots in Yushu Prefecture and 20 marmots in Guoluo Prefecture, Qinghai Province. These marmots (Marmota himalayana) were captured in their natural habitats using humane live traps. In accordance with ethical guidelines, all marmots were anesthetized, and blood samples were obtained via cardiac puncture. After centrifugation, the supernatant was discarded, and the samples were stored at −80 °C for further analysis. Marmots in this study were captured using ketamine hydrochloride as the anesthetic and were subsequently euthanized. All animal procedures in this study were conducted in accordance with the guidelines of the Ethics Committee of Jiangsu University and were approved (Approval No. JSDX20231007002).

All samples were resuspended in Dulbecco’s phosphate buffered saline (DPBS), homogenized, and subjected to three freeze–thaw cycles. The supernatants were collected after centrifugation at 15,000 × g for 10 min at 4 °C.

2.2 Library construction

Each supernatant (500 μL) was filtered through a 0.45-μm syringe filter (Millipore) to remove large cell-sized particles and subsequently centrifuged at 12,000 × g for 5 min to collect the filtrate enriched in viral particles. To degrade unprotected nucleic acids, the filtrate was incubated with a dual-enzyme cocktail of DNase I and RNase A at 37 °C for 60 min (Zhang et al., 2014, 2016; Zhao et al., 2022). Detailed reagent specifications are provided in Table 1. Total nucleic acids (RNA and DNA) protected within viral capsids were then extracted using either the FastPure Viral DNA/RNA Mini Kit (Vazyme) or the QiAamp Viral RNA Mini Kit, following the manufacturers’ protocols. Reverse transcription was performed on the extracted nucleic acids using SuperScript III or SuperScript IV reverse transcriptase (Invitrogen) with random hexamer primers. For single-stranded DNA (ssDNA) viruses, ssDNA was converted to double-stranded DNA (dsDNA) using the Klenow fragment polymerase (New England BioLabs).

Table 1
www.frontiersin.org

Table 1. Nuclease digestion system.

The resulting nucleic acids were prepared for sequencing using the Illumina Nextera XT DNA Sample Preparation Kit, with dual barcoding applied to distinguish between samples. A total of 70 pools were constructed, depending on the study design, and sequenced on the Illumina NovaSeq 6000 platform, generating 250-bp paired-end reads (Liu et al., 2016).

2.3 Bioinformatics analysis

The 250-bp paired-end reads generated for each pool were debarcoded using Illumina’s vendor software. An in-house analysis pipeline running on a 32-node Linux cluster was used to process the data. Reads were considered duplicates if bases 5 to 55 were identical and only one random copy of duplicates was kept. Clonal reads were removed and low sequencing quality tails were trimmed using Phred quality score 30 as the threshold. Adaptors were trimmed using the default parameters of VecScreen which is NCBI BLASTn with specialized parameters designed for adapter removal. The cleaned reads were then subjected to a DIAMOND BLASTx search against an in-house non-virus non-redundant (NVNR) protein database (Buchfink et al., 2015), which was compiled from NCBI nr fasta files by excluding viral taxonomic annotations, to eliminate false positive viral hits. Taxonomic classification of the DIAMOND results was parsed using MEGAN, with the lowest common ancestor (LCA) assignment performed under default settings. To further identify viral sequences, the cleaned reads were de novo assembled using Geneious Prime v2019.0 (Biomatters Ltd.) with default parameters. The resulting contigs and singlet sequences were compared against a viral proteome database using BLASTx with an E-value threshold of <10−5 to classify virus types and filter out non-viral sequences. The viral proteome database was constructed using the NCBI virus reference proteome (available at http://ftp.ncbi.nih.gov/refseq/release/viral/) and viral protein sequences extracted from the NCBI nr FASTA files, categorized based on annotation taxonomy within the Virus Kingdom (Skewes-Cox et al., 2014; Deng et al., 2015).

To identify more distant viral protein similarities, contigs without significant BLASTx hits were further analyzed against viral protein families in the vFam database using HMMER3 (Johnson et al., 2010; Finn et al., 2011). Open reading frames (ORFs) within the viral genomes were predicted by integrating the BLASTx search results with Geneious Prime software. Protein domains were annotated using the NCBI Conserved Domain Search tool with an E-value threshold of <10−5. This comprehensive approach ensured the accurate identification and characterization of viral sequences while minimizing false positives and enhancing the detection of remote viral homologs. Protein sequences were analyzed with cNLS Mapper1 to identify putative cNLSs (Kosugi et al., 2009). cNLS Mapper (last updated on 2012/11/7) was used with uniform threshold settings: monopartite NLS ≥3, bipartite NLS ≥3. Structural models were predicted using the AlphaFold3 Server and the ColabFold v1.5.5: AlphaFold2 using MMseqs2 (Mirdita et al., 2022; Abramson et al., 2024). Conduct domain and motif analysis using ProSite2 and ELM.3

2.4 Viral sequences acquisition

To generate high-quality viral genomes or genomic segments, de novo assembly and reference mapping were conducted in Geneious Prime v2019.0 using the assembled contigs and unassembled reads with known taxonomic assignments obtained from the previous analysis stage. Additionally, Geneious Prime v2019.0 was utilized for genome annotation and ORF prediction, enabling comprehensive characterization of the viral sequences.

2.5 Statistical analysis

Statistical analyses were performed using R v4.2.1 and MEGAN v6.22.2. All reads in the quality-controlled data with sequence lengths greater than 50 bp were aligned to the viral proteome database using BLASTx (method as described above). The BLASTx results were then imported into the MEGAN software to generate rarefaction curves, visualizing differences in viral community composition (Huson et al., 2007). To visualize the viral community structure and richness, the R packages pheatmap and vegan were employed. Differences in viral communities were graphically represented using the ggplot2 package in R. Viral community alpha diversity was analyzed at the family level, with statistical significance assessed using the Kruskal–Wallis test, followed by pairwise Wilcoxon rank-sum tests for group comparisons. A p-value threshold of <0.05 was applied to determine statistical significance.

2.6 Phylogenetic analysis

Phylogenetic analyses were performed using the predicted protein sequences of the viruses identified in this study, along with approximately the top 50 protein sequences retrieved by BLASTx, as well as reference protein sequences from diverse host species or different viral genera obtained from the NCBI GenBank database (Shan et al., 2022). Related protein sequences were aligned using MUSCLE in MEGA version 10.1.8 with default settings (Kumar et al., 2018). Sites containing more than 50% gaps were temporarily removed from alignments. Bayesian inference trees were constructed using MrBayes version 3.2.7 (Ronquist et al., 2012), employing two concurrent runs of Markov chain Monte Carlo (MCMC) sampling. For protein-based phylogenetic analysis, the mixed amino acid model was specified using the command “prset aamodelpr = mixed.” The runs were terminated when the standard deviation of split frequencies fell below 0.01, and the first 25% of the trees were discarded as burn-in. To corroborate the Bayesian inference trees, maximum-likelihood trees were also generated using MEGA software (Kumar et al., 2018). The resulting phylogenetic trees were visualized and annotated using FigTree v1.4.4,4 Adobe Illustrator 2022 v26.0.1, and iTOL v6 (Letunic and Bork, 2021).

3 Results

3.1 Overview of marmot blood virome

The analysis of species richness in all marmot blood sediment samples, based on species-level rarefaction curves and accumulation curves, revealed the diversity characteristics of the viral community. Detailed library statistics are provided in Supplementary Table S1. In most of the 70 libraries, the number of observed virus families stabilized when the viral sequence count reached a specific threshold, indicating that the sequencing depth in the vast majority of the libraries was sufficient to represent the distribution characteristics of virus families in the blood samples of marmots. Furthermore, the sequencing data were deemed reasonable and reliable (Figure 1A). Although there is significant heterogeneity in viral taxonomic composition among different samples, the viral species accumulation curve has approached saturation, suggesting that the viral community in marmots has been adequately sampled with good representativeness (Figure 1B). The accumulation curve further suggests that there are likely over 800 distinct viral species present across the 70 libraries.

Figure 1
Panel A presents a multi-line species rarefaction curve showing the number of leaves in a taxonomy tree versus the number of reads sampled from leaves, with curves leveling off. Panel B displays a species accumulation curve with increasing numbers of species observed as the number of samples increases, including error bars and a shaded confidence interval.

Figure 1. Viral diversity across 70 sequencing libraries. (A) Rarefaction curve of viral species generated with MEGAN v6.22.2 (log-transformed y-axis). (B) Species accumulation curve for the marmot blood virome. Error bars represent the data range; the shaded blue band indicates the 95% confidence interval.

Taxonomic analysis identified 18 specific and functionally complete viral genome sequences obtained through de novo assembly and reference alignment, including four known DNA viruses and one RNA virus capable of infecting vertebrates. These viruses were classified as follows: Anelloviridae (n = 14, one of which exhibited a complete circular genome), Microviridae (n = 1), Parvoviridae (n = 1), Polyomaviridae (n = 1), and Flaviviridae (n = 1).

3.2 Viral diversity and comparison in the viral communities

The heatmap (Figure 2) illustrates the relative abundance or distribution of different viral families in two regions (Guoluo and Yushu). The 70 libraries include various viral families, such as Lipothrixviridae, Anelloviridae, Herpesviridae, and Flaviviridae, encompassing both DNA and RNA viruses, and spanning diverse host ranges (e.g., vertebrates, invertebrates, etc.). The distribution of these viral families was compared between the two regions, Guoluo and Yushu. In terms of DNA viruses, Herpesviridae is widely distributed in both Yushu and Guoluo, with higher abundance in Yushu. Polyomaviridae is primarily distributed in Yushu, with lower abundance in Guoluo. Parvoviridae, Circoviridae, and Poxviridae are present in both regions, although their overall abundance is relatively low. Regarding RNA viruses, Flaviviridae exhibits extremely high abundance in certain Yushu samples, while Coronaviridae is predominantly distributed in Yushu, with no detection in Guoluo.

Figure 2
Heatmap displaying families along the horizontal axis and multiple individual samples vertically, with cells colored in shades of yellow to purple to indicate the abundance or presence of families per sample. A color scale and group legend appear on the lower right.

Figure 2. Heatmap of viral family abundance across sequencing pools. Read counts for each viral family are normalized and displayed on a log10 scale. Viral taxonomic classifications (types, families, and groups) are color-coded according to the legend.

In the Yushu region, analysis of viral family relative abundance revealed that Siphoviridae and Myoviridae dominated the community composition, exhibiting the highest proportions (Figure 3A). Host-based classification further confirmed that bacteriophage-associated viruses constituted the predominant fraction of the virome in Yushu, exceeding their relative contribution in Guoluo. In contrast, the viral community in Guoluo was predominantly characterized by Retroviridae, with its abundance significantly surpassing that of Myoviridae. At the species level, the two regions shared 290 viral species, while the proportion of species unique to Yushu (57.4%, 391/681) markedly exceeded that of Guoluo (28.0%, 113/403). Collectively, these patterns indicate that the Yushu virome is characterized by greater taxonomic richness and a stronger bacteriophage dominance than that of Guoluo (Figure 3B).

Figure 3
Panel A is a stacked bar chart comparing the relative abundance of viral families between Yushu and Guoluo, with Siphoviridae and Myoviridae most prominent. Panel B contains a Venn diagram and bar chart showing 391 unique to Yushu, 113 unique to Guoluo, and 290 shared, with list sizes of 681 and 403 respectively.

Figure 3. Taxonomic analysis of viral reads at the family and species levels. (A) Relative abundance of viral families, showing taxonomic classification and proportional distribution. (B) Shared and unique viral species between the two sample groups, visualized by Venn diagram.

To systematically compare the viral community differences, the Shannon index and Simpson index were used to assess alpha diversity and further analyze the differences in viral composition between Yushu and Guoluo. The results showed significant differences in viral communities between the two regions (p < 0.01), as shown in Figures 4A,B. Specifically, Yushu exhibited significantly higher alpha diversity compared to Guoluo. Principal coordinate analysis (PcoA) based on Bray–Curtis (Figure 4C) and Jaccard distances (Figure 4D) revealed statistically significant differences in the viral community composition at the family level between the two regions based on beta diversity (p = 0.01).

Figure 4
Four-panel scientific figure compares two groups, Guoluo (red) and Yushu (teal). Panel A shows a boxplot of Simpson index by group, with higher values and significance for Yushu. Panel B presents Shannon index boxplots, again higher in Yushu with significant difference. Panel C displays a PCoA scatter plot with confidence ellipses, showing group separation on axes labeled PCoA 1 and PCoA 2, percent variance explained and statistical results noted. Panel D offers another PCoA scatter plot with similar group separation and statistical annotation.

Figure 4. Diversity of viral communities between Yushu and Guoluo. (A,B) Comparison of viral alpha diversity, normalized using MEGAN and measured using the Shannon index based on viral abundance at the family level. The p-value was calculated using the Wilcoxon test. Horizontal bars within the boxes represent medians, while the tops and bottoms of the boxes indicate the 75th and 25th percentiles, respectively. (C,D) Principal coordinate analysis (PCoA) based on Bray–Curtis and Jaccard distances, illustrating viral beta diversity normalized using MEGAN at the family level. A p-value <0.05 was considered statistically significant.

3.3 Identification of Anelloviridae

Anelloviridae is a family of single-stranded circular DNA viruses with a genome length of approximately 2.8–3.9 kb. The virus is non-enveloped, and its capsid exhibits an icosahedral symmetrical structure (Biagini, 2009). The genome contains several open reading frames (ORFs), with ORF1 encoding the capsid protein, and ORF2/ORF3 involved in viral replication and regulation.

Anelloviridae viruses are known to infect vertebrates, including humans, other mammals, and birds (Yan et al., 2024). In humans, the virus is primarily transmitted through blood products, vertical transmission from mother to child, and close contact (e.g., saliva, feces) (Kaczorowska and van der Hoek, 2020). Most infections are asymptomatic, but in immunocompromised individuals (such as organ transplant recipients and HIV-infected individuals), increased viral load may be associated with risks such as graft rejection, hepatitis, or cancer, although causality has not been conclusively established (Spandole et al., 2015).

Through sequence assembly and functional annotation of high-throughput sequencing data, this study identified a total of 14 Anelloviridae viral sequences, one of which (Qblood059_12572) exhibited a complete circular genome structure. To investigate their phylogenetic relationships, the ORF1 genes of the 14 sequences were aligned with reference sequences downloaded from the GenBank database and a Bayesian phylogenetic tree was constructed (Figure 5). The results showed that all 14 sequences clustered with known viral sequences isolated from marmot (Marmota spp.) feces or tissues. BLASTx comparison analysis revealed that the amino acid similarity between these sequences and the reference sequences ranged from 69 to 94%, and they were all classified as unclassified Anelloviridae. Notably, some sequences exhibited unique clustering patterns in the phylogenetic tree. Qblood066_12162, Qblood058_16427, Qblood058_7160, Qblood061_6062, and Qblood061_1838 formed an independent branch, Qblood061_4301, Qblood061_1623, and Qblood066_7159 formed another independent branch, and Qblood066_3280 and Qblood058_16065 also showed independent clustering. These branches may represent new genera or species within the Anelloviridae family. Qblood058_12692 formed a separate branch in the phylogenetic tree, suggesting it may belong to a new genus-level taxon.

Figure 5
Circular phylogenetic tree diagram displaying the relationships among Anelloviridae and Gyrovirus taxa, highlighted by purple and yellow shading. Colored dots indicate hosts: blue for Aves, purple for Marmot, yellow for Lagomorpha, and red for Rodentia. Bootstrap probability is represented by variable circle sizes at branch nodes. Taxonomy and host legends are included for reference.

Figure 5. Phylogenetic analysis of Anelloviridae based on ORF1 protein sequences. The sequence identified in this study is highlighted in pink. Relevant taxonomic and feature annotations are provided in the figure legend.

Based on phylogenetic analysis of the ORF1 sequences, all marmot anelloviruses identified in this study were not assigned to any known genus and are currently designated as unclassified species within the family Anelloviridae. Phylogenetic analysis further indicated that the host range of Anelloviridae is broad, encompassing various vertebrate species. However, as a potential host, the diversity of Anelloviridae viruses within marmots has yet to be fully explored, suggesting that this species could serve as an important model for studies on the evolution and host adaptation of Anelloviridae viruses.

Anelloviridae capsids exhibit broad heterogeneity in amino acid length due to variable projection domains, despite a conserved jelly-roll fold (Butkovic et al., 2023). Intriguingly, while all Anelloviridae capsids encode an N-terminal arginine-rich motif (ARM) that functions as a non-classical nuclear localization signal (NLS), only those with large projection domains have evolved a unique classical nuclear localization signal (cNLS) at their C-terminus (Petersen et al., 2025). “NLSs” refers broadly to all types of nuclear localization signals, while “cNLS” specifically denotes classical nuclear localization signals, which are transported into the nucleus via the Importin α/β1 pathway (Hoad et al., 2026). To further investigate this issue, we structurally predicted the three-dimensional structure of the ORF1 protein encoded by one of the newly identified anelloviruses (Qblood059_12572) and utilized the cNLS Mapper to identify putative cNLSs. The analysis predicted a typical jelly-roll fold in the central region of Qblood059_12572 ORF1 (residues 59–342), accompanied by a small projection domain (Figure 6A). On the other hand, the N- and C-termini of the protein are predicted to be largely unstructured and contain multiple putative NLSs with low scores (Figure 6B), indicating that they are likely non-functional as cNLSs. Therefore, Qblood059_12572 encodes an ORF1 protein with a jelly-roll fold and a very short projection domain, and lacks a C-terminal NLS, consistent with recent reports (Butkovic et al., 2023; Petersen et al., 2025).

Figure 6
Panel A shows a ribbon diagram of a protein structure with color gradients indicating confidence levels from very high (dark blue) to very low (orange) based on pLDDT scores, with N- and C-terminal nuclear localization signals (NLSn and NLSc) labeled. Panel B displays results from the cNLS Mapper, listing predicted nuclear localization signals in a protein sequence with identified monopartite and bipartite NLS motifs, their positions, and scores in table format.

Figure 6. Prediction of NLSs in the Anelloviridae ORF1 protein. (A) AlphaFold3 model of ORF1. The top ranked prediction is shown; coloured by pLDDT score of estimated confidence: very high (pLDDT >90) in dark blue, confident (90 >pLDDT >70) in light blue, low (70 >pLDDT >50) in yellow, and very low (pLDDT <50) in orange. (B) Mapping of putative NLSs by cNLS Mapper analysis. Top: Schematic representation of the ORF1 protein sequence (single-letter amino acid code), with predicted NLS motifs highlighted in red. Bottom: Detailed view of each predicted NLS, indicating its position, sequence, and corresponding cNLS Mapper score.

3.4 Phylogenetic analysis of Tick-borne encephalitis virus

The Flaviviridae family comprises single-stranded, positive-sense RNA viruses (+ssRNA) with a genome length of approximately 10–11 kilobases (kb). These viruses are enveloped, and their capsid proteins exhibit icosahedral symmetry (Pierson and Diamond, 2020). Tick-borne encephalitis virus (TBEV), a member of the Orthoflavivirus genus within the Flaviviridae family, is primarily transmitted through the bite of hard ticks, such as Ixodes ricinus and Ixodes persulcatus (Yoshii, 2019). TBEV is predominantly distributed across forested and grassland regions of Europe, Northern Asia, and Eastern Asia, with significant epidemiological prevalence in Russia, Central Europe, and Northern Europe (Ruzek et al., 2019).

In this study, a viral sequence (Qblood061_6473) with a length of 6,550 bp was identified, though its polyprotein-coding region was incomplete. BLASTx alignment analysis revealed a 94.31% amino acid similarity to a reference sequence isolated from human brain tissue (GenBank Accession No. KC414090). Phylogenetic tree analysis demonstrated that Qblood061_6473 clustered with members of the Orthoflavivirus genus, which are known to infect a diverse range of hosts (Figure 7). It is noteworthy that this is the first time a viral sequence belonging to the Orthoflavivirus genus has been identified in the blood of marmots, suggesting that marmots may be potential hosts for TBEV.

Figure 7
Circular phylogenetic tree visualization displays viral taxa with colored host classifications, including Ixodidae, Tick, Homo sapiens, Rodentia, Canine, and Passeriformes. Arcs indicate Orthoflavivirus encephalitidis and Hepacivirus hominis taxonomy. Node probabilities are shown by varying pink dot sizes, with a probability scale from seventy to one hundred percent.

Figure 7. Phylogenetic relationship of Tick-borne encephalitis virus. Phylogenetic tree based on RdRp amino acid sequences of the Flaviridae viruses. The pink name indicates the sequence obtained in this study. See legend for relevant labeling.

3.5 Phylogenetic analysis of Parvoviridae

The family Parvoviridae comprises small, non-enveloped viruses with a single-stranded DNA genome, approximately 4–6 kb in length, encapsulated within an icosahedral capsid (Decaro and Buonavoglia, 2012). This viral family exhibits a broad host range, infecting mammals, birds, and insects, and is associated with a variety of diseases (Dai et al., 2022). For instance, Canine parvovirus causes severe enteric disease in dogs (Nandi et al., 2010), Porcine parvovirus is linked to reproductive failure and abortion in swine (Streck and Truyen, 2020), and Human parvovirus B19 is implicated in conditions such as erythema infectiosum (fifth disease) in children and arthritis in adults (Dittmer et al., 2024).

In this study, a novel viral sequence, designated as Qblood060_22586, was identified through high-throughput sequencing and bioinformatics analysis. The viral genome has a total length of 4,599 bp and contains complete replication-associated protein (rep) and capsid protein (cap) genes, indicating a fully intact viral genome structure. BLASTx alignment analysis revealed that the Rep protein exhibits high similarity to viruses within the genus Dependoparvovirus. The highest similarity (88.63%) was observed with a Dependoparvovirus sequence derived from marmot tissue (GenBank Accession No. PP098970), and it clustered with this sequence in phylogenetic analysis (Figure 8). Additionally, the Rep protein showed 57.36% similarity to a Dependoparvovirus sequence from caprine. These results suggest that the virus Qblood060_22586 likely belongs to the genus Dependoparvovirus, with marmots being a potential host.

Figure 8
Circular phylogenetic tree chart comparing Miniambidensovirus (yellow) and Dependoparvovirus (purple) by host species, including Homo sapiens (blue), rodent (purple), bat (yellow), and marmot (red), with annotated probability values.

Figure 8. Phylogenetic relationship of Parvoviridae. Phylogenetic tree based on rep protein sequences. The pink name indicates the sequence obtained in this study. Taxonomy and hosts are annotated with corresponding colors, as indicated in the color legend.

3.6 Phylogenetic analysis of Polyomaviridae

Polyomaviridae is a small family of double-stranded circular DNA viruses, typically characterized by non-enveloped viral particles with a diameter of about 40–50 nanometers. Members of the family Polyomaviridae transcribe distinct early and late precursor mRNAs. The early transcript undergoes alternative splicing to generate the small, middle (when present), and large T antigens, while the late transcript encodes the structural proteins (Wiley et al., 1993). The Polyomaviridae family includes more than a dozen human-infecting species, notably John Cunningham polyomavirus (JCPyV), BK polyomavirus (BKPyV), Merkel cell polyomavirus (MCPyV), and Trichodysplasia spinulosa polyomavirus (TSPyV), which are clearly linked to human disease. Several avian polyomaviruses also exhibit strong pathogenic potential (DeCaprio and Garcea, 2013). Polyomaviridae can cause various diseases, including tumors and immunosuppression (Loutfy et al., 2017). The host range is broad, including humans, monkeys, mice, birds, and others. In this study, a novel polyomavirus, tentatively designated as Qblood079_1103, was identified using high-throughput sequencing combined with comprehensive bioinformatic analysis. The complete viral genome spans 5,188 bp and encodes a full-length capsid protein VP1, confirming its structural integrity as a member of the Polyomaviridae family. Comparative analysis using BLASTx revealed that the VP1 protein shares significant sequence homology with unclassified polyomaviruses, demonstrating the highest identity (86.72%) with a reference polyomavirus strain isolated from the skin tissue of a marmot (GenBank Accession No. BK066788). Phylogenetic reconstruction based on VP1 protein sequences placed the newly identified virus within a monophyletic clade alongside the marmot-derived polyomavirus, indicating a close evolutionary relationship (Figure 9A). Concurrently, we also constructed a phylogenetic tree for the large T antigen (LTA) protein (Figure 9B). The results closely parallel those obtained with the VP1 protein. The LTA is a multifunctional viral protein that orchestrates viral DNA replication and host cell cycle modulation. It contains several conserved domains, including the J domain, HPDKGG motif, LXCXE motif (for RB binding), the origin-binding domain (OBD), and the SF3 helicase domain (Topalis et al., 2013). These activities depend on its nuclear localization. Accordingly, a cNLS was first identified in SV40 LTA between the LXCXE motif and the OBD (Kalderon et al., 1984).

Figure 9
Panel A shows a circular phylogenetic tree comparing Alphapolyomavirus and unclassified Polyomaviridae, with branches colored by host species and node supports indicated by red circles. Panel B displays a similar phylogenetic tree with slight differences in sequence labels and branch clustering. Panel C presents the predicted nuclear localization signals (NLS) within a protein sequence, highlighting identified monopartite and bipartite NLS motifs and their scores. Panel D illustrates a schematic protein domain map with labeled functional regions, including JX2C, NLS, OBD, zf_LTAG_D1, SF3 helicase, and a second NLS.

Figure 9. Phylogenetic and structural analysis of a novel polyomavirus. (A) Phylogenetic tree inferred from VP1 protein sequences of Polyomaviridae. The sequence obtained in this study is highlighted in pink. Taxonomy and hosts are annotated with corresponding colors, as indicated in the legend. (B) Phylogenetic tree based on LTA protein sequences. (C) Predicted nuclear localization signal (NLS) positions within the LTA domain. Key functional motifs are annotated. (D) A schematic representation of the novel virus LTA domains, using the SV40 LTA aa positions as a reference.

Intriguingly, comparative analyses show that all mammalian-infecting polyomaviruses encode an LTA bearing an NLS at this conserved position (Cross et al., 2024), whereas non-mammalian polyomaviruses encode NLSs elsewhere (Hoad et al., 2026). Therefore, identification of LTA functional domains and mapping the NLS in the newly identified virus may provide valuable insights into its host origin and evolutionary relationships. We therefore identified the principal functional motifs and domains within the LTA from Qblood079_1103. Our analysis indicates that this protein retains all major LTA functional regions and contains a putative bipartite cNLS located between the LXCXE and OBD motifs, supporting its classification as a mammalian polyomavirus (Figures 9C,D). Additionally, an additional NLS is located downstream of the SF3 helicase domain (Figures 9C,D).

4 Discussion

This study aimed to elucidate the viral diversity in the blood of marmots in the Qinghai–Tibet Plateau region using high-throughput sequencing and bioinformatics analysis, with the goal of assessing their role as potential viral reservoirs and exploring the potential implications of these viruses for wildlife and human health. To this end, we collected 70 marmot blood samples from Yushu and Guoluo Prefectures in Qinghai Province and conducted a comprehensive analysis of the viral communities in these samples using advanced molecular biology and bioinformatics techniques. Our research not only expands the understanding of the marmot virome but also provides critical data for understanding viral transmission dynamics in wildlife and their potential public health risks.

Through high-throughput sequencing and bioinformatics tools, we identified a wide range of viral families, including Anelloviridae, Flaviviridae, Parvoviridae, and Polyomaviridae, among others (Figure 2). Our findings not only enhance the current understanding of viral ecology in marmots but also highlight the potential role of these rodents as reservoirs for zoonotic viruses, which has significant implications for wildlife conservation and public health.

The observed variation in the relative proportions of bacteriophages and vertebrate-associated viruses between Yushu and Guoluo likely reflects differences in local ecological conditions and host availability (Figure 3). The higher dominance of bacteriophages in Yushu suggests a viral community structure more strongly shaped by bacterial hosts, potentially associated with differences in microbial community composition or environmental factors. In contrast, the relatively increased proportion of vertebrate-associated viruses in Guoluo may indicate stronger inputs from vertebrate hosts or host-derived materials, possibly linked to animal density or human–animal interactions. Overall, these findings highlight spatial heterogeneity in viral community structure and emphasize the influence of local host ecology on shaping virome composition.

The identification of Anelloviridae in marmot blood samples is particularly noteworthy. While Anelloviridae viruses are known to infect diverse vertebrates including humans (De Souza et al., 2018), this study provides the first documented evidence of their presence in marmot serum. Phylogenetic analysis revealed that the Anelloviridae sequences identified in this study clustered with known sequences from marmot feces and tissues (Figure 5), suggesting that marmots may serve as natural hosts for these viruses. The high genetic diversity observed among the Anelloviridae sequences further underscores the potential role of marmots in the evolution and host adaptation of these viruses. Future studies should explore the pathogenicity of these viruses in marmots and their potential for cross-species transmission. This study confirms that the ORF1 protein encoded by Qblood059_12572 possesses a typical jelly-roll fold and an extremely short projection domain, while lacking a functional C-terminal nuclear localization signal (Figure 6). These structural characteristics are fully consistent with recent descriptions of anelloviruses with small projection domains (Petersen et al., 2025). It is worth noting that the predicted C-terminal region exhibits low confidence (pLDDT) in the AlphaFold model. Therefore, inferences about the absence of a functional nuclear localization signal in this region based on the current model should be made with caution and require experimental validation.

The detection of Tick-borne encephalitis virus (TBEV) in marmot blood samples is another significant finding. TBEV, a member of the Flaviviridae family, is a highly pathogenic virus that can cause severe neurological disorders in humans (Blom et al., 2018). The identification of a TBEV sequence with high similarity to a human-derived reference sequence suggests that marmots may play a role in the transmission cycle of this virus (Figure 7). This finding aligns with previous studies that have implicated marmots as potential hosts for TBEV, particularly in regions where they coexist with human populations (Dai et al., 2018). Although the TBEV sequences we identified show high similarity to human pathogenic strains, the incomplete genome and lack of functional data mean we cannot assess their actual infectivity and transmission capacity. Therefore, considering marmots as part of the transmission cycle of this virus remains a hypothesis requiring further validation.

In addition to Anelloviridae and Flaviviridae, we also identified novel sequences belonging to the Parvoviridae and Polyomaviridae families. The Parvoviridae sequence, designated as Qblood060_22586, exhibited high similarity to a Dependoparvovirus sequence derived from marmot tissue, suggesting that marmots may be a natural host for this virus. Similarly, the Polyomaviridae sequence, Qblood079_1103, showed close phylogenetic relationships with a marmot-derived polyomavirus (GenBank Accession No. BK066788), further supporting the idea that marmots harbor a diverse array of viruses. These findings contribute to the growing body of evidence that marmots are important reservoirs for a variety of viral pathogens. Meanwhile, we also analyzed the presence of functional domains within the LTA protein (Figure 9), which might help clarify the relationship between the newly identified isolate and other species. For the novel polyomavirus identified in this study, both the predicted nuclear localization function of its LTA protein and its actual host range remain unknown. Future studies should clone the LTA gene of this virus and express it in mammalian cells, using immunofluorescence to directly observe its subcellular localization, thereby providing experimental evidence for its function.

It is critical to note that the all main LTA functional domains could be predicted in the LTA encoded by Qblood079_1103, including a strong putative bipartite cNLS located between the LXCXE motif and the OBD, similarly to all LTA from mammalian-infecting polyomaviruses studied so far (Cross et al., 2024). Intriguingly, a bipartite NLS was also predicted downstream of the SF3 helicase domain, in a similar position to NLSs recently characterized in the LTA from fish infecting polyomaviruses (Hoad et al., 2026). In the present study, the predicted NLS is located downstream of the SF3 helicase domain. This observation aligns with the hypothesis that the position and composition of NLS motifs in large T-like proteins are evolutionarily plastic, capable of shifting among the N-terminal, central, and C-terminal regions.

The comparative analysis of viral communities between Yushu and Guoluo revealed significant differences in viral diversity and abundance. Yushu exhibited a higher diversity of viral families, including Herpesviridae, Polyomaviridae, and Flaviviridae, compared to Guoluo. This regional variation in viral communities may be influenced by ecological factors such as habitat type, climate, and human activity. The higher viral diversity in Yushu could also reflect differences in marmot population density or the presence of additional reservoir species. Further studies are needed to elucidate the factors driving these regional differences and their implications for viral transmission dynamics. It should be noted that the observed regional variations in the virome, while potentially associated with habitat types, may also be influenced by other unmeasured confounding factors, such as host age structure, population density, immune status, and differences in ectoparasite communities. Future studies should systematically collect data on these covariates to more comprehensively elucidate the key drivers underlying the geographical variation of the virome.

One limitation of this study is the reliance on blood samples, which may not capture the full spectrum of viruses present in marmots. Future research should include other sample types, such as feces, urine, and tissue samples, to provide a more comprehensive picture of the marmot virome. It is important to emphasize that the viral sequences detected in this study only demonstrate the presence of viral nucleic acids. Confirming active replication and pathogenicity of these viruses in marmots requires further validation through future studies involving virus isolation, targeted serological surveys (e.g., IgM detection), or experimental infection approaches. Importantly, due to the limited availability of samples, we were unable to perform experimental validation of the identified viral sequences in this study. While these high-quality viral contigs provide valuable clues for future investigations, their biological relevance and authenticity require confirmation through independent experimental approaches. Further studies are needed to determine the potential impact of these viruses on marmot health and their capacity for cross-species transmission.

In conclusion, this study provides valuable insights into the viral diversity of marmots in the Qinghai–Tibet Plateau region. The identification of novel viruses and the detection of known pathogens such as TBEV highlight that marmots may serve as potential reservoirs for zoonotic viruses. These findings have important implications for wildlife conservation and public health, particularly in regions where marmots live in close proximity to human populations. Enhanced surveillance and further research are needed to better understand the role of marmots in viral transmission and to develop effective strategies for mitigating the risk of zoonotic disease outbreaks.

Data availability statement

The viral metagenomic data utilized to corroborate the findings of this study has been submitted and deposited at the National Genomics Data Center. The quality-filtered sequencing data have been deposited in the Sequence Read Archive (SRA) under BioProject accession PRJCA035278 and BioSample accessions SAMC4593464-SAMC4593533. All newly identified genes have been registered with and assigned sequence identifiers (C_AA105711.1-C_AA105727.1) by the National Genomics Data Center. These data are publicly available without any access restrictions.

Ethics statement

The animal study was approved by Qinghai Institute for Endemic Disease Prevention and Control. The study was conducted in accordance with the local legislation and institutional requirements.

Author contributions

HaisW: Writing – original draft, Writing – review & editing. XJ: Writing – original draft, Writing – review & editing. YX: Writing – original draft, Writing – review & editing. SN: Writing – review & editing. HailW: Writing – review & editing. WX: Writing – review & editing. WP: Writing – review & editing. SW: Methodology, Writing – review & editing. WZ: Methodology, Writing – review & editing.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This research was financially supported by National Key Research and Development Programs of China (Nos. 2023YFD1801300 and 2022YFC2603801) and the National Natural Science Foundation of China (No. 82341106) to WZ.

Acknowledgments

The authors thank the Qinghai Institute for Endemic Disease Prevention and Control, Jiangsu University and Wuxi Blood Center.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that Generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2026.1668126/full#supplementary-material

Abbreviations

DPBS, Dulbecco’s phosphate buffered saline; ORF, Open reading frame; TBEV, Tick-borne encephalitis virus; LTA, Large T antigen; cNLS, Classical nuclear localization signal.

Footnotes

References

Abramson, J., Adler, J., Dunger, J., Evans, R., Green, T., Pritzel, A., et al. (2024). Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630, 493–500. doi: 10.1038/s41586-024-07487-w,

PubMed Abstract | Crossref Full Text | Google Scholar

Arnold, W. (2019). Marmots. Curr. Biol. 29, R779–R780. doi: 10.1016/j.cub.2019.06.008,

PubMed Abstract | Crossref Full Text | Google Scholar

Ben-Ari, T., Neerinckx, S., Agier, L., Cazelles, B., Xu, L., Zhang, Z., et al. (2012). Identification of Chinese plague foci from long-term epidemiological data. Proc. Natl. Acad. Sci. U.S.A. 109, 8196–8201. doi: 10.1073/pnas.1110585109,

PubMed Abstract | Crossref Full Text | Google Scholar

Biagini, P. 2009 “Classification of TTV and related viruses (anelloviruses),” TT viruses. Current topics in microbiology and immunology E.-M. VilliersDe and H. Z. Hausen Berlin Springer 21–33

Google Scholar

Blom, K., Cuapio, A., Sandberg, J. T., Varnaite, R., Michaëlsson, J., Björkström, N. K., et al. (2018). Cell-mediated immune responses and immunopathogenesis of human Tick-borne encephalitis virus-infection. Front. Immunol. 9:2174. doi: 10.3389/fimmu.2018.02174,

PubMed Abstract | Crossref Full Text | Google Scholar

Bogovic, P., and Strle, F. (2015). Tick-borne encephalitis: a review of epidemiology, clinical characteristics, and management. World J. Clin. Cases 3, 430–441. doi: 10.12998/wjcc.v3.i5.430,

PubMed Abstract | Crossref Full Text | Google Scholar

Buchfink, B., Xie, C., and Huson, D. H. (2015). Fast and sensitive protein alignment using DIAMOND. Nat. Methods 12, 59–60. doi: 10.1038/nmeth.3176,

PubMed Abstract | Crossref Full Text | Google Scholar

Butkovic, A., Kraberger, S., Smeele, Z., Martin, D. P., Schmidlin, K., Fontenele, R. S., et al. (2023). Evolution of anelloviruses from a circovirus-like ancestor through gradual augmentation of the jelly-roll capsid protein. Virus Evol. 9:vead035. doi: 10.1093/ve/vead035,

PubMed Abstract | Crossref Full Text | Google Scholar

Cross, E. M., Akbari, N., Ghassabian, H., Hoad, M., Pavan, S., Ariawan, D., et al. (2024). A functional and structural comparative analysis of large tumor antigens reveals evolution of different importin α-dependent nuclear localization signals. Protein Sci. 33:e4876. doi: 10.1002/pro.4876,

PubMed Abstract | Crossref Full Text | Google Scholar

Dai, X., Shang, G., Lu, S., Yang, J., and Xu, J. (2018). A new subtype of eastern tick-borne encephalitis virus discovered in Qinghai–Tibet Plateau, China. Emerg. Microbes Infect. 7:74. doi: 10.1038/s41426-018-0081-6,

PubMed Abstract | Crossref Full Text | Google Scholar

Dai, Z., Wang, H., Wu, H., Zhang, Q., Ji, L., Wang, X., et al. (2022). Parvovirus dark matter in the cloaca of wild birds. GigaScience 12:giad001. doi: 10.1093/gigascience/giad001,

PubMed Abstract | Crossref Full Text | Google Scholar

De Souza, W. M., Fumagalli, M. J., De Araujo, J., Sabino-Santos, G., Maia, F. G. M., Romeiro, M. F., et al. (2018). Discovery of novel anelloviruses in small mammals expands the host range and diversity of the Anelloviridae. Virology 514, 9–17. doi: 10.1016/j.virol.2017.11.001,

PubMed Abstract | Crossref Full Text | Google Scholar

DeCaprio, J. A., and Garcea, R. L. (2013). A cornucopia of human polyomaviruses. Nat. Rev. Microbiol. 11, 264–276. doi: 10.1038/nrmicro2992,

PubMed Abstract | Crossref Full Text | Google Scholar

Decaro, N., and Buonavoglia, C. (2012). Canine parvovirus—a review of epidemiological and diagnostic aspects, with emphasis on type 2c. Vet. Microbiol. 155, 1–12. doi: 10.1016/j.vetmic.2011.09.007,

PubMed Abstract | Crossref Full Text | Google Scholar

Deng, X., Naccache, S. N., Ng, T., Federman, S., Li, L., Chiu, C. Y., et al. (2015). An ensemble strategy that significantly improves de novo assembly of microbial genomes from metagenomic next-generation sequencing data. Nucleic Acids Res. 43, e46–e46. doi: 10.1093/nar/gkv002,

PubMed Abstract | Crossref Full Text | Google Scholar

Dittmer, F. P., Guimarães, C. D. M., Peixoto, A. B., Pontes, K. F. M., Bonasoni, M. P., Tonni, G., et al. (2024). Parvovirus B19 infection and pregnancy: review of the current knowledge. J. Pers. Med. 14:139. doi: 10.3390/jpm14020139,

PubMed Abstract | Crossref Full Text | Google Scholar

Finn, R. D., Clements, J., and Eddy, S. R. (2011). HMMER web server: interactive sequence similarity searching. Nucleic Acids Res. 39, W29–W37. doi: 10.1093/nar/gkr367,

PubMed Abstract | Crossref Full Text | Google Scholar

He, Z., Wei, B., Zhang, Y., Liu, J., Xi, J., Ciren, D., et al. (2021). Distribution and characteristics of human plague cases and Yersinia pestis isolates from 4 Marmota plague foci, China, 1950–2019. Emerg. Infect. Dis. 27, 2544–2553. doi: 10.3201/eid2710.202239,

PubMed Abstract | Crossref Full Text | Google Scholar

Hilt, E. E., and Ferrieri, P. (2022). Next generation and other sequencing technologies in diagnostic microbiology and infectious diseases. Genes 13:1566. doi: 10.3390/genes13091566,

PubMed Abstract | Crossref Full Text | Google Scholar

Hoad, M., Pavan, S., Nematollahzadeh, S., Tietz, O., Reeman, J., Forwood, J. K., et al. (2026). Importin α/β1 dependent nuclear import of black sea bass polyomavirus large tumor antigen is mediated by a classical NLS located downstream of the SF3 helicase domain. Biochim. Biophys. Acta Mol. Cell Res. 1873:120074. doi: 10.1016/j.bbamcr.2025.120074,

PubMed Abstract | Crossref Full Text | Google Scholar

Huson, D. H., Auch, A. F., Qi, J., and Schuster, S. C. (2007). MEGAN analysis of metagenomic data. Genome Res. 17, 377–386. doi: 10.1101/gr.5969107,

PubMed Abstract | Crossref Full Text | Google Scholar

Islam, M. M., Farag, E., Eltom, K., Hassan, M. M., Bansal, D., Schaffner, F., et al. (2021). Rodent Ectoparasites in the Middle East: a systematic review and meta-analysis. Pathogens 10:139. doi: 10.3390/pathogens10020139,

PubMed Abstract | Crossref Full Text | Google Scholar

Johnson, L. S., Eddy, S. R., and Portugaly, E. (2010). Hidden Markov model speed heuristic and iterative HMM search procedure. BMC Bioinformatics 11:431. doi: 10.1186/1471-2105-11-431,

PubMed Abstract | Crossref Full Text | Google Scholar

Kaczorowska, J., and van der Hoek, L. (2020). Human anelloviruses: diverse, omnipresent and commensal members of the virome. FEMS Microbiol. Rev. 44, 305–313. doi: 10.1093/femsre/fuaa007,

PubMed Abstract | Crossref Full Text | Google Scholar

Kalderon, D., Richardson, W. D., Markham, A. F., and Smith, A. E. (1984). Sequence requirements for nuclear location of simian virus 40 large-T antigen. Nature 311, 33–38. doi: 10.1038/311033a0,

PubMed Abstract | Crossref Full Text | Google Scholar

Karimov, K., Kachel, S. M., and Hackländer, K. (2018). Responses of snow leopards, wolves and wild ungulates to livestock grazing in the Zorkul Strictly Protected Area, Tajikistan. PLoS One 13:e0208329. doi: 10.1371/journal.pone.0208329,

PubMed Abstract | Crossref Full Text | Google Scholar

Kosugi, S., Hasebe, M., Tomita, M., and Yanagawa, H. (2009). Systematic identification of cell cycle-dependent yeast nucleocytoplasmic shuttling proteins by prediction of composite motifs. Proc. Natl. Acad. Sci. U S A 106, 10171–10176. doi: 10.1073/pnas.0900604106,

PubMed Abstract | Crossref Full Text | Google Scholar

Kumar, S., Stecher, G., Li, M., Knyaz, C., and Tamura, K. (2018). MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 35, 1547–1549. doi: 10.1093/molbev/msy096,

PubMed Abstract | Crossref Full Text | Google Scholar

Kwasnik, M., Rola, J., and Rozek, W. (2023). Tick-borne encephalitis—review of the current status. J. Clin. Med. 12:6603. doi: 10.3390/jcm12206603,

PubMed Abstract | Crossref Full Text | Google Scholar

Letunic, I., and Bork, P. (2021). Interactive Tree of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293–W296. doi: 10.1093/nar/gkab301,

PubMed Abstract | Crossref Full Text | Google Scholar

Lindquist, L., and Vapalahti, O. (2008). Tick-borne encephalitis. Lancet 371, 1861–1871. doi: 10.1016/S0140-6736(08)60800-4

Crossref Full Text | Google Scholar

Liu, B., Bai, L., Fu, Y., Zhao, S., Liu, H., Wang, R., et al. (2020). Genetic and molecular features for hepadnavirus and plague infections in the Himalayan marmot. Genome 63, 307–317. doi: 10.1139/gen-2019-0161,

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, Z., Mi, B., Jiang, Z., Fei, B., Cai, Z., and Liu, X. (2016). Improved bulk density of bamboo pellets as biomass for energy production. Renew. Energy 86, 1–7. doi: 10.1016/j.renene.2015.08.011

Crossref Full Text | Google Scholar

Loutfy, S. A., Moneer, M. M., Salem, S. E., El-Moniem Abada, E. A., El-Moniem Ahmed, E. A., Ibrahim, L. H., et al. (2017). Polyomavirus infections and its clinical relevance in cancer patients: a prospective study. J. Infect. Public Health 10, 22–30. doi: 10.1016/j.jiph.2016.01.008,

PubMed Abstract | Crossref Full Text | Google Scholar

Luis, A. D., Hayman, D. T. S., O’Shea, T. J., Cryan, P. M., Gilbert, A. T., Pulliam, J. R. C., et al. (2013). A comparison of bats and rodents as reservoirs of zoonotic viruses: are bats special? Proc. Biol. Sci. 280:20122753. doi: 10.1098/rspb.2012.2753

Crossref Full Text | Google Scholar

Lyngdoh, S., Shrotriya, S., Goyal, S. P., Clements, H., Hayward, M. W., and Habib, B. (2014). Prey preferences of the snow leopard (Panthera uncia): regional diet specificity holds global significance for conservation. PLoS One 9:e88349. doi: 10.1371/journal.pone.0088349,

PubMed Abstract | Crossref Full Text | Google Scholar

Ma, Y., Liu, P., Li, Z., Yue, Y., Zhao, Y., He, J., et al. (2024). High genetic diversity of the himalayan marmot relative to plague outbreaks in the Qinghai–Tibet Plateau, China. BMC Genomics 25:262. doi: 10.1186/s12864-024-10171-y,

PubMed Abstract | Crossref Full Text | Google Scholar

Michalak, T. I. (2020). Diverse virus and host-dependent mechanisms influence the systemic and intrahepatic immune responses in the woodchuck model of hepatitis B. Front. Immunol. 11:853. doi: 10.3389/fimmu.2020.00853,

PubMed Abstract | Crossref Full Text | Google Scholar

Millman, I., Southam, L., Halbherr, T., Simmons, H., and Kang, C. M. (1984). ColabFold: making protein folding accessible to all.. Nat. Methods 19, 679–682. doi: 10.1038/s41592-022-01488-1,

PubMed Abstract | Crossref Full Text | Google Scholar

Mirdita, M., Schütze, K., Moriwaki, Y., Heo, L., Ovchinnikov, S., and Steinegger, M. (2022). Tick-borne encephalitis virus: a comprehensive review of transmission, pathogenesis, epidemiology, clinical manifestations, diagnosis, and prevention. Microorganisms 11:1634. doi: 10.3390/microorganisms11071634,

PubMed Abstract | Crossref Full Text | Google Scholar

Mlera, L., and Bloom, M. E. (2018). The role of mammalian reservoir hosts in tick-borne Flavivirus biology. Front. Cell. Infect. Microbiol. 8:298. doi: 10.3389/fcimb.2018.00298,

PubMed Abstract | Crossref Full Text | Google Scholar

Moreno, A., Canziani, S., Lelli, D., Castelli, A., Bianchi, A., Bertoletti, I., et al. (2024). Molecular and serological detection of bovine coronaviruses in marmots (Marmota marmota) in the alpine region. Viruses 16:591. doi: 10.3390/v16040591,

PubMed Abstract | Crossref Full Text | Google Scholar

Nandi, S., Chidri, S., Kumar, M., and Chauhan, R. S. (2010). Occurrence of canine parvovirus type 2c in the dogs with haemorrhagic enteritis in India. Res. Vet. Sci. 88, 169–171. doi: 10.1016/j.rvsc.2009.05.018,

PubMed Abstract | Crossref Full Text | Google Scholar

Petersen, G. F., Pavan, S., Ariawan, D., Tietz, O., Nematollahzadeh, S., Sarker, S., et al. (2025). Nuclear trafficking of Anelloviridae capsid protein ORF1 reflects modular evolution of subcellular targeting signals. Virus Evol 11:veaf069. doi: 10.1093/ve/veaf069,

PubMed Abstract | Crossref Full Text | Google Scholar

Pierson, T. C., and Diamond, M. S. (2020). The continued threat of emerging flaviviruses. Nat. Microbiol. 5, 796–812. doi: 10.1038/s41564-020-0714-0,

PubMed Abstract | Crossref Full Text | Google Scholar

Pustijanac, E., Buršić, M., Talapko, J., Škrlec, I., Meštrović, T., and Lišnjić, D. (2023). Tick-borne encephalitis virus: a comprehensive review of transmission, pathogenesis, epidemiology, clinical manifestations, diagnosis, and prevention. Microorganisms 11:1634. doi: 10.3390/microorganisms11071634,

PubMed Abstract | Crossref Full Text | Google Scholar

Ronquist, F., Teslenko, M., van der Mark, P., Ayres, D. L., Darling, A., Höhna, S., et al. (2012). MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst. Biol. 61, 539–542. doi: 10.1093/sysbio/sys029,

PubMed Abstract | Crossref Full Text | Google Scholar

Ruzek, D., Avšič Županc, T., Borde, J., Chrdle, A., Eyer, L., Karganova, G., et al. (2019). Tick-borne encephalitis in Europe and Russia: review of pathogenesis, clinical features, therapy, and vaccines. Antivir. Res. 164, 23–51. doi: 10.1016/j.antiviral.2019.01.014,

PubMed Abstract | Crossref Full Text | Google Scholar

Shan, T., Yang, S., Wang, H., Wang, H., Zhang, J., Gong, G., et al. (2022). Virome in the cloaca of wild and breeding birds revealed a diversity of significant viruses. Microbiome 10:60. doi: 10.1186/s40168-022-01246-7,

PubMed Abstract | Crossref Full Text | Google Scholar

Skewes-Cox, P., Sharpton, T. J., Pollard, K. S., and DeRisi, J. L. (2014). Profile hidden Markov models for the detection of viruses within metagenomic sequence data. PLoS One 9:e105067. doi: 10.1371/journal.pone.0105067,

PubMed Abstract | Crossref Full Text | Google Scholar

Spandole, S., Cimponeriu, D., Berca, L. M., and Mihăescu, G. (2015). Human anelloviruses: an update of molecular, epidemiological and clinical aspects. Arch. Virol. 160, 893–908. doi: 10.1007/s00705-015-2363-9,

PubMed Abstract | Crossref Full Text | Google Scholar

Streck, A. F., and Truyen, U. (2020). Porcine parvovirus. Curr. Issues Mol. Biol. 37, 33–46. doi: 10.21775/cimb.037.033,

PubMed Abstract | Crossref Full Text | Google Scholar

Svoboda, P., Dobler, G., Markotić, A., Kurolt, I.-C., Speck, S., Habuš, J., et al. (2014). Survey for hantaviruses, Tick-borne encephalitis virus, and Rickettsia spp. in small rodents in Croatia. Vector Borne Zoonotic Dis. 14, 523–530. doi: 10.1089/vbz.2013.1457,

PubMed Abstract | Crossref Full Text | Google Scholar

Topalis, D., Andrei, G., and Snoeck, R. (2013). The large tumor antigen: a “Swiss Army knife” protein possessing the functions required for the polyomavirus life cycle. Antivir. Res. 97, 122–136. doi: 10.1016/j.antiviral.2012.11.007,

PubMed Abstract | Crossref Full Text | Google Scholar

Wiley, S. R., Kraus, R. J., Zuo, F., Murray, E. E., Loritz, K., and Mertz, J. E. (1993). SV40 early-to-late switch involves titration of cellular transcriptional repressors. Genes Dev. 7, 2206–2219. doi: 10.1101/gad.7.11.2206,

PubMed Abstract | Crossref Full Text | Google Scholar

Xi, J., Duan, R., He, Z., Meng, L., Xu, D., Chen, Y., et al. (2022). First case report of human plague caused by excavation, skinning, and eating of a hibernating marmot (Marmota himalayana). Front. Public Health 10:910872. doi: 10.3389/fpubh.2022.910872,

PubMed Abstract | Crossref Full Text | Google Scholar

Yan, T., Wang, Z., Li, R., Zhang, D., Song, Y., and Cheng, Z. (2024). Gyrovirus: current status and challenge. Front. Microbiol. 15:1449814. doi: 10.3389/fmicb.2024.1449814,

PubMed Abstract | Crossref Full Text | Google Scholar

Yoshii, K. (2019). Epidemiology and pathological mechanisms of Tick-borne encephalitis. J. Vet. Med. Sci. 81, 343–347. doi: 10.1292/jvms.18-0373,

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, W., Li, L., Deng, X., Blümel, J., Nübling, C. M., Hunfeld, A., et al. (2016). Viral nucleic acids in human plasma pools. Transfusion 56, 2248–2255. doi: 10.1111/trf.13692,

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, W., Li, L., Deng, X., Kapusinszky, B., Pesavento, P. A., and Delwart, E. (2014). Faecal virome of cats in an animal shelter. J. Gen. Virol. 95, 2553–2564. doi: 10.1099/vir.0.069674-0,

PubMed Abstract | Crossref Full Text | Google Scholar

Zhao, M., Yue, C., Yang, Z., Li, Y., Zhang, D., Zhang, J., et al. (2022). Viral metagenomics unveiled extensive communications of viruses within giant pandas and their associated organisms in the same ecosystem. Sci. Total Environ. 820:153317. doi: 10.1016/j.scitotenv.2022.153317,

PubMed Abstract | Crossref Full Text | Google Scholar

Zhu, W., Yang, J., Lu, S., Lan, R., Jin, D., Luo, X., et al. (2020). Beta- and novel Delta-coronaviruses are identified from wild animals in the Qinghai–Tibetan Plateau, China. Virol. Sin. 36, 402–411. doi: 10.1007/s12250-020-00325-z,

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: blood, Qinghai–Tibet Plateau, viral diversity, wild marmots, zoonotic potential

Citation: Wu H, Jiang X, Xi Y, Ning S, Wu H, Xin W, Peng W, Wang S and Zhang W (2026) Characterization of viral diversity in wild marmot blood from the Qinghai–Tibet Plateau. Front. Microbiol. 17:1668126. doi: 10.3389/fmicb.2026.1668126

Received: 17 July 2025; Revised: 24 December 2025; Accepted: 26 January 2026;
Published: 06 February 2026.

Edited by:

Mohammed Rohaim, Cairo University, Egypt

Reviewed by:

Gualtiero Alvisi, University of Padua, Italy
Yi Sun, Beijing Institute of Microbiology and Epidemiology, China

Copyright © 2026 Wu, Jiang, Xi, Ning, Wu, Xin, Peng, Wang and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Shengjun Wang, c2p3anNAdWpzLmVkdS5jbg==; Wen Zhang, ejAyMTZ3ZW5AeWFob28uY29t

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.