ORIGINAL RESEARCH article

Front. Vet. Sci., 22 April 2025

Sec. Animal Nutrition and Metabolism

Volume 12 - 2025 | https://doi.org/10.3389/fvets.2025.1582122

This article is part of the Research TopicAdvancements in Synthetic Microbiomes for Enhancing Animal HealthView all 9 articles

Characterization of the diversity, genomic features, host bacteria, and distribution of crAss-like phages in the pig gut microbiome

  • National Key Laboratory of Pig Genetic Improvement and Germplasm Innovation, Jiangxi Agricultural University, Nanchang, China

Phages play an important role in shaping the gut microbiome. CrAss-like phages, which are key members of the gut virome, show high abundance in the human gut and have attracted increasing interest. However, few studies have been found in pigs, and the distribution of crAss-like phages across broader pig populations remains unknown. Here, we obtained 1,251 pig crAss-like phage genomes from 403 metagenomes publicly available and a pig gut virome dataset constructed by ourselves. These crAss-like phage genomes were further clustered into 533 virus operational taxonomic units (vOTUs). Phylogenetic analysis revealed that crAss-like phages in pig guts were distributed across four well-known family-level clusters (Alpha, Beta, Zeta, and Delta) but were absent in the Gamma and Epsilon clusters. Genomic structure analysis identified 149 pig crAss-like phage vOTUs that utilize alternative genetic codes. Gene blocks encoding replication and assembly proteins varied across crAss-like phage clusters. Approximately 64.73% of crAss-like phage genes lacked functional annotations, highlighting a gap in understanding their functional potential. Numerous anti-CRISPR protein genes were identified in crAss-like phage genomes, and CAZymes encoded by these phages were primarily lysozymes. Host prediction indicated that bacterial hosts of pig crAss-like phages primarily belonged to Prevotella, Parabacteroides, and UBA4372. We observed that interactions between crAss-like phages and Prevotella copri might have a possible effect on fat deposition in pigs. Finally, all detected vOTUs exhibited low prevalence across pig populations, suggesting heterogeneity in crAss-like phage compositions. This study provides key resources and novel insights for investigating crAss-like phage-bacteria interactions and benefits research on the effects of crAss-like phages on pig health and production traits.

1 Introduction

The virome has been considered as “dark matter” in the microbiome due to the lack of comprehensive study. Recent years, with the development of metagenomic sequencing technologies and bioinformatic tools, it has become one of the hot research topics in the microecosystem area (13). Various evidences have indicated that phages can shape the gut microbiome through a variety of ways, such as lysing host bacteria, integrating into host bacteria as proviruses, and metabolic reprogramming via auxiliary metabolic genes (AMGs) (4, 5). Among the growing datasets of viruses, crAss-like phages are one of the most notable new viral members that are extremely widespread with high abundance in the human gut (6).

Dutilh et al. (7) assembled the first crAss-like phage genome in human fecal metagenomes with a circular genome of ~97 kb in length. Since then, a growing number of crAss-like phages have been identified through different studies (810). Guerin et al. (10) classified 249 crAssphage genomes into four clusters of Alphacrassvirinae, Betacrassvirinae, Gammacrassvirinae, and Deltacrassvirinae according to the percentage of shared homologous proteins. Subsequently, the International Committee on Taxonomy of Viruses (ICTV) officially defined the order Crassvirales which include four families and ten subfamilies based on the phylogenetic tree of conserved viral genes (11). Yutin et al. (9) discovered two additional potential subclusters of crAss-like phages at the family level, which were defined as Zeta and Epsilon groups. These crAss-like phages have been detected not only in the human guts, but also in the guts of many other animals. For instance, Edwards et al. (8) found a series of nearly complete genomes of distant relatives of crAss-like phages in non-human primates. Additionally, Shkoporov et al. (12) and Li et al. (13) also reported Crassvirales members in the gut microbiome of rhesus monkeys, pigs, and cats.

The majority of crAss-like phages were predicted to infect bacteria in the phylum Bacteroidetes by a variety of bioinformatic methods, such as CRISPR spacer matching and co-abundance analysis (14). In addition, 17 crAss-like phages were found to link to bacteria in different genera of Bacteroidetes by the metagenomic Hi-C approach (15). To date, the isolation of crAss-like phages in vitro cultures has proven to be challenging (16). However, the study in the cultures of Bacteroides intestinalis found that crAss-like phage and bacteria could multiply in parallel due to a dynamic equilibrium between phage sensitivity and resistance through rapid phase variation of alternate capsular polysaccharides (17). In addition, a carrier state existed for crAss-like phages by delayed release or progeny from infected bacterial cells (17). These observations have implied that crAss-like phages persist in the gut in a “benign” form (8). Other studies have suggested that crAss-like phages are a part of the normal human virome. Different genera of crAss-like phages might prefer certain lifestyles and diets in humans (14). It has also been reported that crAss-like phages may be associated with a variety of diseases, such as inflammatory bowel disease (IBD) (18) and metabolic syndrome (19). However, the phylogenetic composition, genomic structure, and bacterial hosts of crAss phages in the pig gut microbiome remain largely unknown. In addition, their distribution across diverse pig populations has not yet been fully characterized.

In this study, we performed a massive study of pig gut crAss-like phages by combining (1) 380 crAss-like phages newly identified from a wide range of 403 pig metagenomes, and (2) 871 crAss-like phages previously identified in a pig gut virome dataset constructed by ourselves (Figure 1A). All these pig crAss-like phage genomes were clustered into 533 vOTUs. By constructing a phylogenetic tree based on highly conserved proteins (TerL), we found that crAss-like phages identified in pig guts were widely distributed in the clusters of Alpha, Beta, Zeta, and Delta, but were completely absent in the Gamma and Epsilon clusters. We derived and annotated 57,171 hypothetical proteins from 525 pig crAss-like phage vOTUs with taxonomic information, and found that these phages encoded numerous lysozymes and anti-CRISPR proteins. Host bacteria of crAss-like phages were also comprehensively analyzed. Based on this, we further suggested that pig crAss-like phages might interact with Prevotella copri affecting fat deposition in pigs. We also investigated the prevalence of pig crAss-like phages in global samples.

Figure 1
www.frontiersin.org

Figure 1. Summary of the pipeline for clustering pig crAss-like phage OTUs and the percentages of samples detected crAss-like phage genomes in each dataset. (A) An overview of the pipeline for clustering pig crAss-like phage OTUs. (B) The pie charts representing the percentage of samples in each dataset that assembled crAss-like phage genomes. The number at the top of each pie chart represents the number of samples included in each dataset.

2 Materials and methods

2.1 The datasets of pig metagenomes and crAss phage genomes used in this study

2.1.1 Pig metagenomic sequencing datasets

We collected 403 metagenomes for the identification of pig crAss-like phages from the following nine studies including PRJEB11755 (20) (n = 200), PRJNA526405 (21) (n = 42), PRJEB62878 (22) (n = 27), PRJEB23112 (23) (n = 16), PRJNA629856 (24) (n = 27), PRJNA857725 (25) (n = 32), PRJNA788462 (26) (n = 12), PRJEB32496 (27) (2020; n = 36), PRJNA647157 (28) (n = 11).

2.1.2 Pig crAss-like phage genomes

A total of 871 pig crAss-like phage genomes were identified from the gut metagenomes of 44 pigs, including Bamaxiang pigs, Large White pigs, and wild boars from North and South of China.

2.1.3 Human crAss-like phage genomes used in this study

A total of 1,446 human crAss-like phage genomes were collected from three studies and used in this study, including 249 crAss-like phage genomes identified by Guerin et al. (10), 596 crAss-like phage genomes from Yutin et al. (9) and 601 genomes from Gulyaeva et al. (18).

2.2 Contig assembly and identification of crAss-like phages from metagenomic sequencing data

Raw sequence reads from metagenomic sequencing data were processed to remove adaptor and low-quality sequences using fastp (v0.20.1) (29). Host contamination was further removed by mapping clean sequence reads to the pig reference genome (Sscrofa11.1) using bowtie2 (v2.4.2) (30). Clean sequence reads were assembled using MEGAHIT (v1.2.9) (31). Contigs with the length ≥ 3 kb were used for virus prediction by VirSorter2 (32), DeepVirFinder (33), and VIBRANT (34). Contigs that met one of the following criteria were retained for subsequent analysis: (1) VirSorter2 score ≥0.9; (2) DeepVirFinder score ≥ 0.9 and p < 0.01; (3) VirSorter2 score > 0.7, DeepVirFinder score > 0.7 and p < 0.05; and (4) be positive in VIBRANT prediction. Finally, we used CheckV (35) (v0.8.1) to trim bacterial contaminations in the contigs.

We used two approaches to detect crAss-like phages. First, two genetic signature proteins of p-CrAssphages (polymerase: UGP_018 and terminase: UGP_092) were queried against the protein sequences of viral genomes using blastp from the blast+ package (v2.5.0) (36). The proteins with an E-value < 1 × 10−5 and a query alignment length ≥ 350 bp were considered as the hits. The viruses were considered as putative crAss-like phages if its genome contained either polymerase or terminase hit and had the length greater than 70 kb. Second, identifying three conserved structural proteins (terminase large subunit, portal protein, and major capsid protein) of crAss phages in viral genomes using the hmmsearch (−E 0.001) in the HMMER package (v3.3.2) (37). We referenced the hmm profiles previously constructed by Yutin et al. (9), Since previous study has indicated the presence of alternative genetic codons in crAss phages, we also used the transeq (−clean) in EMBOSS programme (38) for translation analysis of viral sequences with six different types of open reading frames. The virus contigs were considered as candidate crAss-like phages if at least one protein sequence hit three conserved structural proteins. Finally, viral contigs identified by both methods were combined and considered as the final set of crAss-like phage genomes.

2.3 Clustering of crAss-like phage genomes

To construct crAss-like phage vOTUs, we combined 380 newly identified and 871 previously obtained pig crAss-like phage genomes. vOTUs were clustered at the threshold of 95% average nucleotide identity (ANI) and 85% coverage that referred to the MIUViG standard proposed by Roux et al. (39) using a custom script in the CheckV repository (35, 39). Using the same methods and thresholds, all 1,446 human crAss-like phage genomes were also clustered into vOTUs. The longest crAss-like phage genome sequence in each vOTU was chosen as the representative genomes of vOTUs.

To construct the genus-level viral clusters (VCs), we clustered 533 pig crAss-like phage vOTUs and 337 human crAss-like phage vOTUs using the vConTACT v2.0 (--rel-mode “Diamond” --db “ProkaryoticViralRefSeq201-Merged” --pcs-mode MCL - - -vcs-mode ClusterONE) based on gene sharing networks (40).

2.4 Phylogeny reconstruction and the assignment of genomes to crAss-like phage clusters

TerL protein sequences of 533 pig crAss-like phage vOTUs and 337 human crAss-like phage vOTUs were detected using the TerL hmm profiles provided by Yutin et al. (9). Multiple sequence alignments of TerL were performed using the MUSCLE (41) with default parameters. Phylogenetic trees were constructed using the iqtree (-B 1000) (42). The phylogenetic trees were then midpoint-rooted and visualized using the iToL (43).

Reference taxonomies of human crAss-like phage vOTUs obtained from three previous studies in humans (9, 10, 18) and expanded to pig crAss-like phage vOTUs in the phylogenetic tree. In brief, the most recent common ancestor (MRCA) of human crAss-like phage vOTUs belonging to the same taxonomy on the phylogenetic tree was determined using phangorn 2.11.1 and ape 5.6-2 in R packages. All descendants of that MRCA were assigned to the classified groups. One human crAss phage vOTU (QWCE01000387) had no reference taxonomy and did not belong to one of the six crAss phage clusters on the phylogenetic tree, so it was removed from subsequent analyses. In addition, eight pig crAss-like phage vOTUs were not annotated to any taxonomies and removed from subsequent analyses because it did not belong to the MRCA of any reference taxonomies.

2.5 Determination of genetic codes and tRNA scanning

The genetic codes of crAss-like phage genomes were determined using the prodigal-gv, a modified version of the Prodigal software (44). This software could automatically detect the recoding of the stop codon TAG. Detection of tRNAs in the genome was performed using the tRNA-scan-SE (2.0.12) (45) with the parameters: -G -X 35.

2.6 Comparison of genomic structures among different crAss-like phage clusters and functional annotation of crAss-like phage genomes

Two approaches were used to annotate the functional capacities of predicted proteins. First, protein sequences of crAss-like phage genes were aligned to the profiles of highly conserved crAss-like phage proteins constructed by Yutin et al. (9) using the hmmsearch in the HMMER package (37) with e-values ≤1e−3. And then, the unannotated proteins were further annotated using the geNomad (44) with default parameters. Proteins that could not be annotated by either method were defined as unknown proteins.

Identification of Carbohydrate-Active Enzymes (CAZymes) was performed using online dbCAN31 (46). AcrPred (47) and PaCRISPR (48) were used to identify anti-CRISPR proteins (Acr) in a catalog of unknown proteins. Only those proteins identified as putative Acrs by both software were retained. Putative Acr proteins with > 200 amino acids in length were removed from further analysis because of possible false positives. The plots of genomic structure comparison were generated using ViPTree2 (49) with default parameters.

2.7 Prediction of bacterial hosts for crAss-like phages based on CRISPR spacers

Bacterial hosts of crAss-like phages were predicted by matching the bacterial CRISPR-spacer sequences to crAss-like phage genomes using blastn (−task blastn-short) (36) allowing a maximum of one mismatch across the whole spacer region sequence. CRISPR spacer sequences were predicted from two MAG datasets including the Unified Human Gastrointestinal Genome (UHGG) catalog (50) and MAGs of pig gut microbiome from our previous study using MinCED (51).

2.8 Determining the abundances of vOTU using metagenomic sequencing data

Clean reads of metagenomic sequencing data were mapped to the genomes of 525 pig crAss-like phage vOTUs using BWA MEM (v0.7.17-r1188) (52). The output was converted to BAM format via Samtools (v1.15.1) (53). The abundance of crAss-like phage vOTUs in a metagenomic sequencing data (one sample) was estimated by CoverM (v0.6.1)3 with options “contig --min-read-aligned-percent 95 --min-covered-fraction 75 --methods rpkm” and normalized the genome size and sequencing depth with RPKM (reads per kilobase per million mapped reads). Clean reads with less than 90% identity to crAss-like phage genomes were excluded from the analysis. The abundance of a vOTU in a sample was considered zero if the coverage of its genome by clean reads was less than 75% in that sample.

2.9 Statistical analysis

The comparisons of genome size (using complete genomes), the number of genes between marker genes (block), and the number of Acr in crAss-like phage genomes among different clusters were performed by the Wilcoxon test (pairwise comparison). The number of crAss-like phage vOTUs detected in different studies was compared by Kruskal-Wallis (multiple group comparison). Association analysis of crAss-like phage vOTUs with pig fat deposition were analyzed using the MaAsLin2 (54) in R package.

3 Results

3.1 Identification of crAss-like phages from geographically diverse pig metagenomes

To comprehensively characterize the diversity of crAss-like phages in pig guts, we collected 403 public pig metagenomes from nine studies across five continents (2028). Using the methods described previously (see Method Details), we assembled 380 crAss-like phages from 54% of 403 tested samples (33.33–81.82% across the nine studies). The number of crAss-like phages identified in each sample ranged from one to nine (Figure 1B). To generate vOTUs of pig crAss-like phages with comprehensive representation, we also combined 871 crAss-like phages previously identified from pig gut metagenomes. These 1,251 pig crAss-like phages had an average genome size of 100.97 kb (Supplementary Table S1). At the thresholds of 85% coverage and 95% average nucleotide identity (ANI) (39), a total of 533 pig crAss-like phage vOTUs were obtained (Supplementary Table S1). More than half of these 533 vOTUs (286, 53.7%) were singleton, and the crAss-like phage vOTU containing the largest number of 51 genomes was from the metagenome of a Bamaxiang pig (BMXV030000188) (Supplementary Figure S1A; Supplementary Table S1). In order to compare human and pig derived crAss-like phages, we also collected 1,446 crAss-like phage genomes from three human studies (9, 10, 18) (Figure 1A). A total of 337 human crAss-like phage vOTUs were generated under the same threshold. The vOTU NL_crAss000703 contained the largest number of 239 crAss-like phage genomes (Supplementary Figure S1B; Supplementary Table S1). The longest available genomes of these vOTUs were selected as representative genomes and used for subsequent analyses.

All 533 pig and 337 human crAss-like phage vOTUs were combined and further clustered into 136 genus-level VCs (viral clusters) using vConTACT (v.2.0) (40). There were 31 VCs that were comprised of both human and pig crAss-like phage vOTUs, 33 VCs containing only human crAss-like phage vOTUs, and 72 VCs containing only pig crAss-like phage vOTUs (Supplementary Figure S1C; Supplementary Table S2).

3.2 Phylogenomics and genomic features of human and pig crAss-like phages

To comprehensively explore the evolutionary relationship between pig-derived crAss-like phages and human-derived crAss-like phages, a phylogenetic tree based on TerL (terminase large subunit) genes was constructed with 533 pig and 337 human crAss-like phage vOTUs (Figure 2A). The phylogenetic structure divided these crAss-like phage vOTUs into six distinct evolutionary branches which corresponded to alpha cluster (Intestiviridae), beta cluster (Steigviridae), gamma cluster (Crevaviridae), delta cluster (Suoliviridae), and two potential new crAss-like families (epsilon and zeta clusters) that were reported (9). Pig crAss-like phage vOTUs were widely distributed in these evolutionary branches, and 525 out of 533 pig crAss-like phage vOTUs could be classified into four crAss-like families referring to the classification of human crAss-like phage vOTUs in the same cluster using the most recent common ancestor (MRCA) approach (18). There were 135, 233, 110, and 47 pig crAss-like phage vOTUs being classified into alpha, beta, delta, and zeta crAss-like families, respectively (Supplementary Figure S2A; Supplementary Table S3). However, we did not find any pig crAss-like phage vOTUs clustered into the gamma and epsilon evolutionary branches. Just like the observation in human crAss phages (Supplementary Figure S2B), pig crAss-like phage vOTU in the zeta cluster had a significantly larger genome size (an average of 171,298 bp) (Figure 2B). Additionally, we compared the genome sizes of pig and human crAss phages within the same cluster. We found that human crAss-like phages had the larger genome size than those identified in pigs in the alpha cluster, while the reverse tendency was observed for crAss-like phages in the beta cluster (Supplementary Figure S2C).

Figure 2
www.frontiersin.org

Figure 2. Reconstruction of the phylogenetic tree of crAss-like phage genomes and the comparison of genomic features among different crAss-like phage clusters. (A) A TerL-based phylogenetic tree constructed with 533 pig crAss-like phage vOTUs and 337 human crAss-like phage vOTUs. Differently colored branches represent different classifications of human crAss-like phage vOTUs at the family-level, and no colored branches represent pig crAss-like phage vOTUs. Yellow and blue colors in the outer cycle indicate the identification of suppressor tRNA or not. The different colors in the inner cycle show the utilization of alternative genetic codes. (B) The boxplots showing the comparison of contig lengths (genome sizes) among pig crAss-like phages from different clusters. The boxplots show medians (bold lines), the upper and lower quartiles. The comparison was performed by non-parametric Wilcoxon test. (C) The stacked barplots represent the number of pig crAss-like phages using different genetic codes in each crAss-like phage cluster. (D) The boxplots showing the comparisons of gene numbers between IHF_54 and MCP (left), between MCP and portal (middle), and between TerL and primase (right) in crAss-like phage genomes among different clusters. The number of genes between TerL and primase in the Epsilon cluster could not be determined because the primase gene was not detected in the genomes of Epsilon crAss-like phages. The boxplots show medians (bold lines), the upper and lower quartiles. The comparison was performed by non-parametric Wilcoxon test. **** p < 0.0001, *** p < 0.001, ** p < 0.01, * p < 0.05.

Previous study has indicated that alternative genetic codes are widely present in human crAss-like phages (9). Using the prodigal-gv (v2.9.0), we identified a total of 149 pig crAss-like phage vOTUs using alternative genetic codes. Therein, the stop codon TAG was recoded for glutamine (genetic code 15), and the stop codon TGA was recoded for tryptophan (genetic code 4) (Figure 2A; Supplementary Table S3). Specifically, similar to the report in human crAss-like phages (9), we found no alternative genetic codes in pig crAss-like phages in the alpha cluster. Thirty-seven out of 233 pig crAss-like phage vOTUs in the beta cluster could use the genetic code 15. However, we did not find this in human crAss-like phages in the beta cluster. There were 78 pig crAss-like phage vOTUs accounting for 71% of crAss-like phage vOTUs in the delta cluster that used the genetic code 15. Furthermore, 19 and 15 out of 47 pig crAss-like phage vOTUs in the zeta cluster could use the genetic code 15 and code 4, respectively (Figure 2C; Supplementary Figure S2D). One of the common mechanisms of the stop codon reassignment is the utilization of repressor tRNAs, so we further examined the presence of repressor tRNAs in pig crAss-like phages. The results indicated that a higher percentage of pig crAss-like phages using the alternative genetic codes (26.67–94.59%) were identified with repressor tRNAs than those using the standard genetic code (0–15.38%) (Supplementary Table S3). Especially, repressive tRNAs were identified in 35 out of 37 pig crAss-like phages using the genetic code 15 in the beta cluster. Interestingly, several crAss-like phages using the standard genetic code (two, eight, and two pig crAss-like phages in the alpha, beta, and zeta cluster, respectively) also had repressive tRNAs.

To compare the differences in the genome structure of crAss-like phages between humans and pigs among six different clusters, we selected five human and five pig crAss-like phage genomes which were considered as complete genomes by CheckV (v0.8.1) from each of four clusters as representative genomes. Additionally, five human crAss-like phage genomes were selected from the gamma and epsilon clusters. We found that the positions of marker genes in the genome structures of crAss-like phages from humans and pigs in the same clusters were basically similar (Supplementary Figures S3–S7), suggesting conservation of the genome structure for crAss-like phages from humans and pigs in the same cluster. Previous studies have demonstrated that the genomes of crAss-like phages contain three large gene blocks responsible for assembly, replication, and transcription (9, 55). The gene blocks between IHF 54 and MCP, and between MCP and portal were responsible for the phage assembly, while the gene block between TerL and primase was related to replication. We compared the number of genes in each gene block among six crAss-like phage clusters. CrAss-like phages from the alpha, delta, epsilon, and gamma clusters had significantly more genes in the block between IHF54 and MCP compared to crAss-like phages from the beta and zeta clusters, but had significantly lower number of genes in the block between MCP and portal (Figure 2D). This implies differences in gene blocks encoding proteins for phage assembly among crAss-like phages from different clusters. We also found that crAss-like phages in the zeta cluster possessed significantly more genes in the block between TerL and primase (Figure 2D), which validated the previous report that indicated a complex block of replication-related genes in crAss-like phage genomes from the zeta cluster (9). The gene block for transcription did not conserve, so we did not compare its gene numbers among the six crAss-phage clusters.

3.3 Potential functional capacity of pig crAss-like phages

To gain insights into the potential function capacities of pig crAss-like phages, we predicted 57,171 hypothetical proteins from 525 crAss-like phage vOTUs with family-level taxonomic information using prodigal-gv (V2.9.0). Functional annotation of hypothetical proteins was performed using the HMMsearch based on HMM profiles from Yutin et al. (9) and geNomad (44) with default parameters. Overall, 64.73% of crAss-like phage genes were not assigned to any functional items, indicating the deficiency to the understanding of functional potentials of pig crAss-like phages (Figure 3A; Supplementary Table S4). The remaining 35.27% of annotated genes mainly belonged to the functional items that conserved in crAss-like phages (9) (Figure 3B; Supplementary Table S4). For examples, HNH, one of the endonucleases that has been demonstrated to insert into some of phage genes as an intron or intronic peptide (9), contained the largest number of annotated genes. Major capsid protein (MCP), the major structural component of virus particles, had the second largest number of annotated genes (Figure 3B).

Figure 3
www.frontiersin.org

Figure 3. Functional capacities of pig crAss-like phages. (A) The pie chart representing the proportion of 57,171 genes from 525 pig crAss-like phage vOTUs that could be functionally annotated. (B) The barplot indicating the functional items with the number of annotated genes in the top 50. (C) The barplot showing CAZymes encoded by pig crAss-like phage genomes. The number of annotated genes for each item is shown on the right of bars. (D) Prevalence (%) of each CAZyme type detected in this study in different crAss-like phage clusters. (E) Comparison of the numbers of Acr detected in different clusters of pig crAss-like phages. The boxplots show medians (bold lines), the upper and lower quartiles. The comparison was performed by non-parametric Wilcoxon test. **** p < 0.0001, *** p < 0.001, ** p < 0.01, * p < 0.05.

We further used dbCAN3 to predict carbohydrate-active enzymes (CAZymes) in pig crAss-like phage genomes. A total of 112 genes encoding 10 types of CAZymes were identified from 103 out of 525 crAss-like phage vOTUs, including six types of Glycoside Hydrolases (GH73, GH108, GH24, GH23, GH25, and GH16) and four types of Glycosyl Transferases (GT2, GT6, GT32, and GT11) (Figure 3C; Supplementary Table S4). GH73 contained the largest number of annotated genes (n = 42) and was identified in 13.7 and 21.2% of pig crAss-like phage genomes in the beta and zeta clusters, respectively. However, it was completely absent in the alpha and delta clusters (Figure 3D). GH108, which had the second-largest number of annotated genes (n = 31), was detected in 21.5 and 1.8% of pig crAss-like phage genomes in the alpha and delta clusters, respectively. But it was not found in the beta and zeta clusters (Figure 3D). Most strikingly, except GH16, all types of Glycoside Hydrolases encoded by crAss-like phage genes were annotated as lysozyme which can lyse bacteria by degrading peptidoglycan in the cell wall (56).

Phages have evolved a variety of anti-defense mechanisms in the “arms race” with host bacteria, such as anti-CRISPR proteins (Acr) (57). Most Acrs are short and highly variable. It was also a main characteristic that matched the large number of proteins encoded by crAss phage genes lacking annotation information. Using PaCRISPR (48) and AcrPred (47), 6,605 genes encoding Acrs were predicted from 523 pig crAss-like phage genomes, with an average of 12.6 Acrs-encoding genes per genome. CrAss-like phage genomes in the zeta cluster had a significantly higher number of Acrs than those in the other three clusters (Figure 3E; Supplementary Table S4). We speculated that this might be related to their large genome sizes and rapid sequence evolution of crAss-like phages in the zeta cluster (9).

3.4 Host bacteria of pig crAss-like phages and the possible effect of the interaction between crAss-like phages and bacterial species on fat deposition in pigs

To explore the interactions between crAss-like phages and bacterial hosts, we extracted CRISPR spacer regions from bacterial genomes in the Unified Human Gastrointestinal Genome (UHGG) database (50) and MAGs assembled in our previous study. About 51% (n = 267) of 525 pig crAss-like phage vOTUs were identified bacterial hosts. Bacteroidota included the greatest number of pig crAss-like phage hosts (n = 229). There were 21 pig crAss-like phages whose bacterial hosts were detected in both Bacteroidota and Firmicutes A. At the genus level, Prevotella (n = 78), Parabacteroides (n = 17), and UBA4372 (n = 13) contained the large numbers of bacterial hosts of pig crAss-like phages (Figure 4A; Supplementary Table S5). These phylum and genus were the predominant bacterial taxa in the pig gut microbiota (58, 59). In addition, 66.67% (n = 178) of pig crAss-like phage vOTUs with host information had their hosts in only one bacterial genus, whereas the other 33.33% of crAss-like phages (n = 89) were detected their hosts across multiple genera (≥ 2) (Figure 4A; Supplementary Table S5).

Figure 4
www.frontiersin.org

Figure 4. Host distribution of pig crAss-like phages and the effect of bacterial host of Prevotella on pig fat deposition. (A) The Sankey diagram representing the taxonomic distribution of bacterial hosts of pig crAss-like phage vOTUs. The colors of the rectangles represent different taxonomic levels of bacterial hosts. The length of the rectangles indicates the number of pig crAss-like phage vOTUs. The abbreviations of taxonomic names for some predominant host bacteria are shown in boxes. The abbreviations for other host bacterial taxa are only shown in Supplementary Table S5. (B) The heatmap (left) indicates the vOTUs associated with pig fat deposition. The fatness-associated vOTUs were identified by MaAsLin2 analysis. The purple boxes show the vOTUs negatively associated with fat deposition, and the red boxes indicate the vOUTs positively associated with high fatness. The color gradients exhibit the strength of the associations calculated by −log (qval) * sign (coeff). The bacterial hosts of vOTUs were identified through matching to CRISPR spacers. The fatness-associated vOTUs and its bacterial hosts were connected with lines. Prevotella and Prevotella copri are specially indicated with red and blue. (C) In comparison of the relative abundance of Prevotella and Prevotella spp. between high fatness pigs and lean pigs. The comparison was performed by non-parametric Wilcoxon test.

To compare the distribution of bacterial hosts between pig and human crAss-like phages, we also predicted potential bacterial hosts for human crAss-like phages. About 84.52% of 336 human crAss-like phage vOTUs (n = 284) were identified bacterial hosts. The taxa of bacterial hosts for human crAss-like phages were similar to that for pig crAss-like phages at all the phylum, class, order, and family levels. At the genus level, the largest number of bacterial hosts of human crAss-like phages belonged to Bacteroides (n = 59), followed by Prevotella (n = 39), and Parabacteroides (n = 23). However, as mentioned above, only 3 pig crAss-like phages were identified bacterial hosts belonging to Bacteroides, indicating the differentiation of bacterial hosts between pig and human crAss-like phages at the high resolution of taxonomic levels (Supplementary Figure S8A).

The interaction between phages and their bacterial hosts plays an important role in animal health or shaping animal phenotypes (60). As one of the important hosts of crAss-like phages described above, Prevotella has been confirmed to be associated with pig fat deposition in our previous study (61). Here we further investigated whether the interaction of crAss-like phages with Prevotella was associated with pig fat deposition. Sixteen fecal metagenomic sequencing data from previous study, including eight data from high fatness pigs and eight data from lean pigs (62) were used in this study. The abundances of pig crAss-like phage vOTUs were calculated by mapping clean metagenomic sequencing reads to the genome sets of pig crAss-like phage vOTUs constructed in this study. A total of 18 pig crAss-like phage vOTUs were identified to have higher abundances in high-fatness pigs, while three vOTUs showed higher abundances in lean pigs by employing the linear mixed model implemented in MaAsLin2 (Figure 4B; Supplementary Table S6). We further analyzed the host bacteria of these 21 fatness-associated crAss-like phage vOTUs and found that 14 out of the 18 vOTUs enriched in high-fatness pigs had detectable bacterial hosts. Interestingly, the host bacteria for 10 out of these 14 high fatness-associated vOTUs belonged to Prevotella (Figure 4B; Supplementary Table S6). Besides Prevotella, other bacterial genera were also identified as the host bacteria for eight high fatness-associated vOTUs. For example, five vOTUs should targeted bacteria across Prevotella, Phocaeicola, Prevotellamassilia, and UBA4372 in Bacteroidaceae (Figure 4B; Supplementary Table S6). Our previous study has demonstrated that 12 Prevotella spp. (especially Prevotella copri) and three Bacteroides spp. were significantly associated with pig fat deposition, with higher abundances in fatness pigs (low lean meat percentage) than in lean pigs (62). Prevotella copri was a core bacterial species in the pig gut microbiome with high abundance. Here, Prevotella copri was confirmed to significantly enrich in fatness pigs (p = 0.01) and was identified as the bacterial hosts of three crAss-like phages which also had higher abundance in fatness pigs (Figures 4B,C). Furthermore, Prevotella was also enriched in high-fatness pigs, although it did not achieve a significance level (p = 0.13) (Figure 4C). This result was consistent with the mutually beneficial and multiply in parallel between crass-like phages and their bacterial hosts (17). This suggested that crAss-like phages in the pig gut might influence fat deposition by interactions with bacterial species in Prevotella and Bacteroidaceae. Unfortunately, we did not detect bacterial hosts for three low fatness-associated vOTUs.

3.5 Global distribution of pig crAss-like phages

We first calculated the number of crAss-like phage vOTUs detected in each of the 403 samples from nine previous studies to assess the diversity of crAss-like phages in tested samples. A vOTU was considered to be detected in a sample if >75% of its representative genome sequence was covered by metagenomic sequencing reads from that sample. Samples from Lim et al. (27) (n = 36) had the largest average number of crAss-like phage vOTUs per sample detected (34.4 crAss-like phages/sample on average, ranging from 8 to 135), followed by the Xiao et al. (20) (30.9 crAss-like phages/sample). The least number of crAss-like phage vOTUs per sample was detected in Holman et al. (25) with an average of 2.8 vOTUs/sample (Figure 5A; Supplementary Table S7).

Figure 5
www.frontiersin.org

Figure 5. The distribution of pig crAss-like phages in geographically diverse pig populations. (A) The boxplots representing the number of pig crAss-like phage vOTUs identified in samples of each pig population. The boxplots show the medians (bold lines), the upper and lower quartiles. The comparison was performed by the non-parametric Kruskal test. (B) The TerL-based phylogenetic tree constructed with 56 pig crAss-like phage vOTUs with >10% prevalence. The heatmap on the right represents the prevalence of vOTU in all samples and in each pig population. (C) The heatmap representing the prevalence of four pig crAss-like phage clusters in each of 10 pig populations.

We then determined the prevalence of pig crAss-like phage vOTUs in all tested samples to identify core crAss-like phages in the pig gut. The results showed that all detected vOTUs had low prevalence. There were 56 crAss-like phage vOTUs with >10% prevalence (alpha = 18, beta = 19, delta = 13, and zeta = 6) (Figure 5B). Among them, only three crAss-like phage vOTUs showed >30% prevalence, with the highest at 30.8% (Figure 5B). The vOTU (SRR8960468m146280) with the highest prevalence belonged to alpha cluster and could target to the bacteria in Eubacterium and Prevotella. Its extensive bacterial hosts across genera likely contribute to its detection in a wide range of samples (63). However, this vOTU was not detected in samples from the studies of Holman et al. (25) and Zhou et al. (28) (Figure 5B). We further calculated the prevalence of all pig crAss-like phage vOTUs at the pig population (study) level. A vOTU was considered detected in a pig population if it was found in at least one sample of that pig population. The results showed that 119 crAss-like phage vOTUs were detected in only one pig population. Seven crAss-like phage vOTUs were detected in the largest number of eight pig populations (Supplementary Table S7).

Finally, we analyzed the distribution of four pig crAss-like phage clusters (alpha, beta, delta, and zeta) in different pig populations. A crAss-like phage cluster was considered present in a sample if any vOTUs belonging to this cluster were detected in that sample. The alpha cluster of crAss-like phages was detected in more than 80% of samples from six populations, whereas the beta, delta, and zeta clusters were identified in 80% of tested samples in only two, three, and two populations, respectively (Figure 5C). This discrepancy might be related to the fact that the alpha cluster of crAss-like phages showed the most diversity among crAss-like phages (9).

4 Discussion

CrAss-like phages were one of the most notable new viral members that have been widely identified in the human gut. However, limited information is available regarding their diversity, functional capacity, and genomic structure in pigs. In this study, we obtained 1,251 pig crAss-like phage genomes by assembling 380 genomes from pig metagenome data across a wide range of geographic sources and integrating 871 crAss-like phage genomes identified previously. For comparison, we also collected 1,446 human crAss phage genomes. Based on these datasets, we systematically analyzed the diversity, genomic structure, functional capacities, host bacteria, and biogeographic distribution of pig crAss-like phages. The results not only provided an important resource of crAss-like phage genomes, improved our understanding on this type of viruses, but also facilitated further studies about the interaction between crAss-like phages and bacteria, and its effect on pig health and production traits.

The phylogenetic analysis showed that, compared to human crAss-like phages, pig crAss-like phages were widely distributed in four crAss-like phage clusters (alpha, beta, zeta, and delta) reported previously, but were not found in the gamma and epsilon clusters. This might be due to the differences in the composition of gut microbiota between humans and pigs, or different living conditions for humans and pigs. It was also worth noting that this may result from the fact that our methods used for identifying crAss phages were mainly developed based on the information of known crAss phage clusters from human studies, which might be less sensitive in detecting other related clusters from pig gut microbiome. In the phylogenetic tree (Figure 2A), crAss-like phages from humans and pigs within the same cluster form a stable monophyletic group; furthermore, those within the same family exhibit a consistent gene order. This indicated that they might share a common origin or be capable of cross-species transmission between humans and pigs. Interestingly, the study by Edwards et al. (8) found that crAss-like phage strains from gorillas’ guts became closely related to humans following contact with humans.

Gene prediction using the correct genetic codes is important in the process of functional annotation, and incorrect utilization of genetic codes may lead to low predicted coding density and truncated gene products (64). Previous studies have suggested that the utilization of alternative genetic codes is a counter-defense strategy of the phages (9). Here, we found extensive utilization of alternative genetic codes in pig crAss-like phage genomes. This was consistent with the previous observations in human crAss-like phages and other types of phages (9, 65). The recoding of the stop codons could be achieved by using repressive tRNAs, which should possibly come from bacteria or point mutations in crAss phage genomes (9). We found that none of the pig and human crAss-like phages from the alpha cluster in this study used alternative genetic codes, but repressive tRNAs were identified in two pig crAss-like phage vOTU genomes. More crAss-like phage genomes from the alpha cluster should be analyzed to determine whether it was an inherent feature of the alpha crAss-like phages that they did not use alternative genetic codes.

Previous studies on crAss-like phages in humans and animals have primarily focused on conserved genes within genomes, such as key genes involved in virion assembly, replication, and transcription (9, 18). In contrast, we provide a comprehensive annotation of all functional genes in pig crAss-like phage genomes, including CAZymes and anti-CRISPR proteins. We found that except GH16, all glycoside hydrolases identified in crAss-like phage genomes were annotated as lysozyme. Previous studies have demonstrated that lysozymes encoded by phage genomes can dissolve bacteria by degrading peptidoglycan in the cell wall (56). Fujimoto et al. (66) identified an antibacterial enzyme GH25 (glycoside hydrolase family 25) LysA-like from the prophage of the highly pathogenic Enterococcus faecalis, which showed the potential to lyse the biofilm formed by Enterococcus faecalis in the gut. We also found that different types of CAZymes seem to be preferentially presented in different clusters of crAss-like phages. For example, GH73 was identified in the beta and zeta crAss-like phage genomes, but completely absent in the alpha and delta clusters. The reverse distribution pattern was observed for GH108. This implied the different lysozymes involved in the infection of crAss phages to bacteria.

The roles of crAss-like phages in health and diseases have been extensively studied. But so far, many reports have not been repeated and confirmed (8, 18). Here we found three pig crAss-like phages whose host bacteria were Prevotella copri which was confirmed to be causative bacterial species leading to excessive fat deposition in pigs (62, 67). Gulyaeva et al. (18) also identified Prevotella copri as a likely host of human delta27 crAss-like phages based on CRISPR analysis and abundance correlation. Concordance with the mutually beneficial symbiotic relationship between crAss-like phages and its bacterial hosts (17), all three crAss-like phages and Prevotella copri had higher abundance in high fatness pigs. Shkoporov et al. (17) reported a successful partnership between crAss001 and Bacteroides intestinalis, which ensured mutual persistence in complex environments. It indicated that when facing the constant selection pressures exerted by crAss-like phages, the phenotypes of Bacteroides intestinalis could be diversified through phase variation and point mutations. This improved the overall fitness of Bacteroides intestinalis and ensured its competitiveness in a constantly changing environment like the gastrointestinal tract. In addition, the persistent crAss-like phages could also potentially be involved in horizontal gene transfer that provided protection against competing bacterial strains, and gave superinfection immunity against cognate phages (17). Whether the mechanism of the interaction between crAss-like phages and Prevotella copri was similar to that between crAss-like phages and Bacteroides intestinalis or not would need to be investigated in future studies.

Finally, we found that the prevalence of crAss-like phages in pig guts was low. This suggested the heterogeneity in the composition of gut crAss-like phages. Previous study has indicated that the composition of phages in the gut was related to the composition of gut bacteria (68). We guessed that the heterogeneity in the composition of gut crAss-like phages should be caused by differential compositions of gut microbiota in different pig populations from worldwide farms.

In conclusion, our study provided 1,251 pig crAss-like phage genomes which were clustered into 533 vOTUs. This expanded the diversity of crAss-like phages. We first performed a comprehensive and systematic investigation of crAss-like phages in the pig gut, including uncovering the utilization of alternative genetic codes, annotating functional capacities of crAss-like phage genes, identifying host bacteria, and describing their distribution in broad pig populations. Although the limited sample size was used in this study, and the interactions between crAss-like phages and bacteria were inferred based on the genomic information without confirmation by experiments, this study gave deep insights into crAss-like phages in the pig gut. It also provides key resources and new knowledge for the subsequent investigation of the interactions between the virome and bacteriome in the pig gut, as well as for studies exploring how crAss-like phages affect pig health and production traits.

Data availability statement

The phage genome data have been deposited in the Zenodo repository (https://doi.org/10.5281/zenodo.13901307) without any access restrictions.

Ethics statement

The animal study was approved by Animal Care and Use Committee of Jiangxi Agricultural University. The study was conducted in accordance with the local legislation and institutional requirements.

Author contributions

YW: Conceptualization, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing. CW: Writing – review & editing, Investigation. ZC: Investigation, Writing – review & editing. MZ: Writing – review & editing, Visualization. LH: Writing – review & editing, Conceptualization, Funding acquisition, Project administration, Supervision. CC: Conceptualization, Funding acquisition, Project administration, Supervision, Writing – review & editing, Formal analysis, Methodology.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by the National Natural Science Foundation of China (32272831).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fvets.2025.1582122/full#supplementary-material

SUPPLEMENTARY FIGURE S1 | The distribution of viral contig numbers contained in each vOTU. (A) Viral contig numbers contained in each vOTU of pig crAss-like phages. (B) Viral contig numbers contained in each vOTU of human crAss-like phages. (C) Venn diagram indicating the numbers of viral clusters (VCs) commonly identified in both humans and pigs, and specifically identified in humans or pigs. The number of vOTUs contained in common or specific VCs are shown in parentheses.

SUPPLEMENTARY FIGURE S2 | In comparison of viral contig lengths of vOTUs among six crAss-like phage clusters and between humans and pigs, and the utilization of alternative genetic codes in six clusters of human crAss-like phages. (A) The pie chart representing the percentage of different clades of pig crAss-like phage vOTUs. (B) Comparison of viral contig lengths of vOTUs among six crAss-like phage clusters in humans. (C) Comparison of viral contig lengths of vOTUs in six crAss-like phage clades between humans and pigs. For both (B,C), the boxplots show the medians (bold lines), and the upper and lower quartiles. The comparisons were performed by the non-parametric Wilcoxon test. Since the gamma and epsilon clusters were not identified in pig crAss-like phages, these two clusters only show the length distribution of human crAss-like phages. (D) The stacked barplot representing the number of human crAss-like phages using alternative genetic codes in each crAss-like phage.

SUPPLEMENTARY FIGURES S3 | Genomic structure plots for three gene blocks responsible for phage assembly, replication, and transcription with complete crAss-like phage genomes in the alpha cluster from pig (five) and human (five). Grey arrows represent ORFs of unknown function, light blue arrows represent ORFs of other functions, and the arrows with other colors represent marker genes as indicated in the legend at the upper of figures. The colored bands between two genomic structure plots shows the percentage of sequence identity between two genomes. The sequence identity between two genomes is also displayed in the dot plot on the left.

SUPPLEMENTARY FIGURES S4 | Genomic structure plots for three gene blocks responsible for phage assembly, replication, and transcription with complete crAss-like phage genomes in the beta cluster from pig (five) and human (five).

SUPPLEMENTARY FIGURES S5 | Genomic structure plots for three gene blocks responsible for phage assembly, replication, and transcription with complete crAss-like phage genomes in the delta cluster from pig (five) and human (five).

SUPPLEMENTARY FIGURES S6 | Genomic structure plots for three gene blocks responsible for phage assembly, replication, and transcription with complete crAss-like phage genomes in the gramma (Figure S6A), and epsilon (Figure S6B) cluster from human.

SUPPLEMENTARY FIGURES S7 | Genomic structure plots for three gene blocks responsible for phage assembly, replication, and transcription with complete crAss-like phage genomes in the zeta cluster from pig (five) and human (five).

SUPPLEMENTARY FIGURE S8 | The Sankey diagram representing the taxonomic distribution of bacterial hosts of human crAss-like phage vOTUs. The colors of the rectangles represent different taxonomic levels of bacterial hosts. The length of the rectangles indicates the number of pig crAss-like phage vOTUs.

Footnotes

References

1. Cao, Z, Sugimura, N, Burgermeister, E, Ebert, MP, Zuo, T, and Lan, P. The gut Virome: a new microbiome component in health and disease. EBioMedicine. (2022) 81:104113. doi: 10.1016/j.ebiom.2022.104113

PubMed Abstract | Crossref Full Text | Google Scholar

2. Santiago-Rodriguez, TM, and Hollister, EB. Unraveling the viral dark matter through viral metagenomics. Front Immunol. (2022) 13:1005107. doi: 10.3389/fimmu.2022.1005107

PubMed Abstract | Crossref Full Text | Google Scholar

3. Shkoporov, AN, and Hill, C. Bacteriophages of the human gut: the "known unknown" of the microbiome. Cell Host Microbe. (2019) 25:195–209. doi: 10.1016/j.chom.2019.01.017

Crossref Full Text | Google Scholar

4. Chevallereau, A, Pons, BJ, van Houte, S, and Westra, ER. Interactions between bacterial and phage communities in natural environments. Nat Rev Microbiol. (2022) 20:49–62. doi: 10.1038/s41579-021-00602-y

Crossref Full Text | Google Scholar

5. Kieft, K, Breister, AM, Huss, P, Linz, AM, Zanetakos, E, Zhou, Z, et al. Virus-associated organosulfur metabolism in human and environmental systems. Cell Rep. (2021) 36:109471. doi: 10.1016/j.celrep.2021.109471

Crossref Full Text | Google Scholar

6. Koonin, EV, and Yutin, N. The crass-like phage group: how metagenomics reshaped the human Virome. Trends Microbiol. (2020) 28:349–59. doi: 10.1016/j.tim.2020.01.010

Crossref Full Text | Google Scholar

7. Dutilh, BE, Cassman, N, McNair, K, Sanchez, SE, Silva, GG, Boling, L, et al. A highly abundant bacteriophage discovered in the unknown sequences of human Faecal metagenomes. Nat Commun. (2014) 5:4498. doi: 10.1038/ncomms5498

PubMed Abstract | Crossref Full Text | Google Scholar

8. Edwards, RA, Vega, AA, Norman, HM, Ohaeri, M, Levi, K, Dinsdale, EA, et al. Global Phylogeography and ancient evolution of the widespread human gut virus Crassphage. Nat Microbiol. (2019) 4:1727–36. doi: 10.1038/s41564-019-0494-6

Crossref Full Text | Google Scholar

9. Yutin, N, Benler, S, Shmakov, SA, Wolf, YI, Tolstoy, I, Rayko, M, et al. Analysis of metagenome-assembled viral genomes from the human gut reveals diverse putative crass-like phages with unique genomic features. Nat Commun. (2021) 12:1044. doi: 10.1038/s41467-021-21350-w

Crossref Full Text | Google Scholar

10. Guerin, E, Shkoporov, A, Stockdale, SR, Clooney, AG, Ryan, FJ, Sutton, TDS, et al. Biology and taxonomy of crass-like bacteriophages, the Most abundant virus in the human gut. Cell Host Microbe. (2018) 24:653–64 e6. doi: 10.1016/j.chom.2018.10.002

Crossref Full Text | Google Scholar

11. Turner, D, Shkoporov, AN, Lood, C, Millard, AD, Dutilh, BE, Alfenas-Zerbini, P, et al. Abolishment of morphology-based taxa and change to binomial species names: 2022 taxonomy update of the Ictv bacterial viruses subcommittee. Arch Virol. (2023) 168:74. doi: 10.1007/s00705-022-05694-2

Crossref Full Text | Google Scholar

12. Shkoporov, AN, Stockdale, SR, Lavelle, A, Kondova, I, Heuston, C, Upadrasta, A, et al. Viral biogeography of the mammalian gut and parenchymal organs. Nat Microbiol. (2022) 7:1301–11. doi: 10.1038/s41564-022-01178-w

Crossref Full Text | Google Scholar

13. Li, Y, Gordon, E, Shean, RC, Idle, A, Deng, X, Greninger, AL, et al. Crassphage and its bacterial host in cat feces. Sci Rep. (2021) 11:815. doi: 10.1038/s41598-020-80076-9

Crossref Full Text | Google Scholar

14. Smith, L, Goldobina, E, Govi, B, and Shkoporov, AN. Bacteriophages of the order Crassvirales: what do we currently know about this keystone component of the human gut Virome? Biomol Ther. (2023) 13:584. doi: 10.3390/biom13040584

Crossref Full Text | Google Scholar

15. Marbouty, M, Cournac, A, Flot, JF, Marie-Nelly, H, Mozziconacci, J, and Koszul, R. Metagenomic chromosome conformation capture (Meta3c) unveils the diversity of chromosome Organization in Microorganisms. eLife. (2014) 3:e03318. doi: 10.7554/eLife.03318

PubMed Abstract | Crossref Full Text | Google Scholar

16. Ramos-Barbero, MD, Gomez-Gomez, C, Sala-Comorera, L, Rodriguez-Rubio, L, Morales-Cortes, S, Mendoza-Barbera, E, et al. Characterization of crass-like phage isolates highlights Crassvirales genetic heterogeneity and worldwide distribution. Nat Commun. (2023) 14:4295. doi: 10.1038/s41467-023-40098-z

Crossref Full Text | Google Scholar

17. Shkoporov, AN, Khokhlova, EV, Stephens, N, Hueston, C, Seymour, S, Hryckowian, AJ, et al. Long-term persistence of crass-like phage Crass001 is associated with phase variation in Bacteroides Intestinalis. BMC Biol. (2021) 19:163. doi: 10.1186/s12915-021-01084-3

Crossref Full Text | Google Scholar

18. Gulyaeva, A, Garmaeva, S, Ruigrok, R, Wang, D, Riksen, NP, Netea, MG, et al. Discovery, diversity, and functional associations of crass-like phages in human gut metagenomes from four Dutch cohorts. Cell Rep. (2022) 38:110204. doi: 10.1016/j.celrep.2021.110204

Crossref Full Text | Google Scholar

19. Cervantes-Echeverria, M, Gallardo-Becerra, L, Cornejo-Granados, F, and Ochoa-Leyva, A. The two-faced role of Crassphage subfamilies in obesity and metabolic syndrome: between good and evil. Genes (Basel). (2023) 14:139. doi: 10.3390/genes14010139

PubMed Abstract | Crossref Full Text | Google Scholar

20. Xiao, L, Estelle, J, Kiilerich, P, Ramayo-Caldas, Y, Xia, Z, Feng, Q, et al. A reference gene catalogue of the pig gut microbiome. Nat Microbiol. (2016) 1:16161. doi: 10.1038/nmicrobiol.2016.161

Crossref Full Text | Google Scholar

21. Gaio, D, DeMaere, MZ, Anantanawat, K, Eamens, GJ, Liu, M, Zingali, T, et al. A large-scale metagenomic survey dataset of the post-weaning piglet gut lumen. Gigascience. (2021) 10:giab039. doi: 10.1093/gigascience/giab039

Crossref Full Text | Google Scholar

22. Jensen, EEB, Sedor, V, Eshun, E, Njage, P, Otani, S, and Aarestrup, FM. The resistomes of rural and urban pigs and poultry in Ghana. mSystems. (2023) 8:e0062923. doi: 10.1128/msystems.00629-23

Crossref Full Text | Google Scholar

23. Joyce, A, McCarthy, CGP, Murphy, S, and Walsh, F. Antibiotic Resistomes of healthy pig Faecal metagenomes. Microb Genom. (2019) 5:e000272. doi: 10.1099/mgen.0.000272

Crossref Full Text | Google Scholar

24. Holman, DB, Gzyl, KE, Mou, KT, and Allen, HK. Weaning age and its effect on the development of the swine gut microbiome and Resistome. mSystems. (2021) 6:e0068221. doi: 10.1128/mSystems.00682-21

Crossref Full Text | Google Scholar

25. Holman, DB, Gzyl, KE, and Kommadath, A. The gut microbiome and Resistome of conventionally vs. pasture-raised pigs. Microb Genom. (2023) 9:mgen001061. doi: 10.1099/mgen.0.001061

Crossref Full Text | Google Scholar

26. Gaire, TN, Odland, C, Zhang, B, Ray, T, Doster, E, Nerem, J, et al. The impacts of viral infection and subsequent antimicrobials on the microbiome-Resistome of growing pigs. Microbiome. (2022) 10:118. doi: 10.1186/s40168-022-01312-0

Crossref Full Text | Google Scholar

27. Lim, SK, Kim, D, Moon, DC, Cho, Y, and Rho, M. Antibiotic Resistomes discovered in the gut microbiomes of Korean swine and cattle. Gigascience. (2020) 9:giaa043. doi: 10.1093/gigascience/giaa043

Crossref Full Text | Google Scholar

28. Zhou, S, Luo, R, Gong, G, Wang, Y, Gesang, Z, Wang, K, et al. Characterization of metagenome-assembled genomes and carbohydrate-degrading genes in the gut microbiota of Tibetan pig. Front Microbiol. (2020) 11:595066. doi: 10.3389/fmicb.2020.595066

PubMed Abstract | Crossref Full Text | Google Scholar

29. Chen, S, Zhou, Y, Chen, Y, and Gu, J. Fastp: an ultra-fast all-in-one Fastq preprocessor. Bioinformatics. (2018) 34:i884–90. doi: 10.1093/bioinformatics/bty560

Crossref Full Text | Google Scholar

30. Langmead, B, and Salzberg, SL. Fast gapped-read alignment with bowtie 2. Nat Methods. (2012) 9:357–9. doi: 10.1038/nmeth.1923

Crossref Full Text | Google Scholar

31. Li, D, Luo, R, Liu, CM, Leung, CM, Ting, HF, Sadakane, K, et al. Megahit V1.0: a fast and scalable metagenome assembler driven by advanced methodologies and community practices. Methods. (2016) 102:3–11. doi: 10.1016/j.ymeth.2016.02.020

Crossref Full Text | Google Scholar

32. Guo, J, Bolduc, B, Zayed, AA, Varsani, A, Dominguez-Huerta, G, Delmont, TO, et al. Virsorter2: a multi-classifier, expert-guided approach to detect diverse DNA and Rna viruses. Microbiome. (2021) 9:37. doi: 10.1186/s40168-020-00990-y

Crossref Full Text | Google Scholar

33. Ren, J, Song, K, Deng, C, Ahlgren, NA, Fuhrman, JA, Li, Y, et al. Identifying viruses from metagenomic data using deep learning. Quant Biol. (2020) 8:64–77. doi: 10.1007/s40484-019-0187-4

Crossref Full Text | Google Scholar

34. Kieft, K, Zhou, Z, and Anantharaman, K. Vibrant: automated recovery, annotation and curation of microbial viruses, and evaluation of viral community function from genomic sequences. Microbiome. (2020) 8:90. doi: 10.1186/s40168-020-00867-0

Crossref Full Text | Google Scholar

35. Nayfach, S, Camargo, AP, Schulz, F, Eloe-Fadrosh, E, Roux, S, and Kyrpides, NC. Checkv assesses the quality and completeness of metagenome-assembled viral genomes. Nat Biotechnol. (2021) 39:578–85. doi: 10.1038/s41587-020-00774-7

Crossref Full Text | Google Scholar

36. Camacho, C, Coulouris, G, Avagyan, V, Ma, N, Papadopoulos, J, Bealer, K, et al. Blast+: architecture and applications. BMC Bioinformatics. (2009) 10:421. doi: 10.1186/1471-2105-10-421

Crossref Full Text | Google Scholar

37. Eddy, SR. Accelerated profile hmm searches. PLoS Comput Biol. (2011) 7:e1002195. doi: 10.1371/journal.pcbi.1002195

Crossref Full Text | Google Scholar

38. Rice, P, Longden, I, and Bleasby, A. Emboss: the European molecular biology open software suite. Trends Genet. (2000) 16:276–7. doi: 10.1016/s0168-9525(00)02024-2

Crossref Full Text | Google Scholar

39. Roux, S, Adriaenssens, EM, Dutilh, BE, Koonin, EV, Kropinski, AM, Krupovic, M, et al. Minimum information about an uncultivated virus genome (Miuvig). Nat Biotechnol. (2019) 37:29–37. doi: 10.1038/nbt.4306

Crossref Full Text | Google Scholar

40. Bin Jang, H, Bolduc, B, Zablocki, O, Kuhn, JH, Roux, S, Adriaenssens, EM, et al. Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks. Nat Biotechnol. (2019) 37:632–9. doi: 10.1038/s41587-019-0100-8

Crossref Full Text | Google Scholar

41. Edgar, RC. Muscle: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics. (2004) 5:113. doi: 10.1186/1471-2105-5-113

Crossref Full Text | Google Scholar

42. Nguyen, LT, Schmidt, HA, von Haeseler, A, and Minh, BQ. Iq-tree: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. (2015) 32:268–74. doi: 10.1093/molbev/msu300

Crossref Full Text | Google Scholar

43. Letunic, I, and Bork, P. Interactive tree of life (Itol) V4: recent updates and new developments. Nucleic Acids Res. (2019) 47:W256–9. doi: 10.1093/nar/gkz239

Crossref Full Text | Google Scholar

44. Camargo, AP, Roux, S, Schulz, F, Babinski, M, Xu, Y, Hu, B, et al. Identification of Mobile genetic elements with Genomad. Nat Biotechnol. (2024) 42:1303–12. doi: 10.1038/s41587-023-01953-y

Crossref Full Text | Google Scholar

45. Lowe, TM, and Chan, PP. Trnascan-se on-line: integrating search and context for analysis of transfer Rna genes. Nucleic Acids Res. (2016) 44:W54–7. doi: 10.1093/nar/gkw413

Crossref Full Text | Google Scholar

46. Zheng, J, Ge, Q, Yan, Y, Zhang, X, Huang, L, and Yin, Y. Dbcan3: automated carbohydrate-active enzyme and substrate annotation. Nucleic Acids Res. (2023) 51:W115–21. doi: 10.1093/nar/gkad328

Crossref Full Text | Google Scholar

47. Dao, FY, Liu, ML, Su, W, Lv, H, Zhang, ZY, Lin, H, et al. Acrpred: a hybrid optimization with enumerated machine learning algorithm to predict anti-Crispr proteins. Int J Biol Macromol. (2023) 228:706–14. doi: 10.1016/j.ijbiomac.2022.12.250

Crossref Full Text | Google Scholar

48. Wang, J, Dai, W, Li, J, Xie, R, Dunstan, RA, Stubenrauch, C, et al. Pacrispr: a server for predicting and visualizing anti-Crispr proteins. Nucleic Acids Res. (2020) 48:W348–57. doi: 10.1093/nar/gkaa432

Crossref Full Text | Google Scholar

49. Nishimura, Y, Yoshida, T, Kuronishi, M, Uehara, H, Ogata, H, and Goto, S. Viptree: the viral proteomic tree server. Bioinformatics. (2017) 33:2379–80. doi: 10.1093/bioinformatics/btx157

Crossref Full Text | Google Scholar

50. Almeida, A, Nayfach, S, Boland, M, Strozzi, F, Beracochea, M, Shi, ZJ, et al. A unified catalog of 204,938 reference genomes from the human gut microbiome. Nat Biotechnol. (2021) 39:105–14. doi: 10.1038/s41587-020-0603-3

Crossref Full Text | Google Scholar

51. Bland, C, Ramsey, TL, Sabree, F, Lowe, M, Brown, K, Kyrpides, NC, et al. Crispr recognition tool (Crt): a tool for automatic detection of clustered regularly interspaced palindromic repeats. BMC Bioinformatics. (2007) 8:209. doi: 10.1186/1471-2105-8-209

Crossref Full Text | Google Scholar

52. Li, H, and Durbin, R. Fast and accurate short read alignment with burrows-wheeler transform. Bioinformatics. (2009) 25:1754–60. doi: 10.1093/bioinformatics/btp324

Crossref Full Text | Google Scholar

53. Li, H, Handsaker, B, Wysoker, A, Fennell, T, Ruan, J, Homer, N, et al. The sequence alignment/map format and Samtools. Bioinformatics. (2009) 25:2078–9. doi: 10.1093/bioinformatics/btp352

Crossref Full Text | Google Scholar

54. Mallick, H, Rahnavard, A, McIver, LJ, Ma, S, Zhang, Y, Nguyen, LH, et al. Multivariable association discovery in population-scale Meta-omics studies. PLoS Comput Biol. (2021) 17:e1009442. doi: 10.1371/journal.pcbi.1009442

Crossref Full Text | Google Scholar

55. Yutin, N, Makarova, KS, Gussow, AB, Krupovic, M, Segall, A, Edwards, RA, et al. Discovery of an expansive bacteriophage family that includes the Most abundant viruses from the human gut. Nat Microbiol. (2018) 3:38–46. doi: 10.1038/s41564-017-0053-y

Crossref Full Text | Google Scholar

56. Stojkovic, EA, and Rothman-Denes, LB. Coliphage N4 N-Acetylmuramidase defines a new family of Murein hydrolases. J Mol Biol. (2007) 366:406–19. doi: 10.1016/j.jmb.2006.11.028

Crossref Full Text | Google Scholar

57. Bondy-Denomy, J, Pawluk, A, Maxwell, KL, and Davidson, AR. Bacteriophage genes that inactivate the Crispr/Cas bacterial immune system. Nature. (2013) 493:429–32. doi: 10.1038/nature11723

Crossref Full Text | Google Scholar

58. Chen, C, Zhou, Y, Fu, H, Xiong, X, Fang, S, Jiang, H, et al. Expanded catalog of microbial genes and metagenome-assembled genomes from the pig gut microbiome. Nat Commun. (2021) 12:1106. doi: 10.1038/s41467-021-21295-0

Crossref Full Text | Google Scholar

59. Xiao, Y, Zou, H, Li, J, Song, T, Lv, W, Wang, W, et al. Impact of quorum sensing signaling molecules in gram-negative Bacteria on host cells: current understanding and future perspectives. Gut Microbes. (2022) 14:2039048. doi: 10.1080/19490976.2022.2039048

PubMed Abstract | Crossref Full Text | Google Scholar

60. Mangalea, MR, Paez-Espino, D, Kieft, K, Chatterjee, A, Chriswell, ME, Seifert, JA, et al. Individuals at risk for rheumatoid Arthritis Harbor differential intestinal bacteriophage communities with distinct metabolic potential. Cell Host Microbe. (2021) 29:726–39.e5. doi: 10.1016/j.chom.2021.03.020

Crossref Full Text | Google Scholar

61. Gomes, AC, Hoffmann, C, and Mota, JF. The human gut microbiota: metabolism and perspective in obesity. Gut Microbes. (2018) 9:308–25. doi: 10.1080/19490976.2018.1465157

Crossref Full Text | Google Scholar

62. Chen, C, Fang, S, Wei, H, He, M, Fu, H, Xiong, X, et al. Prevotella Copri increases fat accumulation in pigs fed with formula diets. Microbiome. (2021) 9:175. doi: 10.1186/s40168-021-01110-0

Crossref Full Text | Google Scholar

63. Camarillo-Guerrero, LF, Almeida, A, Rangel-Pineros, G, Finn, RD, and Lawley, TD. Massive expansion of human gut bacteriophage diversity. Cell. (2021) 184:1098–109 e9. doi: 10.1016/j.cell.2021.01.029

Crossref Full Text | Google Scholar

64. Peters, SL, Borges, AL, Giannone, RJ, Morowitz, MJ, Banfield, JF, and Hettich, RL. Experimental validation that human microbiome phages use alternative genetic coding. Nat Commun. (2022) 13:5710. doi: 10.1038/s41467-022-32979-6

Crossref Full Text | Google Scholar

65. Devoto, AE, Santini, JM, Olm, MR, Anantharaman, K, Munk, P, Tung, J, et al. Megaphages infect Prevotella and variants are widespread in gut microbiomes. Nat Microbiol. (2019) 4:693–700. doi: 10.1038/s41564-018-0338-9

Crossref Full Text | Google Scholar

66. Fujimoto, K, Hayashi, T, Yamamoto, M, Sato, N, Shimohigoshi, M, Miyaoka, D, et al. An Enterococcal phage-derived enzyme suppresses graft-versus-host disease. Nature. (2024) 632:174–81. doi: 10.1038/s41586-024-07667-8

PubMed Abstract | Crossref Full Text | Google Scholar

67. Ma, L, Tao, S, Song, T, Lyu, W, Li, Y, Wang, W, et al. Clostridium Butyricum and carbohydrate active enzymes contribute to the reduced fat deposition in pigs. iMeta. (2024) 3:e160. doi: 10.1002/imt2.160

PubMed Abstract | Crossref Full Text | Google Scholar

68. Shkoporov, AN, Clooney, AG, Sutton, TD, Ryan, FJ, Daly, KM, Nolan, JA, et al. The human gut Virome is highly diverse, stable, and individual specific. Cell Host Microbe. (2019) 26:527–541.e5. doi: 10.1016/j.chom.2019.09.009

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: crAss-like phage, pig, gut, genomic features, host bacteria, Prevotella copri

Citation: Wang Y, Wei C, Chen Z, Zhou M, Huang L and Chen C (2025) Characterization of the diversity, genomic features, host bacteria, and distribution of crAss-like phages in the pig gut microbiome. Front. Vet. Sci. 12:1582122. doi: 10.3389/fvets.2025.1582122

Received: 24 February 2025; Accepted: 02 April 2025;
Published: 22 April 2025.

Edited by:

Jianmin Chai, Foshan University, China

Reviewed by:

Yingping Xiao, Zhejiang Academy of Agricultural Sciences, China
Liuyang Zhao, Chongqing Medical University, China

Copyright © 2025 Wang, Wei, Chen, Zhou, Huang and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Lusheng Huang, bHVzaGVuZ2h1YW5nQGhvdG1haWwuY29t; Congying Chen, Y2hlbmNvbmd5aW5nQGp4YXUuZWR1LmNu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.