Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Microbiol., 28 January 2026

Sec. Phage Biology

Volume 17 - 2026 | https://doi.org/10.3389/fmicb.2026.1720092

This article is part of the Research TopicAdvances in Phage Applications: Deciphering Phage Biological and Ecological Mechanisms through MetagenomicsView all 9 articles

Genomic landscape of complete Acinetobacter phages: clustering, core-shell genes, and synteny insights

  • 1Interdisciplinary Graduate Programme, Institute for Digital Molecular Analytics and Science, Nanyang Technological University, Singapore, Singapore
  • 2Institute for Digital Molecular Analytics and Science, Nanyang Technological University, Singapore, Singapore
  • 3Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore

Introduction: Acinetobacter baumannii is a major clinical threat due to its multidrug resistance and frequent involvement in nosocomial outbreaks. Bacteriophage therapy offers a targeted alternative, but its success depends on a deep understanding of phage genomics and proteomics. This study aims to cluster the current database of Acinetobacter phages, identify phage clusters with potential for therapeutic applications and highlight proteins that may be valuable for phage engineering.

Methods: A total of 250 publicly available complete Acinetobacter phage genomes were downloaded from NCBI database. The genomes were grouped into clusters using PhamClust. A phylogenetic tree using the terminase large subunit and the portal protein was charted. Gene synteny analysis was conducted using Clinker, while the protein family output from PhaMMseqs was utilized to investigate both broadly conserved and cluster-specific phams.

Results: Acinetobacter phages were classified into 12 clusters, including the newly identified cluster 10. While the terminase large subunit and portal protein proved useful for cluster-level grouping, they were insufficient for resolving finer phylogenetic relationships. Three conserved enzymes, endolysins, a DNA helicase, and an HNH homing endonuclease, were found across multiple clusters, alongside a diverse array of cluster-specific phams.

Discussion: Cluster 10 was found to contain three phages with exceptionally broad host ranges, highlighting its strong potential for therapeutic development. Additionally, the three conserved enzymes shared across multiple clusters, especially the endolysins, may serve as valuable tools for phage engineering due to their broad conservation. The cluster-specific phams offer biological insights and may also form the basis for developing cluster-specific primers or molecular diagnostics. Collectively, these findings deepen our understanding of Acinetobacter phage diversity and point toward new avenues for advancing phage-based therapies and diagnostics.

1 Introduction

1.1 Global context of Acinetobacter baumannii and phage therapy

Acinetobacter baumannii is a gram-negative bacteria that has emerged as a formidable nosocomial threat worldwide, frequently causing diverse infections such as pneumonia, septicemia, meningitis, urinary tract and wound infections among critically ill patients in hospital settings (Castillo-Ramírez, 2023; Moubareck and Halat, 2020). This pathogen presents a significant global burden, with up to 1.4 million cases reported annually (Cain and Hamidian, 2023). The economic burden of A. baumannii infections is substantial, with per-infection costs ranging from tens to over a hundred thousand U. S. dollars, leading to annual national expenditures in the hundreds of millions to billions in the United States of America alone (Nelson et al., 2016).

A. baumannii is classified among the ESKAPE pathogens namely, Enterococcus faecium, Staphylococcus aureus, Klebsiella pneumoniae, Pseudomonas aeruginosa, and Enterobacter species which are well-known for acquiring antibiotics resistance rendering them powerless (Rice, 2008). Given that carbapenems are among humanity’s last-line antibiotics (Bradley et al., 1999), the urgency to combat the carbapenem resistant A. baumannii strains is heightened. In 2024, carbapenem-resistant A. baumannii was formally assigned a critical priority level by the WHO for the development of new control, treatment, and prevention measures (WHO Bacterial Priority Pathogens List, 2024). The extensive drug resistance of A. baumannii stems from multiple resistance determinants, including enzymatic inactivation, target modification, and efflux pumps, whose rapid spread is driven by mobile genetic elements (MGEs) such as plasmids and resistance islands (Scoffone et al., 2025).

Phage therapy offers a compelling strategy against extensively drug-resistant Acinetobacter baumannii (XDRAB) because bacteriophages operate independently of antibiotic resistance mechanisms (Qu et al., 2024). A primary advantage is Phage-Antibiotic Synergy (PAS), demonstrated when the lytic Phage Indie combined with ceftazidime achieved superior bactericidal effects in vitro. This synergy was effective in resensitizing resistant A. baumannii strains to the antibiotic, overcoming initial phage resistance, and allowing for treatment at lower doses. Furthermore, phages selected for therapy, such as phage Indie, can be safety-characterized to ensure the absence of lysogenic, virulence, or antibiotic resistance genes (Orozco-Ochoa et al., 2025). In addition, combined inhaled phage therapy has been associated with patient stabilization and showed no severe adverse reactions. Limitations of phage therapy includes the critical issue of phage resistance, observed in clinical cases where the phage lost significant lytic activity during extended treatment (Qu et al., 2024). Additionally, the effectiveness of combination therapy is also highly specific. While Phage Indie showed synergy with ceftazidime, it had an antagonistic effect with piperacillin-tazobactam (Orozco-Ochoa et al., 2025). Furthermore, inhaled phages pose biodistribution and safety concerns: phage DNA was detected in the bloodstream and appeared to accumulate in the intestines, resulting in significant alterations in gut microbiota composition. Additional clinical trials are necessary to fully assess the safety and implications of phage dissemination and its impact on intestinal microecology.

Despite the limitations, phage therapy and research is currently undergoing “a well-deserved rebirth” amid escalating resistance, re-emerged as a precision treatment, leveraging viruses that can destroy bacteria no longer susceptible to conventional antibiotics (Altamirano et al., 2021; Maasilta, 2025; Lin et al., 2017).

1.2 Prior work on phage clustering and core genomes

To unlock the full potential of phage therapy, reducing risks and reaping the greatest benefits, understanding phage genomics is indispensable. This could expand our toolkit for phage engineering and understanding which phage genes may be undesirable for therapy could help improve safety profiles. Our environment holds an immense reservoir of phages, researchers are constantly discovering and sequencing novel ones (Ranveer et al., 2024).

Earlier work on A. baumannii phage genomics demonstrated that the 37 complete Acinetobacter phage genomes segregate into six discrete clusters and two genomic singletons based on whole-genome dot-plots and average nucleotide identity (Turner et al., 2018). Proteomic clustering using OrthoMCL grouped 4,067 encoded proteins into 737 orthologous groups and 974 orphan proteins, with 56.6% of proteins annotated as hypothetical (Turner et al., 2018). However, only 42 Acinetobacter phage genomes were deposited in NCBI GenBank at the time, of which 37 were analyzed, meaning true diversity was likely under-sampled and cluster boundaries provisional. Isolates overwhelmingly belonged to Caudovirales, with only one ssRNA Levivirus and no filamentous Inoviridae, suggesting an incomplete view of Acinetobacter phage families. Furthermore, over half of the 4,067 encoded proteins lacked functional annotation, reflecting “genetic dark matter” in phage genomes that can only be resolved with more isolates and experimental validation.

Building on Turner et al., Oliveira and co-workers expanded the dataset to 139 publicly available Acinetobacter phage genomes as of 21 January 2021, re-annotating all sequences with Prokka against a custom Caudovirales database (Oliveira et al., 2022). They combined nucleotide and protein clustering with network and pan-genome analyses to delineate Acinetobacter phage taxonomy and evolutionary relationships.

Their study expanded from 37 to 139 genomes, enhancing cluster resolution and discovery of 5 proposed rare subfamilies (Oliveira et al., 2022). The use of integrated vConTACT2 and INPHARED pipelines allowed detection of 8 subfamily-level clusters and 46 genera, including eight singletons, which improves taxonomic granularity beyond pairwise ANI and OrthoMCL alone.

As the number of sequenced phages expands, a more sensitive, scalable clustering framework is needed to reveal truly novel lineages and test the stability of existing clusters against deeper sampling. Moreover, past Acinetobacter phage studies did not systematically mine these datasets for universally conserved proteins, prime candidates for phage engineering or for cluster-specific phamilies that could illuminate the distinct infection and replication strategies of each lineage. Addressing these gaps can be essential for translating comparative genomics into actionable tools for therapeutic development against Acinetobacter infections.

1.3 Scope and aims

In this study, we leveraged an expanded dataset of complete Acinetobacter phage genomes nearly double that of previous comparative analyses to generate a comprehensive, protein-family-based framework for phage classification and functional insight. We aim to systematically characterize and cluster Acinetobacter phages and uncover any previously unrecognized lineages; to quantify the evolutionary relationships among clusters; to pinpoint conserved “core” proteins and machineries within each group; and to identify phamilies that bridge multiple clusters, illuminating shared functional modules and potential targets for broad-spectrum phage engineering.

2 Materials and methods

All available protein sequences for complete Acinetobacter phages were retrieved from the NCBI Virus portal on 11 February, 2025. First, the NCBI Virus database was queried for “complete” nucleotide entries, and the host filter was set to Taxonomy ID 469 (Acinetobacter spp.). Protein sequences were downloaded in FASTA format, yielding a single multi-FASTA file containing 30,824 protein records derived from 250 distinct phage genomes.

To enable genome-by-genome analyses, this annotated multi-FASTA file was parsed and processed with a custom Python script (Supplementary material): it parsed each record’s header to identify its source genome accession, grouped sequences accordingly, and wrote one FASTA file per phage genome. These per-genome FASTA files served as the input for subsequent clustering and comparative analyses via PhaMMseqs version 1.0.4 (Gauthier et al., 2022) and PhamClust version 1.3.3 (Gauthier and Hatfull, 2023).

2.1 Clustering of phage genomes using PhamClust

We leveraged PhamClust, a scalable, gene-family-based clustering tool to delineate relationships among 250 Acinetobacter phage genomes. PhamClust integrates PhaMMseqs for phamilies (“phams”) assembly and computes a Proteomic Equivalence Quotient (PEQ) between every genome pair by averaging pairwise amino-acid identity across shared phams for genome-wide similarity (Gauthier and Hatfull, 2023).

We first generated phamilies of related proteins using PhaMMseqs version 1.0.4 (Gauthier et al., 2022; Gauthier and Hatfull, 2024). Multi-FASTA files each containing all proteins from a single phage genome were processed through MMseqs2’s sequence–sequence clustering through default settings (30% minimum identity, 80% coverage) to group proteins into preliminary clusters (phams). Profile-sequence alignments were performed to refine pham boundaries, ensuring that divergent homologs were correctly assigned. Resulting phams were exported as separate FASTA alignments and a genome pham presence/absence TSV, which served as input for PhamClust version 1.3.3. Phamilies were manually annotated by examining the top five gene annotations within each pham submitted by the depositors and assigning a consensus function based on the majority annotation. In instances where the majority was labeled as hypothetical proteins, the non-hypothetical annotations, if present, were selected to provide a more comprehensive functional representation. In addition, PhaMMseqs generated a pangenome summary table classifying phams into core, soft-core, shell, and cloud gene categories.

Next, using the pham presence/absence TSV and PEQ matrix, PhamClust executed a two-step clustering strategy using default parameters (Gauthier and Hatfull, 2023). First, genomes were grouped at a PEQ cutoff of 25% to establish broad clusters; second, each cluster was recursively subdivided at a higher PEQ threshold (50%) to resolve subclusters.

2.2 Phylogenetic analysis

To validate our proteome-based clusters, we reconstructed phylogenetic trees using two hallmark genes, the terminase large subunit and the portal protein. Both genes are universally present as single-copy, essential components of Caudovirales and exhibit sufficient sequence conservation to resolve deep and shallow evolutionary relationships (de Jonge et al., 2024).

Amino-acid sequences for each marker were extracted using PhaBOX version 2.0 and concatenated before alignment using Clustal Omega within the Jalview version 2.11.4.1 workbench (Sievers and Higgins, 2014; Waterhouse et al., 2009). Alignment using Clustal Omega was executed with default settings (35% identity threshold; 80% coverage) and subsequently used to compute a Newick-formatted neighbor joining phylogenetic tree for each marker. The phylogeny tree was visualized and annotated in Interactive Tree Of Life (iTOL) version 6.9.1 web server (Letunic and Bork, 2021, 2024).

2.3 PhaBOX

To contextualize our proteomic clusters with functional and taxonomic information, we applied the end-to-end workflow of the PhaBOX v2 web server (Shang et al., 2023) to all 250 Acinetobacter phage genomes.

PhaBOX utilized PhaTYP for lifestyle prediction and PhaGCN for taxonomic classification. PhaTYP employs a machine learning model and is able to achieve high accuracy for predicting lifestyles (Shang et al., 2022). PhaGCN integrates a graph convolutional network combining sequence-based features and gene-sharing networks to predict taxonomy per ICTV standards (Shang, 2021).

2.4 Gene synteny analyses using clinker

To explore genomic organization and infer functional modules related to phage infection and lysis, we employed Clinker (Gilchrist and Chooi, 2021) to generate comparative gene-synteny maps both between and within our PhamClust-defined clusters.

For between-cluster synteny comparisons, we randomly selected one representative genome from each PhamClust derived cluster and ran Clinker with its default settings to identify conserved gene blocks across distantly related phages. For within-cluster analysis of Cluster 10 the same default setting was run for all 3 respective phages.

All bioinformatics tools were run using default parameters unless otherwise stated.

3 Results

3.1 Acinetobacter phages are highly diverse with 12 proteomic clusters

A total of 250 Acinetobacter phages were clustered based on shared protein families into 12 major clusters ordered from largest to smallest, followed by 14 singletons (Figure 1).

Figure 1
Pairwise similarity heatmap comparing 250 Acinetobacter phage genomes/proteomes. Each cell represents the similarity between two phages (green = higher similarity, red = lower similarity).

Figure 1. Proteomic Equivalence Quotient (PEQ) heatmap of 250 Acinetobacter phage genomes clustered by PhamClust. Pairwise PEQ values (0–100%) are displayed as a color gradient from red (no similarity) to yellow (slight similarity) to green (high similarity). Phage genomes are ordered along both axes by their PhamClust cluster assignment, with the 12 major clusters shown from largest (upper-left block) to smallest (lower-right blocks), followed by 14 singleton genomes. The ICTV-predicted genus and genome length are annotated along the right side of the heatmap.

Overall, the clear “block-diagonal” pattern observed in Figure 1 confirms that PhamClust robustly partitions the 250 Acinetobacter phages into biologically meaningful groups, while the extensive red regions attest to the extraordinary proteomic diversity across the phage population.

Next, to assess how tightly phages group within each cluster and how distinct each cluster is from its neighbors, we examined the pairwise PEQ values to quantify both intra-cluster cohesion and inter-cluster relatedness. The PhamClust heatmap (Figure 1) and cluster summary (Supplementary Table S1) reveal 12 discrete proteomic clusters, ranging from 92 phages in Cluster 1 down to three phages in Clusters 10–12, plus 14 singletons. Within-cluster PEQ values exceed 60–70%, indicating strong core pham conservation, while inter-cluster values typically fall below 10%. The presence of partial shell-gene sharing and hints at evolutionary proximity or horizontal gene flow. Clusters 4, 9 and 2 singletons share a small set of proteins, and Clusters 5, 6, and 8 exhibit overlaps with PEQ values exceeding 30% revealing inter cluster homology. Cluster 4 exhibits the least intra-cluster cohesion containing the most proteomically diverse phages.

Then, we characterized each cluster by several key parameters, the average genome length, the predominant lifecycle strategy (lytic versus temperate), and any existing ICTV-recognized genera assignments. The summaries of each cluster’s membership, mean genome length, relative standard error (RSE), PhaBOX-derived lifestyle breakdown, and ICTV genus annotations were noted (Supplementary Table S1). PhaBOX’s outputs can be found in Supplementary Table S2.

The phage genome lengths cover a wide range from 34,601 bp to 166,600 bp. All 12 clusters exhibit exceptionally low relative standard errors (RSE < 4%), indicating highly consistent average genome lengths within each group. Clusters 4 and 9, which contain mixtures of virulent and temperate phages, also showed the highest RSE values of 3.05 and 2.10%, respectively. Additionally, based on PEQ values (Figure 1), Cluster 4 has the greatest proteomic variability within the cluster with hints of subclusters. We also observed that two distinct groups of established ICTV genera clustered together. The first group comprised of the genera Daemvirus, Friunavirus, and Pettyvirus, while the second included Hadassahvirus, Lazarusvirus, and Zedzedvirus. Clusters 6 and 8 to 12 lack current ICTV genus assignments and represent novel genera.

Our work substantially extends the comparative framework established by Oliveira et al. (2022) through a nearly twofold increase in the number of phages in most clusters. In addition to the previously identified clusters, we resolved an entirely new cluster, known as Cluster 10 in this study, as well as 14 singleton genomes.

3.2 Concatenated terminase large-subunit and portal-protein sequences delineate phylogeny clustering

To evaluate whether rapid, marker-gene phylogenies can stand in for comprehensive proteome clustering, and to gauge inter-cluster relatedness, we compared phylogenetic trees built from the terminase large-subunit and portal proteins against our PhamClust results.

The concatenated terminase large-subunit and portal-protein phylogeny (Figure 2) exhibits near-perfect concordance with our PhamClust proteome clusters validating the specificity of the clusters. Additionally, tree topology reveals clusters that are phylogenetically close. Clusters 7 and 11 and Clusters 5, 6, and 8 emerge from a common branch in the phylogenetic tree, indicating they could potentially share a substantial subset of phamilies.

Figure 2
Circular phylogenetic tree of 250 Acinetobacter phages with tip labels showing individual phage names. An outer colored ring segments the tree into 12 clusters (legend at left), allowing rapid visual identification of the 12 clusters. Many tips are additionally annotated with genus-level assignments when applicable.

Figure 2. Neighbor joining phylogenetic tree of 250 Acinetobacter phage genomes based on concatenated terminase large-subunit and portal-protein sequences. Each tip is color-coded by PhamClust cluster, cluster labels are annotated as shown in the legend box on the left.

3.3 Gene synteny analysis between clusters

Next, we sought to validate the phylogenetic relationships inferred from marker-gene trees and investigate the conserved proteins within the related clusters. We performed a between-cluster synteny comparison using Clinker to potentially reveal shared mechanistic features shared between phylogenetically close lineages (Figure 3). For each of the 12 PhamClust clusters, a single representative genome was chosen at random. These 12 genomes were then analyzed collectively to detect homologous proteins and visualize conserved gene neighborhoods.

Figure 3
Genome map with horizontal aligned clusters composed of arrows in various orientations. Shared genes are color-coded. A scale bar shows identity percentage of the respective shared genes with a darker shade representing.

Figure 3. Between-cluster gene synteny of representative Acinetobacter phage genomes from 12 clusters. One randomly selected genome per PhamClust cluster was compared using Clinker to visualize shared homologous proteins across clusters. Genomes were ordered vertically by their overall syntenic similarity. Arrows represent predicted open reading frames, ribbon connectors link homologous proteins, with shading intensity proportional to amino-acid identity scaling from light gray (30% identity) to black (100% identity).

Aside from the closely related cluster pairs, most clusters share almost no amino-acid identity with one another. In the between-cluster map, most genes are connected only by light-gray ribbons indicating the 30% identity threshold or not linked at all, reflecting minimal detectable homology.

Clusters 11 and 12 uniquely share only two carbohydrate-active enzymes, an acetylxylan esterase and a Concanavalin A–like lectin/glucanase while the remainder of their proteomes are distinct. Similarly, Clusters 7 and 11 only share 4 proteins, large terminase subunit, portal protein and a major capsid protein as well as a hypothetical protein contrary to what the phylogenetic tree in Figure 2 suggests. Although no insertion sequence elements or transposase genes were identified, the presence of synteny between these shared genes indicates potential recombination between the respective clusters that could be driven by co-infections.

Clusters 5, 6, and 8 share extensive syntenic blocks encompassing both functional and structural modules and emerge from a common branch from the phylogenetic tree in Figure 2. This indicates that these three clusters could have evolved from a single common ancestor and undergone divergent evolution across the rest of their genomes. These clusters represent some of the three larger average genome sizes in our dataset, 102,985 bp (Cluster 5), 97,586 bp (Cluster 6), and 93,754 bp (Cluster 8), all with strictly lytic life cycles (Supplementary Table S2). Cluster 5 corresponds to the established genus Saclayvirus, whereas Clusters 6 and 8 lack current genus assignments. Their proximity to Saclayvirus suggests that they may share similar host-interaction strategies and genomic mechanisms, providing a starting point for future functional and taxonomic investigations.

Clusters 5 and 6 retains a set of core structural and replication genes: both encode a tail-fiber protein, a head decoration protein, and DNA polymerase in conserved gene order contexts. This shared repertoire suggests that these two lineages not only assemble virions with similar host-recognition and capsid-stabilization modules but also encode a homologous polymerase machinery for genome replication. Clusters 5 and 8 conserve a distinct set of proteins: a capsid maturation protease, tail fiber protein, endolysin, ribonuclease H like domain protein, ribonucleotide reduction large subunit, and a putative shock protein A. The presence of both replication (ribonucleotide reductase) and stress response (shock protein) enzymes suggests these phages may engage host machinery under distinct environmental conditions. All three clusters (5, 6, and 8) retain the core virion-assembly toolkit, the terminase large subunit, portal protein, tail-assembly chaperone, and HNH homing endonuclease, underscoring a common ancestral replication-and-packaging cassette.

Next, Clusters 4 and 9 also share a substantial proportion of their proteomes, potentially also evolving from a recent common ancestor, including methane monooxygenase, endolysin, an ATP-binding protein, CII protein, ribosomal protein, a carbohydrate-binding domain–containing protein, a transcriptional regulator, and several hypothetical proteins. Cluster 4 belongs to the known genus Vieuvirus. The shared protein repertoire suggests Cluster 9 may possess similar lysogenic control mechanisms, host receptor-binding strategies, and DNA replication processes.

3.4 Genome architecture and core proteome of novel Cluster 10

Given the identification of a novel cluster, designated Cluster 10, we sought to further examine its genomic architecture and gene synteny.

Cluster 10, consisting of three broadly lytic siphoviruses as described by their respective publications, vB_AbaS_TCUP2199 (ON323491), EAb13 (OQ717042), and Mystique (PQ438181), exhibits well conserved shared modules across their genomic architecture (Figure 4). We found 81 core proteins shared across all three genomes that encoded 98, 126, and 154 predicted open reading frames (ORFs) respectively (Alseth et al., 2025; Mardiana et al., 2022; Margulieux et al., 2023). The majority of these conserved ORFs are annotated as hypothetical, reflecting the high proportion of novel genes in this cluster. This “genetic dark matter” is consistent with Cluster 10’s distant phylogenetic relationships with the other clusters (Figures 2, 3). Therefore, we expected many proteins found in these phages to be novel without much shared amino acid identity as compared to those encoded by those in more studied clusters.

Figure 4
Illustration of a genomic comparison among three bacteriophages: EAb13, vB_AbaS_TCUP2199, and Mystique. It features color-coded genes for functions like DNA helicase, exonuclease, and tail fiber protein. The arrangement shows gene identity percentages in shades of gray, with arrows indicating gene orientation. A scale bar of 2.5 kilobases is included at the bottom.

Figure 4. Between-cluster gene synteny map for Cluster 10 representatives. Three genomes, Acinetobacter phage vB_AbaS_TCUP2199 (ON323491:1–79572), EAb13 (OQ717042:1–82411), and Mystique (PQ438181:1–77401), were visualized with Clinker. Ribbons connect homologous proteins between genomes, and ribbon shading reflects amino-acid identity from light gray (30% identity) to black (100% identity).

Among the core proteins with functional annotations, we observed several enzymes central to nucleotide metabolism, such as a ribonucleotide reductase (large and small subunits), a primase, a DNA helicase, two exonucleases, and a thymidylate synthase enzyme. Structural components were also included, such as portal protein, large terminase subunit, major head protein, tail length tape-measure protein, tail assembly protein, two distinct tail-fiber proteins and a central tail-fiber J protein. In addition, an endolysin was identified as a core protein.

3.5 Proteomic conservation across clusters

Given the vast diversity of protein families observed across our Acinetobacter phage dataset, we next asked whether any truly universal “core” proteins exist. From our dataset of 30,824 protein sequences drawn from 250 complete Acinetobacter phage genomes, PhaMMseqs assembled a total of 4,133 gene phamilies (“phams”) spanning structural, functional, and uncharacterized proteins. The annotation of these phams can be found in Supplementary Table S3. Among these, 379 phams (9.2%) correspond to canonical virion-assembly components that were identified by keyword matches to capsid, tail, portal, baseplate, spike, sheath, tube, tape measure, connector, scaffold, decoration, vertex, hub, wedge, pin, or neck. A total of 842 phams (20.4%) carry putative enzymatic or regulatory functions such as DNA replication, transcriptional regulation and lysis. The nearly two-fold excess of functional over structural phams reflects the broad diversity of accessory proteins phages employ for host takeover and environmental adaptation. Of the total gene families identified, 2,912 phams (70.5%) were annotated as hypothetical proteins, highlighting the extent of unexplored “genetic dark matter” within phage proteome.

Remarkably, not a single gene family qualified as “core” (found in ≥99% of genomes) or “soft-core” (95–99%), indicating there is no universally conserved phage protein across all the Acinetobacter phages. Instead, only 96 phams (2.32%) fell into the “shell” category, defined as being present in 15–95% of genomes, representing conserved modules shared by multiple phages. The overwhelming majority (4,037; 97.7%) of phams resided in the “cloud,” occurring in fewer than 15% of genomes and were confined to single clusters or individual isolates.

Most phams are confined to a single cluster and rarely shared between 2 to 3 clusters. Identifying phams that could be found in four or more clusters could be prime candidates for illuminating key enzymatic and structural modules (Table 1). Such broadly conserved phams likely perform essential biological functions in Acinetobacter phages, having been repeatedly selected by evolution for efficiency or necessity. They could also uncover promising targets for phage engineering and effective antimicrobial agents.

Table 1
www.frontiersin.org

Table 1. Phamilies conserved across four or more PhamClust clusters of Acinetobacter phages.

Interestingly, three distinct endolysin phams are broadly distributed: pham_1 endolysin and pham_48 endolysin appear in five clusters, cluster 1, 2, 5, 7, 8 and clusters 2, 3, 4, 5, 6, respectively, and pham_60 endolysin in four clusters, cluster 2, 4, 7, 9. Pham_44, a DNA helicase, and Pham_106, an HNH homing endonuclease, each recur in four phylogenetically distinct clusters—Pham_44 in Clusters 2, 4, 9, and 12, and Pham_106 in Clusters 2, 5, 6, and 8. Three phamilies remain uncharacterized, two occurring in four clusters (pham_344, pham_651) and one in five clusters (pham_39).

In summary, the conservation of these identified endolysins, DNA helicase, HNH homing endonuclease, and three hypothetical proteins across multiple, phylogenetically distinct Acinetobacter phage clusters could serve as potential entry points for phage modification and phage-derived antimicrobials.

3.6 Cluster specific gene phamilies

To uncover the unique molecular signatures that distinguish each PhamClust group, we identified cluster-specific phamilies that were highly prevalent (at least 90%) in the genomes within their respective clusters. By focusing on these cluster-specific phams, we aim to elucidate cluster-defining traits, such as specialized adsorption apparatus, niche-adapted enzymes, or regulatory factors. We examined only the first five identified clusters given that the rest of the clusters have a relatively smaller sample size of less than ten members, hence analyzing for cluster specific phamilies within them would not give us an accurate view of the cluster (Supplementary Table S4).

Cluster 1 comprise phages from Daemvirus, Friunavirus, Pettyvirus. They retain a full complement of DNA processing and structural phams. The universal presence of tail tubular protein A (pham_8) (UniProt, 2025), a hallmark structural component of short, noncontractile tails (Podoviridae | ICTV, 2025), correlates with these phages’ Podoviridae-like morphology. Likewise, the co-occurrence of multiple DNA-processing enzymes-5′-3′ exonucleases (pham_4, pham_7), endonuclease VII (pham_13), DNA helicase (pham_14), and a dedicated RNA polymerase (pham_9) reflects a conserved middle-stage replication program potentially unique to Cluster 1, underscoring its self-sufficient DNA replication machinery.

Cluster 2 phages form a distinct Obolenskvirus genus defined by a unique complement of contractile-tail assembly proteins, including baseplate hub (pham_72), tail sheath (pham_76), tail tube initiator (pham_78), tail tube protein (pham_91), tail completion factor (pham_79) and a unique tail-fiber protein (pham_94). These specialized tail assembly proteins could be adapted for host specificity, targeting a specific subset of A. baumannii hosts (Taslem Mourosi et al., 2022). Complementing this structural arsenal, Cluster 2 carries its own transcriptional machinery (pham_74 RNAP and pham_95 regulator) enabling immediate, host independent early gene expression and precise temporal control over the infection cycle. The universal retention of head related components, a major head protein (pham_85) and prohead core protease (pham_87) underscores a self-contained capsid assembly pathway preserved across the cluster.

Cluster 3 phages comprise phages from Hadassahvirus, Lazarusvirus and Zedzedvirus genera with some of the largest genomes in our dataset (approximately 166 kb in size). They encode multiple baseplate and tail fiber components including hub subunits (pham_125, pham_134), wedge modules (pham_130, pham_137), a hinge connector for long tail fibers (pham_127), and a tail sheath (pham_133) coupled to a tail tube (pham_126) indicative of a cluster specific adsorption device for host recognition and DNA injection. Alongside this structural core, they carry a self-contained capsid assembly pathway (pham_138 prohead core protein) and dedicated lytic machinery (pham_136 putative murein transglycosylase) for its specialized cell-wall degradation mechanism (Bieńkowska-Szewczyk and Taylor, 1980). Uniquely, Cluster 3 harbors metabolic and genome-processing modules, such as an aerobic NDP reductase (pham_131) to boost dNTP pools, a DNA helicase loader (pham_128) for replication initiation, and an endonuclease V (pham_135) for DNA repair or recombination.

Strikingly, the only conserved core pham of Cluster 4, Vieuvirus, is the tail-assembly chaperone (pham_361). Unlike other clusters with extensive suites of fixed structural or replication modules, Cluster 4 lacks conserved machinery.

Cluster 5 phages define the Saclayvirus genus and are distinguished by a self-contained nucleotide-biosynthesis toolkit, dual thymidylate synthases (pham_358, pham_494), a MazG-like pyrophosphatase (pham_492), ribonucleotide reductase (pham_497), dihydrofolate reductase (pham_518) and thymidylate kinase (pham_576) together with complementary replication enzymes (DNA helicase pham_480, dNMP kinase pham_491, GTP-binding protein pham_513). The self-contained nucleotide-biosynthesis and replication enzymes toolkit of Cluster 5 likely accelerate genome replication under nucleotide-limited conditions or stressful conditions (Yuan and Gao, 2017a). The unique tail apparatus is equally elaborate with a baseplate hub (pham_478), tape measure protein (pham_490) and dedicated tail assembly chaperone (pham_484), three distinct tail fibers (pham_470, pham_495, pham_530) and a short fiber module (pham_585). Given that Saclayvirus phages can infect A. baumannii strains with multiple distinct capsule types, this unique ensemble of tail proteins likely underlies their broad-host-range capability (Qin, 2025).

4 Discussion

4.1 Phages infecting Acinetobacter are deeply diverse and divergent

Phages infecting Acinetobacter likely originate from distinct evolutionary lineages that are deeply diverse and divergent, and possible polyphyletic in nature with inter-cluster proteomic similarity rarely exceeding 10% (Figure 1). Given the increasing number of clusters identified with greater sampling depth, we hypothesize that we have yet to reach rarefaction of phages and there are many more diverse cluster types to be discovered. Our lab has discovered novel Acinetobacter phages with no genomic similarity to the 12 clusters in this study, further supporting our hypothesis (unpublished data).

The low prevalence of shell pham in our database (conserved modules shared by multiple phages) as well as the lack of any core genes, further reflects the modular and mosaic nature of phage genomes, that could be driven by high rates of horizontal gene transfer, recombination, and functional specialization, which rapidly shuffle gene repertoires across lineages and ecological niches (Hatfull, 2008). Even among the relatively closer groups, such as Clusters 4 and 9 or Clusters 5, 6, and 8, the observed homology is subtle, indicating long-standing separation and minimal exchange of conserved modules.

This absence of universally conserved core genes is not unique to Acinetobacter phages but reflects a broader characteristic of phage genomics. Similar findings have been reported in jumbo phages (Magar et al., 2024) and Streptococcus thermophilus phages (Szymczak et al., 2019), where no single gene was conserved across all genomes. These observations collectively highlight the mosaic architecture and extensive genomic diversity that underpin phage genomics.

4.2 Strong within-cluster protein homology and conserved genomic lengths except cluster 4

Although Acinetobacter phage clusters are highly divergent from one another, they typically exhibit strong within-cluster protein homology and conserved genome length. Notably, Cluster 4 corresponding to Vieuvirus shares less conservation within its cluster. Their elevated variability likely reflects greater genomic plasticity and recombination found within temperate phages, consistent with Lai et al. (2024) showing higher rates of genetic exchange with bacterial chromosomes compared to virulent phages. The relative looseness of this cluster and the absence of conserved, cluster-specific proteins suggest that Vieuvirus, as currently defined by ICTV, may represent a highly variable lineage, or could potentially comprise multiple distinct genera rather than a single cohesive genus.

4.3 Cluster 3 presents as near-jumbo phages

Cluster 3 exhibits genomic and structural features characteristic of jumbo phages, suggesting a “jumbo-like” classification despite its genome size of ~166 kb falling below the conventional 200 kb threshold (Yuan and Gao, 2017a). The combination of specialized baseplate structures, self-contained replication machinery, and numerous accessory components aligns with the mechanisms of jumbo phages. The jumbo-like cluster harbors a distinctive set of accessory phamilies absent from other clusters, highlighting its unique functional repertoire. As jumbo phages are thought to arise through the stepwise acquisition of novel genetic material that expands both genome size and functional capacity (Yuan and Gao, 2017b), Cluster 3 may represent an intermediate lineage on a trajectory toward jumbo phage status over evolutionary time.

4.4 Novel lineage with broad spectrum lytic activity—Cluster 10

Cluster 10 constitutes a novel lineage with exceptionally broad host range likely due to its conserved multiple tail-fiber and tail-associated proteins, the key determinants of receptor binding and host specificity. Acinetobacter phage Mystique lysed 94/103 A. baumannii strains, Acinetobacter phage vB_AbaS_TCUP2199 infected 189/208 clinical isolates, and Acinetobacter phage EAb13 was active against 91/105 multidrug-resistant strains. Both vB_AbaS_TCUP2199 and Mystique phages were shown previously to be strictly lytic, consistent with our PhaBOX lifestyle predictions (Alseth et al., 2025; Mardiana et al., 2022; Margulieux et al., 2023). High-resolution structure is available only for Mystique revealing a siphovirus-like architecture. Cluster 10 phages conserving this repertoire of tail related proteins are potential models for rational engineering of broad-host-range therapeutics against drug-resistant A. baumannii. Next steps could include defining these tail proteins, solving their structures, and testing domain-swap variants to map sequence-function rules for adsorption across diverse clinical strains.

4.5 Tail associated proteins are highly diverse but remain cluster-specific

In all 12 clusters, the most variable gene families across clusters encode tail related proteins and each cluster maintains its own unique repertoire of tail related proteins. Especially in the context of Cluster 3 and 5, our two largest genome clusters, they contain multiple conserved fiber families, alongside diverse tail chaperones, sheath, tube, and assembly factors. The observed variation in tail proteins suggests that not all tail components are involved in host recognition, supporting the notion that primarily the receptor-binding proteins mediate host specificity (Le et al., 2013). Future studies could investigate these cluster-specific tail proteins, as their high degree of conservation within clusters may serve as useful markers for distinguishing phage genera or clusters.

4.6 Horizontal transfer between clusters: two carbohydrate-active enzymes in Clusters 11 and 12 for phage-host binding

Among the highly divergent Clusters 11 and 12, we identified two carbohydrate-active enzymes, Acetylxylan esterase and a Concanavalin A-like lectin/glucanase superfamily protein, whose functions in capsule recognition and degradation may provide novel insights into the mechanisms of viral attachment to bacterial hosts (Peters et al., 2024; Pickard et al., 2010). This convergent adaptation to Acinetobacter’s carbohydrate-rich surface raises the possibility of overlapping host ranges, though neither original study assessed cross-cluster host susceptibility (Badawy et al., 2020; Pehde et al., 2021). Targeted host-range assays of representative Cluster 11 and 12 phages will be essential to test this hypothesis potentially shedding light on Acinetobacter phage-host binding and entry.

4.7 Gaps in taxonomical annotations

Taxonomical annotations using PhaBOX revealed substantial gaps in current phage taxonomy, with six of the twelve clusters lacking ICTV-recognized genus assignments. While PhaBOX classified the remaining clusters into established genera, these unassigned lineages underscore the limitations of existing frameworks and the need to expand reference databases to capture Acinetobacter phage diversity. For the ICTV assigned genera, the proteomic-based clustering approach was largely concordant with their taxonomy. However, this approach provides greater resolution, revealing subcluster structure within established genera. For example, Cluster 4, currently classified as Vieuvirus, displays heterogeneous clustering patterns and evidence of potential subclusters, suggesting that Vieuvirus may comprise multiple distinct genera. This approach can also capture clusters with only a few members that have not yet been assigned a genus by ICTV. By comparing these small clusters to related or homologous clusters, we can gain insights into their potential evolutionary lineage, functional modules, and shared mechanisms, highlighting relationships that are not apparent from current ICTV classifications like what we did for Clusters 4 and 9 as well as Cluster 5, 6, and 8.

4.8 Markers are not discriminatory for subcluster groupings

The use of concatenated terminase large subunit and portal protein sequences closely recapitulates full-proteome cluster groupings, validating their potential as accurate phylogenetic markers (Figure 2). This suggests a simpler, faster route to Acinetobacter phage classification analogous to 16S rRNA typing in bacteria enabling rapid clustering and characterization without full-genome analyses (Janda and Abbott, 2007). However, while the terminase large subunit and portal protein are effective for assigning phages to their respective clusters, their phylogenetic signal alone cannot accurately resolve how closely related those clusters are. As seen with Clusters 7 and 11, which appear as sister clades yet share only four genes by synteny, as well as Cluster 4 and 9 which the phylogenetic tree fails to identify as more related clusters, whole-proteome analyses are required to capture true evolutionary relatedness between clusters.

4.9 Applications: candidates for phage engineering

Our analysis also identified three conserved enzymes, endolysins, a DNA helicase, and an HNH homing endonuclease, present across multiple, phylogenetically distinct Acinetobacter phage clusters indicates strong evolutionary conservation, underscoring their potential as robust, naturally optimized candidates for phage engineering. As the findings reported here are derived solely from computational analyses, additional downstream experimental validation is required. Nevertheless, this framework serves to prioritize and refine a focused set of phage lineages, gene families, and functional modules that represent particularly promising candidates for targeted wet-lab investigation.

Endolysins target conserved peptidoglycan linkages, enabling broad lytic activity and making them strong candidates for lysin-based antimicrobials (Abdelrahman et al., 2021). The DNA helicase appears well adapted for efficient phage replication (Lo and Gao, 2021; Brok-Volchanskaya et al., 2008), while the HNH endonuclease likely enhances genome maturation, packaging, and recombination (Zhang et al., 2017). Their cross-cluster conservation suggests strong evolutionary selection for these functions, marking them as “nature-approved” modules for engineering phages with improved replication, adaptability, and lytic potency against diverse A. baumannii strains. Future work should benchmark pham_1, pham_48, and pham_60 endolysins head-to-head for bactericidal potency and breadth across diverse Acinetobacter strain panels to test whether their cross-cluster conservation corresponds to broad and robust killing activity.

5 Conclusion

This study has utilized a proteome-based framework for Acinetobacter phage diversity, resolving 12 clusters and 14 singletons while highlighting key translational opportunities. Broadly conserved enzymes, particularly endolysins, emerged as robust candidates for phage-derived antimicrobials. Cluster 10 represents a novel, broad-host-range lineage with strong therapeutic promise, driven by its conserved tail-fiber repertoire. At the same time, half of the clusters remain outside current ICTV taxonomy, underscoring the urgent need to expand phage classification systems to keep pace with new discoveries.

Together, these findings provide potential leads for phage and phage-based therapies while underscoring our current gaps in phage genomics and taxonomy.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found at: https://figshare.com/, https://doi.org/10.6084/m9.figshare.30295204.v1.

Author contributions

SP: Software, Writing – original draft, Investigation, Formal analysis, Data curation, Visualization, Conceptualization, Methodology, Writing – review & editing. SW: Software, Investigation, Resources, Writing – review & editing, Formal analysis, Project administration, Visualization, Data curation, Validation. EY: Supervision, Visualization, Validation, Investigation, Funding acquisition, Writing – review & editing, Project administration, Formal analysis.

Funding

The author(s) declared that financial support was received for this work and/or its publication. This research was funded by Research Center for Excellence IDMxS. SP is funded by NTU Research Scholarship.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that Generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2026.1720092/full#supplementary-material

References

Abdelrahman, F., Easwaran, M., Daramola, O. I., Ragab, S., Lynch, S., Oduselu, T. J., et al. (2021). Phage-encoded endolysins. Antibiotics 10:124. doi: 10.3390/antibiotics10020124,

PubMed Abstract | Crossref Full Text | Google Scholar

Alseth, E. O., Roush, C., Irby, I., Kopylov, M., Bobe, D., Diggs, M. W., et al. (2025). Mystique, a broad host range Acinetobacter phage, reveals the impact of culturing conditions on phage isolation and infectivity. PLoS Pathog. 21:e1012986. doi: 10.1371/journal.ppat.1012986,

PubMed Abstract | Crossref Full Text | Google Scholar

Altamirano, F. G., Forsyth, J. H., Patwa, R., Kostoulias, X., Trim, M., Subedi, D., et al. (2021). Bacteriophage-resistant Acinetobacter baumannii are resensitized to antimicrobials. Nat. Microbiol. 6, 157–161. doi: 10.1038/s41564-020-00830-7

Crossref Full Text | Google Scholar

Badawy, S., Pajunen, M. I., Haiko, J., Baka, Z. A. M., Abou-Dobara, M. I., el-Sayed, A. K. A., et al. (2020). Identification and functional analysis of temperate Siphoviridae bacteriophages of Acinetobacter Baumannii. Viruses 12:604. doi: 10.3390/v12060604,

PubMed Abstract | Crossref Full Text | Google Scholar

Bieńkowska-Szewczyk, K., and Taylor, A. (1980). Murein Transglycosylase from phage λ lysate purification and properties. Biochim. Biophys. Acta 615, 489–496. doi: 10.1016/0005-2744(80)90515-X,

PubMed Abstract | Crossref Full Text | Google Scholar

Bradley, J., Garau, J., Lode, H., Rolston, K., Wilson, S., and Quinn, J. (1999). Carbapenems in clinical practice: a guide to their use in serious infection. Int. J. Antimicrob. Agents 11, 93–100. doi: 10.1016/s0924-8579(98)00094-6,

PubMed Abstract | Crossref Full Text | Google Scholar

Brok-Volchanskaya, V. S., Kadyrov, F. A., Sivogrivov, D. E., Kolosov, P. M., Sokolov, A. S., Shlyapnikov, M. G., et al. (2008). Phage T4 SegB protein is a homing endonuclease required for the preferred inheritance of T4 tRNA gene region occurring in co-infection with a related phage. Nucleic Acids Res. 36, 2094–2105. doi: 10.1093/nar/gkn053,

PubMed Abstract | Crossref Full Text | Google Scholar

Cain, A. K., and Hamidian, M. (2023). Portrait of a killer: uncovering resistance mechanisms and global spread of Acinetobacter Baumannii. PLoS Pathog. 19:e1011520. doi: 10.1371/journal.ppat.1011520,

PubMed Abstract | Crossref Full Text | Google Scholar

Castillo-Ramírez, S. (2023). Genomic epidemiology of Acinetobacter baumannii goes global. MBio 14, 02520–02523. doi: 10.1128/mbio.02520-23

Crossref Full Text | Google Scholar

de Jonge, P. A., van den Born, B.-J. H., Zwinderman, A. H., Nieuwdorp, M., Dutilh, B. E., and Herrema, H. (2024). Phylogeny and disease associations of a widespread and ancient intestinal bacteriophage lineage. Nat. Commun. 15:6346. doi: 10.1038/s41467-024-50777-0,

PubMed Abstract | Crossref Full Text | Google Scholar

Gauthier, C. H., Cresawn, S. G., and Hatfull, G. F. (2022). PhaMMseqs: a new pipeline for constructing phage gene phamilies using MMseqs2. G3 (Bethesda) 12:233. doi: 10.1093/g3journal/jkac233,

PubMed Abstract | Crossref Full Text | Google Scholar

Gauthier, C. H., and Hatfull, G. F. (2023). PhamClust: a phage genome clustering tool using proteomic equivalence quotient for comparative genomics. mSystems 8:00443–23. doi: 10.1128/msystems.00443-23,

PubMed Abstract | Crossref Full Text | Google Scholar

Gauthier, C. H., and Hatfull, G. F. (2024). A Bioinformatic ecosystem for bacteriophage genomics: PhaMMSeqs, Phamerator, pdm_utils, PhagesDB, DEPhT, and PhamClust. Viruses 16:1278. doi: 10.3390/v16081278,

PubMed Abstract | Crossref Full Text | Google Scholar

Gilchrist, C. L. M., and Chooi, Y.-H. (2021). Clinker & clustermap.Js: automatic generation of gene cluster comparison figures. Bioinformatics 37, 2473–2475. doi: 10.1093/bioinformatics/btab007,

PubMed Abstract | Crossref Full Text | Google Scholar

Hatfull, G.F. (2008). Bacteriophage genomics. Curr. Opin. Microbiol. 11, 447–453. doi: 10.1016/j.mib.2008.09.004,

PubMed Abstract | Crossref Full Text | Google Scholar

Janda, J. M., and Abbott, S. L. (2007). 16S rRNA gene sequencing for bacterial identification in the diagnostic laboratory: pluses, perils, and pitfalls. J. Clin. Microbiol. 45, 2761–2764. doi: 10.1128/JCM.01228-07,

PubMed Abstract | Crossref Full Text | Google Scholar

Lai, S., Wang, H., Bork, P., Chen, W.-H., and Zhao, X.-M. (2024). Long-read sequencing reveals extensive gut phageome structural variations driven by genetic exchange with bacterial hosts. Sci. Adv. 10:eadn3316. doi: 10.1126/sciadv.adn3316,

PubMed Abstract | Crossref Full Text | Google Scholar

Le, S., He, X., Tan, Y., Huang, G., Zhang, L., Lux, R., et al. (2013). Mapping the tail fiber as the receptor binding protein responsible for differential host specificity of Pseudomonas aeruginosa bacteriophages PaP1 and JG004. PLoS One 8:e68562. doi: 10.1371/journal.pone.0068562,

PubMed Abstract | Crossref Full Text | Google Scholar

Letunic, I., and Bork, P. (2021). Interactive tree of life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293–W296. doi: 10.1093/nar/gkab301,

PubMed Abstract | Crossref Full Text | Google Scholar

Letunic, I., and Bork, P. (2024). Interactive tree of life (iTOL) v6: recent updates to the phylogenetic tree display and annotation tool. Nucleic Acids Res. 52, W78–W82. doi: 10.1093/nar/gkae268,

PubMed Abstract | Crossref Full Text | Google Scholar

Lin, D. M., Koskella, B., and Lin, H. C. (2017). Phage therapy: an alternative to antibiotics in the age of multi-drug resistance. World J. Gastrointest. Pharmacol. Ther. 8, 162–173. doi: 10.4292/wjgpt.v8.i3.162,

PubMed Abstract | Crossref Full Text | Google Scholar

Lo, C.-Y., and Gao, Y. (2021). DNA helicase–polymerase coupling in bacteriophage DNA replication. Viruses 13:1739. doi: 10.3390/v13091739,

PubMed Abstract | Crossref Full Text | Google Scholar

Maasilta, I. 2025 Bacteriophages and their use in combating antimicrobial resistance World Health Organization Regional Office for Europe. Available online at: https://www.who.int/europe/news-room/fact-sheets/item/bacteriophages-and-their-use-in-combating-antimicrobial-resistance (Accessed March 8, 2025).

Google Scholar

Magar, S., Kolte, V., Sharma, G., and Govindarajan, S. (2024). Exploring Pangenomic diversity and CRISPR-Cas evasion potential in jumbo phages: a comparative genomics study. Microbiol. Spectr. 12, e04200–e04223. doi: 10.1128/spectrum.04200-23,

PubMed Abstract | Crossref Full Text | Google Scholar

Mardiana, M., Teh, S.-H., Lin, L.-C., and Lin, N.-T. (2022). Isolation and characterization of a novel Siphoviridae phage, vB_AbaS_TCUP2199, infecting multidrug-resistant Acinetobacter Baumannii. Viruses 14:6. doi: 10.3390/v14061240,

PubMed Abstract | Crossref Full Text | Google Scholar

Margulieux, K. R., Bird, J. T., Kevorkian, R. T., Ellison, D. W., Nikolich, M. P., and Mzhavia, N. (2023). Complete genome sequence of the broad host range Acinetobacter baumannii phage EAb13. Microbiol. Resour. Announc. 12:e00341-23. doi: 10.1128/MRA.00341-23

Crossref Full Text | Google Scholar

Moubareck, C. A., and Halat, D. H. (2020). Insights into Acinetobacter baumannii: a review of microbiological, virulence, and resistance traits in a threatening nosocomial pathogen. Antibiotics 9:119. doi: 10.3390/antibiotics9030119

Crossref Full Text | Google Scholar

Nelson, R. E., Schweizer, M. L., Perencevich, E. N., Nelson, S. D., Khader, K., Chiang, H. Y., et al. (2016). Costs and mortality associated with multidrug-resistant healthcare-associated Acinetobacter infections. Infect. Control Hosp. Epidemiol. 37, 1212–1218. doi: 10.1017/ice.2016.145,

PubMed Abstract | Crossref Full Text | Google Scholar

Oliveira, H., Domingues, R., Evans, B., Sutton, J. M., Adriaenssens, E. M., and Turner, D. (2022). Genomic diversity of bacteriophages infecting the genus Acinetobacter. Viruses 14:181. doi: 10.3390/v14020181,

PubMed Abstract | Crossref Full Text | Google Scholar

Orozco-Ochoa, A. K., González-Gómez, J. P., Quiñones, B., del Castro- Campo, N., Valdez-Torres, J. B., and Chaidez-Quiroz, C. (2025). Bacteriophage indie resensitizes multidrug-resistant Acinetobacter baumannii to antibiotics in vitro. Sci. Rep. 15:11578. doi: 10.1038/s41598-025-96669-1,

PubMed Abstract | Crossref Full Text | Google Scholar

Pehde, B. M., Niewohner, D., Keomanivong, F. E., and Carruthers, M. D. (2021). Genome sequence and characterization of AcinetobacterPhage DMU1. PHAGE 2, 50–56. doi: 10.1089/phage.2020.0043,

PubMed Abstract | Crossref Full Text | Google Scholar

Peters, D. L., Gaudreault, F., and Chen, W. (2024). Functional domains of Acinetobacter bacteriophage tail fibers. Front. Microbiol. 15:1230997. doi: 10.3389/fmicb.2024.1230997,

PubMed Abstract | Crossref Full Text | Google Scholar

Pickard, D., Toribio, A. L., Petty, N. K., van Tonder, A., Yu, L., Goulding, D., et al. (2010). A conserved acetyl esterase domain targets diverse bacteriophages to the vi capsular receptor of Salmonella enterica serovar Typhi. J. Bacteriol. 192, 5746–5754. doi: 10.1128/jb.00659-10,

PubMed Abstract | Crossref Full Text | Google Scholar

Podoviridae | ICTV (2025). Available online at: https://ictv.global/report_9th/dsDNA/Podoviridae (Accessed June 17, 2025).

Google Scholar

Qin, J. (2025). Genome sequence of the broad-host-range phage Phi1_092033 against Acinetobacter baumannii. Microbiol. Resour. Announc. 14:e0006225. doi: 10.1128/mra.00062-25,

PubMed Abstract | Crossref Full Text | Google Scholar

Qu, J., Zou, J., Zhang, J., Qu, J., and Lu, H. (2024). Phage therapy for extensively drug resistant Acinetobacter baumannii infection: case report and in vivo evaluation of the distribution of phage and the impact on gut microbiome. Front. Med. 11:1432703. doi: 10.3389/fmed.2024.1432703,

PubMed Abstract | Crossref Full Text | Google Scholar

Ranveer, S. A., Dasriya, V., Ahmad, M. F., Dhillon, H. S., Samtiya, M., Shama, E., et al. (2024). Positive and negative aspects of bacteriophages and their immense role in the food chain. Npj Sci. Food 8:1. doi: 10.1038/s41538-023-00245-8

Crossref Full Text | Google Scholar

Rice, L. B. (2008). Federal funding for the study of antimicrobial resistance in nosocomial pathogens: no ESKAPE. J. Infect. Dis. 197, 1079–1081. doi: 10.1086/533452,

PubMed Abstract | Crossref Full Text | Google Scholar

Scoffone, V. C., Trespidi, G., Barbieri, G., Arshad, A., Israyilova, A., and Buroni, S. (2025). The evolution of antimicrobial resistance in Acinetobacter Baumannii and new strategies to fight it. Antibiotics 14:85. doi: 10.3390/antibiotics14010085,

PubMed Abstract | Crossref Full Text | Google Scholar

Shang, J. 2021. ‘PhaGCN: phage taxonomic classification with graph convolutional networks’. Bioinformatics. Available online at: https://github.com/KennthShang/PhaGCN (Accessed March 8, 2025).

Google Scholar

Shang, J., Peng, C., Liao, H., Tang, X., and Sun, Y. (2023). PhaBOX: a web server for identifying and characterizing phage contigs in metagenomic data. Bioinform. Adv. 3:vbad101. doi: 10.1093/bioinformatics/btad101

Crossref Full Text | Google Scholar

Shang, J., Peng, C., Liao, H., Tang, X., Sun, Y., and Yan, X. (2022). PhaTYP: predicting the lifestyle for bacteriophages using BERT’. arxiv:2206.09693. doi: 10.48550/arXiv.2206.09693

Crossref Full Text | Google Scholar

Sievers, F., and Higgins, D. G. (2014). Clustal omega, accurate alignment of very large numbers of sequences. Methods Mol. Biol. 1079, 105–116. doi: 10.1007/978-1-62703-646-7_6,

PubMed Abstract | Crossref Full Text | Google Scholar

Szymczak, P., Rau, M. H., Monteiro, J. M., Pinho, M. G., Filipe, S. R., Vogensen, F. K., et al. (2019). A comparative genomics approach for identifying host-range determinants in Streptococcus Thermophilus bacteriophages. Sci. Rep. 9:7991. doi: 10.1038/s41598-019-44481-z,

PubMed Abstract | Crossref Full Text | Google Scholar

Taslem Mourosi, J., Awe, A., Guo, W., Batra, H., Ganesh, H., Wu, X., et al. (2022). Understanding bacteriophage tail fiber interaction with host surface receptor: the key “blueprint” for reprogramming phage host range. Int. J. Mol. Sci. 23:12146. doi: 10.3390/ijms232012146,

PubMed Abstract | Crossref Full Text | Google Scholar

Turner, D., Ackermann, H. W., Kropinski, A. M., Lavigne, R., Sutton, J. M., and Reynolds, D. M. (2018). Comparative analysis of 37 Acinetobacter bacteriophages reveals genomic clustering and core–shell genes. Viruses 10:5. doi: 10.3390/v10010005,

PubMed Abstract | Crossref Full Text | Google Scholar

UniProt (2025). UniProt. Available online at: https://www.uniprot.org/uniprotkb/P03746/entry (Accessed March 8, 2025).

Google Scholar

Waterhouse, A. M., Procter, J. B., Martin, D. M. A., Clamp, M., and Barton, G. J. (2009). Jalview version 2 – a multiple sequence alignment editor and analysis workbench. Bioinformatics 25, 1189–1191. doi: 10.1093/bioinformatics/btp033,

PubMed Abstract | Crossref Full Text | Google Scholar

WHO Bacterial Priority Pathogens List. Bacterial Pathogens of Public Health Importance to Guide Research, Development and Strategies to Prevent and Control Antimicrobial Resistance. 2024. Available online at: https://www.who.int/publications/i/item/9789240093461 (Accessed March 8, 2025).

Google Scholar

Yuan, Y., and Gao, M. (2017). Jumbo bacteriophages: an overview. Front. Microbiol. 8:403. doi: 10.3389/fmicb.2017.00403,

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, L., Xu, D., Huang, Y., Zhu, X., Rui, M., Wan, T., et al. (2017). Structural and functional characterization of deep-sea thermophilic bacteriophage GVE2 HNH endonuclease. Sci. Rep. 7:42542. doi: 10.1038/srep42542,

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: Acinetobacter, bacteriophage, comparative genomics, endolysins, phage engineering, phage therapy, proteomics

Citation: Pang SJL, Wee SK and Yap EPH (2026) Genomic landscape of complete Acinetobacter phages: clustering, core-shell genes, and synteny insights. Front. Microbiol. 17:1720092. doi: 10.3389/fmicb.2026.1720092

Received: 07 October 2025; Revised: 06 January 2026; Accepted: 08 January 2026;
Published: 28 January 2026.

Edited by:

Alicja Wegrzyn, University of Gdansk, Poland

Reviewed by:

Jetsi Mancilla Rojano, National Autonomous University of Mexico, Mexico
Marcin Górniak, University of Gdansk, Poland

Copyright © 2026 Pang, Wee and Yap. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Eric Peng Huat Yap, ZXJpY3lhcEBudHUuZWR1LnNn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.