PhageWeb – Web Interface for Rapid Identification and Characterization of Prophages in Bacterial Genomes

Sousa, Ailton Lopes de; Maués, Dener; Lobato, Amália; Franco, Edian F.; Pinheiro, Kenny; Araújo, Fabrício; Pantoja, Yan; Costa da Silva, Artur Luiz da; Morais, Jefferson; Ramos, Rommel T. J.

doi:10.3389/fgene.2018.00644

ORIGINAL RESEARCH article

Front. Genet., 18 December 2018

Sec. Computational Genomics

Volume 9 - 2018 | https://doi.org/10.3389/fgene.2018.00644

PhageWeb – Web Interface for Rapid Identification and Characterization of Prophages in Bacterial Genomes

1. Institute of Biological Sciences, Federal University of Para, Belém, Brazil
2. Institute of Exact and Natural Sciences, Federal University of Para, Belém, Brazil

Article metrics

View details

Citations

14,5k

Views

2,2k

Downloads

Abstract

This study developed a computational tool with a graphical interface and a web-service that allows the identification of phage regions through homology search and gene clustering. It uses G+C content variation evaluation and tRNA prediction sites as evidence to reinforce the presence of prophages in indeterminate regions. Also, it performs the functional characterization of the prophages regions through data integration of biological databases. The performance of PhageWeb was compared to other available tools (PHASTER, Prophinder, and PhiSpy) using Sensitivity (Sn) and Positive Predictive Value (PPV) tests. As a reference for the tests, more than 80 manually annotated genomes were used. In the PhageWeb analysis, the Sn index was 86.1% and the PPV was approximately 87%, while the second best tool presented Sn and PPV values of 83.3 and 86.5%, respectively. These numbers allowed us to observe a greater precision in the regions identified by PhageWeb while compared to other prediction tools submitted to the same tests. Additionally, PhageWeb was much faster than the other computational alternatives, decreasing the processing time to approximately one-ninth of the time required by the second best software. PhageWeb is freely available at http://computationalbiology.ufpa.br/phageweb.

Introduction

Phages are the most abundant organisms on earth (Rohwer, 2003), inhabiting various environments and they are able to infect various bacterial species. Phages are also an important factor in bacterial evolution through horizontal gene transfer (Ochman et al., 2000) because they allow the insertion of extrinsic genetic material that can provide new characteristics to their hosts, such as antibiotic resistance, virulence factors, operons or even genomic islands (Bernheim and Sorek, 2018). These characteristics are present in cases of diphtheria (Brüssow et al., 2004), cholera (Kim et al., 2010), and food poisoning by enterohaemorrhagic Escherichia coli (Tozzoli et al., 2014). Moreover, phages have biotechnological applications as cloning in phage display (Winter et al., 1994), diagnosis of infections by phagotyping (Haq et al., 2012; Schofield et al., 2012), vehicles for vaccine delivery (Jafari and Abediankenari, 2015) and phage therapy as an alternative to antibiotics (Levin and Bull, 2004). Phages also play an ecological role, helping recycle nutrients, and increasing photosynthesis in the oceans (Mann et al., 2003; Sullivan et al., 2003). These organisms have two life cycles: lytic and lysogenic. During the lytic cycle, after the successful integration in the bacterial genome, phages can perform incision and excision, or remain dormant in the genome. They are called prophages. Depending on the size of the region and the success of the insertion, the prophage may remain complete and/or become cryptic (Canchaya et al., 2003; Brüssow et al., 2004) by decay, where the remains of its genetic material can provide the host genes that benefit its survival.

Prophages can be considered a cluster of phage-like genes (Zhou et al., 2011). Computational approaches, such as clustering algorithms are used to determine if these genes are close enough to each other to constitute a prophage region (Lima-Mendez et al., 2008; Zhou et al., 2011). Moreover, an important factor for the identification of prophages is the integration of the phages into specific insertion sites, such as in the bacterial genome tRNA genes (Delesalle et al., 2016). Thus, insertions in these genes indicate extrinsic genetic material, although phages do not use these sites exclusively. In addition, G+C content has been a feature used to confirm horizontal gene transfer, the presence of genomic islands and, generally, the identification of mobile genetic elements (Langille et al., 2010). In such regions, the G+C content may be quite distinct compared to the rest of the organism’s genome, and this feature is commonly used to confirm, in silico, the presence of horizontal gene transfer – HGT (Eng et al., 2011).

Many bacterial genomes available in public databases contain phage DNA integrated into their chromosome and phage DNA, in some cases, can make up 10–20% of the bacterial genome (Casjens, 2003). Due to the reduced cost of sequencing of complete bacterial genomes and the high costs for detection of prophages by bench methodologies (Metzker, 2010), new in silico tools for prophage detection in sequenced genomes (Lima-Mendez et al., 2008; Zhou et al., 2011; Akhter et al., 2012) and for prediction of DNA phage sequences in metagenomic data (Amgarten et al., 2018) have been developed. These computational tools generally use an approach that identifies sets of encoding protein genes according to some similarity to known phage genes. However, some of these tools present hindrances, such as the absence of a graphical interface, slow processing and a lack of a broader methodology for finding prophages in bacterial genomes (Srividhya et al., 2007).

Thus, this work presents PhageWeb, a tool to identify prophages in bacterial genomes that considers the similarity of gene sequences against a phage database, using indicators such as alteration of G+C content and, additionally, the presence of tRNA flanking the region which can be used as an evidence of insertion site (Campbell, 2003). These parameters allow analysis of each of the regions through functional characterization with fast processing.

Materials and Methods

Pipeline

PhageWeb receives bacterial genomic sequences in GenBank or EMBL format, or the NCBI’s Accession Number of the bacterial genome as input for analysis. After, it uses the DIAMOND tool (Buchfink et al., 2015) to identify phage-homologous regions in bacterial genomes based on its own database (updated by the application itself), generating a data table that is integrated into the pipeline. The user can change the parameters to refine their analyses: MinPts (minimum number of phage proteins in a region) and the alignment identity against the phage database. Once the input data have been submitted, homology search and gene clustering step select prophage candidate regions. After G+C content and tRNA sites are identified and the characterization of the predictive sequences is performed. Finally, a phage gene conservation analysis optional is performed to indicate the possible integrity of the predicted regions, based on percentual of elements genic. If in a given region identified by PhageWeb there is an index for example of 80% or more of genes belonging to a given phage, it considers a potentially conserved region; but if the region has an index of less than 80%, it will be considered no conserved. The percentage value is optionally assigned by the user at the beginning of each analysis. The pipeline of PhageWeb is shown in Figure 1.

FIGURE 1

Graphical Elements

The interactive graphics for prophage regions in this application were encoded using the JavaScript component of the AngularPlasmid component¹ – a DNA plasmid visualization component developed using Google’s AngularJS framework. AngularPlasmid provides an implementation that creates plasmid maps that are easy to use on the web. Instead of client-side JavaScript coding or other server-side programming languages, AngularPlasmid provides easy-to-use HTML markup, making generation as easy as creating a web page.

Phage Database

The PhageWeb database consists of a collection of prophages sequences reported in several public databases. Two sources of data collection were used: the genome database of the National Center for Biotechnology Information (NCBI) database² and the European Bioinformatics Institute (EBI) database³. The latter has an interactive environment for collecting and sharing information related to phage genomics. This way, the identified sequences were stored in a database developed in MySQL and incorporated into the application. All nucleotide sequences (FASTA and annotated files), as well as the database, are available in the tool, which is updated weekly.

Controlled Dataset

Eighty-four complete bacterial genomes that have predicted regions and manually annotated prophages (Casjens, 2003) were collected to be used to verify and quantify processing time, accuracy and performance of PhageWeb in relation to other software.

Criteria for Identification of Prophage Regions

Clustering Algorithm

The controlled dataset (Casjens, 2003) was used to identify prophage regions by clustering known phage sequences, based on the coordinates in the genome of the homologous genes (Zhou et al., 2011). Three density-based clustering algorithms were evaluated - DBSCAN, OPTICS, and HDBSCAN – to identify the prophage candidate and to be implemented in PhageWeb. For the performance evaluation of the algorithms, four cluster evaluation metrics were used: Silhouette (Rousseeuw, 1987), Dunn (Dunn, 1974), Davies–Bouldin (Davies and Bouldin, 1979), and the Density-Based Clustering Validation index – DBCV (Moulavi et al., 2014).

G+C Content

To increase the precision in the identification of prophages, a method based on DNA composition (Eng et al., 2011) was used, where a sliding window of 1000 bp moves through the entire target genome to be analyzed. The sliding window divides the genome into several smaller sets (regions), and each region can be evaluated according to its G+C content (Lu and Leong, 2016). Previous studies (Eng et al., 2011) proposed the evaluation of HGT by G+C content of the genes inserted in these regions. This way, PhageWeb proposes to classify a specific region as a prophage if at least 80% of the genes show percent G+C above the mean plus one standard deviation or show percent GC below the mean minus one standard deviation.

Regions tRNA

Phages generally integrate into specific insertion sites. Among them, the tRNA genes of the bacterial genome (Campbell, 2003; Delesalle et al., 2016). Those sites can be used as an indication of the presence of external genetic material insertion, although phages don’t use only these places as the target for integration.

Web Services

The functional characterization of the prophage regions is performed by integrating the results obtained in the PhageWeb identification step and public databases like UniProt, NCBI, InterPro, KEGG, Pfam and Gene Ontology through the UniProt public API by Web Service. After the integration, results can be processed and displayed in charts and tables to simplify analysis and understanding of results.

Software

PhageWeb was developed to be a graphical interface for the rapid identification and characterization of prophages in bacterial genomes, using PHP combined with Python and Perl programming languages, besides the Bootstrap Framework. The PhageWeb tool implements an algorithm that combines similarity searches, using analysis and implementation of clustering algorithms in high density for the identification of regions in bacterial genomes. The software is available for use at: http://www.computationalbiology.ufpa.br/phageweb, and it is compatible with Mozilla Firefox 55.0.3, Opera 38.0.2 and Google Chrome 61.0. Additionally, an Application Programming Interface (API) was created to allow the external execution and, consequently, facilitating the integration of the application with other software. The API and usage instructions are available at: https://github.com/phagewebufpa/API.

Tools Comparison

Three tools available to predict phages sequences on genomes were evaluated: Prophinder (Lima-Mendez et al., 2008), PHASTER (Arndt et al., 2016), and PhiSpy (Akhter et al., 2012).

Prophinder is one of the first web tools for prophage detection. It uses coding sequences (CDS) that are similar to those found in ACLAME database using BLAST. Based on the annotation of the ACLAME database, Prophinder selects the genes with the best correspondence to a potential prophage. PHASTER is also a web tool developed to identify phages inside bacterial genomes. Like Prophinder, it also uses homology search for prediction. PHASTER is an upgraded version of the Phast (Zhou et al., 2011) program and accepts DNA sequences data as well as annotated data in GenBank format as input. In general, PHASTER stands out for its ability to provide quality annotations with the prophage’s characteristics and to distinguish between intact and incomplete prophage. PhiSpy, however, differs from the others due to its ability to identify prophage regions that does not have any similarity to known target genes: it is not based on homology search in their predictions. PhiSpy phage detection algorithm was developed based on seven phage distinguishing characteristics: length of the protein, the direction of the transcription chain, A+T inclination and conventional G+C, the abundance of unique phage words, insertion point and similarity of phage proteins. Regarding the parameters, PHASTER, Prophinder, and PhiSpy were used with default parameter values. To compare the performance results of the computational tools, the values of Sensitivity and Positive Predictive Value will be used as evaluation metrics.

Sensitivity and Positive Predictive Value

The performance of PhageWeb against other platforms was evaluated using Sensitivity (Sn), representing the proportion of individuals or elements with the positive classification that yielded a positive result for a particular test, and using the Positive Predictive Value (PPV), which describes the number of true positives. Sn is obtained by: (reference prophages detected/total reference prophages) and PPV is obtained by: (reference prophages detected/(reference prophages detected + non-reference prophages detected). The alignment identity settings can be adjusted by the user of the PhageWeb, however, performance tests were based on the alignment identity set at: 80%.

Results

Clustering

The reference dataset had already identified and annotated prophage regions in each genome, which had several regions of prophages. With the aid of density algorithms (Zhou et al., 2011), we identified the amount of candidate according to the reference data. The algorithm that presented the best performance in the cluster identification was HDBSCAN, followed by OPTICS; the first algorithm gave the best results in the cluster evaluation metrics. For the performance evaluation of the algorithms, four cluster evaluation metrics were used: Silhouette (Rousseeuw, 1987), Dunn (Dunn, 1974), Davies–Bouldin (Davies and Bouldin, 1979), and Density-Based Clustering Validation index – DBCV (Moulavi et al., 2014). Table 1 shows the number of clusters identified by each algorithm and the average based on each of the four cluster-evaluation metrics. The HDBSCAN algorithm was selected to be used in our tool due to its best performance for identification of prophage in the genome.

Table 1

Algorithms	Cluster	Silhouette	DBCV	Dunn	DB
Dbscan	151	0.47	-0.73323973	0.0006	0.553
Optic	168	0.54	-0.677653797	0.003	0.51
Hdbscan	186	0.86	0.285253761	0.087	1.2

Performance Evaluation of Clustering algorithms in the identification of prophage regions, based on the metrics Silhouette, Dunn, Davies-Bouldin (DB), and Density-Based Clustering Validation index (DBCV).

Silhouette – Refers to a method of interpretation and validation of data consistency within clusters; Dunn – A metric for evaluating clustering algorithms, and its purpose is to identify clusters of compact clusters, with a small variation among cluster members; Davies-Bouldin – Is a metric to validate how well the cluster was made using quantities and characteristics inherent to the data set; DBCV – This is a relative validation index for arbitrarily density-based clusters. The highlighted results (underscores) represent the algorithm mean value with the best performance in the identification and formation of clusters of the prophages according to the metrics.

Performance Evaluation

The comparison between PHASTER, Prophinder, PhiSpy, and PhageWeb, showed that PhageWeb was superior regarding the identification of prophages in Sensitivity (Sn) and presented positive predictive value (PPV) with the second best result compared to the other applications. For the analyzed dataset, PhageWeb reached 86.1% sensitivity and 87.3% PPV, and it is estimated that, based on the mean runtime for each analyzed genome, PhageWeb had its processing time reduced in the prediction of prophages by one-ninth of the time compared to the other tools (Table 3). The results of Sn and PPV for the dataset used can be observed in Table 2, that shows a comparison of the values.

Table 2

	Phaster	Prophinder	PhiSpy	PhageWeb
Sn	83.33%	81.02%	52.78%	86.11%
PPV	86.54%	77.43%	88.37%	87.32%

Comparative analysis of values obtained for Sn (Sensitivity) and PPV (Positive Predictive Value) between computational tools.

The complete data this analysis can be observed in the Supplementary Information section.

Table 3

Resource	Phaster	Prophinder	PhiSpy	PhageWeb
Using graphical interface	Yes	Yes	No	Yes
Homology analyses	Yes	Yes	Yes	Yes
Analyses of tRNA sites	Yes	No	No	Yes
G+C content analysis	No	No	No	Yes
Results exportation	Yes	Yes	No	Yes
Circular genome view	Yes	No	No	Yes
Characterization of sequences	Yes	No	No	Yes
Alignment details	Yes	No	No	Yes
Support for biological databases integration	No	No	No	Yes
Output types	Text, graphics	Text, graphics	Text only	Text, graphics
Run time (seconds)	∼365	∼1890	∼5547	∼22

Comparison of functionalities and features of phage prediction tools.

Considering the features and performance of phage identification tools, PhageWeb presents the similar features as the others, however, allowing for more complete analysis with detailing of alignment and functional characterization of the sequences: use of G+C content evidences and tRNA regions to improve the reliability of the results and shorter execution time. Runtime values were obtained experimentally from dataset bacterial genomes. A comparative analysis of the resources available for these tools can be observed in Table 3. The tests performed for the collection of this resource information were performed obeying the same standard of analysis for all the tools: same input data and only features shared by all the tools were used.

In addition, they are presented to exemplify the results obtained for a prediction of prophages for the genome of Lactococcus lactis subsp. lactis Il1403 (NC_002662). Table 4 shows the results where the coordinates (beginning and end) of the prophage regions in the reference genome are presented, along with the results from the prediction tools. The graphical representation of this analysis through software BRIG (Alikhan et al., 2011) is shown in Figure 2.

Table 4

Prophage	Reference coordinates	Phaster	Prophinder	PhiSpy	PhageWeb
Region 1	35516-49727	28461-56371	35516-49727	28818-56368	35516-72698
Region 2	447236-483244	443651-484066	451007-483244	447083-484064	447236-483552
Region 3	502723-513742	502338-520485	502723-511542	-	502723-517314
Region 4	1036642-1071558	1033815-1079175	1036642-1071558	1036482-1113152	1036642-1159446
Region 5	1414112-1456949	1414112-1457046	1439215-1446438	1415361-1457456	1415811-1456949
Region 6	2013685-2025635	1997701-2028023	2011426-2025635	-	2013685-2024681
-	False positives	-	-	633126-658623	-

Prophage regions identified by computational tools for the genome of Lactococcus lactis subsp. lactis ll1403 (NC 002662) compared to that of the lineage that was manually curated annotation.

FIGURE 2

Conclusion

Despite the efficiency of existing tools for bacterial phage analysis genomes, PhageWeb presents an efficient alternative for the identification of prophages. It has high accuracy in the prediction of these organisms as well as in the evaluation of the features and simplicity of use. It also has a graphical interface that allows better interaction and flexibility to manipulate and export the resulting data. In addition, the possibility of performing other analyzes, such as GO and metabolic pathways in the same environment, simplifies the data analysis process, reducing considerably the effort applied in the interaction with biological databases.

Statements

Data availability statement

The datasets analyzed for this study can be found in the PhageWeb – Dataset (http://computationalbiology.ufpa.br/phageweb/dataset/).

Author contributions

RR and AS conceived the idea of the program and together with DM, KP, EF, FA, and YP developed the tool computational. AL, AC, and JM evaluated the biological and computational information, defined the databases to be integrated and functions to be inserted. All authors reviewed the manuscript.

Funding

This work has been supported by the CNPq (Conselho Nacional de Pesquisa Científica) grant #421528/2016-8 and #304711/2015-2, CAPES (Coordenaçã de Aperfeiçoamento de Pessoal de Nível Superior), and PROPESP/UFPA (Pró-Reitoria de Pesquisa e Pós Graduação/Universidade Federal do Pará).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2018.00644/full#supplementary-material

Footnotes

1.^http://angularplasmid.vixis.com

2.^http://www.ncbi.nlm.nih.gov

3.^http://www.ebi.ac.uk/genomes/phage.html

References

1
AkhterS.AzizR. K.EdwardsR. A. (2012). Phispy: a novel algorithm for finding prophages in bacterial genomes that combines similarity- and composition-based strategies.Nucleic Acids Res.40:e126. 10.1093/nar/gks406
2
AlikhanN. F.PettyN. K.Ben ZakourN. L.BeatsonS. A. (2011). Blast ring image generator (brig): simple prokaryote genome comparisons.BMC Genomics12:402. 10.1186/1471-2164-12-402
3
AmgartenD.BragaL. P. P.da SilvaA. M.SetubalJ. C. (2018). MARVEL, a tool for prediction of bacteriophage sequences in metagenomic bins.Front. Genet.9:304. 10.3389/fgene.2018.00304
4
ArndtD.GrantJ. R.MarcuA.SajedT.PonA.LiangY.et al (2016). Phaster: a better, faster version of the phast phage search tool.Nucleic Acids Res.44W16–W21. 10.1093/nar/gkw387
5
BernheimA.SorekR. (2018). Viruses cooperate to defeat bacteria.Nature559482–484. 10.1038/d41586-018-05762-1
6
BrüssowH.CanchayaC.HardtW. D. (2004). Phages and the evolution of bacterial pathogens: from genomic rearrangements to lysogenic conversion.Microbiol. Mol. Biol. Rev.68560–602. 10.1128/MMBR.68.3.560-602.2004
7
BuchfinkB.XieC.HusonD. H. (2015). Fast and sensitive protein alignment using diamond.Nat. Methods1259–60. 10.1038/nmeth.3176
8
CampbellA. (2003). Prophage insertion sites.Res. Microbiol.154277–282. 10.1016/S0923-2508(03)00071-8
- CrossRef
- Google Scholar
9
CanchayaC.ProuxC.FournousG.BruttinA.BrüssowH. (2003). Prophage genomics.Microbiol. Mol. Biol. Rev.67238–276. 10.1128/MMBR.67.2.238-276.2003
- CrossRef
- Google Scholar
10
CasjensS. (2003). Prophages and bacterial genomics: what have we learned so far? molecular microbi- ology.Nucleic Acids Res.49277–300.
- Google Scholar
11
DaviesD.BouldinD. (1979). A cluster separation measure.JIEEE Trans. Pattern Anal. Mach. Intell.1224–227. 10.1109/TPAMI.1979.4766909
- CrossRef
- Google Scholar
12
DelesalleV. A.TankeN. T.VillA. C.KrukonisG. P. (2016). Testing hypotheses for the presence of trna genes in mycobacteriophage genomes.Bacteriophage3:121. 10.1080/21597081.2016.1219441
13
DunnJ. (1974). Well-separated clusters and optimal fuzzy partitions.J. Cybern495–104. 10.1080/01969727408546059
- CrossRef
- Google Scholar
14
EngC.ThibessardA.DanielsenM.RasmussenT. B.MariJ. F.LeblondP. (2011). In silico prediction of horizontal gene transfer in Streptococcus thermophilus.Arch. Microbiol.193287–297. 10.1007/s00203-010-0671-8
15
HaqI. U.ChaudhryW. N.AkhtarM. N.AndleebS.QadriI. (2012). Bacteriophages and their implications on future biotechnology: a review.Virol. J.9:9. 10.1186/1743-422X-9-9
16
JafariN.AbediankenariS. (2015). Phage particles as vaccine delivery vehicles: concepts, applications and prospects.Asian Pac. J. Cancer Prev.168019–8029. 10.7314/APJCP.2015.16.18.8019
17
KimE. J.LeeC. H.NairG. B.KimD. W. (2010). Whole-genome sequence comparisons reveal the evolution of vibrio cholerae o1.Trends Microbiol.23479–489. 10.1016/j.tim.2015.03.010
18
LangilleM. G.HsiaoW. W.BrinkmanF. S. (2010). Detecting genomic islands using bioinformatics approaches.Nat. Rev. Microbiol.8:373. 10.1038/nrmicro2350
19
LevinB. R.BullJ. J. (2004). Population and evolutionary dynamics of phage therapy.Nat. Rev. Microbiol.2166–173. 10.1038/nrmicro822
20
Lima-MendezG.Van HeldenJ.ToussaintA.LeplaeR. (2008). Prophinder: a computational tool for prophage prediction in prokaryotic genomes.Bioinformatics24863–865. 10.1093/bioinformatics/btn043
21
LuB.LeongH. W. (2016). Computational methods for predicting genomic islands in microbial genomes.Comput. Struct. Biotechnol. J.14200–206. 10.1016/j.csbj.2016.05.001
22
MannN. H.CookA.MillardA.BaileyS.ClokieM. (2003). Bacterial photosynthesis genes in a virus.Nature423741–741. 10.1038/424741a
23
MetzkerM. L. (2010). Sequencing technologies–the next generation.Nat. Rev. Genet.11:31. 10.1038/nrg2626
24
MoulaviD.JaskowiakP. A.CampelloR. J. G. B.ZimekA.SanderJ. (2014). “Density-based clustering validation,” inProceedings of the 2014 SIAM International Conference on Data Mining.Philadelphia, PA: Society for Industrial and Applied Mathematics. 10.1137/1.9781611973440.96
- CrossRef
- Google Scholar
25
OchmanH.LawrenceJ. G.GroismanE. A. (2000). Lateral gene transfer and the nature of bacterial innovation.Nature405:299. 10.1038/35012500
26
RohwerF. (2003). Global phage diversity.Cell11353–65. 10.1016/S0092-8674(03)00276-9
- CrossRef
- Google Scholar
27
RousseeuwP. J. (1987). Silhouettes: a graphical aid to the interpretation and validation of cluster analysis.J. Comput. Appl. Math.2053–65. 10.1016/0377-0427(87)90125-7
28
SchofieldD. A.SharpN. J.WestwaterC. (2012). Phage-based platforms for the clinical detection of human bacterial pathogens.Bacteriophage2105–121. 10.4161/bact.19274
29
SrividhyaK. V.AlagurajV.PoornimaG.KumarD.SinghG. P.RaghavenderanL.et al (2007). Identification of prophages in bacterial genomes by dinucleotide relative abundance difference.PLoS One2:1193. 10.1371/journal.pone.0001193
30
SullivanM. B.WaterburyJ. B.ChisholmS. W. (2003). Cyanophages infecting the oceanic cyanobacterium prochlorococcus.Nature4231047–1051. 10.1038/nature01929
31
TozzoliR.GrandeL.MichelacciV.RanieriP.MauglianiA.CaprioliA.et al (2014). Shiga toxin-converting phages and the emergence of new pathogenic Escherichia coli: a world in motion.Front. Cell. Infect. Microbiol.4:80. 10.3389/fcimb.2014.00080
32
WinterG.GriffithsA. D.HawkinsR. E.HoogenboomH. R. (1994). Making antibodies by phage display technology.Annu. Rev. Immunol.12433–455. 10.1146/annurev.iy.12.040194.002245
- CrossRef
- Google Scholar
33
ZhouY.LiangY.LynchK. H.DennisJ. J.WishartD. H. (2011). Phast: a fast phage search tool.Nucleic Acids Res.39347–352. 10.1093/nar/gkr485

Summary

Keywords

phage, prophage, clustering, web interface, web service, characterization, bacterial genome

Citation

Sousa AL, Maués D, Lobato A, Franco EF, Pinheiro K, Araújo F, Pantoja Y, Costa da Silva AL, Morais J and Ramos RTJ (2018) PhageWeb – Web Interface for Rapid Identification and Characterization of Prophages in Bacterial Genomes. Front. Genet. 9:644. doi: 10.3389/fgene.2018.00644

Received

21 August 2018

Accepted

27 November 2018

Published

18 December 2018

Volume

9 - 2018

Edited by

Helder Nakaya, University of São Paulo, Brazil

Reviewed by

Luciane Schons da Fonseca, Massachusetts Institute of Technology, United States; Yu Xue, Huazhong University of Science and Technology, China

Updates

This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Rommel T. J. Ramos, rommelthiago@gmail.com

This article was submitted to Bioinformatics and Computational Biology, a section of the journal Frontiers in Genetics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Computational Genomics

ORIGINAL RESEARCH article

PhageWeb – Web Interface for Rapid Identification and Characterization of Prophages in Bacterial Genomes

Abstract

Introduction