<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Bioinform.</journal-id>
<journal-title>Frontiers in Bioinformatics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Bioinform.</abbrev-journal-title>
<issn pub-type="epub">2673-7647</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">932319</article-id>
<article-id pub-id-type="doi">10.3389/fbinf.2022.932319</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Bioinformatics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Bacteriophage Genetic Edition Using LSTM</article-title>
<alt-title alt-title-type="left-running-head">Ataee et al.</alt-title>
<alt-title alt-title-type="right-running-head">Bacteriophage Genetic Edition Using LSTM</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Ataee</surname>
<given-names>Shabnam</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1766275/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Brochet</surname>
<given-names>Xavier</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1832150/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Pe&#xf1;a-Reyes</surname>
<given-names>Carlos Andr&#xe9;s</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1675763/overview"/>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>Institute of Information and Communication Technology (IICT)</institution>, <institution>School of Management and Engineering Vaud (HEIG-VD)</institution>, <addr-line>Yverdon-les-Bains</addr-line>, <country>Switzerland</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>HES-SO University of Applied Sciences and Arts Western Switzerland</institution>, <addr-line>Del&#x00E9;mont</addr-line>, <country>Switzerland</country>
</aff>
<aff id="aff3">
<sup>3</sup>
<institution>CI4CB&#x2014;Computational Intelligence for Computational Biology</institution>, <institution>SIB&#x2014;Swiss Institute of Bioinformatics</institution>, <addr-line>Lausanne</addr-line>, <country>Switzerland</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1491704/overview">Andrea Tangherloni</ext-link>, University of Bergamo, Italy</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1829851/overview">Daniele Maria Papetti</ext-link>, University of Milano-Bicocca, Italy</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1838057/overview">Simone Giovanni Riva</ext-link>, University of Oxford, United Kingdom</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Carlos Andr&#xe9;s Pe&#xf1;a-Reyes, <email>carlos.pena@heig-vd.ch</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Integrative Bioinformatics, a section of the journal Frontiers in Bioinformatics</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>13</day>
<month>07</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>2</volume>
<elocation-id>932319</elocation-id>
<history>
<date date-type="received">
<day>29</day>
<month>04</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>06</day>
<month>06</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Ataee, Brochet and Pe&#xf1;a-Reyes.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Ataee, Brochet and Pe&#xf1;a-Reyes</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Bacteriophages are gaining increasing interest as antimicrobial tools, largely due to the emergence of multi-antibiotic&#x2013;resistant bacteria. Although their huge diversity and virulence make them particularly attractive for targeting a wide range of bacterial pathogens, it is difficult to select suitable phages due to their high specificity which limits their host range. In addition, other challenges remain such as structural fragility under certain environmental conditions, immunogenicity of phage therapy, or development of bacterial resistance. The use of genetically engineered phages may reduce characteristics that hinder prophylactic and therapeutic applications of phages. Nowadays, there is no systematic method to modify a given phage genome conferring its sought characteristics. We explore the use of artificial intelligence for this purpose as it has the potential to both guide and accelerate genome modification to generate phage variants with unique properties that overcome the limitations of natural phages. We propose an original architecture composed of two deep learning&#x2013;driven components: a phage&#x2013;bacterium interaction predictor and a phage genome-sequence generator. The former is a multi-branch 1-D convolutional neural network (1D-CNN) that analyses phage and bacterial genomes to predict interactions. The latter is a recurrent neural network, more particularly a long short-term memory (LSTM), that performs genomic modifications to a phage to offer substantial host range improvement. For this component, we developed two different architectures composed of one or two stacked LSTM layers with 256 neurons each. These generators are used to modify, more precisely to rewrite, the genome sequence of 42 selected phages, while the predictor is used to estimate the host range of the modified bacteriophages across 46 strains of <italic>Pseudomonas aeruginosa</italic>. The proposed generators, trained with an average accuracy of 96.1%, are able to improve the host range for an average of 18 phages among the 42 under study, increasing both their average host range, by 73.0 and 103.7%, and the maximum host ranges from 21 to 24 and 29, respectively. These promising results showed that the use of deep learning methodologies allows genetic modification of phages to extend, for instance, their host range, confirming the potential of these approaches to guide bacteriophage engineering.</p>
</abstract>
<kwd-group>
<kwd>phage therapy</kwd>
<kwd>genome sequences</kwd>
<kwd>deep learning</kwd>
<kwd>1D-CNN</kwd>
<kwd>LSTM</kwd>
<kwd>generative model</kwd>
<kwd>genetically engineered phages</kwd>
</kwd-group>
<contract-sponsor id="cn001">Schweizerischer Nationalfonds Zur F&#xf6;rderung der Wissenschaftlichen Forschung<named-content content-type="fundref-id">10.13039/501100001711</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1 Introduction</title>
<p>Phage therapy (PT) is a therapeutic approach to treat patients with bacterial infections. It is based on the use of viruses, called bacteriophages (or phages), to infect and kill pathogenic bacteria throughout their lifecycle (<xref ref-type="bibr" rid="B38">Matsuzaki et al., 2005</xref>). Present in all ecosystems, bacteriophages are viruses that naturally and specifically infect bacteria and are, therefore, unable to infect eukaryotic cells. This therapy was developed more than a century ago with the discovery of phages by researchers Frederick Twort (<xref ref-type="bibr" rid="B55">Twort, 1915</xref>) and F&#xe9;lix d&#x2019;Herelle (1917) (<xref ref-type="bibr" rid="B16">D&#x27;Herelle, 2011</xref>). After promising successes, it was abandoned in favor of antibiotic therapy. However, in recent years, there has been a renewed interest in PT due to the emergence of nosocomial infections with antimicrobial-resistant (AMR) bacteria and the lack of new effective antibiotics. AMR is now considered by the World Health Organization as one of the greatest threats to global health, food security, and development [antibiotic resistance (<xref ref-type="bibr" rid="B59">WHO, 2021</xref>)]. This problem has been concretely measured both by the Centers for Disease Control (CDC), which reported more than 2.8 million antibiotic-resistant infections each year in the United States, causing more than 35,000 deaths (<xref ref-type="bibr" rid="B12">CDC, 2019</xref>) and by the United Kingdom, which commissioned a report on AMR in 2016 projecting to cause 10 million deaths per year by 2050. The concept of PT is to correctly match a bacterium with one or more phages capable of infecting and killing it.</p>
<p>After adsorption of the phage onto the target bacterium, the phage transfers its genome (viral nucleic acid, either DNA or RNA) into the bacterial cytoplasm. During the lytic cycle, the DNA is then transcribed, translated, and copied, to be assembled into viral particles (hijacking the bacterial replication machinery). In general, when a critical mass of viral particles is reached, bacterial lysis is actively triggered via lytic proteins that disrupt the bacterial wall, allowing the release of new viruses and leading to the death of the bacterium (<xref ref-type="bibr" rid="B18">Delbr&#xfc;ck, 1940</xref>; <xref ref-type="bibr" rid="B57">Weinbauer, 2004</xref>). The advantages of PT are numerous and linked to the very nature of phages. These viruses, used as therapeutic agents, are able to regulate themselves, at sites of infection and once the bacteria are killed, phages do not replicate and can be rapidly eliminated by the immune system or other mechanisms.</p>
<p>It is estimated that there are about 10<sup>31</sup> phages on Earth (<xref ref-type="bibr" rid="B58">Weitz et al., 2013</xref>; <xref ref-type="bibr" rid="B36">Mann, 2005</xref>) and they are in constant co-evolution with bacteria, which makes them a potentially inexhaustible source in nature. It is therefore theoretically possible to isolate new phages for most types of bacteria. This is important because new variants of pathogenic bacteria are appearing, leading to complicated therapeutic situations, especially with the emergence of multi-antibiotic&#x2013;resistant bacteria. Moreover, the mechanisms of action of phages seem to be independent of those of antibiotics, and they do not provide selective pressure likely to increase antibiotic resistance. Finally, their narrow specificity against bacterial strains allows them to have a negligible impact on the patient&#x2019;s microbiota (commensal flora). The host range of a phage is the spectrum or number of strains of bacterial species that a given specific phage can infect. Phages exhibit a narrow host range, and each phage can only infect a small number of bacteria, and therefore the use of a single phage has a low probability of being able to treat infections caused by several bacteria (<xref ref-type="bibr" rid="B39">Matsuzaki et al., 2014</xref>; <xref ref-type="bibr" rid="B42">Nilsson, 2014</xref>; <xref ref-type="bibr" rid="B37">Mapes et al., 2016</xref>). It is therefore essential to precisely identify the bacteria responsible for the infection before implementing the PT and to use a combination of several phages (phage cocktail) (<xref ref-type="bibr" rid="B38">Matsuzaki et al., 2005</xref>), which increases the number of targeted bacterial strains, that is, the spectrum of action of the treatment and reduces the rate of evolution of resistance to phages (<xref ref-type="bibr" rid="B21">Filippov et al., 2011</xref>; <xref ref-type="bibr" rid="B22">Gu et al., 2019</xref>; <xref ref-type="bibr" rid="B45">Ramirez et al., 2020</xref>).</p>
<p>However, the selection of adequate phages (discovery, isolation, and characterization) is time-consuming and requires laborious regulatory approval (<xref ref-type="bibr" rid="B54">Socher et al., 2011</xref>), which makes it one of the main limiting steps of PT. One forward-thinking modernization of phage therapy involves genetically modifying phages to overcome the limited efficiency of natural phages. In the last few years, the main genetic modifications applied to phages generally include i) mutations in genes (<xref ref-type="bibr" rid="B60">Yehl et al., 2019</xref>), ii) the partial or full replacement of genes (<xref ref-type="bibr" rid="B34">Mahichi et al., 2009</xref>; <xref ref-type="bibr" rid="B33">Lin et al., 2012</xref>; <xref ref-type="bibr" rid="B20">Dunne et al., 2019</xref>), and iii) the insertion of foreign genes (<xref ref-type="bibr" rid="B4">Bikard et al., 2014</xref>; <xref ref-type="bibr" rid="B43">Pei and Lamas-Samanamud, 2014</xref>; <xref ref-type="bibr" rid="B61">Yosef et al., 2015</xref>; <xref ref-type="bibr" rid="B27">Lam et al., 2021</xref>) using molecular techniques such as homologous recombination (HR) and genome rebooting with the aim of mainly improving the host range or enhancing the antibacterial effect of phages. For more details on the methodologies used by genetic engineering, we suggest recent reviews (<xref ref-type="bibr" rid="B14">Chen et al., 2019</xref>; <xref ref-type="bibr" rid="B23">Guo et al., 2021</xref>; <xref ref-type="bibr" rid="B29">Lenneman et al., 2021</xref>). All this research led to the first success in phage genetic engineering in 2019. In the context of the treatment of a 15-year-old lung transplant recipient (<xref ref-type="bibr" rid="B17">Dedrick et al., 2019</xref>), some genes were removed from phages to increase their activity. Such genetically engineered (GE) phages can therefore provide substantial advantages over natural phages in terms of host range, immune system recognition, and environmental stability. Phage engineering could provide a rapid strategy to generate phages with unique properties, and thus accelerate the development of PT, provided that a sound methodology is developed to suggest appropriate modifications to be applied to phage genomes. Currently, there is no such systematic method to guide the design of genetically engineered phages. Phages exhibit unparalleled genetic diversity, which makes it extremely complicated to factor in all possible variables for creating GE phages exhibiting expected properties. New technologies are needed to accelerate the design&#x2013;build&#x2013;test cycle for engineering phages and to make it possible to translate proof-of-concept academic work more efficiently into real-world use.</p>
<p>In this context, we explore herein the application of artificial intelligence (AI) which has the potential to both guide and accelerate genome modifications to generate phage variants with unique properties that overcome the limitations of natural phages. Until now, the application of AI to phage biology mainly concerns automated recovery, prediction, and classification of bacteriophages (<xref ref-type="bibr" rid="B46">Ren et al., 2017</xref>; <xref ref-type="bibr" rid="B2">Amgarten et al., 2018</xref>; <xref ref-type="bibr" rid="B15">Chibani et al., 2019</xref>; <xref ref-type="bibr" rid="B26">Kieft et al., 2020</xref>; <xref ref-type="bibr" rid="B51">Shang et al., 2021</xref>) predicting phage&#x2013;bacteria interactions or host prediction at the genus, species, and/or strain levels (<xref ref-type="bibr" rid="B28">Leite et al., 2018</xref>; <xref ref-type="bibr" rid="B3">Ataee et al., 2020</xref>; <xref ref-type="bibr" rid="B5">Boeckaerts et al., 2021</xref>; <xref ref-type="bibr" rid="B32">Li et al., 2021</xref>; <xref ref-type="bibr" rid="B62">Zhou et al., 2022</xref>), bacteriophage lifecycle (<xref ref-type="bibr" rid="B40">McNair et al., 2012</xref>; <xref ref-type="bibr" rid="B56">Tynecki et al., 2020</xref>), or the identification of viral sequences (<xref ref-type="bibr" rid="B48">Seguritan et al., 2012</xref>; <xref ref-type="bibr" rid="B35">Manavalan et al., 2018</xref>; <xref ref-type="bibr" rid="B11">Cantu et al., 2020</xref>; <xref ref-type="bibr" rid="B41">Meng et al., 2020</xref>).</p>
<p>We propose a novel approach, dubbed PERPHECT (for Deep Generative Networks for Bacteriophage Genetic Edition), aiming to genetically engineer bacteriophages to enhance the activity of resulting phages (<xref ref-type="bibr" rid="B3">Ataee et al., 2020</xref>) (<xref ref-type="bibr" rid="B3">Ataee et al., 2020</xref>). The case study will focus on increasing the host range of phages. To achieve this, we combine state-of-the-art techniques from deep learning: a phagi&#x2013;bacterium interaction predictor, the PERPHECT predictor, and a phage genome-sequence generator, the PERPHECT generator. Therefore, to evaluate the ability of our generator, we compare the host range of phages determined experimentally with their predicted host range computed by counting the number of bacterial strains predicted as positive by the PERPHECT predictor after rewriting their genomes. A key point of the PERPHECT architecture is that its two fundamental components are loosely coupled. With this property, different methods and approaches could be used to implement either or both the predictor and the generator.</p>
<p>In this article, we explore and evaluate the adequacy of a deep learning model to implement the generator component. For this purpose, we take advantage of LSTMs, a special type of recurrent neural network (<xref ref-type="bibr" rid="B50">Servan-Schreiber et al., 1988</xref>; <xref ref-type="bibr" rid="B49">Servan-Schreiber et al., 1991</xref>), to process long genomic sequences to capture high-level structures contained within them. LSTMs (<xref ref-type="bibr" rid="B24">Hochreiter and Schmidhuber, 1997</xref>; <xref ref-type="bibr" rid="B52">Sherstinsky, 2020</xref>) are extremely powerful deep learning models used to capture long-range dependencies since they are made of memory units allowing to save important features. LSTMs are particularly used in natural language processing (NLP), in applications such as paraphrase detection (<xref ref-type="bibr" rid="B54">Socher et al., 2011</xref>), speech recognition (<xref ref-type="bibr" rid="B31">Li et al., 2015</xref>), language modeling (<xref ref-type="bibr" rid="B53">Siami-Namini et al., 2018</xref>), text generation (<xref ref-type="bibr" rid="B47">Santhanam, 2018</xref>), but also for genome modeling (<xref ref-type="bibr" rid="B30">Li, 2019</xref>), or temporal data analysis (<xref ref-type="bibr" rid="B1">Abdel-Nasser and Mahmoud, 2017</xref>; <xref ref-type="bibr" rid="B19">Deng et al., 2020</xref>). The PERPHECT architecture paves the way for the use of deep learning methodologies to genetically modify phages and extend, for example, their host range, thus confirming the potential of these approaches to guide bacteriophage engineering.</p>
</sec>
<sec id="s2">
<title>2 Methods</title>
<p>As already mentioned, the PERPHECT architecture is formed of two fundamental components (schematically represented in <xref ref-type="fig" rid="F1">Figure 1</xref>):<list list-type="simple">
<list-item>
<p>&#x2022; The PERPHECT predictor, used to predict potential interactions between bacteria and phages based solely on their genomic information, is a deep learning model, composed of a multi-branch 1-D CNN (<xref ref-type="bibr" rid="B9">Brownlee, 2018</xref>) (<xref ref-type="fig" rid="F2">Figure 2</xref>). The proposed predictor was trained, validated, and tested on a dataset composed of 7,720 interactions between 227 bacteria and 3,208 phages. Its evaluation results showed performance figures as follows: 85% accuracy, 85% recall, 72% precision, and 78% f1-score on the test set. This predictor is explained in more detail (<xref ref-type="bibr" rid="B3">Ataee et al., 2020</xref>).</p>
</list-item>
<list-item>
<p>&#x2022; The PERPHECT generator modifies existing phage genome sequences to improve their predicted host range. In this article, we concentrate on this component, presenting a novel model based on LSTM (<xref ref-type="bibr" rid="B8">Brownlee, 2017a</xref>), used to learn the context from the input sequence in order to make predictions. The model is conceived so that it is able to complete a genome sequence starting from a seed of <italic>n</italic> nucleotides and then predicting, iteratively one-by-one, the next nucleotides in the sequence.</p>
</list-item>
</list>
</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>PERPHECT model architecture. PERPHECT architecture is formed of two fundamental components: i) The PERPHECT predictor used to predict potential interactions between bacteria and phages based solely on their genomic information and composed of a multi-branch 1-D CNN and ii) the PERPHECT generator used to modify existing phage genome sequences to improve their predicted host range and composed of LSTMs.</p>
</caption>
<graphic xlink:href="fbinf-02-932319-g001.tif"/>
</fig>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Phage&#x2013;bacterium interaction predictor (<xref ref-type="bibr" rid="B3">Ataee, Rodriguez, Brochet, &#x26; Pena, 2020</xref>). The predictor model is composed of a stack of 1-D CNNs. The predictor architecture has a non-linear network topology. The two inputs (bacteria genome sequences and phages genome sequences) are processed separately by two parallel convolutional branches whose outputs are then merged and passed through two sub-sequential dense layers. A dropout layer is also used to reduce overfitting and to improve the generalization of the proposed deep neural network.</p>
</caption>
<graphic xlink:href="fbinf-02-932319-g002.tif"/>
</fig>
<sec id="s2-1">
<title>2.1 Data</title>
<p>Among hundreds of bacterial genomes available in our dataset, we selected 46, belonging to the <italic>Pseudomonas aeruginosa</italic> species, as a target for phage infection. It is the species of the <italic>Pseudomonas</italic> genus that most often causes infections in humans. Unfortunately, many <italic>Pseudomonas</italic> infections are becoming difficult to treat as they are increasingly showing antibiotic resistance. Among all the interactions in our dataset, there are 42 phages able to infect at least one of the 46 <italic>Pseudomonas aeruginosa</italic> strains. We selected these phages as the, potentially, best candidates for genome modification. The genome sequences of 46 bacteria and 42 bacteriophages under study as well as the actual interaction between any bacterium and any phage under study are extracted from experimental results obtained by one of our partners during a previous project (<xref ref-type="bibr" rid="B28">Leite et al., 2018</xref>). Consequently, the new, specific dataset is composed of genome sequences for all 46 bacterial strains and 42 phages under study, as well as all the 1,932 interaction values, among which there are 277 positive interactions. In this dataset, our PERPHECT predictor exhibits a good performance when predicting interactions between the selected bacterial strains and their phages <inline-formula id="inf1">
<mml:math id="m1">
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi mathvariant="bold-italic">accuracy</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>89</mml:mn>
<mml:mo>%</mml:mo>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi mathvariant="bold-italic">recall</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>59</mml:mn>
<mml:mo>%</mml:mo>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi mathvariant="bold-italic">specificity</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>95</mml:mn>
<mml:mo>%</mml:mo>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi mathvariant="bold-italic">precision</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>68</mml:mn>
<mml:mo>%</mml:mo>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
<p>On this basis, the goal of the <italic>PERPHECT Generator</italic> is to modify, or more precisely to rewrite, the genomes of the phages under study in such a way that, after modification, they can infect as many strains as possible from the <italic>Pseudomonas aeruginosa</italic> species. In other words, we want to modify phages to maximize their <italic>host range</italic>. <xref ref-type="fig" rid="F3">Figure 3</xref> shows the distribution of the host range of the selected phages across the 46 bacterial hosts available in the dataset. Note that the host range takes on values between 1 and 22, but only 7 of the 42 phages can infect more than 10 bacterial strains.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Distribution of host ranges of phages under study. Distribution of the host range of the selected phages across the 46 bacterial hosts available in the dataset. Note that the host range takes on values between 1 and 22, but only 7 of the 42 phages can infect more than 10 bacterial strains.</p>
</caption>
<graphic xlink:href="fbinf-02-932319-g003.tif"/>
</fig>
<p>Training the generator model requires providing input&#x2013;output sub-sequences, where a given input subsequence of length <italic>n</italic> will serve to predict one output symbol, which is the next nucleotide in the sequence. Going through the whole genome sequence of a given phage of length <italic>N</italic> will, thus, generate (<italic>N - n</italic>) input&#x2013;output sub-sequences of length (<italic>n&#x2b;1</italic>) symbols, as illustrated in <xref ref-type="fig" rid="F4">Figure 4</xref>. The value <italic>n</italic>, called <italic>seed length</italic>, is defined as the number of nucleotide symbols that needs to be passed to the generator model to predict the next nucleotide. This process is applied to each of the 42 phages in our dataset and repeated for different seed lengths. We use four different values of seed length (200, 500, 700, and 1,000) to build generator models.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Creation of training data for the generator model. The training data are composed of input&#x2013;output sub-sequences, where a given input subsequence of length <italic>n</italic> will serve to predict one output symbol, which is the next nucleotide in the sequence. Going through the whole genome sequence of a given phage of length N will, thus, generate (N - n) input&#x2013;output sub-sequences of length (n&#x2b;1) symbols. The value n, called seed length, is defined as the number of nucleotide symbols that needs to be passed to the generator model to predict the next nucleotide.</p>
</caption>
<graphic xlink:href="fbinf-02-932319-g004.tif"/>
</fig>
<p>From an information point of view, genomic sequences contain four different symbols, representing DNA&#x2019;s nucleotides (i.e., &#x201c;A,&#x201d; &#x201c;C,&#x201d; &#x201c;G,&#x201d; and &#x201c;T&#x201d;). Actual sequences may contain degenerate base symbols which represent more than one potential nucleotide at a given position. As such symbols are very rare in the phage sequences of our dataset, we decided not to keep them as separate symbols but to replace them randomly with one of their possible representations according to the <italic>IUPAC degenerate base symbols</italic> table (<xref ref-type="bibr" rid="B25">Nomenclature, 1970</xref>). Finally, the sequences of symbols are <italic>one-hot encoded</italic> to provide an input representation for the neural network. Under such encoding, nucleotides are represented by 4&#xa0;bits, each bit corresponding to one of the symbols (<xref ref-type="bibr" rid="B10">Brownlee, 2017b</xref>).</p>
</sec>
<sec id="s2-2">
<title>2.2 The Model Architecture</title>
<p>The proposed generator model is composed of an input layer, one or two hidden layers, and an output layer, as shown in <xref ref-type="fig" rid="F5">Figure 5</xref>. The <italic>input layer</italic> takes sequences with <italic>n</italic> positions (where <italic>n</italic> &#x3d; <italic>seed length</italic>) of four features, each counting for the one-hot encoded input sequences. Each <italic>hidden layer</italic> is an LSTM with 256 neurons. The <italic>output layer</italic> is a fully connected layer with <italic>softmax</italic> activation function to ensure the outputs are interpreted as the probability of membership for each class (i.e., the type of nucleotide). For this multi-class classification problem, we use commonly used parameters including <italic>cross-entropy</italic> as a loss function (<xref ref-type="bibr" rid="B6">Brownlee, 2020</xref>) and an <italic>Adam optimization algorithm</italic> (<xref ref-type="bibr" rid="B7">Brownlee, 2021</xref>). To avoid underfitting and overfitting during training, the number of epochs is considered as a <italic>hyperparameter</italic> and tuned using <italic>grid search</italic> while training a few models. The number of epochs is then fixed to the best number of epochs (130 epochs) to avoid underfitting/overfitting, and a <italic>python script</italic> is written to train all different models. Since in our experiments, we trained the proposed deep learning models hundreds of times (e.g., model A is trained 126 times: three times based on the genome sequence of each of the 42 phages under study), we did not use grid search to train all those models. The mean training accuracy and loss of the last three epochs are reported for each trained model. The trained models with a mean accuracy of the last three epochs higher than 70% are also saved. As shown in <xref ref-type="fig" rid="F5">Figure 5</xref>, we propose two different model architectures. They differ in the number of hidden layers but also in the number of phages used to train them. The experiments were developed and run using <italic>Python 3.8.5</italic>, <italic>Keras 2.4.3</italic>, and <italic>TensorFlow-GPU 2.2.0</italic>.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>Different generator model architectures. Two different model architectures are proposed. They are composed of an input layer, one or two hidden layers, and an output layer. The input layer takes sequences with n positions (where n &#x3d; seed length) of four features, each counting for the one-hot encoded input sequences. Each hidden layer is an LSTM with 256 neurons. Architecture A is composed of one hidden layer, whereas the architecture B is composed of two hidden layers. The output layer is a fully connected layer with a softmax activation function to ensure the outputs are interpreted as the probability of membership for each class (i.e., the type of nucleotide).</p>
</caption>
<graphic xlink:href="fbinf-02-932319-g005.tif"/>
</fig>
<sec id="s2-2-1">
<title>2.2.1 Model Architecture A: A Single LSTM Layer</title>
<p>Model A has a single LSTM with 256 neurons as a hidden layer, and it is trained using genome sub-sequences from only one phage. This phage is selected among the 42 phages under study based on two criteria: i) It should be able to train the generator model with high accuracy and ii) it should have a good enough host range before modification. We trained the generator model A with each bacteriophage separately and repeated the process three times because of the stochastic nature of LSTM models. Then, among those phages whose trained model exhibits a mean accuracy higher than 55%, we selected the phage with the highest host range. Finally, amid the three models of the selected phage, the one with the highest accuracy is chosen as the generator model. As already mentioned, the whole process is repeated four times for the different seed length values: 200, 500, 700, and 1,000.</p>
</sec>
<sec id="s2-2-2">
<title>2.2.2 Model Architecture B: Two Stacked LSTM Layers</title>
<p>Model B has a more complex architecture as its hidden layer is composed of two stacked LSTMs, each with 256 neurons. Thanks to this complexity, it may be used to learn simultaneously from more than one phage genome sequence. Model <italic>A</italic>, with a single LSTM layer, was unable to learn on two or more phage genomes, and considering three or more LSTM layers will imply much more parameters, increasing the risk of overfitting the training data. Next arises the challenge of how to choose a set of two or more phage genomes for training. For this purpose, we apply two criteria: i) each of the selected phage genomes should be able, alone, to train the generator model with high accuracy and ii) their combined host range (i.e., the union of their host sets) should be good enough. To do so, we rank the phages according to their mean accuracy, obtained when learning model <italic>A</italic> and selecting the top phage. Then, we add the next top phage if and only if doing so increases the combined host range. This procedure is repeated for the top 10 candidate phages and guarantees obtaining the highest combined host range with the minimum possible number of phages. Each time a phage is aggregated to the list, we use the selected phages to train model <italic>B</italic>, and because of the stochastic nature of the models, we trained it three times. Finally, from the configuration with the highest mean accuracy, we selected the most accurate model as the final generator to be used to modify genome sequences. This process is repeated for the four different seed length values.</p>
</sec>
</sec>
<sec id="s2-3">
<title>2.3 Modifying Phage Genome Sequences</title>
<p>In this phase, the trained model (either model <italic>A</italic> or model <italic>B</italic>) is used to rewrite the genome sequence of all the 42 phages following the iterative process shown in <xref ref-type="fig" rid="F6">Figure 6</xref>. For each phage, we need to provide the sequence of the first <italic>n</italic> symbols (where <italic>n</italic> &#x3d; <italic>seed length</italic>) as input to the model to start the generation process. The input sequence is processed by the model to generate the next character in the sequence. Then, at each iteration, the last <italic>n</italic> symbols of this growing sequence are passed to the model to generate the next symbol. The process continues until the desired sequence length is attained. In this article, the phage genome sequences are completely rewritten, conserving their original length (comprised between 50,000 and 150,000 nucleotides). Note that as the generator model is trained to very closely reproduce the phage sequences used for training, the generated sequences are expected to be very similar to the original phages.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>Iterative generation of a genomic sequence. Rewriting phase of the phage genomic sequence by a trained model (A or B). The sequence of the first n symbols (<italic>n</italic> &#x3d; seed length) is used as input to the model to start the generation process. The input sequence is processed by the model to generate the next character in the sequence. Then, at each iteration, the last n symbols of this growing sequence are passed to the model to generate the next symbol. The process continues until the desired sequence length is attained.</p>
</caption>
<graphic xlink:href="fbinf-02-932319-g006.tif"/>
</fig>
</sec>
</sec>
<sec id="s3">
<title>3 Results</title>
<p>The results of the phage genomic modification using the generator models <italic>A (single LSTM)</italic> and <italic>B (two-stacked LSTMs)</italic> are shown in <xref ref-type="table" rid="T1">Table 1</xref>. For both models and for the four seed lengths used, the following results are presented: the number and ID of the phages used to train the model together with the training accuracy, the number of phages whose host range has been extended, the percentage of host range improvement, the percentage of the phage genome that is modified and, finally, the maximum host range obtained with the corresponding generator. The proposed generators <italic>A</italic> and <italic>B</italic> are trained with an average accuracy of 94.3 and 98.7%, respectively, and are able to improve the host range for, respectively, an average of 18 and 17 phages among the 42 under study. The average host range of phages modified with model <italic>A</italic> is improved by 73%, while with model <italic>B</italic>, it is improved by 103.7%.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Summary of evaluation results for models A and B with four different seed length values. For both models and for the four seed lengths used, the following results are presented: the number and ID of the phages used to train the model together with the training accuracy, the number of phages whose host range has been extended, the percentage of host range improvement, the percentage of the phage genome that is modified, and finally, the maximum host range obtained with the corresponding generator.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Architecture</th>
<th align="center">Seed length</th>
<th align="center">No. of training phages</th>
<th align="center">Training phage IDs</th>
<th align="center">Training accuracy</th>
<th align="center">No. of phages with host range improvement (total &#x3d; 42)</th>
<th align="center">Average host range improvement (%)</th>
<th align="center">Maximum modified host range obtained</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="6" align="left">A</td>
<td rowspan="2" align="char" char=".">200</td>
<td align="left">1</td>
<td align="center">[5,294]</td>
<td align="char" char=".">0.89</td>
<td align="char" char=".">13</td>
<td align="char" char=".">75.5</td>
<td align="char" char=".">21</td>
</tr>
<tr>
<td align="left">1</td>
<td align="center">[5,296]</td>
<td align="char" char=".">
<bold>0.97</bold>
</td>
<td align="char" char=".">13</td>
<td align="char" char=".">57.1</td>
<td align="char" char=".">
<bold>24</bold>
</td>
</tr>
<tr>
<td rowspan="2" align="char" char=".">500</td>
<td align="left">1</td>
<td align="center">[5,309]</td>
<td align="char" char=".">0.95</td>
<td align="char" char=".">20</td>
<td align="char" char=".">
<bold>100</bold>
</td>
<td align="char" char=".">22</td>
</tr>
<tr>
<td align="left">1</td>
<td align="center">[5,284]</td>
<td align="char" char=".">0.92</td>
<td align="char" char=".">19</td>
<td align="char" char=".">61</td>
<td align="char" char=".">21</td>
</tr>
<tr>
<td align="char" char=".">700</td>
<td align="left">1</td>
<td align="center">[5,286]</td>
<td align="char" char=".">
<bold>0.97</bold>
</td>
<td align="char" char=".">20</td>
<td align="char" char=".">63.1</td>
<td align="char" char=".">21</td>
</tr>
<tr>
<td align="char" char=".">1,000</td>
<td align="left">1</td>
<td align="center">[5,294]</td>
<td align="char" char=".">0.96</td>
<td align="char" char=".">
<bold>22</bold>
</td>
<td align="char" char=".">81.2</td>
<td align="char" char=".">22</td>
</tr>
<tr>
<td rowspan="8" align="left">B</td>
<td rowspan="2" align="char" char=".">200</td>
<td align="left">Max: 3</td>
<td rowspan="2" align="center">[5,294, 5,296, 5,311]</td>
<td rowspan="2" align="char" char=".">0.98</td>
<td rowspan="2" align="char" char=".">10</td>
<td rowspan="2" align="char" char=".">53.5</td>
<td rowspan="2" align="char" char=".">22</td>
</tr>
<tr>
<td align="left">Selected: 3</td>
</tr>
<tr>
<td rowspan="2" align="char" char=".">500</td>
<td align="left">Max: 5</td>
<td rowspan="2" align="center">[5,309, 5,284]</td>
<td rowspan="2" align="char" char=".">0.99</td>
<td rowspan="2" align="char" char=".">9</td>
<td rowspan="2" align="char" char=".">61.3</td>
<td rowspan="2" align="char" char=".">24</td>
</tr>
<tr>
<td align="left">Selected: 2</td>
</tr>
<tr>
<td rowspan="2" align="char" char=".">700</td>
<td align="left">Max: 6</td>
<td rowspan="2" align="center">[5,286, 5,296, 5,319, 5,318, 5,323, 5,291]</td>
<td rowspan="2" align="char" char=".">0.98</td>
<td rowspan="2" align="char" char=".">24</td>
<td rowspan="2" align="char" char=".">100</td>
<td rowspan="2" align="char" char=".">23</td>
</tr>
<tr>
<td align="left">Selected: 6</td>
</tr>
<tr>
<td rowspan="2" align="char" char=".">1,000</td>
<td align="left">Max: 5</td>
<td rowspan="2" align="center">[5,294, 5,317, 5,291]</td>
<td rowspan="2" align="char" char=".">
<bold>0.995</bold>
</td>
<td rowspan="2" align="char" char=".">
<bold>26</bold>
</td>
<td rowspan="2" align="char" char=".">
<bold>200</bold>
</td>
<td rowspan="2" align="char" char=".">
<bold>29</bold>
</td>
</tr>
<tr>
<td align="left">Selected: 3</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>The best obtained results are shown in bold.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>A more detailed view of the effect of the PERPHECT generator on the host range of all the 42 phages is illustrated in <xref ref-type="fig" rid="F7">Figures 7</xref>&#x2013;<xref ref-type="fig" rid="F9">9</xref> for three configurations of interest (Note: The <xref ref-type="sec" rid="s10">Supplementary Figures S1&#x2013;S8</xref> present these results for all the configurations explored). In these graphics, green dots represent phages whose host range is improved after modification. <xref ref-type="fig" rid="F7">Figure 7</xref> shows the configuration exhibiting a maximum host range of 24, the highest for model <italic>A</italic>. It is obtained with <italic>n</italic> &#x3d; 200. <xref ref-type="fig" rid="F8">Figure 8</xref> presents the behavior of model <italic>A</italic> with a seed of 1,000 nucleotides. This configuration allows model <italic>A</italic> to improve the host range for 22 of the 42 phages under study, although the maximum host range is only 22. Finally, <xref ref-type="fig" rid="F9">Figure 9</xref> illustrates the behavior of the best configuration overall: model <italic>B</italic> (<italic>Two stacked LSTMs</italic>) with a seed of 1000 nucleotides which was trained on sequences from three phages. As shown in <xref ref-type="table" rid="T1">Table 1</xref>, it improves the host range of 26 of the 42 phages under study. This generator is also able to improve the maximum host range from 21 to 29 (a 38% improvement) with six of the phages exceeding 20 predicted interactions.</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p> Impact of applying model A with a 200-nucleotide seed on phage sequences. Two models are trained based on the genome sequence of either phage 5294 <bold>(A)</bold> or phage 5296 <bold>(B)</bold> with a seed length of 200. These trained models are then used to modify the genome sequence of all 42 phages under study. The plots show the host range of these phages before and after modification using each trained model. The green spots show phages whose host range is improved after modification. The host ranges of 13 phages are improved after modifying with these two models.</p>
</caption>
<graphic xlink:href="fbinf-02-932319-g007.tif"/>
</fig>
<fig id="F8" position="float">
<label>FIGURE 8</label>
<caption>
<p>Impact of applying model A with a 1000-nucleotide seed on phage sequences. The model is trained based on the genome sequence of phage 5294 with a seed length of 1000. The trained model is then used to modify the genome sequence of all 42 phages under study. The plot shows the host range of these phages before and after modification. The green spots show phages whose host range is improved after modification. The host ranges of 22 phages are improved after modifying with this model.</p>
</caption>
<graphic xlink:href="fbinf-02-932319-g008.tif"/>
</fig>
<fig id="F9" position="float">
<label>FIGURE 9</label>
<caption>
<p>Impact of applying model B with a 1000-nucleotide seed on phage sequences. The model is trained based on the genome sequence of three phages with IDs 5294, 5317, and 5291 with a seed length of 1000. The trained model is then used to modify the genome sequence of all 42 phages under study. The plot shows the host range of these phages before and after modification. The green spots show phages whose host range is improved after modification. The host ranges of 26 phages are improved after modifying with this model.</p>
</caption>
<graphic xlink:href="fbinf-02-932319-g009.tif"/>
</fig>
<p>In general, one can observe from these results that there is no, or little, correlation between the host ranges before and after the modification of the phages. Indeed, the largest host ranges after modification are not necessarily obtained with phages exhibiting the largest host range before modification. From the results, one can associate a higher model complexity (i.e., a stacked LSTM model, trained on several phages, and using longer seeds) with better performance.</p>
</sec>
<sec id="s4">
<title>4 Concluding Remarks</title>
<p>In this article, we proposed an innovative approach composed of two fundamental deep learning components: a phage&#x2013;bacterium interaction predictor and a phage genome-sequence generator. For the latter component, we developed two different artificial RNN models composed of either one or two stacked LSTM layers with 256 neurons each, while the genome sequence of either one or several phages participated, respectively, in the process of training the models. The proposed generators are used to engineer the genome sequences of 42 selected phages. The phage&#x2013;bacterium interaction predictor, composed of a multi-branch 1D-CNN, is then used to estimate the host range of the modified bacteriophages across 46 strains of the <italic>Pseudomonas aeruginosa</italic> species under study.</p>
<p>As shown by the evaluation results presented in <xref ref-type="table" rid="T1">Table 1</xref> and <xref ref-type="fig" rid="F7">Figures 7</xref>&#x2013;<xref ref-type="fig" rid="F9">9</xref>, both generator architectures are able to significantly improve the host range of the phage panel under study. For model <italic>A</italic>, based on a single LSTM hidden layer, the seed length used does not seem to have a strong effect on its performance and the most limiting issue appears to be its relatively limited learning capabilities. For model <italic>B</italic>, with two stacked LSTM hidden layers and trained on several phage genomes, the effect of the seed length is more apparent as the host range improvement for seeds of 700 and 1,000 nucleotides is markedly better with respect to the number of improved phages and the average improvement. Nevertheless, the effect on the maximum host range is substantially better only for the longest seed attaining 29. Exploring even the longest seeds could not provide additional advantages as the training accuracy of this model is already 99.5%. These encouraging performances give a green light to further usage of deep learning models, especially LSTM models, in guiding genetic editing of phages to improve their antibacterial power.</p>
<p>Our proposed approach is the first step toward a systematic method to guide the search for genetically engineered phages. Nevertheless, our approach is currently addressing one optimization goal, that is, maximizing phage interaction with target bacteria, while other fundamental criteria, such as minimizing noise, repeats, and non-informative code and maintaining the biological coherence of the organism are yet to be addressed. Some of them, such as noises, repeats, and even unnecessary code, could be tackled by including them in the loss function during the optimization process. Nevertheless, reinforcing the biological coherence of the resulting sequence would require more elaborated strategies. For example, the use of XAI (explainable AI) methods could allow to systematically detect phage-relevant hotspots/motifs in the predictor model. On this base, only essential code segments, selected on the basis of sequence similarity and alignment, will be allowed to integrate existing phage genomes.</p>
</sec>
</body>
<back>
<sec id="s5" sec-type="data-availability">
<title>Data Availability Statement</title>
<p>Full data will be released as part of a separate study. However, the source codes for data analysis and ML/DL models can be found here: zenodo, <ext-link ext-link-type="uri" xlink:href="http://10.5281/zenodo.6448195">10.5281/zenodo.6448195</ext-link>. Further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s6">
<title>Author Contributions</title>
<p>SA is the data scientist who developed, implemented, and tested the methodologies described in the paper. XB is the bio-informatician who recommended and suggested the biologic aspect of the project. CP-R is the data-analyst expert who brings knowledge and recommendations for the field of AI. XB and CP-R came up with the main idea for the project. All authors read and approved the final manuscript.</p>
</sec>
<sec id="s7">
<title>Funding</title>
<p>This project is funded by the Swiss National Science Foundation (FNS) (Grant number: CRSK-2_190298). Publication costs were funded by the Swiss National Science Foundation (FNS) (Grant number: CRSK-2_190298).</p>
</sec>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ack>
<p>The authors thank Gr&#xe9;gory Resch (Head of the laboratory of bacteriophages, Lausanne University Hospital) for the experimental data used in this project.</p>
</ack>
<sec id="s10">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fbinf.2022.932319/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fbinf.2022.932319/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="DataSheet1.PDF" id="SM1" mimetype="application/PDF" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Abdel-Nasser</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Mahmoud</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Accurate Photovoltaic Power Forecasting Models Using Deep LSTM-RNN</article-title>. <source>Neural Comput. Applic</source> <volume>31</volume>, <fpage>2727</fpage>&#x2013;<lpage>2740</lpage>. <pub-id pub-id-type="doi">10.1007/s00521-017-3225-z</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Amgarten</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Braga</surname>
<given-names>L. P. P.</given-names>
</name>
<name>
<surname>da Silva</surname>
<given-names>A. M.</given-names>
</name>
<name>
<surname>Setubal</surname>
<given-names>J. C.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>MARVEL, a Tool for Prediction of Bacteriophage Sequences in Metagenomic Bins</article-title>. <source>Front. Genet.</source> <volume>9</volume>, <fpage>304</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2018.00304</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Ataee</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Rodriguez</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Brochet</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Pena</surname>
<given-names>C. A.</given-names>
</name>
</person-group> (<year>2020</year>). &#x201c;<article-title>Towards BacterioPhage Genetic Edition: Deep Learning Prediction of Phage-Bacterium Interactions</article-title>,&#x201d; in <conf-name>2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)</conf-name>, <conf-date>December 16&#x2013;19, 2020</conf-date> (<publisher-loc>Seoul, Korea (South)</publisher-loc>. </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bikard</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Euler</surname>
<given-names>C. W.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Nussenzweig</surname>
<given-names>P. M.</given-names>
</name>
<name>
<surname>Goldberg</surname>
<given-names>G. W.</given-names>
</name>
<name>
<surname>Duportet</surname>
<given-names>X.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Exploiting CRISPR-Cas Nucleases to Produce Sequence-specific Antimicrobials</article-title>. <source>Nat. Biotechnol.</source> <volume>32</volume>, <fpage>1146</fpage>&#x2013;<lpage>1150</lpage>. <pub-id pub-id-type="doi">10.1038/nbt.3043</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Boeckaerts</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Stock</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Criel</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Gerstmans</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>De Baets</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Briers</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Predicting Bacteriophage Hosts Based on Sequences of Annotated Receptor-Binding Proteins</article-title>. <source>Sci. Rep.</source> <volume>11</volume> (<issue>1467</issue>), <fpage>1467</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-021-81063-4</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Brownlee</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>A Gentle Introduction to Cross-Entropy for Machine Learning</article-title>. <comment>[Online] Available at: <ext-link ext-link-type="uri" xlink:href="https://machinelearningmastery.com/cross-entropy-for-machine-learning/">https://machinelearningmastery.com/cross-entropy-for-machine-learning/</ext-link>
</comment>(<comment>Accessed 06, 2022)</comment>. </citation>
</ref>
<ref id="B7">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Brownlee</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Gentle Introduction to the Adam Optimization Algorithm for Deep Learning</article-title>. <comment>[Online] Available at: <ext-link ext-link-type="uri" xlink:href="https://machinelearningmastery.com/adam-optimization-algorithm-for-deep-learning/">https://machinelearningmastery.com/adam-optimization-algorithm-for-deep-learning/</ext-link>
</comment>(<comment>Accessed 06, 2022)</comment>. </citation>
</ref>
<ref id="B8">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Brownlee</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2017a</year>). <article-title>How to Develop a Character-Based Neural Language Model in Keras</article-title>. <comment>[Online] Available at: <ext-link ext-link-type="uri" xlink:href="https://machinelearningmastery.com/develop-character-based-neural-language-model-keras/">https://machinelearningmastery.com/develop-character-based-neural-language-model-keras/</ext-link>
</comment>(<comment>Accessed 06, 2022)</comment>. </citation>
</ref>
<ref id="B9">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Brownlee</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>How to Develop Convolutional Neural Network Models for Time Series Forecasting</article-title>. <comment>[Online] Available at: <ext-link ext-link-type="uri" xlink:href="https://machinelearningmastery.com/how-to-develop-convolutional-neural-network-models-for-time-series-forecasting/">https://machinelearningmastery.com/how-to-develop-convolutional-neural-network-models-for-time-series-forecasting/</ext-link>(Accessed 06, 2022)</comment>. </citation>
</ref>
<ref id="B10">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Brownlee</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2017b</year>). <article-title>Why One-Hot Encode Data in Machine Learning?</article-title> <comment>[Online] Available at: <ext-link ext-link-type="uri" xlink:href="https://machinelearningmastery.com/why-one-hot-encode-data-in-machine-learning/">https://machinelearningmastery.com/why-one-hot-encode-data-in-machine-learning/</ext-link>
</comment>(<comment>Accessed 06, 2022)</comment>. </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cantu</surname>
<given-names>V. A.</given-names>
</name>
<name>
<surname>Salamon</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Seguritan</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Redfield</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Salamon</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Edwards</surname>
<given-names>R. A.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>PhANNs, a Fast and Accurate Tool and Web Server to Classify Phage Structural Proteins</article-title>. <source>PLoS Comput. Biol.</source> <volume>16</volume> (<issue>11</issue>), <fpage>e1007845</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1007845</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="book">
<collab>CDC</collab> (<year>2019</year>). <source>
<italic>Antibiotic Resistance Threats In the United States,</italic> s.L.</source>. <publisher-loc>Atlanta, GA</publisher-loc>: <publisher-name>U.S. Department of Health and Human Services</publisher-name>. </citation>
</ref>
<ref id="B13">
<citation citation-type="web">
<collab>CERN and Open AIRE</collab> (<year>2013</year>). <article-title>Zenodo Open Repository</article-title>. <comment>[Online] Available at: <ext-link ext-link-type="uri" xlink:href="https://zenodo.org">https://zenodo.org</ext-link> (Accessed 06, 2022)</comment>. </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Batra</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Rao</surname>
<given-names>V. B.</given-names>
</name>
<name>
<surname>Tao</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Genetic Engineering of Bacteriophages against Infectious Diseases</article-title>. <source>Front. Microbiol.</source> <volume>10</volume> (<issue>954</issue>), <fpage>954</fpage>. <pub-id pub-id-type="doi">10.3389/fmicb.2019.00954</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Chibani</surname>
<given-names>C. M.</given-names>
</name>
<name>
<surname>Meiecke</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Farr</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Dietrich</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Liesegang</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2019</year>). <source>ClassiPhages 2.0: Sequence-Based Classification of Phages Using Artificial Neural Networks</source>. <publisher-name>Cold Spring Harbor Laboratory</publisher-name>. <pub-id pub-id-type="doi">10.1101/558171</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>D&#x27;Herelle</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>On an Invisible Microbe Antagonistic to Dysentery Bacilli. Note by M. F. d&#x2019;Herelle, Presented by M. Roux</article-title>. <source>Bacteriophage</source> <volume>1651</volume>, <fpage>3733.</fpage>. <pub-id pub-id-type="doi">10.4161/bact.1.1.14941</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dedrick</surname>
<given-names>R. M.</given-names>
</name>
<name>
<surname>Guerrero-Bustamante</surname>
<given-names>C. A.</given-names>
</name>
<name>
<surname>Garlena</surname>
<given-names>R. A.</given-names>
</name>
<name>
<surname>Russell</surname>
<given-names>D. A.</given-names>
</name>
<name>
<surname>Ford</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Harris</surname>
<given-names>K.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Engineered Bacteriophages for Treatment of a Patient with a Disseminated Drug-Resistant Mycobacterium Abscessus</article-title>. <source>Nat. Med.</source> <volume>25</volume>, <fpage>730</fpage>&#x2013;<lpage>733</lpage>. <pub-id pub-id-type="doi">10.1038/s41591-019-0437-z</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Delbr&#xfc;ck</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>1940</year>). <article-title>The Growth of Bacteriophage and Lysis of the Host</article-title>. <source>General Physiol.</source> <volume>23</volume> (<issue>5</issue>), <fpage>643</fpage>&#x2013;<lpage>660</lpage>. </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Deng</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Shao</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Wind Power Forecasting Methods Based on Deep Learning: A Survey</article-title>. <source>Tech Sci. Press</source> <volume>122</volume> (<issue>1</issue>), <fpage>273</fpage>&#x2013;<lpage>301</lpage>. <pub-id pub-id-type="doi">10.32604/cmes.2020.08768</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dunne</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Rupf</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Tala</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Qabrati</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Ernst</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Shen</surname>
<given-names>Y.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Reprogramming Bacteriophage Host Range through Structure-Guided Design of Chimeric Receptor Binding Proteins</article-title>. <source>Cell Rep.</source> <volume>29</volume> (<issue>5</issue>), <fpage>1336</fpage>&#x2013;<lpage>e4</lpage>. <pub-id pub-id-type="doi">10.1016/j.celrep.2019.09.062</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Filippov</surname>
<given-names>A. A.</given-names>
</name>
<name>
<surname>Sergueev</surname>
<given-names>K. V.</given-names>
</name>
<name>
<surname>He</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>X. Z.</given-names>
</name>
<name>
<surname>Gnade</surname>
<given-names>B. T.</given-names>
</name>
<name>
<surname>Mueller</surname>
<given-names>A. J.</given-names>
</name>
<etal/>
</person-group> (<year>2011</year>). <article-title>Bacteriophage-resistant Mutants in <italic>Yersinia pestis</italic>: Identification of Phage Receptors and Attenuation for Mice</article-title>. <source>PLoS One</source> <volume>6</volume> (<issue>9</issue>), <fpage>e25486</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0025486</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>G.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Identification of Novel Bacteriophage vB_EcoP-EG1 with Lytic Activity against Planktonic and Biofilm Forms of Uropathogenic <italic>Escherichia coli</italic>
</article-title>. <source>Appl. Microbiol. Biotechnol.</source> <volume>103</volume> (<issue>1</issue>), <fpage>315</fpage>&#x2013;<lpage>326</lpage>. <pub-id pub-id-type="doi">10.1007/s00253-018-9471-x</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guo</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Jin</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Fan</surname>
<given-names>F.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Genetic and Chemical Engineering of Phages for Controlling Multidrug-Resistant Bacteria</article-title>. <source>Antibiot. (Basel)</source> <volume>10</volume> (<issue>2</issue>), <fpage>202</fpage>. <pub-id pub-id-type="doi">10.3390/antibiotics10020202</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hochreiter</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Schmidhuber</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>Long Short-Term Memory</article-title>. <source>Neural comput.</source> <volume>9</volume> (<issue>8</issue>), <fpage>1735</fpage>&#x2013;<lpage>1780</lpage>. <pub-id pub-id-type="doi">10.1162/neco.1997.9.8.1735</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Iupac-Iub Comm. on Biochem. Nomenclature</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>1970</year>). <article-title>Abbreviations and Symbols for Nucleic Acids, Polynucleotides, and Their Constituents</article-title>. <source>Biochemistry</source> <volume>9</volume>, <fpage>4022</fpage>&#x2013;<lpage>4027</lpage>. <pub-id pub-id-type="doi">10.1021/bi00822a023</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kieft</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Anantharaman</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>VIBRANT: Automated Recovery, Annotation and Curation of Microbial Viruses, and Evaluation of Viral Community Function from Genomic Sequences</article-title>. <source>Microbiome</source> <volume>8</volume> (<issue>90</issue>), <fpage>90</fpage>. <pub-id pub-id-type="doi">10.1186/s40168-020-00867-0</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lam</surname>
<given-names>K. N.</given-names>
</name>
<name>
<surname>Spanogiannopoulos</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Soto-Perez</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Alexander</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Nalley</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Bisanz</surname>
<given-names>J. E.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Phage-delivered CRISPR-Cas9 for Strain-specific Depletion and Genomic Deletions in the Gut Microbiome</article-title>. <source>Cell Rep.</source> <volume>37</volume> (<issue>5</issue>), <fpage>109930</fpage>. <pub-id pub-id-type="doi">10.1016/j.celrep.2021.109930</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Leite</surname>
<given-names>D. M. C.</given-names>
</name>
<name>
<surname>Brochet</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Resch</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Que</surname>
<given-names>Y. A.</given-names>
</name>
<name>
<surname>Neves</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Pe&#xf1;a-Reyes</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Computational Prediction of Inter-species Relationships through Omics Data Analysis and Machine Learning</article-title>. <source>BMC Bioinforma.</source> <volume>19</volume> (<issue>420</issue>), <fpage>420</fpage>. <pub-id pub-id-type="doi">10.1186/s12859-018-2388-7</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lenneman</surname>
<given-names>B. R.</given-names>
</name>
<name>
<surname>Fernbach</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Loessner</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>T. K.</given-names>
</name>
<name>
<surname>Kilcher</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Enhancing Phage Therapy through Synthetic Biology and Genome Engineering</article-title>. <source>Curr. Opin. Biotechnol.</source> <volume>68</volume>, <fpage>151</fpage>&#x2013;<lpage>159</lpage>. <pub-id pub-id-type="doi">10.1016/j.copbio.2020.11.003</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Identifying Centromeric Satellites with Dna-Brnn</article-title>. <source>Bioinformatics</source> <volume>35</volume> (<issue>21</issue>), <fpage>4408</fpage>&#x2013;<lpage>4410</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btz264</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Mohamed</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Zweig</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Gong</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2015</year>). &#x201c;<article-title>LSTM Time and Frequency Recurrence for Automatic Speech Recognition</article-title>,&#x201d; in <conf-name>IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)</conf-name>. <pub-id pub-id-type="doi">10.1109/asru.2015.7404793</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>A Deep Learning-Based Method for Identification of Bacteriophage-Host Interaction</article-title>. <source>IEEE/ACM Trans. Comput. Biol. Bioinform</source> <volume>18</volume> (<issue>5</issue>), <fpage>1801</fpage>&#x2013;<lpage>1810</lpage>. <pub-id pub-id-type="doi">10.1109/TCBB.2020.3017386</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname>
<given-names>T. Y.</given-names>
</name>
<name>
<surname>Lo</surname>
<given-names>Y. H.</given-names>
</name>
<name>
<surname>Tseng</surname>
<given-names>P. W.</given-names>
</name>
<name>
<surname>Chang</surname>
<given-names>S. F.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>Y. T.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>T. S.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>A T3 and T7 Recombinant Phage Acquires Efficient Adsorption and a Broader Host Range</article-title>. <source>PLoS One</source> <volume>7</volume> (<issue>2</issue>), <fpage>e30954</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0030954</pub-id> </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mahichi</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Synnott</surname>
<given-names>A. J.</given-names>
</name>
<name>
<surname>Yamamichi</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Osada</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Tanji</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Site-specific Recombination of T2 Phage Using IP008 Long Tail Fiber Genes Provides a Targeted Method for Expanding Host Range while Retaining Lytic Activity</article-title>. <source>FEMS Microbiol. Lett.</source> <volume>295</volume> (<issue>2</issue>), <fpage>211</fpage>&#x2013;<lpage>217</lpage>. <pub-id pub-id-type="doi">10.1111/j.1574-6968.2009.01588.x</pub-id> </citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Manavalan</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Shin</surname>
<given-names>T. H.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>PVP-SVM: Sequence-Based Prediction of Phage Virion Proteins Using a Support Vector Machine</article-title>. <source>Front. Microbiol.</source> <volume>9</volume>, <fpage>476</fpage>. <pub-id pub-id-type="doi">10.3389/fmicb.2018.00476</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mann</surname>
<given-names>N. H.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>The Third Age of Phage</article-title>. <source>PLoS Biol.</source> <volume>3</volume> (<issue>5</issue>), <fpage>e182</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pbio.0030182</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mapes</surname>
<given-names>A. C.</given-names>
</name>
<name>
<surname>Trautner</surname>
<given-names>B. W.</given-names>
</name>
<name>
<surname>Liao</surname>
<given-names>K. S.</given-names>
</name>
<name>
<surname>Ramig</surname>
<given-names>R. F.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Development of Expanded Host Range Phage Active on Biofilms of Multi-Drug Resistant <italic>Pseudomonas aeruginosa</italic>
</article-title>. <source>Bacteriophage</source> <volume>6</volume> (<issue>1</issue>), <fpage>e1096995</fpage>. <pub-id pub-id-type="doi">10.1080/21597081.2015.1096995</pub-id> </citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Matsuzaki</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Rashel</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Uchiyama</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Sakurai</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Ujihara</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Kuroda</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2005</year>). <article-title>Bacteriophage Therapy: a Revitalized Therapy against Bacterial Infectious Diseases</article-title>. <source>J. Infect. Chemother.</source> <volume>11</volume>, <fpage>211</fpage>&#x2013;<lpage>219</lpage>. <pub-id pub-id-type="doi">10.1007/s10156-005-0408-9</pub-id> </citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Matsuzaki</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Uchiyama</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Takemura-Uchiyama</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Daibata</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Perspective: The Age of the Phage</article-title>. <source>Nature</source> <volume>509</volume>, <fpage>S9</fpage>. <pub-id pub-id-type="doi">10.1038/509S9a</pub-id> </citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>McNair</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Bailey</surname>
<given-names>B. A.</given-names>
</name>
<name>
<surname>Edwards</surname>
<given-names>R. A.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>PHACTS, a Computational Approach to Classifying the Lifestyle of Phages</article-title>. <source>Bioinformatics</source> <volume>28</volume> (<issue>5</issue>), <fpage>614</fpage>&#x2013;<lpage>618</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/bts014</pub-id> </citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Meng</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Ye</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Zou</surname>
<given-names>Q.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Review and Comparative Analysis of Machine Learning-Based Phage Virion Protein Identification Methods</article-title>. <source>Biochim. Biophys. Acta Proteins Proteom</source> <volume>1868</volume> (<issue>6</issue>), <fpage>140406</fpage>. <pub-id pub-id-type="doi">10.1016/j.bbapap.2020.140406</pub-id> </citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nilsson</surname>
<given-names>A. S.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Phage Therapy-Cconstraints and Possibilities</article-title>. <source>Ups. J. Med. Sci.</source> <volume>119</volume> (<issue>2</issue>), <fpage>192</fpage>&#x2013;<lpage>198</lpage>. <pub-id pub-id-type="doi">10.3109/03009734.2014.902878</pub-id> </citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pei</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Lamas-Samanamud</surname>
<given-names>G. R.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Inhibition of Biofilm Formation by T7 Bacteriophages Producing Quorum-quenching Enzymes</article-title>. <source>Appl. Environ. Microbiol.</source> <volume>80</volume> (<issue>17</issue>), <fpage>5340</fpage>&#x2013;<lpage>5348</lpage>. <pub-id pub-id-type="doi">10.1128/AEM.01434-14</pub-id> </citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ramirez</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Neuman</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Ramirez</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Bacteriophages as Promising Agents for the Biological Control of Moko Disease (Ralstonia Solanacearum) of Banana</article-title>. <source>Biol. Control</source> <volume>149</volume>, <fpage>104238</fpage>. <pub-id pub-id-type="doi">10.1016/j.biocontrol.2020.104238</pub-id> </citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ren</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Ahlgren</surname>
<given-names>N. A.</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>Y. Y.</given-names>
</name>
<name>
<surname>Fuhrman</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>VirFinder: a Novel K-Mer Based Tool for Identifying Viral Sequences from Assembled Metagenomic Data</article-title>. <source>Microbiome</source> <volume>5</volume> (<issue>69</issue>), <fpage>69</fpage>. <pub-id pub-id-type="doi">10.1186/s40168-017-0283-5</pub-id> </citation>
</ref>
<ref id="B47">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Santhanam</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Context Based Text-Generation Using LSTM Networks (Version 1)</article-title>. <source>arXiv</source>. <pub-id pub-id-type="doi">10.48550/ARXIV.2005.00048</pub-id> </citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Seguritan</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Alves</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Arnoult</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Raymond</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Lorimer</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Burgin</surname>
<given-names>A. B.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>Artificial Neural Networks Trained to Detect Viral and Phage Structural Proteins</article-title>. <source>PLoS Comput. Biol.</source> <volume>8</volume> (<issue>8</issue>), <fpage>e1002657</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1002657</pub-id> </citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Servan-Schreiber</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Cleeremans</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Mcclelland</surname>
<given-names>J. L.</given-names>
</name>
</person-group> (<year>1991</year>). <article-title>Graded State Machines: The Representation of Temporal Contingencies in Simple Recurrent Networks</article-title>. <source>Mach. Learn</source> <volume>7</volume>, <fpage>161</fpage>&#x2013;<lpage>193</lpage>. <pub-id pub-id-type="doi">10.1007/bf00114843</pub-id> </citation>
</ref>
<ref id="B50">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Servan-Schreiber</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Cleeremans</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>McClelland</surname>
<given-names>J. L.</given-names>
</name>
</person-group> (<year>1988</year>). &#x201c;<article-title>Learning Sequential Structure in Simple Recurrent Networks</article-title>,&#x201d; in <conf-name>Proceedings of the 1st International Conference on Neural Information Processing Systems (NIPS&#x2019;88)</conf-name>, <conf-loc>Denver, CO</conf-loc>, <conf-date>January 1, 1988</conf-date> (<publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>), <fpage>643</fpage>&#x2013;<lpage>652</lpage>. </citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Bacteriophage Classification for Assembled Contigs Using Graph Convolutional Network</article-title>. <source>Bioinformatics</source> <volume>37</volume> (<issue>1</issue>), <fpage>i25</fpage>&#x2013;<lpage>i33</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btab293</pub-id> </citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sherstinsky</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) Network</article-title>. <source>Phys. D. Nonlinear Phenom.</source> <volume>404</volume>, <fpage>132306</fpage>. <pub-id pub-id-type="doi">10.1016/j.physd.2019.132306</pub-id> </citation>
</ref>
<ref id="B53">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Siami-Namini</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Tavakoli</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Siami Namin</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2018</year>). &#x201c;<article-title>A Comparison of ARIMA and LSTM in Forecasting Time Series</article-title>,&#x201d; in <conf-name>17th IEEE International Conference on Machine Learning and Applications (ICMLA)</conf-name>, <conf-loc>Orlando, FL</conf-loc>, <conf-date>December 17&#x2013;20, 2018</conf-date>. </citation>
</ref>
<ref id="B54">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Socher</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>E. H.</given-names>
</name>
<name>
<surname>Pennington</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2011</year>). &#x201c;<article-title>Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection</article-title>,&#x201d; in <conf-name>Proceedings of the 24th International Conference on Neural Information Processing Systems</conf-name>, <conf-loc>Granada</conf-loc>, <conf-date>December 12&#x2013;15, 2011</conf-date> (<publisher-loc>Red Hook, NY, USA</publisher-loc>. <comment>NIPS&#x27;11)</comment>. </citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Twort</surname>
<given-names>F. W.</given-names>
</name>
</person-group> (<year>1915</year>). <article-title>An Investigation on the Nature of Ultra-microscopic Viruses</article-title>. <source>Lancet</source> <volume>186</volume> (<issue>4814</issue>), <fpage>1241</fpage>&#x2013;<lpage>1243</lpage>. <pub-id pub-id-type="doi">10.1016/s0140-6736(01)20383-3</pub-id> </citation>
</ref>
<ref id="B56">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Tynecki</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Guzis&#x0144;ki</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Kazimierczak</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Jadczuk</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Dastych</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Onisko</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2020</year>). <source>PhageAI - Bacteriophage Life Cycle Recognition with Machine Learning and Natural Language Processing</source>. <publisher-name>Cold Spring Harbor Laboratory</publisher-name>, <fpage>198606</fpage>. <pub-id pub-id-type="doi">10.1101/2020.07.11.198606</pub-id> </citation>
</ref>
<ref id="B57">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Weinbauer</surname>
<given-names>M. G.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Ecology of Prokaryotic Viruses</article-title>. <source>FEMS Microbiol. Rev.</source> <volume>28</volume> (<issue>2</issue>), <fpage>127</fpage>&#x2013;<lpage>181</lpage>. <pub-id pub-id-type="doi">10.1016/j.femsre.2003.08.001</pub-id> </citation>
</ref>
<ref id="B58">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Weitz</surname>
<given-names>J. S.</given-names>
</name>
<name>
<surname>Poist</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Meyer</surname>
<given-names>J. R.</given-names>
</name>
<name>
<surname>Hochberg</surname>
<given-names>M. E.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Phage-Bacteria Infection Networks</article-title>. <source>Trends Microbiol.</source> <volume>21</volume> (<issue>2</issue>), <fpage>82</fpage>&#x2013;<lpage>91</lpage>. <pub-id pub-id-type="doi">10.1016/j.tim.2012.11.003</pub-id> </citation>
</ref>
<ref id="B59">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Who</surname>
</name>
</person-group> (<year>2021</year>). <article-title>Antimicrobial Resistance</article-title>. <comment>[Online] Available at: <ext-link ext-link-type="uri" xlink:href="https://www.who.int/news-room/fact-sheets/detail/antimicrobial-resistance">https://www.who.int/news-room/fact-sheets/detail/antimicrobial-resistance</ext-link> (Accessed 06, 2022)</comment>. </citation>
</ref>
<ref id="B60">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yehl</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Lemire</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>A. C.</given-names>
</name>
<name>
<surname>Ando</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Mimee</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Torres</surname>
<given-names>M. T.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Engineering Phage Host-Range and Suppressing Bacterial Resistance through Phage Tail Fiber Mutagenesis</article-title>. <source>Cell</source> <volume>179</volume> (<issue>2</issue>), <fpage>459</fpage>&#x2013;<lpage>e9</lpage>. <pub-id pub-id-type="doi">10.1016/j.cell.2019.09.015</pub-id> </citation>
</ref>
<ref id="B61">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yosef</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Manor</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Kiro</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Qimron</surname>
<given-names>U.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Temperate and Lytic Bacteriophages Programmed to Sensitize and Kill Antibiotic-Resistant Bacteria</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>112</volume> (<issue>23</issue>), <fpage>7267</fpage>&#x2013;<lpage>7272</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1500107112</pub-id> </citation>
</ref>
<ref id="B62">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhou</surname>
<given-names>F.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>PHISDetector: a Tool to Detect Diverse In Silico Phage-Host Interaction Signals for Virome Studies</article-title>. <source>Genomics, Proteomics Bioinforma.</source>. <pub-id pub-id-type="doi">10.1016/j.gpb.2022.02.003</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>