Original Research ARTICLE
A Computational Pipeline for the Extraction of Actionable Biological Information from NGS-Phage Display Experiments
- 1Institut National de la Santé et de la Recherche Médicale (INSERM), France
- 2Metabolic Engineering and Bioinformatics Program, Institute of Biology, Medicinal Chemistry and Bioetchnology, National Hellenic Research Foundation, Greece
- 3e-NIOS, Greece
Phage Display is a powerful method for the identification of peptide binding to targets of variable complexities and tissues, from unique molecules to the internal surfaces of vessels of living organisms. Particularly for in vivo screenings, the resulting repertoires can be very complex and difficult to study with traditional approaches. Next Generation Sequencing (NGS) opened the possibility to acquire high resolution overviews of such repertoires and thus facilitates the identification of binders of interest. Additionally, the ever-increasing amount of available genome/proteome information became satisfactory regarding the identification of putative mimicked proteins, due to the large scale on which partial sequence homology is assessed. However, the subsequent production of massive data stresses the need for high-performance computational approaches in order to perform standardized and insightful molecular network analysis. Systems-level analysis is essential for efficient resolution of the underlying molecular complexity and the extraction of actionable interpretation, in terms of systemic biological processes and pathways that are systematically perturbed.
In this work we introduce PepSimili, an integrated workflow tool, which performs mapping of massive peptide repertoires on whole proteomes and delivers a streamlined, systems-level biological interpretation. The tool employs modules for modeling and filtering of background noise due to random mappings and amplifies the biologically meaningful signal through coupling with BioInfoMiner, a systems interpretation tool that employs graph-theoretic methods for prioritization of systemic processes and corresponding driver genes. The current implementation exploits the Galaxy environment and is available online. A case study using public data is presented, with and without a control selection.
Keywords: phage display, Galaxy Platform, Enrichment analysis, Network analysis, gene ontology, Reactome, Biological interpretation
Received: 23 Nov 2018;
Accepted: 28 Aug 2019.
Copyright: © 2019 Vekris, Pilalis, Chatziioannou and Petry. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Prof. Aristotle Chatziioannou, National Hellenic Research Foundation, Metabolic Engineering and Bioinformatics Program, Institute of Biology, Medicinal Chemistry and Bioetchnology, Athens, Greece, email@example.com