AUTHOR=Fiuza Tayna S. , Lima João P. M. S. , de Souza Gustavo A. TITLE=EpitoCore: Mining Conserved Epitope Vaccine Candidates in the Core Proteome of Multiple Bacteria Strains JOURNAL=Frontiers in Immunology VOLUME=Volume 11 - 2020 YEAR=2020 URL=https://www.frontiersin.org/journals/immunology/articles/10.3389/fimmu.2020.00816 DOI=10.3389/fimmu.2020.00816 ISSN=1664-3224 ABSTRACT=In reverse vaccinology approaches, complete proteomes of bacteria are submitted to multiple computational prediction steps in order to filter proteins that are possible vaccine candidates. Most available tools perform such analysis only in a single strain, or a very limited number of strains. But the vast amount of genomic data had shown that most bacteria contain pangenomes, i.e. their genomic information contains core, conserved genes, and random accessory genes specific to each strain. Therefore, in reverse vaccinology methods it is of the utmost importance to define core proteins and core epitopes. EpitoCore is a decision-tree pipeline developed to fulfill that need. It provides surfaceome prediction of proteins from related strains, defines clusters of core proteins within those, calculate the immunogenicity of such clusters, predicts epitopes for a given set of MHC alleles defined by the user, and then reports if epitopes are located extracellularly and if they are conserved among the core homologues. Pipeline performance is illustrated by mining peptide vaccine candidates in Mycobacterium avium hominissuis strains. From a total proteome of approximately 4,800 proteins per strain, EpitoCore mined 103 highly immunogenic core homologues located at cell surface, many of those related to virulence and drug resistance. Conserved epitopes identified among these homologues allows the users to define sets of peptides with potential to immunize the largest coverage of tested HLA alleles using peptide-based vaccines. Therefore, EpitoCore is able to provide automated identification of conserved epitopes in bacterial pangenomic datasets.