MINI REVIEW article
Vaccines Meet Big Data: State-of-the-Art and Future Prospects. From the Classical 3Is (“Isolate–Inactivate–Inject”) Vaccinology 1.0 to Vaccinology 3.0, Vaccinomics, and Beyond: A Historical Overview
- 1Department of Health Sciences (DISSAL), School of Public Health, University of Genoa, Genoa, Italy
- 2Department of Experimental Medicine, Unit of Public Health, School of Specialization in Hygiene and Preventive Medicine, University of Perugia, Perugia, Italy
- 3Unit of Public Health, Department of Pharmaceutical Science, University of Perugia, Perugia, Italy
- 4Local Health Unit (LHU) ASL3 Genovese, Genoa, Italy
- 5Department of Medicine and Surgery, Pathology University Milan Bicocca, San Gerardo Hospital, Monza, Italy
- 6Medical Faculty, University of Parma, Parma, Italy
- 7Section of History of Medicine and Ethics, Department of Health Sciences, University of Genoa, Genoa, Italy
- 8Health Management and Economics Research Center, Iran University of Medical Sciences, Tehran, Iran
Vaccines are public health interventions aimed at preventing infections-related mortality, morbidity, and disability. While vaccines have been successfully designed for those infectious diseases preventable by preexisting neutralizing specific antibodies, for other communicable diseases, additional immunological mechanisms should be elicited to achieve a full protection. “New vaccines” are particularly urgent in the nowadays society, in which economic growth, globalization, and immigration are leading to the emergence/reemergence of old and new infectious agents at the animal–human interface. Conventional vaccinology (the so-called “vaccinology 1.0”) was officially born in 1796 thanks to the contribution of Edward Jenner. Entering the twenty-first century, vaccinology has shifted from a classical discipline in which serendipity and the Pasteurian principle of the three Is (isolate, inactivate, and inject) played a major role to a science, characterized by a rational design and plan (“vaccinology 3.0”). This shift has been possible thanks to Big Data, characterized by different dimensions, such as high volume, velocity, and variety of data. Big Data sources include new cutting-edge, high-throughput technologies, electronic registries, social media, and social networks, among others. The current mini-review aims at exploring the potential roles as well as pitfalls and challenges of Big Data in shaping the future vaccinology, moving toward a tailored and personalized vaccine design and administration.
Introduction: From the Classical 3Is “Isolate–Inactivate–Inject” Vaccinology 1.0 to Vaccinology 3.0, Vaccinomics, and Beyond
Vaccines are public health interventions aimed at preventing infections-related mortality, morbidity, and disability. As such, they represent a milestone of hygiene and preventive medicine (1). Since their implementation, they have managed to bring several health and economic benefits, both in developed and developing countries, significantly reducing the burden generated by infectious diseases (2). They have contributed to the eradication of smallpox and to the control of others infectious agents, such as polio. According to the estimates of the Global Alliance for Vaccines and Immunization (Gavi), they have contributed to avert up to 23.3 million projected deaths from 2011 to 2020, especially in Africa, Southeast Asia, and in the Eastern Mediterranean (3). Furthermore, they positively impact on perceived quality of life (3) and reduce inequity worldwide (1, 4).
While vaccines have been successfully designed for those infectious diseases preventable by preexisting neutralizing specific antibodies, for other communicable diseases, additional immunological mechanisms should be elicited to achieve a full protection. These additional mechanisms include the stimulation of effector and memory T lymphocytes, besides the release of antibodies by helper T cells-induced B cells (5). A better understanding of immune networks, their sophisticated tuning, and interactions is, as such, fundamental, in those vaccines against HIV/AIDS, malaria or tuberculosis, eluding classical vaccine development, which require new strategies and approaches (6).
“New vaccines” are particularly urgent in the nowadays society, in which economic growth, globalization, and immigration are leading to the emergence/reemergence of old and new infectious agents at the animal–human interface (7, 8).
Conventional vaccinology (the so-called “vaccinology 1.0”) was officially born in 1796 thanks to the contribution of Edward Jenner (1749–1823) and the pioneering discoveries of the New England Puritan minister Cotton Mather (1663–1728), and Lady Mary Wortley Montague (1689–1762), partially anticipated by Chinese and Indians different centuries before. The vaccine typical of vaccinology 1.0 is given by the rabies vaccine, the first human vaccine manufactured in 1885 in the laboratory (9). Other “first generation” vaccines are bacillus Calmette–Guérin (BCG), plague, pertussis, polio, and smallpox vaccines (9).
Entering the twenty-first century, vaccinology has shifted from a discipline in which serendipity and the Pasteurian principle of the three Is (isolate, inactivate, and inject) played a major role to a science, characterized by a rational design and plan (10).
If vaccinology 1.0 mainly consisted in isolating infectious agents, cultivating and inactivating them (as a whole or partially), and injecting the obtained product, vaccinology 2.0 utilizes purified microbial cell components. Example of “second generation” vaccines includes vaccines against tetanus, diphtheria, anthrax, pneumonia, influenza, hepatitis B, and Lyme disease (9). The transition from vaccinology 1.0 to vaccinology 2.0 has been made possible by several technological advancements, including genetic and protein engineering, recombinant DNA (11), polysaccharide and carbohydrate chemistry, combinatorial chemistry (12), among others.
Vaccinology 3.0 starts from the microbial genomic sequences (reverse vaccinology 1.0) or from the repertoire of protective human antibodies (reverse vaccinology 2.0) (13, 14). This shift has been possible thanks to omics data, which represent one type of Big Data, characterized by different aspects, such as enormous volume, velocity, and high variety of data (15).
High-throughput technologies-enabled omics disciplines [such as genomics and post-genomics specialties (16, 17), including transcriptomics, proteomics, metabolomics, cytomics, immunomics, secretomics, surfomics, or interactomics], briefly overviewed in Table 1, are able to produce a wealth of data and information, at a large-scale. Recently, these approaches have converged in what is termed vaccinomics, that is, to say the performance of large-scale, hypothesis-free, data-driven and holistic investigations. Poland and collaborators have defined vaccinomics as the “integration of immunogenetics and immunogenomics with systems biology and immune profiling” (18).
Table 1. The different genomic/post-genomic specialties and their potential role in the field of vaccinology.
New cutting-edge technologies include next-generation sequencing (NGS) techniques [RNASeq (19) and large-scale B- and T-cell receptor sequencing (20, 21)], mass cytometry (CyTOF) (22), and peptide/protein arrays (23). Data produced by molecular biology and NGS as well as by bioinformatics (24) can be used to perform mechanistic reductionist studies but can be also exploited to comprehensively capture immune dynamics and interactions (25), carrying out, for instance, network analysis or systems biology (the so-called “systems vaccinology”). Novel bioinformatics tools and new approaches are needed to better integrate the enormous wealth of data originated from omics experiments, making the shift from single-omics to multi-omics possible.
Furthermore, the actual era is characterized by the widespread diffusion of the new information and communication technologies (26): electronic health or eHealth refers to their exploitation as “a means to expand, to assist, or to enhance human activities, rather than as a substitute for them” (27). As omics experiments, eHealth generates as well an enormous wealth of data. Researchers have found that, usually, digital activities correlate with offline behaviors and other variables, such as vaccination knowledge and perception of own risk: for example, Betsch and Wicker (28), investigating a sample of 310 medical students found that explicitly surfing the Internet for vaccination risks-related websites led to fewer public health websites than generically searching for immunization practices.
Vaccinology has now entered a new phase, characterized by new challenges: within this new framework, Big Data hold promises and opportunities, which will be overviewed in the following paragraphs (Table 2; Figure 1).
Figure 1. The roles of Big Data in vaccination: from vaccine discovery and development (omics technologies and wet-lab approaches) to vaccination campaign and vaccine safety monitoring (electronic registries, social media/networks, and digital epidemiology).
Vaccine Discovery and Design: The Role of Big Data
Computational vaccinology (29, 30) and immunoinformatics (31), utilizing algorithms, enable experimental immunology to save time, focusing only on prescreened vaccine candidate antigens and, thus, avoiding cost, time-consuming, and labor intensive steps.
Different in silico tools exist, to aid and assist researchers in vaccine discovery and design (32, 33). Databases of vaccine candidates, such as MalVac (34), or Vaxar (35), software tools such as Vaxjo (36), VIOLIN (37–39), NERVE (40), Vaxign (41), Vacceed (42), Jenner-predict server (43), EpiToolKit (44), iVax (45), or VaxiJen (46), have been specifically implemented for vaccinology to enable prediction of vaccine antigens or adjuvants.
A successful example of rationally designed web-based vaccine is the vaccine against Neisseria meningitidis, commercially available with the trade name of Bexsero. For the selection of surface antigens, Masignani and collaborators (47) performed genome mining, using computational tools and algorithms, such as PSORT (48), PSI-BLAST (49), and FindPatterns to predict proteins with transmembrane domains, leader peptides, lipo-boxes and outer membrane anchoring motifs. At the end, 570 proteins were selected and GNA1870, a new surface-exposed lipoprotein inducing high levels of bactericidal antibodies, was discovered.
Reverse vaccinology technique is being applied also to other microorganisms, including Leptospira (50, 51), Streptococcus pneumoniae (52, 53), malaria (54), Schistosoma (55), Echinococcus granulosus (56), Rickettsia prowazekii (57), Mycobacterium (58), Acinetobacter baumannii (59), Escherichia coli (60), Staphylococcus aureus (61, 62), Corynebacterium pseudotuberculosis (63), Herpes simplex (64), Vibrio cholerae (65), and Cryptosporidium (65, 66), among others.
In the field of veterinary vaccinology, the reverse vaccinology approach is being applied, for instance, to organisms like cattle neosporosis (67, 68), Rhipicephalus microplus (69–71), Ehrlichia ruminantium (72, 73), and bovine herpesvirus 4 (74).
These computational approaches, using massive data mining techniques, rely on brute force (the so-called “test-all-to-lose-nothing” approach). Altindis and collaborators (75) have recently attempted to refine this framework, based on the idea that protective antigens share specific structural/functional features, termed as “protective signatures” or “immunosignatures,” differing from other pathogen components, in terms of immunological properties. Instead of focusing on protein localization, as in previous investigations, Altindis and coworkers concentrated their computational analyses on protein biological role and function. In this sense, their approach, termed as “protectome,” is protein localization unbiased, in that it leads to the identification of surface-exposed and secreted or cytoplasmic protective antigens.
Big Data and Vaccine Production and Delivery
After production, to properly preserve, store, handle, ship, and deliver vaccine supplies, it is fundamental to maintain cold chain from the manufacturer to the point of use, keeping temperatures within a precise range of values, and avoiding temperature excursions or fluctuations. Vaccines need, indeed, to be stored within a safe zone, namely, between 2 and 8°C (76–78), otherwise their quality is compromised, and their potency cannot be restored. According to the World Health Organization (WHO) and the United Nations Children’s Fund, approximately two percent of health-care facilities in low- and middle-income countries are equipped with proper functional technology for maintaining cold chain.
To solve these issues, Merck and Microsoft have, for example, established a collaboration, in which Merck exploits Microsoft R Server for Hadoop for analyzing, monitoring, and predicting variables that could affect the cold chain, including origin, destination, and delivery route as well as external weather and logistics providers, utilizing special thermal-protection containers equipped with temperature-recording sensors and temperature-sensitive vaccine vial monitors.
Nexleaf has produced ColdTrace (currently, ColdTrace version 5), which has already been implemented in more than 7,000 health-care facilities worldwide, and has recently established a new partnership with www.Google.org and Gavi.
The benefits provided by these technologies are the fact they are low-cost and particularly useful in developing countries, which often rely on stem thermometers or 30-day temperate loggers.
Big Data and Vaccine Campaigns
Other major sources of Big Data are immunization registries and surveillance systems such as SmiNet-2 (79), or SurvNet@RKI (80). These enormous databases are precious databanks, which can be mined to capture data concerning vaccination coverage rate and its determinants.
Non-conventional data sources or novel data streams, such as Internet search data and tools monitoring web queries, like Google Trends (GT) (81), social media (YouTube, Facebook, Google Plus, Twitter, Pinterest, Instagram, and so on), or news source scraping like HealthMap (82), provide researchers and public health workers with real-time information concerning public reaction to epidemic outbreaks. Novel data streams can track different vaccine-preventable infectious diseases, such as influenza (83–85), pertussis (86, 87), or measles (88), among others. As such, they can be exploited to predict epidemiological figures as well as monitor the effect of vaccine campaigns.
Big Data and Vaccine Efficacy/Effectiveness
Big Data enable also to individuate molecular signatures and predictors of the outcomes of vaccination, being correlates of vaccine efficacy/effectiveness in different populations (89). Haks and collaborators (89), for instance, utilized transcriptomics to quantitatively assess the immunogenetic signature of immunization response. Dunachie and coworkers (90) explored the differentially expressed genes induced by a malaria candidate vaccine and found that most genes conferring immunological protection belonged to the interferon-gamma and to the proteasome/antigen presentation pathways, differently from genes associated with hemopoietic stem cells, regulatory monocytes, and the myeloid lineage modules.
Novel data streams, such as mobile/smartphone applications, can be utilized in the monitoring and management of vaccine-related data (91).
Big Data and Vaccine Side Effects
Vaccine adverse events and reactions are very rare. As such, most studies are statistically underpowered to capture the rate of rare/very rare side effects. Meta-analytical approaches and data mining have emerged as useful strategies with this regard. As claimed by Chandler (92), the classical paradigm of the actual pharmacovigilance/vaccine vigilance system based on three stage-approach (namely, signal detection, development of a causality hypothesis, and testing of the causality hypothesis) is plagued by some limitations, in that “routine vaccine pharmacovigilance practice is not sufficient to understand suspected harms that are poorly defined and whose pathophysiology are not completely understood. Furthermore, estimations of risk at the population level fail to acknowledge that vaccines may cause harm in subgroups with individual-level risk factors” for adverse events following immunization. As such new approaches are needed to capture new side effects and, also in this case, Big Data could play a major role.
Berendsen and coworkers (95) exploited Big Data, to explore BCG-related “non-specific effects,” that is to say effects induced by the vaccination on health beyond its target disease. In particular, they evaluated the effect of timing of BCG on stunting in Sub-Saharan African children under 5 years, analyzing cross-sectional data for 368,450 subjects from 33 controls. Authors found that BCG vaccination did not affect stunting, with timing of BCG vaccination being statistically significant. Similar patterns could be detected for diphtheria–tetanus–pertussis and measles vaccinations.
Vaccine ontology (96, 97), a class of biomedical ontologies, that is to say a consensus-based computer and human interpretable set of terms and relations indicating specific biomedical entities, is another valuable approach. It enables support integrative adverse events-related data collection and analysis, utilizing a normalization strategy more effective than other controlled terminologies. These include the Medical Dictionary for Regulatory Activities, the Common Terminology Criteria for Adverse Events, and the WHO Adverse Reactions Terminology, among others. Using Ontology-Based Vaccine Adverse Event representation, Xie and He (96) explored the adverse events related to Flublok, a recombinant hemagglutinin influenza vaccine.
Novel data streams can be used to see how often people Google for vaccination and for vaccination-related adverse events. Bragazzi and collaborators (98) utilized GT for monitoring the interest toward preventable infections and related vaccines. Authors found that, generally speaking, vaccine was not a popular topic, with the valuable exception of the vaccine against Human Papillomavirus, with vaccines-related queries being approximately one third of the volumes regarding preventable infections. Users tended to search information about possible vaccine-related side effects.
Big Data and Vaccine Literacy/Vaccine Hesitancy
Big Data enable to track and monitor interest toward vaccination practices (99). The increasing phenomenon of vaccine hesitancy (an umbrella term that includes indecision, uncertainty, delay and reluctance) is multifactorial, and closely linked to social contexts, with different determinants, ranging from geographical area, to political situation, complacency, convenience and confidence in vaccines. Novel data streams, providing a snapshot of perceptions of vaccination in a given place and at a specific time, could be used to assess lay-people’s perceptions of vaccination, enabling health-care workers to actively engage citizens and to plan ad hoc communication strategies and plans to contain vaccine hesitancy and to promote vaccine literacy (100).
Shah and colleagues (101) compared time series of rotavirus-related Internet searches as captured by GT with rotavirus laboratory reports from the United States and United Kingdom and with hospitalizations for acute gastroenteritis in the United States and Mexico, before and after national vaccine introductions. Authors found a strong positive correlation between web queries and laboratory reports in the United States (R2 = 0.79) and United Kingdom (R2 = 0.60) and between the Internet searches and acute gastroenteritis hospitalizations in the United States (R2 = 0.87) and Mexico (R2 = 0.69). Correlations were stronger in the prevaccine period and after vaccine introduction, the mean Internet queries decreased by 40–70% in the United States and Mexico, with a loss of seasonal variation in the United Kingdom.
Bakker and coworkers (102) exploited GT to monitor the interest toward chicken pox, over an 11-year period, from 36 countries. Authors found seasonal peaks with striking latitudinal variation in information seeking behavior. Authors concluded that novel data streams are able to track the global burden of childhood disease as well as to investigate effects of immunization at population level.
Goldlust and collaborators (103) investigated the use of large-scale medical claims data for local surveillance of under-immunization for childhood infections in the United States, developing a statistical framework for integrating disparate data sources on surveillance of vaccination behavior. In this way, authors were able to identify the determinants of vaccine hesitancy behavior. Within the “Vaccine Confidence Project,” Larson and colleagues (104) extensively analyzed data from 10,380 reports (from 144 countries) and found that 7,171 (69%) contained positive or neutral content whereas 3,209 (31%) contained negative content (related to vaccine programs and disease outbreaks, vaccine-related beliefs, awareness, and perceptions; vaccine safety; and vaccine delivery programs).
Within the ambitious “Project Tycho” (freely accessible at www.tycho.pitt.edu) launched by the University of Pittsburgh, United States (105, 106), authors have digitized all weekly surveillance reports of notifiable diseases for United States cities and states published in the period between 1888 and 2011. This data set consists of 87,950,807 reported individual cases and has been used to derive a quantitative history of disease dynamics and transmission in the United States. Pattern analysis has documented, in a statistically robust way, a significant reduction of infections-generated burden, underlining the positive effect of vaccination programs (105). This use of big data emphasizes the dimension of “veracity,” through which is possible to contrast vaccine-related “fake news” and “post-modern, post-factual truths,” disseminated by the anti-vaccination movements (107).
Conclusion: State-of-the-Art, Current Challenges, and Future Prospects
Big Data have contributed and are expected to continue contributing toward facilitating the discovery, development, production, and delivery of rationally designed vaccines. Further, enabling to identify predictive biomolecular signatures of response to vaccination, vaccination will shift from the classical “one-size-fits-all” paradigm to a personalized approach. Moreover, Big Data can be used to track the success of vaccination campaigns, in term of vaccination coverage rate, as well as the rare/very rate vaccine-related adverse events, for which “classical epidemiological studies” would be statistically underpowered.
However, a number of pitfalls and challenges should be properly recognized to be addressed by future research: Big Data and Big Data sources, as previously overviewed, are highly heterogeneous and should be effectively integrated and harmonized together. Moreover, some algorithms underlying novel data streams need to be refined in that, sometimes, do not exactly predict epidemic outbreaks (108), even though some scholars have shown that, in principle, is possible to correct them to achieve higher predictive power (109). Further, efforts should be done to preserve and protect privacy, confidentiality, and identity. The emerging field of “Big Data Ethics” is trying to address all these issues (110, 111). Currently, we are only witnessing the very beginning of the ongoing “Big Data revolution.”
NB, MM, and MB conceived the study. NB, VG, MV, RR, AN, AH, MM, and MB drafted and revised the manuscript; read and approved the last version.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The authors would like to thank Ms. Valeria Parisi, University of Milan, for help in preparing the figure.
1. Andre FE, Booy R, Bock HL, Clemens J, Datta SK, John TJ, et al. Vaccination greatly reduces disease, disability, death and inequity worldwide. Bull World Health Organ (2008) 86(2):140–6. doi:10.2471/BLT.07.040089
2. Lee LA, Franzel L, Atwell J, Datta SD, Friberg IK, Goldie SJ, et al. The estimated mortality impact of vaccinations forecast to be administered during 2011-2020 in 73 countries supported by the GAVI Alliance. Vaccine (2013) 31(Suppl 2):B61–72. doi:10.1016/j.vaccine.2012.11.035
3. Lee BY, Bartsch SM, Brown ST, Cooley P, Wheaton WD, Zimmerman RK. Quantifying the economic value and quality of life impact of earlier influenza vaccination. Med Care (2015) 53(3):218–29. doi:10.1097/MLR.0000000000000302
5. Hagan T, Nakaya HI, Subramaniam S, Pulendran B. Systems vaccinology: enabling rational vaccine design with systems biological approaches. Vaccine (2015) 33:5294–301. doi:10.1016/j.vaccine.2015.03.072
13. Burton DR. What are the most powerful immunogen design vaccine strategies? Reverse vaccinology 2.0 shows great promise. Cold Spring Harb Perspect Biol (2017) 9(11). doi:10.1101/cshperspect.a030262
17. Dwivedi P, Alam SI, Tomar RS. Secretome, surfome and immunome: emerging approaches for the discovery of new vaccine candidates against bacterial infections. World J Microbiol Biotechnol (2016) 32(9):155. doi:10.1007/s11274-016-2107-3
18. Poland GA, Ovsyannikova IG, Kennedy RB, Haralambieva IH, Jacobson RM. Vaccinomics and a new paradigm for the development of preventive vaccines against viral infections. OMICS (2011) 15(9):625–36. doi:10.1089/omi.2011.0032
25. Chattopadhyay PK, Roederer M. A mine is a terrible thing to waste: high content, single cell technologies for comprehensive immune analysis. Am J Transplant (2015) 15(5):1155–61. doi:10.1111/ajt.13193
26. Bragazzi NL. From P0 to P6 medicine, a model of highly participatory, narrative, interactive, and “augmented” medicine: some considerations on Salvatore Iaconesi’s clinical story. Patient Prefer Adherence (2013) 7:353–9. doi:10.2147/PPA.S38578
28. Betsch C, Wicker S. E-health use, vaccination knowledge and perception of own risk: drivers of vaccination uptake in medical students. Vaccine (2012) 30(6):1143–8. doi:10.1016/j.vaccine.2011.12.021
29. Khalili S, Jahangiri A, Borna H, Ahmadi Zanoos K, Amani J. Computational vaccinology and epitope vaccine design by immunoinformatics. Acta Microbiol Immunol Hung (2014) 61(3):285–307. doi:10.1556/AMicr.61.2014.3.4
30. Söllner J, Heinzel A, Summer G, Fechete R, Stipkovits L, Szathmary S, et al. Concept and application of a computational vaccinology workflow. Immunome Res (2010) 6(Suppl 2):S7. doi:10.1186/1745-7580-6-S2-S7
31. Hegde NR, Gauthami S, Sampath Kumar HM, Bayry J. The use of databases, data mining and immunoinformatics in vaccinology: where are we? Expert Opin Drug Discov (2018) 13(2):117–30. doi:10.1080/17460441.2018.1413088
35. Todd T, Dunn N, Xiang Z, He Y. Vaxar: a web-based database of laboratory animal responses to vaccinations and its application in the meta-analysis of different animal responses to tuberculosis vaccinations. Comp Med (2016) 66(2):119–28.
36. Sayers S, Ulysse G, Xiang Z, He Y. Vaxjo: a web-based vaccine adjuvant database and its application for analysis of vaccine adjuvants and their uses in vaccine development. J Biomed Biotechnol (2012) 2012:831486. doi:10.1155/2012/831486
37. Xiang Z, Todd T, Ku KP, Kovacic BL, Larson CB, Chen F, et al. VIOLIN: vaccine investigation and online information network. Nucleic Acids Res (2008) 36(Database issue):D923–8. doi:10.1093/nar/gkm1039
39. He Y, Racz R, Sayers S, Lin Y, Todd T, Hur J, et al. Updates on the web-based VIOLIN vaccine database and analysis system. Nucleic Acids Res (2014) 42(Database issue):D1124–32. doi:10.1093/nar/gkt1133
41. He Y, Xiang Z, Mobley HL. Vaxign: the first web-based vaccine design program for reverse vaccinology and applications for vaccine development. J Biomed Biotechnol (2010) 2010:297505. doi:10.1155/2010/297505
42. Goodswen SJ, Kennedy PJ, Ellis JT. Vacceed: a high-throughput in silico vaccine candidate discovery pipeline for eukaryotic pathogens based on reverse vaccinology. Bioinformatics (2014) 30(16):2381–3. doi:10.1093/bioinformatics/btu300
43. Jaiswal V, Chanumolu SK, Gupta A, Chauhan RS, Rout C. Jenner-predict server: prediction of protein vaccine candidates (PVCs) in bacteria based on host-pathogen interactions. BMC Bioinformatics (2013) 14:211. doi:10.1186/1471-2105-14-211
45. Moise L, Gutierrez A, Kibria F, Martin R, Tassone R, Liu R, et al. iVAX: an integrated toolkit for the selection and optimization of antigens and the design of epitope-driven vaccines. Hum Vaccin Immunother (2015) 11(9):2312–21. doi:10.1080/21645515.2015.1061159
47. Masignani V, Comanducci M, Giuliani MM, Bambini S, Adu-Bobie J, Arico B, et al. Vaccination against Neisseria meningitidis using three variants of the lipoprotein GNA1870. J Exp Med (2003) 197(6):789–99. doi:10.1084/jem.20021911
49. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res (1997) 25(17):3389–402. doi:10.1093/nar/25.17.3389
50. Yang HL, Zhu YZ, Qin JH, He P, Jiang XC, Zhao GP, et al. In silico and microarray-based genomic approaches to identifying potential vaccine candidates against Leptospira interrogans. BMC Genomics (2006) 7:293. doi:10.1186/1471-2164-7-293
51. Grassmann AA, Kremer FS, Dos Santos JC, Souza JD, Pinto LDS, McBride AJA. Discovery of novel leptospirosis vaccine candidates using reverse and structural vaccinology. Front Immunol (2017) 8:463. doi:10.3389/fimmu.2017.00463
53. Talukdar S, Zutshi S, Prashanth KS, Saikia KK, Kumar P. Identification of potential vaccine candidates against Streptococcus pneumoniae by reverse vaccinology approach. Appl Biochem Biotechnol (2014) 172(6):3026–41. doi:10.1007/s12010-014-0749-x
55. Merrifield M, Hotez PJ, Beaumier CM, Gillespie P, Strych U, Hayward T, et al. Advancing a vaccine to prevent human schistosomiasis. Vaccine (2016) 34(26):2988–91. doi:10.1016/j.vaccine.2016.03.079
56. Gan W, Zhao G, Xu H, Wu W, Du W, Huang J, et al. Reverse vaccinology approach identify an Echinococcus granulosus tegumental membrane protein enolase as vaccine candidate. Parasitol Res (2010) 106(4):873–82. doi:10.1007/s00436-010-1729-x
57. Caro-Gomez E, Gazi M, Goez Y, Valbuena G. Discovery of novel cross-protective Rickettsia prowazekii T-cell antigens using a combined reverse vaccinology and in vivo screening approach. Vaccine (2014) 32(39):4968–76. doi:10.1016/j.vaccine.2014.06.089
58. Pandey K, Sharma M, Saarav I, Singh S, Dutta P, Bhardwaj A, et al. Analysis of the DosR regulon genes to select cytotoxic T lymphocyte epitope specific vaccine candidates using a reverse vaccinology approach. Int J Mycobacteriol (2016) 5(1):34–43. doi:10.1016/j.ijmyco.2015.10.005
59. Hassan A, Naz A, Obaid A, Paracha RZ, Naz K, Awan FM, et al. Pangenome and immuno-proteomics analysis of Acinetobacter baumannii strains revealed the core peptide vaccine targets. BMC Genomics (2016) 17(1):732. doi:10.1186/s12864-016-2951-4
61. Holtfreter S, Kolata J, Stentzel S, Bauerfeind S, Schmidt F, Sundaramoorthy N, et al. Omics approaches for the study of adaptive immunity to Staphylococcus aureus and the selection of vaccine candidates. Proteomes (2016) 4(1):1–25. doi:10.3390/proteomes4010011
62. Oprea M, Antohe F. Reverse-vaccinology strategy for designing T-cell epitope candidates for Staphylococcus aureus endocarditis vaccine. Biologicals (2013) 41(3):148–53. doi:10.1016/j.biologicals.2013.03.001
63. Soares SC, Trost E, Ramos RT, Carneiro AR, Santos AR, Pinto AC, et al. Genome sequence of Corynebacterium pseudotuberculosis biovar equi strain 258 and prediction of antigenic targets to improve biotechnological vaccine production. J Biotechnol (2013) 167(2):135–41. doi:10.1016/j.jbiotec.2012.11.003
64. Xiang Z, He Y. Genome-wide prediction of vaccine targets for human herpes simplex viruses using Vaxign reverse vaccinology. BMC Bioinformatics (2013) 14(Suppl 4):S2. doi:10.1186/1471-2105-14-S4-S2
65. Barh D, Barve N, Gupta K, Chandra S, Jain N, Tiwari S, et al. Exoproteome and secretome derived broad spectrum novel drug and vaccine candidates in Vibrio cholerae targeted by Piper betel derived compounds. PLoS One (2013) 8(1):e52773. doi:10.1371/journal.pone.0052773
66. Ifeonu OO, Simon R, Tennant SM, Sheoran AS, Daly MC, Felix V, et al. Cryptosporidium hominis gene catalog: a resource for the selection of novel Cryptosporidium vaccine candidates. Database (Oxford) (2016) 2016:1–13. doi:10.1093/database/baw137
68. Goodswen SJ, Kennedy PJ, Ellis JT. On the application of reverse vaccinology to parasitic diseases: a perspective on feature selection and ranking of vaccine candidates. Int J Parasitol (2017) 47(12):779–90. doi:10.1016/j.ijpara.2017.08.004
70. Aguirre Ade A, Lobo FP, Cunha RC, Garcia MV, Andreotti R. Design of the ATAQ peptide and its evaluation as an immunogen to develop a Rhipicephalus vaccine. Vet Parasitol (2016) 221:30–8. doi:10.1016/j.vetpar.2016.02.032
71. Maritz-Olivier C, van Zyl W, Stutzer C. A systematic, functional genomics, and reverse vaccinology approach to the identification of vaccine candidates in the cattle tick, Rhipicephalus microplus. Ticks Tick Borne Dis (2012) 3(3):179–87. doi:10.1016/j.ttbdis.2012.01.003
72. Liebenberg J, Pretorius A, Faber FE, Collins NE, Allsopp BA, van Kleef M. Identification of Ehrlichia ruminantium proteins that activate cellular immune responses using a reverse vaccinology strategy. Vet Immunol Immunopathol (2012) 145(1–2):340–9. doi:10.1016/j.vetimm.2011.12.003
73. Sebatjane SI, Pretorius A, Liebenberg J, Steyn H, Van Kleef M. In vitro and in vivo evaluation of five low molecular weight proteins of Ehrlichia ruminantium as potential vaccine components. Vet Immunol Immunopathol (2010) 137(3–4):217–25. doi:10.1016/j.vetimm.2010.05.011
75. Altindis E, Cozzi R, Di Palo B, Necchi F, Mishra RP, Fontana MR, et al. Protectome analysis: a new selective bioinformatics tool for bacterial vaccine candidate discovery. Mol Cell Proteomics (2015) 14(2):418–29. doi:10.1074/mcp.M114.039362
79. Rolfhamre P, Jansson A, Arneborn M, Ekdahl K. SmiNet-2: description of an internet-based surveillance system for communicable diseases in Sweden. Euro Surveill (2006) 11(5):103–7. doi:10.2807/esm.11.05.00626-en
80. Faensen D, Claus H, Benzler J, Ammon A, Pfoch T, Breuer T, et al. SurvNet@RKI – a multistate electronic reporting system for communicable diseases. Euro Surveill (2006) 11(4):100–3. doi:10.2807/esm.11.04.00614-en
81. Nuti SV, Wayda B, Ranasinghe I, Wang S, Dreyer RP, Chen SI, et al. The use of Google Trends in health care research: a systematic review. PLoS One (2014) 9(10):e109583. doi:10.1371/journal.pone.0109583
82. Althouse BM, Scarpino SV, Meyers LA, Ayers JW, Bargsten M, Baumbach J, et al. Enhancing disease surveillance with novel data streams: challenges and opportunities. EPJ Data Sci (2015) 4:17. doi:10.1140/epjds/s13688-015-0054-0
84. Samaras L, García-Barriocanal E, Sicilia MA. Syndromic surveillance models using web data: the case of influenza in Greece and Italy using Google Trends. JMIR Public Health Surveill (2017) 3(4):e90. doi:10.2196/publichealth.8015
87. Pollett S, Wood N, Boscardin WJ, Bengtsson H, Schwarcz S, Harriman K, et al. Validating the use of Google Trends to enhance pertussis surveillance in California. PLoS Curr (2015) 7. doi:10.1371/currents.outbreaks.7119696b3e7523faa4543faac87c56c2
89. Haks MC, Bottazzi B, Cecchinato V, De Gregorio C, Del Giudice G, Kaufmann SHE, et al. Molecular signatures of immunity and immunogenicity in infection and vaccination. Front Immunol (2017) 8:1563. doi:10.3389/fimmu.2017.01563
90. Dunachie S, Berthoud T, Hill AV, Fletcher HA. Transcriptional changes induced by candidate malaria vaccines and correlation with protection against malaria in a human challenge model. Vaccine (2015) 33(40):5321–31. doi:10.1016/j.vaccine.2015.07.087
91. Atkinson KM, El-Khatib Z, Barnum G, Bell C, Turcotte MC, Murphy MSQ, et al. Using mobile apps to communicate vaccination records: a city-wide evaluation with a national immunization app, maternal child registry and public health authorities. Healthc Q (2017) 20(3):41–6. doi:10.12927/hcq.2017.25289
97. He Y, Ong E, Xie J. Integrative representations and analyses of vaccine-induced intended protective immunity and unintended adverse events using ontology-based and theory-guided approaches. Glob Vaccines Immunol (2016) 1(2):37–9. doi:10.15761/GVI.1000110
98. Bragazzi NL, Barberis I, Rosselli R, Gianfredi V, Nucci D, Moretti M, et al. How often people Google for vaccination: qualitative and quantitative insights from a systematic search of the web-based activities using Google Trends. Hum Vaccin Immunother (2017) 13(2):464–9. doi:10.1080/21645515.2017.1264742
99. Amicizia D, Domnich A, Gasparini R, Bragazzi NL, Lai PL, Panatto D. An overview of current and potential use of information and communication technologies for immunization promotion among adolescents. Hum Vaccin Immunother (2013) 9(12):2634–42. doi:10.4161/hv.26010
101. Shah MP, Lopman BA, Tate JE, Harris J, Esparza-Aguilar M, Sanchez-Uribe E, et al. Use of internet search data to monitor rotavirus vaccine impact in the United States, United Kingdom, and Mexico. J Pediatric Infect Dis Soc (2018) 7(1):56–63. doi:10.1093/jpids/pix004
102. Bakker KM, Martinez-Bakker ME, Helm B, Stevenson TJ. Digital epidemiology reveals global childhood disease seasonality and the effects of immunization. Proc Natl Acad Sci U S A (2016) 113(24):6689–94. doi:10.1073/pnas.1523941113
104. Larson HJ, Smith DM, Paterson P, Cumming M, Eckersberger E, Freifeld CC, et al. Measuring vaccine confidence: analysis of data obtained by a media surveillance system used to analyse public concerns about vaccines. Lancet Infect Dis (2013) 13(7):606–13. doi:10.1016/S1473-3099(13)70108-7
105. van Panhuis WG, Grefenstette J, Jung SY, Chok NS, Cross A, Eng H, et al. Contagious diseases in the United States from 1888 to the present. N Engl J Med (2013) 369(22):2152–8. doi:10.1056/NEJMms1215400
109. Santillana M, Zhang DW, Althouse BM, Ayers JW. What can digital disease detection learn from (an external revision to) Google Flu Trends? Am J Prev Med (2014) 47(3):341–7. doi:10.1016/j.amepre.2014.05.020
Keywords: vaccine, Big Data, omics disciplines, Web 2.0, eHealth, history of vaccinology
Citation: Bragazzi NL, Gianfredi V, Villarini M, Rosselli R, Nasr A, Hussein A, Martini M and Behzadifar M (2018) Vaccines Meet Big Data: State-of-the-Art and Future Prospects. From the Classical 3Is (“Isolate–Inactivate–Inject”) Vaccinology 1.0 to Vaccinology 3.0, Vaccinomics, and Beyond: A Historical Overview. Front. Public Health 6:62. doi: 10.3389/fpubh.2018.00062
Received: 17 December 2017; Accepted: 16 February 2018;
Published: 05 March 2018
Edited by:Enrico Capobianco, University of Miami, United States
Reviewed by:Pier Luigi Lopalco, University of Pisa, Italy
Antonio Mora, Guangzhou Medical University, China
Copyright: © 2018 Bragazzi, Gianfredi, Villarini, Rosselli, Nasr, Hussein, Martini and Behzadifar. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Nicola Luigi Bragazzi, email@example.com
†These authors have contributed equally to this work.