VaxiJen Dataset of Bacterial Immunogens: An Update

Page: [398 - 400] Pages: 3

  • * (Excluding Mailing and Handling)

Abstract

Background: Identifying immunogenic proteins is the first stage in vaccine design and development. VaxiJen is the most widely used and highly cited server for immunogenicity prediction. As the developers of VaxiJen, we are obliged to update and improve it regularly. Here, we present an updated dataset of bacterial immunogens containing 317 experimentally proven immunogenic proteins of bacterial origin, of which 60% have been reported during the last 10 years.

Methods: PubMed was searched for papers containing data for novel immunogenic proteins tested on humans till March 2017. Corresponding protein sequences were collected from NCBI and UniProtKB. The set was curated manually for multiple protein fragments, isoforms, and duplicates.

Results: The final curated dataset consists of 306 immunogenic proteins tested on humans derived from 47 bacterial microorganisms. Certain proteins have several isoforms. All were considered, and the total protein sequences in the set are 317. The updated set contains 206 new immunogens, compared to the previous VaxiJen bacterial dataset. The average number of immunogens per species is 6.7. The set also contains 12 fusion proteins and 41 peptide fragments and epitopes. The dataset includes the names of bacterial microorganisms, protein names, and protein sequences in FASTA format.

Conclusion: Currently, the updated VaxiJen bacterial dataset is the best known manually-curated compilation of bacterial immunogens. It is freely available at http://www.ddg-pharmfac.net/vaxi jen/dataset. It can easily be downloaded, searched, and processed. When combined with an appropriate negative dataset, this update could also serve as a training set, allowing enhanced prediction of the potential immunogenicity of unknown protein sequences.

Keywords: Immunogenicity prediction, dataset, bacterial immunogen, VaxiJen, FASTA, epitopes.

Graphical Abstract

[1]
Flower, D.R. Bioinformatics for Vaccinology; Wiley-Blackwell: Chichester, UK, 2008.
[2]
Arnon, R. Overview of vaccine strategies.In: Vaccine Design. Innovative Approaches and Novel Strategies; Rappuolo, R.; Bagnoli, F., Eds.; Caister Academic Press: Norfolk, UK, 2011.
[3]
Pizza, M.; Scarlato, V.; Masignani, V.; Giuliani, M.M.; Aricò, B.; Comanducci, M.; Jennings, G.T.; Baldi, L.; Bartolini, E.; Capecchi, B.; Galeotti, C.L.; Luzzi, E.; Manetti, R.; Marchetti, E.; Mora, M.; Nuti, S.; Ratti, G.; Santini, L.; Savino, S.; Scarselli, M.; Storni, E.; Zuo, P.; Broeker, M.; Hundt, E.; Knapp, B.; Blair, E.; Mason, T.; Tettelin, H.; Hood, D.W.; Jeffries, A.C.; Saunders, N.J.; Granoff, D.M.; Venter, J.C.; Moxon, E.R.; Grandi, G.; Rappuoli, R. Identification of vaccine candidates against serogroup B meningococcus by whole-genome sequencing. Science, 2000, 287(5459), 1816-1820.
[4]
Doytchinova, I.A.; Flower, D.R. Identifying candidate subunit vaccines using an alignment-independent method based on principal amino acid properties. Vaccine, 2007, 25(5), 856-866.
[5]
Doytchinova, I.A.; Flower, D.R. VaxiJen: A server for prediction of protective antigens, tumour antigens and subunit vaccines. BMC Bioinformatics, 2007, 8, 4.
[6]
Doytchinova, I.A.; Flower, D.R. Bioinformatic approach for identifying parasite and fungal candidate subunit vaccines. Open Vaccine J., 2008, 1, 22-26.
[7]
Zaharieva, N.; Dimitrov, I.; Flower, D.R.; Doytchinova, I. Immunogenicity prediction by vaxijen: A ten year overview. J. Proteomics Bioinform., 2017, 10(11), 298-310.
[8]
Boje, S.; Olsen, A.W.; Erneholm, K.; Agerholm, J.S.; Jungersen, G.; Andersen, P.; Follmann, F. A multi-subunit chlamydia vaccine inducing neutralizing antibodies and strong IFN-gamma (+) CMI responses protects against a genital infection in minipigs. Immunol. Cell Biol., 2016, 94(2), 185-195.
[9]
Wang, X.; Zhang, J.; Liang, J.; Zhang, Y.; Teng, X.; Yuan, X.; Fan, X. Protection against Mycobacterium tuberculosis infection offered by a new multistage subunit vaccine correlates with increased number of IFN-gamma+ IL-2+ CD4+ and IFN-gamma+ CD8+ T cells. PLoS One, 2015, 10(3)e0122560
[10]
Ansari, H.R.; Flower, D.R.; Raghava, G.P. AntigenDB: an immunoinformatics database of pathogen antigens. Nucleic Acids Res., 2010, 38(Database issue), D847-D853.
[11]
Chattopadhyay, A.K.; Nasiev, D.; Flower, D.R. A statistical physics perspective on alignment-independent protein sequence comparison. Bioinformatics, 2015, 31(15), 2469-2474.