Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Bioinform.

Sec. Protein Bioinformatics

Volume 5 - 2025 | doi: 10.3389/fbinf.2025.1669237

This article is part of the Research TopicBioinformatics tools and approaches for prediction and assessment of protein allergenicity and toxicity potentialView all 3 articles

Fish Isoallergens and Variants: Database Compilation, In Silico Allergenicity Prediction Challenges, and Epitope-Based Threshold Optimization

Provisionally accepted
  • 1Biomolecular Sequence To Function Division (BSFD), Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
  • 2Tropical Futures Institute, James Cook University, Singapore, Singapore
  • 3Centre for Sustainable Tropical Fisheries and Aquaculture, College of Science and Engineering, James Cook University, Townsville, Queensland, Australia
  • 4Molecular Allergy Research Laboratory, College of Science and Engineering, James Cook University, Townsville, Queensland, Australia
  • 5Biomolecular Structure To Mechanism Division (BSMD), Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
  • 6Monell Chemical Senses Center, Philadelphia, United States
  • 7Singapore Phenome Center, Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore
  • 8School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
  • 9Singapore Immunology Network (SIgN), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
  • 10Yong Loo Lin School of Medicine and Department of Biological Sciences, National University of Singapore (NUS), Singapore, Singapore

The final, formatted version of the article will be published soon.

Fish is a major food allergy trigger with a complex variety of allergenic protein isoforms and vast species diversity with variable allergenicity. This is the first study to systematically compile fish isoallergen and variant entries associated with ingestion-related allergic reactions, supported by evidence from in vitro IgE-binding assays and complete amino acid sequences, sourced from four major allergen databases: World Health Organisation and International Union of Immunological Societies (WHO/IUIS), AllergenOnline, Comprehensive Protein Allergen Resource (COMPARE), and Allergome. A comprehensive dataset of 79 unique fish isoallergen and variant entries from 34 fish species was identified, with 25 entries common across all four databases. Challenges in allergenicity predictions were evaluated and the sensitivity of five widely used in silico tools (AllerCatPro 2.0, AlgPred 2.0, pLM4Alg, AllergenFP v.1.0, and AllerTop v.2.0) was assessed, with AllerCatPro 2.0 achieving the highest sensitivity (97.5%). Furthermore, epitope mapping and phylogenetic analyses were performed for the major fish allergen parvalbumin, incorporating experimentally validated B-cell epitope data from the Immune Epitope Database (IEDB) and evolutionary relationships. A phylogenetic tree was constructed, integrating epitope data to optimize protein family-specific thresholds for differentiating allergenic from less/non-allergenic parvalbumins. Our innovative approach established a threshold of ≥4 IEDB-mapped epitopes allowing up to two mismatches captured 52 out of 54 parvalbumin sequences (96%) in the dataset, effectively distinguishing between these parvalbumin classes. This study enhances our understanding of fish allergy by systematically compiling fish isoallergens and their variants, and by integrating B-cell epitope data. The optimized thresholds improve the performance of allergenicity prediction tools, and the approach can be applied to other protein families in future.

Keywords: fish allergy, isoallergens, Food allergens, parvalbumin, B-cell epitope mapping, phylogenetic tree, protein allergenicity prediction, in silico prediction tools

Received: 19 Jul 2025; Accepted: 02 Oct 2025.

Copyright: © 2025 Limviphuvadh, Ruethers, Nguyen, Jerry, Smith, Wang, Miao, Andiappan, Lopata and Maurer-Stroh. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Vachiranee Limviphuvadh, vachiraneel@bii.a-star.edu.sg
Sebastian Maurer-Stroh, sebastianms@bii.a-star.edu.sg

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.