ORIGINAL RESEARCH article
Front. Immunol.
Sec. Antigen Presenting Cell Biology
Volume 16 - 2025 | doi: 10.3389/fimmu.2025.1616113
NetMHCpan-4.2: Improved prediction of CD8+ epitopes by use of transfer learning and structural features
Provisionally accepted- 1Department of Health Technology, Technical University of Denmark, Lyngby, Denmark
- 2Center for Vaccine Innovation, La Jolla Institute for Immunology (LJI), San Diego, California, United States
- 3Department of Medicine, Division of Infectious Diseases and Global Public Health, University of California, San Diego, United States
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Identification of CD8+ T cell epitopes is crucial for advancing vaccine development and immunotherapy strategies. Traditional methods for predicting T cell epitopes primarily focus on MHC presentation, leveraging immunopeptidome data. Recent advancements however suggest significant performance improvements through transfer learning and refinement using epitope data. To further investigate this, we here develop an enhanced MHC class I (MHC-I) antigen presentation predictor by integrating newly curated binding affinity and eluted ligand datasets, expanding MHC allele coverage, and incorporating novel input features related to the structural constraints of the MHC-I peptide-binding cleft. We next apply transfer learning using experimentally validated pathogen-and cancer-derived epitopes from public databases to refine our prediction method, ensuring comprehensive data partitioning to prevent performance overestimation. Our findings indicate that fine-tuning on epitope data only yields a minor accuracy boost. Moreover, the transferability between cancer and pathogen-derived epitopes is limited, suggesting distinct properties between these data types. In conclusion, while transfer learning can enhance T cell epitope prediction, the performance gains are modest and data type specific. Our final NetMHCpan-4.2 model is publicly accessible at https://services.healthtech.dtu.dk/services/NetMHCpan-4.2, providing a valuable resource for immunological research and therapeutic development.
Keywords: HLA class I, MHC class I, Antigen Presentation, immunogenicity prediction, machine learning, immunoinformatics
Received: 22 Apr 2025; Accepted: 25 Jul 2025.
Copyright: © 2025 Nilsson, Greenbaum, Peters and Nielsen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Jonas Birkelund Nilsson, Department of Health Technology, Technical University of Denmark, Lyngby, Denmark
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.