Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Immunol.

Sec. Antigen Presenting Cell Biology

Volume 16 - 2025 | doi: 10.3389/fimmu.2025.1616113

NetMHCpan-4.2: Improved prediction of CD8+ epitopes by use of transfer learning and structural features

Provisionally accepted
  • 1Department of Health Technology, Technical University of Denmark, Lyngby, Denmark
  • 2Center for Vaccine Innovation, La Jolla Institute for Immunology (LJI), San Diego, California, United States
  • 3Department of Medicine, Division of Infectious Diseases and Global Public Health, University of California, San Diego, United States

The final, formatted version of the article will be published soon.

Identification of CD8+ T cell epitopes is crucial for advancing vaccine development and immunotherapy strategies. Traditional methods for predicting T cell epitopes primarily focus on MHC presentation, leveraging immunopeptidome data. Recent advancements however suggest significant performance improvements through transfer learning and refinement using epitope data. To further investigate this, we here develop an enhanced MHC class I (MHC-I) antigen presentation predictor by integrating newly curated binding affinity and eluted ligand datasets, expanding MHC allele coverage, and incorporating novel input features related to the structural constraints of the MHC-I peptide-binding cleft. We next apply transfer learning using experimentally validated pathogen-and cancer-derived epitopes from public databases to refine our prediction method, ensuring comprehensive data partitioning to prevent performance overestimation. Our findings indicate that fine-tuning on epitope data only yields a minor accuracy boost. Moreover, the transferability between cancer and pathogen-derived epitopes is limited, suggesting distinct properties between these data types. In conclusion, while transfer learning can enhance T cell epitope prediction, the performance gains are modest and data type specific. Our final NetMHCpan-4.2 model is publicly accessible at https://services.healthtech.dtu.dk/services/NetMHCpan-4.2, providing a valuable resource for immunological research and therapeutic development.

Keywords: HLA class I, MHC class I, Antigen Presentation, immunogenicity prediction, machine learning, immunoinformatics

Received: 22 Apr 2025; Accepted: 25 Jul 2025.

Copyright: © 2025 Nilsson, Greenbaum, Peters and Nielsen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Jonas Birkelund Nilsson, Department of Health Technology, Technical University of Denmark, Lyngby, Denmark

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.