AUTHOR=Zhao Tianyi , Cheng Liang , Zang Tianyi , Hu Yang TITLE=Peptide-Major Histocompatibility Complex Class I Binding Prediction Based on Deep Learning With Novel Feature JOURNAL=Frontiers in Genetics VOLUME=Volume 10 - 2019 YEAR=2019 URL=https://www.frontiersin.org/journals/genetics/articles/10.3389/fgene.2019.01191 DOI=10.3389/fgene.2019.01191 ISSN=1664-8021 ABSTRACT=Peptide-based vaccine development needs accurate pre-diction of the binding affinity between major histocom-patibility complex I (MHC I) proteins and their peptide ligands. Nowadays more and more machine learning methods have been developed to predict binding affinity and some of them have become the popular tools. How-ever most of them are designed by the shallow neural networks. As Bengio said that deep neural networks can learn better fits with less data than shallow neural net-works. In our case, some of the alleles only have dozens of peptides data. In addition, we transform each peptide into a characteristic matrix and input it into the model. As we know when dealing with the problem that the input is a matrix, CNN can find the most critical features by itself. Obviously, compared with the traditional neural network model, CNN is more suitable for predicting binding affin-ity. Different from the previous studies which are based on BLOSUM, we used novel feature to do the prediction. Since we consider that the order of the sequence, hy-dropathy index, polarity and the length of the peptide could affect the binding affinity and the properties of these amino acids are key factors for their binding to MHC, we extracted these information from each peptide. In order to make full use of the data we have obtained, we have integrated different length of peptides into 15mer based on the binding mode of peptide to MHC I. In order to demonstrate that our method is reliable to pre-dict pep-tide-MHC binding, we compared our method with several popular methods. The experiments show the superiority of our method.