A Transfer Learning-Based Approach for Lysine Propionylation Prediction

Li, Ang; Deng, Yingwei; Tan, Yan; Chen, Min

doi:10.3389/fphys.2021.658633

ORIGINAL RESEARCH article

Front. Physiol., 21 April 2021

Sec. Systems Biology Archive

Volume 12 - 2021 | https://doi.org/10.3389/fphys.2021.658633

A Transfer Learning-Based Approach for Lysine Propionylation Prediction

Ang Li^†

Yingwei Deng^*†

Yan Tan

Min Chen^*

School of Computer Science and Technology, Hunan Institute of Technology, Hengyang, China

Lysine propionylation is a newly discovered posttranslational modification (PTM) and plays a key role in the cellular process. Although proteomics techniques was capable of detecting propionylation, large-scale detection was still challenging. To bridge this gap, we presented a transfer learning-based method for computationally predicting propionylation sites. The recurrent neural network-based deep learning model was trained firstly by the malonylation and then fine-tuned by the propionylation. The trained model served as feature extractor where protein sequences as input were translated into numerical vectors. The support vector machine was used as the final classifier. The proposed method reached a matthews correlation coefficient (MCC) of 0.6615 on the 10-fold crossvalidation and 0.3174 on the independent test, outperforming state-of-the-art methods. The enrichment analysis indicated that the propionylation was associated with these GO terms (GO:0016620, GO:0051287, GO:0003735, GO:0006096, and GO:0005737) and with metabolism. We developed a user-friendly online tool for predicting propoinylation sites which is available at http://47.113.117.61/.

Introduction

No machine is more sophisticated than the cell. This is because there are too many sophisticated mechanisms in the cell, including transcription, gene splicing, translation, and posttranslational modification (PTM). All constituted the sophisticated life. As a key mechanism, PTM increases not only diversities of protein structures and functions but also make regulation more sophisticated. Many studies indicated that aberrant of PTM was always implicated in many human diseases including cancer (Martin et al., 2011; Nakamura et al., 2015; Junqueira et al., 2019). Propionylation, one of more than 400 types of PTM, was firstly discovered in histone in 2007 (Chen et al., 2007), and later in nonhistone (Cheng et al., 2009). The propionylation was a dynamic process where propionyl group was conjugated by some acetyltransferases to substrate proteins which was thus propionylated and could be removed by Sirt1 and Sirt2 (Chen et al., 2007; Leemhuis et al., 2008; Zhang et al., 2008; Cheng et al., 2009). Although it was known that lysine propionylation played a regulating role in the metabolism (Yang et al., 2019) and was a mark of active chromatin (Kebede et al., 2017), many of its unknown functions were still not uncovered.

Identifying propionylation sites was crucial to further explore functions of propionylated proteins. The mass spectrometry has been developed to detect propionylation sites in the past decades and obtained vast achievements (Chen et al., 2007). However, this technique was time consuming and labor intensive. Another alternative was computational methods which learned a model from the known data and then gave the predictions for unknown data. The process was similar with learning of human. In the past 30 years, more than 100 computational methods or tools have been developed for predicting PTM sites (Huang and Zeng, 2016; Zhou et al., 2016; Ai et al., 2017; Wei et al., 2017, 2019; Xiang et al., 2017; Chen et al., 2018; de Brevern et al., 2018; Ning et al., 2018, 2019; Xie et al., 2018; Huang et al., 2019, 2020; Luo et al., 2019; Malebary et al., 2019; Wang et al., 2019; Lv et al., 2020; Qian et al., 2020; Thapa et al., 2020). For example, Malebary et al. (2019) proposed a computational model for lysine crotonylation prediction by integrating various position and composition relative features along with statistical moments, and reached the average accuracy of 0.9917 in the experimental dataset. Chen et al. (2018) presented a computational tool named ProAcePred to predict prokaryote lysine acetylation sites by extracting sequence-based, physicochemical property and evolutionary information features. Wei et al. (2017, 2019) used sequence-based information to build computational models for predicting phosphorylation sites and protein methylation sites, respectively. Although propionylation was a newly discovered PTM, there still were two computational methods developed to detect propoinylation sites. One was that the biased support vector machine (SVM) model (Ju and He, 2017) which incorporated four different sequence features into Chou’s pseudo-amino acid composition. Another was the PropSeek which was also a SVM model and which exploited evolutionary information, sequenced-derived information, predicted structural information, and feature annotations (Wang et al., 2017). Advance in deep learning techniques could accelerate development of propionylation prediction. A well-known example was that the AlphaFold, a deep-learning-based method, accurately determined a protein’s 3D shape from its amino-acid sequence (Callaway, 2020). The detection of protein structure especially in more than two dimensions was one of biology’s grandest challenges and to date no better technique can solve this issue. In this paper, we attempted to build a deep learning model to predict propionylation sites. However, the accumulated propionylation data was too small to better train deep learning model. Lysine propionylation has in situ crosstalk with lysine malonylation. Wang et al. (2017) statistically compared 1,471 propionylation sites in 605 proteins with the dataset of 1,745 malonylation sites in 595 proteins and found that 600 (40.8%) of 1,471 propionylation sites are overlapped with malonylation. What is more, the number of malonylation was much more than that of propionylation sites. Inspired by this, we proposed a transfer learning method for predicting propionylation sites. We firstly constructed a recurrent neural network (RNN)-based deep learning model, which was trained by the malonylation data. The model was then fine tuned by the propionylation data. The model served as feature extractor. Finally, the SVM-based classifier was trained to discriminate propionylation from nonpropionylation.

Data

All lysine propionylation sites were both from the protein lysine modifications database (PLMD) (Xu et al., 2017) and Uniprot database (UniProt Consortium, 2018). The PLMD was devoted to collect lysine modification, currently hosting 284,780 modification events in 53,501 proteins for 20 types of lysine modification such as ubiquitination, methylation, and sumoylation. The Uniprot was a comprehensive database of protein sequence and function annotation. We firstly downloaded 192 proteins containing 413 propionyllysine sites from the PLMD http://plmd.biocuckoo.org/download.php. We then retrieved 18 propionylation proteins from the Uniprot database. After merging two dataset of proteins and removing repeated proteins, we obtained 207 unique proteins. Functions of protein including protein modification would rely more or less on homology. To reduce or remove influences of homology on the proposed method, we applied the sequence clustering software CD-HIT (Li and Godzik, 2006) to perform sequence clustering. The sequence identity was set to 0.7. Finally, we obtained 189 proteins as experimental data, of which sequence similarities between any two was less than 0.7. We selected randomly 4/5 of 189 proteins (151) as positive training samples which containing 304 sites, the remaining (38) as positive testing ones containing 104 sites. Lysine sites largely outnumbered lysine propionylation sites, so positive and negative samples were unbalanced, i.e., negative samples extremely outnumbered positive ones. The unbalance between positive and negative samples would cause the trained model to prefer to negative samples. Therefore, we randomly selected sites of lysine which does not undergo PTM from these proteins as negative samples at a ratio of positive to negative 1:1. The training set consisted of 304 positive and 304 negative lysine sites, while the testing set of 104 positive and 104 negative lysine sites. All the positive and the negative sites are listed in the Supplementary Material.

We also downloaded 3,429 malonylated proteins containing 9,584 malonylation sites. Similarly, we randomly chose the same number of lysine sites as nonmalonylation sites, These lysine sites did not undergo malonylation events as negative samples. Therefore, the malonylation set contained 9,584 malonylation sites and 9,584 nonmalonylation lysine sites.

Materials and Methods

As shown in Figure 1, the proposed method consisted of three main steps: feature encoding, training classifier, and predicting propionylation, or eight modules: segmenting sequences, constructing a deep RNN model, training the deep RNN model, extracting features, constructing SVM model, optimizing the window size and the super-parameters in the SVM model, training the SVM model, and predicting propionylation with trained SVM model. We used the malonylation dataset to train the RNN model and then fine tuned the trained model by the training set of propionylation data. Propionylation sequences were inputted into the fine-tuned and trained deep RNN model and the outputs in its last second layer were viewed as features of the propionylation sequences. The subsequent workflow was the same as the common machine learning method.

FIGURE 1

Figure 1. The workflow of the proposed method.

Segmenting Sequences

As shown in Figure 2A, protein sequences were segmented into peptides where lysine was the center and n residues were located in its downstream and upstream, respectively. If the number of residues in the downstream or the upstream was less than n, the corresponding number of character X were supplemented, as shown in Figure 2B. The peptides were a window of residues in fixed size (2×n + 1). We obtained 816 peptides, and 9,584 + 9,584 = 19,168 peptides for propionylation dataset and for malonylation dataset above, respectively.

FIGURE 2

Figure 2. Illustration of segmenting protein sequences. (A) is normal segment; (B) is segment when the number of residues is less than 8.

Deep RNN Model

As shown in Figure 3, the deep RNN model was made up of one embedding, two long short-term memory (LSTM), one Gated Recurrent Unit (GRU), one dropout, one flatten, one fully connected, and one output layer. The embedding layer translated integer indices of amino acid characters into embedding vectors. In general, the embedding layer was regarded as a bridge from text to numerical vector in field of natural language process. The LSTM (Hochreiter and Schmidhuber, 1997) was a RNN (Pearlmutter, 1989; Giles et al., 1994). The RNN shared network weights where output at current step not only depended on the input at current step but also on output at previous steps. Due to its effect and efficiency, the RNN has widely been applied in the field of sequence analysis or time-series analysis. The RNN could not remember information about previous inputs which was away from the current input. The LSTM was one of better solutions to it. The typical LSTM included three gates: forget gate, input gate, and output gate. The forget gate was to forget some past information selected, and the input gate was to remember some current information. All three gates adopted the sigmoid as the activation function, whose output ranged from 0 to 1. The output was 0, meaning that no information was passed and 1 meant all information was passed. The LSTM also included a candidate memory cell which fused current and past memories. The GRU was a variant of the LSTM. Compared with the LSTM, the GRU included only two gates: reset gate and update gate, dropping the candidate memory cell. The reset gate was to determine which past information to be forgotten, and the update gate to drop some past information and to add some new information. The number of operations in the GRU was less than that that in the LSTM, so the GRU was computed faster than the LSTM. For the purpose of detecting bidirectional semantic information, we used the bidirectional LSTM and the bidirectional GRU.

FIGURE 3

Figure 3. The RNN-based deep learning model.

Deep learning model would cause overfitting and be time consuming. Hinton et al. (2012) proposed a dropout operation as a solution to prevent neural networks from overfitting. The dropout operation was to drop some neurons whose weights were not updated during training at a certain rate of dropout, while all the neurons were used during testing. Since the dropout was created, it is becoming a more prevalent trick in the deep learning models (Srivastava et al., 2014).

Flatten layer was a bridge between the LSTM layer and fully connected layer, and its aim was only to transform the shape of input so that it could be connected to the subsequent fully connected layer. The fully connected layer corresponded to the hidden layer in the multiple layer perception. The number of neurons in the output layer was responsible for the number of class labels.

Support Vector Machine

The SVM proposed by Vapnik et al. (Boser et al., 1992; Cortes et al. 1995; Vapnik et al. 1998) is a statistical learning algorithm. Due to mathematically theoretical foundation, the SVM has been applied to a wide range of fields from handwritten digit recognition (Matic et al., 1993), text categorization (Joachims, 1999), face images detection (Osuna et al., 1997), to protein/gene structure or function prediction (Caragea et al., 2007; Plewczynski et al., 2008; Li et al., 2009; Pugalenthi et al., 2010; Li et al., 2011; Sun et al., 2015; Ning et al., 2018). Take, for example, a binary classification with the n training samples {(x_i,y_i)|i = 1,2,…,n} where y_i ∈ {1,−1}. The SVM aimed to find a hyperplane f(x) = wx + b to separate samples with positive label 1 from ones with label −1. That is to say, the hyperplane made positive samples satisfy f(x) = wx + b > 0 and negative ones satisfy f(x) = wx + b < 0. In fact, there would be many hyperplane meeting the requirement above. The SVM was to find such a hyperplane that maximizes the separating margin. This question was modeled as minimizing the following formulas:

L (w) = \frac{1}{2} w^{T} w, (1)

subject to the constraints:

y_{i} (w x_{i} + b) \geq 1, i = 1, 2, 3, \dots, n . (2)

In the real world, the training samples were not always completely separable by any hyperplane. That is to say, there were some samples which were separated as another category. To address this issue, the SVM introduced the slack variables ξ_i. The objective function (1) was rewrote as

L (w, ξ_{i}) = \frac{1}{2} w^{T} w + C \sum_{i = 1}^{n} ξ_{i}, (3)

where C was called penalty factor, a user-specified hyper-parameter, while the constraint (2) was rewrote as

y_{i} (w x_{i} + b) \geq 1 - ξ_{i}, i = 1, 2, 3, 1 / 4, n, ξ_{i} \geq 0 (4)

The objective function was composed of the structural risk (the first term in Eq. 3) and empirical risk (the second term in Eq. 3). The penalty factor controlled trade-off between two risks. Another superiority of the SVM was that it absorbed the kernel function. There existed a case that samples could be not discriminable in the low-dimensional space, but they became discriminable. The SVM firstly exploited the kernel function to transform these undistinguishable samples from low-dimensional into high-dimensional shape, and then found a high-dimensional hyperplane to separate them, which was expressed by

F (x) = w^{T} f (x) + b (5)

where ϕ(x) was a kernel function. There are more than ten kernel functions such as linear kernel $ϕ (x_{i}, x_{j}) = x_{i}^{T} x_{j}$ , polynomial kernel $ϕ (x_{i}, x_{j}) = {(a x_{i}^{T} x_{j} + c)}^{d}$ , Gaussian Kernel $ϕ (x_{i}, x_{j}) = \exp (- \frac{{| x_{i} - x_{j} |}^{2}}{2 σ^{2}}),$ etc. The corresponding constraint were updated as

y_{i} (w ϕ (x_{i}) + b) \geq 1 - ξ_{i}, i = 1, 2, 3, \dots, n, ξ_{i} \geq 0 (6)

The SVM was soluble by the dual theory and the Lagrange optimization algorithm. Readers can refer to the relevant scientific references.

Crossvalidation and Metrics

In the case of regression or classification question, there are generally four types of validations: hold-out validation, k-fold crossvalidation, leave-one-out, and independent test. In the hold-out validation, the training set was splitted into two parts: one for training and another for validation. In the k-fold cross validation, the training set was divided into k parts. Each part was tested by the model trained over other k − 1 parts. Leave-one-out was an extreme cross validation, where the number of samples is equal to k. We used 10-fold cross validation and independent test to examine the proposed method.

To quantitatively compare performance of methods, the following metrics: sensitivity (SN), specificity (SP), accuracy (ACC), and Matthews correlation coefficient (MCC), were used, which were computed by

SN = \frac{TP}{TP + FN}

SP = \frac{TN}{FP + TN}

ACC = \frac{TP + TN}{TP + FN + FP + TN}

MCC = \frac{TP \times TN - FP \times FN}{\sqrt{(TP + FN) (TP + FP) (TN + FN) (TN + FP)}}

In the equations above, TP is the number of the true positive samples, TN the number of the true negative samples, FN the number of false-negative samples, and FP the number of false-positive samples. SN, SP, and ACC ranges from 0 to 1, 0 meaning completely wrong and 1 completely correct. For example, SN = 0 implied that all the positive samples were predicted as negative ones. MCC ranges from −1 to 1, 1 meaning perfect prediction, 0 random prediction, and −1 the prediction completely opposite to the true.

The receiver operating characteristic (ROC) curve was used to depict performance, which plotted true positive rate against false positive rate under various thresholds. The area under the ROC curve (AUC) was used to quantitively assess the performance. The AUC ranged from 0 to 1, 0.5 meaning random guess and 1 perfect performance.

Results

Parameter Optimization

The size of peptide window was generally set to one of the interval [21, 41]. We conducted 10-fold crossvalidations over the training set to search for better window size. The performances under various window size were listed in Table 1. The crossvalidation of window size 29 obtained the better performance. Therefore, we set window size to 29 in the subsequent experiments. We also optimized super parameters in the SVM classifier, i.e., C, kernel, and gamma. We searched combination space of C = [0.5, 1, 1.5, 2, 2.5, 3, 10, 100, 1,000], kernel = [“linear,” “poly,” “rbf”], and gamma = [“scale,” “auto”]. Table 2 shows the best 15 combinations. The best performance was SN = 0.8454, SP = 0.8158, ACC = 0.8306, and MCC = 0.6615, slightly better than previous, and the corresponding parameter was that C = 1, kernel = rbf, and gamma = scale. The predictive performance in the testing set was a SN of 0.6731, a SP 0.6442, an ACC of 0.6587, and a MCC of 0.3174.

TABLE 1

Table 1. Performance of various window size in the 10-fold crossvalidation.

TABLE 2

Table 2. The best 15 combinations in the searching space.

Comparison With Other Methods

To the best of my knowledge, there were two computational methods for propionylation prediction. One was the PropPred (Ju and He, 2017) and another was the PropSeek (Wang et al., 2017). However, to date, these two webservers stopped work. The performance of the PropPred with 250 optimal features and a window size of 25 residues in the 10-fold crossvalidation was a SN of 0.7003, a SP 0.7561, an ACC of 0.7502, and a MCC of 0.3085, inferior to that of the proposed method. The performance of the PropPred in the testing set was a SN of 0.6604, a SP of 0.7504, an ACC of 0.7431, and a MCC of 0.2495, inferior to that of the proposed method in terms of SN and MCC. It must be pointed out that the training and the testing set used by two methods were different. To perform fair comparison, we implemented the PropProd with the 250 optimal features and a window size of 25 residues. Both performances of 10-fold crossvalidation on the training set and of independent test on the testing set are listed in Table 3. Obviously, the proposed method outperformed the PropPred. We also compared the presented method with the deep RNN model. The performance of the deep RNN model over the testing set obtained a SN of 0.5962, a SP of 0.6731, an ACC of 0.6346, and a MCC of 0.2700. The presented method outperformed the deep RNN model.

TABLE 3

Table 3. Performances of the PropPred method.

Figure 4A shows performances of 10-fold crossvalidation for the presented method and the PropPred. Although the AUC of the presented method was inferior to that of the PropPred, the best performance at the most up-left was better than that of that of the PropPred. In the independent test (Figure 4B), the presented method outperformed the PropPred and the deep RNN method. Obviously, the presented method occupied advantage of the deep learning and avoided artificial design of feature extraction.

FIGURE 4

Figure 4. Receiver operating characteristic curves of (A) 10-fold cross validation and (B) independent test.

Functional Analysis

We used the DAVID web application (Huang da et al., 2009) for functional analysis which included a comprehensive set of functional annotation tools to uncover and understand biological meaning behind studied genes. Firstly, we exploited the gene functional classification tool in the DAVID to cluster 183 proteins from Thermus thermophilus HB8. As shown in Table 4, only 29 proteins clustered into four similar function groups, while other proteins showed no similarity of functions. The proteins leucyl-tRNA synthetase (leuS) and the protein histidyl-tRNA synthetase (hisS) appeared simultaneously in two groups. We also used the function annotation tool in the DAVID perform enrichment of GO and KEGG pathway. Because 183 of 207 proteins were from Thermus thermophilus HB8, genes of Thermus thermophilus HB8 were used as background. Under the condition of ease less than or equal to 0.01, the enriched GO terms of molecular function were GO:0016620 (oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor), GO:0051287 (NAD binding), and GO:0003735 (structural constituent of ribosome). The enriched GO terms of biological process and cellular component was GO:0006096 (glycolytic process) and GO:0005737 (cytoplasm), respectively, as shown in Table 5. The enriched pathways are listed in Table 6. In the nine enriched pathways, four was related to metabolism, and two to biosynthesis, implying involvement roles of the propionylation in the metabolism. Some researchers reported that lysine propionylation was involved in metabolism (Okanishi et al., 2014, 2017; Yang et al., 2019).

TABLE 4

Table 4. Function groups of proteins.

TABLE 5

Table 5. Significantly enriched GO terms.

TABLE 6

Table 6. Significant KEGG pathways.

Conclusion

We presented a transfer learning-based method and an online webserver¹ for computationally predicting propionylation. The method took advantage of crosstalk between propionylation and malonylation. The advantage of the method was to avoid artificially designing features. Statistical enrichment analysis implied that propoinylation was associated with metabolism.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.

Author Contributions

AL and YD: conceptualization, funding acquisition, and writing – original draft. YT: data curation, formal analysis, and software. AL, MC, and YD: methodology, validation, writing – review, and editing. All authors contributed to the article and approved the submitted version.

Funding

This work was supported in part by the Natural Science Foundation of Hunan Province, China under Grant 2019JJ40064 and Scientific Research Project of Education Department of Hunan Province under Grant 19B142, 19A125, and 18A253.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphys.2021.658633/full#supplementary-material

Footnotes

^ http://47.113.117.61/

References

Ai, H., Wu, R., Zhang, L., Wu, X., Ma, J., Hu, H., et al. (2017). pSuc-PseRat: predicting lysine succinylation in proteins by exploiting the ratios of sequence coupling and properties. J. Comput. Biol. 24, 1050–1059. doi: 10.1089/cmb.2016.0206

PubMed Abstract | CrossRef Full Text | Google Scholar

Boser, B. E., Guyon, I. M., and Vapnik, V. N. (1992). “A training algorithm for optimal margin classifiers,” in Proceedings of the 5th Annual Workshop on Computational Learning Theory, (New York, NY: ACM), 144–152.

Google Scholar

Callaway, E. (2020). ‘It will change everything’: DeepMind’s AI makes gigantic leap in solving protein structures. Nature 588, 203–204. doi: 10.1038/d41586-020-03348-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Caragea, C., Sinapov, J., Silvescu, A., Dobbs, D., and Honavar, V. (2007). Glycosylation site prediction using ensembles of Support Vector Machine classifiers. BMC Bioinformatics 8:438. doi: 10.1186/1471-2105-8-438

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, G., Cao, M., Luo, K., Wang, L., Wen, P., and Shi, S. (2018). ProAcePred: prokaryote lysine acetylation sites prediction based on elastic net feature optimization. Bioinformatics 34, 3999–4006. doi: 10.1093/bioinformatics/bty444

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, Y., Sprung, R., Tang, Y., Ball, H., Sangras, B., Kim, S. C., et al. (2007). Lysine propionylation and butyrylation are novel post-translational modifications in histones. Mol. Cell. Proteomics 6, 812–819. doi: 10.1074/mcp.m700021-mcp200

PubMed Abstract | CrossRef Full Text | Google Scholar

Cheng, Z., Tang, Y., Chen, Y., Kim, S., Liu, H., Li, S. S. C., et al. (2009). Molecular characterization of propionyllysines in non-histone proteins. Mol. Cell. Proteomics 8, 45–52. doi: 10.1074/mcp.m800224-mcp200

PubMed Abstract | CrossRef Full Text | Google Scholar

Cortes, C., and Vapnik, V. (1995). Support-vector networks. Mach. Learn. 20, 273–297.

Google Scholar

de Brevern, A. G., Hasan, M. M., and Kurata, H. (2018). GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features. Plos One 13:e0200283. doi: 10.1371/journal.pone.0200283

PubMed Abstract | CrossRef Full Text | Google Scholar

Giles, C. L., Kuhn, G. M., and Williams, R. J. (1994). Dynamic recurrent neural networks: theory and applications. IEEE Trans. Neural Netw. 5, 153–156. doi: 10.1109/tnn.1994.8753425

CrossRef Full Text | Google Scholar

Hinton, G. E., Srivastava, N., Krizhevsky, A., Sutskever, I., and Salakhutdinov, R. R. (2012). Improving neural networks by preventing co-adaptation of feature detectors. arXiv [preprint] arXiv:1207.0580

Google Scholar

Hochreiter, S., and Schmidhuber, J. (1997). Long short-term memory. Neural Comput. 9, 1735–1780.

Google Scholar

Huang da, W., Sherman, B. T., and Lempicki, R. A. (2009). Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44–57. doi: 10.1038/nprot.2008.211

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, G., and Zeng, W. (2016). A discrete hidden markov model for detecting histone crotonyllysine sites. Match Commun. Math. Comput. Chem 75, 717–730.

Google Scholar

Huang, G., Zheng, Y., Wu, Y.-Q., and Yu, Z.-G. (2020). An information entropy-based approach for computationally identifying histone lysine butyrylation. Front. Genet. 10:1325. doi: 10.3389/fgene.2019.01325

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, K.-Y., Hsu, J. B.-K., and Lee, T.-Y. (2019). Characterization and identification of lysine succinylation sites based on deep learning method. Sci. Rep. 9:16175.

Google Scholar

Joachims, T. (1999). “Transductive inference for text classification using support vector machines,” in Paper Presented at International Conference on Machine Learning; 6/27/1999, Bled.

Google Scholar

Ju, Z., and He, J. J. (2017). Prediction of lysine propionylation sites using biased SVM and incorporating four different sequence features into Chou’s PseAAC. J. Mol. Graph. Model. 76, 356–363. doi: 10.1016/j.jmgm.2017.07.022

PubMed Abstract | CrossRef Full Text | Google Scholar

Junqueira, S. C., Centeno, E. G. Z., Wilkinson, K. A., and Cimarosti, H. (2019). Post-translational modifications of Parkinson’s disease-related proteins: phosphorylation, SUMOylation and ubiquitination. Biochim. Biophys. Acta 1865, 2001–2007. doi: 10.1016/j.bbadis.2018.10.025

PubMed Abstract | CrossRef Full Text | Google Scholar

Kebede, A. F., Nieborak, A., Shahidian, L. Z., Le Gras, S., Richter, F., Gómez, D. A., et al. (2017). Histone propionylation is a mark of active chromatin. Nat. Struct. Mol. Biol. 24, 1048–1056. doi: 10.1038/nsmb.3490

PubMed Abstract | CrossRef Full Text | Google Scholar

Leemhuis, H., Packman, L. C., Nightingale, K. P., and Hollfelder, F. (2008). The human histone acetyltransferase P/CAF is a promiscuous histone propionyltransferase. Chembiochem 9, 499–503. doi: 10.1002/cbic.200700556

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, S., Li, H., Li, M., Shyr, Y., Xie, L., and Li, Y. (2009). Improved prediction of lysine acetylation by support vector machines. Protein Pept. Lett. 16, 977–983. doi: 10.2174/092986609788923338

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, W., and Godzik, A. (2006). Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22, 1658–1659. doi: 10.1093/bioinformatics/btl158

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Y. X., Shao, Y. H., Jing, L., and Deng, N. Y. (2011). An efficient support vector machine approach for identifying protein S-nitrosylation sites. Protein Pept. Lett. 18, 573–587. doi: 10.2174/092986611795222731

PubMed Abstract | CrossRef Full Text | Google Scholar

Luo, F., Wang, M., Liu, Y., Zhao, X. M., and Li, A. (2019). DeepPhos: prediction of protein phosphorylation sites with deep learning. Bioinformatics 35, 2766–2773. doi: 10.1093/bioinformatics/bty1051

PubMed Abstract | CrossRef Full Text | Google Scholar

Lv, H., Dao, F. Y., Guan, Z. X., Yang, H., Li, Y. W., and Lin, H. (2020). Deep-Kcr: accurate detection of lysine crotonylation sites using deep learning method. Brief Bioinform. bbaa255. doi: 10.1093/bib/bbaa255

PubMed Abstract | CrossRef Full Text | Google Scholar

Malebary, S. J., Rehman, M. S. U., and Khan, Y. D. (2019). iCrotoK-PseAAC: identify lysine crotonylation sites by blending position relative statistical features according to the Chou’s 5-step rule. PLoS One 14:e0223993. doi: 10.1371/journal.pone.0223993

PubMed Abstract | CrossRef Full Text | Google Scholar

Martin, L., Latypova, X., and Terro, F. (2011). Post-translational modifications of tau protein: implications for Alzheimer’s disease. Neurochem. Int. 58, 458–471. doi: 10.1016/j.neuint.2010.12.023

PubMed Abstract | CrossRef Full Text | Google Scholar

Matic, N., Guyon, I., Denker, J., and Vapnik, V. (1993). “Writer-adaptation for on-line handwritten character recognition,” in Paper Presented at the 2nd International Conference on Document Analysis and Recognition; 10/20/1993, Tsukuba.

Google Scholar

Nakamura, T., Prikhodko, O. A., Pirie, E., Nagar, S., Akhtar, M. W., Oh, C. K., et al. (2015). Aberrant protein S-nitrosylation contributes to the pathophysiology of neurodegenerative diseases. Neurobiol. Dis. 84, 99–108. doi: 10.1016/j.nbd.2015.03.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Ning, Q., Yu, M., Ji, J., Ma, Z., and Zhao, X. (2019). Analysis and prediction of human acetylation using a cascade classifier based on support vector machine. BMC Bioinformatics 20:346. doi: 10.1186/s12859-019-2938-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Ning, Q., Zhao, X., Bao, L., Ma, Z., and Zhao, X. (2018). Detecting Succinylation sites from protein sequences using ensemble support vector machine. BMC Bioinformatics 19:237. doi: 10.1186/s12859-018-2249-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Okanishi, H., Kim, K., Masui, R., and Kuramitsu, S. (2014). Lysine propionylation is a prevalent post-translational modification in Thermus thermophilus. Mol. Cell. Proteomics 13, 2382–2398. doi: 10.1074/mcp.m113.035659

PubMed Abstract | CrossRef Full Text | Google Scholar

Okanishi, H., Kim, K., Masui, R., and Kuramitsu, S. (2017). Proteome-wide identification of lysine propionylation in thermophilic and mesophilic bacteria: Geobacillus kaustophilus, Thermus thermophilus, Escherichia coli, Bacillus subtilis, and Rhodothermus marinus. Extremophiles 21, 283–296. doi: 10.1007/s00792-016-0901-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Osuna, E., Freund, R., and Girosit, F. (1997). “Training support vector machines: an application to face detection,” in Paper Presented at Computer Vision and Pattern Recognition; 6/17/1997, Los Alamitos.

Google Scholar

Pearlmutter, B. A. (1989). Learning state space trajectories in recurrent neural networks. Neural Comput. 1, 263–269. doi: 10.1162/neco.1989.1.2.263

CrossRef Full Text | Google Scholar

Plewczynski, D., Tkacz, A., Wyrwicz, L. S., Rychlewski, L., and Ginalski, K. (2008). AutoMotif Server for prediction of phosphorylation sites in proteins using support vector machine: 2007 update. J. Mol. Model. 14, 69–76. doi: 10.1007/s00894-007-0250-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Pugalenthi, G., Kandaswamy, K. K., Suganthan, P. N., Sowdhamini, R., Martinetz, T., and Kolatkar, P. R. (2010). SMpred: a support vector machine approach to identify structural motifs in protein structure without using evolutionary information. J. Biomol. Struct. Dyn. 28, 405–414. doi: 10.1080/07391102.2010.10507369

PubMed Abstract | CrossRef Full Text | Google Scholar

Qian, Y., Ye, S., Zhang, Y., and Zhang, J. (2020). SUMO-Forest: a Cascade Forest based method for the prediction of SUMOylation sites on imbalanced data. Gene 741, 144536. doi: 10.1016/j.gene.2020.144536

PubMed Abstract | CrossRef Full Text | Google Scholar

Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., and Salakhutdinov, R. (2014). Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958.

Google Scholar

Sun, L., Liu, H., Zhang, L., and Meng, J. (2015). lncRScan-SVM: a tool for predicting long non-coding RNAs using support vector machine. PLoS One 10:e0139654. doi: 10.1371/journal.pone.0139654

PubMed Abstract | CrossRef Full Text | Google Scholar

Thapa, N., Chaudhari, M., McManus, S., Roy, K., Newman, R. H., Saigo, H., et al. (2020). DeepSuccinylSite: a deep learning based approach for protein succinylation site prediction. BMC Bioinformatics 21(Suppl 3):63. doi: 10.1186/s12859-020-3342-z

PubMed Abstract | CrossRef Full Text | Google Scholar

UniProt Consortium, T. (2018). UniProt: the universal protein knowledgebase. Nucleic Acids Res. 46:2699. doi: 10.1093/nar/gky092

PubMed Abstract | CrossRef Full Text | Google Scholar

Vapnik, V. N., and Vapnik, V. (1998). Statistical Learning Theory. New York, NY: Wiley.

Google Scholar

Wang, D., Liang, Y., and Xu, D. (2019). Capsule network for protein post-translational modification site prediction. Bioinformatics 35, 2386–2394. doi: 10.1093/bioinformatics/bty977

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, L. N., Shi, S. P., Wen, P. P., Zhou, Z. Y., and Qiu, J. D. (2017). Computing prediction and functional analysis of prokaryotic propionylation. J. Chem. Inf. Model. 57, 2896–2904. doi: 10.1021/acs.jcim.7b00482

PubMed Abstract | CrossRef Full Text | Google Scholar

Wei, L., Xing, P., Shi, G., Ji, Z., and Zou, Q. (2019). Fast prediction of protein methylation sites using a sequence-based feature selection technique. IEEE ACM Trans. Comput. Biol. Bioinform. 16, 1264–1273. doi: 10.1109/tcbb.2017.2670558

PubMed Abstract | CrossRef Full Text | Google Scholar

Wei, L., Xing, P., Tang, J., and Zou, Q. (2017). PhosPred-RF: a novel sequence-based predictor for phosphorylation sites using sequential information only. IEEE Trans. Nanobiosci. 16, 240–247. doi: 10.1109/tnb.2017.2661756

PubMed Abstract | CrossRef Full Text | Google Scholar

Xiang, Q., Feng, K., Liao, B., Liu, Y., and Huang, G. (2017). Prediction of lysine malonylation sites based on pseudo amino acid. Comb. Chem. High Throughput Screen. 20, 622–628.

Google Scholar

Xie, Y., Luo, X., Li, Y., Chen, L., Ma, W., Huang, J., et al. (2018). DeepNitro: prediction of protein nitration and nitrosylation sites by deep learning. Genomics Proteomics Bioinformatics 16, 294–306. doi: 10.1016/j.gpb.2018.04.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, H., Zhou, J., Lin, S., Deng, W., Zhang, Y., and Xue, Y. (2017). PLMD: an updated data resource of protein lysine modifications. J. Genet. Genomics 44, 243–250. doi: 10.1016/j.jgg.2017.03.007

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, M., Huang, H., and Ge, F. (2019). Lysine propionylation is a widespread post-translational modification involved in regulation of photosynthesis and metabolism in Cyanobacteria. Int J Mol Sci 20, 4792. doi: 10.3390/ijms20194792

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, K., Chen, Y., Zhang, Z., and Zhao, Y. (2008). Identification and verification of lysine propionylation and butyrylation in Yeast core histones using PTMap software. J. Proteome Res. 8, 900–906. doi: 10.1021/pr8005155

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, Y., Huang, T., Huang, G., Zhang, N., Kong, X., and Cai, Y.-D. (2016). Prediction of protein N-formylation and comparison with N-acetylation based on a feature selection method. Neurocomputing 217, 53–62. doi: 10.1016/j.neucom.2015.10.148

CrossRef Full Text | Google Scholar

Keywords: propionylation, malonylation, deep learning, transfer learning, recurrent neural network, long short term memory, support machine vector

Citation: Li A, Deng Y, Tan Y and Chen M (2021) A Transfer Learning-Based Approach for Lysine Propionylation Prediction. Front. Physiol. 12:658633. doi: 10.3389/fphys.2021.658633

Received: 27 January 2021; Accepted: 15 March 2021;
Published: 21 April 2021.

Edited by:

Yu Xue, Huazhong University of Science and Technology, China

Reviewed by:

Han Cheng, Zhengzhou University, China
Jian-Ding Qiu, Nanchang University, China
Yan Xu, University of Science and Technology Beijing, China

Copyright © 2021 Li, Deng, Tan and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yingwei Deng, ZGVuZ3lpbmd3ZWlAaG5pdC5lZHUuY24=; Min Chen, Y2hlbm1pbkBobml0LmVkdS5jbg==

^†These authors share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.