Abstract
As one of the most prevailing post-translational modifications, phosphorylation is vital in regulating almost every cellular behavior. In this paper, we propose a new computational method that can effectively identify phosphorylation sites by using optimally chosen properties. The highlight of our method is that the optimal combination of features was selected from a set of 165 novel structural neighborhood properties by a random forest feature selection method. And then an ensemble learning method based on support vector machine was used to build the prediction model. Experimental results obtained from cross validation and independent test suggested that our method achieved a significant improvement on the prediction quality. Promising results were obtained after being compared with the state-of-the-art approaches using independent dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Steen, H., Jebanathirajah, J.A., Rush, J., Morrice, N., Kirschner, M.W.: Phosphorylation Analysis by Mass Spectrometry Myths, Facts, and the Consequences for Qualitative and Quantitative Measurements. Molecular & Cellular Proteomics 5(1), 172–181 (2006)
Delom, F., Chevet, E.: Phosphoprotein Analysis: From Proteins to Proteomes. Proteome Science 4(1), 15 (2006)
Pinna, L.A., Ruzzene, M.: How Do Protein Kinases Recognize Their Substrates?: Biochimica et Biophysica Acta (BBA)-Molecular Cell Research 1314(3), 191–225 (1996)
Iakoucheva, L.M., Radivojac, P., Brown, C.J., O’Connor, T.R., Sikes, J.G., Obradovic, Z., Dunker, A.K.: The Importance of Intrinsic Disorder for Protein Phosphorylation. Nucleic Acids Research 32(3), 1037–1049 (2004)
Blom, N., Gammeltoft, S., Brunak, S.: Sequence and Structure-Based Prediction of Eukaryotic Protein Phosphorylation Sites. Journal of Molecular Biology 294(5), 1351–1362 (1999)
Xue, Y., Ren, J., Gao, X., Jin, C., Wen, L., Yao, X.: Gps 2.0, a Tool to Predict Kinase-Specific Phosphorylation Sites in Hierarchy. Molecular & Cellular Proteomics 7(9), 1598–1608 (2008)
Xue, Y., Li, A., Wang, L., Feng, H., Yao, X.: Ppsp: Prediction of Pk-Specific Phosphorylation Site with Bayesian Decision Theory. BMC Bioinformatics 7(1), 163 (2006)
Wong, Y.-H., Lee, T.-Y., Liang, H.-K., Huang, C.-M., Wang, T.-Y., Yang, Y.-H., Chu, C.-H., Huang, H.-D., Ko, M.-T., Hwang, J.-K.: Kinasephos 2.0: A Web Server for Identifying Protein Kinase-Specific Phosphorylation Sites Based on Sequences and Coupling Patterns. Nucleic Acids Research 35(suppl. 2), W588–W594 (2007)
Zanzoni, A., Ausiello, G., Via, A., Gherardini, P.F., Helmer-Citterich, M.: Phospho3d: A Database of Three-Dimensional Structures of Protein Phosphorylation Sites. Nucleic Acids Research 35(suppl. 1), D229–D231 (2007)
Zanzoni, A., Carbajo, D., Diella, F., Gherardini, P.F., Tramontano, A., Helmer-Citterich, M., Via, A.: Phospho3d 2.0: An Enhanced Database of Three-Dimensional Structures of Phosphorylation Sites. Nucleic Acids Research 39(suppl. 1), D268–D271 (2011)
Huang, Y., Niu, B., Gao, Y., Fu, L., Li, W.: Cd-Hit Suite: A Web Server for Clustering and Comparing Biological Sequences. Bioinformatics 26(5), 680–682 (2010)
Deng, L., Guan, J., Wei, X., Yi, Y., Zhang, Q.C., Zhou, S.: Boosting Prediction Performance of Protein–Protein Interaction Hot Spots by Using Structural Neighborhood Properties. Journal of Computational Biology 20(11), 878–891 (2013)
Liaw, A., Wiener, M.: Classification and Regression by Randomforest. R News 2(3), 18–22 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Gao, Y., Hao, W., Chen, Z., Deng, L. (2014). Structure-Based Prediction of Protein Phosphorylation Sites Using an Ensemble Approach. In: Huang, DS., Han, K., Gromiha, M. (eds) Intelligent Computing in Bioinformatics. ICIC 2014. Lecture Notes in Computer Science(), vol 8590. Springer, Cham. https://doi.org/10.1007/978-3-319-09330-7_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-09330-7_15
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09329-1
Online ISBN: 978-3-319-09330-7
eBook Packages: Computer ScienceComputer Science (R0)