An ensemble method approach to investigate kinase-specific phosphorylation sites

Sutapa Datta, Subhasis MukhopadhyayDepartment of Biophysics, Molecular Biology and Bioinformatics and Distributed Information Centre for Bioinformatics, University of Calcutta, West Bengal, IndiaAbstract: Protein phosphorylation is one of the most significant and well-studied post-translational mod...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Datta S, Mukhopadhyay S
Formato: article
Lenguaje:EN
Publicado: Dove Medical Press 2014
Materias:
Acceso en línea:https://doaj.org/article/a4669c7b0e7a46cda659c9982fd8f0b6
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
Descripción
Sumario:Sutapa Datta, Subhasis MukhopadhyayDepartment of Biophysics, Molecular Biology and Bioinformatics and Distributed Information Centre for Bioinformatics, University of Calcutta, West Bengal, IndiaAbstract: Protein phosphorylation is one of the most significant and well-studied post-translational modifications, and it plays an important role in various cellular processes. It has made a considerable impact in understanding the protein functions which are involved in revealing signal transductions and various diseases. The identification of kinase-specific phosphorylation sites has an important role in elucidating the mechanism of phosphorylation; however, experimental techniques for identifying phosphorylation sites are labor intensive and expensive. An exponentially increasing number of protein sequences generated by various laboratories across the globe require computer-aided procedures for reliably and quickly identifying the phosphorylation sites, opening a new horizon for in silico analysis. In this regard, we have introduced a novel ensemble method where we have selected three classifiers (least square support vector machine, multilayer perceptron, and k-Nearest Neighbor) and three different feature encoding parameters (dipeptide composition, physicochemical properties of amino acids, and protein–protein similarity score). Each of these classifiers is trained on each of the three different parameter systems. The final results of the ensemble method are obtained by fusing the results of all the classifiers by a weighted voting algorithm. Extensive experiments reveal that our proposed method can successfully predict phosphorylation sites in a kinase-specific manner and performs significantly better when compared with other existing phosphorylation site prediction methods.Keywords: post-translational modification, cell signaling, phosphate