An ensemble method approach to investigate kinase-specific phosphorylation sites

Sutapa Datta, Subhasis MukhopadhyayDepartment of Biophysics, Molecular Biology and Bioinformatics and Distributed Information Centre for Bioinformatics, University of Calcutta, West Bengal, IndiaAbstract: Protein phosphorylation is one of the most significant and well-studied post-translational mod...

Description complète

Enregistré dans:
Détails bibliographiques
Auteurs principaux: Datta S, Mukhopadhyay S
Format: article
Langue:EN
Publié: Dove Medical Press 2014
Sujets:
Accès en ligne:https://doaj.org/article/a4669c7b0e7a46cda659c9982fd8f0b6
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
Description
Résumé:Sutapa Datta, Subhasis MukhopadhyayDepartment of Biophysics, Molecular Biology and Bioinformatics and Distributed Information Centre for Bioinformatics, University of Calcutta, West Bengal, IndiaAbstract: Protein phosphorylation is one of the most significant and well-studied post-translational modifications, and it plays an important role in various cellular processes. It has made a considerable impact in understanding the protein functions which are involved in revealing signal transductions and various diseases. The identification of kinase-specific phosphorylation sites has an important role in elucidating the mechanism of phosphorylation; however, experimental techniques for identifying phosphorylation sites are labor intensive and expensive. An exponentially increasing number of protein sequences generated by various laboratories across the globe require computer-aided procedures for reliably and quickly identifying the phosphorylation sites, opening a new horizon for in silico analysis. In this regard, we have introduced a novel ensemble method where we have selected three classifiers (least square support vector machine, multilayer perceptron, and k-Nearest Neighbor) and three different feature encoding parameters (dipeptide composition, physicochemical properties of amino acids, and protein–protein similarity score). Each of these classifiers is trained on each of the three different parameter systems. The final results of the ensemble method are obtained by fusing the results of all the classifiers by a weighted voting algorithm. Extensive experiments reveal that our proposed method can successfully predict phosphorylation sites in a kinase-specific manner and performs significantly better when compared with other existing phosphorylation site prediction methods.Keywords: post-translational modification, cell signaling, phosphate