Prediction of deleterious non-synonymous SNPs based on protein interaction network and hybrid properties.

Non-synonymous SNPs (nsSNPs), also known as Single Amino acid Polymorphisms (SAPs) account for the majority of human inherited diseases. It is important to distinguish the deleterious SAPs from neutral ones. Most traditional computational methods to classify SAPs are based on sequential or structura...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Tao Huang, Ping Wang, Zhi-Qiang Ye, Heng Xu, Zhisong He, Kai-Yan Feng, Lele Hu, Weiren Cui, Kai Wang, Xiao Dong, Lu Xie, Xiangyin Kong, Yu-Dong Cai, Yixue Li
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2010
Materias:
R
Q
Acceso en línea:https://doaj.org/article/9d419a84d9284c4ba304fe00b716ff45
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:9d419a84d9284c4ba304fe00b716ff45
record_format dspace
spelling oai:doaj.org-article:9d419a84d9284c4ba304fe00b716ff452021-11-18T06:36:32ZPrediction of deleterious non-synonymous SNPs based on protein interaction network and hybrid properties.1932-620310.1371/journal.pone.0011900https://doaj.org/article/9d419a84d9284c4ba304fe00b716ff452010-07-01T00:00:00Zhttps://www.ncbi.nlm.nih.gov/pmc/articles/pmid/20689580/?tool=EBIhttps://doaj.org/toc/1932-6203Non-synonymous SNPs (nsSNPs), also known as Single Amino acid Polymorphisms (SAPs) account for the majority of human inherited diseases. It is important to distinguish the deleterious SAPs from neutral ones. Most traditional computational methods to classify SAPs are based on sequential or structural features. However, these features cannot fully explain the association between a SAP and the observed pathophysiological phenotype. We believe the better rationale for deleterious SAP prediction should be: If a SAP lies in the protein with important functions and it can change the protein sequence and structure severely, it is more likely related to disease. So we established a method to predict deleterious SAPs based on both protein interaction network and traditional hybrid properties. Each SAP is represented by 472 features that include sequential features, structural features and network features. Maximum Relevance Minimum Redundancy (mRMR) method and Incremental Feature Selection (IFS) were applied to obtain the optimal feature set and the prediction model was Nearest Neighbor Algorithm (NNA). In jackknife cross-validation, 83.27% of SAPs were correctly predicted when the optimized 263 features were used. The optimized predictor with 263 features was also tested in an independent dataset and the accuracy was still 80.00%. In contrast, SIFT, a widely used predictor of deleterious SAPs based on sequential features, has a prediction accuracy of 71.05% on the same dataset. In our study, network features were found to be most important for accurate prediction and can significantly improve the prediction performance. Our results suggest that the protein interaction context could provide important clues to help better illustrate SAP's functional association. This research will facilitate the post genome-wide association studies.Tao HuangPing WangZhi-Qiang YeHeng XuZhisong HeKai-Yan FengLele HuWeiren CuiKai WangXiao DongLu XieXiangyin KongYu-Dong CaiYixue LiPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 5, Iss 7, p e11900 (2010)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Tao Huang
Ping Wang
Zhi-Qiang Ye
Heng Xu
Zhisong He
Kai-Yan Feng
Lele Hu
Weiren Cui
Kai Wang
Xiao Dong
Lu Xie
Xiangyin Kong
Yu-Dong Cai
Yixue Li
Prediction of deleterious non-synonymous SNPs based on protein interaction network and hybrid properties.
description Non-synonymous SNPs (nsSNPs), also known as Single Amino acid Polymorphisms (SAPs) account for the majority of human inherited diseases. It is important to distinguish the deleterious SAPs from neutral ones. Most traditional computational methods to classify SAPs are based on sequential or structural features. However, these features cannot fully explain the association between a SAP and the observed pathophysiological phenotype. We believe the better rationale for deleterious SAP prediction should be: If a SAP lies in the protein with important functions and it can change the protein sequence and structure severely, it is more likely related to disease. So we established a method to predict deleterious SAPs based on both protein interaction network and traditional hybrid properties. Each SAP is represented by 472 features that include sequential features, structural features and network features. Maximum Relevance Minimum Redundancy (mRMR) method and Incremental Feature Selection (IFS) were applied to obtain the optimal feature set and the prediction model was Nearest Neighbor Algorithm (NNA). In jackknife cross-validation, 83.27% of SAPs were correctly predicted when the optimized 263 features were used. The optimized predictor with 263 features was also tested in an independent dataset and the accuracy was still 80.00%. In contrast, SIFT, a widely used predictor of deleterious SAPs based on sequential features, has a prediction accuracy of 71.05% on the same dataset. In our study, network features were found to be most important for accurate prediction and can significantly improve the prediction performance. Our results suggest that the protein interaction context could provide important clues to help better illustrate SAP's functional association. This research will facilitate the post genome-wide association studies.
format article
author Tao Huang
Ping Wang
Zhi-Qiang Ye
Heng Xu
Zhisong He
Kai-Yan Feng
Lele Hu
Weiren Cui
Kai Wang
Xiao Dong
Lu Xie
Xiangyin Kong
Yu-Dong Cai
Yixue Li
author_facet Tao Huang
Ping Wang
Zhi-Qiang Ye
Heng Xu
Zhisong He
Kai-Yan Feng
Lele Hu
Weiren Cui
Kai Wang
Xiao Dong
Lu Xie
Xiangyin Kong
Yu-Dong Cai
Yixue Li
author_sort Tao Huang
title Prediction of deleterious non-synonymous SNPs based on protein interaction network and hybrid properties.
title_short Prediction of deleterious non-synonymous SNPs based on protein interaction network and hybrid properties.
title_full Prediction of deleterious non-synonymous SNPs based on protein interaction network and hybrid properties.
title_fullStr Prediction of deleterious non-synonymous SNPs based on protein interaction network and hybrid properties.
title_full_unstemmed Prediction of deleterious non-synonymous SNPs based on protein interaction network and hybrid properties.
title_sort prediction of deleterious non-synonymous snps based on protein interaction network and hybrid properties.
publisher Public Library of Science (PLoS)
publishDate 2010
url https://doaj.org/article/9d419a84d9284c4ba304fe00b716ff45
work_keys_str_mv AT taohuang predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT pingwang predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT zhiqiangye predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT hengxu predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT zhisonghe predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT kaiyanfeng predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT lelehu predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT weirencui predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT kaiwang predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT xiaodong predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT luxie predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT xiangyinkong predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT yudongcai predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT yixueli predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
_version_ 1718424432736731136