Revealing missing parts of the interactome via link prediction.

Protein interaction networks (PINs) are often used to "learn" new biological function from their topology. Since current PINs are noisy, their computational de-noising via link prediction (LP) could improve the learning accuracy. LP uses the existing PIN topology to predict missing and spu...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Yuriy Hulovatyy, Ryan W Solava, Tijana Milenković
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2014
Materias:
R
Q
Acceso en línea:https://doaj.org/article/58dfd8fdee8747f69354e650b9749c63
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:58dfd8fdee8747f69354e650b9749c63
record_format dspace
spelling oai:doaj.org-article:58dfd8fdee8747f69354e650b9749c632021-11-18T08:30:06ZRevealing missing parts of the interactome via link prediction.1932-620310.1371/journal.pone.0090073https://doaj.org/article/58dfd8fdee8747f69354e650b9749c632014-01-01T00:00:00Zhttps://www.ncbi.nlm.nih.gov/pmc/articles/pmid/24594900/pdf/?tool=EBIhttps://doaj.org/toc/1932-6203Protein interaction networks (PINs) are often used to "learn" new biological function from their topology. Since current PINs are noisy, their computational de-noising via link prediction (LP) could improve the learning accuracy. LP uses the existing PIN topology to predict missing and spurious links. Many of existing LP methods rely on shared immediate neighborhoods of the nodes to be linked. As such, they have limitations. Thus, in order to comprehensively study what are the topological properties of nodes in PINs that dictate whether the nodes should be linked, we introduce novel sensitive LP measures that are expected to overcome the limitations of the existing methods. We systematically evaluate the new and existing LP measures by introducing "synthetic" noise into PINs and measuring how accurate the measures are in reconstructing the original PINs. Also, we use the LP measures to de-noise the original PINs, and we measure biological correctness of the de-noised PINs with respect to functional enrichment of the predicted interactions. Our main findings are: 1) LP measures that favor nodes which are both "topologically similar" and have large shared extended neighborhoods are superior; 2) using more network topology often though not always improves LP accuracy; and 3) LP improves biological correctness of the PINs, plus we validate a significant portion of the predicted interactions in independent, external PIN data sources. Ultimately, we are less focused on identifying a superior method but more on showing that LP improves biological correctness of PINs, which is its ultimate goal in computational biology. But we note that our new methods outperform each of the existing ones with respect to at least one evaluation criterion. Alarmingly, we find that the different criteria often disagree in identifying the best method(s), which has important implications for LP communities in any domain, including social networks.Yuriy HulovatyyRyan W SolavaTijana MilenkovićPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 9, Iss 3, p e90073 (2014)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Yuriy Hulovatyy
Ryan W Solava
Tijana Milenković
Revealing missing parts of the interactome via link prediction.
description Protein interaction networks (PINs) are often used to "learn" new biological function from their topology. Since current PINs are noisy, their computational de-noising via link prediction (LP) could improve the learning accuracy. LP uses the existing PIN topology to predict missing and spurious links. Many of existing LP methods rely on shared immediate neighborhoods of the nodes to be linked. As such, they have limitations. Thus, in order to comprehensively study what are the topological properties of nodes in PINs that dictate whether the nodes should be linked, we introduce novel sensitive LP measures that are expected to overcome the limitations of the existing methods. We systematically evaluate the new and existing LP measures by introducing "synthetic" noise into PINs and measuring how accurate the measures are in reconstructing the original PINs. Also, we use the LP measures to de-noise the original PINs, and we measure biological correctness of the de-noised PINs with respect to functional enrichment of the predicted interactions. Our main findings are: 1) LP measures that favor nodes which are both "topologically similar" and have large shared extended neighborhoods are superior; 2) using more network topology often though not always improves LP accuracy; and 3) LP improves biological correctness of the PINs, plus we validate a significant portion of the predicted interactions in independent, external PIN data sources. Ultimately, we are less focused on identifying a superior method but more on showing that LP improves biological correctness of PINs, which is its ultimate goal in computational biology. But we note that our new methods outperform each of the existing ones with respect to at least one evaluation criterion. Alarmingly, we find that the different criteria often disagree in identifying the best method(s), which has important implications for LP communities in any domain, including social networks.
format article
author Yuriy Hulovatyy
Ryan W Solava
Tijana Milenković
author_facet Yuriy Hulovatyy
Ryan W Solava
Tijana Milenković
author_sort Yuriy Hulovatyy
title Revealing missing parts of the interactome via link prediction.
title_short Revealing missing parts of the interactome via link prediction.
title_full Revealing missing parts of the interactome via link prediction.
title_fullStr Revealing missing parts of the interactome via link prediction.
title_full_unstemmed Revealing missing parts of the interactome via link prediction.
title_sort revealing missing parts of the interactome via link prediction.
publisher Public Library of Science (PLoS)
publishDate 2014
url https://doaj.org/article/58dfd8fdee8747f69354e650b9749c63
work_keys_str_mv AT yuriyhulovatyy revealingmissingpartsoftheinteractomevialinkprediction
AT ryanwsolava revealingmissingpartsoftheinteractomevialinkprediction
AT tijanamilenkovic revealingmissingpartsoftheinteractomevialinkprediction
_version_ 1718421683758432256