A systems biology approach to transcription factor binding site prediction.

<h4>Background</h4>The elucidation of mammalian transcriptional regulatory networks holds great promise for both basic and translational research and remains one the greatest challenges to systems biology. Recent reverse engineering methods deduce regulatory interactions from large-scale...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Xiang Zhou, Pavel Sumazin, Presha Rajbhandari, Andrea Califano
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2010
Materias:
R
Q
Acceso en línea:https://doaj.org/article/0a9d9f5eff2e495a88dbe571bff9d6b5
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:0a9d9f5eff2e495a88dbe571bff9d6b5
record_format dspace
spelling oai:doaj.org-article:0a9d9f5eff2e495a88dbe571bff9d6b52021-11-25T06:25:01ZA systems biology approach to transcription factor binding site prediction.1932-620310.1371/journal.pone.0009878https://doaj.org/article/0a9d9f5eff2e495a88dbe571bff9d6b52010-03-01T00:00:00Zhttps://www.ncbi.nlm.nih.gov/pmc/articles/pmid/20360861/?tool=EBIhttps://doaj.org/toc/1932-6203<h4>Background</h4>The elucidation of mammalian transcriptional regulatory networks holds great promise for both basic and translational research and remains one the greatest challenges to systems biology. Recent reverse engineering methods deduce regulatory interactions from large-scale mRNA expression profiles and cross-species conserved regulatory regions in DNA. Technical challenges faced by these methods include distinguishing between direct and indirect interactions, associating transcription regulators with predicted transcription factor binding sites (TFBSs), identifying non-linearly conserved binding sites across species, and providing realistic accuracy estimates.<h4>Methodology/principal findings</h4>We address these challenges by closely integrating proven methods for regulatory network reverse engineering from mRNA expression data, linearly and non-linearly conserved regulatory region discovery, and TFBS evaluation and discovery. Using an extensive test set of high-likelihood interactions, which we collected in order to provide realistic prediction-accuracy estimates, we show that a careful integration of these methods leads to significant improvements in prediction accuracy. To verify our methods, we biochemically validated TFBS predictions made for both transcription factors (TFs) and co-factors; we validated binding site predictions made using a known E2F1 DNA-binding motif on E2F1 predicted promoter targets, known E2F1 and JUND motifs on JUND predicted promoter targets, and a de novo discovered motif for BCL6 on BCL6 predicted promoter targets. Finally, to demonstrate accuracy of prediction using an external dataset, we showed that sites matching predicted motifs for ZNF263 are significantly enriched in recent ZNF263 ChIP-seq data.<h4>Conclusions/significance</h4>Using an integrative framework, we were able to address technical challenges faced by state of the art network reverse engineering methods, leading to significant improvement in direct-interaction detection and TFBS-discovery accuracy. We estimated the accuracy of our framework on a human B-cell specific test set, which may help guide future methodological development.Xiang ZhouPavel SumazinPresha RajbhandariAndrea CalifanoPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 5, Iss 3, p e9878 (2010)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Xiang Zhou
Pavel Sumazin
Presha Rajbhandari
Andrea Califano
A systems biology approach to transcription factor binding site prediction.
description <h4>Background</h4>The elucidation of mammalian transcriptional regulatory networks holds great promise for both basic and translational research and remains one the greatest challenges to systems biology. Recent reverse engineering methods deduce regulatory interactions from large-scale mRNA expression profiles and cross-species conserved regulatory regions in DNA. Technical challenges faced by these methods include distinguishing between direct and indirect interactions, associating transcription regulators with predicted transcription factor binding sites (TFBSs), identifying non-linearly conserved binding sites across species, and providing realistic accuracy estimates.<h4>Methodology/principal findings</h4>We address these challenges by closely integrating proven methods for regulatory network reverse engineering from mRNA expression data, linearly and non-linearly conserved regulatory region discovery, and TFBS evaluation and discovery. Using an extensive test set of high-likelihood interactions, which we collected in order to provide realistic prediction-accuracy estimates, we show that a careful integration of these methods leads to significant improvements in prediction accuracy. To verify our methods, we biochemically validated TFBS predictions made for both transcription factors (TFs) and co-factors; we validated binding site predictions made using a known E2F1 DNA-binding motif on E2F1 predicted promoter targets, known E2F1 and JUND motifs on JUND predicted promoter targets, and a de novo discovered motif for BCL6 on BCL6 predicted promoter targets. Finally, to demonstrate accuracy of prediction using an external dataset, we showed that sites matching predicted motifs for ZNF263 are significantly enriched in recent ZNF263 ChIP-seq data.<h4>Conclusions/significance</h4>Using an integrative framework, we were able to address technical challenges faced by state of the art network reverse engineering methods, leading to significant improvement in direct-interaction detection and TFBS-discovery accuracy. We estimated the accuracy of our framework on a human B-cell specific test set, which may help guide future methodological development.
format article
author Xiang Zhou
Pavel Sumazin
Presha Rajbhandari
Andrea Califano
author_facet Xiang Zhou
Pavel Sumazin
Presha Rajbhandari
Andrea Califano
author_sort Xiang Zhou
title A systems biology approach to transcription factor binding site prediction.
title_short A systems biology approach to transcription factor binding site prediction.
title_full A systems biology approach to transcription factor binding site prediction.
title_fullStr A systems biology approach to transcription factor binding site prediction.
title_full_unstemmed A systems biology approach to transcription factor binding site prediction.
title_sort systems biology approach to transcription factor binding site prediction.
publisher Public Library of Science (PLoS)
publishDate 2010
url https://doaj.org/article/0a9d9f5eff2e495a88dbe571bff9d6b5
work_keys_str_mv AT xiangzhou asystemsbiologyapproachtotranscriptionfactorbindingsiteprediction
AT pavelsumazin asystemsbiologyapproachtotranscriptionfactorbindingsiteprediction
AT presharajbhandari asystemsbiologyapproachtotranscriptionfactorbindingsiteprediction
AT andreacalifano asystemsbiologyapproachtotranscriptionfactorbindingsiteprediction
AT xiangzhou systemsbiologyapproachtotranscriptionfactorbindingsiteprediction
AT pavelsumazin systemsbiologyapproachtotranscriptionfactorbindingsiteprediction
AT presharajbhandari systemsbiologyapproachtotranscriptionfactorbindingsiteprediction
AT andreacalifano systemsbiologyapproachtotranscriptionfactorbindingsiteprediction
_version_ 1718413723105755136