A systems biology approach to transcription factor binding site prediction.
<h4>Background</h4>The elucidation of mammalian transcriptional regulatory networks holds great promise for both basic and translational research and remains one the greatest challenges to systems biology. Recent reverse engineering methods deduce regulatory interactions from large-scale...
Guardado en:
Autores principales: | , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Public Library of Science (PLoS)
2010
|
Materias: | |
Acceso en línea: | https://doaj.org/article/0a9d9f5eff2e495a88dbe571bff9d6b5 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:0a9d9f5eff2e495a88dbe571bff9d6b5 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:0a9d9f5eff2e495a88dbe571bff9d6b52021-11-25T06:25:01ZA systems biology approach to transcription factor binding site prediction.1932-620310.1371/journal.pone.0009878https://doaj.org/article/0a9d9f5eff2e495a88dbe571bff9d6b52010-03-01T00:00:00Zhttps://www.ncbi.nlm.nih.gov/pmc/articles/pmid/20360861/?tool=EBIhttps://doaj.org/toc/1932-6203<h4>Background</h4>The elucidation of mammalian transcriptional regulatory networks holds great promise for both basic and translational research and remains one the greatest challenges to systems biology. Recent reverse engineering methods deduce regulatory interactions from large-scale mRNA expression profiles and cross-species conserved regulatory regions in DNA. Technical challenges faced by these methods include distinguishing between direct and indirect interactions, associating transcription regulators with predicted transcription factor binding sites (TFBSs), identifying non-linearly conserved binding sites across species, and providing realistic accuracy estimates.<h4>Methodology/principal findings</h4>We address these challenges by closely integrating proven methods for regulatory network reverse engineering from mRNA expression data, linearly and non-linearly conserved regulatory region discovery, and TFBS evaluation and discovery. Using an extensive test set of high-likelihood interactions, which we collected in order to provide realistic prediction-accuracy estimates, we show that a careful integration of these methods leads to significant improvements in prediction accuracy. To verify our methods, we biochemically validated TFBS predictions made for both transcription factors (TFs) and co-factors; we validated binding site predictions made using a known E2F1 DNA-binding motif on E2F1 predicted promoter targets, known E2F1 and JUND motifs on JUND predicted promoter targets, and a de novo discovered motif for BCL6 on BCL6 predicted promoter targets. Finally, to demonstrate accuracy of prediction using an external dataset, we showed that sites matching predicted motifs for ZNF263 are significantly enriched in recent ZNF263 ChIP-seq data.<h4>Conclusions/significance</h4>Using an integrative framework, we were able to address technical challenges faced by state of the art network reverse engineering methods, leading to significant improvement in direct-interaction detection and TFBS-discovery accuracy. We estimated the accuracy of our framework on a human B-cell specific test set, which may help guide future methodological development.Xiang ZhouPavel SumazinPresha RajbhandariAndrea CalifanoPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 5, Iss 3, p e9878 (2010) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Medicine R Science Q |
spellingShingle |
Medicine R Science Q Xiang Zhou Pavel Sumazin Presha Rajbhandari Andrea Califano A systems biology approach to transcription factor binding site prediction. |
description |
<h4>Background</h4>The elucidation of mammalian transcriptional regulatory networks holds great promise for both basic and translational research and remains one the greatest challenges to systems biology. Recent reverse engineering methods deduce regulatory interactions from large-scale mRNA expression profiles and cross-species conserved regulatory regions in DNA. Technical challenges faced by these methods include distinguishing between direct and indirect interactions, associating transcription regulators with predicted transcription factor binding sites (TFBSs), identifying non-linearly conserved binding sites across species, and providing realistic accuracy estimates.<h4>Methodology/principal findings</h4>We address these challenges by closely integrating proven methods for regulatory network reverse engineering from mRNA expression data, linearly and non-linearly conserved regulatory region discovery, and TFBS evaluation and discovery. Using an extensive test set of high-likelihood interactions, which we collected in order to provide realistic prediction-accuracy estimates, we show that a careful integration of these methods leads to significant improvements in prediction accuracy. To verify our methods, we biochemically validated TFBS predictions made for both transcription factors (TFs) and co-factors; we validated binding site predictions made using a known E2F1 DNA-binding motif on E2F1 predicted promoter targets, known E2F1 and JUND motifs on JUND predicted promoter targets, and a de novo discovered motif for BCL6 on BCL6 predicted promoter targets. Finally, to demonstrate accuracy of prediction using an external dataset, we showed that sites matching predicted motifs for ZNF263 are significantly enriched in recent ZNF263 ChIP-seq data.<h4>Conclusions/significance</h4>Using an integrative framework, we were able to address technical challenges faced by state of the art network reverse engineering methods, leading to significant improvement in direct-interaction detection and TFBS-discovery accuracy. We estimated the accuracy of our framework on a human B-cell specific test set, which may help guide future methodological development. |
format |
article |
author |
Xiang Zhou Pavel Sumazin Presha Rajbhandari Andrea Califano |
author_facet |
Xiang Zhou Pavel Sumazin Presha Rajbhandari Andrea Califano |
author_sort |
Xiang Zhou |
title |
A systems biology approach to transcription factor binding site prediction. |
title_short |
A systems biology approach to transcription factor binding site prediction. |
title_full |
A systems biology approach to transcription factor binding site prediction. |
title_fullStr |
A systems biology approach to transcription factor binding site prediction. |
title_full_unstemmed |
A systems biology approach to transcription factor binding site prediction. |
title_sort |
systems biology approach to transcription factor binding site prediction. |
publisher |
Public Library of Science (PLoS) |
publishDate |
2010 |
url |
https://doaj.org/article/0a9d9f5eff2e495a88dbe571bff9d6b5 |
work_keys_str_mv |
AT xiangzhou asystemsbiologyapproachtotranscriptionfactorbindingsiteprediction AT pavelsumazin asystemsbiologyapproachtotranscriptionfactorbindingsiteprediction AT presharajbhandari asystemsbiologyapproachtotranscriptionfactorbindingsiteprediction AT andreacalifano asystemsbiologyapproachtotranscriptionfactorbindingsiteprediction AT xiangzhou systemsbiologyapproachtotranscriptionfactorbindingsiteprediction AT pavelsumazin systemsbiologyapproachtotranscriptionfactorbindingsiteprediction AT presharajbhandari systemsbiologyapproachtotranscriptionfactorbindingsiteprediction AT andreacalifano systemsbiologyapproachtotranscriptionfactorbindingsiteprediction |
_version_ |
1718413723105755136 |