Genome-wide binding analysis of 195 DNA binding proteins reveals "reservoir" promoters and human specific SVA-repeat family regulation.

A key aspect in defining cell state is the complex choreography of DNA binding events in a given cell type, which in turn establishes a cell-specific gene-expression program. Here we wanted to take a deep analysis of DNA binding events and transcriptional output of a single cell state (K562 cells)....

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Michael J Smallegan, Soraya Shehata, Savannah F Spradlin, Alison Swearingen, Graycen Wheeler, Arpan Das, Giulia Corbet, Benjamin Nebenfuehr, Daniel Ahrens, Devin Tauber, Shelby Lennon, Kevin Choi, Thao Huynh, Tom Wieser, Kristen Schneider, Michael Bradshaw, Joel Basken, Maria Lai, Timothy Read, Matt Hynes-Grace, Dan Timmons, Jon Demasi, John L Rinn
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2021
Materias:
R
Q
Acceso en línea:https://doaj.org/article/3581be7dbf384cbd93c22a41d69897a4
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:3581be7dbf384cbd93c22a41d69897a4
record_format dspace
spelling oai:doaj.org-article:3581be7dbf384cbd93c22a41d69897a42021-12-02T20:05:18ZGenome-wide binding analysis of 195 DNA binding proteins reveals "reservoir" promoters and human specific SVA-repeat family regulation.1932-620310.1371/journal.pone.0237055https://doaj.org/article/3581be7dbf384cbd93c22a41d69897a42021-01-01T00:00:00Zhttps://doi.org/10.1371/journal.pone.0237055https://doaj.org/toc/1932-6203A key aspect in defining cell state is the complex choreography of DNA binding events in a given cell type, which in turn establishes a cell-specific gene-expression program. Here we wanted to take a deep analysis of DNA binding events and transcriptional output of a single cell state (K562 cells). To this end we re-analyzed 195 DNA binding proteins contained in ENCODE data. We used standardized analysis pipelines, containerization, and literate programming with R Markdown for reproducibility and rigor. Our approach validated many findings from previous independent studies, underscoring the importance of ENCODE's goals in providing these reproducible data resources. We also had several new findings including: (i) 1,362 promoters, which we refer to as 'reservoirs,' that are defined by having up to 111 different DNA binding-proteins localized on one promoter, yet do not have any expression of steady-state RNA (ii) Reservoirs do not overlap super-enhancer annotations and distinct have distinct properties from super-enhancers. (iii) The human specific SVA repeat element may have been co-opted for enhancer regulation and is highly transcribed in PRO-seq and RNA-seq. Collectively, this study performed by the students of a CU Boulder computational biology class (BCHM 5631 -Spring 2020) demonstrates the value of reproducible findings and how resources like ENCODE that prioritize data standards can foster new findings with existing data in a didactic environment.Michael J SmalleganSoraya ShehataSavannah F SpradlinAlison SwearingenGraycen WheelerArpan DasGiulia CorbetBenjamin NebenfuehrDaniel AhrensDevin TauberShelby LennonKevin ChoiThao HuynhTom WieserKristen SchneiderMichael BradshawJoel BaskenMaria LaiTimothy ReadMatt Hynes-GraceDan TimmonsJon DemasiJohn L RinnPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 16, Iss 6, p e0237055 (2021)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Michael J Smallegan
Soraya Shehata
Savannah F Spradlin
Alison Swearingen
Graycen Wheeler
Arpan Das
Giulia Corbet
Benjamin Nebenfuehr
Daniel Ahrens
Devin Tauber
Shelby Lennon
Kevin Choi
Thao Huynh
Tom Wieser
Kristen Schneider
Michael Bradshaw
Joel Basken
Maria Lai
Timothy Read
Matt Hynes-Grace
Dan Timmons
Jon Demasi
John L Rinn
Genome-wide binding analysis of 195 DNA binding proteins reveals "reservoir" promoters and human specific SVA-repeat family regulation.
description A key aspect in defining cell state is the complex choreography of DNA binding events in a given cell type, which in turn establishes a cell-specific gene-expression program. Here we wanted to take a deep analysis of DNA binding events and transcriptional output of a single cell state (K562 cells). To this end we re-analyzed 195 DNA binding proteins contained in ENCODE data. We used standardized analysis pipelines, containerization, and literate programming with R Markdown for reproducibility and rigor. Our approach validated many findings from previous independent studies, underscoring the importance of ENCODE's goals in providing these reproducible data resources. We also had several new findings including: (i) 1,362 promoters, which we refer to as 'reservoirs,' that are defined by having up to 111 different DNA binding-proteins localized on one promoter, yet do not have any expression of steady-state RNA (ii) Reservoirs do not overlap super-enhancer annotations and distinct have distinct properties from super-enhancers. (iii) The human specific SVA repeat element may have been co-opted for enhancer regulation and is highly transcribed in PRO-seq and RNA-seq. Collectively, this study performed by the students of a CU Boulder computational biology class (BCHM 5631 -Spring 2020) demonstrates the value of reproducible findings and how resources like ENCODE that prioritize data standards can foster new findings with existing data in a didactic environment.
format article
author Michael J Smallegan
Soraya Shehata
Savannah F Spradlin
Alison Swearingen
Graycen Wheeler
Arpan Das
Giulia Corbet
Benjamin Nebenfuehr
Daniel Ahrens
Devin Tauber
Shelby Lennon
Kevin Choi
Thao Huynh
Tom Wieser
Kristen Schneider
Michael Bradshaw
Joel Basken
Maria Lai
Timothy Read
Matt Hynes-Grace
Dan Timmons
Jon Demasi
John L Rinn
author_facet Michael J Smallegan
Soraya Shehata
Savannah F Spradlin
Alison Swearingen
Graycen Wheeler
Arpan Das
Giulia Corbet
Benjamin Nebenfuehr
Daniel Ahrens
Devin Tauber
Shelby Lennon
Kevin Choi
Thao Huynh
Tom Wieser
Kristen Schneider
Michael Bradshaw
Joel Basken
Maria Lai
Timothy Read
Matt Hynes-Grace
Dan Timmons
Jon Demasi
John L Rinn
author_sort Michael J Smallegan
title Genome-wide binding analysis of 195 DNA binding proteins reveals "reservoir" promoters and human specific SVA-repeat family regulation.
title_short Genome-wide binding analysis of 195 DNA binding proteins reveals "reservoir" promoters and human specific SVA-repeat family regulation.
title_full Genome-wide binding analysis of 195 DNA binding proteins reveals "reservoir" promoters and human specific SVA-repeat family regulation.
title_fullStr Genome-wide binding analysis of 195 DNA binding proteins reveals "reservoir" promoters and human specific SVA-repeat family regulation.
title_full_unstemmed Genome-wide binding analysis of 195 DNA binding proteins reveals "reservoir" promoters and human specific SVA-repeat family regulation.
title_sort genome-wide binding analysis of 195 dna binding proteins reveals "reservoir" promoters and human specific sva-repeat family regulation.
publisher Public Library of Science (PLoS)
publishDate 2021
url https://doaj.org/article/3581be7dbf384cbd93c22a41d69897a4
work_keys_str_mv AT michaeljsmallegan genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT sorayashehata genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT savannahfspradlin genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT alisonswearingen genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT graycenwheeler genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT arpandas genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT giuliacorbet genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT benjaminnebenfuehr genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT danielahrens genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT devintauber genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT shelbylennon genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT kevinchoi genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT thaohuynh genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT tomwieser genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT kristenschneider genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT michaelbradshaw genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT joelbasken genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT marialai genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT timothyread genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT matthynesgrace genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT dantimmons genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT jondemasi genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
AT johnlrinn genomewidebindinganalysisof195dnabindingproteinsrevealsreservoirpromotersandhumanspecificsvarepeatfamilyregulation
_version_ 1718375472757211136