Origin Replication Complex Binding, Nucleosome Depletion Patterns, and a Primary Sequence Motif Can Predict Origins of Replication in a Genome with Epigenetic Centromeres
ABSTRACT Origins of DNA replication are key genetic elements, yet their identification remains elusive in most organisms. In previous work, we found that centromeres contain origins of replication (ORIs) that are determined epigenetically in the pathogenic yeast Candida albicans. In this study, we u...
Guardado en:
Autores principales: | , , , , , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
American Society for Microbiology
2014
|
Materias: | |
Acceso en línea: | https://doaj.org/article/70e68d4e98f34ef9b64f1dcf46320fb4 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:70e68d4e98f34ef9b64f1dcf46320fb4 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:70e68d4e98f34ef9b64f1dcf46320fb42021-11-15T15:45:55ZOrigin Replication Complex Binding, Nucleosome Depletion Patterns, and a Primary Sequence Motif Can Predict Origins of Replication in a Genome with Epigenetic Centromeres10.1128/mBio.01703-142150-7511https://doaj.org/article/70e68d4e98f34ef9b64f1dcf46320fb42014-10-01T00:00:00Zhttps://journals.asm.org/doi/10.1128/mBio.01703-14https://doaj.org/toc/2150-7511ABSTRACT Origins of DNA replication are key genetic elements, yet their identification remains elusive in most organisms. In previous work, we found that centromeres contain origins of replication (ORIs) that are determined epigenetically in the pathogenic yeast Candida albicans. In this study, we used origin recognition complex (ORC) binding and nucleosome occupancy patterns in Saccharomyces cerevisiae and Kluyveromyces lactis to train a machine learning algorithm to predict the position of active arm (noncentromeric) origins in the C. albicans genome. The model identified bona fide active origins as determined by the presence of replication intermediates on nondenaturing two-dimensional (2D) gels. Importantly, these origins function at their native chromosomal loci and also as autonomously replicating sequences (ARSs) on a linear plasmid. A “mini-ARS screen” identified at least one and often two ARS regions of ≥100 bp within each bona fide origin. Furthermore, a 15-bp AC-rich consensus motif was associated with the predicted origins and conferred autonomous replicating activity to the mini-ARSs. Thus, while centromeres and the origins associated with them are epigenetic, arm origins are dependent upon critical DNA features, such as a binding site for ORC and a propensity for nucleosome exclusion. IMPORTANCE DNA replication machinery is highly conserved, yet the definition of exactly what specifies a replication origin differs in different species. Here, we utilized computational genomics to predict origin locations in Candida albicans by combining locations of binding sites for the conserved origin replication complex, necessary for replication initiation, together with chromatin organization patterns. We identified predicted sequences that exhibited bona fide origin function and developed a linear plasmid assay to delimit the DNA fragments necessary for origin function. Additionally, we found that a short AC-rich motif, which is enriched in predicted origins, is required for origin function. Thus, we demonstrated a new machine learning paradigm for identification of potential origins from a genome with no prior information. Furthermore, this work suggests that C. albicans has two different types of origins: “hard-wired” arm origins that rely upon specific sequence motifs and “epigenetic” centromeric origins that are recruited to kinetochores in a sequence-independent manner.Hung-Ji TsaiJoshua A. BallerIvan LiachkoAmnon KorenLaura S. BurrackMeleah A. HickmanMathuravani A. ThevandavakkamLaura N. RuscheJudith BermanAmerican Society for MicrobiologyarticleMicrobiologyQR1-502ENmBio, Vol 5, Iss 5 (2014) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Microbiology QR1-502 |
spellingShingle |
Microbiology QR1-502 Hung-Ji Tsai Joshua A. Baller Ivan Liachko Amnon Koren Laura S. Burrack Meleah A. Hickman Mathuravani A. Thevandavakkam Laura N. Rusche Judith Berman Origin Replication Complex Binding, Nucleosome Depletion Patterns, and a Primary Sequence Motif Can Predict Origins of Replication in a Genome with Epigenetic Centromeres |
description |
ABSTRACT Origins of DNA replication are key genetic elements, yet their identification remains elusive in most organisms. In previous work, we found that centromeres contain origins of replication (ORIs) that are determined epigenetically in the pathogenic yeast Candida albicans. In this study, we used origin recognition complex (ORC) binding and nucleosome occupancy patterns in Saccharomyces cerevisiae and Kluyveromyces lactis to train a machine learning algorithm to predict the position of active arm (noncentromeric) origins in the C. albicans genome. The model identified bona fide active origins as determined by the presence of replication intermediates on nondenaturing two-dimensional (2D) gels. Importantly, these origins function at their native chromosomal loci and also as autonomously replicating sequences (ARSs) on a linear plasmid. A “mini-ARS screen” identified at least one and often two ARS regions of ≥100 bp within each bona fide origin. Furthermore, a 15-bp AC-rich consensus motif was associated with the predicted origins and conferred autonomous replicating activity to the mini-ARSs. Thus, while centromeres and the origins associated with them are epigenetic, arm origins are dependent upon critical DNA features, such as a binding site for ORC and a propensity for nucleosome exclusion. IMPORTANCE DNA replication machinery is highly conserved, yet the definition of exactly what specifies a replication origin differs in different species. Here, we utilized computational genomics to predict origin locations in Candida albicans by combining locations of binding sites for the conserved origin replication complex, necessary for replication initiation, together with chromatin organization patterns. We identified predicted sequences that exhibited bona fide origin function and developed a linear plasmid assay to delimit the DNA fragments necessary for origin function. Additionally, we found that a short AC-rich motif, which is enriched in predicted origins, is required for origin function. Thus, we demonstrated a new machine learning paradigm for identification of potential origins from a genome with no prior information. Furthermore, this work suggests that C. albicans has two different types of origins: “hard-wired” arm origins that rely upon specific sequence motifs and “epigenetic” centromeric origins that are recruited to kinetochores in a sequence-independent manner. |
format |
article |
author |
Hung-Ji Tsai Joshua A. Baller Ivan Liachko Amnon Koren Laura S. Burrack Meleah A. Hickman Mathuravani A. Thevandavakkam Laura N. Rusche Judith Berman |
author_facet |
Hung-Ji Tsai Joshua A. Baller Ivan Liachko Amnon Koren Laura S. Burrack Meleah A. Hickman Mathuravani A. Thevandavakkam Laura N. Rusche Judith Berman |
author_sort |
Hung-Ji Tsai |
title |
Origin Replication Complex Binding, Nucleosome Depletion Patterns, and a Primary Sequence Motif Can Predict Origins of Replication in a Genome with Epigenetic Centromeres |
title_short |
Origin Replication Complex Binding, Nucleosome Depletion Patterns, and a Primary Sequence Motif Can Predict Origins of Replication in a Genome with Epigenetic Centromeres |
title_full |
Origin Replication Complex Binding, Nucleosome Depletion Patterns, and a Primary Sequence Motif Can Predict Origins of Replication in a Genome with Epigenetic Centromeres |
title_fullStr |
Origin Replication Complex Binding, Nucleosome Depletion Patterns, and a Primary Sequence Motif Can Predict Origins of Replication in a Genome with Epigenetic Centromeres |
title_full_unstemmed |
Origin Replication Complex Binding, Nucleosome Depletion Patterns, and a Primary Sequence Motif Can Predict Origins of Replication in a Genome with Epigenetic Centromeres |
title_sort |
origin replication complex binding, nucleosome depletion patterns, and a primary sequence motif can predict origins of replication in a genome with epigenetic centromeres |
publisher |
American Society for Microbiology |
publishDate |
2014 |
url |
https://doaj.org/article/70e68d4e98f34ef9b64f1dcf46320fb4 |
work_keys_str_mv |
AT hungjitsai originreplicationcomplexbindingnucleosomedepletionpatternsandaprimarysequencemotifcanpredictoriginsofreplicationinagenomewithepigeneticcentromeres AT joshuaaballer originreplicationcomplexbindingnucleosomedepletionpatternsandaprimarysequencemotifcanpredictoriginsofreplicationinagenomewithepigeneticcentromeres AT ivanliachko originreplicationcomplexbindingnucleosomedepletionpatternsandaprimarysequencemotifcanpredictoriginsofreplicationinagenomewithepigeneticcentromeres AT amnonkoren originreplicationcomplexbindingnucleosomedepletionpatternsandaprimarysequencemotifcanpredictoriginsofreplicationinagenomewithepigeneticcentromeres AT laurasburrack originreplicationcomplexbindingnucleosomedepletionpatternsandaprimarysequencemotifcanpredictoriginsofreplicationinagenomewithepigeneticcentromeres AT meleahahickman originreplicationcomplexbindingnucleosomedepletionpatternsandaprimarysequencemotifcanpredictoriginsofreplicationinagenomewithepigeneticcentromeres AT mathuravaniathevandavakkam originreplicationcomplexbindingnucleosomedepletionpatternsandaprimarysequencemotifcanpredictoriginsofreplicationinagenomewithepigeneticcentromeres AT lauranrusche originreplicationcomplexbindingnucleosomedepletionpatternsandaprimarysequencemotifcanpredictoriginsofreplicationinagenomewithepigeneticcentromeres AT judithberman originreplicationcomplexbindingnucleosomedepletionpatternsandaprimarysequencemotifcanpredictoriginsofreplicationinagenomewithepigeneticcentromeres |
_version_ |
1718427524644470784 |