Properties and distribution of pure GA-sequences of mammalian genomes.
The article describes DNA sequences of mammalian genomes that are longer than 50 bases, but consist exclusively of G's and A's ('pure GA-sequences'). Although their frequency of incidence should be 10(-16) or smaller, the chromosomes of human, chimpanzee, dog, cat, rat, and mouse...
Guardado en:
Autor principal: | |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Public Library of Science (PLoS)
2008
|
Materias: | |
Acceso en línea: | https://doaj.org/article/9c44b754b453487e8c4fe53d179d1196 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:9c44b754b453487e8c4fe53d179d1196 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:9c44b754b453487e8c4fe53d179d11962021-11-25T06:18:22ZProperties and distribution of pure GA-sequences of mammalian genomes.1932-620310.1371/journal.pone.0003818https://doaj.org/article/9c44b754b453487e8c4fe53d179d11962008-01-01T00:00:00Zhttps://www.ncbi.nlm.nih.gov/pmc/articles/pmid/19043592/pdf/?tool=EBIhttps://doaj.org/toc/1932-6203The article describes DNA sequences of mammalian genomes that are longer than 50 bases, but consist exclusively of G's and A's ('pure GA-sequences'). Although their frequency of incidence should be 10(-16) or smaller, the chromosomes of human, chimpanzee, dog, cat, rat, and mouse contained many tens of thousands of them ubiquitously located along the chromosomes with a species-dependent density, reaching sizes of up to 1300 [b]. With the exception of a small number of poly-A-, poly-G-, poly-GA-, and poly-GAAA-sequences (combined <0.5%), all pure GA-sequences of the mammals tested were unique individuals, contained several repeated short GA-containing motifs, and shared a common hexa-nucleotide spectrum. At most 2% of the human GA-sequences were transcribed into mRNAs; all others were not coding for proteins. Although this could have made them less subject to natural selection, they contained many [corrected] times fewer point mutations than one should expect from the genome at large. As to the presence of other sequences with similarly restricted base contents, there were approximately as many pure TC-sequences as pure GA-sequences, but many fewer pure AC-, TA, and TG-sequences. There were practically no pure GC-sequences. The functions of pure GA-sequences are not known. Supported by a number of observations related to heat shock phenomena, the article speculates that they serve as genomic sign posts which may help guide polymerases and transcription factors to their proper targets, and/or as spatial linkers that help generate the 3-dimensional organization of chromatin.Guenter Albrecht-BuehlerPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 3, Iss 11, p e3818 (2008) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Medicine R Science Q |
spellingShingle |
Medicine R Science Q Guenter Albrecht-Buehler Properties and distribution of pure GA-sequences of mammalian genomes. |
description |
The article describes DNA sequences of mammalian genomes that are longer than 50 bases, but consist exclusively of G's and A's ('pure GA-sequences'). Although their frequency of incidence should be 10(-16) or smaller, the chromosomes of human, chimpanzee, dog, cat, rat, and mouse contained many tens of thousands of them ubiquitously located along the chromosomes with a species-dependent density, reaching sizes of up to 1300 [b]. With the exception of a small number of poly-A-, poly-G-, poly-GA-, and poly-GAAA-sequences (combined <0.5%), all pure GA-sequences of the mammals tested were unique individuals, contained several repeated short GA-containing motifs, and shared a common hexa-nucleotide spectrum. At most 2% of the human GA-sequences were transcribed into mRNAs; all others were not coding for proteins. Although this could have made them less subject to natural selection, they contained many [corrected] times fewer point mutations than one should expect from the genome at large. As to the presence of other sequences with similarly restricted base contents, there were approximately as many pure TC-sequences as pure GA-sequences, but many fewer pure AC-, TA, and TG-sequences. There were practically no pure GC-sequences. The functions of pure GA-sequences are not known. Supported by a number of observations related to heat shock phenomena, the article speculates that they serve as genomic sign posts which may help guide polymerases and transcription factors to their proper targets, and/or as spatial linkers that help generate the 3-dimensional organization of chromatin. |
format |
article |
author |
Guenter Albrecht-Buehler |
author_facet |
Guenter Albrecht-Buehler |
author_sort |
Guenter Albrecht-Buehler |
title |
Properties and distribution of pure GA-sequences of mammalian genomes. |
title_short |
Properties and distribution of pure GA-sequences of mammalian genomes. |
title_full |
Properties and distribution of pure GA-sequences of mammalian genomes. |
title_fullStr |
Properties and distribution of pure GA-sequences of mammalian genomes. |
title_full_unstemmed |
Properties and distribution of pure GA-sequences of mammalian genomes. |
title_sort |
properties and distribution of pure ga-sequences of mammalian genomes. |
publisher |
Public Library of Science (PLoS) |
publishDate |
2008 |
url |
https://doaj.org/article/9c44b754b453487e8c4fe53d179d1196 |
work_keys_str_mv |
AT guenteralbrechtbuehler propertiesanddistributionofpuregasequencesofmammaliangenomes |
_version_ |
1718413940007895040 |