Variation in DNA substitution rates among lineages erroneously inferred from simulated clock-like data.

<h4>Background</h4>The observation of variation in substitution rates among lineages has led to (1) a general rejection of the molecular clock model, and (2) the suggestion that a number of biological characteristics of organisms can cause rate variation. Accurate estimates of rate varia...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Rachel S Schwartz, Rachel Lockridge Mueller
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2010
Materias:
R
Q
Acceso en línea:https://doaj.org/article/a3085f1c1b8047cc9383846339127880
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:a3085f1c1b8047cc9383846339127880
record_format dspace
spelling oai:doaj.org-article:a3085f1c1b8047cc93838463391278802021-11-25T06:25:30ZVariation in DNA substitution rates among lineages erroneously inferred from simulated clock-like data.1932-620310.1371/journal.pone.0009649https://doaj.org/article/a3085f1c1b8047cc93838463391278802010-03-01T00:00:00Zhttps://www.ncbi.nlm.nih.gov/pmc/articles/pmid/20300176/pdf/?tool=EBIhttps://doaj.org/toc/1932-6203<h4>Background</h4>The observation of variation in substitution rates among lineages has led to (1) a general rejection of the molecular clock model, and (2) the suggestion that a number of biological characteristics of organisms can cause rate variation. Accurate estimates of rate variation, and thus accurate inferences regarding the causes of rate variation, depend on accurate estimates of substitution rates. However, theory suggests that even when the substitution process is clock-like, variable numbers of substitutions can occur among lineages because the substitution process is stochastic. Furthermore, substitution rates along lineages can be misestimated, particularly when multiple substitutions occur at some sites. Although these potential causes of error in rate estimation are well understood in theory, such error has not been examined in detail; consequently, empirical studies that estimate rate variation among lineages have been unable to determine whether their results could be impacted by estimation error.<h4>Methodology/principal findings</h4>To evaluate the extent to which error in rate estimation could erroneously suggest rate variation among lineages, we examined rate variation estimated for datasets simulated under a molecular clock on trees with equal and variable branch lengths. Thus, any apparent rate variation in these datasets reflects error in rate estimation rather than true differences in the underlying substitution process. We observed substantial rate variation among lineages in our simulations; however, we did not observe rate variation when average substitution rates were compared between different clades.<h4>Conclusions/significance</h4>Our results confirm previous theoretical work suggesting that observations of among lineage rate variation in empirical data may be due to the stochastic substitution process and error in the estimation of substitution rates, rather than true differences in the underlying substitution process among lineages. However, conclusions regarding rate variation drawn from rates averaged across multiple branches are likely due to real, systematic variation in rates between groups.Rachel S SchwartzRachel Lockridge MuellerPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 5, Iss 3, p e9649 (2010)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Rachel S Schwartz
Rachel Lockridge Mueller
Variation in DNA substitution rates among lineages erroneously inferred from simulated clock-like data.
description <h4>Background</h4>The observation of variation in substitution rates among lineages has led to (1) a general rejection of the molecular clock model, and (2) the suggestion that a number of biological characteristics of organisms can cause rate variation. Accurate estimates of rate variation, and thus accurate inferences regarding the causes of rate variation, depend on accurate estimates of substitution rates. However, theory suggests that even when the substitution process is clock-like, variable numbers of substitutions can occur among lineages because the substitution process is stochastic. Furthermore, substitution rates along lineages can be misestimated, particularly when multiple substitutions occur at some sites. Although these potential causes of error in rate estimation are well understood in theory, such error has not been examined in detail; consequently, empirical studies that estimate rate variation among lineages have been unable to determine whether their results could be impacted by estimation error.<h4>Methodology/principal findings</h4>To evaluate the extent to which error in rate estimation could erroneously suggest rate variation among lineages, we examined rate variation estimated for datasets simulated under a molecular clock on trees with equal and variable branch lengths. Thus, any apparent rate variation in these datasets reflects error in rate estimation rather than true differences in the underlying substitution process. We observed substantial rate variation among lineages in our simulations; however, we did not observe rate variation when average substitution rates were compared between different clades.<h4>Conclusions/significance</h4>Our results confirm previous theoretical work suggesting that observations of among lineage rate variation in empirical data may be due to the stochastic substitution process and error in the estimation of substitution rates, rather than true differences in the underlying substitution process among lineages. However, conclusions regarding rate variation drawn from rates averaged across multiple branches are likely due to real, systematic variation in rates between groups.
format article
author Rachel S Schwartz
Rachel Lockridge Mueller
author_facet Rachel S Schwartz
Rachel Lockridge Mueller
author_sort Rachel S Schwartz
title Variation in DNA substitution rates among lineages erroneously inferred from simulated clock-like data.
title_short Variation in DNA substitution rates among lineages erroneously inferred from simulated clock-like data.
title_full Variation in DNA substitution rates among lineages erroneously inferred from simulated clock-like data.
title_fullStr Variation in DNA substitution rates among lineages erroneously inferred from simulated clock-like data.
title_full_unstemmed Variation in DNA substitution rates among lineages erroneously inferred from simulated clock-like data.
title_sort variation in dna substitution rates among lineages erroneously inferred from simulated clock-like data.
publisher Public Library of Science (PLoS)
publishDate 2010
url https://doaj.org/article/a3085f1c1b8047cc9383846339127880
work_keys_str_mv AT rachelsschwartz variationindnasubstitutionratesamonglineageserroneouslyinferredfromsimulatedclocklikedata
AT rachellockridgemueller variationindnasubstitutionratesamonglineageserroneouslyinferredfromsimulatedclocklikedata
_version_ 1718413760821985280