Phylogenetic detection of recombination with a Bayesian prior on the distance between trees.

Genomic regions participating in recombination events may support distinct topologies, and phylogenetic analyses should incorporate this heterogeneity. Existing phylogenetic methods for recombination detection are challenged by the enormous number of possible topologies, even for a moderate number o...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Leonardo de Oliveira Martins, Elcio Leal, Hirohisa Kishino
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2008
Materias:
R
Q
Acceso en línea:https://doaj.org/article/7f0244832be94742acc3719600b2151d
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:7f0244832be94742acc3719600b2151d
record_format dspace
spelling oai:doaj.org-article:7f0244832be94742acc3719600b2151d2021-11-25T06:11:42ZPhylogenetic detection of recombination with a Bayesian prior on the distance between trees.1932-620310.1371/journal.pone.0002651https://doaj.org/article/7f0244832be94742acc3719600b2151d2008-07-01T00:00:00Zhttps://www.ncbi.nlm.nih.gov/pmc/articles/pmid/18612422/pdf/?tool=EBIhttps://doaj.org/toc/1932-6203Genomic regions participating in recombination events may support distinct topologies, and phylogenetic analyses should incorporate this heterogeneity. Existing phylogenetic methods for recombination detection are challenged by the enormous number of possible topologies, even for a moderate number of taxa. If, however, the detection analysis is conducted independently between each putative recombinant sequence and a set of reference parentals, potential recombinations between the recombinants are neglected. In this context, a recombination hotspot can be inferred in phylogenetic analyses if we observe several consecutive breakpoints. We developed a distance measure between unrooted topologies that closely resembles the number of recombinations. By introducing a prior distribution on these recombination distances, a Bayesian hierarchical model was devised to detect phylogenetic inconsistencies occurring due to recombinations. This model relaxes the assumption of known parental sequences, still common in HIV analysis, allowing the entire dataset to be analyzed at once. On simulated datasets with up to 16 taxa, our method correctly detected recombination breakpoints and the number of recombination events for each breakpoint. The procedure is robust to rate and transitionratiotransversion heterogeneities for simulations with and without recombination. This recombination distance is related to recombination hotspots. Applying this procedure to a genomic HIV-1 dataset, we found evidence for hotspots and de novo recombination.Leonardo de Oliveira MartinsElcio LealHirohisa KishinoPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 3, Iss 7, p e2651 (2008)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Leonardo de Oliveira Martins
Elcio Leal
Hirohisa Kishino
Phylogenetic detection of recombination with a Bayesian prior on the distance between trees.
description Genomic regions participating in recombination events may support distinct topologies, and phylogenetic analyses should incorporate this heterogeneity. Existing phylogenetic methods for recombination detection are challenged by the enormous number of possible topologies, even for a moderate number of taxa. If, however, the detection analysis is conducted independently between each putative recombinant sequence and a set of reference parentals, potential recombinations between the recombinants are neglected. In this context, a recombination hotspot can be inferred in phylogenetic analyses if we observe several consecutive breakpoints. We developed a distance measure between unrooted topologies that closely resembles the number of recombinations. By introducing a prior distribution on these recombination distances, a Bayesian hierarchical model was devised to detect phylogenetic inconsistencies occurring due to recombinations. This model relaxes the assumption of known parental sequences, still common in HIV analysis, allowing the entire dataset to be analyzed at once. On simulated datasets with up to 16 taxa, our method correctly detected recombination breakpoints and the number of recombination events for each breakpoint. The procedure is robust to rate and transitionratiotransversion heterogeneities for simulations with and without recombination. This recombination distance is related to recombination hotspots. Applying this procedure to a genomic HIV-1 dataset, we found evidence for hotspots and de novo recombination.
format article
author Leonardo de Oliveira Martins
Elcio Leal
Hirohisa Kishino
author_facet Leonardo de Oliveira Martins
Elcio Leal
Hirohisa Kishino
author_sort Leonardo de Oliveira Martins
title Phylogenetic detection of recombination with a Bayesian prior on the distance between trees.
title_short Phylogenetic detection of recombination with a Bayesian prior on the distance between trees.
title_full Phylogenetic detection of recombination with a Bayesian prior on the distance between trees.
title_fullStr Phylogenetic detection of recombination with a Bayesian prior on the distance between trees.
title_full_unstemmed Phylogenetic detection of recombination with a Bayesian prior on the distance between trees.
title_sort phylogenetic detection of recombination with a bayesian prior on the distance between trees.
publisher Public Library of Science (PLoS)
publishDate 2008
url https://doaj.org/article/7f0244832be94742acc3719600b2151d
work_keys_str_mv AT leonardodeoliveiramartins phylogeneticdetectionofrecombinationwithabayesianprioronthedistancebetweentrees
AT elcioleal phylogeneticdetectionofrecombinationwithabayesianprioronthedistancebetweentrees
AT hirohisakishino phylogeneticdetectionofrecombinationwithabayesianprioronthedistancebetweentrees
_version_ 1718414052060823552