MrBayes tgMC³: a tight GPU implementation of MrBayes.
MrBayes is model-based phylogenetic inference tool using Bayesian statistics. However, model-based assessment of phylogenetic trees adds to the computational burden of tree-searching, and so poses significant computational challenges. Graphics Processing Units (GPUs) have been proposed as high perfo...
Guardado en:
Autores principales: | , , , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Public Library of Science (PLoS)
2013
|
Materias: | |
Acceso en línea: | https://doaj.org/article/611d83443fe24bd9ab2c92e695121f1b |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:611d83443fe24bd9ab2c92e695121f1b |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:611d83443fe24bd9ab2c92e695121f1b2021-11-18T07:49:55ZMrBayes tgMC³: a tight GPU implementation of MrBayes.1932-620310.1371/journal.pone.0060667https://doaj.org/article/611d83443fe24bd9ab2c92e695121f1b2013-01-01T00:00:00Zhttps://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23593277/?tool=EBIhttps://doaj.org/toc/1932-6203MrBayes is model-based phylogenetic inference tool using Bayesian statistics. However, model-based assessment of phylogenetic trees adds to the computational burden of tree-searching, and so poses significant computational challenges. Graphics Processing Units (GPUs) have been proposed as high performance, low cost acceleration platforms and several parallelized versions of the Metropolis Coupled Markov Chain Mote Carlo (MC(3)) algorithm in MrBayes have been presented that can run on GPUs. However, some bottlenecks decrease the efficiency of these implementations. To address these bottlenecks, we propose a tight GPU MC(3) (tgMC(3)) algorithm. tgMC(3) implements a different architecture from the one-to-one acceleration architecture employed in previously proposed methods. It merges multiply discrete GPU kernels according to the data dependency and hence decreases the number of kernels launched and the complexity of data transfer. We implemented tgMC(3) and made performance comparisons with an earlier proposed algorithm, nMC(3), and also with MrBayes MC(3) under serial and multiply concurrent CPU processes. All of the methods were benchmarked on the same computing node from DEGIMA. Experiments indicate that the tgMC(3) method outstrips nMC(3) (v1.0) with speedup factors from 2.1 to 2.7×. In addition, tgMC(3) outperforms the serial MrBayes MC(3) by a factor of 6 to 30× when using a single GTX480 card, whereas a speedup factor of around 51× can be achieved by using two GTX 480 cards on relatively long sequences. Moreover, tgMC(3) was compared with MrBayes accelerated by BEAGLE, and achieved speedup factors from 3.7 to 5.7×. The reported performance improvement of tgMC(3) is significant and appears to scale well with increasing dataset sizes. In addition, the strategy proposed in tgMC(3) could benefit the acceleration of other Bayesian-based phylogenetic analysis methods using GPUs.Cheng LingTsuyoshi HamadaJianing BaiXianbin LiDouglas ChestersWeimin ZhengWeifeng ShiPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 8, Iss 4, p e60667 (2013) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Medicine R Science Q |
spellingShingle |
Medicine R Science Q Cheng Ling Tsuyoshi Hamada Jianing Bai Xianbin Li Douglas Chesters Weimin Zheng Weifeng Shi MrBayes tgMC³: a tight GPU implementation of MrBayes. |
description |
MrBayes is model-based phylogenetic inference tool using Bayesian statistics. However, model-based assessment of phylogenetic trees adds to the computational burden of tree-searching, and so poses significant computational challenges. Graphics Processing Units (GPUs) have been proposed as high performance, low cost acceleration platforms and several parallelized versions of the Metropolis Coupled Markov Chain Mote Carlo (MC(3)) algorithm in MrBayes have been presented that can run on GPUs. However, some bottlenecks decrease the efficiency of these implementations. To address these bottlenecks, we propose a tight GPU MC(3) (tgMC(3)) algorithm. tgMC(3) implements a different architecture from the one-to-one acceleration architecture employed in previously proposed methods. It merges multiply discrete GPU kernels according to the data dependency and hence decreases the number of kernels launched and the complexity of data transfer. We implemented tgMC(3) and made performance comparisons with an earlier proposed algorithm, nMC(3), and also with MrBayes MC(3) under serial and multiply concurrent CPU processes. All of the methods were benchmarked on the same computing node from DEGIMA. Experiments indicate that the tgMC(3) method outstrips nMC(3) (v1.0) with speedup factors from 2.1 to 2.7×. In addition, tgMC(3) outperforms the serial MrBayes MC(3) by a factor of 6 to 30× when using a single GTX480 card, whereas a speedup factor of around 51× can be achieved by using two GTX 480 cards on relatively long sequences. Moreover, tgMC(3) was compared with MrBayes accelerated by BEAGLE, and achieved speedup factors from 3.7 to 5.7×. The reported performance improvement of tgMC(3) is significant and appears to scale well with increasing dataset sizes. In addition, the strategy proposed in tgMC(3) could benefit the acceleration of other Bayesian-based phylogenetic analysis methods using GPUs. |
format |
article |
author |
Cheng Ling Tsuyoshi Hamada Jianing Bai Xianbin Li Douglas Chesters Weimin Zheng Weifeng Shi |
author_facet |
Cheng Ling Tsuyoshi Hamada Jianing Bai Xianbin Li Douglas Chesters Weimin Zheng Weifeng Shi |
author_sort |
Cheng Ling |
title |
MrBayes tgMC³: a tight GPU implementation of MrBayes. |
title_short |
MrBayes tgMC³: a tight GPU implementation of MrBayes. |
title_full |
MrBayes tgMC³: a tight GPU implementation of MrBayes. |
title_fullStr |
MrBayes tgMC³: a tight GPU implementation of MrBayes. |
title_full_unstemmed |
MrBayes tgMC³: a tight GPU implementation of MrBayes. |
title_sort |
mrbayes tgmc³: a tight gpu implementation of mrbayes. |
publisher |
Public Library of Science (PLoS) |
publishDate |
2013 |
url |
https://doaj.org/article/611d83443fe24bd9ab2c92e695121f1b |
work_keys_str_mv |
AT chengling mrbayestgmc3atightgpuimplementationofmrbayes AT tsuyoshihamada mrbayestgmc3atightgpuimplementationofmrbayes AT jianingbai mrbayestgmc3atightgpuimplementationofmrbayes AT xianbinli mrbayestgmc3atightgpuimplementationofmrbayes AT douglaschesters mrbayestgmc3atightgpuimplementationofmrbayes AT weiminzheng mrbayestgmc3atightgpuimplementationofmrbayes AT weifengshi mrbayestgmc3atightgpuimplementationofmrbayes |
_version_ |
1718422913967718400 |