Trellis for efficient data and task management in the VA Million Veteran Program
Abstract Biomedical studies have become larger in size and yielded large quantities of data, yet efficient data processing remains a challenge. Here we present Trellis, a cloud-based data and task management framework that completely automates the process from data ingestion to result presentation,...
Guardado en:
Autores principales: | , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Nature Portfolio
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/977e68558af542b39e3e97a8f93a32b7 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:977e68558af542b39e3e97a8f93a32b7 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:977e68558af542b39e3e97a8f93a32b72021-12-05T12:12:11ZTrellis for efficient data and task management in the VA Million Veteran Program10.1038/s41598-021-02569-52045-2322https://doaj.org/article/977e68558af542b39e3e97a8f93a32b72021-12-01T00:00:00Zhttps://doi.org/10.1038/s41598-021-02569-5https://doaj.org/toc/2045-2322Abstract Biomedical studies have become larger in size and yielded large quantities of data, yet efficient data processing remains a challenge. Here we present Trellis, a cloud-based data and task management framework that completely automates the process from data ingestion to result presentation, while tracking data lineage, facilitating information query, and supporting fault-tolerance and scalability. Using a graph database to coordinate the state of the data processing workflows and a scalable microservice architecture to perform bioinformatics tasks, Trellis has enabled efficient variant calling on 100,000 human genomes collected in the VA Million Veteran Program.Paul Billing RossJina SongPhilip S. TsaoCuiping PanNature PortfolioarticleMedicineRScienceQENScientific Reports, Vol 11, Iss 1, Pp 1-12 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Medicine R Science Q |
spellingShingle |
Medicine R Science Q Paul Billing Ross Jina Song Philip S. Tsao Cuiping Pan Trellis for efficient data and task management in the VA Million Veteran Program |
description |
Abstract Biomedical studies have become larger in size and yielded large quantities of data, yet efficient data processing remains a challenge. Here we present Trellis, a cloud-based data and task management framework that completely automates the process from data ingestion to result presentation, while tracking data lineage, facilitating information query, and supporting fault-tolerance and scalability. Using a graph database to coordinate the state of the data processing workflows and a scalable microservice architecture to perform bioinformatics tasks, Trellis has enabled efficient variant calling on 100,000 human genomes collected in the VA Million Veteran Program. |
format |
article |
author |
Paul Billing Ross Jina Song Philip S. Tsao Cuiping Pan |
author_facet |
Paul Billing Ross Jina Song Philip S. Tsao Cuiping Pan |
author_sort |
Paul Billing Ross |
title |
Trellis for efficient data and task management in the VA Million Veteran Program |
title_short |
Trellis for efficient data and task management in the VA Million Veteran Program |
title_full |
Trellis for efficient data and task management in the VA Million Veteran Program |
title_fullStr |
Trellis for efficient data and task management in the VA Million Veteran Program |
title_full_unstemmed |
Trellis for efficient data and task management in the VA Million Veteran Program |
title_sort |
trellis for efficient data and task management in the va million veteran program |
publisher |
Nature Portfolio |
publishDate |
2021 |
url |
https://doaj.org/article/977e68558af542b39e3e97a8f93a32b7 |
work_keys_str_mv |
AT paulbillingross trellisforefficientdataandtaskmanagementinthevamillionveteranprogram AT jinasong trellisforefficientdataandtaskmanagementinthevamillionveteranprogram AT philipstsao trellisforefficientdataandtaskmanagementinthevamillionveteranprogram AT cuipingpan trellisforefficientdataandtaskmanagementinthevamillionveteranprogram |
_version_ |
1718372145985224704 |