Trellis for efficient data and task management in the VA Million Veteran Program

Abstract Biomedical studies have become larger in size and yielded large quantities of data, yet efficient data processing remains a challenge. Here we present Trellis, a cloud-based data and task management framework that completely automates the process from data ingestion to result presentation,...

Full description

Saved in:
Bibliographic Details
Main Authors: Paul Billing Ross, Jina Song, Philip S. Tsao, Cuiping Pan
Format: article
Language:EN
Published: Nature Portfolio 2021
Subjects:
R
Q
Online Access:https://doaj.org/article/977e68558af542b39e3e97a8f93a32b7
Tags: Add Tag
No Tags, Be the first to tag this record!
id oai:doaj.org-article:977e68558af542b39e3e97a8f93a32b7
record_format dspace
spelling oai:doaj.org-article:977e68558af542b39e3e97a8f93a32b72021-12-05T12:12:11ZTrellis for efficient data and task management in the VA Million Veteran Program10.1038/s41598-021-02569-52045-2322https://doaj.org/article/977e68558af542b39e3e97a8f93a32b72021-12-01T00:00:00Zhttps://doi.org/10.1038/s41598-021-02569-5https://doaj.org/toc/2045-2322Abstract Biomedical studies have become larger in size and yielded large quantities of data, yet efficient data processing remains a challenge. Here we present Trellis, a cloud-based data and task management framework that completely automates the process from data ingestion to result presentation, while tracking data lineage, facilitating information query, and supporting fault-tolerance and scalability. Using a graph database to coordinate the state of the data processing workflows and a scalable microservice architecture to perform bioinformatics tasks, Trellis has enabled efficient variant calling on 100,000 human genomes collected in the VA Million Veteran Program.Paul Billing RossJina SongPhilip S. TsaoCuiping PanNature PortfolioarticleMedicineRScienceQENScientific Reports, Vol 11, Iss 1, Pp 1-12 (2021)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Paul Billing Ross
Jina Song
Philip S. Tsao
Cuiping Pan
Trellis for efficient data and task management in the VA Million Veteran Program
description Abstract Biomedical studies have become larger in size and yielded large quantities of data, yet efficient data processing remains a challenge. Here we present Trellis, a cloud-based data and task management framework that completely automates the process from data ingestion to result presentation, while tracking data lineage, facilitating information query, and supporting fault-tolerance and scalability. Using a graph database to coordinate the state of the data processing workflows and a scalable microservice architecture to perform bioinformatics tasks, Trellis has enabled efficient variant calling on 100,000 human genomes collected in the VA Million Veteran Program.
format article
author Paul Billing Ross
Jina Song
Philip S. Tsao
Cuiping Pan
author_facet Paul Billing Ross
Jina Song
Philip S. Tsao
Cuiping Pan
author_sort Paul Billing Ross
title Trellis for efficient data and task management in the VA Million Veteran Program
title_short Trellis for efficient data and task management in the VA Million Veteran Program
title_full Trellis for efficient data and task management in the VA Million Veteran Program
title_fullStr Trellis for efficient data and task management in the VA Million Veteran Program
title_full_unstemmed Trellis for efficient data and task management in the VA Million Veteran Program
title_sort trellis for efficient data and task management in the va million veteran program
publisher Nature Portfolio
publishDate 2021
url https://doaj.org/article/977e68558af542b39e3e97a8f93a32b7
work_keys_str_mv AT paulbillingross trellisforefficientdataandtaskmanagementinthevamillionveteranprogram
AT jinasong trellisforefficientdataandtaskmanagementinthevamillionveteranprogram
AT philipstsao trellisforefficientdataandtaskmanagementinthevamillionveteranprogram
AT cuipingpan trellisforefficientdataandtaskmanagementinthevamillionveteranprogram
_version_ 1718372145985224704