Perspectives on automated composition of workflows in the life sciences [version 1; peer review: 2 approved]

Scientific data analyses often combine several computational tools in automated pipelines, or workflows. Thousands of such workflows have been used in the life sciences, though their composition has remained a cumbersome manual process due to a lack of standards for annotation, assembly, and impleme...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Anna-Lena Lamprecht, Magnus Palmblad, Jon Ison, Veit Schwämmle, Mohammad Sadnan Al Manir, Ilkay Altintas, Christopher J. O. Baker, Ammar Ben Hadj Amor, Salvador Capella-Gutierrez, Paulos Charonyktakis, Michael R. Crusoe, Yolanda Gil, Carole Goble, Timothy J. Griffin, Paul Groth, Hans Ienasescu, Pratik Jagtap, Matúš Kalaš, Vedran Kasalica, Alireza Khanteymoori, Tobias Kuhn, Hailiang Mei, Hervé Ménager, Steffen Möller, Robin A. Richardson, Vincent Robert, Stian Soiland-Reyes, Robert Stevens, Szoke Szaniszlo, Suzan Verberne, Aswin Verhoeven, Katherine Wolstencroft
Formato: article
Lenguaje:EN
Publicado: F1000 Research Ltd 2021
Materias:
R
Q
Acceso en línea:https://doaj.org/article/e9b280176ab24646b8b7d023b327e100
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:e9b280176ab24646b8b7d023b327e100
record_format dspace
spelling oai:doaj.org-article:e9b280176ab24646b8b7d023b327e1002021-11-08T10:53:47ZPerspectives on automated composition of workflows in the life sciences [version 1; peer review: 2 approved]2046-140210.12688/f1000research.54159.1https://doaj.org/article/e9b280176ab24646b8b7d023b327e1002021-09-01T00:00:00Zhttps://f1000research.com/articles/10-897/v1https://doaj.org/toc/2046-1402Scientific data analyses often combine several computational tools in automated pipelines, or workflows. Thousands of such workflows have been used in the life sciences, though their composition has remained a cumbersome manual process due to a lack of standards for annotation, assembly, and implementation. Recent technological advances have returned the long-standing vision of automated workflow composition into focus. This article summarizes a recent Lorentz Center workshop dedicated to automated composition of workflows in the life sciences. We survey previous initiatives to automate the composition process, and discuss the current state of the art and future perspectives. We start by drawing the “big picture” of the scientific workflow development life cycle, before surveying and discussing current methods, technologies and practices for semantic domain modelling, automation in workflow development, and workflow assessment. Finally, we derive a roadmap of individual and community-based actions to work toward the vision of automated workflow development in the forthcoming years. A central outcome of the workshop is a general description of the workflow life cycle in six stages: 1) scientific question or hypothesis, 2) conceptual workflow, 3) abstract workflow, 4) concrete workflow, 5) production workflow, and 6) scientific results. The transitions between stages are facilitated by diverse tools and methods, usually incorporating domain knowledge in some form. Formal semantic domain modelling is hard and often a bottleneck for the application of semantic technologies. However, life science communities have made considerable progress here in recent years and are continuously improving, renewing interest in the application of semantic technologies for workflow exploration, composition and instantiation. Combined with systematic benchmarking with reference data and large-scale deployment of production-stage workflows, such technologies enable a more systematic process of workflow development than we know today. We believe that this can lead to more robust, reusable, and sustainable workflows in the future.Anna-Lena LamprechtMagnus PalmbladJon IsonVeit SchwämmleMohammad Sadnan Al ManirIlkay AltintasChristopher J. O. BakerAmmar Ben Hadj AmorSalvador Capella-GutierrezPaulos CharonyktakisMichael R. CrusoeYolanda GilCarole GobleTimothy J. GriffinPaul GrothHans IenasescuPratik JagtapMatúš KalašVedran KasalicaAlireza KhanteymooriTobias KuhnHailiang MeiHervé MénagerSteffen MöllerRobin A. RichardsonVincent RobertStian Soiland-ReyesRobert StevensSzoke SzaniszloSuzan VerberneAswin VerhoevenKatherine WolstencroftF1000 Research LtdarticleMedicineRScienceQENF1000Research, Vol 10 (2021)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Anna-Lena Lamprecht
Magnus Palmblad
Jon Ison
Veit Schwämmle
Mohammad Sadnan Al Manir
Ilkay Altintas
Christopher J. O. Baker
Ammar Ben Hadj Amor
Salvador Capella-Gutierrez
Paulos Charonyktakis
Michael R. Crusoe
Yolanda Gil
Carole Goble
Timothy J. Griffin
Paul Groth
Hans Ienasescu
Pratik Jagtap
Matúš Kalaš
Vedran Kasalica
Alireza Khanteymoori
Tobias Kuhn
Hailiang Mei
Hervé Ménager
Steffen Möller
Robin A. Richardson
Vincent Robert
Stian Soiland-Reyes
Robert Stevens
Szoke Szaniszlo
Suzan Verberne
Aswin Verhoeven
Katherine Wolstencroft
Perspectives on automated composition of workflows in the life sciences [version 1; peer review: 2 approved]
description Scientific data analyses often combine several computational tools in automated pipelines, or workflows. Thousands of such workflows have been used in the life sciences, though their composition has remained a cumbersome manual process due to a lack of standards for annotation, assembly, and implementation. Recent technological advances have returned the long-standing vision of automated workflow composition into focus. This article summarizes a recent Lorentz Center workshop dedicated to automated composition of workflows in the life sciences. We survey previous initiatives to automate the composition process, and discuss the current state of the art and future perspectives. We start by drawing the “big picture” of the scientific workflow development life cycle, before surveying and discussing current methods, technologies and practices for semantic domain modelling, automation in workflow development, and workflow assessment. Finally, we derive a roadmap of individual and community-based actions to work toward the vision of automated workflow development in the forthcoming years. A central outcome of the workshop is a general description of the workflow life cycle in six stages: 1) scientific question or hypothesis, 2) conceptual workflow, 3) abstract workflow, 4) concrete workflow, 5) production workflow, and 6) scientific results. The transitions between stages are facilitated by diverse tools and methods, usually incorporating domain knowledge in some form. Formal semantic domain modelling is hard and often a bottleneck for the application of semantic technologies. However, life science communities have made considerable progress here in recent years and are continuously improving, renewing interest in the application of semantic technologies for workflow exploration, composition and instantiation. Combined with systematic benchmarking with reference data and large-scale deployment of production-stage workflows, such technologies enable a more systematic process of workflow development than we know today. We believe that this can lead to more robust, reusable, and sustainable workflows in the future.
format article
author Anna-Lena Lamprecht
Magnus Palmblad
Jon Ison
Veit Schwämmle
Mohammad Sadnan Al Manir
Ilkay Altintas
Christopher J. O. Baker
Ammar Ben Hadj Amor
Salvador Capella-Gutierrez
Paulos Charonyktakis
Michael R. Crusoe
Yolanda Gil
Carole Goble
Timothy J. Griffin
Paul Groth
Hans Ienasescu
Pratik Jagtap
Matúš Kalaš
Vedran Kasalica
Alireza Khanteymoori
Tobias Kuhn
Hailiang Mei
Hervé Ménager
Steffen Möller
Robin A. Richardson
Vincent Robert
Stian Soiland-Reyes
Robert Stevens
Szoke Szaniszlo
Suzan Verberne
Aswin Verhoeven
Katherine Wolstencroft
author_facet Anna-Lena Lamprecht
Magnus Palmblad
Jon Ison
Veit Schwämmle
Mohammad Sadnan Al Manir
Ilkay Altintas
Christopher J. O. Baker
Ammar Ben Hadj Amor
Salvador Capella-Gutierrez
Paulos Charonyktakis
Michael R. Crusoe
Yolanda Gil
Carole Goble
Timothy J. Griffin
Paul Groth
Hans Ienasescu
Pratik Jagtap
Matúš Kalaš
Vedran Kasalica
Alireza Khanteymoori
Tobias Kuhn
Hailiang Mei
Hervé Ménager
Steffen Möller
Robin A. Richardson
Vincent Robert
Stian Soiland-Reyes
Robert Stevens
Szoke Szaniszlo
Suzan Verberne
Aswin Verhoeven
Katherine Wolstencroft
author_sort Anna-Lena Lamprecht
title Perspectives on automated composition of workflows in the life sciences [version 1; peer review: 2 approved]
title_short Perspectives on automated composition of workflows in the life sciences [version 1; peer review: 2 approved]
title_full Perspectives on automated composition of workflows in the life sciences [version 1; peer review: 2 approved]
title_fullStr Perspectives on automated composition of workflows in the life sciences [version 1; peer review: 2 approved]
title_full_unstemmed Perspectives on automated composition of workflows in the life sciences [version 1; peer review: 2 approved]
title_sort perspectives on automated composition of workflows in the life sciences [version 1; peer review: 2 approved]
publisher F1000 Research Ltd
publishDate 2021
url https://doaj.org/article/e9b280176ab24646b8b7d023b327e100
work_keys_str_mv AT annalenalamprecht perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT magnuspalmblad perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT jonison perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT veitschwammle perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT mohammadsadnanalmanir perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT ilkayaltintas perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT christopherjobaker perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT ammarbenhadjamor perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT salvadorcapellagutierrez perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT pauloscharonyktakis perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT michaelrcrusoe perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT yolandagil perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT carolegoble perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT timothyjgriffin perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT paulgroth perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT hansienasescu perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT pratikjagtap perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT matuskalas perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT vedrankasalica perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT alirezakhanteymoori perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT tobiaskuhn perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT hailiangmei perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT hervemenager perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT steffenmoller perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT robinarichardson perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT vincentrobert perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT stiansoilandreyes perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT robertstevens perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT szokeszaniszlo perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT suzanverberne perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT aswinverhoeven perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
AT katherinewolstencroft perspectivesonautomatedcompositionofworkflowsinthelifesciencesversion1peerreview2approved
_version_ 1718442576845996032