Biological data annotation via a human-augmenting AI-based labeling system
Abstract Biology has become a prime area for the deployment of deep learning and artificial intelligence (AI), enabled largely by the massive data sets that the field can generate. Key to most AI tasks is the availability of a sufficiently large, labeled data set with which to train AI models. In th...
Guardado en:
Autores principales: | , , , , , , , , , , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Nature Portfolio
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/d6164252a0d94935879eacf682d4bdaf |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:d6164252a0d94935879eacf682d4bdaf |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:d6164252a0d94935879eacf682d4bdaf2021-12-02T17:13:14ZBiological data annotation via a human-augmenting AI-based labeling system10.1038/s41746-021-00520-62398-6352https://doaj.org/article/d6164252a0d94935879eacf682d4bdaf2021-10-01T00:00:00Zhttps://doi.org/10.1038/s41746-021-00520-6https://doaj.org/toc/2398-6352Abstract Biology has become a prime area for the deployment of deep learning and artificial intelligence (AI), enabled largely by the massive data sets that the field can generate. Key to most AI tasks is the availability of a sufficiently large, labeled data set with which to train AI models. In the context of microscopy, it is easy to generate image data sets containing millions of cells and structures. However, it is challenging to obtain large-scale high-quality annotations for AI models. Here, we present HALS (Human-Augmenting Labeling System), a human-in-the-loop data labeling AI, which begins uninitialized and learns annotations from a human, in real-time. Using a multi-part AI composed of three deep learning models, HALS learns from just a few examples and immediately decreases the workload of the annotator, while increasing the quality of their annotations. Using a highly repetitive use-case—annotating cell types—and running experiments with seven pathologists—experts at the microscopic analysis of biological specimens—we demonstrate a manual work reduction of 90.60%, and an average data-quality boost of 4.34%, measured across four use-cases and two tissue stain types.Douwe van der WalIny JhunIsraa LakloukJeff NirschlLara RicherRebecca RojanskyTalent ThepareeJoshua WheelerJörg SanderFelix FengOsama MohamadSilvio SavareseRichard SocherAndre EstevaNature PortfolioarticleComputer applications to medicine. Medical informaticsR858-859.7ENnpj Digital Medicine, Vol 4, Iss 1, Pp 1-7 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Computer applications to medicine. Medical informatics R858-859.7 |
spellingShingle |
Computer applications to medicine. Medical informatics R858-859.7 Douwe van der Wal Iny Jhun Israa Laklouk Jeff Nirschl Lara Richer Rebecca Rojansky Talent Theparee Joshua Wheeler Jörg Sander Felix Feng Osama Mohamad Silvio Savarese Richard Socher Andre Esteva Biological data annotation via a human-augmenting AI-based labeling system |
description |
Abstract Biology has become a prime area for the deployment of deep learning and artificial intelligence (AI), enabled largely by the massive data sets that the field can generate. Key to most AI tasks is the availability of a sufficiently large, labeled data set with which to train AI models. In the context of microscopy, it is easy to generate image data sets containing millions of cells and structures. However, it is challenging to obtain large-scale high-quality annotations for AI models. Here, we present HALS (Human-Augmenting Labeling System), a human-in-the-loop data labeling AI, which begins uninitialized and learns annotations from a human, in real-time. Using a multi-part AI composed of three deep learning models, HALS learns from just a few examples and immediately decreases the workload of the annotator, while increasing the quality of their annotations. Using a highly repetitive use-case—annotating cell types—and running experiments with seven pathologists—experts at the microscopic analysis of biological specimens—we demonstrate a manual work reduction of 90.60%, and an average data-quality boost of 4.34%, measured across four use-cases and two tissue stain types. |
format |
article |
author |
Douwe van der Wal Iny Jhun Israa Laklouk Jeff Nirschl Lara Richer Rebecca Rojansky Talent Theparee Joshua Wheeler Jörg Sander Felix Feng Osama Mohamad Silvio Savarese Richard Socher Andre Esteva |
author_facet |
Douwe van der Wal Iny Jhun Israa Laklouk Jeff Nirschl Lara Richer Rebecca Rojansky Talent Theparee Joshua Wheeler Jörg Sander Felix Feng Osama Mohamad Silvio Savarese Richard Socher Andre Esteva |
author_sort |
Douwe van der Wal |
title |
Biological data annotation via a human-augmenting AI-based labeling system |
title_short |
Biological data annotation via a human-augmenting AI-based labeling system |
title_full |
Biological data annotation via a human-augmenting AI-based labeling system |
title_fullStr |
Biological data annotation via a human-augmenting AI-based labeling system |
title_full_unstemmed |
Biological data annotation via a human-augmenting AI-based labeling system |
title_sort |
biological data annotation via a human-augmenting ai-based labeling system |
publisher |
Nature Portfolio |
publishDate |
2021 |
url |
https://doaj.org/article/d6164252a0d94935879eacf682d4bdaf |
work_keys_str_mv |
AT douwevanderwal biologicaldataannotationviaahumanaugmentingaibasedlabelingsystem AT inyjhun biologicaldataannotationviaahumanaugmentingaibasedlabelingsystem AT israalaklouk biologicaldataannotationviaahumanaugmentingaibasedlabelingsystem AT jeffnirschl biologicaldataannotationviaahumanaugmentingaibasedlabelingsystem AT lararicher biologicaldataannotationviaahumanaugmentingaibasedlabelingsystem AT rebeccarojansky biologicaldataannotationviaahumanaugmentingaibasedlabelingsystem AT talenttheparee biologicaldataannotationviaahumanaugmentingaibasedlabelingsystem AT joshuawheeler biologicaldataannotationviaahumanaugmentingaibasedlabelingsystem AT jorgsander biologicaldataannotationviaahumanaugmentingaibasedlabelingsystem AT felixfeng biologicaldataannotationviaahumanaugmentingaibasedlabelingsystem AT osamamohamad biologicaldataannotationviaahumanaugmentingaibasedlabelingsystem AT silviosavarese biologicaldataannotationviaahumanaugmentingaibasedlabelingsystem AT richardsocher biologicaldataannotationviaahumanaugmentingaibasedlabelingsystem AT andreesteva biologicaldataannotationviaahumanaugmentingaibasedlabelingsystem |
_version_ |
1718381409325809664 |