Harnessing clinical annotations to improve deep learning performance in prostate segmentation.

<h4>Purpose</h4>Developing large-scale datasets with research-quality annotations is challenging due to the high cost of refining clinically generated markup into high precision annotations. We evaluated the direct use of a large dataset with only clinically generated annotations in deve...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Karthik V Sarma, Alex G Raman, Nikhil J Dhinagar, Alan M Priester, Stephanie Harmon, Thomas Sanford, Sherif Mehralivand, Baris Turkbey, Leonard S Marks, Steven S Raman, William Speier, Corey W Arnold
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2021
Materias:
R
Q
Acceso en línea:https://doaj.org/article/614ca6a1ac7743129fe5e2c5dec4ac65
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:614ca6a1ac7743129fe5e2c5dec4ac65
record_format dspace
spelling oai:doaj.org-article:614ca6a1ac7743129fe5e2c5dec4ac652021-12-02T20:09:57ZHarnessing clinical annotations to improve deep learning performance in prostate segmentation.1932-620310.1371/journal.pone.0253829https://doaj.org/article/614ca6a1ac7743129fe5e2c5dec4ac652021-01-01T00:00:00Zhttps://doi.org/10.1371/journal.pone.0253829https://doaj.org/toc/1932-6203<h4>Purpose</h4>Developing large-scale datasets with research-quality annotations is challenging due to the high cost of refining clinically generated markup into high precision annotations. We evaluated the direct use of a large dataset with only clinically generated annotations in development of high-performance segmentation models for small research-quality challenge datasets.<h4>Materials and methods</h4>We used a large retrospective dataset from our institution comprised of 1,620 clinically generated segmentations, and two challenge datasets (PROMISE12: 50 patients, ProstateX-2: 99 patients). We trained a 3D U-Net convolutional neural network (CNN) segmentation model using our entire dataset, and used that model as a template to train models on the challenge datasets. We also trained versions of the template model using ablated proportions of our dataset, and evaluated the relative benefit of those templates for the final models. Finally, we trained a version of the template model using an out-of-domain brain cancer dataset, and evaluated the relevant benefit of that template for the final models. We used five-fold cross-validation (CV) for all training and evaluation across our entire dataset.<h4>Results</h4>Our model achieves state-of-the-art performance on our large dataset (mean overall Dice 0.916, average Hausdorff distance 0.135 across CV folds). Using this model as a pre-trained template for refining on two external datasets significantly enhanced performance (30% and 49% enhancement in Dice scores respectively). Mean overall Dice and mean average Hausdorff distance were 0.912 and 0.15 for the ProstateX-2 dataset, and 0.852 and 0.581 for the PROMISE12 dataset. Using even small quantities of data to train the template enhanced performance, with significant improvements using 5% or more of the data.<h4>Conclusion</h4>We trained a state-of-the-art model using unrefined clinical prostate annotations and found that its use as a template model significantly improved performance in other prostate segmentation tasks, even when trained with only 5% of the original dataset.Karthik V SarmaAlex G RamanNikhil J DhinagarAlan M PriesterStephanie HarmonThomas SanfordSherif MehralivandBaris TurkbeyLeonard S MarksSteven S RamanWilliam SpeierCorey W ArnoldPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 16, Iss 6, p e0253829 (2021)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Karthik V Sarma
Alex G Raman
Nikhil J Dhinagar
Alan M Priester
Stephanie Harmon
Thomas Sanford
Sherif Mehralivand
Baris Turkbey
Leonard S Marks
Steven S Raman
William Speier
Corey W Arnold
Harnessing clinical annotations to improve deep learning performance in prostate segmentation.
description <h4>Purpose</h4>Developing large-scale datasets with research-quality annotations is challenging due to the high cost of refining clinically generated markup into high precision annotations. We evaluated the direct use of a large dataset with only clinically generated annotations in development of high-performance segmentation models for small research-quality challenge datasets.<h4>Materials and methods</h4>We used a large retrospective dataset from our institution comprised of 1,620 clinically generated segmentations, and two challenge datasets (PROMISE12: 50 patients, ProstateX-2: 99 patients). We trained a 3D U-Net convolutional neural network (CNN) segmentation model using our entire dataset, and used that model as a template to train models on the challenge datasets. We also trained versions of the template model using ablated proportions of our dataset, and evaluated the relative benefit of those templates for the final models. Finally, we trained a version of the template model using an out-of-domain brain cancer dataset, and evaluated the relevant benefit of that template for the final models. We used five-fold cross-validation (CV) for all training and evaluation across our entire dataset.<h4>Results</h4>Our model achieves state-of-the-art performance on our large dataset (mean overall Dice 0.916, average Hausdorff distance 0.135 across CV folds). Using this model as a pre-trained template for refining on two external datasets significantly enhanced performance (30% and 49% enhancement in Dice scores respectively). Mean overall Dice and mean average Hausdorff distance were 0.912 and 0.15 for the ProstateX-2 dataset, and 0.852 and 0.581 for the PROMISE12 dataset. Using even small quantities of data to train the template enhanced performance, with significant improvements using 5% or more of the data.<h4>Conclusion</h4>We trained a state-of-the-art model using unrefined clinical prostate annotations and found that its use as a template model significantly improved performance in other prostate segmentation tasks, even when trained with only 5% of the original dataset.
format article
author Karthik V Sarma
Alex G Raman
Nikhil J Dhinagar
Alan M Priester
Stephanie Harmon
Thomas Sanford
Sherif Mehralivand
Baris Turkbey
Leonard S Marks
Steven S Raman
William Speier
Corey W Arnold
author_facet Karthik V Sarma
Alex G Raman
Nikhil J Dhinagar
Alan M Priester
Stephanie Harmon
Thomas Sanford
Sherif Mehralivand
Baris Turkbey
Leonard S Marks
Steven S Raman
William Speier
Corey W Arnold
author_sort Karthik V Sarma
title Harnessing clinical annotations to improve deep learning performance in prostate segmentation.
title_short Harnessing clinical annotations to improve deep learning performance in prostate segmentation.
title_full Harnessing clinical annotations to improve deep learning performance in prostate segmentation.
title_fullStr Harnessing clinical annotations to improve deep learning performance in prostate segmentation.
title_full_unstemmed Harnessing clinical annotations to improve deep learning performance in prostate segmentation.
title_sort harnessing clinical annotations to improve deep learning performance in prostate segmentation.
publisher Public Library of Science (PLoS)
publishDate 2021
url https://doaj.org/article/614ca6a1ac7743129fe5e2c5dec4ac65
work_keys_str_mv AT karthikvsarma harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation
AT alexgraman harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation
AT nikhiljdhinagar harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation
AT alanmpriester harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation
AT stephanieharmon harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation
AT thomassanford harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation
AT sherifmehralivand harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation
AT baristurkbey harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation
AT leonardsmarks harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation
AT stevensraman harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation
AT williamspeier harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation
AT coreywarnold harnessingclinicalannotationstoimprovedeeplearningperformanceinprostatesegmentation
_version_ 1718375064042209280