Leveraging network analysis to evaluate biomedical named entity recognition tools

Abstract The ever-growing availability of biomedical text sources has resulted in a boost in clinical studies based on their exploitation. Biomedical named-entity recognition (bio-NER) techniques have evolved remarkably in recent years and their application in research is increasingly successful. St...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Eduardo P. García del Valle, Gerardo Lagunes García, Lucía Prieto Santamaría, Massimiliano Zanin, Ernestina Menasalvas Ruiz, Alejandro Rodríguez-González
Formato: article
Lenguaje:EN
Publicado: Nature Portfolio 2021
Materias:
R
Q
Acceso en línea:https://doaj.org/article/1a59a4ae3e8a47658867a982cc8edfda
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:1a59a4ae3e8a47658867a982cc8edfda
record_format dspace
spelling oai:doaj.org-article:1a59a4ae3e8a47658867a982cc8edfda2021-12-02T16:10:38ZLeveraging network analysis to evaluate biomedical named entity recognition tools10.1038/s41598-021-93018-w2045-2322https://doaj.org/article/1a59a4ae3e8a47658867a982cc8edfda2021-06-01T00:00:00Zhttps://doi.org/10.1038/s41598-021-93018-whttps://doaj.org/toc/2045-2322Abstract The ever-growing availability of biomedical text sources has resulted in a boost in clinical studies based on their exploitation. Biomedical named-entity recognition (bio-NER) techniques have evolved remarkably in recent years and their application in research is increasingly successful. Still, the disparity of tools and the limited available validation resources are barriers preventing a wider diffusion, especially within clinical practice. We here propose the use of omics data and network analysis as an alternative for the assessment of bio-NER tools. Specifically, our method introduces quality criteria based on edge overlap and community detection. The application of these criteria to four bio-NER solutions yielded comparable results to strategies based on annotated corpora, without suffering from their limitations. Our approach can constitute a guide both for the selection of the best bio-NER tool given a specific task, and for the creation and validation of novel approaches.Eduardo P. García del ValleGerardo Lagunes GarcíaLucía Prieto SantamaríaMassimiliano ZaninErnestina Menasalvas RuizAlejandro Rodríguez-GonzálezNature PortfolioarticleMedicineRScienceQENScientific Reports, Vol 11, Iss 1, Pp 1-10 (2021)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Eduardo P. García del Valle
Gerardo Lagunes García
Lucía Prieto Santamaría
Massimiliano Zanin
Ernestina Menasalvas Ruiz
Alejandro Rodríguez-González
Leveraging network analysis to evaluate biomedical named entity recognition tools
description Abstract The ever-growing availability of biomedical text sources has resulted in a boost in clinical studies based on their exploitation. Biomedical named-entity recognition (bio-NER) techniques have evolved remarkably in recent years and their application in research is increasingly successful. Still, the disparity of tools and the limited available validation resources are barriers preventing a wider diffusion, especially within clinical practice. We here propose the use of omics data and network analysis as an alternative for the assessment of bio-NER tools. Specifically, our method introduces quality criteria based on edge overlap and community detection. The application of these criteria to four bio-NER solutions yielded comparable results to strategies based on annotated corpora, without suffering from their limitations. Our approach can constitute a guide both for the selection of the best bio-NER tool given a specific task, and for the creation and validation of novel approaches.
format article
author Eduardo P. García del Valle
Gerardo Lagunes García
Lucía Prieto Santamaría
Massimiliano Zanin
Ernestina Menasalvas Ruiz
Alejandro Rodríguez-González
author_facet Eduardo P. García del Valle
Gerardo Lagunes García
Lucía Prieto Santamaría
Massimiliano Zanin
Ernestina Menasalvas Ruiz
Alejandro Rodríguez-González
author_sort Eduardo P. García del Valle
title Leveraging network analysis to evaluate biomedical named entity recognition tools
title_short Leveraging network analysis to evaluate biomedical named entity recognition tools
title_full Leveraging network analysis to evaluate biomedical named entity recognition tools
title_fullStr Leveraging network analysis to evaluate biomedical named entity recognition tools
title_full_unstemmed Leveraging network analysis to evaluate biomedical named entity recognition tools
title_sort leveraging network analysis to evaluate biomedical named entity recognition tools
publisher Nature Portfolio
publishDate 2021
url https://doaj.org/article/1a59a4ae3e8a47658867a982cc8edfda
work_keys_str_mv AT eduardopgarciadelvalle leveragingnetworkanalysistoevaluatebiomedicalnamedentityrecognitiontools
AT gerardolagunesgarcia leveragingnetworkanalysistoevaluatebiomedicalnamedentityrecognitiontools
AT luciaprietosantamaria leveragingnetworkanalysistoevaluatebiomedicalnamedentityrecognitiontools
AT massimilianozanin leveragingnetworkanalysistoevaluatebiomedicalnamedentityrecognitiontools
AT ernestinamenasalvasruiz leveragingnetworkanalysistoevaluatebiomedicalnamedentityrecognitiontools
AT alejandrorodriguezgonzalez leveragingnetworkanalysistoevaluatebiomedicalnamedentityrecognitiontools
_version_ 1718384394447618048