Enhancing Korean Named Entity Recognition With Linguistic Tokenization Strategies

Tokenization is a significant primary step for the training of the Pre-trained Language Model (PLM), which alleviates the challenging Out-of-Vocabulary problem in the area of Natural Language Processing. As tokenization strategies can change linguistic understanding, it is essential to consider the...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Gyeongmin Kim, Junyoung Son, Jinsung Kim, Hyunhee Lee, Heuiseok Lim
Formato: article
Lenguaje:EN
Publicado: IEEE 2021
Materias:
Acceso en línea:https://doaj.org/article/d665db0beed8491ba11d763eda19afbd
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!