Named-Entity Dataset for Medieval Latin, Middle High German and Old Norse

We present a dataset of named entities in three languages: Medieval Latin, Middle High German and Old Norse. The dataset, containing proper nouns of persons and places, was originally created to extract characters from three related medieval texts. Since the annotation is on low-resource pre-modern...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Clément Besnier, William Mattingly
Formato: article
Lenguaje:EN
Publicado: Ubiquity Press 2021
Materias:
P
Acceso en línea:https://doaj.org/article/fc7dd6c06883435b8a8e9bdfa2c2fdef
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:fc7dd6c06883435b8a8e9bdfa2c2fdef
record_format dspace
spelling oai:doaj.org-article:fc7dd6c06883435b8a8e9bdfa2c2fdef2021-11-08T08:11:28ZNamed-Entity Dataset for Medieval Latin, Middle High German and Old Norse2059-481X10.5334/johd.36https://doaj.org/article/fc7dd6c06883435b8a8e9bdfa2c2fdef2021-10-01T00:00:00Zhttps://openhumanitiesdata.metajnl.com/articles/36https://doaj.org/toc/2059-481XWe present a dataset of named entities in three languages: Medieval Latin, Middle High German and Old Norse. The dataset, containing proper nouns of persons and places, was originally created to extract characters from three related medieval texts. Since the annotation is on low-resource pre-modern languages, they may be important to build named-entity recognition tools for languages with little data and high linguistic variation.Clément BesnierWilliam MattinglyUbiquity Pressarticlenamed-entity recognitionnatural language processinglatinmedieval latinmiddle high germanold norseHistory of scholarship and learning. The humanitiesAZ20-999Language and LiteraturePENJournal of Open Humanities Data, Vol 7 (2021)
institution DOAJ
collection DOAJ
language EN
topic named-entity recognition
natural language processing
latin
medieval latin
middle high german
old norse
History of scholarship and learning. The humanities
AZ20-999
Language and Literature
P
spellingShingle named-entity recognition
natural language processing
latin
medieval latin
middle high german
old norse
History of scholarship and learning. The humanities
AZ20-999
Language and Literature
P
Clément Besnier
William Mattingly
Named-Entity Dataset for Medieval Latin, Middle High German and Old Norse
description We present a dataset of named entities in three languages: Medieval Latin, Middle High German and Old Norse. The dataset, containing proper nouns of persons and places, was originally created to extract characters from three related medieval texts. Since the annotation is on low-resource pre-modern languages, they may be important to build named-entity recognition tools for languages with little data and high linguistic variation.
format article
author Clément Besnier
William Mattingly
author_facet Clément Besnier
William Mattingly
author_sort Clément Besnier
title Named-Entity Dataset for Medieval Latin, Middle High German and Old Norse
title_short Named-Entity Dataset for Medieval Latin, Middle High German and Old Norse
title_full Named-Entity Dataset for Medieval Latin, Middle High German and Old Norse
title_fullStr Named-Entity Dataset for Medieval Latin, Middle High German and Old Norse
title_full_unstemmed Named-Entity Dataset for Medieval Latin, Middle High German and Old Norse
title_sort named-entity dataset for medieval latin, middle high german and old norse
publisher Ubiquity Press
publishDate 2021
url https://doaj.org/article/fc7dd6c06883435b8a8e9bdfa2c2fdef
work_keys_str_mv AT clementbesnier namedentitydatasetformedievallatinmiddlehighgermanandoldnorse
AT williammattingly namedentitydatasetformedievallatinmiddlehighgermanandoldnorse
_version_ 1718442871931011072