Named-Entity Dataset for Medieval Latin, Middle High German and Old Norse
We present a dataset of named entities in three languages: Medieval Latin, Middle High German and Old Norse. The dataset, containing proper nouns of persons and places, was originally created to extract characters from three related medieval texts. Since the annotation is on low-resource pre-modern...
Guardado en:
Autores principales: | , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Ubiquity Press
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/fc7dd6c06883435b8a8e9bdfa2c2fdef |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:fc7dd6c06883435b8a8e9bdfa2c2fdef |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:fc7dd6c06883435b8a8e9bdfa2c2fdef2021-11-08T08:11:28ZNamed-Entity Dataset for Medieval Latin, Middle High German and Old Norse2059-481X10.5334/johd.36https://doaj.org/article/fc7dd6c06883435b8a8e9bdfa2c2fdef2021-10-01T00:00:00Zhttps://openhumanitiesdata.metajnl.com/articles/36https://doaj.org/toc/2059-481XWe present a dataset of named entities in three languages: Medieval Latin, Middle High German and Old Norse. The dataset, containing proper nouns of persons and places, was originally created to extract characters from three related medieval texts. Since the annotation is on low-resource pre-modern languages, they may be important to build named-entity recognition tools for languages with little data and high linguistic variation.Clément BesnierWilliam MattinglyUbiquity Pressarticlenamed-entity recognitionnatural language processinglatinmedieval latinmiddle high germanold norseHistory of scholarship and learning. The humanitiesAZ20-999Language and LiteraturePENJournal of Open Humanities Data, Vol 7 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
named-entity recognition natural language processing latin medieval latin middle high german old norse History of scholarship and learning. The humanities AZ20-999 Language and Literature P |
spellingShingle |
named-entity recognition natural language processing latin medieval latin middle high german old norse History of scholarship and learning. The humanities AZ20-999 Language and Literature P Clément Besnier William Mattingly Named-Entity Dataset for Medieval Latin, Middle High German and Old Norse |
description |
We present a dataset of named entities in three languages: Medieval Latin, Middle High German and Old Norse. The dataset, containing proper nouns of persons and places, was originally created to extract characters from three related medieval texts. Since the annotation is on low-resource pre-modern languages, they may be important to build named-entity recognition tools for languages with little data and high linguistic variation. |
format |
article |
author |
Clément Besnier William Mattingly |
author_facet |
Clément Besnier William Mattingly |
author_sort |
Clément Besnier |
title |
Named-Entity Dataset for Medieval Latin, Middle High German and Old Norse |
title_short |
Named-Entity Dataset for Medieval Latin, Middle High German and Old Norse |
title_full |
Named-Entity Dataset for Medieval Latin, Middle High German and Old Norse |
title_fullStr |
Named-Entity Dataset for Medieval Latin, Middle High German and Old Norse |
title_full_unstemmed |
Named-Entity Dataset for Medieval Latin, Middle High German and Old Norse |
title_sort |
named-entity dataset for medieval latin, middle high german and old norse |
publisher |
Ubiquity Press |
publishDate |
2021 |
url |
https://doaj.org/article/fc7dd6c06883435b8a8e9bdfa2c2fdef |
work_keys_str_mv |
AT clementbesnier namedentitydatasetformedievallatinmiddlehighgermanandoldnorse AT williammattingly namedentitydatasetformedievallatinmiddlehighgermanandoldnorse |
_version_ |
1718442871931011072 |