Building and Comparing Lemma Embeddings for Latin. Classical Latin versus Thomas Aquinas
This paper presents a new set of lemma embeddings for the Latin language. Embeddings are trained on a manually annotated corpus of texts belonging to the Classical era: different models, architectures and dimensions are tested and evaluated using a novel benchmark for the synonym selection task. In...
Guardado en:
Autores principales: | , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Accademia University Press
2020
|
Materias: | |
Acceso en línea: | https://doaj.org/article/7c5afd58eef44c698580f902978bf4f8 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:7c5afd58eef44c698580f902978bf4f8 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:7c5afd58eef44c698580f902978bf4f82021-12-02T09:52:31ZBuilding and Comparing Lemma Embeddings for Latin. Classical Latin versus Thomas Aquinas2499-455310.4000/ijcol.624https://doaj.org/article/7c5afd58eef44c698580f902978bf4f82020-06-01T00:00:00Zhttp://journals.openedition.org/ijcol/624https://doaj.org/toc/2499-4553This paper presents a new set of lemma embeddings for the Latin language. Embeddings are trained on a manually annotated corpus of texts belonging to the Classical era: different models, architectures and dimensions are tested and evaluated using a novel benchmark for the synonym selection task. In addition, we release vectors pre-trained on the “Opera Maiora” by Thomas Aquinas, thus providing a resource to analyze Latin in a diachronic perspective. The embeddings built upon the two training corpora are compared to each other to support diachronic lexical studies. The words showing the highest usage change between the two corpora are reported and a selection of them is discussed.Rachele SprugnoliGiovanni MorettiMarco PassarottiAccademia University PressarticleSocial SciencesHComputational linguistics. Natural language processingP98-98.5ENIJCoL, Vol 6, Iss 1, Pp 29-45 (2020) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Social Sciences H Computational linguistics. Natural language processing P98-98.5 |
spellingShingle |
Social Sciences H Computational linguistics. Natural language processing P98-98.5 Rachele Sprugnoli Giovanni Moretti Marco Passarotti Building and Comparing Lemma Embeddings for Latin. Classical Latin versus Thomas Aquinas |
description |
This paper presents a new set of lemma embeddings for the Latin language. Embeddings are trained on a manually annotated corpus of texts belonging to the Classical era: different models, architectures and dimensions are tested and evaluated using a novel benchmark for the synonym selection task. In addition, we release vectors pre-trained on the “Opera Maiora” by Thomas Aquinas, thus providing a resource to analyze Latin in a diachronic perspective. The embeddings built upon the two training corpora are compared to each other to support diachronic lexical studies. The words showing the highest usage change between the two corpora are reported and a selection of them is discussed. |
format |
article |
author |
Rachele Sprugnoli Giovanni Moretti Marco Passarotti |
author_facet |
Rachele Sprugnoli Giovanni Moretti Marco Passarotti |
author_sort |
Rachele Sprugnoli |
title |
Building and Comparing Lemma Embeddings for Latin. Classical Latin versus Thomas Aquinas |
title_short |
Building and Comparing Lemma Embeddings for Latin. Classical Latin versus Thomas Aquinas |
title_full |
Building and Comparing Lemma Embeddings for Latin. Classical Latin versus Thomas Aquinas |
title_fullStr |
Building and Comparing Lemma Embeddings for Latin. Classical Latin versus Thomas Aquinas |
title_full_unstemmed |
Building and Comparing Lemma Embeddings for Latin. Classical Latin versus Thomas Aquinas |
title_sort |
building and comparing lemma embeddings for latin. classical latin versus thomas aquinas |
publisher |
Accademia University Press |
publishDate |
2020 |
url |
https://doaj.org/article/7c5afd58eef44c698580f902978bf4f8 |
work_keys_str_mv |
AT rachelesprugnoli buildingandcomparinglemmaembeddingsforlatinclassicallatinversusthomasaquinas AT giovannimoretti buildingandcomparinglemmaembeddingsforlatinclassicallatinversusthomasaquinas AT marcopassarotti buildingandcomparinglemmaembeddingsforlatinclassicallatinversusthomasaquinas |
_version_ |
1718397953441267712 |