Embeddings-based detection of word use variation in Italian newspapers

We study how words are used differently in two Italian newspapers at opposite ends of the political spectrum by training embeddings on one newspaper’s corpus, updating the weights on the second one, and observing vector shifts. We run two types of analysis, one top-down, based on a preselection of f...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Michele Cafagna, Lorenzo De Mattei, Malvina Nissim
Formato: article
Lenguaje:EN
Publicado: Accademia University Press 2020
Materias:
H
Acceso en línea:https://doaj.org/article/1c1d181040114638a9dbaaffc724a82c
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:1c1d181040114638a9dbaaffc724a82c
record_format dspace
spelling oai:doaj.org-article:1c1d181040114638a9dbaaffc724a82c2021-12-02T09:52:20ZEmbeddings-based detection of word use variation in Italian newspapers2499-455310.4000/ijcol.703https://doaj.org/article/1c1d181040114638a9dbaaffc724a82c2020-12-01T00:00:00Zhttp://journals.openedition.org/ijcol/703https://doaj.org/toc/2499-4553We study how words are used differently in two Italian newspapers at opposite ends of the political spectrum by training embeddings on one newspaper’s corpus, updating the weights on the second one, and observing vector shifts. We run two types of analysis, one top-down, based on a preselection of frequent words in both newspapers, and one bottom-up, on the basis of a combination of the observed shifts and relative and absolute frequency. The analysis is specific to this data, but the method can serve as a blueprint for similar studies.Michele CafagnaLorenzo De MatteiMalvina NissimAccademia University PressarticleSocial SciencesHComputational linguistics. Natural language processingP98-98.5ENIJCoL, Vol 6, Iss 2, Pp 9-22 (2020)
institution DOAJ
collection DOAJ
language EN
topic Social Sciences
H
Computational linguistics. Natural language processing
P98-98.5
spellingShingle Social Sciences
H
Computational linguistics. Natural language processing
P98-98.5
Michele Cafagna
Lorenzo De Mattei
Malvina Nissim
Embeddings-based detection of word use variation in Italian newspapers
description We study how words are used differently in two Italian newspapers at opposite ends of the political spectrum by training embeddings on one newspaper’s corpus, updating the weights on the second one, and observing vector shifts. We run two types of analysis, one top-down, based on a preselection of frequent words in both newspapers, and one bottom-up, on the basis of a combination of the observed shifts and relative and absolute frequency. The analysis is specific to this data, but the method can serve as a blueprint for similar studies.
format article
author Michele Cafagna
Lorenzo De Mattei
Malvina Nissim
author_facet Michele Cafagna
Lorenzo De Mattei
Malvina Nissim
author_sort Michele Cafagna
title Embeddings-based detection of word use variation in Italian newspapers
title_short Embeddings-based detection of word use variation in Italian newspapers
title_full Embeddings-based detection of word use variation in Italian newspapers
title_fullStr Embeddings-based detection of word use variation in Italian newspapers
title_full_unstemmed Embeddings-based detection of word use variation in Italian newspapers
title_sort embeddings-based detection of word use variation in italian newspapers
publisher Accademia University Press
publishDate 2020
url https://doaj.org/article/1c1d181040114638a9dbaaffc724a82c
work_keys_str_mv AT michelecafagna embeddingsbaseddetectionofwordusevariationinitaliannewspapers
AT lorenzodemattei embeddingsbaseddetectionofwordusevariationinitaliannewspapers
AT malvinanissim embeddingsbaseddetectionofwordusevariationinitaliannewspapers
_version_ 1718397931299536896