FinLex: An effective use of word embeddings for financial lexicon generation

We present a simple and effective methodology for the generation of lexicons (word lists) that may be used in natural language scoring applications. In particular, in the finance industry, word lists have become ubiquitous for sentiment scoring. These have been derived from dictionaries such as the...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Sanjiv R. Das, Michele Donini, Muhammad Bilal Zafar, John He, Krishnaram Kenthapadi
Formato: article
Lenguaje:EN
Publicado: KeAi Communications Co., Ltd. 2022
Materias:
Acceso en línea:https://doaj.org/article/a9e7fbaa425b46cd8cd98d2d853264f8
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
Descripción
Sumario:We present a simple and effective methodology for the generation of lexicons (word lists) that may be used in natural language scoring applications. In particular, in the finance industry, word lists have become ubiquitous for sentiment scoring. These have been derived from dictionaries such as the Harvard Inquirer and require manual curation. Here, we present an automated approach to the curation of lexicons, which makes automatic preparation of any word list immediate. We show that our automated word lists deliver comparable performance to traditional lexicons on machine learning classification tasks. This new approach will enable finance academics and practitioners to create and deploy new word lists in addition to the few traditional ones in a facile manner.