Deep Learning of Inflection and the Cell-Filling Problem

Machine learning offers two basic strategies for morphology induction: lexical segmentation and surface word relation. The first approach assumes that words can be segmented into morphemes. Inferring a novel inflected form requires identification of morphemic constituents and a strategy for their re...

Description complète

Enregistré dans:
Détails bibliographiques
Auteurs principaux: Franco Alberto Cardillo, Marcello Ferro, Claudia Marzi, Vito Pirrelli
Format: article
Langue:EN
Publié: Accademia University Press 2018
Sujets:
H
Accès en ligne:https://doaj.org/article/ea47fdee18ca49449c2938cac6bdd45f
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
Description
Résumé:Machine learning offers two basic strategies for morphology induction: lexical segmentation and surface word relation. The first approach assumes that words can be segmented into morphemes. Inferring a novel inflected form requires identification of morphemic constituents and a strategy for their recombination. The second approach dispenses with segmentation: lexical representations form part of a network of associatively related inflected forms. Production of a novel form consists in filling in one empty node in the network. Here, we present the results of a task of word inflection by a recurrent LSTM network that learns to fill in paradigm cells of incomplete verb paradigms. Although the task does not require morpheme segmentation, we show that accuracy in carrying out the inflection task is a function of the model’s sensitivity to paradigm distribution and morphological structure.