CLJun 21, 2023

Morphological Inflection with Phonological Features

arXiv:2306.12581v1222 citationsh-index: 30
Originality Synthesis-oriented
AI Analysis

This work addresses challenges in morphological tasks for low-resource languages, but it is incremental as it shows limited gains over existing approaches.

The paper tackled the problem of improving morphological inflection models by incorporating subcharacter phonological features, finding that the methods yielded comparable results to grapheme-based baselines with minor improvements in some languages.

Recent years have brought great advances into solving morphological tasks, mostly due to powerful neural models applied to various tasks as (re)inflection and analysis. Yet, such morphological tasks cannot be considered solved, especially when little training data is available or when generalizing to previously unseen lemmas. This work explores effects on performance obtained through various ways in which morphological models get access to subcharacter phonological features that are the targets of morphological processes. We design two methods to achieve this goal: one that leaves models as is but manipulates the data to include features instead of characters, and another that manipulates models to take phonological features into account when building representations for phonemes. We elicit phonemic data from standard graphemic data using language-specific grammars for languages with shallow grapheme-to-phoneme mapping, and we experiment with two reinflection models over eight languages. Our results show that our methods yield comparable results to the grapheme-based baseline overall, with minor improvements in some of the languages. All in all, we conclude that patterns in character distributions are likely to allow models to infer the underlying phonological characteristics, even when phonemes are not explicitly represented.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes