CLAISep 14, 2021

A Three Step Training Approach with Data Augmentation for Morphological Inflection

arXiv:2109.07006v11 citations
Originality Synthesis-oriented
AI Analysis

This work addresses morphological inflection for linguists and NLP researchers, but it is incremental as it builds on existing methods with minor improvements.

The authors tackled morphological inflection across diverse languages using a three-step training approach with data augmentation on an LSTM encoder-decoder model, achieving results that outperformed the only other submission but remained worse than a Transformer baseline.

We present the BME submission for the SIGMORPHON 2021 Task 0 Part 1, Generalization Across Typologically Diverse Languages shared task. We use an LSTM encoder-decoder model with three step training that is first trained on all languages, then fine-tuned on each language families and finally finetuned on individual languages. We use a different type of data augmentation technique in the first two steps. Our system outperformed the only other submission. Although it remains worse than the Transformer baseline released by the organizers, our model is simpler and our data augmentation techniques are easily applicable to new languages. We perform ablation studies and show that the augmentation techniques and the three training steps often help but sometimes have a negative effect.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes