CL AISep 14, 2021

A Three Step Training Approach with Data Augmentation for Morphological Inflection

Gabor Szolnok, Botond Barta, Dorina Lakatos, Judit Acs

arXiv:2109.07006v10.51 citations

Originality Synthesis-oriented

AI Analysis

This work addresses morphological inflection for linguists and NLP researchers, but it is incremental as it builds on existing methods with minor improvements.

The authors tackled morphological inflection across diverse languages using a three-step training approach with data augmentation on an LSTM encoder-decoder model, achieving results that outperformed the only other submission but remained worse than a Transformer baseline.

We present the BME submission for the SIGMORPHON 2021 Task 0 Part 1, Generalization Across Typologically Diverse Languages shared task. We use an LSTM encoder-decoder model with three step training that is first trained on all languages, then fine-tuned on each language families and finally finetuned on individual languages. We use a different type of data augmentation technique in the first two steps. Our system outperformed the only other submission. Although it remains worse than the Transformer baseline released by the organizers, our model is simpler and our data augmentation techniques are easily applicable to new languages. We perform ablation studies and show that the augmentation techniques and the three training steps often help but sometimes have a negative effect.

View on arXiv PDF

Similar