CLJul 29, 2017

Curriculum Learning and Minibatch Bucketing in Neural Machine Translation

arXiv:1707.09533v11132 citations
Originality Synthesis-oriented
AI Analysis

This work addresses training efficiency and performance in neural machine translation, but it is incremental as it builds on existing curriculum learning concepts with minor gains.

The study investigated the impact of sentence ordering strategies, specifically minibatch homogeneity and curriculum learning, on neural machine translation training, finding that minibatch homogeneity had no effect while certain curricula led to a small improvement over the baseline in English-to-Czech experiments.

We examine the effects of particular orderings of sentence pairs on the on-line training of neural machine translation (NMT). We focus on two types of such orderings: (1) ensuring that each minibatch contains sentences similar in some aspect and (2) gradual inclusion of some sentence types as the training progresses (so called "curriculum learning"). In our English-to-Czech experiments, the internal homogeneity of minibatches has no effect on the training but some of our "curricula" achieve a small improvement over the baseline.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes