CLLGNov 21, 2019

Minimizing the Bag-of-Ngrams Difference for Non-Autoregressive Neural Machine Translation

arXiv:1911.09320v195 citations
Originality Incremental advance
AI Analysis

This work addresses fluency issues in non-autoregressive translation, which is important for speeding up machine translation systems, but it is incremental as it modifies the training objective within an existing framework.

The paper tackled the problem of weak correlation between word-level cross-entropy loss and translation quality in Non-Autoregressive Neural Machine Translation (NAT), which leads to fluency errors, by proposing a Bag-of-Ngrams (BoN) difference training objective. The result was a significant improvement, outperforming the NAT baseline by about 5.0 BLEU scores on WMT14 En↔De and about 2.5 BLEU scores on WMT16 En↔Ro.

Non-Autoregressive Neural Machine Translation (NAT) achieves significant decoding speedup through generating target words independently and simultaneously. However, in the context of non-autoregressive translation, the word-level cross-entropy loss cannot model the target-side sequential dependency properly, leading to its weak correlation with the translation quality. As a result, NAT tends to generate influent translations with over-translation and under-translation errors. In this paper, we propose to train NAT to minimize the Bag-of-Ngrams (BoN) difference between the model output and the reference sentence. The bag-of-ngrams training objective is differentiable and can be efficiently calculated, which encourages NAT to capture the target-side sequential dependency and correlates well with the translation quality. We validate our approach on three translation tasks and show that our approach largely outperforms the NAT baseline by about 5.0 BLEU scores on WMT14 En$\leftrightarrow$De and about 2.5 BLEU scores on WMT16 En$\leftrightarrow$Ro.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes