CLMay 13, 2018

Bag-of-Words as Target for Neural Machine Translation

arXiv:1805.04871v11137 citations
Originality Incremental advance
AI Analysis

This addresses the issue of translation diversity for machine translation systems, though it is incremental as it builds on existing methods.

The paper tackles the problem of neural machine translation models punishing multiple correct translations by proposing an approach that uses both sentences and bag-of-words as targets during training, resulting in a BLEU score improvement of 4.55 over strong baselines on a Chinese-English dataset.

A sentence can be translated into more than one correct sentences. However, most of the existing neural machine translation models only use one of the correct translations as the targets, and the other correct sentences are punished as the incorrect sentences in the training stage. Since most of the correct translations for one sentence share the similar bag-of-words, it is possible to distinguish the correct translations from the incorrect ones by the bag-of-words. In this paper, we propose an approach that uses both the sentences and the bag-of-words as targets in the training stage, in order to encourage the model to generate the potentially correct sentences that are not appeared in the training set. We evaluate our model on a Chinese-English translation dataset, and experiments show our model outperforms the strong baselines by the BLEU score of 4.55.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes