CLSep 8, 2021

Mixup Decoding for Diverse Machine Translation

arXiv:2109.03402v2663 citations
AI Analysis

This addresses the need for varied translations in machine translation systems, offering a novel approach without additional training, though it is incremental in building on mixup training concepts.

The paper tackles the problem of generating diverse translations in machine translation by proposing MixDiversity, a method that uses linear interpolation with sampled sentence pairs during decoding, resulting in substantial performance improvements over previous methods on datasets like WMT'16 en-ro, WMT'14 en-de, and WMT'17 zh-en.

Diverse machine translation aims at generating various target language translations for a given source language sentence. Leveraging the linear relationship in the sentence latent space introduced by the mixup training, we propose a novel method, MixDiversity, to generate different translations for the input sentence by linearly interpolating it with different sentence pairs sampled from the training corpus when decoding. To further improve the faithfulness and diversity of the translations, we propose two simple but effective approaches to select diverse sentence pairs in the training corpus and adjust the interpolation weight for each pair correspondingly. Moreover, by controlling the interpolation weight, our method can achieve the trade-off between faithfulness and diversity without any additional training, which is required in most of the previous methods. Experiments on WMT'16 en-ro, WMT'14 en-de, and WMT'17 zh-en are conducted to show that our method substantially outperforms all previous diverse machine translation methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes