CLOct 8, 2022

ngram-OAXE: Phrase-Based Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

arXiv:2210.03999v1584 citationsh-index: 17
Originality Incremental advance
AI Analysis

This is an incremental improvement for researchers in machine translation, addressing a specific bottleneck in non-autoregressive models.

The paper tackled the multimodality problem in non-autoregressive machine translation by extending the OAXE loss to allow reordering only between n-gram phrases, resulting in improved translation fluency and phrase modeling across various benchmarks.

Recently, a new training oaxe loss has proven effective to ameliorate the effect of multimodality for non-autoregressive translation (NAT), which removes the penalty of word order errors in the standard cross-entropy loss. Starting from the intuition that reordering generally occurs between phrases, we extend oaxe by only allowing reordering between ngram phrases and still requiring a strict match of word order within the phrases. Extensive experiments on NAT benchmarks across language pairs and data scales demonstrate the effectiveness and universality of our approach. %Further analyses show that the proposed ngram-oaxe alleviates the multimodality problem with a better modeling of phrase translation. Further analyses show that ngram-oaxe indeed improves the translation of ngram phrases, and produces more fluent translation with a better modeling of sentence structure.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes