CLOct 12, 2021

Doubly-Trained Adversarial Data Augmentation for Neural Machine Translation

arXiv:2110.05691v1621 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses robustness issues in neural machine translation systems, which is an incremental improvement for the machine translation community.

The paper tackles the problem of neural machine translation models being vulnerable to noisy inputs by generating adversarial augmentation samples that attack the model while preserving source-side semantic meaning, resulting in improved model robustness across three language pairs and two evaluation metrics.

Neural Machine Translation (NMT) models are known to suffer from noisy inputs. To make models robust, we generate adversarial augmentation samples that attack the model and preserve the source-side semantic meaning at the same time. To generate such samples, we propose a doubly-trained architecture that pairs two NMT models of opposite translation directions with a joint loss function, which combines the target-side attack and the source-side semantic similarity constraint. The results from our experiments across three different language pairs and two evaluation metrics show that these adversarial samples improve the model robustness.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes