CLJan 6, 2022

PAEG: Phrase-level Adversarial Example Generation for Neural Machine Translation

arXiv:2201.02009v2581 citations
AI Analysis

This work addresses robustness issues in neural machine translation for users relying on accurate translations, though it is incremental as it builds on existing gradient-based methods.

The paper tackles the fragility of neural machine translation models to noisy inputs by proposing a phrase-level adversarial example generation framework, which improves translation performance and robustness across three benchmarks.

While end-to-end neural machine translation (NMT) has achieved impressive progress, noisy input usually leads models to become fragile and unstable. Generating adversarial examples as the augmented data has been proved to be useful to alleviate this problem. Existing methods for adversarial example generation (AEG) are word-level or character-level, which ignore the ubiquitous phrase structure. In this paper, we propose a Phrase-level Adversarial Example Generation (PAEG) framework to enhance the robustness of the translation model. Our method further improves the gradient-based word-level AEG method by adopting a phrase-level substitution strategy. We verify our method on three benchmarks, including LDC Chinese-English, IWSLT14 German-English, and WMT14 English-German tasks. Experimental results demonstrate that our approach significantly improves translation performance and robustness to noise compared to previous strong baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes