CLAug 22, 2016

Context Gates for Neural Machine Translation

arXiv:1608.06043v3147 citations
AI Analysis

This addresses the issue of balancing adequacy and fluency in NMT for translation tasks, representing an incremental improvement.

The paper tackles the problem of inadequate translations in neural machine translation by proposing context gates that dynamically control the influence of source and target contexts during word generation, resulting in a +2.3 BLEU point improvement over a standard attention-based system.

In neural machine translation (NMT), generation of a target word depends on both source and target contexts. We find that source contexts have a direct impact on the adequacy of a translation while target contexts affect the fluency. Intuitively, generation of a content word should rely more on the source context and generation of a functional word should rely more on the target context. Due to the lack of effective control over the influence from source and target contexts, conventional NMT tends to yield fluent but inadequate translations. To address this problem, we propose context gates which dynamically control the ratios at which source and target contexts contribute to the generation of target words. In this way, we can enhance both the adequacy and fluency of NMT with more careful control of the information flow from contexts. Experiments show that our approach significantly improves upon a standard attention-based NMT system by +2.3 BLEU points.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes