CLMay 25, 2018

Context-Aware Neural Machine Translation Learns Anaphora Resolution

arXiv:1805.10163v11199 citations
Originality Incremental advance
AI Analysis

This addresses translation coherence and pronoun accuracy for users of neural machine translation systems, but it is incremental as it builds on existing context-aware approaches.

The paper tackled the problem of machine translation ignoring extra-sentential context, which can cause errors in ambiguous cases like pronouns, and introduced a context-aware model that improved overall BLEU scores by +0.7 over a context-agnostic version and +0.6 over simple concatenation.

Standard machine translation systems process sentences in isolation and hence ignore extra-sentential information, even though extended context can both prevent mistakes in ambiguous cases and improve translation coherence. We introduce a context-aware neural machine translation model designed in such way that the flow of information from the extended context to the translation model can be controlled and analyzed. We experiment with an English-Russian subtitles dataset, and observe that much of what is captured by our model deals with improving pronoun translation. We measure correspondences between induced attention distributions and coreference relations and observe that the model implicitly captures anaphora. It is consistent with gains for sentences where pronouns need to be gendered in translation. Beside improvements in anaphoric cases, the model also improves in overall BLEU, both over its context-agnostic version (+0.7) and over simple concatenation of the context and source sentences (+0.6).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes