ML CL LGApr 17, 2017

Does Neural Machine Translation Benefit from Larger Context?

Sebastien Jean, Stanislas Lauly, Orhan Firat, Kyunghyun Cho

arXiv:1704.05135v123.8153 citations

Originality Synthesis-oriented

AI Analysis

This addresses the problem of context modeling in machine translation for researchers, showing incremental benefits that are data-dependent.

The study investigated whether neural machine translation benefits from modeling surrounding text, finding that it improves translation quality and pronoun prediction on small corpora, but the advantage diminishes with larger training data.

We propose a neural machine translation architecture that models the surrounding text in addition to the source sentence. These models lead to better performance, both in terms of general translation quality and pronoun prediction, when trained on small corpora, although this improvement largely disappears when trained with a larger corpus. We also discover that attention-based neural machine translation is well suited for pronoun prediction and compares favorably with other approaches that were specifically designed for this task.

View on arXiv PDF

Similar