CLJun 8, 2018

Multilingual Neural Machine Translation with Task-Specific Attention

Graeme Blackwood, Miguel Ballesteros, Todd Ward

arXiv:1806.03280v132.31124 citations

Originality Incremental advance

AI Analysis

This work addresses translation quality improvements for multilingual NMT systems, particularly in low-resource settings, but is incremental as it builds on existing attention mechanisms.

The paper tackled the problem of improving multilingual neural machine translation by proposing task-specific attention models, which achieved consistent gains in translation quality across all translation directions, including low-resource zero-shot scenarios, on the Europarl corpus.

Multilingual machine translation addresses the task of translating between multiple source and target languages. We propose task-specific attention models, a simple but effective technique for improving the quality of sequence-to-sequence neural multilingual translation. Our approach seeks to retain as much of the parameter sharing generalization of NMT models as possible, while still allowing for language-specific specialization of the attention model to a particular language-pair or task. Our experiments on four languages of the Europarl corpus show that using a target-specific model of attention provides consistent gains in translation quality for all possible translation directions, compared to a model in which all parameters are shared. We observe improved translation quality even in the (extreme) low-resource zero-shot translation directions for which the model never saw explicitly paired parallel data.

View on arXiv PDF

Similar