CLApr 21, 2019

Dynamic Past and Future for Neural Machine Translation

Zaixiang Zheng, Shujian Huang, Zhaopeng Tu, Xin-Yu Dai, Jiajun Chen

arXiv:1904.09646v230.21008 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses translation quality for NLP applications, but it is incremental as it builds on existing routing mechanisms.

The paper tackles the problem of improving neural machine translation by explicitly modeling translated and untranslated content using a guided dynamic routing mechanism, resulting in substantial improvements over RNMT and Transformer models by producing more adequate translations.

Previous studies have shown that neural machine translation (NMT) models can benefit from explicitly modeling translated (Past) and untranslated (Future) to groups of translated and untranslated contents through parts-to-wholes assignment. The assignment is learned through a novel variant of routing-by-agreement mechanism (Sabour et al., 2017), namely {\em Guided Dynamic Routing}, where the translating status at each decoding step {\em guides} the routing process to assign each source word to its associated group (i.e., translated or untranslated content) represented by a capsule, enabling translation to be made from holistic context. Experiments show that our approach achieves substantial improvements over both RNMT and Transformer by producing more adequate translations. Extensive analysis demonstrates that our method is highly interpretable, which is able to recognize the translated and untranslated contents as expected.

View on arXiv PDF Code

Similar