CLAILGSep 12, 2017

Refining Source Representations with Relation Networks for Neural Machine Translation

arXiv:1709.03980v321 citations
Originality Incremental advance
AI Analysis

This work addresses translation quality issues for NMT users, but it is incremental as it builds on existing encoder-decoder frameworks.

The paper tackles the problem of neural machine translation (NMT) suffering from forgetting old information and ignoring word relationships by introducing relation networks to refine source representations, resulting in significant performance improvements on Chinese-to-English datasets.

Although neural machine translation (NMT) with the encoder-decoder framework has achieved great success in recent times, it still suffers from some drawbacks: RNNs tend to forget old information which is often useful and the encoder only operates through words without considering word relationship. To solve these problems, we introduce a relation networks (RN) into NMT to refine the encoding representations of the source. In our method, the RN first augments the representation of each source word with its neighbors and reasons all the possible pairwise relations between them. Then the source representations and all the relations are fed to the attention module and the decoder together, keeping the main encoder-decoder architecture unchanged. Experiments on two Chinese-to-English data sets in different scales both show that our method can outperform the competitive baselines significantly.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes