CLNEApr 21, 2017

Attention Strategies for Multi-Source Sequence-to-Sequence Learning

arXiv:1704.06567v1191 citations
Originality Incremental advance
AI Analysis

This addresses a relatively unexplored area for tasks involving multiple source languages or modalities, but it is incremental as it builds on existing attention techniques.

The paper tackled the problem of modeling attention in multi-source sequence-to-sequence learning, proposing flat and hierarchical combination methods, and achieved competitive results on WMT16 Multimodal Translation and Automatic Post-editing tasks.

Modeling attention in neural multi-source sequence-to-sequence learning remains a relatively unexplored area, despite its usefulness in tasks that incorporate multiple source languages or modalities. We propose two novel approaches to combine the outputs of attention mechanisms over each source sequence, flat and hierarchical. We compare the proposed methods with existing techniques and present results of systematic evaluation of those methods on the WMT16 Multimodal Translation and Automatic Post-editing tasks. We show that the proposed methods achieve competitive results on both tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes