CLSep 7, 2021

Infusing Future Information into Monotonic Attention Through Language Models

arXiv:2109.03121v12 citations
AI Analysis

This work addresses performance issues in simultaneous translation for speech-to-text tasks, representing an incremental improvement over existing monotonic attention methods.

The paper tackles the problem of poor read/write decisions in simultaneous neural machine translation due to insufficient information, proposing a framework that uses an external language model to aid monotonic attention, which improves the quality-latency trade-off over state-of-the-art methods on MuST-C English-German and English-French tasks.

Simultaneous neural machine translation(SNMT) models start emitting the target sequence before they have processed the source sequence. The recent adaptive policies for SNMT use monotonic attention to perform read/write decisions based on the partial source and target sequences. The lack of sufficient information might cause the monotonic attention to take poor read/write decisions, which in turn negatively affects the performance of the SNMT model. On the other hand, human translators make better read/write decisions since they can anticipate the immediate future words using linguistic information and domain knowledge.Motivated by human translators, in this work, we propose a framework to aid monotonic attention with an external language model to improve its decisions.We conduct experiments on the MuST-C English-German and English-French speech-to-text translation tasks to show the effectiveness of the proposed framework.The proposed SNMT method improves the quality-latency trade-off over the state-of-the-art monotonic multihead attention.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes