CL AIMar 17, 2022

Gaussian Multi-head Attention for Simultaneous Machine Translation

arXiv:2203.09072v132.1647 citationsh-index: 29Has Code

Originality Incremental advance

AI Analysis

This work addresses the challenge of balancing translation accuracy and delay in real-time streaming scenarios, representing an incremental improvement over existing methods.

The paper tackles the problem of simultaneous machine translation by proposing a new policy that explicitly models alignment between source and target words to control translation quality and latency, achieving improved trade-offs on En-Vi and De-En tasks.

Simultaneous machine translation (SiMT) outputs translation while receiving the streaming source inputs, and hence needs a policy to determine where to start translating. The alignment between target and source words often implies the most informative source word for each target word, and hence provides the unified control over translation quality and latency, but unfortunately the existing SiMT methods do not explicitly model the alignment to perform the control. In this paper, we propose Gaussian Multi-head Attention (GMA) to develop a new SiMT policy by modeling alignment and translation in a unified manner. For SiMT policy, GMA models the aligned source position of each target word, and accordingly waits until its aligned position to start translating. To integrate the learning of alignment into the translation model, a Gaussian distribution centered on predicted aligned position is introduced as an alignment-related prior, which cooperates with translation-related soft attention to determine the final attention. Experiments on En-Vi and De-En tasks show that our method outperforms strong baselines on the trade-off between translation and latency.

View on arXiv PDF Code

Similar