CLAIDec 6, 2022

Neural Machine Translation with Contrastive Translation Memories

Peking U
arXiv:2212.03140v1298 citationsh-index: 52
Originality Highly original
AI Analysis

This work addresses inefficiencies in machine translation for users by introducing a novel retrieval and encoding approach, though it is incremental as it builds on existing retrieval-augmented models.

The paper tackles the problem of redundancy in retrieval-augmented neural machine translation by proposing a method that uses contrastive translation memories to maximize information gain, resulting in improvements over strong baselines on benchmark datasets.

Retrieval-augmented Neural Machine Translation models have been successful in many translation scenarios. Different from previous works that make use of mutually similar but redundant translation memories~(TMs), we propose a new retrieval-augmented NMT to model contrastively retrieved translation memories that are holistically similar to the source sentence while individually contrastive to each other providing maximal information gains in three phases. First, in TM retrieval phase, we adopt a contrastive retrieval algorithm to avoid redundancy and uninformativeness of similar translation pieces. Second, in memory encoding stage, given a set of TMs we propose a novel Hierarchical Group Attention module to gather both local context of each TM and global context of the whole TM set. Finally, in training phase, a Multi-TM contrastive learning objective is introduced to learn salient feature of each TM with respect to target sentence. Experimental results show that our framework obtains improvements over strong baselines on the benchmark datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes