CLAIMay 24, 2021

Neural Machine Translation with Monolingual Translation Memory

arXiv:2105.11269v2729 citations
Originality Highly original
AI Analysis

This work addresses translation quality enhancement for machine translation systems, particularly benefiting low-resource and domain adaptation scenarios, though it is incremental relative to prior TM-augmented methods.

The paper tackles the problem of improving neural machine translation by proposing a framework that uses monolingual translation memory with cross-lingual retrieval, achieving substantial improvements and outperforming bilingual TM-augmented baselines.

Prior work has proved that Translation memory (TM) can boost the performance of Neural Machine Translation (NMT). In contrast to existing work that uses bilingual corpus as TM and employs source-side similarity search for memory retrieval, we propose a new framework that uses monolingual memory and performs learnable memory retrieval in a cross-lingual manner. Our framework has unique advantages. First, the cross-lingual memory retriever allows abundant monolingual data to be TM. Second, the memory retriever and NMT model can be jointly optimized for the ultimate translation goal. Experiments show that the proposed method obtains substantial improvements. Remarkably, it even outperforms strong TM-augmented NMT baselines using bilingual TM. Owning to the ability to leverage monolingual data, our model also demonstrates effectiveness in low-resource and domain adaptation scenarios.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes