CLApr 26, 2022

Efficient Machine Translation Domain Adaptation

arXiv:2204.12608v1644 citationsh-index: 12
Originality Incremental advance
AI Analysis

This work addresses efficiency issues in domain adaptation for machine translation, which is incremental as it builds on existing retrieval-based methods to reduce computational costs.

The paper tackles the problem of slow retrieval-augmented machine translation models for domain adaptation by introducing methods to speed up nearest neighbor retrieval, including a caching strategy, resulting in improved runtimes while maintaining translation quality across domains.

Machine translation models struggle when translating out-of-domain text, which makes domain adaptation a topic of critical importance. However, most domain adaptation methods focus on fine-tuning or training the entire or part of the model on every new domain, which can be costly. On the other hand, semi-parametric models have been shown to successfully perform domain adaptation by retrieving examples from an in-domain datastore (Khandelwal et al., 2021). A drawback of these retrieval-augmented models, however, is that they tend to be substantially slower. In this paper, we explore several approaches to speed up nearest neighbor machine translation. We adapt the methods recently proposed by He et al. (2021) for language modeling, and introduce a simple but effective caching strategy that avoids performing retrieval when similar contexts have been seen before. Translation quality and runtimes for several domains show the effectiveness of the proposed solutions.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes