CLDec 15, 2021

Improving Both Domain Robustness and Domain Adaptability in Machine Translation

arXiv:2112.08288v3582 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses domain adaptation challenges in machine translation, offering incremental improvements for researchers and practitioners in NLP.

The paper tackles domain adaptation in neural machine translation by proposing RMLNMT, a meta-learning framework that improves both domain robustness and adaptability, achieving enhanced performance on seen and unseen domains in English→German and English→Chinese translation.

We consider two problems of NMT domain adaptation using meta-learning. First, we want to reach domain robustness, i.e., we want to reach high quality on both domains seen in the training data and unseen domains. Second, we want our systems to be adaptive, i.e., making it possible to finetune systems with just hundreds of in-domain parallel sentences. We study the domain adaptability of meta-learning when improving the domain robustness of the model. In this paper, we propose a novel approach, RMLNMT (Robust Meta-Learning Framework for Neural Machine Translation Domain Adaptation), which improves the robustness of existing meta-learning models. More specifically, we show how to use a domain classifier in curriculum learning and we integrate the word-level domain mixing model into the meta-learning framework with a balanced sampling strategy. Experiments on English$\rightarrow$German and English$\rightarrow$Chinese translation show that RMLNMT improves in terms of both domain robustness and domain adaptability in seen and unseen domains. Our source code is available at https://github.com/lavine-lmu/RMLNMT.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes