CLOct 23, 2022

Additive Interventions Yield Robust Multi-Domain Machine Translation Models

Microsoft
arXiv:2210.12727v1290 citationsh-index: 40
Originality Incremental advance
AI Analysis

This work addresses domain adaptation challenges in machine translation, offering a robust solution for scenarios with uncertain domain labels, though it is incremental in nature.

The paper tackled the problem of domain adaptation in neural machine translation by comparing additive interventions with tag-based methods, finding that interventions are robust to domain label errors and that single-domain fine-tuning's advantage diminishes with larger training data.

Additive interventions are a recently-proposed mechanism for controlling target-side attributes in neural machine translation. In contrast to tag-based approaches which manipulate the raw source sequence, interventions work by directly modulating the encoder representation of all tokens in the sequence. We examine the role of additive interventions in a large-scale multi-domain machine translation setting and compare its performance in various inference scenarios. We find that while the performance difference is small between intervention-based systems and tag-based systems when the domain label matches the test domain, intervention-based systems are robust to label error, making them an attractive choice under label uncertainty. Further, we find that the superiority of single-domain fine-tuning comes under question when training data size is scaled, contradicting previous findings.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes