CLOct 23, 2022

Additive Interventions Yield Robust Multi-Domain Machine Translation Models

Microsoft

arXiv:2210.12727v123.9290 citationsh-index: 40

Originality Incremental advance

AI Analysis

This work addresses domain adaptation challenges in machine translation, offering a robust solution for scenarios with uncertain domain labels, though it is incremental in nature.

The paper tackled the problem of domain adaptation in neural machine translation by comparing additive interventions with tag-based methods, finding that interventions are robust to domain label errors and that single-domain fine-tuning's advantage diminishes with larger training data.

Additive interventions are a recently-proposed mechanism for controlling target-side attributes in neural machine translation. In contrast to tag-based approaches which manipulate the raw source sequence, interventions work by directly modulating the encoder representation of all tokens in the sequence. We examine the role of additive interventions in a large-scale multi-domain machine translation setting and compare its performance in various inference scenarios. We find that while the performance difference is small between intervention-based systems and tag-based systems when the domain label matches the test domain, intervention-based systems are robust to label error, making them an attractive choice under label uncertainty. Further, we find that the superiority of single-domain fine-tuning comes under question when training data size is scaled, contradicting previous findings.

View on arXiv PDF

Similar