CLOct 20, 2022

Can Domains Be Transferred Across Languages in Multi-Domain Multilingual Neural Machine Translation?

Thuy-Trang Vu, Shahram Khadivi, Xuanli He, Dinh Phung, Gholamreza Haffari

arXiv:2210.11628v123.9292 citationsh-index: 44

Originality Incremental advance

AI Analysis

This work addresses the challenge of incomplete data in multi-domain multilingual translation, which is incremental as it builds on existing multilingual and multi-domain NMT methods.

The paper tackles the problem of transferring domain information across languages in multi-domain multilingual neural machine translation, particularly when in-domain data is missing for some language pairs, and shows that their approach boosts zero-shot translation performance by up to +10 BLEU gains and aids generalization to missing domains.

Previous works mostly focus on either multilingual or multi-domain aspects of neural machine translation (NMT). This paper investigates whether the domain information can be transferred across languages on the composition of multi-domain and multilingual NMT, particularly for the incomplete data condition where in-domain bitext is missing for some language pairs. Our results in the curated leave-one-domain-out experiments show that multi-domain multilingual (MDML) NMT can boost zero-shot translation performance up to +10 gains on BLEU, as well as aid the generalisation of multi-domain NMT to the missing domain. We also explore strategies for effective integration of multilingual and multi-domain NMT, including language and domain tag combination and auxiliary task training. We find that learning domain-aware representations and adding target-language tags to the encoder leads to effective MDML-NMT.

View on arXiv PDF

Similar