CLLGOct 16, 2023

Towards a Better Understanding of Variations in Zero-Shot Neural Machine Translation Performance

arXiv:2310.10385v2137 citationsh-index: 17Has Code
Originality Incremental advance
AI Analysis

This work addresses the problem of inconsistent zero-shot translation for multilingual NMT researchers, providing insights and a benchmark, but it is incremental as it builds on prior studies of zero-shot performance.

The paper investigates the high variations in zero-shot neural machine translation performance across 1,560 language directions, identifying target-side translation capability, vocabulary overlap, and linguistic properties as key factors, with target-side quality being the most influential.

Multilingual Neural Machine Translation (MNMT) facilitates knowledge sharing but often suffers from poor zero-shot (ZS) translation qualities. While prior work has explored the causes of overall low ZS performance, our work introduces a fresh perspective: the presence of high variations in ZS performance. This suggests that MNMT does not uniformly exhibit poor ZS capability; instead, certain translation directions yield reasonable results. Through systematic experimentation involving 1,560 language directions spanning 40 languages, we identify three key factors contributing to high variations in ZS NMT performance: 1) target side translation capability 2) vocabulary overlap 3) linguistic properties. Our findings highlight that the target side translation quality is the most influential factor, with vocabulary overlap consistently impacting ZS performance. Additionally, linguistic properties, such as language family and writing system, play a role, particularly with smaller models. Furthermore, we suggest that the off-target issue is a symptom of inadequate ZS performance, emphasizing that zero-shot translation challenges extend beyond addressing the off-target problem. We release the data and models serving as a benchmark to study zero-shot for future research at https://github.com/Smu-Tan/ZS-NMT-Variations

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes