CLSep 30, 2022

Language-Family Adapters for Low-Resource Multilingual Neural Machine Translation

Microsoft
arXiv:2209.15236v3268 citationsh-index: 31
Originality Incremental advance
AI Analysis

This provides a more efficient method for low-resource machine translation, though it is incremental as it builds on existing adapter techniques.

The paper tackles the problem of negative interference in parameter-efficient multilingual neural machine translation by proposing language-family adapters, which outperform baselines with higher translation scores on average for 17 low-resource languages from English.

Large multilingual models trained with self-supervision achieve state-of-the-art results in a wide range of natural language processing tasks. Self-supervised pretrained models are often fine-tuned on parallel data from one or multiple language pairs for machine translation. Multilingual fine-tuning improves performance on low-resource languages but requires modifying the entire model and can be prohibitively expensive. Training a new adapter on each language pair or training a single adapter on all language pairs without updating the pretrained model has been proposed as a parameter-efficient alternative. However, the former does not permit any sharing between languages, while the latter shares parameters for all languages and is susceptible to negative interference. In this paper, we propose training language-family adapters on top of mBART-50 to facilitate cross-lingual transfer. Our approach outperforms related baselines, yielding higher translation scores on average when translating from English to 17 different low-resource languages. We also show that language-family adapters provide an effective method to translate to languages unseen during pretraining.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes