CLMar 29, 2024

An Efficient Approach for Studying Cross-Lingual Transfer in Multilingual Language Models

arXiv:2403.20088v126 citationsh-index: 13Has CodeMRL
Originality Incremental advance
AI Analysis

This work addresses the need to optimize language selection for cross-lingual transfer in NLP applications, though it is incremental as it builds on existing adapter-based methods.

The paper tackles the problem of understanding cross-lingual transfer in multilingual language models by proposing an efficient method to study how transfer languages affect zero-shot performance on target languages, finding that some languages are beneficial or detrimental depending on the target and that unseen languages consistently benefit from transfer.

The capacity and effectiveness of pre-trained multilingual models (MLMs) for zero-shot cross-lingual transfer is well established. However, phenomena of positive or negative transfer, and the effect of language choice still need to be fully understood, especially in the complex setting of massively multilingual LMs. We propose an \textit{efficient} method to study transfer language influence in zero-shot performance on another target language. Unlike previous work, our approach disentangles downstream tasks from language, using dedicated adapter units. Our findings suggest that some languages do not largely affect others, while some languages, especially ones unseen during pre-training, can be extremely beneficial or detrimental for different target languages. We find that no transfer language is beneficial for all target languages. We do, curiously, observe languages previously unseen by MLMs consistently benefit from transfer from almost any language. We additionally use our modular approach to quantify negative interference efficiently and categorize languages accordingly. Furthermore, we provide a list of promising transfer-target language configurations that consistently lead to target language performance improvements. Code and data are publicly available: https://github.com/ffaisal93/neg_inf

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes