CLJan 31, 2024

The Impact of Language Adapters in Cross-Lingual Transfer for NLU

arXiv:2402.00149v1107 citationsh-index: 4MOOMIN
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of efficient cross-lingual adaptation for NLU tasks, but it is incremental as it builds on existing modular deep learning approaches.

The paper investigates the role of language adapters in zero-shot cross-lingual transfer for natural language understanding, finding that their impact is inconsistent across tasks, languages, and models, with source-language adapters sometimes performing equivalently or better, and removal having only a weak negative effect.

Modular deep learning has been proposed for the efficient adaption of pre-trained models to new tasks, domains and languages. In particular, combining language adapters with task adapters has shown potential where no supervised data exists for a language. In this paper, we explore the role of language adapters in zero-shot cross-lingual transfer for natural language understanding (NLU) benchmarks. We study the effect of including a target-language adapter in detailed ablation studies with two multilingual models and three multilingual datasets. Our results show that the effect of target-language adapters is highly inconsistent across tasks, languages and models. Retaining the source-language adapter instead often leads to an equivalent, and sometimes to a better, performance. Removing the language adapter after training has only a weak negative effect, indicating that the language adapters do not have a strong impact on the predictions.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes