CLJun 2, 2021

Lightweight Adapter Tuning for Multilingual Speech Translation

arXiv:2106.01463v2725 citations
AI Analysis

This work addresses the need for parameter-efficient adaptation in multilingual speech translation, though it is incremental as it applies existing adapter methods to a new domain.

The paper tackles the problem of efficiently adapting multilingual speech translation models to specific language pairs by using lightweight adapter modules, achieving competitive results to full fine-tuning with significantly fewer parameters.

Adapter modules were recently introduced as an efficient alternative to fine-tuning in NLP. Adapter tuning consists in freezing pretrained parameters of a model and injecting lightweight modules between layers, resulting in the addition of only a small number of task-specific trainable parameters. While adapter tuning was investigated for multilingual neural machine translation, this paper proposes a comprehensive analysis of adapters for multilingual speech translation (ST). Starting from different pre-trained models (a multilingual ST trained on parallel data or a multilingual BART (mBART) trained on non-parallel multilingual data), we show that adapters can be used to: (a) efficiently specialize ST to specific language pairs with a low extra cost in terms of parameters, and (b) transfer from an automatic speech recognition (ASR) task and an mBART pre-trained model to a multilingual ST task. Experiments show that adapter tuning offer competitive results to full fine-tuning, while being much more parameter-efficient.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes