ASCLJun 12, 2024

Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR

arXiv:2406.07842v13 citations
Originality Incremental advance
AI Analysis

It addresses the challenge of efficiently adding new languages to multilingual ASR systems, which is incremental as it builds on existing pre-trained models like Whisper.

This paper tackles the problem of integrating new languages into a pre-trained multilingual ASR system with limited data for existing languages, achieving minimized performance degradation for existing languages and enabling language-agnostic operation through a dual-pipeline with LoRA and decoder selection.

This paper addresses challenges in integrating new languages into a pre-trained multilingual automatic speech recognition (mASR) system, particularly in scenarios where training data for existing languages is limited or unavailable. The proposed method employs a dual-pipeline with low-rank adaptation (LoRA). It maintains two data flow pipelines-one for existing languages and another for new languages. The primary pipeline follows the standard flow through the pre-trained parameters of mASR, while the secondary pipeline additionally utilizes language-specific parameters represented by LoRA and a separate output decoder module. Importantly, the proposed approach minimizes the performance degradation of existing languages and enables a language-agnostic operation mode, facilitated by a decoder selection strategy. We validate the effectiveness of the proposed method by extending the pre-trained Whisper model to 19 new languages from the FLEURS dataset

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes