CLMar 15

Task Arithmetic with Support Languages for Low-Resource ASR

arXiv:2601.0703870.0h-index: 2

AI Analysis

This addresses the challenge of developing ASR systems for low-resource languages with scant data, though it is incremental as it builds on existing task arithmetic methods.

The paper tackled the problem of low-resource automatic speech recognition by using task arithmetic to combine models trained on high-resource languages with those for low-resource languages, resulting in word error rate improvements of up to 10% across 23 languages.

The development of resource-constrained approaches to automatic speech recognition (ASR) is of great interest due to its broad applicability to many low-resource languages for which there is scant usable data. Existing approaches to many low-resource natural language processing tasks leverage additional data from higher-resource languages that are closely related to a target low-resource language. One increasingly popular approach uses task arithmetic to combine models trained on different tasks to create a model for a task where there is little to no training data. In this paper, we consider training on a particular language to be a task, and we generate task vectors by fine-tuning variants of the Whisper ASR system. For pairs of high- and low-resource languages, we merge task vectors via a linear combination which is optimized on the downstream word error rate on the low-resource target language's validation set. Across 23 low-resource target languages for which we evaluate this technique, we find consistent word error rate improvements of up to 10% compared to a baseline without our approach.

View on arXiv PDF

Similar