CL LGDec 16, 2021

Prompt Tuning GPT-2 language model for parameter-efficient domain adaptation of ASR systems

Saket Dingliwal, Ashish Shenoy, Sravan Bodapati, Ankur Gandhe, Ravi Teja Gadde, Katrin Kirchhoff

arXiv:2112.08718v30.54 citations

Originality Incremental advance

AI Analysis

This enables efficient domain adaptation for ASR systems in industrial applications, though it is incremental as it builds on existing prompt tuning methods.

The paper tackles the problem of adapting ASR systems to new domains with minimal overhead by introducing domain-prompts, which train a small set of domain embeddings to prime a Transformer-based LM, achieving 7-13% WER reduction using only 0.02% of parameters updated.

Automatic Speech Recognition (ASR) systems have found their use in numerous industrial applications in very diverse domains creating a need to adapt to new domains with small memory and deployment overhead. In this work, we introduce domain-prompts, a methodology that involves training a small number of domain embedding parameters to prime a Transformer-based Language Model (LM) to a particular domain. Using this domain-adapted LM for rescoring ASR hypotheses can achieve 7-13% WER reduction for a new domain with just 1000 unlabeled textual domain-specific sentences. This improvement is comparable or even better than fully fine-tuned models even though just 0.02% of the parameters of the base LM are updated. Additionally, our method is deployment-friendly as the learnt domain embeddings are prefixed to the input to the model rather than changing the base model architecture. Therefore, our method is an ideal choice for on-the-fly adaptation of LMs used in ASR systems to progressively scale it to new domains.

View on arXiv PDF

Similar