CLFeb 8, 2021

Effects of Layer Freezing on Transferring a Speech Recognition System to Under-resourced Languages

arXiv:2102.04097v2671 citations
AI Analysis

This research provides insights into effective transfer learning strategies for automatic speech recognition in under-resourced languages, benefiting researchers and developers working on low-resource ASR.

This paper investigates the impact of layer freezing on transferring speech recognition models to under-resourced languages. They found that freezing even a single layer significantly improved results when transferring a pre-trained DeepSpeech model to German and Swiss German datasets compared to training from scratch.

In this paper, we investigate the effect of layer freezing on the effectiveness of model transfer in the area of automatic speech recognition. We experiment with Mozilla's DeepSpeech architecture on German and Swiss German speech datasets and compare the results of either training from scratch vs. transferring a pre-trained model. We compare different layer freezing schemes and find that even freezing only one layer already significantly improves results.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes