CLLGSDASMay 11, 2022

Improved Meta Learning for Low Resource Speech Recognition

arXiv:2205.06182v128 citationsh-index: 11
Originality Incremental advance
AI Analysis

This work addresses training challenges in meta learning for low-resource ASR, offering an incremental improvement over MAML.

The paper tackles training instabilities and slower convergence in model-agnostic meta learning (MAML) for low-resource speech recognition by introducing a multi-step loss (MSL) framework, resulting in improved accuracy and stability across multiple languages.

We propose a new meta learning based framework for low resource speech recognition that improves the previous model agnostic meta learning (MAML) approach. The MAML is a simple yet powerful meta learning approach. However, the MAML presents some core deficiencies such as training instabilities and slower convergence speed. To address these issues, we adopt multi-step loss (MSL). The MSL aims to calculate losses at every step of the inner loop of MAML and then combines them with a weighted importance vector. The importance vector ensures that the loss at the last step has more importance than the previous steps. Our empirical evaluation shows that MSL significantly improves the stability of the training procedure and it thus also improves the accuracy of the overall system. Our proposed system outperforms MAML based low resource ASR system on various languages in terms of character error rates and stable training behavior.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes