CLSDASJul 16, 2021

A Comparison of Methods for OOV-word Recognition on a New Public Dataset

arXiv:2107.08091v16 citationsHas Code
AI Analysis

This work addresses a common problem in speech recognition systems, though it appears incremental as it builds on existing hybrid ASR and subword approaches.

The authors tackled the problem of recognizing out-of-vocabulary (OOV) words in automatic speech recognition by creating test sets from the CommonVoice dataset and evaluating subword models and WFST modifications. They reported very large improvements in OOV-word recognition and released the data and code.

A common problem for automatic speech recognition systems is how to recognize words that they did not see during training. Currently there is no established method of evaluating different techniques for tackling this problem. We propose using the CommonVoice dataset to create test sets for multiple languages which have a high out-of-vocabulary (OOV) ratio relative to a training set and release a new tool for calculating relevant performance metrics. We then evaluate, within the context of a hybrid ASR system, how much better subword models are at recognizing OOVs, and how much benefit one can get from incorporating OOV-word information into an existing system by modifying WFSTs. Additionally, we propose a new method for modifying a subword-based language model so as to better recognize OOV-words. We showcase very large improvements in OOV-word recognition and make both the data and code available.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes