CL LG NE ASOct 30, 2015

Prediction-Adaptation-Correction Recurrent Neural Networks for Low-Resource Language Speech Recognition

Yu Zhang, Ekapol Chuangsuwanich, James Glass, Dong Yu

arXiv:1510.08985v12.21 citations

Originality Incremental advance

AI Analysis

This addresses speech recognition for low-resource languages, which is an incremental improvement over existing neural network methods.

The paper tackles low-resource speech recognition by using prediction-adaptation-correction recurrent neural networks (PAC-RNNs), which outperform state-of-the-art DNNs and LSTMs on IARPA-Babel tasks, with transfer learning from similar languages further improving performance.

In this paper, we investigate the use of prediction-adaptation-correction recurrent neural networks (PAC-RNNs) for low-resource speech recognition. A PAC-RNN is comprised of a pair of neural networks in which a {\it correction} network uses auxiliary information given by a {\it prediction} network to help estimate the state probability. The information from the correction network is also used by the prediction network in a recurrent loop. Our model outperforms other state-of-the-art neural networks (DNNs, LSTMs) on IARPA-Babel tasks. Moreover, transfer learning from a language that is similar to the target language can help improve performance further.

View on arXiv PDF

Similar