CLNEAug 17, 2013

Implementation Of Back-Propagation Neural Network For Isolated Bangla Speech Recognition

arXiv:1308.3785v160 citations
Originality Synthesis-oriented
AI Analysis

This work addresses speech recognition for Bangla digits, which is an incremental application of existing methods to a new language domain.

The paper tackled isolated Bangla speech recognition for digits using a back-propagation neural network with MFCC features, achieving a recognition rate of about 96.332% for known speakers and 92% for unknown speakers.

This paper is concerned with the development of Back-propagation Neural Network for Bangla Speech Recognition. In this paper, ten bangla digits were recorded from ten speakers and have been recognized. The features of these speech digits were extracted by the method of Mel Frequency Cepstral Coefficient (MFCC) analysis. The mfcc features of five speakers were used to train the network with Back propagation algorithm. The mfcc features of ten bangla digit speeches, from 0 to 9, of another five speakers were used to test the system. All the methods and algorithms used in this research were implemented using the features of Turbo C and C++ languages. From our investigation it is seen that the developed system can successfully encode and analyze the mfcc features of the speech signal to recognition. The developed system achieved recognition rate about 96.332% for known speakers (i.e., speaker dependent) and 92% for unknown speakers (i.e., speaker independent).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes