CLMay 13, 2015

Feature selection using Fisher's ratio technique for automatic speech recognition

arXiv:1505.03239v116 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental improvement for speech recognition systems, potentially reducing computational cost.

The paper tackled the problem of improving automatic speech recognition by using Fisher's ratio to select a subset of MFCC coefficients, achieving comparable classification accuracy with fewer features.

Automatic Speech Recognition involves mainly two steps; feature extraction and classification . Mel Frequency Cepstral Coefficient is used as one of the prominent feature extraction techniques in ASR. Usually, the set of all 12 MFCC coefficients is used as the feature vector in the classification step. But the question is whether the same or improved classification accuracy can be achieved by using a subset of 12 MFCC as feature vector. In this paper, Fisher's ratio technique is used for selecting a subset of 12 MFCC coefficients that contribute more in discriminating a pattern. The selected coefficients are used in classification with Hidden Markov Model algorithm. The classification accuracies that we get by using 12 coefficients and by using the selected coefficients are compared.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes