LG CL SD ASNov 21, 2017

Multiple-Instance, Cascaded Classification for Keyword Spotting in Narrow-Band Audio

Ahmad AbdulKader, Kareem Nassar, Mohamed El-Geish, Daniel Galvez, Chetan Patil

arXiv:1711.08058v20.74 citations

Originality Incremental advance

AI Analysis

This work addresses keyword spotting for computationally-constrained devices in challenging audio environments, representing an incremental improvement.

The paper tackled keyword spotting in narrow-band audio under non-IID conditions by proposing a cascaded classifier model that incorporates DNNs, multiple-feature representations, and multiple-instance learning, achieving a false negative rate of 6% at an hourly false positive rate of 0.75.

We propose using cascaded classifiers for a keyword spotting (KWS) task on narrow-band (NB), 8kHz audio acquired in non-IID environments -- a more challenging task than most state-of-the-art KWS systems face. We present a model that incorporates Deep Neural Networks (DNNs), cascading, multiple-feature representations, and multiple-instance learning. The cascaded classifiers handle the task's class imbalance and reduce power consumption on computationally-constrained devices via early termination. The KWS system achieves a false negative rate of 6% at an hourly false positive rate of 0.75

View on arXiv PDF

Similar