ASSDSep 17, 2020

Utterance-level Intent Recognition from Keywords

arXiv:2009.08064v1
AI Analysis

This enables low-power, always-on intent detection for hardware applications, though it is incremental as it builds on existing keyword-based methods with new feature fusion.

The paper tackles wake-on-intent classification for resource-constrained platforms by using keyword sequences and multiple input features, achieving noise-robust intent recognition across domains like in-car communications.

This paper focuses on wake on intent (WOI) techniques for platforms with limited compute and memory. Our approach of utterance-level intent classification is based on a sequence of keywords in the utterance instead of a single fixed key phrase. The keyword sequence is transformed into four types of input features, namely acoustics, phones, word2vec and speech2vec for individual intent learning and then fused decision making. If a wake intent is detected, it will trigger the power-costly ASR afterwards. The system is trained and tested on a newly collected internal dataset in Intel called AMIE, which will be reported in this paper for the first time. It is demonstrated that our novel technique with the representation of the key-phrases successfully achieved a noise robust intent classification in different domains including in-car human-machine communications. The wake on intent system will be low-power and low-complexity, which makes it suitable for always on operations in real life hardware-based applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes