CLJun 20, 2017

Effective Spoken Language Labeling with Deep Recurrent Neural Networks

arXiv:1706.06896v11 citations
Originality Incremental advance
AI Analysis

This work improves SLU for spoken dialog systems, though it is incremental as it builds on existing RNN ideas.

The paper tackles spoken language understanding (SLU) as a labeling problem by proposing novel deep recurrent neural network (RNN) architectures, achieving state-of-the-art results on the ATIS and MEDIA corpora.

Understanding spoken language is a highly complex problem, which can be decomposed into several simpler tasks. In this paper, we focus on Spoken Language Understanding (SLU), the module of spoken dialog systems responsible for extracting a semantic interpretation from the user utterance. The task is treated as a labeling problem. In the past, SLU has been performed with a wide variety of probabilistic models. The rise of neural networks, in the last couple of years, has opened new interesting research directions in this domain. Recurrent Neural Networks (RNNs) in particular are able not only to represent several pieces of information as embeddings but also, thanks to their recurrent architecture, to encode as embeddings relatively long contexts. Such long contexts are in general out of reach for models previously used for SLU. In this paper we propose novel RNNs architectures for SLU which outperform previous ones. Starting from a published idea as base block, we design new deep RNNs achieving state-of-the-art results on two widely used corpora for SLU: ATIS (Air Traveling Information System), in English, and MEDIA (Hotel information and reservation in France), in French.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes