CLJun 10, 2019

Automated Curriculum Learning for Turn-level Spoken Language Understanding with Weak Supervision

arXiv:1906.04291v11 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of reducing annotation effort and improving scalability for spoken language understanding systems in commercial applications, though it is incremental as it builds on existing weak supervision and curriculum learning methods.

The paper tackles the problem of turn-level spoken language understanding for tasks like voice ordering by proposing an end-to-end statistical model with weak supervision, which performs comparably to a deployed pipelined system while saving development labor by over an order of magnitude and improving test accuracy by 7.8% with RBSMA and an additional 5% with automated curriculum learning.

We propose a learning approach for turn-level spoken language understanding, which facilitates a user to speak one or more utterances compositionally in a turn for completing a task (e.g., voice ordering). A typical pipelined approach for these understanding tasks requires non-trivial annotation effort for developing its multiple components. Also, the pipeline is difficult to port to a new domain or scale up. To address these problems, we propose an end-to-end statistical model with weak supervision. We employ randomized beam search with memory augmentation (RBSMA) to solve complicated problems for which long promising trajectories are usually difficult to explore. Furthermore, considering the diversity of problem complexity, we explore automated curriculum learning (CL) for weak supervision to accelerate exploration and learning. We evaluate the proposed approach on real-world user logs of a commercial voice ordering system. Results demonstrate that when trained on a small number of end-to-end annotated sessions collected with low cost, our model performs comparably to the deployed pipelined system, saving the development labor over an order of magnitude. The RBSMA algorithm improves the test set accuracy by 7.8% relative compared to the standard beam search. Automated CL leads to better generalization and further improves the test set accuracy by 5% relative.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes