CLAINov 28, 2018

Few-Shot Generalization Across Dialogue Tasks

arXiv:1811.11707v134 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of few-shot generalization for dialogue systems, enabling more efficient adaptation to new tasks, though it is incremental in nature.

The paper tackles the problem of extending dialogue managers to new domains by introducing the Recurrent Embedding Dialogue Policy (REDP), which embeds actions and states in a shared vector space and outperforms an LSTM baseline in handling uncooperative user behavior and transferring expertise across tasks.

Machine-learning based dialogue managers are able to learn complex behaviors in order to complete a task, but it is not straightforward to extend their capabilities to new domains. We investigate different policies' ability to handle uncooperative user behavior, and how well expertise in completing one task (such as restaurant reservations) can be reapplied when learning a new one (e.g. booking a hotel). We introduce the Recurrent Embedding Dialogue Policy (REDP), which embeds system actions and dialogue states in the same vector space. REDP contains a memory component and attention mechanism based on a modified Neural Turing Machine, and significantly outperforms a baseline LSTM classifier on this task. We also show that both our architecture and baseline solve the bAbI dialogue task, achieving 100% test accuracy.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes