CLAIHCDec 28, 2017

Toward Continual Learning for Conversational Agents

arXiv:1712.09943v341 citations
Originality Incremental advance
AI Analysis

This work addresses data efficiency for conversational AI systems, particularly in customer support, but is incremental as it adapts existing continual learning methods to a new domain.

The paper tackles the problem of data inefficiency in neural conversation models by applying continual learning to enable conversational agents to accumulate skills across tasks, achieving a 30% reduction in required training data while maintaining performance.

While end-to-end neural conversation models have led to promising advances in reducing hand-crafted features and errors induced by the traditional complex system architecture, they typically require an enormous amount of data due to the lack of modularity. Previous studies adopted a hybrid approach with knowledge-based components either to abstract out domain-specific information or to augment data to cover more diverse patterns. On the contrary, we propose to directly address the problem using recent developments in the space of continual learning for neural models. Specifically, we adopt a domain-independent neural conversational model and introduce a novel neural continual learning algorithm that allows a conversational agent to accumulate skills across different tasks in a data-efficient way. To the best of our knowledge, this is the first work that applies continual learning to conversation systems. We verified the efficacy of our method through a conversational skill transfer from either synthetic dialogs or human-human dialogs to human-computer conversations in a customer support domain.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes