CLSep 12, 2019

Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue Dataset

Abhinav Rastogi, Xiaoxue Zang, Srinivas Sunkara, Raghav Gupta, Pranav Khaitan

arXiv:1909.05855v221.2721 citations

Originality Highly original

AI Analysis

This work addresses the problem of supporting numerous and overlapping services in virtual assistants for developers, though it is incremental as it builds upon existing task-oriented dialogue systems.

The authors tackled the challenge of building scalable multi-domain conversational agents by introducing the Schema-Guided Dialogue (SGD) dataset, which contains over 16k conversations across 16 domains, and proposed a schema-guided paradigm that enables zero-shot generalization to new APIs without additional training data.

Virtual assistants such as Google Assistant, Alexa and Siri provide a conversational interface to a large number of services and APIs spanning multiple domains. Such systems need to support an ever-increasing number of services with possibly overlapping functionality. Furthermore, some of these services have little to no training data available. Existing public datasets for task-oriented dialogue do not sufficiently capture these challenges since they cover few domains and assume a single static ontology per domain. In this work, we introduce the the Schema-Guided Dialogue (SGD) dataset, containing over 16k multi-domain conversations spanning 16 domains. Our dataset exceeds the existing task-oriented dialogue corpora in scale, while also highlighting the challenges associated with building large-scale virtual assistants. It provides a challenging testbed for a number of tasks including language understanding, slot filling, dialogue state tracking and response generation. Along the same lines, we present a schema-guided paradigm for task-oriented dialogue, in which predictions are made over a dynamic set of intents and slots, provided as input, using their natural language descriptions. This allows a single dialogue system to easily support a large number of services and facilitates simple integration of new services without requiring additional training data. Building upon the proposed paradigm, we release a model for dialogue state tracking capable of zero-shot generalization to new APIs, while remaining competitive in the regular setting.

View on arXiv PDF

Similar