CL AI MLMay 17, 2018

Neural User Simulation for Corpus-based Policy Optimisation for Spoken Dialogue Systems

Florian Kreyssig, Inigo Casanueva, Pawel Budzianowski, Milica Gasic

arXiv:1805.06966v14.786 citations

Originality Incremental advance

AI Analysis

This addresses the need for more effective offline training tools for task-oriented spoken dialogue systems, though it is incremental as it builds on existing simulator concepts.

The paper tackled the problem of limited diversity and semantic output in user simulators for training dialogue systems by introducing a Neural User Simulator (NUS) that learns from a corpus and generates natural language, resulting in outperforming the Agenda-Based User Simulator (ABUS) in policy training and real-user tests.

User Simulators are one of the major tools that enable offline training of task-oriented dialogue systems. For this task the Agenda-Based User Simulator (ABUS) is often used. The ABUS is based on hand-crafted rules and its output is in semantic form. Issues arise from both properties such as limited diversity and the inability to interface a text-level belief tracker. This paper introduces the Neural User Simulator (NUS) whose behaviour is learned from a corpus and which generates natural language, hence needing a less labelled dataset than simulators generating a semantic output. In comparison to much of the past work on this topic, which evaluates user simulators on corpus-based metrics, we use the NUS to train the policy of a reinforcement learning based Spoken Dialogue System. The NUS is compared to the ABUS by evaluating the policies that were trained using the simulators. Cross-model evaluation is performed i.e. training on one simulator and testing on the other. Furthermore, the trained policies are tested on real users. In both evaluation tasks the NUS outperformed the ABUS.

View on arXiv PDF

Similar