LG CLAug 13, 2015

Learning from Real Users: Rating Dialogue Success with Neural Networks for Reinforcement Learning in Spoken Dialogue Systems

Pei-Hao Su, David Vandyke, Milica Gasic, Dongho Kim, Nikola Mrksic, Tsung-Hsien Wen, Steve Young

arXiv:1508.03386v114.670 citations

Originality Incremental advance

AI Analysis

This work addresses a key bottleneck for deploying spoken dialogue systems in real-world settings where user tasks are unknown, though it is incremental as it builds on existing simulation-based training methods.

The paper tackles the problem of training spoken dialogue systems without prior knowledge of user tasks by introducing two neural network models that rate dialogue success based on turn-level features, achieving performance comparable to a baseline system that uses task knowledge.

To train a statistical spoken dialogue system (SDS) it is essential that an accurate method for measuring task success is available. To date training has relied on presenting a task to either simulated or paid users and inferring the dialogue's success by observing whether this presented task was achieved or not. Our aim however is to be able to learn from real users acting under their own volition, in which case it is non-trivial to rate the success as any prior knowledge of the task is simply unavailable. User feedback may be utilised but has been found to be inconsistent. Hence, here we present two neural network models that evaluate a sequence of turn-level features to rate the success of a dialogue. Importantly these models make no use of any prior knowledge of the user's task. The models are trained on dialogues generated by a simulated user and the best model is then used to train a policy on-line which is shown to perform at least as well as a baseline system using prior knowledge of the user's task. We note that the models should also be of interest for evaluating SDS and for monitoring a dialogue in rule-based SDS.

View on arXiv PDF

Similar