CLAILGNov 8, 2019

Investigation of Error Simulation Techniques for Learning Dialog Policies for Conversational Error Recovery

arXiv:1911.03378v116 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of expensive data collection for conversational AI by enhancing error simulation techniques, though it is incremental as it builds on existing methods.

The paper reviewed and compared existing error simulation methods for training dialog policies in speech-based virtual assistants, improving a text-level method with confidence score prediction and out-of-vocabulary word mapping, and explored their impact on learning a clarification dialog policy for error recovery.

Training dialog policies for speech-based virtual assistants requires a plethora of conversational data. The data collection phase is often expensive and time consuming due to human involvement. To address this issue, a common solution is to build user simulators for data generation. For the successful deployment of the trained policies into real world domains, it is vital that the user simulator mimics realistic conditions. In particular, speech-based assistants are heavily affected by automatic speech recognition and language understanding errors, hence the user simulator should be able to simulate similar errors. In this paper, we review the existing error simulation methods that induce errors at audio, phoneme, text, or semantic level; and conduct detailed comparisons between the audio-level and text-level methods. In the process, we improve the existing text-level method by introducing confidence score prediction and out-of-vocabulary word mapping. We also explore the impact of audio-level and text-level methods on learning a simple clarification dialog policy to recover from errors to provide insight on future improvement for both approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes