CL AI AS SPNov 3, 2023

Are cascade dialogue state tracking models speaking out of turn in spoken dialogues?

Lucas Druart, Léo Jacqmin, Benoît Favre, Lina Maria Rojas-Barahona, Valentin Vielzeuf

arXiv:2311.04922v10.5h-index: 19

Originality Synthesis-oriented

AI Analysis

This addresses error analysis for spoken dialogue systems, which is incremental as it builds on existing models by identifying specific error patterns.

The paper analyzed errors in state-of-the-art dialogue state tracking models in spoken dialogues, identifying that errors on non-categorical slots are critical for bridging the gap between spoken and chat-based systems, and explored solutions to improve transcriptions and help models correct these errors.

In Task-Oriented Dialogue (TOD) systems, correctly updating the system's understanding of the user's needs is key to a smooth interaction. Traditionally TOD systems are composed of several modules that interact with one another. While each of these components is the focus of active research communities, their behavior in interaction can be overlooked. This paper proposes a comprehensive analysis of the errors of state of the art systems in complex settings such as Dialogue State Tracking which highly depends on the dialogue context. Based on spoken MultiWoz, we identify that errors on non-categorical slots' values are essential to address in order to bridge the gap between spoken and chat-based dialogue systems. We explore potential solutions to improve transcriptions and help dialogue state tracking generative models correct such errors.

View on arXiv PDF

Similar