CLOct 11, 2022

Graph Neural Network Policies and Imitation Learning for Multi-Domain Task-Oriented Dialogues

Thibault Cordier, Tanguy Urvoy, Fabrice Lefèvre, Lina M. Rojas-Barahona

arXiv:2210.05252v130.9582 citationsh-index: 20

Originality Incremental advance

AI Analysis

This work addresses the problem of handling complex, multi-domain dialogues for task-oriented systems, representing an incremental improvement in dialogue management.

The paper tackled the challenge of managing multi-domain task-oriented dialogues by developing structured policies using graph neural networks and imitation learning, achieving improved performance over standard policies.

Task-oriented dialogue systems are designed to achieve specific goals while conversing with humans. In practice, they may have to handle simultaneously several domains and tasks. The dialogue manager must therefore be able to take into account domain changes and plan over different domains/tasks in order to deal with multidomain dialogues. However, learning with reinforcement in such context becomes difficult because the state-action dimension is larger while the reward signal remains scarce. Our experimental results suggest that structured policies based on graph neural networks combined with different degrees of imitation learning can effectively handle multi-domain dialogues. The reported experiments underline the benefit of structured policies over standard policies.

View on arXiv PDF

Similar