CLOct 11, 2022

Graph Neural Network Policies and Imitation Learning for Multi-Domain Task-Oriented Dialogues

arXiv:2210.05252v1582 citationsh-index: 20
Originality Incremental advance
AI Analysis

This work addresses the problem of handling complex, multi-domain dialogues for task-oriented systems, representing an incremental improvement in dialogue management.

The paper tackled the challenge of managing multi-domain task-oriented dialogues by developing structured policies using graph neural networks and imitation learning, achieving improved performance over standard policies.

Task-oriented dialogue systems are designed to achieve specific goals while conversing with humans. In practice, they may have to handle simultaneously several domains and tasks. The dialogue manager must therefore be able to take into account domain changes and plan over different domains/tasks in order to deal with multidomain dialogues. However, learning with reinforcement in such context becomes difficult because the state-action dimension is larger while the reward signal remains scarce. Our experimental results suggest that structured policies based on graph neural networks combined with different degrees of imitation learning can effectively handle multi-domain dialogues. The reported experiments underline the benefit of structured policies over standard policies.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes