CLLGJul 2, 2022

A Multi-Task BERT Model for Schema-Guided Dialogue State Tracking

arXiv:2207.00828v18 citationsh-index: 43
Originality Incremental advance
AI Analysis

This work addresses the need for more efficient and robust dialogue state tracking in task-oriented systems, offering a practical improvement over existing methods.

The paper tackles the problem of improving computational efficiency and performance in schema-guided dialogue state tracking by proposing a single multi-task BERT model that jointly handles intent prediction, requested slot prediction, and slot filling, with an efficient encoding method. Evaluation on the SGD dataset shows it outperforms the baseline SGP-DST by a large margin and performs competitively with state-of-the-art while being significantly more efficient.

Task-oriented dialogue systems often employ a Dialogue State Tracker (DST) to successfully complete conversations. Recent state-of-the-art DST implementations rely on schemata of diverse services to improve model robustness and handle zero-shot generalization to new domains [1], however such methods [2, 3] typically require multiple large scale transformer models and long input sequences to perform well. We propose a single multi-task BERT-based model that jointly solves the three DST tasks of intent prediction, requested slot prediction and slot filling. Moreover, we propose an efficient and parsimonious encoding of the dialogue history and service schemata that is shown to further improve performance. Evaluation on the SGD dataset shows that our approach outperforms the baseline SGP-DST by a large margin and performs well compared to the state-of-the-art, while being significantly more computationally efficient. Extensive ablation studies are performed to examine the contributing factors to the success of our model.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes