CLAIOct 8, 2019

Find or Classify? Dual Strategy for Slot-Value Predictions on Multi-Domain Dialog State Tracking

arXiv:1910.03544v41070 citations
Originality Incremental advance
AI Analysis

This work addresses a core challenge in task-oriented dialog systems for improving accuracy and robustness in slot-value prediction, representing an incremental advancement over existing BERT-based methods.

The paper tackles the problem of slot-value prediction in multi-domain dialog state tracking by proposing a dual-strategy model that jointly handles categorical and non-categorical slots using a BERT-style reading comprehension model, achieving state-of-the-art performance in noisy settings and robust results across different dataset versions.

Dialog state tracking (DST) is a core component in task-oriented dialog systems. Existing approaches for DST mainly fall into one of two categories, namely, ontology-based and ontology-free methods. An ontology-based method selects a value from a candidate-value list for each target slot, while an ontology-free method extracts spans from dialog contexts. Recent work introduced a BERT-based model to strike a balance between the two methods by pre-defining categorical and non-categorical slots. However, it is not clear enough which slots are better handled by either of the two slot types, and the way to use the pre-trained model has not been well investigated. In this paper, we propose a simple yet effective dual-strategy model for DST, by adapting a single BERT-style reading comprehension model to jointly handle both the categorical and non-categorical slots. Our experiments on the MultiWOZ datasets show that our method significantly outperforms the BERT-based counterpart, finding that the key is a deep interaction between the domain-slot and context information. When evaluated on noisy (MultiWOZ 2.0) and cleaner (MultiWOZ 2.1) settings, our method performs competitively and robustly across the two different settings. Our method sets the new state of the art in the noisy setting, while performing more robustly than the best model in the cleaner setting. We also conduct a comprehensive error analysis on the dataset, including the effects of the dual strategy for each slot, to facilitate future research.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes