CLJan 28, 2021

Attention Guided Dialogue State Tracking with Sparse Supervision

arXiv:2101.11958v14 citations
Originality Incremental advance
AI Analysis

This reduces data acquisition costs for DST in domains like customer service, though it is incremental as it builds on existing encoder-decoder models.

The paper tackles the problem of expensive turn-level annotations for Dialogue State Tracking (DST) by using sparse action logs from call centers as weak supervision, achieving competitive performance to fully supervised baselines on MultiWOZ and Schema Guided Dialogue datasets.

Existing approaches to Dialogue State Tracking (DST) rely on turn level dialogue state annotations, which are expensive to acquire in large scale. In call centers, for tasks like managing bookings or subscriptions, the user goal can be associated with actions (e.g.~API calls) issued by customer service agents. These action logs are available in large volumes and can be utilized for learning dialogue states. However, unlike turn-level annotations, such logged actions are only available sparsely across the dialogue, providing only a form of weak supervision for DST models. To efficiently learn DST with sparse labels, we extend a state-of-the-art encoder-decoder model. The model learns a slot-aware representation of dialogue history, which focuses on relevant turns to guide the decoder. We present results on two public multi-domain DST datasets (MultiWOZ and Schema Guided Dialogue) in both settings i.e. training with turn-level and with sparse supervision. The proposed approach improves over baseline in both settings. More importantly, our model trained with sparse supervision is competitive in performance to fully supervised baselines, while being more data and cost efficient.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes