CLSep 10, 2021

Zero-Shot Dialogue State Tracking via Cross-Task Transfer

arXiv:2109.04655v1672 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of handling diverse task-oriented dialogue domains without in-domain data for researchers and practitioners in conversational AI, representing an incremental advance in transfer learning methods.

The paper tackles zero-shot dialogue state tracking by transferring knowledge from question answering corpora, achieving substantial improvements on MultiWoz and better generalization on unseen domains compared to fully trained baselines.

Zero-shot transfer learning for dialogue state tracking (DST) enables us to handle a variety of task-oriented dialogue domains without the expense of collecting in-domain data. In this work, we propose to transfer the \textit{cross-task} knowledge from general question answering (QA) corpora for the zero-shot DST task. Specifically, we propose TransferQA, a transferable generative QA model that seamlessly combines extractive QA and multi-choice QA via a text-to-text transformer framework, and tracks both categorical slots and non-categorical slots in DST. In addition, we introduce two effective ways to construct unanswerable questions, namely, negative question sampling and context truncation, which enable our model to handle "none" value slots in the zero-shot DST setting. The extensive experiments show that our approaches substantially improve the existing zero-shot and few-shot results on MultiWoz. Moreover, compared to the fully trained baseline on the Schema-Guided Dialogue dataset, our approach shows better generalization ability in unseen domains.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes