CLSep 10, 2021

Zero-Shot Dialogue State Tracking via Cross-Task Transfer

Zhaojiang Lin, Bing Liu, Andrea Madotto, Seungwhan Moon, Paul Crook, Zhenpeng Zhou, Zhiguang Wang, Zhou Yu, Eunjoon Cho, Rajen Subba, Pascale Fung

arXiv:2109.04655v131.2672 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the problem of handling diverse task-oriented dialogue domains without in-domain data for researchers and practitioners in conversational AI, representing an incremental advance in transfer learning methods.

The paper tackles zero-shot dialogue state tracking by transferring knowledge from question answering corpora, achieving substantial improvements on MultiWoz and better generalization on unseen domains compared to fully trained baselines.

Zero-shot transfer learning for dialogue state tracking (DST) enables us to handle a variety of task-oriented dialogue domains without the expense of collecting in-domain data. In this work, we propose to transfer the \textit{cross-task} knowledge from general question answering (QA) corpora for the zero-shot DST task. Specifically, we propose TransferQA, a transferable generative QA model that seamlessly combines extractive QA and multi-choice QA via a text-to-text transformer framework, and tracks both categorical slots and non-categorical slots in DST. In addition, we introduce two effective ways to construct unanswerable questions, namely, negative question sampling and context truncation, which enable our model to handle "none" value slots in the zero-shot DST setting. The extensive experiments show that our approaches substantially improve the existing zero-shot and few-shot results on MultiWoz. Moreover, compared to the fully trained baseline on the Schema-Guided Dialogue dataset, our approach shows better generalization ability in unseen domains.

View on arXiv PDF Code

Similar