CLAIApr 3, 2022

Task2Dial: A Novel Task and Dataset for Commonsense enhanced Task-based Dialogue Grounded in Documents

arXiv:2204.01061v19 citationsh-index: 16
Originality Incremental advance
AI Analysis

This addresses the need for more realistic dialogue systems in AI by providing a dataset that reduces template-like utterances, though it is incremental as it builds on existing document-grounded dialogue tasks.

The paper tackles the problem of generating natural and varied task-based dialogues grounded in documents by introducing the Task2Dial dataset, which includes commonsense-enhanced interactions with an average of 18.15 turns and 19.79 tokens per turn, showing more lexical richness than existing datasets.

This paper proposes a novel task on commonsense-enhanced task-based dialogue grounded in documents and describes the Task2Dial dataset, a novel dataset of document-grounded task-based dialogues, where an Information Giver (IG) provides instructions (by consulting a document) to an Information Follower (IF), so that the latter can successfully complete the task. In this unique setting, the IF can ask clarification questions which may not be grounded in the underlying document and require commonsense knowledge to be answered. The Task2Dial dataset poses new challenges: (1) its human reference texts show more lexical richness and variation than other document-grounded dialogue datasets; (2) generating from this set requires paraphrasing as instructional responses might have been modified from the underlying document; (3) requires commonsense knowledge, since questions might not necessarily be grounded in the document; (4) generating requires planning based on context, as task steps need to be provided in order. The Task2Dial dataset contains dialogues with an average $18.15$ number of turns and 19.79 tokens per turn, as compared to 12.94 and 12 respectively in existing datasets. As such, learning from this dataset promises more natural, varied and less template-like system utterances.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes