CLSep 14, 2021

Commonsense-Focused Dialogues for Response Generation: An Empirical Study

Pei Zhou, Karthik Gopalakrishnan, Behnam Hedayatnia, Seokhwan Kim, Jay Pujara, Xiang Ren, Yang Liu, Dilek Hakkani-Tur

arXiv:2109.06427v231.5702 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the problem of generating more commonsense-aware responses in dialogue systems, which is incremental as it builds on existing datasets and methods.

The paper tackled the lack of explicit commonsense focus in dialogue datasets by auto-extracting commonsensical dialogues from existing data and collecting a new dataset of 25K dialogues for social commonsense, finding that models trained on these datasets produce responses with more commonsense than baselines.

Smooth and effective communication requires the ability to perform latent or explicit commonsense inference. Prior commonsense reasoning benchmarks (such as SocialIQA and CommonsenseQA) mainly focus on the discriminative task of choosing the right answer from a set of candidates, and do not involve interactive language generation as in dialogue. Moreover, existing dialogue datasets do not explicitly focus on exhibiting commonsense as a facet. In this paper, we present an empirical study of commonsense in dialogue response generation. We first auto-extract commonsensical dialogues from existing dialogue datasets by leveraging ConceptNet, a commonsense knowledge graph. Furthermore, building on social contexts/situations in SocialIQA, we collect a new dialogue dataset with 25K dialogues aimed at exhibiting social commonsense in an interactive setting. We evaluate response generation models trained using these datasets and find that models trained on both extracted and our collected data produce responses that consistently exhibit more commonsense than baselines. Finally we propose an approach for automatic evaluation of commonsense that relies on features derived from ConceptNet and pre-trained language and dialog models, and show reasonable correlation with human evaluation of responses' commonsense quality. We are releasing a subset of our collected data, Commonsense-Dialogues, containing about 11K dialogs.

View on arXiv PDF Code

Similar