CL AIJul 20, 2016

Neural Contextual Conversation Learning with Labeled Question-Answering Pairs

Kun Xiong, Anqi Cui, Zefeng Zhang, Ming Li

arXiv:1607.05809v14.210 citations

Originality Incremental advance

AI Analysis

This addresses the issue of safe or generic responses in chatbots for users seeking more engaging conversations, though it is incremental as it builds on existing seq2seq models.

The paper tackled the problem of neural conversational models producing generic responses by proposing an end-to-end approach with memory mechanisms to incorporate context, resulting in a model with contextual attention that outperformed state-of-the-art seq2seq models on perplexity tests and generated diverse, robust responses.

Neural conversational models tend to produce generic or safe responses in different contexts, e.g., reply \textit{"Of course"} to narrative statements or \textit{"I don't know"} to questions. In this paper, we propose an end-to-end approach to avoid such problem in neural generative models. Additional memory mechanisms have been introduced to standard sequence-to-sequence (seq2seq) models, so that context can be considered while generating sentences. Three seq2seq models, which memorize a fix-sized contextual vector from hidden input, hidden input/output and a gated contextual attention structure respectively, have been trained and tested on a dataset of labeled question-answering pairs in Chinese. The model with contextual attention outperforms others including the state-of-the-art seq2seq models on perplexity test. The novel contextual model generates diverse and robust responses, and is able to carry out conversations on a wide range of topics appropriately.

View on arXiv PDF

Similar