CLAIJul 20, 2016

Neural Contextual Conversation Learning with Labeled Question-Answering Pairs

arXiv:1607.05809v110 citations
Originality Incremental advance
AI Analysis

This addresses the issue of safe or generic responses in chatbots for users seeking more engaging conversations, though it is incremental as it builds on existing seq2seq models.

The paper tackled the problem of neural conversational models producing generic responses by proposing an end-to-end approach with memory mechanisms to incorporate context, resulting in a model with contextual attention that outperformed state-of-the-art seq2seq models on perplexity tests and generated diverse, robust responses.

Neural conversational models tend to produce generic or safe responses in different contexts, e.g., reply \textit{"Of course"} to narrative statements or \textit{"I don't know"} to questions. In this paper, we propose an end-to-end approach to avoid such problem in neural generative models. Additional memory mechanisms have been introduced to standard sequence-to-sequence (seq2seq) models, so that context can be considered while generating sentences. Three seq2seq models, which memorize a fix-sized contextual vector from hidden input, hidden input/output and a gated contextual attention structure respectively, have been trained and tested on a dataset of labeled question-answering pairs in Chinese. The model with contextual attention outperforms others including the state-of-the-art seq2seq models on perplexity test. The novel contextual model generates diverse and robust responses, and is able to carry out conversations on a wide range of topics appropriately.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes