CLNov 30, 2017

Neural Response Generation with Dynamic Vocabularies

arXiv:1711.11191v175 citations
Originality Highly original
AI Analysis

This work provides a more efficient and higher-quality response generation method for open-domain chatbots, benefiting developers and users of conversational AI systems.

This paper addresses the problem of generating responses in open-domain chatbots, where existing methods use a fixed vocabulary, leading to generic responses and high decoding costs. The proposed Dynamic Vocabulary Sequence-to-Sequence (DVS2S) model allows each input to have its own small, dynamically allocated vocabulary, resulting in significantly improved response quality and a 40% reduction in decoding time compared to the most efficient baseline.

We study response generation for open domain conversation in chatbots. Existing methods assume that words in responses are generated from an identical vocabulary regardless of their inputs, which not only makes them vulnerable to generic patterns and irrelevant noise, but also causes a high cost in decoding. We propose a dynamic vocabulary sequence-to-sequence (DVS2S) model which allows each input to possess their own vocabulary in decoding. In training, vocabulary construction and response generation are jointly learned by maximizing a lower bound of the true objective with a Monte Carlo sampling method. In inference, the model dynamically allocates a small vocabulary for an input with the word prediction model, and conducts decoding only with the small vocabulary. Because of the dynamic vocabulary mechanism, DVS2S eludes many generic patterns and irrelevant words in generation, and enjoys efficient decoding at the same time. Experimental results on both automatic metrics and human annotations show that DVS2S can significantly outperform state-of-the-art methods in terms of response quality, but only requires 60% decoding time compared to the most efficient baseline.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes