CLSep 15, 2020

Multi-Referenced Training for Dialogue Response Generation

arXiv:2009.07117v2697 citationsHas Code
AI Analysis

This work addresses the challenge of generating diverse responses in dialogue systems, which is incremental by building on existing variational models and data augmentation techniques.

The paper tackles the problem of learning one-to-many relations in open-domain dialogue response generation by addressing the gap between real-world and single-referenced data distributions, resulting in significant improvements in automated and human evaluations.

In open-domain dialogue response generation, a dialogue context can be continued with diverse responses, and the dialogue models should capture such one-to-many relations. In this work, we first analyze the training objective of dialogue models from the view of Kullback-Leibler divergence (KLD) and show that the gap between the real world probability distribution and the single-referenced data's probability distribution prevents the model from learning the one-to-many relations efficiently. Then we explore approaches to multi-referenced training in two aspects. Data-wise, we generate diverse pseudo references from a powerful pretrained model to build multi-referenced data that provides a better approximation of the real-world distribution. Model-wise, we propose to equip variational models with an expressive prior, named linear Gaussian model (LGM). Experimental results of automated evaluation and human evaluation show that the methods yield significant improvements over baselines. We will release our code and data in https://github.com/ZHAOTING/dialog-processing.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes