CLAINov 22, 2021

DLVGen: A Dual Latent Variable Approach to Personalized Dialogue Generation

arXiv:2111.11363v17 citations
Originality Highly original
AI Analysis

This addresses a practical limitation in real-world dialogue systems where persona data is scarce, offering a novel approach for more adaptable conversational AI.

The paper tackles the problem of generating personalized dialogue without needing persona information or corresponding dialogue examples, proposing DLVGen which models latent distributions for responses and persona, and achieves diverse and accurate persona incorporation in responses.

The generation of personalized dialogue is vital to natural and human-like conversation. Typically, personalized dialogue generation models involve conditioning the generated response on the dialogue history and a representation of the persona/personality of the interlocutor. As it is impractical to obtain the persona/personality representations for every interlocutor, recent works have explored the possibility of generating personalized dialogue by finetuning the model with dialogue examples corresponding to a given persona instead. However, in real-world implementations, a sufficient number of corresponding dialogue examples are also rarely available. Hence, in this paper, we propose a Dual Latent Variable Generator (DLVGen) capable of generating personalized dialogue in the absence of any persona/personality information or any corresponding dialogue examples. Unlike prior work, DLVGen models the latent distribution over potential responses as well as the latent distribution over the agent's potential persona. During inference, latent variables are sampled from both distributions and fed into the decoder. Empirical results show that DLVGen is capable of generating diverse responses which accurately incorporate the agent's persona.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes