CLAIMay 22, 2023

Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation Regularization

arXiv:2305.12782v1230 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses a specific bottleneck in developing more reliable conversational agents, though it is incremental as it builds on existing pre-trained models.

The authors tackled the problem of order sensitivity in personalized dialogue generation, where varying the order of persona sentences causes significant performance fluctuations (29.4% on GPT2 and 83.2% on BART), and proposed a model-agnostic framework that improves robustness and consistency.

Generating persona consistent dialogue response is important for developing an intelligent conversational agent. Recent works typically fine-tune large-scale pre-trained models on this task by concatenating persona texts and dialogue history as a single input sequence to generate the target response. While simple and effective, our analysis shows that this popular practice is seriously affected by order sensitivity where different input orders of persona sentences significantly impact the quality and consistency of generated response, resulting in severe performance fluctuations (i.e., 29.4% on GPT2 and 83.2% on BART). To mitigate the order sensitivity problem, we propose a model-agnostic framework, ORder Insensitive Generation (ORIG), which enables dialogue models to learn robust representation under different persona orders and improve the consistency of response generation. Experiments on the Persona-Chat dataset justify the effectiveness and superiority of our method with two dominant pre-trained models (GPT2 and BART).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes