CLAIApr 16, 2020

Paraphrase Augmented Task-Oriented Dialog Generation

arXiv:2004.07462v21025 citations
AI Analysis

This work addresses data scarcity in task-oriented dialog generation, offering a practical solution for real-world applications, though it is incremental as it builds on existing models.

The paper tackles the problem of limited high-quality dialog data for neural generative models by proposing a paraphrase augmented response generation (PARG) framework that jointly trains paraphrase and response generation models, improving state-of-the-art models on CamRest676 and MultiWOZ datasets, especially in low-resource settings.

Neural generative models have achieved promising performance on dialog generation tasks if given a huge data set. However, the lack of high-quality dialog data and the expensive data annotation process greatly limit their application in real-world settings. We propose a paraphrase augmented response generation (PARG) framework that jointly trains a paraphrase model and a response generation model to improve the dialog generation performance. We also design a method to automatically construct paraphrase training data set based on dialog state and dialog act labels. PARG is applicable to various dialog generation models, such as TSCP (Lei et al., 2018) and DAMD (Zhang et al., 2019). Experimental results show that the proposed framework improves these state-of-the-art dialog models further on CamRest676 and MultiWOZ. PARG also significantly outperforms other data augmentation methods in dialog generation tasks, especially under low resource settings.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes