CLMay 19, 2022

Self-training with Two-phase Self-augmentation for Few-shot Dialogue Generation

arXiv:2205.09661v2290 citationsh-index: 15
AI Analysis

This work addresses the high annotation cost for few-shot dialogue generation, offering an incremental improvement in self-training techniques for domain-specific applications.

The paper tackles the problem of limited training examples for response generation in task-oriented dialogue systems by proposing a two-phase self-augmentation method to generate high-quality pseudo-labeled data, resulting in improved performance over existing self-training methods on benchmark datasets like FewShotWOZ and FewShotSGD.

In task-oriented dialogue systems, response generation from meaning representations (MRs) often suffers from limited training examples, due to the high cost of annotating MR-to-Text pairs. Previous works on self-training leverage fine-tuned conversational models to automatically generate pseudo-labeled MR-to-Text pairs for further fine-tuning. However, some self-augmented data may be noisy or uninformative for the model to learn from. In this work, we propose a two-phase self-augmentation procedure to generate high-quality pseudo-labeled MR-to-Text pairs: the first phase selects the most informative MRs based on model's prediction uncertainty; with the selected MRs, the second phase generates accurate responses by aggregating multiple perturbed latent representations from each MR. Empirical experiments on two benchmark datasets, FewShotWOZ and FewShotSGD, show that our method generally outperforms existing self-training methods on both automatic and human evaluations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes