CLAug 28, 2021

Self-training Improves Pre-training for Few-shot Learning in Task-oriented Dialog Systems

arXiv:2108.12589v1670 citations
Originality Incremental advance
AI Analysis

This work addresses the high labeling cost in task-oriented dialog systems, offering an incremental improvement for few-shot learning scenarios.

The paper tackles the problem of training task-oriented dialog systems with limited labeled data by proposing a self-training approach that uses unlabeled dialog data to improve pre-trained models, achieving consistent performance gains across four downstream tasks.

As the labeling cost for different modules in task-oriented dialog (ToD) systems is expensive, a major challenge is to train different modules with the least amount of labeled data. Recently, large-scale pre-trained language models, have shown promising results for few-shot learning in ToD. In this paper, we devise a self-training approach to utilize the abundant unlabeled dialog data to further improve state-of-the-art pre-trained models in few-shot learning scenarios for ToD systems. Specifically, we propose a self-training approach that iteratively labels the most confident unlabeled data to train a stronger Student model. Moreover, a new text augmentation technique (GradAug) is proposed to better train the Student by replacing non-crucial tokens using a masked language model. We conduct extensive experiments and present analyses on four downstream tasks in ToD, including intent classification, dialog state tracking, dialog act prediction, and response selection. Empirical results demonstrate that the proposed self-training approach consistently improves state-of-the-art pre-trained models (BERT, ToD-BERT) when only a small number of labeled data are available.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes