CLMay 20, 2022

Robust Task-Oriented Dialogue Generation with Contrastive Pre-training and Adversarial Filtering

arXiv:2205.10363v1292 citationsh-index: 36
Originality Incremental advance
AI Analysis

This addresses robustness issues in task-oriented dialogue systems for NLP applications, though it is an incremental improvement on existing contrastive learning methods.

The paper investigated whether task-oriented dialogue datasets contain data artifacts that allow models to exploit spurious correlations, finding that state-of-the-art models performed similarly when trained only on frequent phrases versus full data. The authors proposed a contrastive learning framework with adversarial filtering to encourage learning generalizable patterns, reporting exceptional performance in cross-domain/dataset and adversarial generalization experiments.

Data artifacts incentivize machine learning models to learn non-transferable generalizations by taking advantage of shortcuts in the data, and there is growing evidence that data artifacts play a role for the strong results that deep learning models achieve in recent natural language processing benchmarks. In this paper, we focus on task-oriented dialogue and investigate whether popular datasets such as MultiWOZ contain such data artifacts. We found that by only keeping frequent phrases in the training examples, state-of-the-art models perform similarly compared to the variant trained with full data, suggesting they exploit these spurious correlations to solve the task. Motivated by this, we propose a contrastive learning based framework to encourage the model to ignore these cues and focus on learning generalisable patterns. We also experiment with adversarial filtering to remove "easy" training instances so that the model would focus on learning from the "harder" instances. We conduct a number of generalization experiments -- e.g., cross-domain/dataset and adversarial tests -- to assess the robustness of our approach and found that it works exceptionally well.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes