CLApr 23, 2024

Unsupervised End-to-End Task-Oriented Dialogue with LLMs: The Power of the Noisy Channel

arXiv:2404.15219v225 citationsh-index: 5EMNLP
Originality Highly original
AI Analysis

This addresses the costly and error-prone annotation process for dialogue systems, offering an unsupervised solution that could reduce reliance on expert annotations.

The paper tackles the problem of building task-oriented dialogue systems without turn-level annotations by using only unlabeled dialogues and an API schema, achieving more than double the dialogue success rate of a GPT-3.5 baseline on the MultiWOZ benchmark.

Training task-oriented dialogue systems typically requires turn-level annotations for interacting with their APIs: e.g. a dialogue state and the system actions taken at each step. These annotations can be costly to produce, error-prone, and require both domain and annotation expertise. With advances in LLMs, we hypothesize that unlabeled data and a schema definition are sufficient for building a working task-oriented dialogue system, completely unsupervised. We consider a novel unsupervised setting of only (1) a well-defined API schema (2) a set of unlabeled dialogues between a user and agent. We propose an innovative approach using expectation-maximization (EM) that infers turn-level annotations as latent variables using a noisy channel model to build an end-to-end dialogue agent. Evaluating our approach on the MultiWOZ benchmark, our method more than doubles the dialogue success rate of a strong GPT-3.5 baseline.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes