CLAIOct 9, 2022

Controllable Dialogue Simulation with In-Context Learning

arXiv:2210.04185v4309 citationsh-index: 34Has Code
Originality Highly original
AI Analysis

This addresses the cost and time issues in building dialogue systems for researchers and developers, offering an incremental improvement over existing data augmentation methods.

The paper tackles the problem of expensive and time-consuming crowdsourcing for dialogue dataset creation by proposing Dialogic, a method using large language model in-context learning to automate generation, which in experiments on MultiWOZ achieved better performance with as few as 85 seed dialogues in low-resource settings.

Building dialogue systems requires a large corpus of annotated dialogues. Such datasets are usually created via crowdsourcing, which is expensive and time-consuming. In this paper, we propose \textsc{Dialogic}, a novel dialogue simulation method based on large language model in-context learning to automate dataset creation. Seeded with a few annotated dialogues, \textsc{Dialogic} automatically selects in-context examples for demonstration and prompts GPT-3 to generate new dialogues and annotations in a controllable way. Our method can rapidly expand a small set of dialogue data with minimum or zero \textit{human involvement} and \textit{parameter update} and is thus much more cost-efficient and time-saving than crowdsourcing. Experimental results on the MultiWOZ dataset demonstrate that training a model on the simulated dialogues leads to even better performance than using the same amount of human-generated dialogues under the challenging low-resource settings, with as few as 85 dialogues as a seed. When enough data is available, our method can still serve as an effective data augmentation method. Human evaluation results also show that our simulated dialogues have near-human fluency and annotation accuracy. The code and data are available at \textbf{\url{https://github.com/Leezekun/dialogic}}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes