CLAIAug 25, 2023

ChatGPT as Data Augmentation for Compositional Generalization: A Case Study in Open Intent Detection

arXiv:2308.13517v1193 citationsh-index: 45Has Code
Originality Synthesis-oriented
AI Analysis

This addresses the problem of handling new language combinations in open intent detection for natural language understanding, though it appears incremental as it applies an existing method (ChatGPT) to a new domain.

The paper tackles compositional generalization in open intent detection by using ChatGPT for data augmentation, demonstrating that this approach improves model performance and outperforms existing techniques on multiple benchmarks.

Open intent detection, a crucial aspect of natural language understanding, involves the identification of previously unseen intents in user-generated text. Despite the progress made in this field, challenges persist in handling new combinations of language components, which is essential for compositional generalization. In this paper, we present a case study exploring the use of ChatGPT as a data augmentation technique to enhance compositional generalization in open intent detection tasks. We begin by discussing the limitations of existing benchmarks in evaluating this problem, highlighting the need for constructing datasets for addressing compositional generalization in open intent detection tasks. By incorporating synthetic data generated by ChatGPT into the training process, we demonstrate that our approach can effectively improve model performance. Rigorous evaluation of multiple benchmarks reveals that our method outperforms existing techniques and significantly enhances open intent detection capabilities. Our findings underscore the potential of large language models like ChatGPT for data augmentation in natural language understanding tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes