CLJul 21, 2024

Training Zero-Shot Generalizable End-to-End Task-Oriented Dialog System Without Turn-level Dialog Annotations

arXiv:2407.15055v21.0h-index: 8

Originality Incremental advance

AI Analysis

This addresses the scalability and cost issues in developing task-oriented dialogue systems for applications requiring natural language interactions, though it is incremental as it builds on existing LLM methods.

The paper tackles the problem of training task-oriented dialogue systems without expensive turn-level annotations by using multi-task instruction fine-tuning with large language models, achieving generalization to unseen domains and outperforming state-of-the-art annotated models and ChatGPT.

Task-oriented dialogue (TOD) systems enable users to achieve their goals through natural language interactions. Traditionally, these systems have relied on turn-level manually annotated metadata, such as dialogue states and policy annotations, which are expensive, time-consuming, and often inconsistent or error-prone. This dependence limits the potential to leverage vast amounts of readily available conversational data for training TOD systems. Additionally, a critical challenge in TOD system design is determining when and how to access and integrate information from external sources. Current approaches typically expect this information to be provided alongside the dialogue context, rather than learning to identify and retrieve it autonomously. While pre-trained large language models (LLMs) have been used to develop TOD systems, their potential to train such systems without laborious annotations remains largely unexplored. This work employs multi-task instruction fine-tuning to create more efficient and scalable TOD systems that can effectively leverage natural language conversational data without manual annotations, while autonomously managing external information retrieval. Our extensive experimental evaluations, using three diverse TOD datasets and three LLMs of varying sizes, demonstrate that our approach can generalize to new, unseen domains. Notably, our approach outperforms both state-of-the-art models trained on annotated data and billion-scale parameter off-the-shelf ChatGPT models.

View on arXiv PDF

Similar