CLAISep 27, 2022

Information Extraction and Human-Robot Dialogue towards Real-life Tasks: A Baseline Study with the MobileCS Dataset

arXiv:2209.13464v2292 citationsh-index: 16
AI Analysis

This work addresses the need for more realistic and noisy dialogue data in task-oriented dialogue systems, though it is incremental as it provides baselines rather than novel methods.

The paper tackles the problem of building task-oriented dialogue systems for real-life tasks by presenting baseline studies on information extraction and dialogue system construction using the MobileCS dataset, which consists of real-world customer service transcripts, achieving baseline results that serve as a starting point for future research.

Recently, there have merged a class of task-oriented dialogue (TOD) datasets collected through Wizard-of-Oz simulated games. However, the Wizard-of-Oz data are in fact simulated data and thus are fundamentally different from real-life conversations, which are more noisy and casual. Recently, the SereTOD challenge is organized and releases the MobileCS dataset, which consists of real-world dialog transcripts between real users and customer-service staffs from China Mobile. Based on the MobileCS dataset, the SereTOD challenge has two tasks, not only evaluating the construction of the dialogue system itself, but also examining information extraction from dialog transcripts, which is crucial for building the knowledge base for TOD. This paper mainly presents a baseline study of the two tasks with the MobileCS dataset. We introduce how the two baselines are constructed, the problems encountered, and the results. We anticipate that the baselines can facilitate exciting future research to build human-robot dialogue systems for real-life tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes