CL AISep 27, 2022

Information Extraction and Human-Robot Dialogue towards Real-life Tasks: A Baseline Study with the MobileCS Dataset

Hong Liu, Hao Peng, Zhijian Ou, Juanzi Li, Yi Huang, Junlan Feng

arXiv:2209.13464v224.0292 citationsh-index: 16Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses the need for more realistic and noisy dialogue data in task-oriented dialogue systems, though it is incremental as it provides baselines rather than novel methods.

The paper tackles the problem of building task-oriented dialogue systems for real-life tasks by presenting baseline studies on information extraction and dialogue system construction using the MobileCS dataset, which consists of real-world customer service transcripts, achieving baseline results that serve as a starting point for future research.

Recently, there have merged a class of task-oriented dialogue (TOD) datasets collected through Wizard-of-Oz simulated games. However, the Wizard-of-Oz data are in fact simulated data and thus are fundamentally different from real-life conversations, which are more noisy and casual. Recently, the SereTOD challenge is organized and releases the MobileCS dataset, which consists of real-world dialog transcripts between real users and customer-service staffs from China Mobile. Based on the MobileCS dataset, the SereTOD challenge has two tasks, not only evaluating the construction of the dialogue system itself, but also examining information extraction from dialog transcripts, which is crucial for building the knowledge base for TOD. This paper mainly presents a baseline study of the two tasks with the MobileCS dataset. We introduce how the two baselines are constructed, the problems encountered, and the results. We anticipate that the baselines can facilitate exciting future research to build human-robot dialogue systems for real-life tasks.

View on arXiv PDF Code

Similar