CLSep 29, 2017

The First Evaluation of Chinese Human-Computer Dialogue Technology

arXiv:1709.10217v228 citations
AI Analysis

It addresses the need for standardized benchmarks in Chinese dialogue systems, primarily for industry and researchers, but is incremental as it applies existing methods to new data.

This paper presents the first evaluation of Chinese human-computer dialogue technology, detailing an evaluation scheme with tasks like user intent classification and online testing, and publishes results to show current performance levels.

In this paper, we introduce the first evaluation of Chinese human-computer dialogue technology. We detail the evaluation scheme, tasks, metrics and how to collect and annotate the data for training, developing and test. The evaluation includes two tasks, namely user intent classification and online testing of task-oriented dialogue. To consider the different sources of the data for training and developing, the first task can also be divided into two sub tasks. Both the two tasks are coming from the real problems when using the applications developed by industry. The evaluation data is provided by the iFLYTEK Corporation. Meanwhile, in this paper, we publish the evaluation results to present the current performance of the participants in the two tasks of Chinese human-computer dialogue technology. Moreover, we analyze the existing problems of human-computer dialogue as well as the evaluation scheme itself.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes