CLAISep 10, 2021

An Evaluation Dataset and Strategy for Building Robust Multi-turn Response Selection Model

arXiv:2109.04834v1661 citations
Originality Synthesis-oriented
AI Analysis

This work addresses robustness issues in Korean conversational AI models, though it is incremental as it builds on existing datasets and methods.

The study identified weaknesses in Korean multi-turn response selection models, such as reliance on superficial patterns, and released an adversarial dataset to evaluate these issues, proposing a strategy to enhance model robustness.

Multi-turn response selection models have recently shown comparable performance to humans in several benchmark datasets. However, in the real environment, these models often have weaknesses, such as making incorrect predictions based heavily on superficial patterns without a comprehensive understanding of the context. For example, these models often give a high score to the wrong response candidate containing several keywords related to the context but using the inconsistent tense. In this study, we analyze the weaknesses of the open-domain Korean Multi-turn response selection models and publish an adversarial dataset to evaluate these weaknesses. We also suggest a strategy to build a robust model in this adversarial environment.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes