CL AISep 10, 2021

An Evaluation Dataset and Strategy for Building Robust Multi-turn Response Selection Model

Kijong Han, Seojin Lee, Wooin Lee, Joosung Lee, Dong-hun Lee

arXiv:2109.04834v130.7661 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This work addresses robustness issues in Korean conversational AI models, though it is incremental as it builds on existing datasets and methods.

The study identified weaknesses in Korean multi-turn response selection models, such as reliance on superficial patterns, and released an adversarial dataset to evaluate these issues, proposing a strategy to enhance model robustness.

Multi-turn response selection models have recently shown comparable performance to humans in several benchmark datasets. However, in the real environment, these models often have weaknesses, such as making incorrect predictions based heavily on superficial patterns without a comprehensive understanding of the context. For example, these models often give a high score to the wrong response candidate containing several keywords related to the context but using the inconsistent tense. In this study, we analyze the weaknesses of the open-domain Korean Multi-turn response selection models and publish an adversarial dataset to evaluate these weaknesses. We also suggest a strategy to build a robust model in this adversarial environment.

View on arXiv PDF Code

Similar