CLApr 6, 2020

The World is Not Binary: Learning to Rank with Grayscale Data for Dialogue Response Selection

arXiv:2004.02421v41002 citations
AI Analysis

This work addresses the problem of improving retrieval-based conversation systems for developers and researchers by providing a more fine-grained ranking method, though it is incremental as it builds on existing models.

The paper tackles the sub-optimal binary classification approach in dialogue response selection by automatically constructing grayscale data using off-the-shelf models, resulting in significant and consistent performance improvements across three benchmark datasets and four state-of-the-art matching models.

Response selection plays a vital role in building retrieval-based conversation systems. Despite that response selection is naturally a learning-to-rank problem, most prior works take a point-wise view and train binary classifiers for this task: each response candidate is labeled either relevant (one) or irrelevant (zero). On the one hand, this formalization can be sub-optimal due to its ignorance of the diversity of response quality. On the other hand, annotating grayscale data for learning-to-rank can be prohibitively expensive and challenging. In this work, we show that grayscale data can be automatically constructed without human effort. Our method employs off-the-shelf response retrieval models and response generation models as automatic grayscale data generators. With the constructed grayscale data, we propose multi-level ranking objectives for training, which can (1) teach a matching model to capture more fine-grained context-response relevance difference and (2) reduce the train-test discrepancy in terms of distractor strength. Our method is simple, effective, and universal. Experiments on three benchmark datasets and four state-of-the-art matching models show that the proposed approach brings significant and consistent performance improvements.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes