CLOct 11, 2015

A Diversity-Promoting Objective Function for Neural Conversation Models

arXiv:1510.03055v32721 citations
Originality Incremental advance
AI Analysis

This addresses the issue of dull and repetitive outputs in conversational AI, making interactions more engaging for users, though it is an incremental improvement over existing methods.

The authors tackled the problem of neural conversation models generating generic responses by proposing Maximum Mutual Information (MMI) as an alternative objective function, resulting in more diverse and appropriate responses with gains in BLEU scores and human evaluations.

Sequence-to-sequence neural network models for generation of conversational responses tend to generate safe, commonplace responses (e.g., "I don't know") regardless of the input. We suggest that the traditional objective function, i.e., the likelihood of output (response) given input (message) is unsuited to response generation tasks. Instead we propose using Maximum Mutual Information (MMI) as the objective function in neural models. Experimental results demonstrate that the proposed MMI models produce more diverse, interesting, and appropriate responses, yielding substantive gains in BLEU scores on two conversational datasets and in human evaluations.

Code Implementations14 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes