CLAIApr 11, 2018

Achieving Fluency and Coherency in Task-oriented Dialog

arXiv:1804.03799v11 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of incoherent and inaccurate dialog agents in real-world customer support scenarios, though it is incremental as it builds on existing methods.

The paper tackled the problem of generating both fluent natural language responses and accurate external actions in task-oriented dialog systems, showing that a hybrid model combining nearest neighbor and Seq2Seq methods achieved a 78% relative improvement in fluency scores and a 130% improvement in accuracy of external calls on customer support data.

We consider real world task-oriented dialog settings, where agents need to generate both fluent natural language responses and correct external actions like database queries and updates. We demonstrate that, when applied to customer support chat transcripts, Sequence to Sequence (Seq2Seq) models often generate short, incoherent and ungrammatical natural language responses that are dominated by words that occur with high frequency in the training data. These phenomena do not arise in synthetic datasets such as bAbI, where we show Seq2Seq models are nearly perfect. We develop techniques to learn embeddings that succinctly capture relevant information from the dialog history, and demonstrate that nearest neighbor based approaches in this learned neural embedding space generate more fluent responses. However, we see that these methods are not able to accurately predict when to execute an external action. We show how to combine nearest neighbor and Seq2Seq methods in a hybrid model, where nearest neighbor is used to generate fluent responses and Seq2Seq type models ensure dialog coherency and generate accurate external actions. We show that this approach is well suited for customer support scenarios, where agents' responses are typically script-driven, and correct external actions are critically important. The hybrid model on the customer support data achieves a 78% relative improvement in fluency scores, and a 130% improvement in accuracy of external calls.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes