CLMay 17, 2016

Incorporating Loose-Structured Knowledge into Conversation Modeling via Recall-Gate LSTM

arXiv:1605.05110v266 citations
Originality Incremental advance
AI Analysis

This work addresses improving chat-bot dialog ability for automated systems, but it is incremental as it builds on existing LSTM methods with a novel gate.

The paper tackled conversation modeling by incorporating loose-structured domain knowledge into an LSTM via a Recall gate, resulting in promising performance on context-oriented response selection tasks across two datasets.

Modeling human conversations is the essence for building satisfying chat-bots with multi-turn dialog ability. Conversation modeling will notably benefit from domain knowledge since the relationships between sentences can be clarified due to semantic hints introduced by knowledge. In this paper, a deep neural network is proposed to incorporate background knowledge for conversation modeling. Through a specially designed Recall gate, domain knowledge can be transformed into the extra global memory of Long Short-Term Memory (LSTM), so as to enhance LSTM by cooperating with its local memory to capture the implicit semantic relevance between sentences within conversations. In addition, this paper introduces the loose structured domain knowledge base, which can be built with slight amount of manual work and easily adopted by the Recall gate. Our model is evaluated on the context-oriented response selecting task, and experimental results on both two datasets have shown that our approach is promising for modeling human conversations and building key components of automatic chatting systems.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes