CLJan 31, 2024

Local and Global Contexts for Conversation

arXiv:2401.17588v1103 citationsh-index: 1Findings
Originality Incremental advance
AI Analysis

This addresses the challenge of context understanding in open-domain conversation, which is incremental as it builds on existing transformer-based methods.

The paper tackles the problem of capturing both local and global contexts in multi-turn dialogue for generating responses, and introduces LGCM, a hierarchical transformer model that outperforms existing conversation models on automatic metrics with significant margins.

The context in conversation is the dialog history crucial for multi-turn dialogue. Learning from the relevant contexts in dialog history for grounded conversation is a challenging problem. Local context is the most neighbor and more sensitive to the subsequent response, and global context is relevant to a whole conversation far beyond neighboring utterances. Currently, pretrained transformer models for conversation challenge capturing the correlation and connection between local and global contexts. We introduce a local and global conversation model (LGCM) for general-purpose conversation in open domain. It is a local-global hierarchical transformer model that excels at accurately discerning and assimilating the relevant contexts necessary for generating responses. It employs a local encoder to grasp the local context at the level of individual utterances and a global encoder to understand the broader context at the dialogue level. The seamless fusion of these locally and globally contextualized encodings ensures a comprehensive comprehension of the conversation. Experiments on popular datasets show that LGCM outperforms the existing conversation models on the performance of automatic metrics with significant margins.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes