CLFeb 19, 2024

Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conversations

arXiv:2402.11975v234 citationsHas Code
AI Analysis

This addresses the problem of dynamic, real-world interactions for users and developers of conversational AI systems, representing a novel approach rather than an incremental improvement.

The study tackled the challenges of memory database management and accurate retrieval in long-term conversations by introducing COMEDY, a framework that uses a single language model for memory generation, compression, and response generation, resulting in more nuanced and human-like conversational experiences compared to traditional methods.

Existing retrieval-based methods have made significant strides in maintaining long-term conversations. However, these approaches face challenges in memory database management and accurate memory retrieval, hindering their efficacy in dynamic, real-world interactions. This study introduces a novel framework, COmpressive Memory-Enhanced Dialogue sYstems (COMEDY), which eschews traditional retrieval modules and memory databases. Instead, COMEDY adopts a "One-for-All" approach, utilizing a single language model to manage memory generation, compression, and response generation. Central to this framework is the concept of compressive memory, which intergrates session-specific summaries, user-bot dynamics, and past events into a concise memory format. To support COMEDY, we curated a large-scale Chinese instruction-tuning dataset, Dolphin, derived from real user-chatbot interactions. Comparative evaluations demonstrate COMEDY's superiority over traditional retrieval-based methods in producing more nuanced and human-like conversational experiences. Our codes are available at https://github.com/nuochenpku/COMEDY.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes