MMAG: Mixed Memory-Augmented Generation for Large Language Models Applications
This addresses the need for more coherent and human-aligned language agents in conversational AI applications, though it appears incremental as it builds on existing memory concepts.
The paper tackles the problem of large language models struggling with relevance, personalization, and continuity in extended interactions by introducing the Mixed Memory-Augmented Generation (MMAG) pattern, a framework with five memory layers, and demonstrates its implementation in the Heero conversational agent, which improves engagement and retention.
Large Language Models (LLMs) excel at generating coherent text within a single prompt but fall short in sustaining relevance, personalization, and continuity across extended interactions. Human communication, however, relies on multiple forms of memory, from recalling past conversations to adapting to personal traits and situational context. This paper introduces the Mixed Memory-Augmented Generation (MMAG) pattern, a framework that organizes memory for LLM-based agents into five interacting layers: conversational, long-term user, episodic and event-linked, sensory and context-aware, and short-term working memory. Drawing inspiration from cognitive psychology, we map these layers to technical components and outline strategies for coordination, prioritization, and conflict resolution. We demonstrate the approach through its implementation in the Heero conversational agent, where encrypted long-term bios and conversational history already improve engagement and retention. We further discuss implementation concerns around storage, retrieval, privacy, and latency, and highlight open challenges. MMAG provides a foundation for building memory-rich language agents that are more coherent, proactive, and aligned with human needs.