AIMar 19

D-Mem: A Dual-Process Memory System for LLM Agents

arXiv:2603.1863151.82 citationsh-index: 4

AI Analysis

This addresses the issue of lossy abstraction in retrieval-based memory frameworks for autonomous agents, offering a solution for improved contextual understanding in tasks like conversational memory, though it appears incremental as it builds on existing retrieval methods.

The paper tackles the problem of high-fidelity memory access for long-horizon reasoning in LLM agents by introducing D-Mem, a dual-process memory system that combines lightweight vector retrieval with a full deliberation module, achieving an F1 score of 53.5 on the LoCoMo benchmark with GPT-4o-mini and recovering 96.7% of the full deliberation's performance at lower computational cost.

Driven by the development of persistent, self-adapting autonomous agents, equipping these systems with high-fidelity memory access for long-horizon reasoning has emerged as a critical requirement. However, prevalent retrieval-based memory frameworks often follow an incremental processing paradigm that continuously extracts and updates conversational memories into vector databases, relying on semantic retrieval when queried. While this approach is fast, it inherently relies on lossy abstraction, frequently missing contextually critical information and struggling to resolve queries that rely on fine-grained contextual understanding. To address this, we introduce D-Mem, a dual-process memory system. It retains lightweight vector retrieval for routine queries while establishing an exhaustive Full Deliberation module as a high-fidelity fallback. To achieve cognitive economy without sacrificing accuracy, D-Mem employs a Multi-dimensional Quality Gating policy to dynamically bridge these two processes. Experiments on the LoCoMo and RealTalk benchmarks using GPT-4o-mini and Qwen3-235B-Instruct demonstrate the efficacy of our approach. Notably, our Multi-dimensional Quality Gating policy achieves an F1 score of 53.5 on LoCoMo with GPT-4o-mini. This outperforms our static retrieval baseline, Mem0$^\ast$ (51.2), and recovers 96.7\% of the Full Deliberation's performance (55.3), while incurring significantly lower computational costs.

View on arXiv PDF

Similar