AIJan 12

Learning How to Remember: A Meta-Cognitive Management Method for Structured and Transferable Agent Memory

arXiv:2601.07470v1h-index: 28
Originality Highly original
AI Analysis

This addresses the challenge of structured and transferable memory for LLM agents in long-horizon decision-making tasks, representing a novel approach rather than an incremental improvement.

The paper tackles the problem of limited generalization and negative transfer in LLM agent memory by proposing MCMA, a method that learns memory abstraction as a skill, resulting in substantial improvements in performance, out-of-distribution generalization, and cross-task transfer across multiple benchmarks.

Large language model (LLM) agents increasingly rely on accumulated memory to solve long-horizon decision-making tasks. However, most existing approaches store memory in fixed representations and reuse it at a single or implicit level of abstraction, which limits generalization and often leads to negative transfer when distribution shift. This paper proposes the Meta-Cognitive Memory Abstraction method (MCMA), which treats memory abstraction as a learnable cognitive skill rather than a fixed design choice. MCMA decouples task execution from memory management by combining a frozen task model with a learned memory copilot. The memory copilot is trained using direct preference optimization, it determines how memories should be structured, abstracted, and reused. Memories are further organized into a hierarchy of abstraction levels, enabling selective reuse based on task similarity. When no memory is transferable, MCMA transfers the ability to abstract and manage memory by transferring the memory copilot. Experiments on ALFWorld, ScienceWorld, and BabyAI demonstrate substantial improvements in performance, out-of-distribution generalization, and cross-task transfer over several baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes