Living systematic review
Agent / long-term memory
Giving LLM agents persistent, retrievable memory across turns and sessions.
69 papers · 99 critique receipts · 540 benchmark results · updated Jun 18, 2026
Most-superseded baselines
Ranked by how many distinct papers critique or beat each method. These are the standard baselines newer work routinely measures against.
- 1Mem0· Mem0Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory
3 papers critique it · 15 beat it on benchmarks
- 3Zep· Mem0Zep: A Temporal Knowledge Graph Architecture for Agent Memory
1 papers critique it · 9 beat it on benchmarks
- 4Nemori· NemoriNemori: Self-Organizing Agent Memory Inspired by Cognitive Science
2 papers critique it · 5 beat it on benchmarks
- 5RAPTOR· RAPTORRAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
3 papers critique it · 4 beat it on benchmarks
- 6MemGPT· RAPTORMemGPT: Towards LLMs as Operating Systems
4 papers critique it · 3 beat it on benchmarks
- 7HippoRAG 2· RAPTORFrom RAG to Memory: Non-Parametric Continual Learning for Large Language Models
1 papers critique it · 5 beat it on benchmarks
- 9HippoRAG· RAPTORHippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models
4 papers critique it · 2 beat it on benchmarks
- 10LightMem· Mem0LightMem: Lightweight and Efficient Memory-Augmented Generation
1 papers critique it · 4 beat it on benchmarks
- 11Reflexion· ReflexionReflexion: Language Agents with Verbal Reinforcement Learning
3 papers critique it · 2 beat it on benchmarks
- 12Memory-R1· Mem0Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning
4 papers critique it · 1 beat it on benchmarks
Sub-problems
Methods that compete on the same benchmarks cluster into distinct sub-problems.
RAPTOR · 25 methods
RAPTOR · MemGPT · HippoRAG 2 · HippoRAG · MemoryBank · GraphRAG
MIRIX · 3 methods
MIRIX · EviMem · MemBuilder
Memento · 3 methods
Generative Agents · 3 methods
The frontier
Recent methods not yet superseded in the knowledge base.
- Jun 9, 2026
- ConvMemory v2ConvMemory v2: A Recall-Preserving Top-10 Evidence Reranker for Conversational Memory RetrievalJun 9, 2026
- Jun 3, 2026
- Jun 2, 2026
- May 30, 2026
- May 30, 2026
- MemGuardMemGuard: Preventing Memory Contamination in Long-Term Memory-Augmented Large Language ModelsMay 27, 2026
- DeferMemDeferMem: Query-Time Evidence Distillation via Reinforcement Learning for Long-Term Memory QAMay 21, 2026
- May 20, 2026
- Visual Agentic Memory (VAM)Visual Agentic Memory: Enabling Online Long Video Understanding via Online Indexing, Hierarchical Memory, and Agentic RetrievalMay 15, 2026
- May 14, 2026
- May 11, 2026