Is MemoryOS superseded?

MemoryOS (Agent / long-term memory): superseded — cited as a baseline and beaten by newer methods. 2 paper(s) critique it, 4 beat it on benchmarks — #8 of 63 most-superseded. Sub-problem: cluster led by Mem0. Newer alternatives in the same sub-problem include REAL, MemPro, MemGuard, DeferMem, Memory-R2.

Method Drift›Agent / long-term memory

Superseded baseline#8 of 63 most-superseded

MemoryOS

Memory OS of AI Agent

Agent / long-term memory · first seen May 30, 2025

superseded — cited as a baseline and beaten by newer methods

2 papers critique it · 4 beat it on benchmarks

What papers say

Verbatim critique sentences, each from a paper that cites MemoryOS as a baseline.

“By treating outgoing connections as equally valid or using fixed graph-expansion rules, existing systems can fail to discriminate between highly relevant pathways and distracting noise, leading to degraded retrieval accuracy as memory grows.”
— HAGE: Harnessing Agentic Memory via RL-Driven Weighted Graph Evolution
“Blocking Latency: Achieving structural depth often comes at the cost of interactivity. Approaches like MemoryOS and synchronous graph builders typically require heavy LLM operations on the critical path. As noted in benchmarks wu2024longmemeval, such mechanisms incur prohibitive latency, rendering them impractical for real-time interaction.”
— MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents

Beaten on benchmarks

Head-to-head results where a newer method reports beating MemoryOS. Values are copied from the source paper's tables — verify against the cited paper.

HAGE beats MemoryOS · Overall [gpt-4o-mini]
0.739 vs 0.553
HAGE: Harnessing Agentic Memory via RL-Driven Weighted Graph Evolution
HAGE beats MemoryOS · Overall [Qwen2.5-3B]
0.548 vs 0.280
HAGE: Harnessing Agentic Memory via RL-Driven Weighted Graph Evolution
HAGE beats MemoryOS · LLM Score [GPT-4o-mini]
0.824 vs 0.592
HAGE: Harnessing Agentic Memory via RL-Driven Weighted Graph Evolution
HAGE beats MemoryOS · F1 [GPT-4o-mini]
0.678 vs 0.477
HAGE: Harnessing Agentic Memory via RL-Driven Weighted Graph Evolution
HAGE beats MemoryOS · LLM Score [Qwen2.5-3B]
0.527 vs 0.459
HAGE: Harnessing Agentic Memory via RL-Driven Weighted Graph Evolution
HAGE beats MemoryOS · F1 [Qwen2.5-3B]
0.429 vs 0.350
HAGE: Harnessing Agentic Memory via RL-Driven Weighted Graph Evolution
HAGE beats MemoryOS · Avg. Score [all]
0.739 vs 0.553
HAGE: Harnessing Agentic Memory via RL-Driven Weighted Graph Evolution
GAM beats MemoryOS · LoCoMo Temporal F1 [GPT-4o-mini]
56.15 vs 41.15
General Agentic Memory Via Deep Research
GAM beats MemoryOS · LoCoMo Temporal F1 [Qwen2.5-14b]
40.96 vs 32.24
General Agentic Memory Via Deep Research
MAGMA beats MemoryOS · Judge [Overall]
0.700 vs 0.553
MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents
MAGMA beats MemoryOS · Latency (s) [all methods]
1.47 vs 32.68
MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents
Memory-R2 beats MemoryOS · F1 [Multi-hop]
38.41 vs 29.55
Memory-R2: Fair Credit Assignment for Long-Horizon Memory-Augmented LLM Agents

Newer alternatives

Recent methods in the same sub-problem, not yet superseded in the knowledge base.