The Memory Curse: How Expanded Recall Erodes Cooperative Intent in LLM Agents

arXiv:2605.0806066.3
AI Analysis

For researchers and developers of multi-agent LLM systems, this reveals that longer memory can destabilize cooperation, challenging the assumption that expanded context is always beneficial.

Expanding context windows in LLMs degrades cooperation in multi-agent social dilemmas across 18 of 28 model-game settings, a phenomenon termed the memory curse. The cause is eroding forward-looking intent rather than increased paranoia, and memory content (not length) drives the effect.

Context window expansion is often treated as a straightforward capability upgrade for LLMs, but we find it systematically fails in multi-agent social dilemmas. Across 7 LLMs and 4 games over 500 rounds, expanding accessible history degrades cooperation in 18 of 28 model--game settings, a pattern we term the memory curse. We isolate the underlying mechanism through three analyses. First, lexical analysis of 378,000 reasoning traces associates this breakdown with eroding forward-looking intent rather than rising paranoia. We validate this using targeted fine-tuning as a cognitive probe: a LoRA adapter trained exclusively on forward-looking traces mitigates the decay and transfers zero-shot to distinct games. Second, memory sanitization holds prompt length fixed while replacing visible history with synthetic cooperative records, which restores cooperation substantially, proving the trigger is memory content, not length alone. Finally, ablating explicit Chain-of-Thought reasoning often reduces the collapse, showing that deliberation paradoxically amplifies the memory curse. Together, these results recast memory as an active determinant of multi-agent behavior: longer recall can either destabilize or support cooperation depending on the reasoning patterns it elicits.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes