AIOSSEApr 11

ClawVM: Harness-Managed Virtual Memory for Stateful Tool-Using LLM Agents

arXiv:2604.1035238.9h-index: 2
AI Analysis

For developers of LLM agent harnesses, ClawVM provides a principled solution to state management failures that plague current systems.

Stateful tool-using LLM agents suffer from recurring failures due to best-effort state management. ClawVM introduces a virtual memory layer that makes residency and durability deterministic and auditable, eliminating all policy-controllable faults when the minimum-fidelity set fits within the token budget, with median <50 microseconds overhead per turn.

Stateful tool-using LLM agents treat the context window as working memory, yet today's agent harnesses manage residency and durability as best-effort, causing recurring failures: lost state after compaction, bypassed flushes on reset, and destructive writeback. We present \textsc{ClawVM}, a virtual memory layer that manages state as typed pages with minimum-fidelity invariants, multi-resolution representations under a token budget, and validated writeback at every lifecycle boundary. Because the harness already assembles prompts, mediates tools, and observes lifecycle events, it is the natural enforcement point; placing the contract there makes residency and durability deterministic and auditable. Across synthetic workloads, 12 real-session traces, and adversarial stress tests, \textsc{ClawVM} eliminates all policy-controllable faults whenever the minimum-fidelity set fits within the token budget, confirmed by an offline oracle, and adds median <50 microseconds of policy-engine overhead per turn.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes