CL MAJun 3

RAMPART: Registry-based Agentic Memory with Priority-Aware Runtime Transformation

arXiv:2606.0462863.7

Predicted impact top 96% in CL · last 90 daysOriginality Highly original

AI Analysis

For LLM agent developers, RAMPART provides a programmable, permissioned memory system that improves task success and reduces costs without modifying model weights.

RAMPART introduces a compile-time memory model for LLM agents that uses a structured registry with composable primitives, achieving up to 67.8% prompt cost reduction and 83% success rate recovery via relevance gating, while block grouping lifts task success by tens of percentage points at hard positions.

RAMPART is a compile-time memory model and pure in-RAM block registry for LLM-based agents. Context assembly is a programmable runtime operation where content is compiled from a structured registry under explicit policy for ordering, inclusion, and eviction. Five composable primitives (promote, gate, write, evict, rollback) act on named addressable blocks before compilation at zero prompt-token cost. Provenance tags and non-evictable authorship flags implement a permissioned memory model with block-level ownership. Controlled probes with Qwen3-8B Q4 show that compile-time placement and the structural relationship between blocks and the task query affect task success, with the cliff falling at roughly the seventh block position when the task follows the registry and the twelfth when it precedes. Grouping the critical block with content-adjacent neighbours and promoting the group as a unit lifts task success by tens of percentage points at positions where single-block placement fails. Cross-model replication on Qwen2.5-7B, Llama-3.1-8B, Mistral-7B-v0.3, and Qwen3-14B shows the content-priming effect appears at the same absolute positions across families, with magnitude varying with model strength. Block grouping raises Mistral's mean pass rate roughly fivefold at the hardest registry size, and a smaller model with the intervention can outperform a larger model without it in the mid-registry zone. Relevance gating reduces prompt cost by 67.8\% while recovering 83% of the promoted-condition success rate. Schema eviction produces 0% invocations against 100% with the schema present, a property policy-based approaches cannot guarantee by construction. Shared-registry coordination reduces inter-agent communication to a method call at zero coordination token cost.

View on arXiv PDF

Similar