AI CL LG MADec 18, 2025

PAACE: A Plan-Aware Automated Agent Context Engineering Framework

arXiv:2512.16970v17.82 citationsh-index: 5

Originality Incremental advance

AI Analysis

This work addresses the challenge of context management for LLM agents in complex workflows, offering a novel solution that is incremental in building on prior compression methods.

The paper tackles the problem of managing rapidly expanding contexts in multi-step LLM agent workflows by introducing PAACE, a plan-aware automated context engineering framework, which improves agent correctness and reduces context load, achieving higher accuracy on benchmarks like AppWorld while lowering peak context and cumulative dependency.

Large Language Model (LLM) agents are increasingly deployed in complex, multi-step workflows involving planning, tool use, reflection, and interaction with external knowledge systems. These workflows generate rapidly expanding contexts that must be curated, transformed, and compressed to maintain fidelity, avoid attention dilution, and reduce inference cost. Prior work on summarization and query-aware compression largely ignores the multi-step, plan-aware nature of agentic reasoning. In this work, we introduce PAACE (Plan-Aware Automated Context Engineering), a unified framework for optimizing the evolving state of LLM agents through next-k-task relevance modeling, plan-structure analysis, instruction co-refinement, and function-preserving compression. PAACE comprises (1) PAACE-Syn, a large-scale generator of synthetic agent workflows annotated with stepwise compression supervision, and (2) PAACE-FT, a family of distilled, plan-aware compressors trained from successful teacher demonstrations. Experiments on long-horizon benchmarks (AppWorld, OfficeBench, and 8-Objective QA) demonstrate that PAACE consistently improves agent correctness while substantially reducing context load. On AppWorld, PAACE achieves higher accuracy than all baselines while lowering peak context and cumulative dependency. On OfficeBench and multi-hop QA, PAACE improves both accuracy and F1, achieving fewer steps, lower peak tokens, and reduced attention dependency. Distilled PAACE-FT retains 97 percent of the teacher's performance while reducing inference cost by over an order of magnitude, enabling practical deployment of plan-aware compression with compact models.

View on arXiv PDF

Similar