AIDec 17, 2025

SCOPE: Prompt Evolution for Enhancing Agent Effectiveness

arXiv:2512.15374v111 citationsh-index: 12Has Code
Originality Highly original
AI Analysis

This addresses a critical bottleneck for LLM agents in dynamic environments, offering a novel method to enhance effectiveness without human intervention.

The paper tackles the problem of LLM agents struggling with dynamic contexts due to static prompts, introducing SCOPE to automatically evolve prompts via online optimization, which improved task success rates from 14.23% to 38.64% on the HLE benchmark.

Large Language Model (LLM) agents are increasingly deployed in environments that generate massive, dynamic contexts. However, a critical bottleneck remains: while agents have access to this context, their static prompts lack the mechanisms to manage it effectively, leading to recurring Corrective and Enhancement failures. To address this capability gap, we introduce \textbf{SCOPE} (Self-evolving Context Optimization via Prompt Evolution). SCOPE frames context management as an \textit{online optimization} problem, synthesizing guidelines from execution traces to automatically evolve the agent's prompt. We propose a Dual-Stream mechanism that balances tactical specificity (resolving immediate errors) with strategic generality (evolving long-term principles). Furthermore, we introduce Perspective-Driven Exploration to maximize strategy coverage, increasing the likelihood that the agent has the correct strategy for any given task. Experiments on the HLE benchmark show that SCOPE improves task success rates from 14.23\% to 38.64\% without human intervention. We make our code publicly available at https://github.com/JarvisPei/SCOPE.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes