AIOct 17, 2025

Experience-Driven Exploration for Efficient API-Free AI Agents

arXiv:2510.15259v21 citationsh-index: 13
Originality Incremental advance
AI Analysis

This addresses the challenge of inefficient exploration for AI agents in GUI-based software without APIs, offering a domain-specific solution that is incremental over existing methods.

The paper tackled the problem of AI agents operating in API-free GUI environments, where they face inefficiency due to myopic decisions and trial-and-error exploration, by proposing KG-Agent, a framework that structures interactions into a knowledge graph and uses hybrid intrinsic rewards, resulting in significant improvements in exploration efficiency and strategic depth in environments like Civilization V and Slay the Spire.

Most existing software lacks accessible Application Programming Interfaces (APIs), requiring agents to operate solely through pixel-based Graphical User Interfaces (GUIs). In this API-free setting, large language model (LLM)-based agents face severe efficiency bottlenecks: limited to local visual experiences, they make myopic decisions and rely on inefficient trial-and-error, hindering both skill acquisition and long-term planning. To address these challenges, we propose KG-Agent, an experience-driven learning framework that structures an agent's raw pixel-level interactions into a persistent State-Action Knowledge Graph (SA-KG). KG-Agent overcomes inefficient exploration by linking functionally similar but visually distinct GUI states, forming a rich neighborhood of experience that enables the agent to generalize from a diverse set of historical strategies. To support long-horizon reasoning, we design a hybrid intrinsic reward mechanism based on the graph topology, combining a state value reward for exploiting known high-value pathways with a novelty reward that encourages targeted exploration. This approach decouples strategic planning from pure discovery, allowing the agent to effectively value setup actions with delayed gratification. We evaluate KG-Agent in two complex, open-ended GUI-based decision-making environments (Civilization V and Slay the Spire), demonstrating significant improvements in exploration efficiency and strategic depth over the state-of-the-art methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes