CLNov 4, 2024

DynaSaur: Large Language Agents Beyond Predefined Actions

arXiv:2411.01747v325 citationsh-index: 17Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of restricted planning and acting capabilities for LLM agents in real-world, open-ended scenarios, representing a novel method for a known bottleneck.

The paper tackles the limitation of LLM agents relying on fixed, predefined action sets by proposing a framework that dynamically creates and composes actions using a general-purpose programming language, resulting in significantly improved flexibility and outperformance over prior methods in benchmarks.

Existing LLM agent systems typically select actions from a fixed and predefined set at every step. While this approach is effective in closed, narrowly scoped environments, it presents two major challenges for real-world, open-ended scenarios: (1) it significantly restricts the planning and acting capabilities of LLM agents, and (2) it requires substantial human effort to enumerate and implement all possible actions, which is impractical in complex environments with a vast number of potential actions. To address these limitations, we propose an LLM agent framework that can dynamically create and compose actions as needed. In this framework, the agent interacts with its environment by generating and executing programs written in a general-purpose programming language. Moreover, generated actions are accumulated over time for future reuse. Our extensive experiments across multiple benchmarks show that this framework significantly improves flexibility and outperforms prior methods that rely on a fixed action set. Notably, it enables LLM agents to adapt and recover in scenarios where predefined actions are insufficient or fail due to unforeseen edge cases. Our code can be found in https://github.com/adobe-research/dynasaur.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes