AIFeb 23

Learning to Rewrite Tool Descriptions for Reliable LLM-Agent Tool Use

arXiv:2602.20426v1h-index: 5
Originality Incremental advance
AI Analysis

This addresses a practical problem for developers of LLM agents by improving tool interface optimization without execution traces, though it is incremental as it builds on prior work on tool interfaces.

The paper tackles the bottleneck of human-oriented tool interfaces for LLM-based agents by proposing Trace-Free+, a curriculum learning framework that transfers supervision from trace-rich to trace-free settings, resulting in consistent gains on unseen tools, strong cross-domain generalization, and robustness with over 100 candidate tools.

The performance of LLM-based agents depends not only on the agent itself but also on the quality of the tool interfaces it consumes. While prior work has focused heavily on agent fine-tuning, tool interfaces-including natural language descriptions and parameter schemas-remain largely human-oriented and often become a bottleneck, especially when agents must select from large candidate tool sets. Existing approaches to improving tool interfaces rely on execution traces, which are frequently unavailable in cold-start or privacy-constrained settings, and typically optimize each tool independently, limiting scalability and generalization to unseen tools. We propose Trace-Free+, a curriculum learning framework that progressively transfers supervision from trace-rich settings to trace-free deployment, encouraging the model to abstract reusable interface-usage patterns and tool usage outcomes. To support this approach, we construct a large-scale dataset of high-quality tool interfaces using a structured workflow over a diverse collection of tools. Experiments on StableToolBench and RestBench show consistent gains on unseen tools, strong cross-domain generalization, and robustness as the number of candidate tools scales to over 100, demonstrating that tool interface optimization is a practical and deployable complement to agent fine-tuning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes