CLApr 13

HTAA: Enhancing LLM Planning via Hybrid Toolset Agentization & Adaptation

arXiv:2604.1091729.8h-index: 5
AI Analysis

For practitioners deploying LLMs with large tool sets, HTAA reduces manual validation effort and operational cost in production, but the gains are incremental over existing hierarchical methods.

HTAA introduces a hierarchical framework for LLM tool-use planning that encapsulates co-used tools into specialized agents and aligns the planner via trajectory-based training, achieving higher task success rates and reduced context overhead on benchmarks and a real-world ride-hailing platform.

Enabling large language models to scale and reliably use hundreds of tools is critical for real-world applications, yet challenging due to the inefficiency and error accumulation inherent in flat tool-calling architectures. To address this, we propose Hybrid Toolset Agentization & Adaptation (HTAA), a hierarchical framework for scalable tool-use planning. We propose a novel toolset agentization paradigm, which encapsulates frequently co-used tools into specialized agent tools, thereby reducing the planner's action space and mitigating redundancy. To ensure effective coordination, we design Asymmetric Planner Adaptation, a trajectory-based training paradigm that aligns the high-level planner with agent tools via backward reconstruction and forward refinement. To validate the performance of HTAA, we conduct experiments on a real-world internal dataset, InfoVerify, based on the POI validation workflow of China's largest online large-scale ride-hailing platform, featuring long-horizon executable tool trajectories. Experiments on InfoVerify and widely-used benchmarks show that HTAA consistently achieves higher task success rates, requires short tool calling trajectories, and significantly reduces context overhead compared to strong baselines. Furthermore, in a production deployment, HTAA substantially reduces manual validation effort and operational cost, demonstrating its practical efficacy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes