CLMar 5, 2024

Learning to Use Tools via Cooperative and Interactive Agents

Zhengliang Shi, Shen Gao, Xiuyi Chen, Yue Feng, Lingyong Yan, Haibo Shi, Dawei Yin, Pengjie Ren, Suzan Verberne, Zhaochun Ren

Baidu

arXiv:2403.03031v422.763 citationsh-index: 41Has CodeEMNLP

Originality Incremental advance

AI Analysis

This addresses the challenge of making LLMs more effective as agents for practical tasks, though it appears incremental by building on existing tool learning methods.

The paper tackles the problem of performance degradation in tool learning for large language models (LLMs) due to inflexible pipelines and lack of specialization, proposing ConAgents, a framework with cooperative agents, which achieves up to 14% higher success rates in experiments.

Tool learning empowers large language models (LLMs) as agents to use external tools and extend their utility. Existing methods employ one single LLM-based agent to iteratively select and execute tools, thereafter incorporating execution results into the next action prediction. Despite their progress, these methods suffer from performance degradation when addressing practical tasks due to: (1) the pre-defined pipeline with restricted flexibility to calibrate incorrect actions, and (2) the struggle to adapt a general LLM-based agent to perform a variety of specialized actions. To mitigate these problems, we propose ConAgents, a Cooperative and interactive Agents framework, which coordinates three specialized agents for tool selection, tool execution, and action calibration separately. ConAgents introduces two communication protocols to enable the flexible cooperation of agents. To effectively generalize the ConAgents into open-source models, we also propose specialized action distillation, enhancing their ability to perform specialized actions in our framework. Our extensive experiments on three datasets show that the LLMs, when equipped with the ConAgents, outperform baselines with substantial improvement (i.e., up to 14% higher success rate).

View on arXiv PDF Code

Similar