CL AIMar 21, 2025

Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models

Mengsong Wu, Tong Zhu, Han Han, Xiang Zhang, Wenbiao Shao, Wenliang Chen

arXiv:2503.16779v113.06 citationsh-index: 13Has Code

Originality Incremental advance

AI Analysis

This addresses the limitation of existing tool learning methods that require fine-tuning or inefficient prompts, allowing for more flexible tool usage in AI applications.

The paper tackles the problem of enabling frozen language models to use a large pool of unseen tools without fine-tuning, by introducing Chain-of-Tools for tool calling in chain-of-thought reasoning, and shows it outperforms baselines on benchmarks like GSM8K-XL and FuncQA.

Tool learning can further broaden the usage scenarios of large language models (LLMs). However most of the existing methods either need to finetune that the model can only use tools seen in the training data, or add tool demonstrations into the prompt with lower efficiency. In this paper, we present a new Tool Learning method Chain-of-Tools. It makes full use of the powerful semantic representation capability of frozen LLMs to finish tool calling in CoT reasoning with a huge and flexible tool pool which may contain unseen tools. Especially, to validate the effectiveness of our approach in the massive unseen tool scenario, we construct a new dataset SimpleToolQuestions. We conduct experiments on two numerical reasoning benchmarks (GSM8K-XL and FuncQA) and two knowledge-based question answering benchmarks (KAMEL and SimpleToolQuestions). Experimental results show that our approach performs better than the baseline. We also identify dimensions of the model output that are critical in tool selection, enhancing the model interpretability. Our code and data are available at: https://github.com/fairyshine/Chain-of-Tools .

View on arXiv PDF Code

Similar