CLAIMar 21, 2025

Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models

arXiv:2503.16779v16 citationsh-index: 13Has Code
Originality Incremental advance
AI Analysis

This addresses the limitation of existing tool learning methods that require fine-tuning or inefficient prompts, allowing for more flexible tool usage in AI applications.

The paper tackles the problem of enabling frozen language models to use a large pool of unseen tools without fine-tuning, by introducing Chain-of-Tools for tool calling in chain-of-thought reasoning, and shows it outperforms baselines on benchmarks like GSM8K-XL and FuncQA.

Tool learning can further broaden the usage scenarios of large language models (LLMs). However most of the existing methods either need to finetune that the model can only use tools seen in the training data, or add tool demonstrations into the prompt with lower efficiency. In this paper, we present a new Tool Learning method Chain-of-Tools. It makes full use of the powerful semantic representation capability of frozen LLMs to finish tool calling in CoT reasoning with a huge and flexible tool pool which may contain unseen tools. Especially, to validate the effectiveness of our approach in the massive unseen tool scenario, we construct a new dataset SimpleToolQuestions. We conduct experiments on two numerical reasoning benchmarks (GSM8K-XL and FuncQA) and two knowledge-based question answering benchmarks (KAMEL and SimpleToolQuestions). Experimental results show that our approach performs better than the baseline. We also identify dimensions of the model output that are critical in tool selection, enhancing the model interpretability. Our code and data are available at: https://github.com/fairyshine/Chain-of-Tools .

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes