CLFeb 21, 2024

TOOLVERIFIER: Generalization to New Tools via Self-Verification

Dheeraj Mekala, Jason Weston, Jack Lanchantin, Roberta Raileanu, Maria Lomeli, Jingbo Shang, Jane Dwivedi-Yu

Meta AI

arXiv:2402.14158v217.935 citationsh-index: 28Has CodeEMNLP

Originality Incremental advance

AI Analysis

This addresses the challenge of building general AI assistants by improving tool use generalization, though it is incremental as it builds on existing benchmarks and methods.

The paper tackles the problem of language models struggling to robustly use new tools from few demonstrations by introducing a self-verification method that asks contrastive questions during tool selection and parameter generation, resulting in an average improvement of 22% over few-shot baselines on 4 tasks with 17 unseen tools.

Teaching language models to use tools is an important milestone towards building general assistants, but remains an open problem. While there has been significant progress on learning to use specific tools via fine-tuning, language models still struggle with learning how to robustly use new tools from only a few demonstrations. In this work we introduce a self-verification method which distinguishes between close candidates by self-asking contrastive questions during (1) tool selection; and (2) parameter generation. We construct synthetic, high-quality, self-generated data for this goal using Llama-2 70B, which we intend to release publicly. Extensive experiments on 4 tasks from the ToolBench benchmark, consisting of 17 unseen tools, demonstrate an average improvement of 22% over few-shot baselines, even in scenarios where the distinctions between candidate tools are finely nuanced.

View on arXiv PDF Code

Similar