CL AI LG NEMay 7, 2024

Fleet of Agents: Coordinated Problem Solving with Large Language Models

Lars Klein, Nearchos Potamitis, Roland Aydin, Robert West, Caglar Gulcehre, Akhil Arora

arXiv:2405.06691v36.17 citationsh-index: 11Has CodeICML

Originality Incremental advance

AI Analysis

This work addresses the cost-quality trade-off in LLM-based reasoning frameworks, offering a practical solution for researchers and practitioners, though it appears incremental as it builds on existing agent and search methods.

The paper tackles the problem of balancing cost and quality in enhancing reasoning abilities of large language models (LLMs) by introducing Fleet of Agents (FoA), a framework that uses LLMs as agents for dynamic tree searches with genetic-type particle filtering, resulting in an average quality improvement of ~5% while requiring only ~40% of the cost of previous state-of-the-art methods.

While numerous frameworks have been developed to enhance the reasoning abilities of large language models (LLMs), there is a scarcity of methods that effectively balance the trade-off between cost and quality. In this paper, we introduce Fleet of Agents (FoA), a novel and intuitive yet principled framework utilizing LLMs as agents to navigate through dynamic tree searches, employing a genetic-type particle filtering approach. FoA spawns a multitude of agents, each exploring the search space autonomously, followed by a selection phase where resampling based on a heuristic value function optimizes the balance between exploration and exploitation. This mechanism enables dynamic branching, adapting the exploration strategy based on discovered solutions. We conduct extensive experiments on three benchmark tasks, ``Game of 24'', ``Mini-Crosswords'', and ``WebShop'', utilizing four different LLMs, ``GPT-3.5'', ``GPT-4'', ``LLaMA3.2-11B'', and ``LLaMA3.2-90B''. On average across all tasks and LLMs, FoA obtains a quality improvement of ~5% while requiring only ~40% of the cost of previous SOTA methods. Notably, our analyses reveal that (1) FoA achieves the best cost-quality trade-off among all benchmarked methods and (2) FoA + LLaMA3.2-11B surpasses the Llama3.2-90B model. FoA is publicly available at https://github.com/au-clan/FoA.

View on arXiv PDF Code

Similar