RO MAMay 2

LLM-Foraging: Large Language Models for Decentralized Swarm Robot Foraging

Peihan Li, Joanna Gutierrez, Fabian Hernandez, Qi Lu, Lifeng Zhou

arXiv:2605.0146164.6

AI Analysis

For swarm robotics, this work demonstrates that LLMs can serve as generalizable, training-free decision policies that transfer across varying team sizes, arena sizes, and resource distributions, addressing the brittleness of optimized policies.

LLM-Foraging uses a large language model as a tactical decision-maker in a decentralized swarm foraging controller, achieving higher resource collection and consistency than a GA-tuned baseline across 36 configurations without retraining.

Swarm foraging algorithms, such as the central-place foraging algorithm (CPFA), typically rely on offline parameter optimization using genetic algorithms (GA) or reinforcement learning, yielding policies tightly coupled to a specific combination of team size, arena size, and resource distribution. When deployment conditions change, performance degrades, and retraining is computationally expensive. We propose LLM-Foraging, a decentralized swarm controller that augments the CPFA state machine with a large language model (LLM) tactical decision-maker at three structured decision points, namely post-deposit, central-zone arrival, and search starvation. Each robot runs its own LLM client and queries it using only locally observable state, while the existing CPFA motion and sensing stack executes the selected action. Because the LLM serves as a general decision policy rather than parameters fitted to a single configuration, the controller is training-free at deployment and transfers across configurations without re-optimization. We evaluate LLM-Foraging in Gazebo with TurtleBot3 robots across 36 configurations spanning team sizes of 4 to 10 robots, arena sizes from 6x6 to 10x10 meters, and three resource distributions (clustered, powerlaw, random). LLM-Foraging collects more resources than the GA-tuned CPFA baseline across the evaluated configurations and is more consistent, a property that the GA's single-configuration tuning does not transfer.

View on arXiv PDF

Similar