MAAILGSep 26, 2025

Multi-Agent Path Finding via Offline RL and LLM Collaboration

arXiv:2509.22130v1h-index: 29
Originality Incremental advance
AI Analysis

This addresses MAPF challenges for robotics and logistics, offering an incremental improvement over existing decentralized RL methods.

The paper tackles the problem of Multi-Agent Path Finding (MAPF) by proposing a decentralized planning framework using offline reinforcement learning and LLM collaboration, reducing training times from weeks to hours and improving adaptability in dynamic environments.

Multi-Agent Path Finding (MAPF) poses a significant and challenging problem critical for applications in robotics and logistics, particularly due to its combinatorial complexity and the partial observability inherent in realistic environments. Decentralized reinforcement learning methods commonly encounter two substantial difficulties: first, they often yield self-centered behaviors among agents, resulting in frequent collisions, and second, their reliance on complex communication modules leads to prolonged training times, sometimes spanning weeks. To address these challenges, we propose an efficient decentralized planning framework based on the Decision Transformer (DT), uniquely leveraging offline reinforcement learning to substantially reduce training durations from weeks to mere hours. Crucially, our approach effectively handles long-horizon credit assignment and significantly improves performance in scenarios with sparse and delayed rewards. Furthermore, to overcome adaptability limitations inherent in standard RL methods under dynamic environmental changes, we integrate a large language model (GPT-4o) to dynamically guide agent policies. Extensive experiments in both static and dynamically changing environments demonstrate that our DT-based approach, augmented briefly by GPT-4o, significantly enhances adaptability and performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes