ML LGMar 4, 2025

Decentralized Reinforcement Learning for Multi-Agent Multi-Resource Allocation via Dynamic Cluster Agreements

Antonio Marino, Esteban Restrepo, Claudio Pacchierotti, Paolo Robuffo Giordano

arXiv:2503.02437v110.34 citationsh-index: 48IEEE Robot Autom Lett

Originality Incremental advance

AI Analysis

This addresses decentralized multi-agent resource allocation, an incremental improvement over existing methods.

The paper tackles decentralized allocation of heterogeneous resources among multiple agents by proposing LGTC-IPPO, which integrates dynamic cluster consensus with IPPO to reduce reliance on global information. Experimental results show it achieves more stable rewards, better coordination, and robust performance as agents or resource types increase.

This paper addresses the challenge of allocating heterogeneous resources among multiple agents in a decentralized manner. Our proposed method, LGTC-IPPO, builds upon Independent Proximal Policy Optimization (IPPO) by integrating dynamic cluster consensus, a mechanism that allows agents to form and adapt local sub-teams based on resource demands. This decentralized coordination strategy reduces reliance on global information and enhances scalability. We evaluate LGTC-IPPO against standard multi-agent reinforcement learning baselines and a centralized expert solution across a range of team sizes and resource distributions. Experimental results demonstrate that LGTC-IPPO achieves more stable rewards, better coordination, and robust performance even as the number of agents or resource types increases. Additionally, we illustrate how dynamic clustering enables agents to reallocate resources efficiently also for scenarios with discharging resources.

View on arXiv PDF

Similar