LGJun 3

Enhancing the MADDPG Algorithm for Multi-Agent Learning via Action Inference and Importance Sampling

Marc Walden, Jason Liu, Shaashwath Sivakumar, Ryan Liu, Hamza Khan

arXiv:2606.0502169.5Has Code

Predicted impact top 26% in LG · last 90 daysOriginality Synthesis-oriented

AI Analysis

For researchers in multi-agent reinforcement learning, this work offers incremental improvements to a known algorithm.

The paper enhances the MADDPG algorithm with Action Inference and importance sampling, achieving improved learning stability and exploration efficiency on the Predator-Prey task.

We investigate multi-agent deep reinforcement learning and propose two enhancements to the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm. First, we introduce a novel Action Inference mechanism that enables each agent to predict other agents' intended actions, thereby improving the accuracy and stability of its own policy. Second, we apply an importance sampling strategy, using geometric distribution, in the replay buffer to prioritize more recent and informative experiences, which helps mitigate the non-stationarity inherent in multi-agent environments. We evaluate both modifications on the discrete-action Predator-Prey task provided by the PettingZoo library, a flexible Python interface for general multi-agent reinforcement learning benchmarks. Our results indicate that Action Inference is effective in improving learning stability and inter-agent cooperation and that importance sampling using geometric distribution can lead to significant improvements in exploration efficiency over standard MADDPG. Code available at https://github.com/shaashwathsivakumar/MARL_Proj

View on arXiv PDF Code

Similar