AIDec 5, 2022

E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance

Stanford
arXiv:2212.02064v111 citationsh-index: 77
Originality Highly original
AI Analysis

This addresses the problem of inefficient cooperation and planning in multi-agent systems for researchers and practitioners in MARL, representing a novel method rather than an incremental improvement.

The paper tackles the challenge of enabling multiple agents to efficiently cooperate on complex, long-horizon tasks in multi-agent reinforcement learning by proposing E-MAPP, a framework that uses parallel programs to guide agents, resulting in outperforming baselines in completion rate, time efficiency, and zero-shot generalization.

A critical challenge in multi-agent reinforcement learning(MARL) is for multiple agents to efficiently accomplish complex, long-horizon tasks. The agents often have difficulties in cooperating on common goals, dividing complex tasks, and planning through several stages to make progress. We propose to address these challenges by guiding agents with programs designed for parallelization, since programs as a representation contain rich structural and semantic information, and are widely used as abstractions for long-horizon tasks. Specifically, we introduce Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance(E-MAPP), a novel framework that leverages parallel programs to guide multiple agents to efficiently accomplish goals that require planning over $10+$ stages. E-MAPP integrates the structural information from a parallel program, promotes the cooperative behaviors grounded in program semantics, and improves the time efficiency via a task allocator. We conduct extensive experiments on a series of challenging, long-horizon cooperative tasks in the Overcooked environment. Results show that E-MAPP outperforms strong baselines in terms of the completion rate, time efficiency, and zero-shot generalization ability by a large margin.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes