AIMAJan 7, 2021

Attention Actor-Critic algorithm for Multi-Agent Constrained Co-operative Reinforcement Learning

arXiv:2101.02349v122 citations
Originality Incremental advance
AI Analysis

This work provides an incremental improvement for multi-agent reinforcement learning practitioners who need to handle action constraints in co-operative settings.

This paper addresses multi-agent co-operative reinforcement learning where agents must optimize a common goal while satisfying action constraints. The authors extend an attention-based Actor-Critic algorithm to this constrained setting by incorporating different attention modes for goal optimization and constraint satisfaction, which they show improves performance on benchmark environments.

In this work, we consider the problem of computing optimal actions for Reinforcement Learning (RL) agents in a co-operative setting, where the objective is to optimize a common goal. However, in many real-life applications, in addition to optimizing the goal, the agents are required to satisfy certain constraints specified on their actions. Under this setting, the objective of the agents is to not only learn the actions that optimize the common objective but also meet the specified constraints. In recent times, the Actor-Critic algorithm with an attention mechanism has been successfully applied to obtain optimal actions for RL agents in multi-agent environments. In this work, we extend this algorithm to the constrained multi-agent RL setting. The idea here is that optimizing the common goal and satisfying the constraints may require different modes of attention. By incorporating different attention modes, the agents can select useful information required for optimizing the objective and satisfying the constraints separately, thereby yielding better actions. Through experiments on benchmark multi-agent environments, we show the effectiveness of our proposed algorithm.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes