ROAILGMAOct 5, 2021

Influencing Towards Stable Multi-Agent Interactions

arXiv:2110.08229v141 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of learning in dynamic multi-agent settings, offering a novel approach to reduce instability, though it appears incremental as it builds on existing methods for influence and representation learning.

The paper tackles the problem of non-stationarity in multi-agent learning by proposing an algorithm that proactively influences other agents to stabilize their strategies, improving task efficiency in simulated environments like autonomous driving and robotic manipulation.

Learning in multi-agent environments is difficult due to the non-stationarity introduced by an opponent's or partner's changing behaviors. Instead of reactively adapting to the other agent's (opponent or partner) behavior, we propose an algorithm to proactively influence the other agent's strategy to stabilize -- which can restrain the non-stationarity caused by the other agent. We learn a low-dimensional latent representation of the other agent's strategy and the dynamics of how the latent strategy evolves with respect to our robot's behavior. With this learned dynamics model, we can define an unsupervised stability reward to train our robot to deliberately influence the other agent to stabilize towards a single strategy. We demonstrate the effectiveness of stabilizing in improving efficiency of maximizing the task reward in a variety of simulated environments, including autonomous driving, emergent communication, and robotic manipulation. We show qualitative results on our website: https://sites.google.com/view/stable-marl/.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes