MAAIJun 3, 2024

Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents

arXiv:2406.01641v35 citationsHas Code
AI Analysis

This addresses the challenge of achieving cooperation in multi-agent systems for applications like robotics or game theory, but it is incremental as it builds on opponent shaping methods with a more practical approach.

The paper tackles the problem of promoting cooperation among self-interested AI agents in social dilemmas, where naive reinforcement learning often leads to poor outcomes, by introducing Reciprocators that intrinsically motivate agents to reciprocate opponents' influence, resulting in improved cooperation during simultaneous learning.

Cooperation between self-interested individuals is a widespread phenomenon in the natural world, but remains elusive in interactions between artificially intelligent agents. Instead, naive reinforcement learning algorithms typically converge to Pareto-dominated outcomes in even the simplest of social dilemmas. An emerging literature on opponent shaping has demonstrated the ability to reach prosocial outcomes by influencing the learning of other agents. However, such methods differentiate through the learning step of other agents or optimize for meta-game dynamics, which rely on privileged access to opponents' learning algorithms or exponential sample complexity, respectively. To provide a learning rule-agnostic and sample-efficient alternative, we introduce Reciprocators, reinforcement learning agents which are intrinsically motivated to reciprocate the influence of opponents' actions on their returns. This approach seeks to modify other agents' $Q$-values by increasing their return following beneficial actions (with respect to the Reciprocator) and decreasing it after detrimental actions, guiding them towards mutually beneficial actions without directly differentiating through a model of their policy. We show that Reciprocators can be used to promote cooperation in temporally extended social dilemmas during simultaneous learning. Our code is available at https://github.com/johnlyzhou/reciprocator/.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes