AIJul 21, 2024

Explaining Decisions of Agents in Mixed-Motive Games

arXiv:2407.15255v34 citationsh-index: 8
Originality Incremental advance
AI Analysis

This addresses the challenge of understanding AI agents in complex social dilemmas for users and researchers, but it is incremental as it builds on existing explainable AI methods for cooperative environments.

The paper tackled the problem of explaining agent decision-making in mixed-motive games involving both cooperation and competition, by designing new explanation methods and demonstrating their effectiveness in games like no-press Diplomacy and a prisoner's dilemma variant, showing usefulness for humans.

In recent years, agents have become capable of communicating seamlessly via natural language and navigating in environments that involve cooperation and competition, a fact that can introduce social dilemmas. Due to the interleaving of cooperation and competition, understanding agents' decision-making in such environments is challenging, and humans can benefit from obtaining explanations. However, such environments and scenarios have rarely been explored in the context of explainable AI. While some explanation methods for cooperative environments can be applied in mixed-motive setups, they do not address inter-agent competition, cheap-talk, or implicit communication by actions. In this work, we design explanation methods to address these issues. Then, we proceed to establish generality and demonstrate the applicability of the methods to three games with vastly different properties. Lastly, we demonstrate the effectiveness and usefulness of the methods for humans in two mixed-motive games. The first is a challenging 7-player game called no-press Diplomacy. The second is a 3-player game inspired by the prisoner's dilemma, featuring communication in natural language.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes