MALGFeb 23, 2024

Open Ad Hoc Teamwork with Cooperative Game Theory

arXiv:2402.15259v57 citationsh-index: 11Has CodeICML
Originality Incremental advance
AI Analysis

This work addresses the challenge of flexible multi-agent collaboration in dynamic environments, representing an incremental advance over existing graph-based methods.

The paper tackles the problem of open ad hoc teamwork, where an agent must collaborate with a changing number of teammates without prior coordination, by establishing a new theory based on cooperative game theory to explain joint Q-value representations and proposing the CIAO algorithm, which demonstrates improved performance in experiments.

Ad hoc teamwork poses a challenging problem, requiring the design of an agent to collaborate with teammates without prior coordination or joint training. Open ad hoc teamwork (OAHT) further complicates this challenge by considering environments with a changing number of teammates, referred to as open teams. One promising solution in practice to this problem is leveraging the generalizability of graph neural networks to handle an unrestricted number of agents with various agent-types, named graph-based policy learning (GPL). However, its joint Q-value representation over a coordination graph lacks convincing explanations. In this paper, we establish a new theory to understand the representation of the joint Q-value for OAHT and its learning paradigm, through the lens of cooperative game theory. Building on our theory, we propose a novel algorithm named CIAO, based on GPL's framework, with additional provable implementation tricks that can facilitate learning. The demos of experimental results are available on https://sites.google.com/view/ciao2024, and the code of experiments is published on https://github.com/hsvgbkhgbv/CIAO.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes