LGOct 17, 2024

On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow

Tonghan Wang, Heng Dong, Yanchen Jiang, David C. Parkes, Milind Tambe

HarvardTsinghua

arXiv:2410.13953v310.44 citationsh-index: 14AAMAS

Originality Incremental advance

AI Analysis

This work addresses the problem of partial observability in multi-agent systems for researchers in AI and robotics, offering a theoretical framework with error bounds and convergence guarantees, though it appears incremental by building on existing diffusion model approaches.

The paper tackles the challenge of reconstructing global states from local action-observation histories in decentralized POMDPs with partial observability, using diffusion models to show that in collectively observable settings, agents share a unique fixed point corresponding to the true state, and proposes a composite diffusion process with theoretical convergence guarantees.

Multiagent systems grapple with partial observability (PO), and the decentralized POMDP (Dec-POMDP) model highlights the fundamental nature of this challenge. Whereas recent approaches to addressing PO have appealed to deep learning models, providing a rigorous understanding of how these models and their approximation errors affect agents' handling of PO and their interactions remain a challenge. In addressing this challenge, we investigate reconstructing global states from local action-observation histories in Dec-POMDPs using diffusion models. We first find that diffusion models conditioned on local history represent possible states as stable fixed points. In collectively observable (CO) Dec-POMDPs, individual diffusion models conditioned on agents' local histories share a unique fixed point corresponding to the global state, while in non-CO settings, shared fixed points yield a distribution of possible states given joint history. We further find that, with deep learning approximation errors, fixed points can deviate from true states and the deviation is negatively correlated to the Jacobian rank. Inspired by this low-rank property, we bound a deviation by constructing a surrogate linear regression model that approximates the local behavior of a diffusion model. With this bound, we propose a \emph{composite diffusion process} iterating over agents with theoretical convergence guarantees to the true state.

View on arXiv PDF

Similar