MAAILGJun 18, 2025

RecBayes: Recurrent Bayesian Ad Hoc Teamwork in Large Partially Observable Domains

arXiv:2506.15756v1h-index: 33
Originality Highly original
AI Analysis

This addresses the problem of deploying agents into unknown teams in large partially observable environments for multi-agent systems, offering a novel solution compared to prior methods with stricter requirements.

The paper tackles ad hoc teamwork under partial observability by proposing RecBayes, a recurrent Bayesian classifier that identifies known teams and tasks from observations alone without requiring environment states or teammate actions. Results show it handles domains scaled up to 1M states and 2^125 observations effectively.

This paper proposes RecBayes, a novel approach for ad hoc teamwork under partial observability, a setting where agents are deployed on-the-fly to environments where pre-existing teams operate, that never requires, at any stage, access to the states of the environment or the actions of its teammates. We show that by relying on a recurrent Bayesian classifier trained using past experiences, an ad hoc agent is effectively able to identify known teams and tasks being performed from observations alone. Unlike recent approaches such as PO-GPL (Gu et al., 2021) and FEAT (Rahman et al., 2023), that require at some stage fully observable states of the environment, actions of teammates, or both, or approaches such as ATPO (Ribeiro et al., 2023) that require the environments to be small enough to be tabularly modelled (Ribeiro et al., 2023), in their work up to 4.8K states and 1.7K observations, we show RecBayes is both able to handle arbitrarily large spaces while never relying on either states and teammates' actions. Our results in benchmark domains from the multi-agent systems literature, adapted for partial observability and scaled up to 1M states and 2^125 observations, show that RecBayes is effective at identifying known teams and tasks being performed from partial observations alone, and as a result, is able to assist the teams in solving the tasks effectively.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes