RO AIMay 25

Acting on the Unseen: Communication-Free Collaborative Filtering for Decentralized Multi-Robot Task Allocation

Alexander Apartsin, Yigal Meshulam, Yehudit Aperstein

arXiv:2605.2558423.8

Predicted impact top 58% in RO · last 90 daysOriginality Highly original

AI Analysis

This work addresses the practical problem of decentralized multi-robot task allocation under extreme information constraints, providing theoretical guarantees and empirical validation for a novel algorithm.

The paper introduces Zero-Knowledge MRTA, a decentralized multi-robot task allocation setting with no communication, no prior knowledge, and only partial noisy observations. They propose SwarmCF, an online low-rank collaborative filtering algorithm that enables robots to perform well on unseen tasks, achieving a per-robot sample complexity of Θ(d) versus Θ(n) for structure-free learners, and recovering about 80% of the centralized full-communication ceiling.

Multi-robot task allocation usually assumes some combination of communication, known task models, or a coordinator. We study the opposite extreme, a regime common in practice but overlooked in theory, which we name Zero-Knowledge MRTA (ZK-MRTA): a robot team with no prior knowledge (no task models, not even the latent rank), no communication (no messages, no parameter sharing, no coordinator), and only a partial and privately-noisy view of a public stream of teammates' outcomes. A hidden low-rank structure governs which robot suits which task, and there are far more tasks than rounds, so most (robot, task) pairs are never attempted. Yet each robot can act well on tasks it never attempted, and onboard new tasks, by running online low-rank collaborative filtering over the broadcast (SwarmCF). The advantage over any structure-free learner is categorical, not a constant factor: a structure-free learner is provably at the prior-mean error floor on unseen pairs. We prove a matching per-robot sample complexity (Θ(d) versus Θ(n), in the rank d and the task count n), an anytime (cumulative-reward) separation under task scarcity, and a deterministic condition under which decentralized recovery from the masked broadcast is exact (validated empirically). Experiments quantify the value of the broadcast, a positive scaling law (per-robot unseen-pair skill rises with team size), and the strongest masking-robustness and anytime profile among low-rank methods, recovering most (about 80% on earned skill) of a centralized full-communication ceiling, and holding under capacity-1 contention and in a robotics-grounded sensing instance.

View on arXiv PDF

Similar