MLLGNov 24, 2021

One More Step Towards Reality: Cooperative Bandits with Imperfect Communication

arXiv:2111.12482v129 citations
Originality Highly original
AI Analysis

This addresses the problem of large-scale decision-making in real-world distributed settings where communication is imperfect, representing an incremental advance over prior work focused on perfect communication.

The paper tackles cooperative bandit learning under imperfect communication scenarios, such as stochastic networks and adversarial corruptions, by proposing decentralized algorithms that achieve competitive performance with near-optimal group regret guarantees, and also presents an improved algorithm for perfect communication that outperforms existing state-of-the-art methods.

The cooperative bandit problem is increasingly becoming relevant due to its applications in large-scale decision-making. However, most research for this problem focuses exclusively on the setting with perfect communication, whereas in most real-world distributed settings, communication is often over stochastic networks, with arbitrary corruptions and delays. In this paper, we study cooperative bandit learning under three typical real-world communication scenarios, namely, (a) message-passing over stochastic time-varying networks, (b) instantaneous reward-sharing over a network with random delays, and (c) message-passing with adversarially corrupted rewards, including byzantine communication. For each of these environments, we propose decentralized algorithms that achieve competitive performance, along with near-optimal guarantees on the incurred group regret as well. Furthermore, in the setting with perfect communication, we present an improved delayed-update algorithm that outperforms the existing state-of-the-art on various network topologies. Finally, we present tight network-dependent minimax lower bounds on the group regret. Our proposed algorithms are straightforward to implement and obtain competitive empirical performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes