LGAIPRMLAug 26, 2024

Representative Arm Identification: A fixed confidence approach to identify cluster representatives

arXiv:2408.14195v12 citationsh-index: 4
Originality Incremental advance
AI Analysis

This addresses a general clustering problem in bandit theory, with applications in ranking and selection, but is incremental as it builds on existing confidence interval methods.

The paper tackles the representative arm identification problem in multi-armed bandits, aiming to identify a specified number of arms from each cluster with minimal arm pulls, and provides algorithms with sample complexity bounds that match a lower bound, showing superior performance in empirical tests.

We study the representative arm identification (RAI) problem in the multi-armed bandits (MAB) framework, wherein we have a collection of arms, each associated with an unknown reward distribution. An underlying instance is defined by a partitioning of the arms into clusters of predefined sizes, such that for any $j > i$, all arms in cluster $i$ have a larger mean reward than those in cluster $j$. The goal in RAI is to reliably identify a certain prespecified number of arms from each cluster, while using as few arm pulls as possible. The RAI problem covers as special cases several well-studied MAB problems such as identifying the best arm or any $M$ out of the top $K$, as well as both full and coarse ranking. We start by providing an instance-dependent lower bound on the sample complexity of any feasible algorithm for this setting. We then propose two algorithms, based on the idea of confidence intervals, and provide high probability upper bounds on their sample complexity, which orderwise match the lower bound. Finally, we do an empirical comparison of both algorithms along with an LUCB-type alternative on both synthetic and real-world datasets, and demonstrate the superior performance of our proposed schemes in most cases.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes