ML LGMay 22, 2024

Symmetric Linear Bandits with Hidden Symmetry

Nam Phuong Tran, The Anh Ta, Debmalya Mandal, Long Tran-Thanh

arXiv:2405.13899v25.51 citationsh-index: 14Has CodeNIPS

Originality Incremental advance

AI Analysis

This addresses the challenge of learning hidden symmetries in bandit settings for applications where sparsity assumptions may not hold, representing an incremental advance in structured bandit algorithms.

The paper tackles the problem of high-dimensional linear bandits with hidden symmetry, where the reward is invariant under transformations but the symmetry is unknown, by developing an algorithm based on model selection that achieves a regret bound of O(d_0^{2/3} T^{2/3} log(d)), and improves to O(d_0 sqrt(T log(d))) under an extra assumption.

High-dimensional linear bandits with low-dimensional structure have received considerable attention in recent studies due to their practical significance. The most common structure in the literature is sparsity. However, it may not be available in practice. Symmetry, where the reward is invariant under certain groups of transformations on the set of arms, is another important inductive bias in the high-dimensional case that covers many standard structures, including sparsity. In this work, we study high-dimensional symmetric linear bandits where the symmetry is hidden from the learner, and the correct symmetry needs to be learned in an online setting. We examine the structure of a collection of hidden symmetry and provide a method based on model selection within the collection of low-dimensional subspaces. Our algorithm achieves a regret bound of $ O(d_0^{2/3} T^{2/3} \log(d))$, where $d$ is the ambient dimension which is potentially very large, and $d_0$ is the dimension of the true low-dimensional subspace such that $d_0 \ll d$. With an extra assumption on well-separated models, we can further improve the regret to $ O(d_0\sqrt{T\log(d)} )$.

View on arXiv PDF Code

Similar