MLLGSep 5, 2019

An Arm-Wise Randomization Approach to Combinatorial Linear Semi-Bandits

arXiv:1909.02251v26 citations
Originality Incremental advance
AI Analysis

This work solves a critical practical problem in sequential decision-making for applications with clustered data, such as recommender systems, though it is incremental as it builds on existing algorithms with a new technique.

The paper addresses the poor performance of existing combinatorial linear semi-bandit algorithms in clustered feature vector scenarios, common in applications like recommender systems, by introducing an arm-wise randomization technique. The proposed algorithms, PC²UCB and Thompson sampling, outperform existing methods in empirical evaluations on artificial and real-world datasets, with theoretical analyses providing high probability asymptotic regret bounds.

Combinatorial linear semi-bandits (CLS) are widely applicable frameworks of sequential decision-making, in which a learner chooses a subset of arms from a given set of arms associated with feature vectors. Existing algorithms work poorly for the clustered case, in which the feature vectors form several large clusters. This shortcoming is critical in practice because it can be found in many applications, including recommender systems. In this paper, we clarify why such a shortcoming occurs, and we introduce a key technique of arm-wise randomization to overcome it. We propose two algorithms with this technique: the perturbed C${}^2$UCB (PC${}^2$UCB) and the Thompson sampling (TS). Our empirical evaluation with artificial and real-world datasets demonstrates that the proposed algorithms with the arm-wise randomization technique outperform the existing algorithms without this technique, especially for the clustered case. Our contributions also include theoretical analyses that provide high probability asymptotic regret bounds for our algorithms.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes