MLLGJun 4

Adaptive Learning Rates with Surrogate Probability for Follow-the-Perturbed-Leader

arXiv:2606.0604354.6
AI Analysis

This work addresses the challenge of achieving adaptive, probability-dependent learning rates in computationally efficient FTPL, which previously lacked such adaptivity, benefiting online learning and bandit algorithm design.

The authors introduce surrogate probability functions to design adaptive learning rates for Follow-the-Perturbed-Leader (FTPL), achieving best-of-both-worlds guarantees for any shape parameter α>1 in bandit problems, generalizing prior results limited to α=2.

Follow-the-regularized-leader framework has shown effectiveness and flexibility in online learning problems, where the choice of learning rates are known to be crucial. Recently, adaptive learning rates defined in terms of the arm-selection probabilities, obtained by solving convex optimization, have achieved improved best-of-both-worlds (BOBW) guarantees in various bandit problems. In contrast, BOBW guarantees for its computationally efficient alternative, follow-the-perturbed-leader (FTPL), remain relatively limited since its optimization-free nature ironically makes the design of adaptive, probability-dependent learning rates non-trivial. To address this challenge, we propose an adaptive learning rate for FTPL by introducing surrogate probability functions that can be computed only from the available quantities, without requiring the exact probabilities. Based on these learning rates with surrogate functions, we provide the BOBW guarantee for FTPL with Pareto perturbations for any shape parameter $α>1$, generalizing prior results restricted to specific choices of $α=2$. We further show the BOBW guarantees for FTPL with adaptive learning rates in the bandit problem with expert advices. Our approach preserves the computational simplicity of FTPL while enabling probability-dependent adaptivity, and the surrogate-based methodology may be of independent interest in other algorithmic frameworks beyond FTPL and learning rate designs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes