LGMar 16

Robust and Computationally Efficient Linear Contextual Bandits under Adversarial Corruption and Heavy-Tailed Noise

arXiv:2603.1559624.7h-index: 6
AI Analysis

This work addresses robustness and efficiency challenges in bandit algorithms for applications like online advertising or recommendation systems, though it is incremental as it builds on existing methods for heavy-tailed noise and corruption.

The paper tackles the problem of linear contextual bandits under adversarial corruption and heavy-tailed noise, proposing a computationally efficient algorithm based on online mirror descent that reduces per-round computational cost from O(t log T) to O(1) and achieves sublinear regret without prior knowledge of noise or corruption parameters.

We study linear contextual bandits under adversarial corruption and heavy-tailed noise with finite $(1+ε)$-th moments for some $ε\in (0,1]$. Existing work that addresses both adversarial corruption and heavy-tailed noise relies on a finite variance (i.e., finite second-moment) assumption and suffers from computational inefficiency. We propose a computationally efficient algorithm based on online mirror descent that achieves robustness to both adversarial corruption and heavy-tailed noise. While the existing algorithm incurs $\mathcal{O}(t\log T)$ computational cost, our algorithm reduces this to $\mathcal{O}(1)$ per round. We establish an additive regret bound consisting of a term depending on the $(1+ε)$-moment bound of the noise and a term depending on the total amount of corruption. In particular, when $ε= 1$, our result recovers existing guarantees under finite-variance assumptions. When no corruption is present, it matches the best-known rates for linear contextual bandits with heavy-tailed noise. Moreover, the algorithm requires no prior knowledge of the noise moment bound or the total amount of corruption and still guarantees sublinear regret.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes