LGOCMLApr 27

A Modern Introduction to Online Learning

arXiv:1912.1321318.9537 citationsh-index: 42
Predicted impact top 45% in LG · last 90 daysOriginality Synthesis-oriented
AI Analysis

For researchers and students new to online learning, this book offers a clear, self-contained introduction to the field, though it is a pedagogical contribution rather than a novel research advance.

This book provides a modern introduction to online learning through the lens of online convex optimization, covering algorithms like Online Mirror Descent and Follow-The-Regularized-Leader, with a focus on parameter tuning, adaptive methods, and applications to non-convex losses, bandits, and advanced topics. It aims to present the material in an accessible yet mathematically rigorous manner.

In this book, I introduce the basic concepts of Online Learning through the modern view of Online Convex Optimization. Here, online learning refers to the framework of regret minimization under worst-case assumptions. I present first-order and second-order algorithms for online learning with convex losses, in Euclidean and non-Euclidean settings. All the algorithms are clearly presented as instantiation of Online Mirror Descent or Follow-The-Regularized-Leader and their variants. Particular attention is given to the issue of tuning the parameters of the algorithms and learning in unbounded domains, through adaptive and parameter-free online learning algorithms. Non-convex losses are addressed through convex surrogate losses and randomization. The bandit setting is also briefly discussed, touching on the problem of adversarial and stochastic multi-armed bandits. Finally, I also cover advanced topics, including black-box reductions, saddle-point optimization, sequential investment, and non-stationary forms of regret analysis. The book concludes with a selection of applications of online learning to domains far from it, such as generalization theory and concentration inequalities. I tried to maintain an informal, but mathematically serious, tone throughout the book. No prior knowledge of convex analysis is required. Moreover, all the included proofs have been carefully chosen to be as simple and as short as possible. This also means that sometimes I have added one or two additional assumptions, just to simplify the proofs.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes