LGApr 29, 2016

MetaGrad: Multiple Learning Rates in Online Learning

arXiv:1604.08740v3109 citations
Originality Incremental advance
AI Analysis

This work addresses the need for more versatile adaptive algorithms in online convex optimization, offering incremental improvements by extending adaptability beyond previous methods.

The authors tackled the problem of designing adaptive online learning methods that automatically achieve fast regret rates across diverse function classes without manual tuning, and introduced MetaGrad, which adapts to a broader range including exp-concave, strongly convex, and stochastic functions, achieving logarithmic regret in cases like the unregularized hinge loss with favorable data distributions.

In online convex optimization it is well known that certain subclasses of objective functions are much easier than arbitrary convex functions. We are interested in designing adaptive methods that can automatically get fast rates in as many such subclasses as possible, without any manual tuning. Previous adaptive methods are able to interpolate between strongly convex and general convex functions. We present a new method, MetaGrad, that adapts to a much broader class of functions, including exp-concave and strongly convex functions, but also various types of stochastic and non-stochastic functions without any curvature. For instance, MetaGrad can achieve logarithmic regret on the unregularized hinge loss, even though it has no curvature, if the data come from a favourable probability distribution. MetaGrad's main feature is that it simultaneously considers multiple learning rates. Unlike previous methods with provable regret guarantees, however, its learning rates are not monotonically decreasing over time and are not tuned based on a theoretically derived bound on the regret. Instead, they are weighted directly proportional to their empirical performance on the data using a tilted exponential weights master algorithm.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes