ML LGJul 13, 2020

Relaxing the I.I.D. Assumption: Adaptively Minimax Optimal Regret via Root-Entropic Regularization

Blair Bilodeau, Jeffrey Negrea, Daniel M. Roy

arXiv:2007.06552v39.011 citations

Originality Highly original

AI Analysis

This work addresses the challenge of robust online learning for scenarios where data distributions are not i.i.d. but vary within unknown constraints, providing a foundational advance for adaptive algorithms in machine learning.

The paper tackles the problem of prediction with expert advice under semi-adversarial data distributions that vary arbitrarily within unknown constraint sets, bridging i.i.d. and adversarial extremes. It achieves adaptively minimax optimal regret at all constraint levels using a novel root-entropic regularization scheme in the FTRL framework, with matching upper and lower bounds.

We consider prediction with expert advice when data are generated from distributions varying arbitrarily within an unknown constraint set. This semi-adversarial setting includes (at the extremes) the classical i.i.d. setting, when the unknown constraint set is restricted to be a singleton, and the unconstrained adversarial setting, when the constraint set is the set of all distributions. The Hedge algorithm -- long known to be minimax (rate) optimal in the adversarial regime -- was recently shown to be simultaneously minimax optimal for i.i.d. data. In this work, we propose to relax the i.i.d. assumption by seeking adaptivity at all levels of a natural ordering on constraint sets. We provide matching upper and lower bounds on the minimax regret at all levels, show that Hedge with deterministic learning rates is suboptimal outside of the extremes, and prove that one can adaptively obtain minimax regret at all levels. We achieve this optimal adaptivity using the follow-the-regularized-leader (FTRL) framework, with a novel adaptive regularization scheme that implicitly scales as the square root of the entropy of the current predictive distribution, rather than the entropy of the initial predictive distribution. Finally, we provide novel technical tools to study the statistical performance of FTRL along the semi-adversarial spectrum.

View on arXiv PDF

Similar