LG OCAug 7, 2022

Optimal Tracking in Prediction with Expert Advice

arXiv:2208.03708v13.31 citationsh-index: 29

Originality Highly original

AI Analysis

This work solves the problem of optimal tracking in online learning for researchers and practitioners, offering a foundational advancement with universal and adaptive guarantees.

The paper tackles the prediction with expert advice problem by developing an algorithm that achieves min-max optimal dynamic regret against time-varying combinations of expert decisions, with no prior information like time horizon or loss ranges, and provides universal guarantees with logarithmic complexity.

We study the prediction with expert advice setting, where the aim is to produce a decision by combining the decisions generated by a set of experts, e.g., independently running algorithms. We achieve the min-max optimal dynamic regret under the prediction with expert advice setting, i.e., we can compete against time-varying (not necessarily fixed) combinations of expert decisions in an optimal manner. Our end-algorithm is truly online with no prior information, such as the time horizon or loss ranges, which are commonly used by different algorithms in the literature. Both our regret guarantees and the min-max lower bounds are derived with the general consideration that the expert losses can have time-varying properties and are possibly unbounded. Our algorithm can be adapted for restrictive scenarios regarding both loss feedback and decision making. Our guarantees are universal, i.e., our end-algorithm can provide regret guarantee against any competitor sequence in a min-max optimal manner with logarithmic complexity. Note that, to our knowledge, for the prediction with expert advice problem, our algorithms are the first to produce such universally optimal, adaptive and truly online guarantees with no prior knowledge.

View on arXiv PDF

Similar