LG IT OC MLApr 12, 2018

Online convex optimization and no-regret learning: Algorithms, guarantees and applications

E. Veronica Belmega, Panayotis Mertikopoulos, Romain Negrel, Luca Sanguinetti

arXiv:1804.04529v18.745 citations

Originality Synthesis-oriented

AI Analysis

It offers a gentle introduction to online optimization tools for applications in signal processing, data mining, and wireless communications, but is incremental as it reviews existing methods.

This tutorial paper introduces online optimization and learning algorithms that achieve no-regret performance, approaching the performance of an ideal algorithm with full future knowledge, and provides examples from metric learning to wireless resource allocation.

Spurred by the enthusiasm surrounding the "Big Data" paradigm, the mathematical and algorithmic tools of online optimization have found widespread use in problems where the trade-off between data exploration and exploitation plays a predominant role. This trade-off is of particular importance to several branches and applications of signal processing, such as data mining, statistical inference, multimedia indexing and wireless communications (to name but a few). With this in mind, the aim of this tutorial paper is to provide a gentle introduction to online optimization and learning algorithms that are asymptotically optimal in hindsight - i.e., they approach the performance of a virtual algorithm with unlimited computational power and full knowledge of the future, a property known as no-regret. Particular attention is devoted to identifying the algorithms' theoretical performance guarantees and to establish links with classic optimization paradigms (both static and stochastic). To allow a better understanding of this toolbox, we provide several examples throughout the tutorial ranging from metric learning to wireless resource allocation problems.

View on arXiv PDF

Similar