LG OCJan 27, 2021

Adversaries in Online Learning Revisited: with applications in Robust Optimization and Adversarial training

arXiv:2101.11443v11.6h-index: 23

Originality Incremental advance

AI Analysis

It addresses confusion in applying online learning results to robust optimization and adversarial training, offering a general method for these problems, though it appears incremental as it builds on prior work like arXiv:1402.6361.

The paper clarifies the concept of adversaries in online learning, distinguishing between anticipative and non-anticipative types, and applies this to robust optimization and adversarial training using an imaginary play approach to obtain approximately robust solutions.

We revisit the concept of "adversary" in online learning, motivated by solving robust optimization and adversarial training using online learning methods. While one of the classical setups in online learning deals with the "adversarial" setup, it appears that this concept is used less rigorously, causing confusion in applying results and insights from online learning. Specifically, there are two fundamentally different types of adversaries, depending on whether the "adversary" is able to anticipate the exogenous randomness of the online learning algorithms. This is particularly relevant to robust optimization and adversarial training because the adversarial sequences are often anticipative, and many online learning algorithms do not achieve diminishing regret in such a case. We then apply this to solving robust optimization problems or (equivalently) adversarial training problems via online learning and establish a general approach for a large variety of problem classes using imaginary play. Here two players play against each other, the primal player playing the decisions and the dual player playing realizations of uncertain data. When the game terminates, the primal player has obtained an approximately robust solution. This meta-game allows for solving a large variety of robust optimization and multi-objective optimization problems and generalizes the approach of arXiv:1402.6361.

View on arXiv PDF

Similar