LG MLMay 23, 2018

Cleaning up the neighborhood: A full classification for adversarial partial monitoring

arXiv:1805.09247v130 citations

Originality Highly original

AI Analysis

This work provides a foundational classification for adversarial partial monitoring, impacting researchers in online learning and bandit theory.

The authors solved the open problem of classifying all finite adversarial partial monitoring games, completing the work started by Bartok et al. [2014], and introduced a new algorithm for a specific class of games with proven upper and lower regret bounds.

Partial monitoring is a generalization of the well-known multi-armed bandit framework where the loss is not directly observed by the learner. We complete the classification of finite adversarial partial monitoring to include all games, solving an open problem posed by Bartok et al. [2014]. Along the way we simplify and improve existing algorithms and correct errors in previous analyses. Our second contribution is a new algorithm for the class of games studied by Bartok [2013] where we prove upper and lower regret bounds that shed more light on the dependence of the regret on the game structure.

View on arXiv PDF

Similar