LG MLDec 10, 2014

Generalised Entropy MDPs and Minimax Regret

Emmanouil G. Androulakis, Christos Dimitrakakis

arXiv:1412.3276v12 citations

Originality Incremental advance

AI Analysis

This addresses the challenge of prior specification in Bayesian decision-making, but it appears incremental as it builds on existing bandit theory.

The paper tackles the problem of specifying prior beliefs in Bayesian methods by considering worst-case priors, which involves solving a stochastic zero-sum game. It extends results from bandit theory to discover minimax-Bayes policies and discusses their practicality.

Bayesian methods suffer from the problem of how to specify prior beliefs. One interesting idea is to consider worst-case priors. This requires solving a stochastic zero-sum game. In this paper, we extend well-known results from bandit theory in order to discover minimax-Bayes policies and discuss when they are practical.

View on arXiv PDF

Similar