LG MLJun 19, 2020

Open Problem: Model Selection for Contextual Bandits

Dylan J. Foster, Akshay Krishnamurthy, Haipeng Luo

arXiv:2006.10940v111.119 citations

Originality Synthesis-oriented

AI Analysis

This addresses a theoretical gap in contextual bandit algorithms for researchers, but it is an open problem with no incremental progress reported.

The paper investigates whether model selection guarantees, which adapt to the best hypothesis class complexity in statistical learning, can be extended to contextual bandit learning, but does not provide specific results or numbers.

In statistical learning, algorithms for model selection allow the learner to adapt to the complexity of the best hypothesis class in a sequence. We ask whether similar guarantees are possible for contextual bandit learning.

View on arXiv PDF

Similar