LGJul 8, 2017

Correlational Dueling Bandits with Application to Clinical Treatment in Large Decision Spaces

arXiv:1707.02375v119 citations
Originality Highly original
AI Analysis

This work addresses the challenge of optimizing clinical treatments in large decision spaces, such as spinal cord injury therapies, by introducing a novel dueling bandits approach, though it is incremental in improving upon existing methods for structured spaces.

The paper tackles the problem of sequential decision making in large decision spaces with noisy comparative feedback, proposing the CorrDuel algorithm that leverages low-dimensional correlation structure to reduce cumulative regret, with experimental validation in simulations and a live clinical trial for spinal cord stimulation.

We consider sequential decision making under uncertainty, where the goal is to optimize over a large decision space using noisy comparative feedback. This problem can be formulated as a $K$-armed Dueling Bandits problem where $K$ is the total number of decisions. When $K$ is very large, existing dueling bandits algorithms suffer huge cumulative regret before converging on the optimal arm. This paper studies the dueling bandits problem with a large number of arms that exhibit a low-dimensional correlation structure. Our problem is motivated by a clinical decision making process in large decision space. We propose an efficient algorithm CorrDuel which optimizes the exploration/exploitation tradeoff in this large decision space of clinical treatments. More broadly, our approach can be applied to other sequential decision problems with large and structured decision spaces. We derive regret bounds, and evaluate performance in simulation experiments as well as on a live clinical trial of therapeutic spinal cord stimulation. To our knowledge, this marks the first time an online learning algorithm was applied towards spinal cord injury treatments. Our experimental results show the effectiveness and efficiency of our approach.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes