MLLGDec 11, 2023

RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions

arXiv:2312.06403v45 citationsh-index: 18NIPS
Originality Highly original
AI Analysis

This work addresses optimization problems for mobile health interventions, offering a robust solution to improve algorithm performance in real-world settings.

The paper tackles challenges like participant heterogeneity and nonstationarity in mobile health interventions by proposing RoME, a robust mixed-effects contextual bandit algorithm, which achieves robust regret bounds and demonstrates superior performance in simulations and off-policy evaluations.

Mobile health leverages personalized and contextually tailored interventions optimized through bandit and reinforcement learning algorithms. In practice, however, challenges such as participant heterogeneity, nonstationarity, and nonlinear relationships hinder algorithm performance. We propose RoME, a Robust Mixed-Effects contextual bandit algorithm that simultaneously addresses these challenges via (1) modeling the differential reward with user- and time-specific random effects, (2) network cohesion penalties, and (3) debiased machine learning for flexible estimation of baseline rewards. We establish a high-probability regret bound that depends solely on the dimension of the differential-reward model, enabling us to achieve robust regret bounds even when the baseline reward is highly complex. We demonstrate the superior performance of the RoME algorithm in a simulation and two off-policy evaluation studies.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes