LGAIJun 8, 2022

Designing Reinforcement Learning Algorithms for Digital Interventions: Pre-implementation Guidelines

Harvard
arXiv:2206.03944v345 citationsh-index: 59
Originality Synthesis-oriented
AI Analysis

This work addresses practical implementation issues for researchers and practitioners in mobile health and online education, but it is incremental as it adapts an existing framework to a new context.

The paper tackles the challenge of designing reinforcement learning algorithms for digital interventions by extending the PCS framework to guide algorithm development and simulation environment design, illustrated through the Oralytics mobile health study for improving tooth-brushing behaviors.

Online reinforcement learning (RL) algorithms are increasingly used to personalize digital interventions in the fields of mobile health and online education. Common challenges in designing and testing an RL algorithm in these settings include ensuring the RL algorithm can learn and run stably under real-time constraints, and accounting for the complexity of the environment, e.g., a lack of accurate mechanistic models for the user dynamics. To guide how one can tackle these challenges, we extend the PCS (Predictability, Computability, Stability) framework, a data science framework that incorporates best practices from machine learning and statistics in supervised learning (Yu and Kumbier, 2020), to the design of RL algorithms for the digital interventions setting. Further, we provide guidelines on how to design simulation environments, a crucial tool for evaluating RL candidate algorithms using the PCS framework. We illustrate the use of the PCS framework for designing an RL algorithm for Oralytics, a mobile health study aiming to improve users' tooth-brushing behaviors through the personalized delivery of intervention messages. Oralytics will go into the field in late 2022.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes