AILGAug 15, 2022

Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care

Harvard
arXiv:2208.07406v330 citationsh-index: 59
Originality Incremental advance
AI Analysis

This work addresses the problem of improving oral hygiene adherence for patients through personalized mobile interventions, but it is incremental as it focuses on reward design within an existing RL framework.

The authors tackled the challenge of designing a reward function for an online reinforcement learning algorithm to optimize mobile prompts for oral self-care, addressing delayed effects and noisy data, and developed a procedure for hyperparameter optimization using a simulation test bed.

Dental disease is one of the most common chronic diseases despite being largely preventable. However, professional advice on optimal oral hygiene practices is often forgotten or abandoned by patients. Therefore patients may benefit from timely and personalized encouragement to engage in oral self-care behaviors. In this paper, we develop an online reinforcement learning (RL) algorithm for use in optimizing the delivery of mobile-based prompts to encourage oral hygiene behaviors. One of the main challenges in developing such an algorithm is ensuring that the algorithm considers the impact of the current action on the effectiveness of future actions (i.e., delayed effects), especially when the algorithm has been made simple in order to run stably and autonomously in a constrained, real-world setting (i.e., highly noisy, sparse data). We address this challenge by designing a quality reward which maximizes the desired health outcome (i.e., high-quality brushing) while minimizing user burden. We also highlight a procedure for optimizing the hyperparameters of the reward by building a simulation environment test bed and evaluating candidates using the test bed. The RL algorithm discussed in this paper will be deployed in Oralytics, an oral self-care app that provides behavioral strategies to boost patient engagement in oral hygiene practices.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes