Katie Witkiewitz

2.3MEApr 25, 2017

Sufficient Markov Decision Processes with Alternating Deep Neural Networks

Longshaokan Wang, Eric B. Laber, Katie Witkiewitz

Advances in mobile computing technologies have made it possible to monitor and apply data-driven interventions across complex systems in real time. Markov decision processes (MDPs) are the primary model for sequential decision problems with a large or indefinite time horizon. Choosing a representation of the underlying decision process that is both Markov and low-dimensional is non-trivial. We propose a method for constructing a low-dimensional representation of the original decision process for which: 1. the MDP model holds; 2. a decision strategy that maximizes mean utility when applied to the low-dimensional representation also maximizes mean utility when applied to the original process. We use a deep neural network to define a class of potential process representations and estimate the process of lowest dimension within this class. The method is illustrated using data from a mobile study on heavy drinking and smoking among college students.

12.3MLJul 18, 2016

A Batch, Off-Policy, Actor-Critic Algorithm for Optimizing the Average Reward

S. A. Murphy, Y. Deng, E. B. Laber et al.

We develop an off-policy actor-critic algorithm for learning an optimal policy from a training set composed of data from multiple individuals. This algorithm is developed with a view towards its use in mobile health.

Katie Witkiewitz

2 Papers