LGMLJan 24, 2019

Dynamic Measurement Scheduling for Event Forecasting using Deep RL

arXiv:1901.09699v320 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of efficient and effective monitoring for critical care patients, representing an incremental improvement over existing heuristic or physician-based scheduling methods.

The paper tackles the problem of scheduling medical measurements for event forecasting under budget constraints by using deep reinforcement learning to jointly minimize cost and maximize predictive gain, achieving a 31% reduction in measurements or a 3x improvement in predictive gain compared to physicians in a real-world ICU task.

Imagine a patient in critical condition. What and when should be measured to forecast detrimental events, especially under the budget constraints? We answer this question by deep reinforcement learning (RL) that jointly minimizes the measurement cost and maximizes predictive gain, by scheduling strategically-timed measurements. We learn our policy to be dynamically dependent on the patient's health history. To scale our framework to exponentially large action space, we distribute our reward in a sequential setting that makes the learning easier. In our simulation, our policy outperforms heuristic-based scheduling with higher predictive gain and lower cost. In a real-world ICU mortality prediction task (MIMIC3), our policies reduce the total number of measurements by $31\%$ or improve predictive gain by a factor of $3$ as compared to physicians, under the off-policy policy evaluation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes