LG AIOct 27, 2020

Graph-based Reinforcement Learning for Active Learning in Real Time: An Application in Modeling River Networks

Xiaowei Jia, Beiyu Lin, Jacob Zwart, Jeffrey Sadler, Alison Appling, Samantha Oliver, Jordan Read

arXiv:2010.14000v22.31 citations

Originality Incremental advance

AI Analysis

This addresses the challenge of scarce labeled data in scientific domains like hydrology by enabling real-time decision-making for sensor deployment, though it is incremental as it adapts existing methods to a specific problem.

The paper tackles the problem of efficiently collecting labeled data for ML models in scientific applications by developing a real-time active learning method that selects query samples using spatial and temporal context in a reinforcement learning framework, achieving effective predictions of streamflow and water temperature in the Delaware River Basin with limited data collection budgets.

Effective training of advanced ML models requires large amounts of labeled data, which is often scarce in scientific problems given the substantial human labor and material cost to collect labeled data. This poses a challenge on determining when and where we should deploy measuring instruments (e.g., in-situ sensors) to collect labeled data efficiently. This problem differs from traditional pool-based active learning settings in that the labeling decisions have to be made immediately after we observe the input data that come in a time series. In this paper, we develop a real-time active learning method that uses the spatial and temporal contextual information to select representative query samples in a reinforcement learning framework. To reduce the need for large training data, we further propose to transfer the policy learned from simulation data which is generated by existing physics-based models. We demonstrate the effectiveness of the proposed method by predicting streamflow and water temperature in the Delaware River Basin given a limited budget for collecting labeled data. We further study the spatial and temporal distribution of selected samples to verify the ability of this method in selecting informative samples over space and time.

View on arXiv PDF

Similar