LGAIROMLOct 30, 2019

Safe Exploration for Interactive Machine Learning

arXiv:1910.13726v196 citations
Originality Incremental advance
AI Analysis

This addresses the need for safe exploration in real-world IML applications, such as Bayesian optimization and MDPs, by improving sample efficiency, though it is incremental as it builds on existing methods.

The paper tackles the problem of ensuring safety in Interactive Machine Learning (IML) by introducing a framework that makes any unsafe IML algorithm safe, focusing on learning safety only when necessary for the objective, and it outperforms other algorithms empirically.

In Interactive Machine Learning (IML), we iteratively make decisions and obtain noisy observations of an unknown function. While IML methods, e.g., Bayesian optimization and active learning, have been successful in applications, on real-world systems they must provably avoid unsafe decisions. To this end, safe IML algorithms must carefully learn about a priori unknown constraints without making unsafe decisions. Existing algorithms for this problem learn about the safety of all decisions to ensure convergence. This is sample-inefficient, as it explores decisions that are not relevant for the original IML objective. In this paper, we introduce a novel framework that renders any existing unsafe IML algorithm safe. Our method works as an add-on that takes suggested decisions as input and exploits regularity assumptions in terms of a Gaussian process prior in order to efficiently learn about their safety. As a result, we only explore the safe set when necessary for the IML problem. We apply our framework to safe Bayesian optimization and to safe exploration in deterministic Markov Decision Processes (MDP), which have been analyzed separately before. Our method outperforms other algorithms empirically.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes