LG AI RO MLOct 7, 2019

A Learnable Safety Measure

Steve Heim, Alexander von Rohr, Sebastian Trimpe, Alexander Badri-Spröwitz

arXiv:1910.02835v17.716 citationsh-index: 24Has Code

Originality Incremental advance

AI Analysis

This work addresses safety constraints in reinforcement learning for physical systems, which is an incremental improvement over existing methods that require extensive prior knowledge.

The paper tackles the problem of ensuring safety in learning-based control of physical systems by introducing a learnable safety measure that captures the relationship between system dynamics and failure states, and demonstrates that using this measure during learning can greatly reduce failures.

Failures are challenging for learning to control physical systems since they risk damage, time-consuming resets, and often provide little gradient information. Adding safety constraints to exploration typically requires a lot of prior knowledge and domain expertise. We present a safety measure which implicitly captures how the system dynamics relate to a set of failure states. Not only can this measure be used as a safety function, but also to directly compute the set of safe state-action pairs. Further, we show a model-free approach to learn this measure by active sampling using Gaussian processes. While safety can only be guaranteed after learning the safety measure, we show that failures can already be greatly reduced by using the estimated measure during learning.

View on arXiv PDF Code

Similar