LO LGJun 30, 2020

It's Time to Play Safe: Shield Synthesis for Timed Systems

Roderick Bloem, Peter Gjøl Jensen, Bettina Könighofer, Kim Guldstrand Larsen, Florian Lorber, Alexander Palmisano

arXiv:2006.16688v15.18 citations

Originality Incremental advance

AI Analysis

This addresses safety-critical issues in real-time systems like autonomous vehicles, though it appears incremental by extending shield synthesis to timed systems.

The paper tackles the problem of ensuring safety in real-time systems by synthesizing timed shields from timed automata specifications, presenting both pre-shields and post-shields that minimize interference, and demonstrates their application in a reinforcement learning setting for car platoon control with experimental results.

Erroneous behaviour in safety critical real-time systems may inflict serious consequences. In this paper, we show how to synthesize timed shields from timed safety properties given as timed automata. A timed shield enforces the safety of a running system while interfering with the system as little as possible. We present timed post-shields and timed pre-shields. A timed pre-shield is placed before the system and provides a set of safe outputs. This set restricts the choices of the system. A timed post-shield is implemented after the system. It monitors the system and corrects the system's output only if necessary. We further extend the timed post-shield construction to provide a guarantee on the recovery phase, i.e., the time between a specification violation and the point at which full control can be handed back to the system. In our experimental results, we use timed post-shields to ensure the safety in a reinforcement learning setting for controlling a platoon of cars, during the learning and execution phase, and study the effect.

View on arXiv PDF

Similar