It's Time to Play Safe: Shield Synthesis for Timed Systems
This addresses safety-critical issues in real-time systems like autonomous vehicles, though it appears incremental by extending shield synthesis to timed systems.
The paper tackles the problem of ensuring safety in real-time systems by synthesizing timed shields from timed automata specifications, presenting both pre-shields and post-shields that minimize interference, and demonstrates their application in a reinforcement learning setting for car platoon control with experimental results.
Erroneous behaviour in safety critical real-time systems may inflict serious consequences. In this paper, we show how to synthesize timed shields from timed safety properties given as timed automata. A timed shield enforces the safety of a running system while interfering with the system as little as possible. We present timed post-shields and timed pre-shields. A timed pre-shield is placed before the system and provides a set of safe outputs. This set restricts the choices of the system. A timed post-shield is implemented after the system. It monitors the system and corrects the system's output only if necessary. We further extend the timed post-shield construction to provide a guarantee on the recovery phase, i.e., the time between a specification violation and the point at which full control can be handed back to the system. In our experimental results, we use timed post-shields to ensure the safety in a reinforcement learning setting for controlling a platoon of cars, during the learning and execution phase, and study the effect.