SYLGApr 23, 2025

Learning Verifiable Control Policies Using Relaxed Verification

arXiv:2504.16879v1h-index: 2CDC
Originality Incremental advance
AI Analysis

This addresses safety verification for control systems, offering a method to ensure policies meet specifications during runtime, though it is incremental as it builds on existing verification techniques.

The paper tackles the problem of providing safety guarantees for learning-based control systems by integrating verification throughout training, resulting in policies that satisfy reach-avoid and invariance specifications on quadrotor and unicycle models.

To provide safety guarantees for learning-based control systems, recent work has developed formal verification methods to apply after training ends. However, if the trained policy does not meet the specifications, or there is conservatism in the verification algorithm, establishing these guarantees may not be possible. Instead, this work proposes to perform verification throughout training to ultimately aim for policies whose properties can be evaluated throughout runtime with lightweight, relaxed verification algorithms. The approach is to use differentiable reachability analysis and incorporate new components into the loss function. Numerical experiments on a quadrotor model and unicycle model highlight the ability of this approach to lead to learned control policies that satisfy desired reach-avoid and invariance specifications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes