LG AIApr 26

CAPSULE: Control-Theoretic Action Perturbations for Safe Uncertainty-Aware Reinforcement Learning

Rahul Narava, Siddharth Verma, Ojas Jain, Shashi Shekhar Jha, Mayank Shekhar Jha

arXiv:2604.2357612.1

Predicted impact top 87% in LG · last 90 daysOriginality Incremental advance

AI Analysis

Addresses safe exploration in high-dimensional systems with unknown dynamics, offering hard safety guarantees for RL agents.

Proposed a safe RL framework that learns a probabilistic control-affine model offline and uses it to construct control barrier functions with model uncertainty, enabling safe exploration with fewer safety violations while maintaining task performance comparable to baselines.

Ensuring safe exploration in high-dimensional systems with unknown dynamics remains a significant challenge. Existing safe reinforcement learning methods often provide safety guarantees only in expectation, which can still lead to safety violations. Control-theoretic approaches, in contrast, offer hard constraint-based safety guarantees but typically assume access to known system dynamics or require accurate estimation of control-affine models. In this paper, we propose a safe reinforcement learning framework that learns a probabilistic control-affine dynamics model in an offline setting. The learned model is leveraged to explicitly construct control barrier functions (CBFs) that incorporate model uncertainty to provide conservative safety constraints. These CBF constraints are enforced through an online constraint-based action correction mechanism, enabling safe exploration without overly restricting task performance. Empirical evaluations on nonlinear, complex continuous-control benchmarks demonstrate that our approach achieves returns comparable to those of existing baselines while significantly reducing safety violations.

View on arXiv PDF

Similar