ROMay 20

Reinforcement Learning for Risk Adaptation via Differentiable CVaR Barrier Functions

Xinyi Wang, Taekyung Kim, Bardh Hoxha, Georgios Fainekos, Dimitra Panagou

arXiv:2605.2125736.4

Predicted impact top 55% in RO · last 90 daysOriginality Incremental advance

AI Analysis

For robots navigating crowded environments with uncertain obstacle motions, this work provides a context-aware adaptation method that balances safety and efficiency, outperforming existing optimization-based, RL-based, and integrated approaches.

The paper proposes an end-to-end risk adaptation framework for crowd navigation under obstacle-motion uncertainty, combining reinforcement learning with differentiable CVaR barrier functions. The method achieves the strongest overall performance in safety, efficiency, and generalization across varying obstacle densities and robot models.

Planning through crowded environments under uncertain obstacle motions remains difficult, as stochastic interactions often induce overly conservative behavior or reduced efficiency. To address this challenge, we propose an end-to-end risk adaptation framework for crowd navigation under obstacle-motion uncertainty modeled by a Gaussian mixture model. The framework combines reinforcement learning~(RL) with a differentiable quadratic-program safety layer based on Conditional Value-at-Risk~(CVaR) barrier functions, jointly learning nominal control input, risk level, and safety margin and enforcing explicit probabilistic safety constraints. This design enables context-aware adaptation, promoting efficient behavior while invoking caution only when necessary. We conduct extensive evaluations in dynamic, uncertain, and crowded environments across varying obstacle densities and robot models, and further assess generalization under three out-of-distribution cases. Comparisons across optimization-based, RL-based, and integrated RL and optimization methods are provided, and the proposed method is shown to deliver the strongest overall performance in safety, efficiency, and generalization under uncertainty.

View on arXiv PDF

Similar