LGAIROFeb 27, 2025

Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning

arXiv:2502.19652v116 citationsh-index: 56Has CodeICLR
Originality Synthesis-oriented
AI Analysis

This provides a tool for the robust RL community to assess and develop algorithms, but it is incremental as it builds on existing RL benchmarks.

The authors tackled the lack of standardized benchmarks for robust reinforcement learning by introducing Robust-Gymnasium, a unified modular benchmark with over sixty task environments, and they benchmarked existing algorithms to uncover significant deficiencies.

Driven by inherent uncertainty and the sim-to-real gap, robust reinforcement learning (RL) seeks to improve resilience against the complexity and variability in agent-environment sequential interactions. Despite the existence of a large number of RL benchmarks, there is a lack of standardized benchmarks for robust RL. Current robust RL policies often focus on a specific type of uncertainty and are evaluated in distinct, one-off environments. In this work, we introduce Robust-Gymnasium, a unified modular benchmark designed for robust RL that supports a wide variety of disruptions across all key RL components-agents' observed state and reward, agents' actions, and the environment. Offering over sixty diverse task environments spanning control and robotics, safe RL, and multi-agent RL, it provides an open-source and user-friendly tool for the community to assess current methods and foster the development of robust RL algorithms. In addition, we benchmark existing standard and robust RL algorithms within this framework, uncovering significant deficiencies in each and offering new insights.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes