LGAIMay 22

Reflex: Reinforcement Learning with Reflection Symmetry Exploitation in State-Based Continuous Control

arXiv:2605.2341511.9Has Code
Predicted impact top 42% in LG · last 90 daysOriginality Incremental advance
AI Analysis

Addresses the underexplored problem of leveraging reflection symmetry in state-based RL to improve sample efficiency for continuous control tasks.

Reflex exploits reflection symmetry in state-based continuous control to improve sample efficiency and performance, achieving superior results over standard PPO and SAC baselines on OpenAI Gym and DeepMind Control benchmarks.

Reinforcement learning has long struggled with poor sample efficiency. One promising approach to mitigate this problem is leveraging group-invariant Markov Decision Processes ($G$-invariant MDPs). Existing works in this direction have primarily focused on image-based RL and rotational symmetry such as $\mathrm{SO(2)}$, leaving state-based RL and reflection symmetry largely underexplored. In this work, we focus on state-based continuous control tasks and exploit reflection symmetry by introducing Reflex, a paradigm that seamlessly integrates with both on-policy and off-policy RL algorithms. We formalize two types of reflection-axial reflection and bilateral reflection, and characterize their corresponding transformations. Building on a theoretical analysis of symmetry-preserving optimal value functions and policies, Reflex integrates reflection symmetry into policy learning through principled symmetry regularization mechanisms. We integrate Reflex with PPO and SAC, and evaluate it on a suite of OpenAI Gym and DeepMind Control benchmarks, demonstrating superior performance over standard baselines while improving sample efficiency. Our code is available at https://github.com/TonyStark042/Reflex.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes