ROMar 11

RL-Augmented MPC for Non-Gaited Legged and Hybrid Locomotion

arXiv:2603.10878v18.3h-index: 18Has Code
Predicted impact top 45% in RO · last 90 daysOriginality Incremental advance
AI Analysis

This addresses locomotion control for heavy legged robots, enabling acyclic gaits and adaptation, though it is incremental as it builds on existing RL and MPC methods.

The paper tackles the challenge of non-gaited legged and hybrid locomotion by proposing a hierarchical RL-augmented MPC architecture that offloads contact timing to RL, achieving zero-shot sim-to-real transfer on a 120 kg robot without domain randomization.

We propose a contact-explicit hierarchical architecture coupling Reinforcement Learning (RL) and Model Predictive Control (MPC), where a high-level RL agent provides gait and navigation commands to a low-level locomotion MPC. This offloads the combinatorial burden of contact timing from the MPC by learning acyclic gaits through trial and error in simulation. We show that only a minimal set of rewards and limited tuning are required to obtain effective policies. We validate the architecture in simulation across robotic platforms spanning 50 kg to 120 kg and different MPC implementations, observing the emergence of acyclic gaits and timing adaptations in flat-terrain legged and hybrid locomotion, and further demonstrating extensibility to non-flat terrains. Across all platforms, we achieve zero-shot sim-to-sim transfer without domain randomization, and we further demonstrate zero-shot sim-to-real transfer without domain randomization on Centauro, our 120 kg wheeled-legged humanoid robot. We make our software framework and evaluation results publicly available at https://github.com/AndrePatri/AugMPC.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes