ROAILGSYNov 1, 2022

CPG-RL: Learning Central Pattern Generators for Quadruped Locomotion

arXiv:2211.00458v1123 citationsh-index: 73
Originality Incremental advance
AI Analysis

This addresses robust locomotion for quadruped robots, with incremental improvements in combining CPGs with DRL for enhanced disturbance resistance.

The paper tackled robust quadruped locomotion by integrating central pattern generators (CPGs) into deep reinforcement learning, achieving sim-to-real transfer with a Unitree A1 robot that handled a 13.75 kg load (115% of nominal mass) without domain randomization.

In this letter, we present a method for integrating central pattern generators (CPGs), i.e. systems of coupled oscillators, into the deep reinforcement learning (DRL) framework to produce robust and omnidirectional quadruped locomotion. The agent learns to directly modulate the intrinsic oscillator setpoints (amplitude and frequency) and coordinate rhythmic behavior among different oscillators. This approach also allows the use of DRL to explore questions related to neuroscience, namely the role of descending pathways, interoscillator couplings, and sensory feedback in gait generation. We train our policies in simulation and perform a sim-to-real transfer to the Unitree A1 quadruped, where we observe robust behavior to disturbances unseen during training, most notably to a dynamically added 13.75 kg load representing 115% of the nominal quadruped mass. We test several different observation spaces based on proprioceptive sensing and show that our framework is deployable with no domain randomization and very little feedback, where along with the oscillator states, it is possible to provide only contact booleans in the observation space. Video results can be found at https://youtu.be/xqXHLzLsEV4.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes