ROAILGAug 12, 2025

Unsupervised Skill Discovery as Exploration for Learning Agile Locomotion

arXiv:2508.08982v17 citationsh-index: 2
Originality Incremental advance
AI Analysis

This addresses the problem of reducing human effort in robot learning for agile locomotion, though it appears incremental as it builds on existing exploration and skill discovery methods.

The paper tackled the challenge of enabling legged robots to learn agile locomotion without extensive human engineering by proposing SDAX, a framework that uses unsupervised skill discovery to acquire diverse skills for overcoming obstacles, resulting in successful deployment on real quadrupedal robots with behaviors like crawling, climbing, and jumping off walls.

Exploration is crucial for enabling legged robots to learn agile locomotion behaviors that can overcome diverse obstacles. However, such exploration is inherently challenging, and we often rely on extensive reward engineering, expert demonstrations, or curriculum learning - all of which limit generalizability. In this work, we propose Skill Discovery as Exploration (SDAX), a novel learning framework that significantly reduces human engineering effort. SDAX leverages unsupervised skill discovery to autonomously acquire a diverse repertoire of skills for overcoming obstacles. To dynamically regulate the level of exploration during training, SDAX employs a bi-level optimization process that autonomously adjusts the degree of exploration. We demonstrate that SDAX enables quadrupedal robots to acquire highly agile behaviors including crawling, climbing, leaping, and executing complex maneuvers such as jumping off vertical walls. Finally, we deploy the learned policy on real hardware, validating its successful transfer to the real world.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes