LGAICVROMLSep 12, 2019

Hierarchical Foresight: Self-Supervised Learning of Long-Horizon Tasks via Visual Subgoal Generation

arXiv:1909.05829v1153 citations
Originality Incremental advance
AI Analysis

This addresses a key limitation in robot learning for long-horizon tasks, offering a self-supervised approach that improves planning efficiency and scalability, though it is incremental as it builds on existing video prediction and planning methods.

The paper tackles the problem of enabling robots to plan over long horizons for vision-based tasks by proposing a hierarchical visual foresight framework that generates subgoal images to decompose tasks into easier segments, resulting in nearly a 200% performance improvement over baseline methods in simulated manipulation tasks.

Video prediction models combined with planning algorithms have shown promise in enabling robots to learn to perform many vision-based tasks through only self-supervision, reaching novel goals in cluttered scenes with unseen objects. However, due to the compounding uncertainty in long horizon video prediction and poor scalability of sampling-based planning optimizers, one significant limitation of these approaches is the ability to plan over long horizons to reach distant goals. To that end, we propose a framework for subgoal generation and planning, hierarchical visual foresight (HVF), which generates subgoal images conditioned on a goal image, and uses them for planning. The subgoal images are directly optimized to decompose the task into easy to plan segments, and as a result, we observe that the method naturally identifies semantically meaningful states as subgoals. Across three out of four simulated vision-based manipulation tasks, we find that our method achieves nearly a 200% performance improvement over planning without subgoals and model-free RL approaches. Further, our experiments illustrate that our approach extends to real, cluttered visual scenes. Project page: https://sites.google.com/stanford.edu/hvf

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes