LGROJun 11, 2021

Keyframe-Focused Visual Imitation Learning

arXiv:2106.06452v129 citations
Originality Incremental advance
AI Analysis

This addresses a long-standing issue in behavioral cloning for driving by enabling effective imitation from observation histories, though it is incremental as it builds on prior keyframe-based approaches.

The paper tackles the problem of imitation learning in partially observable settings by upweighting demonstration keyframes at expert action changepoints, resulting in consistent performance improvements on image-based Gym MuJoCo tasks and effective imitation in the CARLA driving simulator.

Imitation learning trains control policies by mimicking pre-recorded expert demonstrations. In partially observable settings, imitation policies must rely on observation histories, but many seemingly paradoxical results show better performance for policies that only access the most recent observation. Recent solutions ranging from causal graph learning to deep information bottlenecks have shown promising results, but failed to scale to realistic settings such as visual imitation. We propose a solution that outperforms these prior approaches by upweighting demonstration keyframes corresponding to expert action changepoints. This simple approach easily scales to complex visual imitation settings. Our experimental results demonstrate consistent performance improvements over all baselines on image-based Gym MuJoCo continuous control tasks. Finally, on the CARLA photorealistic vision-based urban driving simulator, we resolve a long-standing issue in behavioral cloning for driving by demonstrating effective imitation from observation histories. Supplementary materials and code at: \url{https://tinyurl.com/imitation-keyframes}.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes