LGAICVROSep 30, 2024

M2Distill: Multi-Modal Distillation for Lifelong Imitation Learning

arXiv:2410.00064v310 citationsh-index: 3
Originality Incremental advance
AI Analysis

This addresses scalability and forgetting issues in lifelong learning for robotics, though it appears incremental as it builds on distillation and multi-modal approaches.

The paper tackles the problem of catastrophic forgetting in lifelong imitation learning for manipulation tasks by introducing M2Distill, a multi-modal distillation method that preserves consistent latent spaces across vision, language, and action distributions, and it outperforms prior state-of-the-art methods on the LIBERO benchmark suites.

Lifelong imitation learning for manipulation tasks poses significant challenges due to distribution shifts that occur in incremental learning steps. Existing methods often focus on unsupervised skill discovery to construct an ever-growing skill library or distillation from multiple policies, which can lead to scalability issues as diverse manipulation tasks are continually introduced and may fail to ensure a consistent latent space throughout the learning process, leading to catastrophic forgetting of previously learned skills. In this paper, we introduce M2Distill, a multi-modal distillation-based method for lifelong imitation learning focusing on preserving consistent latent space across vision, language, and action distributions throughout the learning process. By regulating the shifts in latent representations across different modalities from previous to current steps, and reducing discrepancies in Gaussian Mixture Model (GMM) policies between consecutive learning steps, we ensure that the learned policy retains its ability to perform previously learned tasks while seamlessly integrating new skills. Extensive evaluations on the LIBERO lifelong imitation learning benchmark suites, including LIBERO-OBJECT, LIBERO-GOAL, and LIBERO-SPATIAL, demonstrate that our method consistently outperforms prior state-of-the-art methods across all evaluated metrics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes