LGAISep 29, 2025

Data-Efficient Multitask DAgger

arXiv:2509.25466v1h-index: 14
Originality Incremental advance
AI Analysis

This addresses the data efficiency challenge for multitask robot learning, offering an incremental improvement over existing imitation learning methods.

The paper tackles the problem of training generalist robot policies that require extensive expert data by proposing a Data-Efficient multitask DAgger framework, which increases task success rates using substantially fewer expert demonstrations and shows better zero-shot transfer to real robots than baseline methods.

Generalist robot policies that can perform many tasks typically require extensive expert data or simulations for training. In this work, we propose a novel Data-Efficient multitask DAgger framework that distills a single multitask policy from multiple task-specific expert policies. Our approach significantly increases the overall task success rate by actively focusing on tasks where the multitask policy underperforms. The core of our method is a performance-aware scheduling strategy that tracks how much each task's learning process benefits from the amount of data, using a Kalman filter-based estimator to robustly decide how to allocate additional demonstrations across tasks. We validate our approach on MetaWorld, as well as a suite of diverse drawer-opening tasks in IsaacLab. The resulting policy attains high performance across all tasks while using substantially fewer expert demonstrations, and the visual policy learned with our method in simulation shows better performance than naive DAgger and Behavior Cloning when transferring zero-shot to a real robot without using real data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes