LGAIROFeb 9, 2024

Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss

arXiv:2402.06187v411 citationsh-index: 12Has CodeICML
Originality Incremental advance
AI Analysis

This work addresses the challenge of efficient few-shot learning for sequential decision-making tasks, which is incremental as it builds on existing temporal action contrastive learning methods.

The paper tackles the problem of improving few-shot policy learning efficiency in sequential decision-making tasks by introducing Premier-TACO, a multitask feature representation learning approach that pretrains on offline datasets and fine-tunes with minimal demonstrations, achieving significant enhancements in few-shot imitation learning across diverse continuous control benchmarks.

We present Premier-TACO, a multitask feature representation learning approach designed to improve few-shot policy learning efficiency in sequential decision-making tasks. Premier-TACO leverages a subset of multitask offline datasets for pretraining a general feature representation, which captures critical environmental dynamics and is fine-tuned using minimal expert demonstrations. It advances the temporal action contrastive learning (TACO) objective, known for state-of-the-art results in visual control tasks, by incorporating a novel negative example sampling strategy. This strategy is crucial in significantly boosting TACO's computational efficiency, making large-scale multitask offline pretraining feasible. Our extensive empirical evaluation in a diverse set of continuous control benchmarks including Deepmind Control Suite, MetaWorld, and LIBERO demonstrate Premier-TACO's effectiveness in pretraining visual representations, significantly enhancing few-shot imitation learning of novel tasks. Our code, pretraining data, as well as pretrained model checkpoints will be released at https://github.com/PremierTACO/premier-taco. Our project webpage is at https://premiertaco.github.io.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes