LGAIMay 4, 2023

Masked Trajectory Models for Prediction, Representation, and Control

arXiv:2305.02968v157 citationsHas Code
Originality Highly original
AI Analysis

This provides a unified approach for prediction, representation, and control in robotics and AI, though it is incremental as it builds on masked modeling techniques.

The paper tackles sequential decision making by introducing Masked Trajectory Models (MTM), a generic self-supervised method that learns versatile networks capable of performing tasks like forward dynamics, inverse dynamics, and offline RL with the same weights, matching or outperforming specialized networks in continuous control tasks and accelerating RL learning speed.

We introduce Masked Trajectory Models (MTM) as a generic abstraction for sequential decision making. MTM takes a trajectory, such as a state-action sequence, and aims to reconstruct the trajectory conditioned on random subsets of the same trajectory. By training with a highly randomized masking pattern, MTM learns versatile networks that can take on different roles or capabilities, by simply choosing appropriate masks at inference time. For example, the same MTM network can be used as a forward dynamics model, inverse dynamics model, or even an offline RL agent. Through extensive experiments in several continuous control tasks, we show that the same MTM network -- i.e. same weights -- can match or outperform specialized networks trained for the aforementioned capabilities. Additionally, we find that state representations learned by MTM can significantly accelerate the learning speed of traditional RL algorithms. Finally, in offline RL benchmarks, we find that MTM is competitive with specialized offline RL algorithms, despite MTM being a generic self-supervised learning method without any explicit RL components. Code is available at https://github.com/facebookresearch/mtm

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes