LG AIJan 16, 2025

On Learning Informative Trajectory Embeddings for Imitation, Classification and Regression

Zichang Ge, Changyu Chen, Arunesh Sinha, Pradeep Varakantham

arXiv:2501.09327v24.11 citationsh-index: 6Has CodeAAMAS

Originality Incremental advance

AI Analysis

This work addresses the need for generalizable trajectory representations across domains such as autonomous driving and healthcare, though it appears incremental by building on embedding models like CLIP and BERT.

The paper tackles the problem of learning from state-action trajectories in sequential decision-making tasks like autonomous driving and robotics, proposing a novel embedding method that captures skills and competencies without reward labels, and experimental results show it outperforms traditional approaches in tasks like imitation, classification, clustering, and regression.

In real-world sequential decision making tasks like autonomous driving, robotics, and healthcare, learning from observed state-action trajectories is critical for tasks like imitation, classification, and clustering. For example, self-driving cars must replicate human driving behaviors, while robots and healthcare systems benefit from modeling decision sequences, whether or not they come from expert data. Existing trajectory encoding methods often focus on specific tasks or rely on reward signals, limiting their ability to generalize across domains and tasks. Inspired by the success of embedding models like CLIP and BERT in static domains, we propose a novel method for embedding state-action trajectories into a latent space that captures the skills and competencies in the dynamic underlying decision-making processes. This method operates without the need for reward labels, enabling better generalization across diverse domains and tasks. Our contributions are threefold: (1) We introduce a trajectory embedding approach that captures multiple abilities from state-action data. (2) The learned embeddings exhibit strong representational power across downstream tasks, including imitation, classification, clustering, and regression. (3) The embeddings demonstrate unique properties, such as controlling agent behaviors in IQ-Learn and an additive structure in the latent space. Experimental results confirm that our method outperforms traditional approaches, offering more flexible and powerful trajectory representations for various applications. Our code is available at https://github.com/Erasmo1015/vte.

View on arXiv PDF Code

Similar