CV AI LG ROSep 28, 2023

MotionLM: Multi-Agent Motion Forecasting as Language Modeling

Ari Seff, Brian Cera, Dian Chen, Mason Ng, Aurick Zhou, Nigamaa Nayakanti, Khaled S. Refaat, Rami Al-Rfou, Benjamin Sapp

arXiv:2309.16534v133.3201 citationsh-index: 27

Originality Highly original

AI Analysis

This addresses the critical problem of safe planning in autonomous vehicles by improving interactive motion prediction, though it is an incremental advance using a novel method for a known bottleneck.

The paper tackles multi-agent motion forecasting for autonomous vehicles by representing trajectories as discrete tokens and framing it as a language modeling task, achieving state-of-the-art performance with a 1st-place ranking on the Waymo Open Motion Dataset interactive challenge leaderboard.

Reliable forecasting of the future behavior of road agents is a critical component to safe planning in autonomous vehicles. Here, we represent continuous trajectories as sequences of discrete motion tokens and cast multi-agent motion prediction as a language modeling task over this domain. Our model, MotionLM, provides several advantages: First, it does not require anchors or explicit latent variable optimization to learn multimodal distributions. Instead, we leverage a single standard language modeling objective, maximizing the average log probability over sequence tokens. Second, our approach bypasses post-hoc interaction heuristics where individual agent trajectory generation is conducted prior to interactive scoring. Instead, MotionLM produces joint distributions over interactive agent futures in a single autoregressive decoding process. In addition, the model's sequential factorization enables temporally causal conditional rollouts. The proposed approach establishes new state-of-the-art performance for multi-agent motion prediction on the Waymo Open Motion Dataset, ranking 1st on the interactive challenge leaderboard.

View on arXiv PDF

Similar