CVAILGROSep 28, 2023

MotionLM: Multi-Agent Motion Forecasting as Language Modeling

arXiv:2309.16534v1196 citationsh-index: 27
Originality Highly original
AI Analysis

This addresses the critical problem of safe planning in autonomous vehicles by improving interactive motion prediction, though it is an incremental advance using a novel method for a known bottleneck.

The paper tackles multi-agent motion forecasting for autonomous vehicles by representing trajectories as discrete tokens and framing it as a language modeling task, achieving state-of-the-art performance with a 1st-place ranking on the Waymo Open Motion Dataset interactive challenge leaderboard.

Reliable forecasting of the future behavior of road agents is a critical component to safe planning in autonomous vehicles. Here, we represent continuous trajectories as sequences of discrete motion tokens and cast multi-agent motion prediction as a language modeling task over this domain. Our model, MotionLM, provides several advantages: First, it does not require anchors or explicit latent variable optimization to learn multimodal distributions. Instead, we leverage a single standard language modeling objective, maximizing the average log probability over sequence tokens. Second, our approach bypasses post-hoc interaction heuristics where individual agent trajectory generation is conducted prior to interactive scoring. Instead, MotionLM produces joint distributions over interactive agent futures in a single autoregressive decoding process. In addition, the model's sequential factorization enables temporally causal conditional rollouts. The proposed approach establishes new state-of-the-art performance for multi-agent motion prediction on the Waymo Open Motion Dataset, ranking 1st on the interactive challenge leaderboard.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes