CVROAug 2, 2024

SceneMotion: From Agent-Centric Embeddings to Scene-Wide Forecasts

arXiv:2408.01537v38 citationsh-index: 7Has Code
Originality Incremental advance
AI Analysis

This work addresses motion forecasting for self-driving vehicles, which is incremental as it builds on existing attention-based models with a novel latent context module.

The paper tackles the problem of forecasting scene-wide motion modes for multiple traffic agents in self-driving contexts, achieving competitive performance in the Waymo Open Interaction Prediction Challenge.

Self-driving vehicles rely on multimodal motion forecasts to effectively interact with their environment and plan safe maneuvers. We introduce SceneMotion, an attention-based model for forecasting scene-wide motion modes of multiple traffic agents. Our model transforms local agent-centric embeddings into scene-wide forecasts using a novel latent context module. This module learns a scene-wide latent space from multiple agent-centric embeddings, enabling joint forecasting and interaction modeling. The competitive performance in the Waymo Open Interaction Prediction Challenge demonstrates the effectiveness of our approach. Moreover, we cluster future waypoints in time and space to quantify the interaction between agents. We merge all modes and analyze each mode independently to determine which clusters are resolved through interaction or result in conflict. Our implementation is available at: https://github.com/kit-mrt/future-motion

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes