ROAILGJul 7, 2025

Beyond Features: How Dataset Design Influences Multi-Agent Trajectory Prediction Performance

arXiv:2507.05098v1h-index: 23
Originality Synthesis-oriented
AI Analysis

This work addresses dataset design challenges for autonomous navigation researchers, but it is incremental as it builds on existing benchmarks and methods.

The study investigated how dataset design affects multi-agent trajectory prediction accuracy, finding that adding extra map and agent features did not improve performance over baseline features, and conducted cross-dataset and geographic transfer experiments.

Accurate trajectory prediction is critical for safe autonomous navigation, yet the impact of dataset design on model performance remains understudied. This work systematically examines how feature selection, cross-dataset transfer, and geographic diversity influence trajectory prediction accuracy in multi-agent settings. We evaluate a state-of-the-art model using our novel L4 Motion Forecasting dataset based on our own data recordings in Germany and the US. This includes enhanced map and agent features. We compare our dataset to the US-centric Argoverse 2 benchmark. First, we find that incorporating supplementary map and agent features unique to our dataset, yields no measurable improvement over baseline features, demonstrating that modern architectures do not need extensive feature sets for optimal performance. The limited features of public datasets are sufficient to capture convoluted interactions without added complexity. Second, we perform cross-dataset experiments to evaluate how effective domain knowledge can be transferred between datasets. Third, we group our dataset by country and check the knowledge transfer between different driving cultures.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes