CVAIFeb 26, 2025

EMT: A Visual Multi-Task Benchmark Dataset for Autonomous Driving

arXiv:2502.19260v41 citationsh-index: 36Has Code
Originality Synthesis-oriented
AI Analysis

This dataset addresses the need for region-specific multi-task evaluation in autonomous driving, though it is incremental as it builds on existing benchmarking frameworks.

The paper introduces the EMT dataset, a visual multi-task benchmark for autonomous driving with over 30,000 frames and 570,000 annotated bounding boxes from Gulf region traffic, supporting tracking, trajectory forecasting, and intention prediction tasks.

This paper introduces the Emirates Multi-Task (EMT) dataset, designed to support multi-task benchmarking within a unified framework. It comprises over 30,000 frames from a dash-camera perspective and 570,000 annotated bounding boxes, covering approximately 150 kilometers of driving routes that reflect the distinctive road topology, congestion patterns, and driving behavior of Gulf region traffic. The dataset supports three primary tasks: tracking, trajectory forecasting, and intention prediction. Each benchmark is accompanied by corresponding evaluations: (1) multi-agent tracking experiments addressing multi-class scenarios and occlusion handling; (2) trajectory forecasting evaluation using deep sequential and interaction-aware models; and (3) intention prediction experiments based on observed trajectories. The dataset is publicly available at https://avlab.io/emt-dataset, with pre-processing scripts and evaluation models at https://github.com/AV-Lab/emt-dataset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes