EMT: A Visual Multi-Task Benchmark Dataset for Autonomous Driving
This dataset addresses the need for region-specific multi-task evaluation in autonomous driving, though it is incremental as it builds on existing benchmarking frameworks.
The paper introduces the EMT dataset, a visual multi-task benchmark for autonomous driving with over 30,000 frames and 570,000 annotated bounding boxes from Gulf region traffic, supporting tracking, trajectory forecasting, and intention prediction tasks.
This paper introduces the Emirates Multi-Task (EMT) dataset, designed to support multi-task benchmarking within a unified framework. It comprises over 30,000 frames from a dash-camera perspective and 570,000 annotated bounding boxes, covering approximately 150 kilometers of driving routes that reflect the distinctive road topology, congestion patterns, and driving behavior of Gulf region traffic. The dataset supports three primary tasks: tracking, trajectory forecasting, and intention prediction. Each benchmark is accompanied by corresponding evaluations: (1) multi-agent tracking experiments addressing multi-class scenarios and occlusion handling; (2) trajectory forecasting evaluation using deep sequential and interaction-aware models; and (3) intention prediction experiments based on observed trajectories. The dataset is publicly available at https://avlab.io/emt-dataset, with pre-processing scripts and evaluation models at https://github.com/AV-Lab/emt-dataset.