UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles
This addresses the problem of limited 3D understanding in UAV applications for researchers and developers, though it is incremental as it builds on existing 2D benchmarks by extending them to 3D.
The authors tackled the lack of 3D perception benchmarks for UAVs by introducing UAV3D, a large-scale dataset with 1,000 scenes and 20 frames each for 3D object detection and tracking tasks, enabling research in both single and collaborative UAV scenarios.
Unmanned Aerial Vehicles (UAVs), equipped with cameras, are employed in numerous applications, including aerial photography, surveillance, and agriculture. In these applications, robust object detection and tracking are essential for the effective deployment of UAVs. However, existing benchmarks for UAV applications are mainly designed for traditional 2D perception tasks, restricting the development of real-world applications that require a 3D understanding of the environment. Furthermore, despite recent advancements in single-UAV perception, limited views of a single UAV platform significantly constrain its perception capabilities over long distances or in occluded areas. To address these challenges, we introduce UAV3D, a benchmark designed to advance research in both 3D and collaborative 3D perception tasks with UAVs. UAV3D comprises 1,000 scenes, each of which has 20 frames with fully annotated 3D bounding boxes on vehicles. We provide the benchmark for four 3D perception tasks: single-UAV 3D object detection, single-UAV object tracking, collaborative-UAV 3D object detection, and collaborative-UAV object tracking. Our dataset and code are available at https://huiyegit.github.io/UAV3D_Benchmark/.