CVAIIVOct 11, 2021

UrbanNet: Leveraging Urban Maps for Long Range 3D Object Detection

arXiv:2110.05561v112 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses a cost-sensitive challenge for traffic monitoring applications, but it appears incremental as it builds on existing components like 2D detectors and 3D descriptors.

The paper tackles the problem of precise 3D object detection from monocular images for traffic monitoring by proposing UrbanNet, a modular architecture that uses urban maps to achieve accurate detection at long range, even with object rotation and non-flat ground, as evaluated on a novel synthetic dataset.

Relying on monocular image data for precise 3D object detection remains an open problem, whose solution has broad implications for cost-sensitive applications such as traffic monitoring. We present UrbanNet, a modular architecture for long range monocular 3D object detection with static cameras. Our proposed system combines commonly available urban maps along with a mature 2D object detector and an efficient 3D object descriptor to accomplish accurate detection at long range even when objects are rotated along any of their three axes. We evaluate UrbanNet on a novel challenging synthetic dataset and highlight the advantages of its design for traffic detection in roads with changing slope, where the flat ground approximation does not hold. Data and code are available at https://github.com/TRAILab/UrbanNet

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes