CVMar 29, 2021

Monocular 3D Vehicle Detection Using Uncalibrated Traffic Cameras through Homography

arXiv:2103.15293v232 citations
AI Analysis

This addresses the problem of 3D vehicle detection for traffic monitoring systems, offering an incremental improvement by adapting existing methods to uncalibrated cameras.

The paper tackles monocular 3D vehicle detection from uncalibrated traffic cameras by using homography estimation and a dual-view network with tailed rotated bounding boxes, achieving generalization to new camera setups without training on them.

This paper proposes a method to extract the position and pose of vehicles in the 3D world from a single traffic camera. Most previous monocular 3D vehicle detection algorithms focused on cameras on vehicles from the perspective of a driver, and assumed known intrinsic and extrinsic calibration. On the contrary, this paper focuses on the same task using uncalibrated monocular traffic cameras. We observe that the homography between the road plane and the image plane is essential to 3D vehicle detection and the data synthesis for this task, and the homography can be estimated without the camera intrinsics and extrinsics. We conduct 3D vehicle detection by estimating the rotated bounding boxes (r-boxes) in the bird's eye view (BEV) images generated from inverse perspective mapping. We propose a new regression target called tailed r-box and a dual-view network architecture which boosts the detection accuracy on warped BEV images. Experiments show that the proposed method can generalize to new camera and environment setups despite not seeing imaged from them during training.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes