CVLGJul 23, 2020

Neural Geometric Parser for Single Image Camera Calibration

arXiv:2007.11855v225 citations
Originality Incremental advance
AI Analysis

This addresses camera calibration from single images, a key problem in computer vision for applications like 3D reconstruction, but is incremental as it builds on prior neural and geometric approaches.

The paper tackles single image camera calibration for man-made scenes by proposing a neural geometric parser that combines semantic and geometric cues, achieving significantly higher accuracy than existing state-of-the-art techniques in experiments on indoor and outdoor scenes.

We propose a neural geometric parser learning single image camera calibration for man-made scenes. Unlike previous neural approaches that rely only on semantic cues obtained from neural networks, our approach considers both semantic and geometric cues, resulting in significant accuracy improvement. The proposed framework consists of two networks. Using line segments of an image as geometric cues, the first network estimates the zenith vanishing point and generates several candidates consisting of the camera rotation and focal length. The second network evaluates each candidate based on the given image and the geometric cues, where prior knowledge of man-made scenes is used for the evaluation. With the supervision of datasets consisting of the horizontal line and focal length of the images, our networks can be trained to estimate the same camera parameters. Based on the Manhattan world assumption, we can further estimate the camera rotation and focal length in a weakly supervised manner. The experimental results reveal that the performance of our neural approach is significantly higher than that of existing state-of-the-art camera calibration techniques for single images of indoor and outdoor scenes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes