CVJun 2, 2023

Two-View Geometry Scoring Without Correspondences

arXiv:2306.01596v116 citationsh-index: 40
Originality Highly original
AI Analysis

This addresses a bottleneck in computer vision for applications like 3D reconstruction by improving pose estimation when correspondences are scarce or unreliable.

The paper tackles the problem of camera pose estimation in two-view geometry by proposing the Fundamental Scoring Network (FSNet), which scores image pairs and fundamental matrices without relying on correspondences, and shows that combining it with MAGSAC++ achieves state-of-the-art results.

Camera pose estimation for two-view geometry traditionally relies on RANSAC. Normally, a multitude of image correspondences leads to a pool of proposed hypotheses, which are then scored to find a winning model. The inlier count is generally regarded as a reliable indicator of "consensus". We examine this scoring heuristic, and find that it favors disappointing models under certain circumstances. As a remedy, we propose the Fundamental Scoring Network (FSNet), which infers a score for a pair of overlapping images and any proposed fundamental matrix. It does not rely on sparse correspondences, but rather embodies a two-view geometry model through an epipolar attention mechanism that predicts the pose error of the two images. FSNet can be incorporated into traditional RANSAC loops. We evaluate FSNet on fundamental and essential matrix estimation on indoor and outdoor datasets, and establish that FSNet can successfully identify good poses for pairs of images with few or unreliable correspondences. Besides, we show that naively combining FSNet with MAGSAC++ scoring approach achieves state of the art results.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes