CVMar 21

SupScene: Scene-Structured Overlap Supervision for Image Retrieval in Unconstrained SfM

arXiv:2601.1193036.9h-index: 4Has Code
AI Analysis

This work addresses the need for geometrically matchable image retrieval in SfM, offering a more effective training paradigm that outperforms existing methods on multiple benchmarks.

SupScene introduces a scene-structured training framework for image retrieval in unconstrained SfM that leverages overlap graphs to supervise pairwise relations, achieving significant improvements in retrieval performance and downstream SfM completeness.

Image retrieval is a critical step for reducing the quadratic cost of image matching in unconstrained Structure-from-Motion (SfM). Unlike generic image retrieval, however, the relevant goal of SfM is to identify geometrically matchable image pairs rather than merely semantically similar images. Prevailing methods are largely trained under anchor-centric tuple guidance, which organizes the training around isolated tuples and under-utilizes the dense, graded overlap structure naturally established within a SfM scene. In this work, we present SupScene, a scene-structured training framework that samples connected local subgraphs from SfM overlap graphs and jointly supervises all valid within-subgraph pairwise relations. To explicitly align the trained descriptor with geometric co-visibility, we further introduce an overlap-ordered objective that combines multi-similarity optimization with a continuous relative-overlap ranking term. In addition, the proposed framework is instantiated with a lightweight Structural Context Probe Pooling (SCPP) head that aggregates complementary structural responses into a compact global descriptor. Extensive experimental results on multiple benchmarks demonstrate that our method can significantly improve overall retrieval performance and enhance the completeness of downstream SfM reconstructions. Code and models are available at https://github.com/Suxilan/SupScene.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes