CVMay 13, 2015

Leveraging Image based Prior for Visual Place Recognition

arXiv:1505.03205v21 citations
Originality Synthesis-oriented
AI Analysis

This addresses the problem of efficient and cost-effective place recognition for robotics or mapping applications, but it appears incremental as it builds on existing descriptor methods with a new data source.

The paper tackles visual place recognition by proposing a novel scene descriptor that uses raw image libraries like Google StreetView and Flickr instead of vector-quantized features, resulting in a compact and discriminative descriptor based on mined landmarks.

In this study, we propose a novel scene descriptor for visual place recognition. Unlike popular bag-of-words scene descriptors which rely on a library of vector quantized visual features, our proposed descriptor is based on a library of raw image data, such as publicly available photo collections from Google StreetView and Flickr. The library images need not to be associated with spatial information regarding the viewpoint and orientation of the scene. As a result, these images are cheaper than the database images; in addition, they are readily available. Our proposed descriptor directly mines the image library to discover landmarks (i.e., image patches) that suitably match an input query/database image. The discovered landmarks are then compactly described by their pose and shape (i.e., library image ID, bounding boxes) and used as a compact discriminative scene descriptor for the input image. We evaluate the effectiveness of our scene description framework by comparing its performance to that of previous approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes