IRCVFeb 21, 2017

Differential Geometric Retrieval of Deep Features

arXiv:1702.06383v23 citations
Originality Synthesis-oriented
AI Analysis

This work addresses image retrieval for commercial applications, but it is incremental as it focuses on comparing existing metrics rather than introducing a new method.

The paper tackled the problem of content-based image retrieval by comparing distance metrics for ranking deep features, finding that a Wasserstein-based metric, though computationally expensive, outperforms information-theoretic comparisons.

Comparing images to recommend items from an image-inventory is a subject of continued interest. Added with the scalability of deep-learning architectures the once `manual' job of hand-crafting features have been largely alleviated, and images can be compared according to features generated from a deep convolutional neural network. In this paper, we compare distance metrics (and divergences) to rank features generated from a neural network, for content-based image retrieval. Specifically, after modelling individual images using approximations of mixture models or sparse covariance estimators, we resort to their information-theoretic and Riemann geometric comparisons. We show that using approximations of mixture models enable us to compute a distance measure based on the Wasserstein metric that requires less effort than other computationally intensive optimal transport plans; finally, an affine invariant metric is used to compare the optimal transport metric to its Riemann geometric counterpart -- we conclude that although expensive, retrieval metric based on Wasserstein geometry is more suitable than information theoretic comparison of images. In short, we combine GPU scalability in learning deep feature vectors with statistically efficient metrics that we foresee being utilised in a commercial setting.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes