CVNov 4, 2019

Deep Heterogeneous Hashing for Face Video Retrieval

arXiv:1911.01048v119 citations
Originality Synthesis-oriented
AI Analysis

This work addresses a domain-specific problem for applications in face video retrieval, but it is incremental as it builds on existing hashing and manifold techniques.

The paper tackles the problem of retrieving face videos using a face image query by addressing the challenge of matching heterogeneous data representations (Euclidean vectors for images and Riemannian manifolds for videos) through an end-to-end deep hashing method. It achieves competitive performance on three datasets compared to existing hashing methods.

Retrieving videos of a particular person with face image as a query via hashing technique has many important applications. While face images are typically represented as vectors in Euclidean space, characterizing face videos with some robust set modeling techniques (e.g. covariance matrices as exploited in this study, which reside on Riemannian manifold), has recently shown appealing advantages. This hence results in a thorny heterogeneous spaces matching problem. Moreover, hashing with handcrafted features as done in many existing works is clearly inadequate to achieve desirable performance for this task. To address such problems, we present an end-to-end Deep Heterogeneous Hashing (DHH) method that integrates three stages including image feature learning, video modeling, and heterogeneous hashing in a single framework, to learn unified binary codes for both face images and videos. To tackle the key challenge of hashing on the manifold, a well-studied Riemannian kernel mapping is employed to project data (i.e. covariance matrices) into Euclidean space and thus enables to embed the two heterogeneous representations into a common Hamming space, where both intra-space discriminability and inter-space compatibility are considered. To perform network optimization, the gradient of the kernel mapping is innovatively derived via structured matrix backpropagation in a theoretically principled way. Experiments on three challenging datasets show that our method achieves quite competitive performance compared with existing hashing methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes