IRJul 26, 2017

Discrete Latent Factor Model for Cross-Modal Hashing

arXiv:1707.08322v1145 citations
Originality Incremental advance
AI Analysis

This addresses the problem of efficient and accurate cross-modal similarity search for multimedia applications, representing an incremental improvement over existing discrete methods.

The paper tackles the trade-off between accuracy and training efficiency in cross-modal hashing by proposing DLFH, a discrete method that directly learns binary hash codes, achieving significantly better accuracy than existing methods while maintaining training time comparable to faster relaxation-based continuous methods.

Due to its storage and retrieval efficiency, cross-modal hashing~(CMH) has been widely used for cross-modal similarity search in multimedia applications. According to the training strategy, existing CMH methods can be mainly divided into two categories: relaxation-based continuous methods and discrete methods. In general, the training of relaxation-based continuous methods is faster than discrete methods, but the accuracy of relaxation-based continuous methods is not satisfactory. On the contrary, the accuracy of discrete methods is typically better than relaxation-based continuous methods, but the training of discrete methods is time-consuming. In this paper, we propose a novel CMH method, called discrete latent factor model based cross-modal hashing~(DLFH), for cross modal similarity search. DLFH is a discrete method which can directly learn the binary hash codes for CMH. At the same time, the training of DLFH is efficient. Experiments on real datasets show that DLFH can achieve significantly better accuracy than existing methods, and the training time of DLFH is comparable to that of relaxation-based continuous methods which are much faster than existing discrete methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes