IRMar 2, 2018

Learning Decorrelated Hashing Codes for Multimodal Retrieval

arXiv:1803.00682v2
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in multimodal hashing for social network data retrieval, offering an incremental improvement over existing methods.

The paper tackles the problem of correlation among hashing bits in multimodal retrieval, which slows performance improvement with longer codes, by proposing a minimum correlation regularization method that decorrelates hashing codes, resulting in superior retrieval performance especially as code length increases.

In social networks, heterogeneous multimedia data correlate to each other, such as videos and their corresponding tags in YouTube and image-text pairs in Facebook. Nearest neighbor retrieval across multiple modalities on large data sets becomes a hot yet challenging problem. Hashing is expected to be an efficient solution, since it represents data as binary codes. As the bit-wise XOR operations can be fast handled, the retrieval time is greatly reduced. Few existing multimodal hashing methods consider the correlation among hashing bits. The correlation has negative impact on hashing codes. When the hashing code length becomes longer, the retrieval performance improvement becomes slower. In this paper, we propose a minimum correlation regularization (MCR) for multimodal hashing. First, the sigmoid function is used to embed the data matrices. Then, the MCR is applied on the output of sigmoid function. As the output of sigmoid function approximates a binary code matrix, the proposed MCR can efficiently decorrelate the hashing codes. Experiments show the superiority of the proposed method becomes greater as the code length increases.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes