LGMLMay 11, 2019

Ranking-based Deep Cross-modal Hashing

arXiv:1905.04450v182 citations
Originality Incremental advance
AI Analysis

This work improves cross-modal retrieval efficiency for applications like multimedia search, but it is incremental as it builds on existing hashing methods with enhancements.

The paper tackles the problem of cross-modal hashing for multi-modal data retrieval by addressing limitations in feature compatibility and handling complex ranking semantics, resulting in a method that outperforms baselines and achieves state-of-the-art performance in experiments.

Cross-modal hashing has been receiving increasing interests for its low storage cost and fast query speed in multi-modal data retrievals. However, most existing hashing methods are based on hand-crafted or raw level features of objects, which may not be optimally compatible with the coding process. Besides, these hashing methods are mainly designed to handle simple pairwise similarity. The complex multilevel ranking semantic structure of instances associated with multiple labels has not been well explored yet. In this paper, we propose a ranking-based deep cross-modal hashing approach (RDCMH). RDCMH firstly uses the feature and label information of data to derive a semi-supervised semantic ranking list. Next, to expand the semantic representation power of hand-crafted features, RDCMH integrates the semantic ranking information into deep cross-modal hashing and jointly optimizes the compatible parameters of deep feature representations and of hashing functions. Experiments on real multi-modal datasets show that RDCMH outperforms other competitive baselines and achieves the state-of-the-art performance in cross-modal retrieval applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes