CVIRDec 12, 2024

Nested Hash Layer: A Plug-and-play Module for Multiple-length Hash Code Learning

arXiv:2412.08922v21 citationsh-index: 40
AI Analysis

This addresses the need for efficient and effective hash code generation in large-scale image retrieval, though it is incremental as it builds on existing deep supervised hashing models.

The paper tackles the trade-off between efficiency and effectiveness in deep supervised hashing for image retrieval by proposing the Nested Hash Layer, a plug-and-play module that generates multiple-length hash codes simultaneously, resulting in a 5-8x training speed improvement and a 3.4% average performance boost.

Deep supervised hashing is essential for efficient storage and search in large-scale image retrieval. Traditional deep supervised hashing models generate single-length hash codes, but this creates a trade-off between efficiency and effectiveness for different code lengths. To find the optimal length for a task, multiple models must be trained, increasing time and computation. Furthermore, relationships between hash codes of different lengths are often ignored. To address these issues, we propose the Nested Hash Layer (NHL), a plug-and-play module for deep supervised hashing models. NHL generates hash codes of multiple lengths simultaneously in a nested structure. To resolve optimization conflicts from multiple learning objectives, we introduce a dominance-aware dynamic weighting strategy to adjust gradients. Additionally, we propose a long-short cascade self-distillation method, where long hash codes guide the learning of shorter ones, improving overall code quality. Experiments indicate that the NHL achieves an overall training speed improvement of approximately 5 to 8 times across various deep supervised hashing models and enhances the average performance of these models by about 3.4%.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes