CVLGNov 20, 2019

SSAH: Semi-supervised Adversarial Deep Hashing with Self-paced Hard Sample Generation

arXiv:1911.08688v136 citations
Originality Incremental advance
AI Analysis

This addresses the data scarcity issue in practical web media search applications, representing an incremental advancement over existing GAN-based semi-supervised hashing methods.

The paper tackles the problem of limited labeled data in deep hashing for web media search by proposing SSAH, a semi-supervised method that integrates adversarial hard sample generation and hashing learning, resulting in significant improvements over state-of-the-art models on widely-used and fine-grained datasets.

Deep hashing methods have been proved to be effective and efficient for large-scale Web media search. The success of these data-driven methods largely depends on collecting sufficient labeled data, which is usually a crucial limitation in practical cases. The current solutions to this issue utilize Generative Adversarial Network (GAN) to augment data in semi-supervised learning. However, existing GAN-based methods treat image generations and hashing learning as two isolated processes, leading to generation ineffectiveness. Besides, most works fail to exploit the semantic information in unlabeled data. In this paper, we propose a novel Semi-supervised Self-pace Adversarial Hashing method, named SSAH to solve the above problems in a unified framework. The SSAH method consists of an adversarial network (A-Net) and a hashing network (H-Net). To improve the quality of generative images, first, the A-Net learns hard samples with multi-scale occlusions and multi-angle rotated deformations which compete against the learning of accurate hashing codes. Second, we design a novel self-paced hard generation policy to gradually increase the hashing difficulty of generated samples. To make use of the semantic information in unlabeled ones, we propose a semi-supervised consistent loss. The experimental results show that our method can significantly improve state-of-the-art models on both the widely-used hashing datasets and fine-grained datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes