CVAug 26, 2019

Embarrassingly Simple Binary Representation Learning

arXiv:1908.09573v126 citations
AI Analysis

This work addresses the need for simpler and more effective hashing models in machine learning, though it appears incremental as it builds on existing paradigms.

The authors tackled the problem of binary representation learning by proposing a simple classification-based model that avoids complex optimization or auxiliary components, achieving state-of-the-art performance on benchmarks like CIFAR-10, NUS-WIDE, and ImageNet.

Recent binary representation learning models usually require sophisticated binary optimization, similarity measure or even generative models as auxiliaries. However, one may wonder whether these non-trivial components are needed to formulate practical and effective hashing models. In this paper, we answer the above question by proposing an embarrassingly simple approach to binary representation learning. With a simple classification objective, our model only incorporates two additional fully-connected layers onto the top of an arbitrary backbone network, whilst complying with the binary constraints during training. The proposed model lower-bounds the Information Bottleneck (IB) between data samples and their semantics, and can be related to many recent `learning to hash' paradigms. We show that, when properly designed, even such a simple network can generate effective binary codes, by fully exploring data semantics without any held-out alternating updating steps or auxiliary models. Experiments are conducted on conventional large-scale benchmarks, i.e., CIFAR-10, NUS-WIDE, and ImageNet, where the proposed simple model outperforms the state-of-the-art methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes