MLCVLGApr 18, 2017

Unsupervised Learning by Predicting Noise

arXiv:1704.05310v1302 citations
Originality Incremental advance
AI Analysis

It addresses the need for unsupervised learning in computer vision, offering a domain-agnostic solution that is incremental over existing unsupervised methods.

The paper tackles the problem of training deep networks without supervision by introducing a framework that aligns deep features to fixed noise targets, avoiding trivial solutions and feature collapse. The method scales to millions of images and achieves performance on par with state-of-the-art unsupervised methods on ImageNet and Pascal VOC.

Convolutional neural networks provide visual features that perform remarkably well in many computer vision applications. However, training these networks requires significant amounts of supervision. This paper introduces a generic framework to train deep networks, end-to-end, with no supervision. We propose to fix a set of target representations, called Noise As Targets (NAT), and to constrain the deep features to align to them. This domain agnostic approach avoids the standard unsupervised learning issues of trivial solutions and collapsing of features. Thanks to a stochastic batch reassignment strategy and a separable square loss function, it scales to millions of images. The proposed approach produces representations that perform on par with state-of-the-art unsupervised methods on ImageNet and Pascal VOC.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes