CVNov 17, 2019

Unsupervised Visual Representation Learning with Increasing Object Shape Bias

arXiv:1911.07272v2
Originality Incremental advance
AI Analysis

This addresses the bottleneck of annotation costs in computer vision research, though it appears incremental as it builds on existing contrastive predictive coding methods.

The paper tackles the problem of high annotation costs in computer vision by proposing an unsupervised learning method based on contrastive predictive coding, which achieves state-of-the-art performance with non-annotated images and enables unlimited training samples for potential universal pre-trained models.

(Very early draft)Traditional supervised learning keeps pushing convolution neural network(CNN) achieving state-of-art performance. However, lack of large-scale annotation data is always a big problem due to the high cost of it, even ImageNet dataset is over-fitted by complex models now. The success of unsupervised learning method represented by the Bert model in natural language processing(NLP) field shows its great potential. And it makes that unlimited training samples becomes possible and the great universal generalization ability changes NLP research direction directly. In this article, we purpose a novel unsupervised learning method based on contrastive predictive coding. Under that, we are able to train model with any non-annotation images and improve model's performance to reach state-of-art performance at the same level of model complexity. Beside that, since the number of training images could be unlimited amplification, an universal large-scale pre-trained computer vision model is possible in the future.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes