CVDec 21, 2018

Learning from Web Data: the Benefit of Unsupervised Object Localization

arXiv:1812.09232v16 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of reducing annotation costs for fine-grained image classification by leveraging web data, but it is incremental as it builds on existing noisy label correction approaches.

The paper tackles the problem of learning from noisy web data for fine-grained image classification by introducing an unsupervised object localization method to reduce distribution gaps between web and standard datasets, achieving favorable performance against state-of-the-art methods in experiments.

Annotating a large number of training images is very time-consuming. In this background, this paper focuses on learning from easy-to-acquire web data and utilizes the learned model for fine-grained image classification in labeled datasets. Currently, the performance gain from training with web data is incremental, like a common saying "better than nothing, but not by much". Conventionally, the community looks to correcting the noisy web labels to select informative samples. In this work, we first systematically study the built-in gap between the web and standard datasets, i.e. different data distributions between the two kinds of data. Then, in addition to using web labels, we present an unsupervised object localization method, which provides critical insights into the object density and scale in web images. Specifically, we design two constraints on web data to substantially reduce the difference of data distributions for the web and standard datasets. First, we present a method to control the scale, localization and number of objects in the detected region. Second, we propose to select the regions containing objects that are consistent with the web tag. Based on the two constraints, we are able to process web images to reduce the gap, and the processed web data is used to better assist the standard dataset to train CNNs. Experiments on several fine-grained image classification datasets confirm that our method performs favorably against the state-of-the-art methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes