CVNov 6, 2016

Deep Label Distribution Learning with Label Ambiguity

arXiv:1611.01731v2487 citations
Originality Incremental advance
AI Analysis

This addresses label ambiguity in domains like age and pose estimation for computer vision, offering a method to prevent over-fitting with small datasets, though it is incremental as it builds on existing ConvNet frameworks.

The paper tackles the problem of visual recognition tasks where precise labels are hard to obtain by converting labels into distributions and learning them with deep ConvNets, resulting in significantly better results than state-of-the-art methods for age and head pose estimation, with improvements in multi-label classification and semantic segmentation.

Convolutional Neural Networks (ConvNets) have achieved excellent recognition performance in various visual recognition tasks. A large labeled training set is one of the most important factors for its success. However, it is difficult to collect sufficient training images with precise labels in some domains such as apparent age estimation, head pose estimation, multi-label classification and semantic segmentation. Fortunately, there is ambiguous information among labels, which makes these tasks different from traditional classification. Based on this observation, we convert the label of each image into a discrete label distribution, and learn the label distribution by minimizing a Kullback-Leibler divergence between the predicted and ground-truth label distributions using deep ConvNets. The proposed DLDL (Deep Label Distribution Learning) method effectively utilizes the label ambiguity in both feature learning and classifier learning, which help prevent the network from over-fitting even when the training set is small. Experimental results show that the proposed approach produces significantly better results than state-of-the-art methods for age estimation and head pose estimation. At the same time, it also improves recognition performance for multi-label classification and semantic segmentation tasks.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes