CVJul 15, 2019

AugLabel: Exploiting Word Representations to Augment Labels for Face Attribute Classification

arXiv:1907.06757v15 citations
Originality Incremental advance
AI Analysis

This addresses data scarcity in computer vision for face attribute classification, offering an incremental improvement over existing augmentation methods.

The paper tackles the problem of limited annotated data for face attribute classification by proposing a method to augment labels using word2vec representations, which improved baseline performance and reduced the need for annotated data by up to 50% on CelebA and LFWA datasets.

Augmenting data in image space (eg. flipping, cropping etc) and activation space (eg. dropout) are being widely used to regularise deep neural networks and have been successfully applied on several computer vision tasks. Unlike previous works, which are mostly focused on doing augmentation in the aforementioned domains, we propose to do augmentation in label space. In this paper, we present a novel method to generate fixed dimensional labels with continuous values for images by exploiting the word2vec representations of the existing categorical labels. We then append these representations with existing categorical labels and train the model. We validated our idea on two challenging face attribute classification data sets viz. CelebA and LFWA. Our extensive experiments show that the augmented labels improve the performance of the competitive deep learning baseline and reduce the need of annotated real data up to 50%, while attaining a performance similar to the state-of-the-art methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes