CVLGAug 19, 2020

Attribute Prototype Network for Zero-Shot Learning

arXiv:2008.08290v4377 citations
AI Analysis

This addresses the problem of transferring knowledge to unseen classes in computer vision, with incremental improvements in attribute-based methods.

The paper tackled zero-shot learning by proposing a framework that integrates attribute localization into image representations, achieving new state-of-the-art results on three benchmarks.

From the beginning of zero-shot learning research, visual attributes have been shown to play an important role. In order to better transfer attribute-based knowledge from known to unknown classes, we argue that an image representation with integrated attribute localization ability would be beneficial for zero-shot learning. To this end, we propose a novel zero-shot representation learning framework that jointly learns discriminative global and local features using only class-level attributes. While a visual-semantic embedding layer learns global features, local features are learned through an attribute prototype network that simultaneously regresses and decorrelates attributes from intermediate features. We show that our locality augmented image representations achieve a new state-of-the-art on three zero-shot learning benchmarks. As an additional benefit, our model points to the visual evidence of the attributes in an image, e.g. for the CUB dataset, confirming the improved attribute localization ability of our image representation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes