CVNov 12, 2018

Generative Dual Adversarial Network for Generalized Zero-shot Learning

arXiv:1811.04857v4233 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of classifying images from both seen and unseen classes for computer vision researchers, representing an incremental improvement over existing generative methods.

The paper tackles generalized zero-shot learning by proposing a unified framework that integrates visual-to-semantic mapping, semantic-to-visual mapping, and metric learning, achieving higher accuracy on both seen and unseen classes compared to state-of-the-art models.

This paper studies the problem of generalized zero-shot learning which requires the model to train on image-label pairs from some seen classes and test on the task of classifying new images from both seen and unseen classes. Most previous models try to learn a fixed one-directional mapping between visual and semantic space, while some recently proposed generative methods try to generate image features for unseen classes so that the zero-shot learning problem becomes a traditional fully-supervised classification problem. In this paper, we propose a novel model that provides a unified framework for three different approaches: visual-> semantic mapping, semantic->visual mapping, and metric learning. Specifically, our proposed model consists of a feature generator that can generate various visual features given class embeddings as input, a regressor that maps each visual feature back to its corresponding class embedding, and a discriminator that learns to evaluate the closeness of an image feature and a class embedding. All three components are trained under the combination of cyclic consistency loss and dual adversarial loss. Experimental results show that our model not only preserves higher accuracy in classifying images from seen classes, but also performs better than existing state-of-the-art models in in classifying images from unseen classes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes