CVAILGApr 10, 2017

Semantically Consistent Regularization for Zero-Shot Recognition

arXiv:1704.03039v1141 citations
Originality Incremental advance
AI Analysis

This addresses zero-shot learning for computer vision, offering an incremental improvement by combining existing approaches with new regularization techniques.

The paper tackles the problem of zero-shot recognition by exploiting the complementarity between two forms of semantic supervision, proposing a CNN framework with semantic constraints to improve transfer ability. It achieves significant improvements over state-of-the-art on several datasets.

The role of semantics in zero-shot learning is considered. The effectiveness of previous approaches is analyzed according to the form of supervision provided. While some learn semantics independently, others only supervise the semantic subspace explained by training classes. Thus, the former is able to constrain the whole space but lacks the ability to model semantic correlations. The latter addresses this issue but leaves part of the semantic space unsupervised. This complementarity is exploited in a new convolutional neural network (CNN) framework, which proposes the use of semantics as constraints for recognition.Although a CNN trained for classification has no transfer ability, this can be encouraged by learning an hidden semantic layer together with a semantic code for classification. Two forms of semantic constraints are then introduced. The first is a loss-based regularizer that introduces a generalization constraint on each semantic predictor. The second is a codeword regularizer that favors semantic-to-class mappings consistent with prior semantic knowledge while allowing these to be learned from data. Significant improvements over the state-of-the-art are achieved on several datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes