LGCVMLMar 15, 2019

Zero Shot Learning with the Isoperimetric Loss

arXiv:1903.06781v25 citations
Originality Highly original
AI Analysis

This addresses the problem of transferring knowledge to unknown classes in computer vision, offering a novel regularization approach that improves performance in zero-shot learning tasks.

The authors tackled zero-shot learning by introducing the isoperimetric loss as a regularization criterion to learn mappings from visual to semantic embeddings, achieving state-of-the-art performance on benchmarks like AwA and CUB with fully automated methods.

We introduce the isoperimetric loss as a regularization criterion for learning the map from a visual representation to a semantic embedding, to be used to transfer knowledge to unknown classes in a zero-shot learning setting. We use a pre-trained deep neural network model as a visual representation of image data, a Word2Vec embedding of class labels, and linear maps between the visual and semantic embedding spaces. However, the spaces themselves are not linear, and we postulate the sample embedding to be populated by noisy samples near otherwise smooth manifolds. We exploit the graph structure defined by the sample points to regularize the estimates of the manifolds by inferring the graph connectivity using a generalization of the isoperimetric inequalities from Riemannian geometry to graphs. Surprisingly, this regularization alone, paired with the simplest baseline model, outperforms the state-of-the-art among fully automated methods in zero-shot learning benchmarks such as AwA and CUB. This improvement is achieved solely by learning the structure of the underlying spaces by imposing regularity.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes