LGMLFeb 11, 2020

Compositional Embeddings for Multi-Label One-Shot Learning

arXiv:2002.04193v516 citations
AI Analysis

This addresses multi-label object recognition with weak supervision, but it is incremental as it builds on prior embedding methods.

The paper tackles multi-label one-shot learning by introducing compositional embedding models that infer sets of classes from images, outperforming existing methods on OmniGlot, Open Images, and COCO datasets.

We present a compositional embedding framework that infers not just a single class per input image, but a set of classes, in the setting of one-shot learning. Specifically, we propose and evaluate several novel models consisting of (1) an embedding function f trained jointly with a "composition" function g that computes set union operations between the classes encoded in two embedding vectors; and (2) embedding f trained jointly with a "query" function h that computes whether the classes encoded in one embedding subsume the classes encoded in another embedding. In contrast to prior work, these models must both perceive the classes associated with the input examples and encode the relationships between different class label sets, and they are trained using only weak one-shot supervision consisting of the label-set relationships among training examples. Experiments on the OmniGlot, Open Images, and COCO datasets show that the proposed compositional embedding models outperform existing embedding methods. Our compositional embedding models have applications to multi-label object recognition for both one-shot and supervised learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes