CVMay 2, 2021

Subspace Representation Learning for Few-shot Image Classification

arXiv:2105.00379v26 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of few-shot image classification for computer vision researchers, offering an incremental improvement by extending metric learning from vector to subspace representation.

The paper tackles few-shot image classification by proposing a subspace representation learning framework that uses weighted subspace distance and template subspaces to aggregate K-shot information, achieving competitive or superior performance on MiniImageNet, TieredImageNet, and CUB datasets.

In this paper, we propose a subspace representation learning (SRL) framework to tackle few-shot image classification tasks. It exploits a subspace in local CNN feature space to represent an image, and measures the similarity between two images according to a weighted subspace distance (WSD). When K images are available for each class, we develop two types of template subspaces to aggregate K-shot information: the prototypical subspace (PS) and the discriminative subspace (DS). Based on the SRL framework, we extend metric learning based techniques from vector to subspace representation. While most previous works adopted global vector representation, using subspace representation can effectively preserve the spatial structure, and diversity within an image. We demonstrate the effectiveness of the SRL framework on three public benchmark datasets: MiniImageNet, TieredImageNet and Caltech-UCSD Birds-200-2011 (CUB), and the experimental results illustrate competitive/superior performance of our method compared to the previous state-of-the-art.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes