CVNov 17, 2018

RelationNet2: Deep Comparison Columns for Few-Shot Learning

arXiv:1811.07100v318 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of scaling visual recognition to new classes with limited data for researchers in computer vision, representing an incremental improvement over existing metric-based methods.

The paper tackles few-shot learning by proposing a deep comparison network that uses non-linear feature comparisons at multiple levels and represents images as distributions to reduce overfitting, achieving excellent performance on miniImageNet and tieredImageNet benchmarks.

Few-shot deep learning is a topical challenge area for scaling visual recognition to open ended growth of unseen new classes with limited labeled examples. A promising approach is based on metric learning, which trains a deep embedding to support image similarity matching. Our insight is that effective general purpose matching requires non-linear comparison of features at multiple abstraction levels. We thus propose a new deep comparison network comprised of embedding and relation modules that learn multiple non-linear distance metrics based on different levels of features simultaneously. Furthermore, to reduce over-fitting and enable the use of deeper embeddings, we represent images as distributions rather than vectors via learning parameterized Gaussian noise regularization. The resulting network achieves excellent performance on both miniImageNet and tieredImageNet.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes