CVMar 21, 2021

Multi-level Metric Learning for Few-shot Image Recognition

arXiv:2103.11383v426 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses the problem of limited data in image recognition for AI applications, offering an incremental improvement over existing methods.

The paper tackles few-shot image recognition by proposing a multi-level metric learning method that combines pixel-level, part-level, and global-level features to improve classification, achieving state-of-the-art results on popular datasets.

Few-shot learning is devoted to training a model on few samples. Most of these approaches learn a model based on a pixel-level or global-level feature representation. However, using global features may lose local information, and using pixel-level features may lose the contextual semantics of the image. Moreover, such works can only measure the relations between them on a single level, which is not comprehensive and effective. And if query images can simultaneously be well classified via three distinct level similarity metrics, the query images within a class can be more tightly distributed in a smaller feature space, generating more discriminative feature maps. Motivated by this, we propose a novel Part-level Embedding Adaptation with Graph (PEAG) method to generate task-specific features. Moreover, a Multi-level Metric Learning (MML) method is proposed, which not only calculates the pixel-level similarity but also considers the similarity of part-level features and global-level features. Extensive experiments on popular few-shot image recognition datasets prove the effectiveness of our method compared with the state-of-the-art methods. Our code is available at \url{https://github.com/chenhaoxing/M2L}.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes