CVMar 17, 2022

Multi-similarity based Hyperrelation Network for few-shot segmentation

arXiv:2203.09550v512 citationsh-index: 9
Originality Incremental advance
AI Analysis

This addresses the problem of segmenting objects from unseen categories with limited supervision for computer vision applications, representing an incremental improvement over existing methods.

The paper tackles few-shot semantic segmentation by proposing a Multi-similarity Hyperrelation Network (MSHNet) that combines Generative Prototype Similarity and cosine similarity to establish robust semantic relationships between support and query images, achieving new state-of-the-art performances on Pascal-5i and COCO-20i datasets for 1-shot and 5-shot tasks.

Few-shot semantic segmentation aims at recognizing the object regions of unseen categories with only a few annotated examples as supervision. The key to few-shot segmentation is to establish a robust semantic relationship between the support and query images and to prevent overfitting. In this paper, we propose an effective Multi-similarity Hyperrelation Network (MSHNet) to tackle the few-shot semantic segmentation problem. In MSHNet, we propose a new Generative Prototype Similarity (GPS), which together with cosine similarity can establish a strong semantic relation between the support and query images. The locally generated prototype similarity based on global feature is logically complementary to the global cosine similarity based on local feature, and the relationship between the query image and the supported image can be expressed more comprehensively by using the two similarities simultaneously. In addition, we propose a Symmetric Merging Block (SMB) in MSHNet to efficiently merge multi-layer, multi-shot and multi-similarity hyperrelational features. MSHNet is built on the basis of similarity rather than specific category features, which can achieve more general unity and effectively reduce overfitting. On two benchmark semantic segmentation datasets Pascal-5i and COCO-20i, MSHNet achieves new state-of-the-art performances on 1-shot and 5-shot semantic segmentation tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes