CVNov 18, 2024

FCC: Fully Connected Correlation for Few-Shot Segmentation

arXiv:2411.11917v11 citationsh-index: 19
Originality Incremental advance
AI Analysis

This work addresses the challenge of segmenting objects with limited labeled data, which is crucial for applications in computer vision, but it is incremental as it builds on existing correlation-based methods.

The paper tackles the problem of few-shot segmentation by proposing FCC, a method that integrates pixel-level correlations across all layers of a Vision Transformer encoder to capture target-specific patterns, achieving state-of-the-art performance on benchmarks like PASCAL and COCO.

Few-shot segmentation (FSS) aims to segment the target object in a query image using only a small set of support images and masks. Therefore, having strong prior information for the target object using the support set is essential for guiding the initial training of FSS, which leads to the success of few-shot segmentation in challenging cases, such as when the target object shows considerable variation in appearance, texture, or scale across the support and query images. Previous methods have tried to obtain prior information by creating correlation maps from pixel-level correlation on final-layer or same-layer features. However, we found these approaches can offer limited and partial information when advanced models like Vision Transformers are used as the backbone. Vision Transformer encoders have a multi-layer structure with identical shapes in their intermediate layers. Leveraging the feature comparison from all layers in the encoder can enhance the performance of few-shot segmentation. We introduce FCC (Fully Connected Correlation) to integrate pixel-level correlations between support and query features, capturing associations that reveal target-specific patterns and correspondences in both same-layers and cross-layers. FCC captures previously inaccessible target information, effectively addressing the limitations of support mask. Our approach consistently demonstrates state-of-the-art performance on PASCAL, COCO, and domain shift tests. We conducted an ablation study and cross-layer correlation analysis to validate FCC's core methodology. These findings reveal the effectiveness of FCC in enhancing prior information and overall model performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes