CVAug 14, 2020

BriNet: Towards Bridging the Intra-class and Inter-class Gaps in One-Shot Segmentation

arXiv:2008.06226v155 citations
Originality Incremental advance
AI Analysis

This work addresses the generalization challenge in few-shot segmentation for computer vision, offering incremental improvements over existing methods.

The paper tackles the intra-class and inter-class gaps in one-shot segmentation by proposing BriNet, which introduces an Information Exchange Module and a multi-path fine-grained strategy, achieving new state-of-the-art results on PASCAL VOC and MSCOCO datasets.

Few-shot segmentation focuses on the generalization of models to segment unseen object instances with limited training samples. Although tremendous improvements have been achieved, existing methods are still constrained by two factors. (1) The information interaction between query and support images is not adequate, leaving intra-class gap. (2) The object categories at the training and inference stages have no overlap, leaving the inter-class gap. Thus, we propose a framework, BriNet, to bridge these gaps. First, more information interactions are encouraged between the extracted features of the query and support images, i.e., using an Information Exchange Module to emphasize the common objects. Furthermore, to precisely localize the query objects, we design a multi-path fine-grained strategy which is able to make better use of the support feature representations. Second, a new online refinement strategy is proposed to help the trained model adapt to unseen classes, achieved by switching the roles of the query and the support images at the inference stage. The effectiveness of our framework is demonstrated by experimental results, which outperforms other competitive methods and leads to a new state-of-the-art on both PASCAL VOC and MSCOCO dataset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes