CVAug 10, 2020

Prototype Mixture Models for Few-shot Semantic Segmentation

arXiv:2008.03898v2470 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of segmenting objects with varying appearances in few-shot learning, offering a domain-specific incremental improvement.

The paper tackles the problem of semantic ambiguity in few-shot segmentation by proposing prototype mixture models (PMMs) that use multiple prototypes to represent diverse image regions, resulting in a 5.82% improvement in 5-shot segmentation performance on MS-COCO.

Few-shot segmentation is challenging because objects within the support and query images could significantly differ in appearance and pose. Using a single prototype acquired directly from the support image to segment the query image causes semantic ambiguity. In this paper, we propose prototype mixture models (PMMs), which correlate diverse image regions with multiple prototypes to enforce the prototype-based semantic representation. Estimated by an Expectation-Maximization algorithm, PMMs incorporate rich channel-wised and spatial semantics from limited support images. Utilized as representations as well as classifiers, PMMs fully leverage the semantics to activate objects in the query image while depressing background regions in a duplex manner. Extensive experiments on Pascal VOC and MS-COCO datasets show that PMMs significantly improve upon state-of-the-arts. Particularly, PMMs improve 5-shot segmentation performance on MS-COCO by up to 5.82\% with only a moderate cost for model size and inference speed.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes