CVAISep 3, 2025

Unsupervised Instance Segmentation with Superpixels

arXiv:2509.05352v12 citationsh-index: 2Pattern Recognition
Originality Incremental advance
AI Analysis

This addresses the annotation bottleneck for computer vision applications like robotics and autonomous driving, offering an incremental improvement over existing unsupervised methods.

The paper tackles the problem of costly human annotations in instance segmentation by proposing an unsupervised framework that uses superpixels and self-training, achieving state-of-the-art results on public datasets.

Instance segmentation is essential for numerous computer vision applications, including robotics, human-computer interaction, and autonomous driving. Currently, popular models bring impressive performance in instance segmentation by training with a large number of human annotations, which are costly to collect. For this reason, we present a new framework that efficiently and effectively segments objects without the need for human annotations. Firstly, a MultiCut algorithm is applied to self-supervised features for coarse mask segmentation. Then, a mask filter is employed to obtain high-quality coarse masks. To train the segmentation network, we compute a novel superpixel-guided mask loss, comprising hard loss and soft loss, with high-quality coarse masks and superpixels segmented from low-level image features. Lastly, a self-training process with a new adaptive loss is proposed to improve the quality of predicted masks. We conduct experiments on public datasets in instance segmentation and object detection to demonstrate the effectiveness of the proposed framework. The results show that the proposed framework outperforms previous state-of-the-art methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes