CVLGApr 10, 2019

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations

arXiv:1904.05044v3644 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of reducing annotation costs for instance segmentation in computer vision, though it is incremental as it builds on weakly supervised methods.

The paper tackles the problem of instance segmentation using only image-level class labels as supervision, by generating pseudo instance segmentation labels via IRNet, which identifies seed areas and propagates them with boundary detection, achieving state-of-the-art performance on the PASCAL VOC 2012 dataset and surpassing some models with stronger supervision.

This paper presents a novel approach for learning instance segmentation with image-level class labels as supervision. Our approach generates pseudo instance segmentation labels of training images, which are used to train a fully supervised model. For generating the pseudo labels, we first identify confident seed areas of object classes from attention maps of an image classification model, and propagate them to discover the entire instance areas with accurate boundaries. To this end, we propose IRNet, which estimates rough areas of individual instances and detects boundaries between different object classes. It thus enables to assign instance labels to the seeds and to propagate them within the boundaries so that the entire areas of instances can be estimated accurately. Furthermore, IRNet is trained with inter-pixel relations on the attention maps, thus no extra supervision is required. Our method with IRNet achieves an outstanding performance on the PASCAL VOC 2012 dataset, surpassing not only previous state-of-the-art trained with the same level of supervision, but also some of previous models relying on stronger supervision.

Code Implementations8 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes