CVOct 5, 2021

Scaling up instance annotation via label propagation

arXiv:2110.02277v113 citations
Originality Incremental advance
AI Analysis

This addresses the time-consuming and costly process of manual segmentation annotation for building large datasets, offering a scalable solution for computer vision applications.

The paper tackles the problem of efficiently annotating object segmentation masks at scale by proposing a scheme that uses hierarchical clustering and label propagation, achieving a 76x reduction in annotation time and producing 1M masks in 290 hours with quality comparable to manual annotation.

Manually annotating object segmentation masks is very time-consuming. While interactive segmentation methods offer a more efficient alternative, they become unaffordable at a large scale because the cost grows linearly with the number of annotated masks. In this paper, we propose a highly efficient annotation scheme for building large datasets with object segmentation masks. At a large scale, images contain many object instances with similar appearance. We exploit these similarities by using hierarchical clustering on mask predictions made by a segmentation model. We propose a scheme that efficiently searches through the hierarchy of clusters and selects which clusters to annotate. Humans manually verify only a few masks per cluster, and the labels are propagated to the whole cluster. Through a large-scale experiment to populate 1M unlabeled images with object segmentation masks for 80 object classes, we show that (1) we obtain 1M object segmentation masks with an total annotation time of only 290 hours; (2) we reduce annotation time by 76x compared to manual annotation; (3) the segmentation quality of our masks is on par with those from manually annotated datasets. Code, data, and models are available online.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes