CV AIAug 22, 2020

ScribbleBox: Interactive Annotation Framework for Video Object Segmentation

Bowen Chen, Huan Ling, Xiaohui Zeng, Gao Jun, Ziyue Xu, Sanja Fidler

arXiv:2008.09721v19.126 citations

Originality Incremental advance

AI Analysis

This addresses the efficiency issue for researchers and practitioners in video object segmentation by providing a faster annotation tool, though it is incremental as it builds on existing interactive methods.

The paper tackles the time-consuming problem of manually labeling video datasets for segmentation by introducing ScribbleBox, an interactive framework that splits annotation into tracked boxes and masks, achieving 88.92% J&F on DAVIS2017 with 9.14 clicks per box track and 4 frames of scribble annotation.

Manually labeling video datasets for segmentation tasks is extremely time consuming. In this paper, we introduce ScribbleBox, a novel interactive framework for annotating object instances with masks in videos. In particular, we split annotation into two steps: annotating objects with tracked boxes, and labeling masks inside these tracks. We introduce automation and interaction in both steps. Box tracks are annotated efficiently by approximating the trajectory using a parametric curve with a small number of control points which the annotator can interactively correct. Our approach tolerates a modest amount of noise in the box placements, thus typically only a few clicks are needed to annotate tracked boxes to a sufficient accuracy. Segmentation masks are corrected via scribbles which are efficiently propagated through time. We show significant performance gains in annotation efficiency over past work. We show that our ScribbleBox approach reaches 88.92% J&F on DAVIS2017 with 9.14 clicks per box track, and 4 frames of scribble annotation.

View on arXiv PDF

Similar