CVApr 6, 2017

Semantically-Guided Video Object Segmentation

Sergi Caelles, Yuhua Chen, Jordi Pont-Tuset, Luc Van Gool

arXiv:1704.01926v23.813 citations

Originality Incremental advance

AI Analysis

This addresses the problem of segmenting objects in videos with appearance changes for computer vision applications, representing an incremental advance.

The paper tackles semi-supervised video object segmentation by introducing a semantic prior to guide appearance models, improving state-of-the-art results on two datasets with metrics like running in half a second per frame.

This paper tackles the problem of semi-supervised video object segmentation, that is, segmenting an object in a sequence given its mask in the first frame. One of the main challenges in this scenario is the change of appearance of the objects of interest. Their semantics, on the other hand, do not vary. This paper investigates how to take advantage of such invariance via the introduction of a semantic prior that guides the appearance model. Specifically, given the segmentation mask of the first frame of a sequence, we estimate the semantics of the object of interest, and propagate that knowledge throughout the sequence to improve the results based on an appearance model. We present Semantically-Guided Video Object Segmentation (SGV), which improves results over previous state of the art on two different datasets using a variety of evaluation metrics, while running in half a second per frame.

View on arXiv PDF

Similar