CVApr 6, 2017

Semantically-Guided Video Object Segmentation

arXiv:1704.01926v213 citations
Originality Incremental advance
AI Analysis

This addresses the problem of segmenting objects in videos with appearance changes for computer vision applications, representing an incremental advance.

The paper tackles semi-supervised video object segmentation by introducing a semantic prior to guide appearance models, improving state-of-the-art results on two datasets with metrics like running in half a second per frame.

This paper tackles the problem of semi-supervised video object segmentation, that is, segmenting an object in a sequence given its mask in the first frame. One of the main challenges in this scenario is the change of appearance of the objects of interest. Their semantics, on the other hand, do not vary. This paper investigates how to take advantage of such invariance via the introduction of a semantic prior that guides the appearance model. Specifically, given the segmentation mask of the first frame of a sequence, we estimate the semantics of the object of interest, and propagate that knowledge throughout the sequence to improve the results based on an appearance model. We present Semantically-Guided Video Object Segmentation (SGV), which improves results over previous state of the art on two different datasets using a variety of evaluation metrics, while running in half a second per frame.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes