CVJan 25, 2023

Flow-guided Semi-supervised Video Object Segmentation

Yushan Zhang, Andreas Robinson, Maria Magnusson, Michael Felsberg

arXiv:2301.10492v12.81 citationsh-index: 55

Originality Incremental advance

AI Analysis

This work addresses video object segmentation for computer vision applications, but it is incremental as it builds on existing encoder-decoder approaches with a novel integration method.

The paper tackles semi-supervised video object segmentation by integrating optical flow with an attention mechanism, achieving state-of-the-art performance on DAVIS 2017 and YouTube-VOS 2019 datasets with strong performance gains.

We propose an optical flow-guided approach for semi-supervised video object segmentation. Optical flow is usually exploited as additional guidance information in unsupervised video object segmentation. However, its relevance in semi-supervised video object segmentation has not been fully explored. In this work, we follow an encoder-decoder approach to address the segmentation task. A model to extract the combined information from optical flow and the image is proposed, which is then used as input to the target model and the decoder network. Unlike previous methods where concatenation is used to integrate information from image data and optical flow, a simple yet effective attention mechanism is exploited in our work. Experiments on DAVIS 2017 and YouTube-VOS 2019 show that by integrating the information extracted from optical flow into the original image branch results in a strong performance gain and our method achieves state-of-the-art performance.

View on arXiv PDF

Similar