CVAug 3, 2022

Per-Clip Video Object Segmentation

arXiv:2208.01924v163 citationsh-index: 32
Originality Incremental advance
AI Analysis

This work improves accuracy and efficiency for video object segmentation, offering flexibility in speed-accuracy trade-offs, though it is incremental over existing memory-based methods.

The paper tackles video object segmentation by proposing a per-clip inference scheme that processes consecutive frames together, achieving state-of-the-art performance with 84.6% on YouTube-VOS 2018/2019 and 91.9% on DAVIS 2016/2017.

Recently, memory-based approaches show promising results on semi-supervised video object segmentation. These methods predict object masks frame-by-frame with the help of frequently updated memory of the previous mask. Different from this per-frame inference, we investigate an alternative perspective by treating video object segmentation as clip-wise mask propagation. In this per-clip inference scheme, we update the memory with an interval and simultaneously process a set of consecutive frames (i.e. clip) between the memory updates. The scheme provides two potential benefits: accuracy gain by clip-level optimization and efficiency gain by parallel computation of multiple frames. To this end, we propose a new method tailored for the per-clip inference. Specifically, we first introduce a clip-wise operation to refine the features based on intra-clip correlation. In addition, we employ a progressive matching mechanism for efficient information-passing within a clip. With the synergy of two modules and a newly proposed per-clip based training, our network achieves state-of-the-art performance on Youtube-VOS 2018/2019 val (84.6% and 84.6%) and DAVIS 2016/2017 val (91.9% and 86.1%). Furthermore, our model shows a great speed-accuracy trade-off with varying memory update intervals, which leads to huge flexibility.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes