CVApr 13, 2021

Crossover Learning for Fast Online Video Instance Segmentation

arXiv:2104.05970v1125 citations
AI Analysis

This work addresses the need for efficient and accurate video instance segmentation, which is crucial for video understanding applications, but it is incremental as it builds on existing online methods.

The authors tackled the problem of fast online video instance segmentation by proposing CrossVIS, which uses a crossover learning scheme for temporal modeling without extra parameters, achieving state-of-the-art performance among online methods on benchmarks like YouTube-VIS-2019, OVIS, and YouTube-VIS-2021 with a good latency-accuracy trade-off.

Modeling temporal visual context across frames is critical for video instance segmentation (VIS) and other video understanding tasks. In this paper, we propose a fast online VIS model named CrossVIS. For temporal information modeling in VIS, we present a novel crossover learning scheme that uses the instance feature in the current frame to pixel-wisely localize the same instance in other frames. Different from previous schemes, crossover learning does not require any additional network parameters for feature enhancement. By integrating with the instance segmentation loss, crossover learning enables efficient cross-frame instance-to-pixel relation learning and brings cost-free improvement during inference. Besides, a global balanced instance embedding branch is proposed for more accurate and more stable online instance association. We conduct extensive experiments on three challenging VIS benchmarks, \ie, YouTube-VIS-2019, OVIS, and YouTube-VIS-2021 to evaluate our methods. To our knowledge, CrossVIS achieves state-of-the-art performance among all online VIS methods and shows a decent trade-off between latency and accuracy. Code will be available to facilitate future research.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes