CVJan 6, 2022

EM-driven unsupervised learning for efficient motion segmentation

arXiv:2201.02074v351 citations
AI Analysis

This addresses the problem of motion segmentation for computer vision applications without manual annotation, though it is incremental as it builds on existing EM and CNN frameworks.

The paper tackles unsupervised motion segmentation from optical flow by proposing a CNN-based method that uses Expectation-Maximization to design a loss function without ground-truth, achieving strong performance on benchmarks like DAVIS2016 and FBMS59 while being fast at test time.

In this paper, we present a CNN-based fully unsupervised method for motion segmentation from optical flow. We assume that the input optical flow can be represented as a piecewise set of parametric motion models, typically, affine or quadratic motion models. The core idea of our work is to leverage the Expectation-Maximization (EM) framework in order to design in a well-founded manner a loss function and a training procedure of our motion segmentation neural network that does not require either ground-truth or manual annotation. However, in contrast to the classical iterative EM, once the network is trained, we can provide a segmentation for any unseen optical flow field in a single inference step and without estimating any motion models. We investigate different loss functions including robust ones and propose a novel efficient data augmentation technique on the optical flow field, applicable to any network taking optical flow as input. In addition, our method is able by design to segment multiple motions. Our motion segmentation network was tested on four benchmarks, DAVIS2016, SegTrackV2, FBMS59, and MoCA, and performed very well, while being fast at test time.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes