CVJan 6, 2022

EM-driven unsupervised learning for efficient motion segmentation

Etienne Meunier, Anaïs Badoual, Patrick Bouthemy

arXiv:2201.02074v317.052 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the problem of motion segmentation for computer vision applications without manual annotation, though it is incremental as it builds on existing EM and CNN frameworks.

The paper tackles unsupervised motion segmentation from optical flow by proposing a CNN-based method that uses Expectation-Maximization to design a loss function without ground-truth, achieving strong performance on benchmarks like DAVIS2016 and FBMS59 while being fast at test time.

In this paper, we present a CNN-based fully unsupervised method for motion segmentation from optical flow. We assume that the input optical flow can be represented as a piecewise set of parametric motion models, typically, affine or quadratic motion models. The core idea of our work is to leverage the Expectation-Maximization (EM) framework in order to design in a well-founded manner a loss function and a training procedure of our motion segmentation neural network that does not require either ground-truth or manual annotation. However, in contrast to the classical iterative EM, once the network is trained, we can provide a segmentation for any unseen optical flow field in a single inference step and without estimating any motion models. We investigate different loss functions including robust ones and propose a novel efficient data augmentation technique on the optical flow field, applicable to any network taking optical flow as input. In addition, our method is able by design to segment multiple motions. Our motion segmentation network was tested on four benchmarks, DAVIS2016, SegTrackV2, FBMS59, and MoCA, and performed very well, while being fast at test time.

View on arXiv PDF Code

Similar