CVFeb 11, 2019

Towards Segmenting Anything That Moves

arXiv:1902.03715v495 citations
Originality Incremental advance
AI Analysis

This work addresses the need for generic object segmentation in applications like robotics, though it builds incrementally on spatio-temporal grouping methods.

The paper tackles the problem of segmenting moving objects without category restrictions by proposing a learning-based approach that combines motion and appearance cues, achieving state-of-the-art performance on the FBMS dataset and outperforming methods on unseen categories in new benchmarks.

Detecting and segmenting individual objects, regardless of their category, is crucial for many applications such as action detection or robotic interaction. While this problem has been well-studied under the classic formulation of spatio-temporal grouping, state-of-the-art approaches do not make use of learning-based methods. To bridge this gap, we propose a simple learning-based approach for spatio-temporal grouping. Our approach leverages motion cues from optical flow as a bottom-up signal for separating objects from each other. Motion cues are then combined with appearance cues that provide a generic objectness prior for capturing the full extent of objects. We show that our approach outperforms all prior work on the benchmark FBMS dataset. One potential worry with learning-based methods is that they might overfit to the particular type of objects that they have been trained on. To address this concern, we propose two new benchmarks for generic, moving object detection, and show that our model matches top-down methods on common categories, while significantly out-performing both top-down and bottom-up methods on never-before-seen categories.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes