CVApr 4, 2022

Object Permanence Emerges in a Random Walk along Memory

Pavel Tokmakov, Allan Jabri, Jie Li, Adrien Gaidon

arXiv:2204.01784v213.228 citationsh-index: 22

Originality Incremental advance

AI Analysis

This addresses the challenge of localizing occluded objects in computer vision, offering a broadly applicable method with minimal supervision, though it appears incremental as it builds on existing self-supervised techniques.

The paper tackles the problem of learning object permanence under occlusion without direct supervision, proposing a self-supervised objective based on temporal coherence of memory, and shows that the resulting model outperforms existing approaches on multiple datasets.

This paper proposes a self-supervised objective for learning representations that localize objects under occlusion - a property known as object permanence. A central question is the choice of learning signal in cases of total occlusion. Rather than directly supervising the locations of invisible objects, we propose a self-supervised objective that requires neither human annotation, nor assumptions about object dynamics. We show that object permanence can emerge by optimizing for temporal coherence of memory: we fit a Markov walk along a space-time graph of memories, where the states in each time step are non-Markovian features from a sequence encoder. This leads to a memory representation that stores occluded objects and predicts their motion, to better localize them. The resulting model outperforms existing approaches on several datasets of increasing complexity and realism, despite requiring minimal supervision, and hence being broadly applicable.

View on arXiv PDF

Similar