CVApr 4, 2022

Object Permanence Emerges in a Random Walk along Memory

arXiv:2204.01784v228 citationsh-index: 22
AI Analysis

This addresses the challenge of localizing occluded objects in computer vision, offering a broadly applicable method with minimal supervision, though it appears incremental as it builds on existing self-supervised techniques.

The paper tackles the problem of learning object permanence under occlusion without direct supervision, proposing a self-supervised objective based on temporal coherence of memory, and shows that the resulting model outperforms existing approaches on multiple datasets.

This paper proposes a self-supervised objective for learning representations that localize objects under occlusion - a property known as object permanence. A central question is the choice of learning signal in cases of total occlusion. Rather than directly supervising the locations of invisible objects, we propose a self-supervised objective that requires neither human annotation, nor assumptions about object dynamics. We show that object permanence can emerge by optimizing for temporal coherence of memory: we fit a Markov walk along a space-time graph of memories, where the states in each time step are non-Markovian features from a sequence encoder. This leads to a memory representation that stores occluded objects and predicts their motion, to better localize them. The resulting model outperforms existing approaches on several datasets of increasing complexity and realism, despite requiring minimal supervision, and hence being broadly applicable.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes