LGAINEFeb 28, 2018

Relational Neural Expectation Maximization: Unsupervised Discovery of Objects and their Interactions

arXiv:1802.10353v1300 citations
Originality Incremental advance
AI Analysis

It addresses the problem of learning common-sense physical reasoning without supervision for intelligent agents, representing an incremental advance in unsupervised neural methods.

The paper tackles unsupervised learning of object discovery and physical interaction modeling from raw visual images, demonstrating superior performance on bouncing ball videos and the ability to handle occlusion and extrapolate to scenes with varying object counts.

Common-sense physical reasoning is an essential ingredient for any intelligent agent operating in the real-world. For example, it can be used to simulate the environment, or to infer the state of parts of the world that are currently unobserved. In order to match real-world conditions this causal knowledge must be learned without access to supervised data. To address this problem we present a novel method that learns to discover objects and model their physical interactions from raw visual images in a purely \emph{unsupervised} fashion. It incorporates prior knowledge about the compositional nature of human perception to factor interactions between object-pairs and learn efficiently. On videos of bouncing balls we show the superior modelling capabilities of our method compared to other unsupervised neural approaches that do not incorporate such prior knowledge. We demonstrate its ability to handle occlusion and show that it can extrapolate learned knowledge to scenes with different numbers of objects.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes