LGCVFeb 16, 2017

Discovering objects and their relations from entangled scene representations

arXiv:1702.05068v177 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of structured scene understanding for AI systems, offering a general-purpose method that could enhance tasks requiring object relation reasoning, though it appears incremental as it builds on existing neural network and autoencoder frameworks.

The paper tackles the problem of learning object relations from entangled scene representations by introducing relation networks (RNs), a neural network architecture for object-relation reasoning, and shows that RNs can factorize objects from scene descriptions and deep image representations, with applications in one-shot learning.

Our world can be succinctly and compactly described as structured scenes of objects and relations. A typical room, for example, contains salient objects such as tables, chairs and books, and these objects typically relate to each other by their underlying causes and semantics. This gives rise to correlated features, such as position, function and shape. Humans exploit knowledge of objects and their relations for learning a wide spectrum of tasks, and more generally when learning the structure underlying observed data. In this work, we introduce relation networks (RNs) - a general purpose neural network architecture for object-relation reasoning. We show that RNs are capable of learning object relations from scene description data. Furthermore, we show that RNs can act as a bottleneck that induces the factorization of objects from entangled scene description inputs, and from distributed deep representations of scene images provided by a variational autoencoder. The model can also be used in conjunction with differentiable memory mechanisms for implicit relation discovery in one-shot learning tasks. Our results suggest that relation networks are a potentially powerful architecture for solving a variety of problems that require object relation reasoning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes