LGCVROMLJul 24, 2019

Higher-Order Function Networks for Learning Composable 3D Object Representations

arXiv:1907.10388v223 citations
Originality Incremental advance
AI Analysis

This addresses the problem of efficient 3D object learning for computer vision applications, but it appears incremental as it builds on existing neural representation methods.

The paper tackles the problem of 3D object representation by encoding geometry into a small mapping network, achieving reconstruction accuracy equal to or exceeding state-of-the-art methods with orders of magnitude fewer parameters, such as 7,000 parameters versus millions.

We present a new approach to 3D object representation where a neural network encodes the geometry of an object directly into the weights and biases of a second 'mapping' network. This mapping network can be used to reconstruct an object by applying its encoded transformation to points randomly sampled from a simple geometric space, such as the unit sphere. We study the effectiveness of our method through various experiments on subsets of the ShapeNet dataset. We find that the proposed approach can reconstruct encoded objects with accuracy equal to or exceeding state-of-the-art methods with orders of magnitude fewer parameters. Our smallest mapping network has only about 7000 parameters and shows reconstruction quality on par with state-of-the-art object decoder architectures with millions of parameters. Further experiments on feature mixing through the composition of learned functions show that the encoding captures a meaningful subspace of objects.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes