CVApr 2, 2020

Learning Unsupervised Hierarchical Part Decomposition of 3D Objects from a Single RGB Image

arXiv:2004.01176v1130 citations
AI Analysis

This addresses the challenge of understanding 3D object structure for computer vision applications, representing a novel method for a known bottleneck.

The paper tackles the problem of jointly recovering 3D object geometry and hierarchical part decomposition from a single RGB image without part-level supervision, achieving results that facilitate reasoning about 3D geometry on ShapeNet and D-FAUST datasets.

Humans perceive the 3D world as a set of distinct objects that are characterized by various low-level (geometry, reflectance) and high-level (connectivity, adjacency, symmetry) properties. Recent methods based on convolutional neural networks (CNNs) demonstrated impressive progress in 3D reconstruction, even when using a single 2D image as input. However, the majority of these methods focuses on recovering the local 3D geometry of an object without considering its part-based decomposition or relations between parts. We address this challenging problem by proposing a novel formulation that allows to jointly recover the geometry of a 3D object as a set of primitives as well as their latent hierarchical structure without part-level supervision. Our model recovers the higher level structural decomposition of various objects in the form of a binary tree of primitives, where simple parts are represented with fewer primitives and more complex parts are modeled with more components. Our experiments on the ShapeNet and D-FAUST datasets demonstrate that considering the organization of parts indeed facilitates reasoning about 3D geometry.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes