CVCGLGMar 19, 2020

Local Implicit Grid Representations for 3D Scenes

arXiv:2003.08981v1612 citations
Originality Highly original
AI Analysis

This addresses the problem of reconstructing complex indoor 3D scenes from partial data for applications in computer vision and graphics, representing a novel method for a known bottleneck.

The paper tackles the lack of shape priors for indoor 3D scenes by introducing Local Implicit Grid Representations, a scalable representation that uses an autoencoder to learn embeddings of local shape crops and optimizes latent codes on a grid for reconstruction, demonstrating significantly better results in 3D surface reconstruction from sparse point observations.

Shape priors learned from data are commonly used to reconstruct 3D objects from partial or noisy data. Yet no such shape priors are available for indoor scenes, since typical 3D autoencoders cannot handle their scale, complexity, or diversity. In this paper, we introduce Local Implicit Grid Representations, a new 3D shape representation designed for scalability and generality. The motivating idea is that most 3D surfaces share geometric details at some scale -- i.e., at a scale smaller than an entire object and larger than a small patch. We train an autoencoder to learn an embedding of local crops of 3D shapes at that size. Then, we use the decoder as a component in a shape optimization that solves for a set of latent codes on a regular grid of overlapping crops such that an interpolation of the decoded local shapes matches a partial or noisy observation. We demonstrate the value of this proposed approach for 3D surface reconstruction from sparse point observations, showing significantly better results than alternative approaches.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes