CVJan 27, 2020

Deep NRSfM++: Towards Unsupervised 2D-3D Lifting in the Wild

arXiv:2001.10090v216 citations
AI Analysis

This work improves 3D reconstruction from 2D landmarks for computer vision applications, but it is incremental as it builds on existing learning-based NRSfM methods.

The paper tackles the problem of unsupervised 2D-3D lifting from images by addressing limitations in learning-based non-rigid structure from motion (NRSfM), such as handling perspective cameras and missing points, and achieves state-of-the-art performance on large-scale benchmarks.

The recovery of 3D shape and pose from 2D landmarks stemming from a large ensemble of images can be viewed as a non-rigid structure from motion (NRSfM) problem. Classical NRSfM approaches, however, are problematic as they rely on heuristic priors on the 3D structure (e.g. low rank) that do not scale well to large datasets. Learning-based methods are showing the potential to reconstruct a much broader set of 3D structures than classical methods -- dramatically expanding the importance of NRSfM to atemporal unsupervised 2D to 3D lifting. Hitherto, these learning approaches have not been able to effectively model perspective cameras or handle missing/occluded points -- limiting their applicability to in-the-wild datasets. In this paper, we present a generalized strategy for improving learning-based NRSfM methods to tackle the above issues. Our approach, Deep NRSfM++, achieves state-of-the-art performance across numerous large-scale benchmarks, outperforming both classical and learning-based 2D-3D lifting methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes