CVMar 27, 2016

3DMatch: Learning Local Geometric Descriptors from RGB-D Reconstructions

arXiv:1603.08182v31187 citations
Originality Incremental advance
AI Analysis

This addresses the problem of 3D reconstruction and object alignment for robotics and computer vision applications, representing a novel method for a known bottleneck.

The paper tackles the challenge of matching local geometric features on noisy, low-resolution, and incomplete 3D scan data by introducing 3DMatch, a data-driven model that learns a local volumetric patch descriptor, which consistently outperforms other state-of-the-art approaches by a significant margin.

Matching local geometric features on real-world depth images is a challenging task due to the noisy, low-resolution, and incomplete nature of 3D scan data. These difficulties limit the performance of current state-of-art methods, which are typically based on histograms over geometric properties. In this paper, we present 3DMatch, a data-driven model that learns a local volumetric patch descriptor for establishing correspondences between partial 3D data. To amass training data for our model, we propose a self-supervised feature learning method that leverages the millions of correspondence labels found in existing RGB-D reconstructions. Experiments show that our descriptor is not only able to match local geometry in new scenes for reconstruction, but also generalize to different tasks and spatial scales (e.g. instance-level object model alignment for the Amazon Picking Challenge, and mesh surface correspondence). Results show that 3DMatch consistently outperforms other state-of-the-art approaches by a significant margin. Code, data, benchmarks, and pre-trained models are available online at http://3dmatch.cs.princeton.edu

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes