CVAug 18, 2020

Category Level Object Pose Estimation via Neural Analysis-by-Synthesis

arXiv:2008.08145v1152 citations
Originality Incremental advance
AI Analysis

This work addresses object pose estimation for robotics and computer vision by enabling category-level pose recovery without instance-specific models, representing an incremental improvement over existing analysis-by-synthesis frameworks.

The paper tackles the problem of object pose estimation by combining gradient-based fitting with a neural image synthesis module that implicitly represents entire object categories, eliminating the need for explicit CAD models per instance. The method recovers object orientation with high accuracy from 2D images and achieves full 6DOF pose recovery when depth measurements are provided.

Many object pose estimation algorithms rely on the analysis-by-synthesis framework which requires explicit representations of individual object instances. In this paper we combine a gradient-based fitting procedure with a parametric neural image synthesis module that is capable of implicitly representing the appearance, shape and pose of entire object categories, thus rendering the need for explicit CAD models per object instance unnecessary. The image synthesis network is designed to efficiently span the pose configuration space so that model capacity can be used to capture the shape and local appearance (i.e., texture) variations jointly. At inference time the synthesized images are compared to the target via an appearance based loss and the error signal is backpropagated through the network to the input parameters. Keeping the network parameters fixed, this allows for iterative optimization of the object pose, shape and appearance in a joint manner and we experimentally show that the method can recover orientation of objects with high accuracy from 2D images alone. When provided with depth measurements, to overcome scale ambiguities, the method can accurately recover the full 6DOF pose successfully.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes