CVMay 10, 2017

Learning 3D Object Categories by Looking Around Them

arXiv:1705.03951v388 citations
Originality Incremental advance
AI Analysis

This addresses the need for scalable 3D object recognition in robotics or AR/VR, though it is incremental as it builds on existing unsupervised and 3D learning methods.

The paper tackles the problem of learning 3D object categories without manual annotations by using videos from moving viewpoints, achieving state-of-the-art results on benchmarks.

Traditional approaches for learning 3D object categories use either synthetic data or manual supervision. In this paper, we propose a method which does not require manual annotations and is instead cued by observing objects from a moving vantage point. Our system builds on two innovations: a Siamese viewpoint factorization network that robustly aligns different videos together without explicitly comparing 3D shapes; and a 3D shape completion network that can extract the full shape of an object from partial observations. We also demonstrate the benefits of configuring networks to perform probabilistic predictions as well as of geometry-aware data augmentation schemes. We obtain state-of-the-art results on publicly-available benchmarks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes