CV ROMar 15, 2019

Generate What You Can't See - a View-dependent Image Generation

Karol Piaskowski, Rafal Staszak, Dominik Belter

arXiv:1903.06814v10.95 citations

Originality Incremental advance

AI Analysis

This addresses the time-consuming and often impossible task of acquiring full environmental knowledge for autonomous robots, though it is incremental as it builds on existing deep learning techniques.

The paper tackles the problem of robots needing to fully scan environments to understand objects by proposing a method that generates images of an object from various viewpoints using a single input RGB image, achieving results on both synthetic and real datasets.

In order to operate autonomously, a robot should explore the environment and build a model of each of the surrounding objects. A common approach is to carefully scan the whole workspace. This is time-consuming. It is also often impossible to reach all the viewpoints required to acquire full knowledge about the environment. Humans can perform shape completion of occluded objects by relying on past experience. Therefore, we propose a method that generates images of an object from various viewpoints using a single input RGB image. A deep neural network is trained to imagine the object appearance from many viewpoints. We present the whole pipeline, which takes a single RGB image as input and returns a sequence of RGB and depth images of the object. The method utilizes a CNN-based object detector to extract the object from the natural scene. Then, the proposed network generates a set of RGB and depth images. We show the results both on a synthetic dataset and on real images.

View on arXiv PDF

Similar