CV LG MLMay 22, 2019

Real-time Approximate Bayesian Computation for Scene Understanding

Javier Felip, Nilesh Ahuja, David Gómez-Gutiérrez, Omesh Tickoo, Vikash Mansinghka

arXiv:1905.13307v10.9

Originality Incremental advance

AI Analysis

This work addresses real-time inference challenges in scene understanding for applications like robotics and autonomous systems, though it is incremental by combining existing methods with new speed-up techniques.

The paper tackles scene understanding problems like predicting human actions and object poses by using Approximate Bayesian Computation with generative models from simulation software, achieving real-time feasibility with demonstrated performance and accuracy measurements.

Consider scene understanding problems such as predicting where a person is probably reaching, or inferring the pose of 3D objects from depth images, or inferring the probable street crossings of pedestrians at a busy intersection. This paper shows how to solve these problems using Approximate Bayesian Computation. The underlying generative models are built from realistic simulation software, wrapped in a Bayesian error model for the gap between simulation outputs and real data. The simulators are drawn from off-the-shelf computer graphics, video game, and traffic simulation code. The paper introduces two techniques for speeding up inference that can be used separately or in combination. The first is to train neural surrogates of the simulators, using a simple form of domain randomization to make the surrogates more robust to the gap between the simulation and reality. The second is to adaptively discretize the latent variables using a Tree-pyramid approach adapted from computer graphics. This paper also shows performance and accuracy measurements on real-world problems, establishing that it is feasible to solve these problems in real-time.

View on arXiv PDF

Similar