CV LG MLOct 28, 2019

Geometry-Aware Neural Rendering

Josh Tobin, OpenAI Robotics, Pieter Abbeel

OpenAI

arXiv:1911.04554v112.425 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses a bottleneck in 3D scene understanding for robotics and computer vision, offering an incremental advancement in neural rendering efficiency.

The paper tackles the challenge of scaling neural rendering to complex, high-dimensional scenes by proposing Epipolar Cross Attention (ECA), an efficient geometry-aware attention mechanism that reduces comparisons from O(n^2) to O(n) per dimension, and demonstrates significant improvements in performance on Generative Query Networks using new simulated datasets.

Understanding the 3-dimensional structure of the world is a core challenge in computer vision and robotics. Neural rendering approaches learn an implicit 3D model by predicting what a camera would see from an arbitrary viewpoint. We extend existing neural rendering to more complex, higher dimensional scenes than previously possible. We propose Epipolar Cross Attention (ECA), an attention mechanism that leverages the geometry of the scene to perform efficient non-local operations, requiring only $O(n)$ comparisons per spatial dimension instead of $O(n^2)$. We introduce three new simulated datasets inspired by real-world robotics and demonstrate that ECA significantly improves the quantitative and qualitative performance of Generative Query Networks (GQN).

View on arXiv PDF Code

Similar