CVLGMLOct 28, 2019

Geometry-Aware Neural Rendering

OpenAI
arXiv:1911.04554v125 citations
Originality Incremental advance
AI Analysis

This work addresses a bottleneck in 3D scene understanding for robotics and computer vision, offering an incremental advancement in neural rendering efficiency.

The paper tackles the challenge of scaling neural rendering to complex, high-dimensional scenes by proposing Epipolar Cross Attention (ECA), an efficient geometry-aware attention mechanism that reduces comparisons from O(n^2) to O(n) per dimension, and demonstrates significant improvements in performance on Generative Query Networks using new simulated datasets.

Understanding the 3-dimensional structure of the world is a core challenge in computer vision and robotics. Neural rendering approaches learn an implicit 3D model by predicting what a camera would see from an arbitrary viewpoint. We extend existing neural rendering to more complex, higher dimensional scenes than previously possible. We propose Epipolar Cross Attention (ECA), an attention mechanism that leverages the geometry of the scene to perform efficient non-local operations, requiring only $O(n)$ comparisons per spatial dimension instead of $O(n^2)$. We introduce three new simulated datasets inspired by real-world robotics and demonstrate that ECA significantly improves the quantitative and qualitative performance of Generative Query Networks (GQN).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes