CVGRMMOct 28, 2022

NeRFPlayer: A Streamable Dynamic Scene Representation with Decomposed Neural Radiance Fields

ETH Zurich
arXiv:2210.15947v2215 citationsh-index: 94
AI Analysis

This enables free visual exploration in VR for dynamic scenes captured with minimal camera setups, representing an incremental improvement over existing methods.

The paper tackles the problem of efficiently reconstructing and rendering dynamic 4D scenes from few or single RGB cameras, achieving comparable or superior rendering quality and speed to state-of-the-art methods with reconstruction in 10 seconds per frame and interactive rendering.

Visually exploring in a real-world 4D spatiotemporal space freely in VR has been a long-term quest. The task is especially appealing when only a few or even single RGB cameras are used for capturing the dynamic scene. To this end, we present an efficient framework capable of fast reconstruction, compact modeling, and streamable rendering. First, we propose to decompose the 4D spatiotemporal space according to temporal characteristics. Points in the 4D space are associated with probabilities of belonging to three categories: static, deforming, and new areas. Each area is represented and regularized by a separate neural field. Second, we propose a hybrid representations based feature streaming scheme for efficiently modeling the neural fields. Our approach, coined NeRFPlayer, is evaluated on dynamic scenes captured by single hand-held cameras and multi-camera arrays, achieving comparable or superior rendering performance in terms of quality and speed comparable to recent state-of-the-art methods, achieving reconstruction in 10 seconds per frame and interactive rendering.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes