CVJun 23, 2024

LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control

arXiv:2406.16038v310 citations
Originality Incremental advance
AI Analysis

This work advances interactive scene reconstruction for applications in robotics or virtual reality by scaling object-level reconstruction to complex scenes, though it appears incremental as it builds on existing radiance field methods.

The paper tackles the challenge of inaccurate interactive motion recovery in complex scenes by proposing LiveScene, a scene-level language-embedded interactive radiance field that efficiently reconstructs and controls multiple objects, demonstrating significant superiority in novel view synthesis, interactive scene control, and language grounding performance.

This paper scales object-level reconstruction to complex scenes, advancing interactive scene reconstruction. We introduce two datasets, OmniSim and InterReal, featuring 28 scenes with multiple interactive objects. To tackle the challenge of inaccurate interactive motion recovery in complex scenes, we propose LiveScene, a scene-level language-embedded interactive radiance field that efficiently reconstructs and controls multiple objects. By decomposing the interactive scene into local deformable fields, LiveScene enables separate reconstruction of individual object motions, reducing memory consumption. Additionally, our interaction-aware language embedding localizes individual interactive objects, allowing for arbitrary control using natural language. Our approach demonstrates significant superiority in novel view synthesis, interactive scene control, and language grounding performance through extensive experiments. Project page: https://livescenes.github.io.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes