CVDec 2, 2021

Recognizing Scenes from Novel Viewpoints

arXiv:2112.01520v18 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of 3D scene understanding for AI agents, allowing efficient interaction with scenes, but it is incremental as it builds on existing implicit 3D representation methods.

The paper tackles the problem of enabling AI agents to recognize scenes from novel viewpoints using only a few input images, achieving this by segmenting scenes into semantic categories without access to target view images.

Humans can perceive scenes in 3D from a handful of 2D views. For AI agents, the ability to recognize a scene from any viewpoint given only a few images enables them to efficiently interact with the scene and its objects. In this work, we attempt to endow machines with this ability. We propose a model which takes as input a few RGB images of a new scene and recognizes the scene from novel viewpoints by segmenting it into semantic categories. All this without access to the RGB images from those views. We pair 2D scene recognition with an implicit 3D representation and learn from multi-view 2D annotations of hundreds of scenes without any 3D supervision beyond camera poses. We experiment on challenging datasets and demonstrate our model's ability to jointly capture semantics and geometry of novel scenes with diverse layouts, object types and shapes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes