CVJul 23, 2020

End-to-End Optimization of Scene Layout

arXiv:2007.11744v182 citations
Originality Incremental advance
AI Analysis

This work addresses the need for flexible scene layout generation from various inputs like sentences or images, offering incremental improvements in control and optimization.

The paper tackles the problem of generating diverse scene layouts that satisfy relationships specified in scene graphs, achieving higher accuracy and diversity in conditional scene synthesis compared to existing methods.

We propose an end-to-end variational generative model for scene layout synthesis conditioned on scene graphs. Unlike unconditional scene layout generation, we use scene graphs as an abstract but general representation to guide the synthesis of diverse scene layouts that satisfy relationships included in the scene graph. This gives rise to more flexible control over the synthesis process, allowing various forms of inputs such as scene layouts extracted from sentences or inferred from a single color image. Using our conditional layout synthesizer, we can generate various layouts that share the same structure of the input example. In addition to this conditional generation design, we also integrate a differentiable rendering module that enables layout refinement using only 2D projections of the scene. Given a depth and a semantics map, the differentiable rendering module enables optimizing over the synthesized layout to fit the given input in an analysis-by-synthesis fashion. Experiments suggest that our model achieves higher accuracy and diversity in conditional scene synthesis and allows exemplar-based scene generation from various input forms.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes