CVDec 17, 2020

SceneFormer: Indoor Scene Generation with Transformers

arXiv:2012.09793v2207 citations
AI Analysis

This work provides a more efficient and flexible method for generating realistic indoor scenes, which is beneficial for applications in virtual reality, interior design, and game development.

This paper addresses indoor scene generation by creating sequences of objects, their locations, and orientations conditioned on room layouts. The model achieves faster scene generation (1.48 seconds per scene, 20% faster than FastSynth) with similar or improved realism, with user studies showing preference over FastSynth scenes 53.9% and 56.7% of the time for bedrooms and living rooms, respectively.

We address the task of indoor scene generation by generating a sequence of objects, along with their locations and orientations conditioned on a room layout. Large-scale indoor scene datasets allow us to extract patterns from user-designed indoor scenes, and generate new scenes based on these patterns. Existing methods rely on the 2D or 3D appearance of these scenes in addition to object positions, and make assumptions about the possible relations between objects. In contrast, we do not use any appearance information, and implicitly learn object relations using the self-attention mechanism of transformers. We show that our model design leads to faster scene generation with similar or improved levels of realism compared to previous methods. Our method is also flexible, as it can be conditioned not only on the room layout but also on text descriptions of the room, using only the cross-attention mechanism of transformers. Our user study shows that our generated scenes are preferred to the state-of-the-art FastSynth scenes 53.9% and 56.7% of the time for bedroom and living room scenes, respectively. At the same time, we generate a scene in 1.48 seconds on average, 20% faster than FastSynth.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes