AICVNov 24, 2022

PuzzleFusion: Unleashing the Power of Diffusion Models for Spatial Puzzle Solving

arXiv:2211.13785v327 citationsh-index: 40
Originality Incremental advance
AI Analysis

This addresses spatial puzzle solving for applications such as room layout alignment, though it appears incremental as it applies existing diffusion models to new tasks.

The paper tackles spatial puzzle solving tasks like jigsaw puzzles and room arrangement by proposing PuzzleFusion, an end-to-end neural architecture based on Diffusion Models, which outperforms competing methods by significant margins in evaluations.

This paper presents an end-to-end neural architecture based on Diffusion Models for spatial puzzle solving, particularly jigsaw puzzle and room arrangement tasks. In the latter task, for instance, the proposed system "PuzzleFusion" takes a set of room layouts as polygonal curves in the top-down view and aligns the room layout pieces by estimating their 2D translations and rotations, akin to solving the jigsaw puzzle of room layouts. A surprising discovery of the paper is that the simple use of a Diffusion Model effectively solves these challenging spatial puzzle tasks as a conditional generation process. To enable learning of an end-to-end neural system, the paper introduces new datasets with ground-truth arrangements: 1) 2D Voronoi jigsaw dataset, a synthetic one where pieces are generated by Voronoi diagram of 2D pointset; and 2) MagicPlan dataset, a real one offered by MagicPlan from its production pipeline, where pieces are room layouts constructed by augmented reality App by real-estate consumers. The qualitative and quantitative evaluations demonstrate that our approach outperforms the competing methods by significant margins in all the tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes