CVAIFeb 20, 2025

FacaDiffy: Inpainting Unseen Facade Parts Using Diffusion Models

arXiv:2502.14940v11 citationsh-index: 37Has CodeISPRS Ann Photogramm Remote Sens Spat Inf Sci
Originality Incremental advance
AI Analysis

This work addresses a domain-specific challenge in robotics, geoinformatics, and computer vision for creating high-detail 3D building models, but it is incremental as it builds on existing diffusion models and inpainting techniques.

The paper tackles the problem of incomplete 2D conflict maps for building facades due to laser scanning obstacles by introducing FacaDiffy, a method that uses a personalized Stable Diffusion model to inpaint unseen parts, resulting in a 22% increase in detection rate for 3D semantic building reconstruction.

High-detail semantic 3D building models are frequently utilized in robotics, geoinformatics, and computer vision. One key aspect of creating such models is employing 2D conflict maps that detect openings' locations in building facades. Yet, in reality, these maps are often incomplete due to obstacles encountered during laser scanning. To address this challenge, we introduce FacaDiffy, a novel method for inpainting unseen facade parts by completing conflict maps with a personalized Stable Diffusion model. Specifically, we first propose a deterministic ray analysis approach to derive 2D conflict maps from existing 3D building models and corresponding laser scanning point clouds. Furthermore, we facilitate the inpainting of unseen facade objects into these 2D conflict maps by leveraging the potential of personalizing a Stable Diffusion model. To complement the scarcity of real-world training data, we also develop a scalable pipeline to produce synthetic conflict maps using random city model generators and annotated facade images. Extensive experiments demonstrate that FacaDiffy achieves state-of-the-art performance in conflict map completion compared to various inpainting baselines and increases the detection rate by $22\%$ when applying the completed conflict maps for high-definition 3D semantic building reconstruction. The code is be publicly available in the corresponding GitHub repository: https://github.com/ThomasFroech/InpaintingofUnseenFacadeObjects

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes