CVAIGRAug 19, 2024

Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering

arXiv:2408.09702v128 citationsh-index: 23
Originality Incremental advance
AI Analysis

This addresses the challenge of realistic object insertion for applications in augmented reality and visual effects, representing an incremental improvement over existing methods.

The paper tackles the problem of inserting virtual objects into real-world images with consistent lighting and details by proposing a diffusion-guided inverse rendering method, achieving photorealistic composition in single frames or videos.

The correct insertion of virtual objects in images of real-world scenes requires a deep understanding of the scene's lighting, geometry and materials, as well as the image formation process. While recent large-scale diffusion models have shown strong generative and inpainting capabilities, we find that current models do not sufficiently "understand" the scene shown in a single picture to generate consistent lighting effects (shadows, bright reflections, etc.) while preserving the identity and details of the composited object. We propose using a personalized large diffusion model as guidance to a physically based inverse rendering process. Our method recovers scene lighting and tone-mapping parameters, allowing the photorealistic composition of arbitrary virtual objects in single frames or videos of indoor or outdoor scenes. Our physically based pipeline further enables automatic materials and tone-mapping refinement.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes