CVJun 26, 2025

DeOcc-1-to-3: 3D De-Occlusion from a Single Image via Self-Supervised Multi-View Diffusion

arXiv:2506.21544v211 citationsh-index: 8
Originality Incremental advance
AI Analysis

This addresses the challenge of 3D reconstruction under real-world occlusions for computer vision applications, representing an incremental improvement by adapting existing diffusion models to handle occlusions.

The paper tackles the problem of reconstructing 3D objects from a single occluded image by proposing DeOcc-1-to-3, an end-to-end framework that synthesizes six structurally consistent novel views, enabling reliable 3D reconstruction without prior inpainting or manual annotations, and it introduces a benchmark for occlusion-aware reconstruction.

Reconstructing 3D objects from a single image remains challenging, especially under real-world occlusions. While recent diffusion-based view synthesis models can generate consistent novel views from a single RGB image, they typically assume fully visible inputs and fail when parts of the object are occluded, resulting in degraded 3D reconstruction quality. We propose DeOcc-1-to-3, an end-to-end framework for occlusion-aware multi-view generation that synthesizes six structurally consistent novel views directly from a single occluded image, enabling reliable 3D reconstruction without prior inpainting or manual annotations. Our self-supervised training pipeline leverages occluded-unoccluded image pairs and pseudo-ground-truth views to teach the model structure-aware completion and view consistency. Without modifying the original architecture, we fully fine-tune the view synthesis model to jointly learn completion and multi-view generation. Additionally, we introduce the first benchmark for occlusion-aware reconstruction, covering diverse occlusion levels, object categories, and masking patterns, providing a standardized protocol for future evaluation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes