CVSep 18, 2024

Depth Estimation Based on 3D Gaussian Splatting Siamese Defocus

arXiv:2409.12323v21 citationsh-index: 3
Originality Incremental advance
AI Analysis

This addresses a practical limitation in Depth from Defocus methods for real-world applications, offering an incremental improvement by eliminating the need for All-In-Focus images.

The paper tackles the problem of monocular depth estimation from defocused images by proposing a self-supervised framework based on 3D Gaussian splatting and Siamese networks, which predicts defocus maps and depth without requiring All-In-Focus images, and demonstrates effectiveness on synthetic and real datasets.

Depth estimation is a fundamental task in 3D geometry. While stereo depth estimation can be achieved through triangulation methods, it is not as straightforward for monocular methods, which require the integration of global and local information. The Depth from Defocus (DFD) method utilizes camera lens models and parameters to recover depth information from blurred images and has been proven to perform well. However, these methods rely on All-In-Focus (AIF) images for depth estimation, which is nearly impossible to obtain in real-world applications. To address this issue, we propose a self-supervised framework based on 3D Gaussian splatting and Siamese networks. By learning the blur levels at different focal distances of the same scene in the focal stack, the framework predicts the defocus map and Circle of Confusion (CoC) from a single defocused image, using the defocus map as input to DepthNet for monocular depth estimation. The 3D Gaussian splatting model renders defocused images using the predicted CoC, and the differences between these and the real defocused images provide additional supervision signals for the Siamese Defocus self-supervised network. This framework has been validated on both artificially synthesized and real blurred datasets. Subsequent quantitative and visualization experiments demonstrate that our proposed framework is highly effective as a DFD method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes