IVCVJun 1

Depth from Dual Differential Defocus and Stereo Consensus

arXiv:2606.0290628.7
Predicted impact top 40% in IV · last 90 daysOriginality Incremental advance
AI Analysis

Enables compact passive binocular rangefinders with substantially smaller form factors for accurate depth estimation in extended working range.

The paper introduces D^3S Consensus, a physics-based algorithm combining depth-from-defocus and stereo to achieve accurate depth estimation beyond the depth-of-field. It achieves 1-cm mean absolute error over 0.3-1.64 m with a 4 mm baseline, outperforming commercial stereo cameras with larger form factors.

We introduce D^3S Consensus, a physics-based, closed-form algorithm that unifies depth-from-defocus (DfD) and stereo to achieve highly accurate depth estimation throughout an extended working range beyond the depth-of-field (DoF) of cameras. Given a pair of dual-defocus stereo images, the method estimates an overdetermined set of depth using a novel DfD theory, Dual Differential Defocus (D^3), and (S)tereo in a coupled fashion. It then picks the most confident depth prediction from the set by enforcing consensus between these physically independent cues to reject unreliable estimates. Analysis shows that D^3S achieves a comparable working range under the same error tolerance with 10x smaller baseline than previous triangulation-based depth estimation systems. This enables compact passive binocular rangefinders with substantially smaller form factors than conventional stereo and DfD designs. We demonstrate the first D^3S prototype with only 4 mm baseline and 12 mm EFL. It generates up to 900 x 1800-pixel depth maps with 1-cm mean absolute error over 0.3-1.64 m from a snapshot acquisition. This has surpassed the reported accuracy of certain commercially available stereo cameras with much larger form factors.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes