CVMay 29

αDepth: Learning Single-Pass Soft Boundary Decomposition for Stereo Conversion

arXiv:2606.0038681.0h-index: 16
Predicted impact top 27% in CV · last 90 daysOriginality Highly original
AI Analysis

This work addresses the challenge of modeling soft boundaries in stereo conversion for computer vision applications, offering an automated solution that outperforms existing methods.

αDepth introduces a layered representation that decomposes soft boundaries for high-fidelity stereo conversion, achieving state-of-the-art performance by eliminating background bleeding and structural distortions.

Accurately modeling soft boundaries, e.g., hair and defocus blur, is a fundamental challenge in stereo conversion due to the ambiguous blending of foreground and background. Existing depth models primarily predict single-layer depth, leading to ambiguity in depth correspondence at soft boundaries. While matting techniques can capture opacity for layered modeling, they often struggle in complex scenes with multiple targets and usually require user intervention. This paper introduces αDepth, a layered representation that decomposes soft boundaries for high-fidelity stereo conversion. Specifically, we first resolve mixed color and depth ambiguity by estimating layered color and depth values at soft boundaries. Considering complex multi-target scenes, we design a Circular Alpha Representation (CAR) that shifts the paradigm from global target extraction to local boundary decomposition. Unlike prior matting methods restricted to a single foreground/background, CAR enables efficient scene-level inference without manual guidance. Extensive evaluations demonstrate that αDepth achieves state-of-the-art performance in stereo conversion, eliminating background bleeding and structural distortions at soft boundaries.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes