CVAIGRNov 21, 2023

Intrinsic Image Decomposition via Ordinal Shading

arXiv:2311.12792v174 citationsh-index: 11
Originality Incremental advance
AI Analysis

This work addresses a fundamental mid-level vision problem crucial for inverse rendering and computational photography, offering incremental improvements for researchers and practitioners in these fields.

The paper tackles the under-constrained problem of intrinsic image decomposition by proposing a method that uses dense ordinal shading and a two-network approach to generate high-resolution decompositions, achieving state-of-the-art results in qualitative and quantitative analyses and enabling practical editing tasks like recoloring and relighting.

Intrinsic decomposition is a fundamental mid-level vision problem that plays a crucial role in various inverse rendering and computational photography pipelines. Generating highly accurate intrinsic decompositions is an inherently under-constrained task that requires precisely estimating continuous-valued shading and albedo. In this work, we achieve high-resolution intrinsic decomposition by breaking the problem into two parts. First, we present a dense ordinal shading formulation using a shift- and scale-invariant loss in order to estimate ordinal shading cues without restricting the predictions to obey the intrinsic model. We then combine low- and high-resolution ordinal estimations using a second network to generate a shading estimate with both global coherency and local details. We encourage the model to learn an accurate decomposition by computing losses on the estimated shading as well as the albedo implied by the intrinsic model. We develop a straightforward method for generating dense pseudo ground truth using our model's predictions and multi-illumination data, enabling generalization to in-the-wild imagery. We present an exhaustive qualitative and quantitative analysis of our predicted intrinsic components against state-of-the-art methods. Finally, we demonstrate the real-world applicability of our estimations by performing otherwise difficult editing tasks such as recoloring and relighting.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes