CVMar 29

Poppy: Polarization-based Plug-and-Play Guidance for Enhancing Monocular Normal Estimation

arXiv:2603.2789155.7h-index: 9
Predicted impact top 63% in CV · last 90 daysOriginality Incremental advance
AI Analysis

For computer vision practitioners, Poppy provides a training-free method to improve normal estimation on challenging surfaces using polarization, without retraining.

Monocular normal estimators fail on reflective, textureless, and dark surfaces. Poppy uses single-shot polarization to refine normals from any frozen RGB backbone at test time, reducing mean angular error by 23-26% on synthetic and 6-16% on real data.

Monocular surface normal estimators trained on large-scale RGB-normal data often perform poorly in the edge cases of reflective, textureless, and dark surfaces. Polarization encodes surface orientation independently of texture and albedo, offering a physics-based complement for these cases. Existing polarization methods, however, require multi-view capture or specialized training data, limiting generalization. We introduce Poppy, a training-free framework that refines normals from any frozen RGB backbone using single-shot polarization measurements at test time. Keeping backbone weights frozen, Poppy optimizes per-pixel offsets to the input RGB and output normal along with a learned reflectance decomposition. A differentiable rendering layer converts the refined normals into polarization predictions and penalizes mismatches with the observed signal. Across seven benchmarks and three backbone architectures (diffusion, flow, and feed-forward), Poppy reduces mean angular error by 23-26% on synthetic data and 6-16% on real data. These results show that guiding learned RGB-based normal estimators with polarization cues at test time refines normals on challenging surfaces without retraining.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes