CVMar 29

Poppy: Polarization-based Plug-and-Play Guidance for Enhancing Monocular Normal Estimation

Irene Kim, Sai Tanmay Reddy Chakkera, Alexandros Graikos, Dimitris Samaras, Akshat Dave

arXiv:2603.2789155.7h-index: 9

Predicted impact top 63% in CV · last 90 daysOriginality Incremental advance

AI Analysis

For computer vision practitioners, Poppy provides a training-free method to improve normal estimation on challenging surfaces using polarization, without retraining.

Monocular normal estimators fail on reflective, textureless, and dark surfaces. Poppy uses single-shot polarization to refine normals from any frozen RGB backbone at test time, reducing mean angular error by 23-26% on synthetic and 6-16% on real data.

Monocular surface normal estimators trained on large-scale RGB-normal data often perform poorly in the edge cases of reflective, textureless, and dark surfaces. Polarization encodes surface orientation independently of texture and albedo, offering a physics-based complement for these cases. Existing polarization methods, however, require multi-view capture or specialized training data, limiting generalization. We introduce Poppy, a training-free framework that refines normals from any frozen RGB backbone using single-shot polarization measurements at test time. Keeping backbone weights frozen, Poppy optimizes per-pixel offsets to the input RGB and output normal along with a learned reflectance decomposition. A differentiable rendering layer converts the refined normals into polarization predictions and penalizes mismatches with the observed signal. Across seven benchmarks and three backbone architectures (diffusion, flow, and feed-forward), Poppy reduces mean angular error by 23-26% on synthetic data and 6-16% on real data. These results show that guiding learned RGB-based normal estimators with polarization cues at test time refines normals on challenging surfaces without retraining.

View on arXiv PDF

Similar