CVMar 20

High-fidelity Multi-view Normal Integration with Scale-encoded Neural Surface Representation

arXiv:2603.2033753.5h-index: 6
AI Analysis

This addresses surface reconstruction issues in computer vision for applications like 3D modeling, but it is incremental as it builds on prior multi-view normal integration techniques.

The paper tackles the problem of multi-view normal inconsistency causing blurring in surface reconstruction by proposing a scale-encoded neural surface representation that incorporates pixel coverage area, resulting in high-fidelity reconstruction that outperforms existing methods.

Previous multi-view normal integration methods typically sample a single ray per pixel, without considering the spatial area covered by each pixel, which varies with camera intrinsics and the camera-to-object distance. Consequently, when the target object is captured at different distances, the normals at corresponding pixels may differ across views. This multi-view surface normal inconsistency results in the blurring of high-frequency details in the reconstructed surface. To address this issue, we propose a scale-encoded neural surface representation that incorporates the pixel coverage area into the neural representation. By associating each 3D point with a spatial scale and calculating its normal from a hybrid grid-based encoding, our method effectively represents multi-scale surface normals captured at varying distances. Furthermore, to enable scale-aware surface reconstruction, we introduce a mesh extraction module that assigns an optimal local scale to each vertex based on the training observations. Experimental results demonstrate that our approach consistently yields high-fidelity surface reconstruction from normals observed at varying distances, outperforming existing multi-view normal integration methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes