CVMay 30, 2023

ReTR: Modeling Rendering Via Transformer for Generalizable Neural Surface Reconstruction

arXiv:2305.18832v232 citationsHas Code
Originality Highly original
AI Analysis

This addresses limitations in neural surface reconstruction for 3D vision applications, representing a novel method for a known bottleneck.

The paper tackles the problem of inaccurate surface reconstruction in generalizable neural methods by redesigning the rendering process with a transformer framework, resulting in improved reconstruction quality and generalization ability that outperforms state-of-the-art approaches.

Generalizable neural surface reconstruction techniques have attracted great attention in recent years. However, they encounter limitations of low confidence depth distribution and inaccurate surface reasoning due to the oversimplified volume rendering process employed. In this paper, we present Reconstruction TRansformer (ReTR), a novel framework that leverages the transformer architecture to redesign the rendering process, enabling complex render interaction modeling. It introduces a learnable $\textit{meta-ray token}$ and utilizes the cross-attention mechanism to simulate the interaction of rendering process with sampled points and render the observed color. Meanwhile, by operating within a high-dimensional feature space rather than the color space, ReTR mitigates sensitivity to projected colors in source views. Such improvements result in accurate surface assessment with high confidence. We demonstrate the effectiveness of our approach on various datasets, showcasing how our method outperforms the current state-of-the-art approaches in terms of reconstruction quality and generalization ability. $\textit{Our code is available at }$ https://github.com/YixunLiang/ReTR.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes