CVMar 23, 2023

Semantic Ray: Learning a Generalizable Semantic Field with Cross-Reprojection Attention

arXiv:2303.13014v146 citationsh-index: 57Has Code
Originality Incremental advance
AI Analysis

This addresses the need for efficient and accurate semantic understanding in neural radiance fields, which is incremental over prior work like Semantic-NeRF.

The paper tackles the problem of learning a generalizable semantic radiance field from multiple scenes, proposing Semantic Ray (S-Ray) to exploit semantic information from multi-view reprojections, and it shows strong generalization to unseen scenes.

In this paper, we aim to learn a semantic radiance field from multiple scenes that is accurate, efficient and generalizable. While most existing NeRFs target at the tasks of neural scene rendering, image synthesis and multi-view reconstruction, there are a few attempts such as Semantic-NeRF that explore to learn high-level semantic understanding with the NeRF structure. However, Semantic-NeRF simultaneously learns color and semantic label from a single ray with multiple heads, where the single ray fails to provide rich semantic information. As a result, Semantic NeRF relies on positional encoding and needs to train one specific model for each scene. To address this, we propose Semantic Ray (S-Ray) to fully exploit semantic information along the ray direction from its multi-view reprojections. As directly performing dense attention over multi-view reprojected rays would suffer from heavy computational cost, we design a Cross-Reprojection Attention module with consecutive intra-view radial and cross-view sparse attentions, which decomposes contextual information along reprojected rays and cross multiple views and then collects dense connections by stacking the modules. Experiments show that our S-Ray is able to learn from multiple scenes, and it presents strong generalization ability to adapt to unseen scenes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes