CVLGApr 5, 2022

SE(3)-Equivariant Attention Networks for Shape Reconstruction in Function Space

arXiv:2204.02394v240 citationsh-index: 77
Originality Highly original
AI Analysis

This addresses the problem of scalable and accurate 3D reconstruction for computer vision and robotics, with incremental improvements in equivariance and local modeling.

The paper tackles 3D shape reconstruction from unoriented point clouds by proposing an SE(3)-equivariant attention network that operates directly on irregular point clouds, outperforming previous methods and enabling reconstruction of novel scenes with multiple objects in random poses without performance loss.

We propose a method for 3D shape reconstruction from unoriented point clouds. Our method consists of a novel SE(3)-equivariant coordinate-based network (TF-ONet), that parametrizes the occupancy field of the shape and respects the inherent symmetries of the problem. In contrast to previous shape reconstruction methods that align the input to a regular grid, we operate directly on the irregular point cloud. Our architecture leverages equivariant attention layers that operate on local tokens. This mechanism enables local shape modelling, a crucial property for scalability to large scenes. Given an unoriented, sparse, noisy point cloud as input, we produce equivariant features for each point. These serve as keys and values for the subsequent equivariant cross-attention blocks that parametrize the occupancy field. By querying an arbitrary point in space, we predict its occupancy score. We show that our method outperforms previous SO(3)-equivariant methods, as well as non-equivariant methods trained on SO(3)-augmented datasets. More importantly, local modelling together with SE(3)-equivariance create an ideal setting for SE(3) scene reconstruction. We show that by training only on single, aligned objects and without any pre-segmentation, we can reconstruct novel scenes containing arbitrarily many objects in random poses without any performance loss.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes