CVFeb 13, 2022

Hierarchical Point Cloud Encoding and Decoding with Lightweight Self-Attention based Model

arXiv:2202.06407v1
Originality Incremental advance
AI Analysis

This addresses the need for efficient point cloud processing in applications like classification and reconstruction, though it appears incremental as it builds on hierarchical pipelines with self-attention.

The paper tackles the problem of representation learning for point cloud data by proposing SA-CNN, a hierarchical and lightweight self-attention based encoding and decoding architecture, achieving state-of-the-art or comparable performance in benchmarks while reducing model complexity by several orders of magnitude.

In this paper we present SA-CNN, a hierarchical and lightweight self-attention based encoding and decoding architecture for representation learning of point cloud data. The proposed SA-CNN introduces convolution and transposed convolution stacks to capture and generate contextual information among unordered 3D points. Following conventional hierarchical pipeline, the encoding process extracts feature in local-to-global manner, while the decoding process generates feature and point cloud in coarse-to-fine, multi-resolution stages. We demonstrate that SA-CNN is capable of a wide range of applications, namely classification, part segmentation, reconstruction, shape retrieval, and unsupervised classification. While achieving the state-of-the-art or comparable performance in the benchmarks, SA-CNN maintains its model complexity several order of magnitude lower than the others. In term of qualitative results, we visualize the multi-stage point cloud reconstructions and latent walks on rigid objects as well as deformable non-rigid human and robot models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes