CVDec 11, 2023

TULIP: Transformer for Upsampling of LiDAR Point Clouds

arXiv:2312.06733v424 citationsh-index: 7CVPR
Originality Incremental advance
AI Analysis

This addresses a key challenge in perception for robots and autonomous vehicles, though it is incremental as it builds on existing range image-based approaches.

The paper tackles the problem of upsampling sparse LiDAR point clouds for robot and autonomous vehicle perception by proposing TULIP, a Swin-Transformer-based method that modifies patch and window geometries to better handle range images, resulting in outperforming state-of-the-art methods on three datasets.

LiDAR Upsampling is a challenging task for the perception systems of robots and autonomous vehicles, due to the sparse and irregular structure of large-scale scene contexts. Recent works propose to solve this problem by converting LiDAR data from 3D Euclidean space into an image super-resolution problem in 2D image space. Although their methods can generate high-resolution range images with fine-grained details, the resulting 3D point clouds often blur out details and predict invalid points. In this paper, we propose TULIP, a new method to reconstruct high-resolution LiDAR point clouds from low-resolution LiDAR input. We also follow a range image-based approach but specifically modify the patch and window geometries of a Swin-Transformer-based network to better fit the characteristics of range images. We conducted several experiments on three public real-world and simulated datasets. TULIP outperforms state-of-the-art methods in all relevant metrics and generates robust and more realistic point clouds than prior works.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes