CVDec 11, 2023

TULIP: Transformer for Upsampling of LiDAR Point Clouds

Bin Yang, Patrick Pfreundschuh, Roland Siegwart, Marco Hutter, Peyman Moghadam, Vaishakh Patil

arXiv:2312.06733v411.624 citationsh-index: 7Has CodeCVPR

Originality Incremental advance

AI Analysis

This addresses a key challenge in perception for robots and autonomous vehicles, though it is incremental as it builds on existing range image-based approaches.

The paper tackles the problem of upsampling sparse LiDAR point clouds for robot and autonomous vehicle perception by proposing TULIP, a Swin-Transformer-based method that modifies patch and window geometries to better handle range images, resulting in outperforming state-of-the-art methods on three datasets.

LiDAR Upsampling is a challenging task for the perception systems of robots and autonomous vehicles, due to the sparse and irregular structure of large-scale scene contexts. Recent works propose to solve this problem by converting LiDAR data from 3D Euclidean space into an image super-resolution problem in 2D image space. Although their methods can generate high-resolution range images with fine-grained details, the resulting 3D point clouds often blur out details and predict invalid points. In this paper, we propose TULIP, a new method to reconstruct high-resolution LiDAR point clouds from low-resolution LiDAR input. We also follow a range image-based approach but specifically modify the patch and window geometries of a Swin-Transformer-based network to better fit the characteristics of range images. We conducted several experiments on three public real-world and simulated datasets. TULIP outperforms state-of-the-art methods in all relevant metrics and generates robust and more realistic point clouds than prior works.

View on arXiv PDF Code

Similar