CVAIAug 3, 2023

LiDAR-Camera Panoptic Segmentation via Geometry-Consistent and Semantic-Aware Alignment

arXiv:2308.01686v226 citationsh-index: 25Has Code
AI Analysis

This work addresses the challenging problem of sensor fusion for autonomous driving perception, offering a novel method but with incremental gains in a specific domain.

The paper tackles 3D panoptic segmentation by fusing LiDAR and camera data, proposing a three-stage network (LCPS) that improves performance by about 6.9% PQ over LiDAR-only baselines on the NuScenes dataset.

3D panoptic segmentation is a challenging perception task that requires both semantic segmentation and instance segmentation. In this task, we notice that images could provide rich texture, color, and discriminative information, which can complement LiDAR data for evident performance improvement, but their fusion remains a challenging problem. To this end, we propose LCPS, the first LiDAR-Camera Panoptic Segmentation network. In our approach, we conduct LiDAR-Camera fusion in three stages: 1) an Asynchronous Compensation Pixel Alignment (ACPA) module that calibrates the coordinate misalignment caused by asynchronous problems between sensors; 2) a Semantic-Aware Region Alignment (SARA) module that extends the one-to-one point-pixel mapping to one-to-many semantic relations; 3) a Point-to-Voxel feature Propagation (PVP) module that integrates both geometric and semantic fusion information for the entire point cloud. Our fusion strategy improves about 6.9% PQ performance over the LiDAR-only baseline on NuScenes dataset. Extensive quantitative and qualitative experiments further demonstrate the effectiveness of our novel framework. The code will be released at https://github.com/zhangzw12319/lcps.git.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes