CVAug 17, 2021

An Evaluation of RGB and LiDAR Fusion for Semantic Segmentation

arXiv:2108.07661v11 citations
Originality Synthesis-oriented
AI Analysis

This addresses the sensor fusion problem for autonomous vehicles, but it is incremental as it builds on existing methods with modest gains.

The paper investigated whether fusing RGB and LiDAR data improves semantic segmentation for autonomous driving, finding that mid-level fusion achieved the highest improvement of 2.7% mIoU over base models.

LiDARs and cameras are the two main sensors that are planned to be included in many announced autonomous vehicles prototypes. Each of the two provides a unique form of data from a different perspective to the surrounding environment. In this paper, we explore and attempt to answer the question: is there an added benefit by fusing those two forms of data for the purpose of semantic segmentation within the context of autonomous driving? We also attempt to show at which level does said fusion prove to be the most useful. We evaluated our algorithms on the publicly available SemanticKITTI dataset. All fusion models show improvements over the base model, with the mid-level fusion showing the highest improvement of 2.7% in terms of mean Intersection over Union (mIoU) metric.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes