IVCVAug 15, 2019

A Multimodal Vision Sensor for Autonomous Driving

arXiv:1908.05649v117 citations
AI Analysis

This addresses the need for robust sensing in autonomous vehicles, but it is incremental as it combines existing camera types with data fusion techniques.

The paper tackles the problem of environment perception for autonomous driving by developing a multimodal vision sensor that integrates stereo, polarization, and panoramic cameras, resulting in capabilities like depth measurement, polarization analysis, and 360-degree capture, with demonstrated applications such as water hazard detection and semantic segmentation.

This paper describes a multimodal vision sensor that integrates three types of cameras, including a stereo camera, a polarization camera and a panoramic camera. Each sensor provides a specific dimension of information: the stereo camera measures depth per pixel, the polarization obtains the degree of polarization, and the panoramic camera captures a 360-degree landscape. Data fusion and advanced environment perception could be built upon the combination of sensors. Designed especially for autonomous driving, this vision sensor is shipped with a robust semantic segmentation network. In addition, we demonstrate how cross-modal enhancement could be achieved by registering the color image and the polarization image. An example of water hazard detection is given. To prove the multimodal vision sensor's compatibility with different devices, a brief runtime performance analysis is carried out.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes