ROCVSep 28, 2023

MEM: Multi-Modal Elevation Mapping for Robotics and Learning

arXiv:2309.16818v132 citationsh-index: 21Has Code
Originality Incremental advance
AI Analysis

This work addresses the need for richer environmental representations in robotics, though it is incremental as it builds on existing mapping frameworks.

The authors tackled the limitation of geometric-only elevation maps by extending a 2.5D robot-centric framework to fuse multi-modal data from point clouds or images, enabling real-time applications like line detection, human detection, and colorization on various robots.

Elevation maps are commonly used to represent the environment of mobile robots and are instrumental for locomotion and navigation tasks. However, pure geometric information is insufficient for many field applications that require appearance or semantic information, which limits their applicability to other platforms or domains. In this work, we extend a 2.5D robot-centric elevation mapping framework by fusing multi-modal information from multiple sources into a popular map representation. The framework allows inputting data contained in point clouds or images in a unified manner. To manage the different nature of the data, we also present a set of fusion algorithms that can be selected based on the information type and user requirements. Our system is designed to run on the GPU, making it real-time capable for various robotic and learning tasks. We demonstrate the capabilities of our framework by deploying it on multiple robots with varying sensor configurations and showcasing a range of applications that utilize multi-modal layers, including line detection, human detection, and colorization.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes