RO CVSep 28, 2023

MEM: Multi-Modal Elevation Mapping for Robotics and Learning

Gian Erni, Jonas Frey, Takahiro Miki, Matias Mattamala, Marco Hutter

arXiv:2309.16818v113.232 citationsh-index: 21Has Code

Originality Incremental advance

AI Analysis

This work addresses the need for richer environmental representations in robotics, though it is incremental as it builds on existing mapping frameworks.

The authors tackled the limitation of geometric-only elevation maps by extending a 2.5D robot-centric framework to fuse multi-modal data from point clouds or images, enabling real-time applications like line detection, human detection, and colorization on various robots.

Elevation maps are commonly used to represent the environment of mobile robots and are instrumental for locomotion and navigation tasks. However, pure geometric information is insufficient for many field applications that require appearance or semantic information, which limits their applicability to other platforms or domains. In this work, we extend a 2.5D robot-centric elevation mapping framework by fusing multi-modal information from multiple sources into a popular map representation. The framework allows inputting data contained in point clouds or images in a unified manner. To manage the different nature of the data, we also present a set of fusion algorithms that can be selected based on the information type and user requirements. Our system is designed to run on the GPU, making it real-time capable for various robotic and learning tasks. We demonstrate the capabilities of our framework by deploying it on multiple robots with varying sensor configurations and showcasing a range of applications that utilize multi-modal layers, including line detection, human detection, and colorization.

View on arXiv PDF Code

Similar