ROAICVLGJul 18, 2017

Choosing Smartly: Adaptive Multimodal Fusion for Object Detection in Changing Environments

arXiv:1707.05733v2119 citations
Originality Incremental advance
AI Analysis

This work addresses sensor noise challenges for autonomous robots in changing environments, representing an incremental improvement in multimodal fusion methods.

The paper tackles object detection in dynamic environments by proposing an adaptive multimodal fusion approach that learns to weight sensor predictions online, demonstrating adaptation to harsh lighting changes and severe motion blur in robot experiments.

Object detection is an essential task for autonomous robots operating in dynamic and changing environments. A robot should be able to detect objects in the presence of sensor noise that can be induced by changing lighting conditions for cameras and false depth readings for range sensors, especially RGB-D cameras. To tackle these challenges, we propose a novel adaptive fusion approach for object detection that learns weighting the predictions of different sensor modalities in an online manner. Our approach is based on a mixture of convolutional neural network (CNN) experts and incorporates multiple modalities including appearance, depth and motion. We test our method in extensive robot experiments, in which we detect people in a combined indoor and outdoor scenario from RGB-D data, and we demonstrate that our method can adapt to harsh lighting changes and severe camera motion blur. Furthermore, we present a new RGB-D dataset for people detection in mixed in- and outdoor environments, recorded with a mobile robot. Code, pretrained models and dataset are available at http://adaptivefusion.cs.uni-freiburg.de

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes