CVJun 5, 2018

Recurrent Convolutional Fusion for RGB-D Object Recognition

Mohammad Reza Loghmani, Mirco Planamente, Barbara Caputo, Markus Vincze

arXiv:1806.01673v39.635 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the challenge of synergistic RGB-D data fusion for improved object recognition in machine vision, representing an incremental advancement.

The paper tackles the problem of effectively combining RGB and depth data for object recognition by introducing a novel end-to-end architecture called recurrent convolutional fusion (RCFusion), which significantly outperforms state-of-the-art approaches on two popular datasets.

Providing machines with the ability to recognize objects like humans has always been one of the primary goals of machine vision. The introduction of RGB-D cameras has paved the way for a significant leap forward in this direction thanks to the rich information provided by these sensors. However, the machine vision community still lacks an effective method to synergically use the RGB and depth data to improve object recognition. In order to take a step in this direction, we introduce a novel end-to-end architecture for RGB-D object recognition called recurrent convolutional fusion (RCFusion). Our method generates compact and highly discriminative multi-modal features by combining complementary RGB and depth information representing different levels of abstraction. Extensive experiments on two popular datasets, RGB-D Object Dataset and JHUIT-50, show that RCFusion significantly outperforms state-of-the-art approaches in both the object categorization and instance recognition tasks.

View on arXiv PDF Code

Similar