CVLGMLJul 31, 2018

A recurrent multi-scale approach to RBG-D Object Recognition

arXiv:1808.01357v3
Originality Incremental advance
AI Analysis

This work addresses object recognition for robotics using RGB-D cameras, representing an incremental improvement over existing methods.

The authors tackled RGB-D object recognition by developing RCFusion, an end-to-end architecture that fuses RGB and depth information at multiple abstraction levels, achieving state-of-the-art results on the RGB-D Object Dataset and JHUIT-50 datasets.

Technological development aims to produce generations of increasingly efficient robots able to perform complex tasks. This requires considerable efforts, from the scientific community, to find new algorithms that solve computer vision problems, such as object recognition. The diffusion of RGB-D cameras directed the study towards the research of new architectures able to exploit the RGB and Depth information. The project that is developed in this thesis concerns the realization of a new end-to-end architecture for the recognition of RGB-D objects called RCFusion. Our method generates compact and highly discriminative multi-modal features by combining complementary RGB and depth information representing different levels of abstraction. We evaluate our method on standard object recognition datasets, RGB-D Object Dataset and JHUIT-50. The experiments performed show that our method outperforms the existing approaches and establishes new state-of-the-art results for both datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes