CVMay 25, 2021

Review on Indoor RGB-D Semantic Segmentation with Deep Convolutional Neural Networks

arXiv:2105.11925v114 citations
Originality Synthesis-oriented
AI Analysis

It addresses the problem of improving semantic segmentation accuracy in indoor environments by integrating depth data, but it is incremental as it is a review paper.

This paper reviews the field of RGB-D indoor semantic segmentation using deep convolutional neural networks, summarizing datasets, methods, state-of-the-art performance, and future challenges.

Many research works focus on leveraging the complementary geometric information of indoor depth sensors in vision tasks performed by deep convolutional neural networks, notably semantic segmentation. These works deal with a specific vision task known as "RGB-D Indoor Semantic Segmentation". The challenges and resulting solutions of this task differ from its standard RGB counterpart. This results in a new active research topic. The objective of this paper is to introduce the field of Deep Convolutional Neural Networks for RGB-D Indoor Semantic Segmentation. This review presents the most popular public datasets, proposes a categorization of the strategies employed by recent contributions, evaluates the performance of the current state-of-the-art, and discusses the remaining challenges and promising directions for future works.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes