CVAIOct 30, 2019

Multi Modal Semantic Segmentation using Synthetic Data

arXiv:1910.13676v14 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of data scarcity for robotics applications like autonomous driving, though it is incremental as it builds on existing synthetic data methods.

The paper tackles the problem of 3D semantic segmentation without labeled real-world data by generating synthetic scenes using CARLA and training a deep neural network to learn geometric and texture cues, achieving effective classification on real-world datasets.

Semantic understanding of scenes in three-dimensional space (3D) is a quintessential part of robotics oriented applications such as autonomous driving as it provides geometric cues such as size, orientation and true distance of separation to objects which are crucial for taking mission critical decisions. As a first step, in this work we investigate the possibility of semantically classifying different parts of a given scene in 3D by learning the underlying geometric context in addition to the texture cues BUT in the absence of labelled real-world datasets. To this end we generate a large number of synthetic scenes, their pixel-wise labels and corresponding 3D representations using CARLA software framework. We then build a deep neural network that learns underlying category specific 3D representation and texture cues from color information of the rendered synthetic scenes. Further on we apply the learned model on different real world datasets to evaluate its performance. Our preliminary investigation of results show that the neural network is able to learn the geometric context from synthetic scenes and effectively apply this knowledge to classify each point of a 3D representation of a scene in real-world.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes