CVDec 22, 2017

The ParallelEye Dataset: Constructing Large-Scale Artificial Scenes for Traffic Vision Research

arXiv:1712.08394v113 citations
Originality Synthesis-oriented
AI Analysis

This provides a scalable solution for researchers in traffic vision to generate diverse and accurately labeled datasets, though it is incremental as it builds on existing virtual dataset methods.

The authors tackled the problem of time-consuming and error-prone manual collection of traffic vision datasets by constructing large-scale artificial scenes using virtual reality and computer graphics, resulting in a photorealistic virtual dataset called ParallelEye with low modeling time and high accuracy labeling.

Video image datasets are playing an essential role in design and evaluation of traffic vision algorithms. Nevertheless, a longstanding inconvenience concerning image datasets is that manually collecting and annotating large-scale diversified datasets from real scenes is time-consuming and prone to error. For that virtual datasets have begun to function as a proxy of real datasets. In this paper, we propose to construct large-scale artificial scenes for traffic vision research and generate a new virtual dataset called "ParallelEye". First of all, the street map data is used to build 3D scene model of Zhongguancun Area, Beijing. Then, the computer graphics, virtual reality, and rule modeling technologies are utilized to synthesize large-scale, realistic virtual urban traffic scenes, in which the fidelity and geography match the real world well. Furthermore, the Unity3D platform is used to render the artificial scenes and generate accurate ground-truth labels, e.g., semantic/instance segmentation, object bounding box, object tracking, optical flow, and depth. The environmental conditions in artificial scenes can be controlled completely. As a result, we present a viable implementation pipeline for constructing large-scale artificial scenes for traffic vision research. The experimental results demonstrate that this pipeline is able to generate photorealistic virtual datasets with low modeling time and high accuracy labeling.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes