CVDec 6, 2024

UniScene: Unified Occupancy-centric Driving Scene Generation

arXiv:2412.05435v294 citationsh-index: 11CVPR
Originality Incremental advance
AI Analysis

This work addresses the need for controllable and annotated training data in autonomous driving, offering a novel approach that is incremental by building on existing scene generation methods.

The paper tackles the problem of generating diverse and high-fidelity training data for autonomous driving by introducing UniScene, a unified framework that produces semantic occupancy, video, and LiDAR data, outperforming previous state-of-the-art methods in all three generation tasks.

Generating high-fidelity, controllable, and annotated training data is critical for autonomous driving. Existing methods typically generate a single data form directly from a coarse scene layout, which not only fails to output rich data forms required for diverse downstream tasks but also struggles to model the direct layout-to-data distribution. In this paper, we introduce UniScene, the first unified framework for generating three key data forms - semantic occupancy, video, and LiDAR - in driving scenes. UniScene employs a progressive generation process that decomposes the complex task of scene generation into two hierarchical steps: (a) first generating semantic occupancy from a customized scene layout as a meta scene representation rich in both semantic and geometric information, and then (b) conditioned on occupancy, generating video and LiDAR data, respectively, with two novel transfer strategies of Gaussian-based Joint Rendering and Prior-guided Sparse Modeling. This occupancy-centric approach reduces the generation burden, especially for intricate scenes, while providing detailed intermediate representations for the subsequent generation stages. Extensive experiments demonstrate that UniScene outperforms previous SOTAs in the occupancy, video, and LiDAR generation, which also indeed benefits downstream driving tasks. Project page: https://arlo0o.github.io/uniscene/

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes