CVAILGRODec 13, 2024

GaussianAD: Gaussian-Centric End-to-End Autonomous Driving

arXiv:2412.10371v130 citationsh-index: 22Has Code
Originality Incremental advance
AI Analysis

This work addresses the efficiency and representation challenges in autonomous driving systems, offering a novel method that is incremental in improving scene modeling for real-world applications.

The paper tackles the trade-off between comprehensiveness and efficiency in vision-based autonomous driving by proposing a Gaussian-centric end-to-end framework that uses 3D semantic Gaussians for scene representation, achieving verified effectiveness on tasks like motion planning and occupancy prediction on the nuScenes dataset.

Vision-based autonomous driving shows great potential due to its satisfactory performance and low costs. Most existing methods adopt dense representations (e.g., bird's eye view) or sparse representations (e.g., instance boxes) for decision-making, which suffer from the trade-off between comprehensiveness and efficiency. This paper explores a Gaussian-centric end-to-end autonomous driving (GaussianAD) framework and exploits 3D semantic Gaussians to extensively yet sparsely describe the scene. We initialize the scene with uniform 3D Gaussians and use surrounding-view images to progressively refine them to obtain the 3D Gaussian scene representation. We then use sparse convolutions to efficiently perform 3D perception (e.g., 3D detection, semantic map construction). We predict 3D flows for the Gaussians with dynamic semantics and plan the ego trajectory accordingly with an objective of future scene forecasting. Our GaussianAD can be trained in an end-to-end manner with optional perception labels when available. Extensive experiments on the widely used nuScenes dataset verify the effectiveness of our end-to-end GaussianAD on various tasks including motion planning, 3D occupancy prediction, and 4D occupancy forecasting. Code: https://github.com/wzzheng/GaussianAD.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes