CVAILGRODec 11, 2024

GPD-1: Generative Pre-training for Driving

arXiv:2412.08643v11 citationsh-index: 22Has Code
Originality Incremental advance
AI Analysis

This addresses the need for integrated scene modeling in autonomous driving, though it is incremental as it builds on existing transformer and tokenization methods.

The paper tackles the problem of modeling driving scenario evolutions for autonomous driving by proposing GPD-1, a unified generative pre-training model that accomplishes multiple tasks like scene generation and motion planning without fine-tuning, achieving successful generalization across various tasks on the nuPlan dataset.

Modeling the evolutions of driving scenarios is important for the evaluation and decision-making of autonomous driving systems. Most existing methods focus on one aspect of scene evolution such as map generation, motion prediction, and trajectory planning. In this paper, we propose a unified Generative Pre-training for Driving (GPD-1) model to accomplish all these tasks altogether without additional fine-tuning. We represent each scene with ego, agent, and map tokens and formulate autonomous driving as a unified token generation problem. We adopt the autoregressive transformer architecture and use a scene-level attention mask to enable intra-scene bi-directional interactions. For the ego and agent tokens, we propose a hierarchical positional tokenizer to effectively encode both 2D positions and headings. For the map tokens, we train a map vector-quantized autoencoder to efficiently compress ego-centric semantic maps into discrete tokens. We pre-train our GPD-1 on the large-scale nuPlan dataset and conduct extensive experiments to evaluate its effectiveness. With different prompts, our GPD-1 successfully generalizes to various tasks without finetuning, including scene generation, traffic simulation, closed-loop simulation, map prediction, and motion planning. Code: https://github.com/wzzheng/GPD.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes