ROMay 30

DriveAnchor: Progressive Anchor-based Flow Learning for Autonomous Driving Planning

Limin Yan, Haoyun Tang, Yutao Qiu, Hongqing Liu, Haoyu Xu

arXiv:2606.0051940.5

AI Analysis

For autonomous driving planning, DriveAnchor provides a practical, composable pipeline that enhances safety and controllability while maintaining real-time performance, validated through real-world vehicle testing.

DriveAnchor introduces a three-stage planning framework for autonomous driving that replaces unstructured Gaussian priors with a vocabulary of 2,398 trajectory shapes, achieving an 89% reduction in near-range collision rates and a 32% improvement in mean reward without sacrificing imitation accuracy, with 2.06 ms inference on NVIDIA Drive Orin.

We present DriveAnchor, a three-stage framework for autonomous driving planning that achieves behavioral diversity, controllability, and safety in a composable pipeline. Demonstration Flow Pretraining replaces the unstructured Gaussian prior with a vocabulary of 2,398 trajectory shapes constructed by farthest-point sampling, structurally grounding behavioral diversity in vocabulary coverage. Guided Flow Post-training jointly post-trains an Energy Field module with flow matching (FM), conditioning the Energy Field on static road geometry alone, to relocate anchors toward user-specified corridor polygons before flow generation, adding controllability without differentiable guidance; after Stage 2, new corridor presets require only Energy Field updates, not FM retraining. Reward-Refined Flow Fine-tuning applies zeroth-order reinforcement learning to align each anchor's output with collision-avoidance objectives: because the flow-matching model is a deterministic feedforward network in single-step mode, each anchor uniquely determines the output trajectory, reducing reward optimization to a direction search in anchor space without log-likelihood computation or ODE-to-SDE conversion. Evaluated on approximately 2 million held-out driving scenarios, DriveAnchor reduces near-range collision rates by 89% and improves mean reward by 32% without degradation in imitation accuracy, with 2.06 ms inference on NVIDIA Drive Orin. DriveAnchor has been validated through real-world vehicle testing, confirming its practicality for production deployment.

View on arXiv PDF

Similar