CVAIGRLGApr 20

Asset Harvester: Extracting 3D Assets from Autonomous Driving Logs for Simulation

arXiv:2604.1846898.1h-index: 23
Predicted impact top 4% in CV · last 90 daysOriginality Incremental advance
AI Analysis

For autonomous vehicle developers, it addresses the bottleneck of generating simulation-ready 3D assets from sparse in-the-wild data, enabling more realistic closed-loop simulation.

Asset Harvester extracts complete 3D object assets from sparse, real-world autonomous driving logs for simulation, enabling scalable conversion of AV object observations into reusable 3D assets.

Closed-loop simulation is a core component of autonomous vehicle (AV) development, enabling scalable testing, training, and safety validation before real-world deployment. Neural scene reconstruction converts driving logs into interactive 3D environments for simulation, but it does not produce complete 3D object assets required for agent manipulation and large-viewpoint novel-view synthesis. To address this challenge, we present Asset Harvester, an image-to-3D model and end-to-end pipeline that converts sparse, in-the-wild object observations from real driving logs into complete, simulation-ready assets. Rather than relying on a single model component, we developed a system-level design for real-world AV data that combines large-scale curation of object-centric training tuples, geometry-aware preprocessing across heterogeneous sensors, and a robust training recipe that couples sparse-view-conditioned multiview generation with 3D Gaussian lifting. Within this system, SparseViewDiT is explicitly designed to address limited-angle views and other real-world data challenges. Together with hybrid data curation, augmentation, and self-distillation, this system enables scalable conversion of sparse AV object observations into reusable 3D assets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes