CVJun 12, 2025

QuadricFormer: Scene as Superquadrics for 3D Semantic Occupancy Prediction

arXiv:2506.10977v115 citationsh-index: 22Has Code
Originality Highly original
AI Analysis

This addresses the need for efficient and accurate 3D perception in autonomous driving systems, representing an incremental improvement over existing object-centric methods.

The paper tackles the problem of inefficient 3D occupancy prediction in autonomous driving by proposing QuadricFormer, which uses geometrically expressive superquadrics as scene primitives instead of dense voxels or sparse Gaussians, achieving state-of-the-art performance on the nuScenes dataset with superior efficiency.

3D occupancy prediction is crucial for robust autonomous driving systems as it enables comprehensive perception of environmental structures and semantics. Most existing methods employ dense voxel-based scene representations, ignoring the sparsity of driving scenes and resulting in inefficiency. Recent works explore object-centric representations based on sparse Gaussians, but their ellipsoidal shape prior limits the modeling of diverse structures. In real-world driving scenes, objects exhibit rich geometries (e.g., cuboids, cylinders, and irregular shapes), necessitating excessive ellipsoidal Gaussians densely packed for accurate modeling, which leads to inefficient representations. To address this, we propose to use geometrically expressive superquadrics as scene primitives, enabling efficient representation of complex structures with fewer primitives through their inherent shape diversity. We develop a probabilistic superquadric mixture model, which interprets each superquadric as an occupancy probability distribution with a corresponding geometry prior, and calculates semantics through probabilistic mixture. Building on this, we present QuadricFormer, a superquadric-based model for efficient 3D occupancy prediction, and introduce a pruning-and-splitting module to further enhance modeling efficiency by concentrating superquadrics in occupied regions. Extensive experiments on the nuScenes dataset demonstrate that QuadricFormer achieves state-of-the-art performance while maintaining superior efficiency.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes