CVFeb 7, 2025

GaussRender: Learning 3D Occupancy with Gaussian Rendering

arXiv:2502.05040v316 citationsh-index: 29Has Code
Originality Incremental advance
AI Analysis

This addresses geometric fidelity issues in 3D scene understanding for autonomous driving, representing an incremental improvement over existing methods.

The paper tackles the problem of visual inconsistencies in 3D occupancy prediction for autonomous driving by proposing GaussRender, a module that enforces projective consistency through differentiable rendering with Gaussian splatting, achieving state-of-the-art results on benchmarks like SurroundOcc-nuScenes with significant improvements in surface-sensitive metrics such as RayIoU.

Understanding the 3D geometry and semantics of driving scenes is critical for safe autonomous driving. Recent advances in 3D occupancy prediction have improved scene representation but often suffer from visual inconsistencies, leading to floating artifacts and poor surface localization. Existing voxel-wise losses (e.g., cross-entropy) fail to enforce visible geometric coherence. In this paper, we propose GaussRender, a module that improves 3D occupancy learning by enforcing projective consistency. Our key idea is to project both predicted and ground-truth 3D occupancy into 2D camera views, where we apply supervision. Our method penalizes 3D configurations that produce inconsistent 2D projections, thereby enforcing a more coherent 3D structure. To achieve this efficiently, we leverage differentiable rendering with Gaussian splatting. GaussRender seamlessly integrates with existing architectures while maintaining efficiency and requiring no inference-time modifications. Extensive evaluations on multiple benchmarks (SurroundOcc-nuScenes, Occ3D-nuScenes, SSCBench-KITTI360) demonstrate that GaussRender significantly improves geometric fidelity across various 3D occupancy models (TPVFormer, SurroundOcc, Symphonies), achieving state-of-the-art results, particularly on surface-sensitive metrics such as RayIoU. The code is open-sourced at https://github.com/valeoai/GaussRender.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes