CVAIROOct 21, 2024

WildOcc: A Benchmark for Off-Road 3D Semantic Occupancy Prediction

arXiv:2410.15792v29 citationsh-index: 11
Originality Synthesis-oriented
AI Analysis

This addresses a gap in autonomous driving research for off-road environments, providing a new benchmark and method, but it is incremental as it extends existing on-road techniques to off-road settings.

The paper tackles the lack of datasets and benchmarks for 3D semantic occupancy prediction in off-road environments by introducing WildOcc, the first benchmark with dense occupancy annotations, and proposes a multi-modal framework that fuses spatio-temporal information and uses cross-modality distillation, achieving improved reconstruction results as indicated by the method.

3D semantic occupancy prediction is an essential part of autonomous driving, focusing on capturing the geometric details of scenes. Off-road environments are rich in geometric information, therefore it is suitable for 3D semantic occupancy prediction tasks to reconstruct such scenes. However, most of researches concentrate on on-road environments, and few methods are designed for off-road 3D semantic occupancy prediction due to the lack of relevant datasets and benchmarks. In response to this gap, we introduce WildOcc, to our knowledge, the first benchmark to provide dense occupancy annotations for off-road 3D semantic occupancy prediction tasks. A ground truth generation pipeline is proposed in this paper, which employs a coarse-to-fine reconstruction to achieve a more realistic result. Moreover, we introduce a multi-modal 3D semantic occupancy prediction framework, which fuses spatio-temporal information from multi-frame images and point clouds at voxel level. In addition, a cross-modality distillation function is introduced, which transfers geometric knowledge from point clouds to image features.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes