CVAIROJun 23, 2025

OC-SOP: Enhancing Vision-Based 3D Semantic Occupancy Prediction by Object-Centric Awareness

arXiv:2506.18798v23 citationsh-index: 5SMC
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in vision-based 3D perception for autonomous driving, offering incremental improvements.

The paper tackles the problem of inaccurate semantic occupancy prediction for dynamic foreground objects in autonomous driving by proposing OC-SOP, which integrates object-centric cues, achieving state-of-the-art performance on SemanticKITTI.

Autonomous driving perception faces significant challenges due to occlusions and incomplete scene data in the environment. To overcome these issues, the task of semantic occupancy prediction (SOP) is proposed, which aims to jointly infer both the geometry and semantic labels of a scene from images. However, conventional camera-based methods typically treat all categories equally and primarily rely on local features, leading to suboptimal predictions, especially for dynamic foreground objects. To address this, we propose Object-Centric SOP (OC-SOP), a framework that integrates high-level object-centric cues extracted via a detection branch into the semantic occupancy prediction pipeline. This object-centric integration significantly enhances the prediction accuracy for foreground objects and achieves state-of-the-art performance among all categories on SemanticKITTI.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes