CVFeb 20, 2025

Learning Temporal 3D Semantic Scene Completion via Optical Flow Guidance

arXiv:2502.14520v24 citationsh-index: 9
AI Analysis

This addresses the need for accurate and reliable scene perception in autonomous driving by improving temporal consistency, though it is incremental as it builds on existing SSC methods with novel components.

The paper tackles the problem of 3D Semantic Scene Completion (SSC) in autonomous driving by proposing FlowScene, a method that uses optical flow guidance to integrate motion and contextual cues, achieving state-of-the-art performance on benchmarks like SemanticKITTI and SSCBench-KITTI-360.

3D Semantic Scene Completion (SSC) provides comprehensive scene geometry and semantics for autonomous driving perception, which is crucial for enabling accurate and reliable decision-making. However, existing SSC methods are limited to capturing sparse information from the current frame or naively stacking multi-frame temporal features, thereby failing to acquire effective scene context. These approaches ignore critical motion dynamics and struggle to achieve temporal consistency. To address the above challenges, we propose a novel temporal SSC method FlowScene: Learning Temporal 3D Semantic Scene Completion via Optical Flow Guidance. By leveraging optical flow, FlowScene can integrate motion, different viewpoints, occlusions, and other contextual cues, thereby significantly improving the accuracy of 3D scene completion. Specifically, our framework introduces two key components: (1) a Flow-Guided Temporal Aggregation module that aligns and aggregates temporal features using optical flow, capturing motion-aware context and deformable structures; and (2) an Occlusion-Guided Voxel Refinement module that injects occlusion masks and temporally aggregated features into 3D voxel space, adaptively refining voxel representations for explicit geometric modeling. Experimental results demonstrate that FlowScene achieves state-of-the-art performance on the SemanticKITTI and SSCBench-KITTI-360 benchmarks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes