CVJul 26, 2025

FROSS: Faster-than-Real-Time Online 3D Semantic Scene Graph Generation from RGB-D Images

arXiv:2507.19993v25 citationsh-index: 5Has Code
Originality Incremental advance
AI Analysis

This addresses the need for efficient, real-time scene understanding in open-world applications, representing an incremental improvement over existing methods.

The paper tackles the problem of generating 3D semantic scene graphs from RGB-D images, which is computationally intensive and non-incremental, by proposing FROSS, a method that achieves faster-than-real-time performance with superior results on benchmarks like ReplicaSSG and 3DSSG.

The ability to abstract complex 3D environments into simplified and structured representations is crucial across various domains. 3D semantic scene graphs (SSGs) achieve this by representing objects as nodes and their interrelationships as edges, facilitating high-level scene understanding. Existing methods for 3D SSG generation, however, face significant challenges, including high computational demands and non-incremental processing that hinder their suitability for real-time open-world applications. To address this issue, we propose FROSS (Faster-than-Real-Time Online 3D Semantic Scene Graph Generation), an innovative approach for online and faster-than-real-time 3D SSG generation that leverages the direct lifting of 2D scene graphs to 3D space and represents objects as 3D Gaussian distributions. This framework eliminates the dependency on precise and computationally-intensive point cloud processing. Furthermore, we extend the Replica dataset with inter-object relationship annotations, creating the ReplicaSSG dataset for comprehensive evaluation of FROSS. The experimental results from evaluations on ReplicaSSG and 3DSSG datasets show that FROSS can achieve superior performance while operating significantly faster than prior 3D SSG generation methods. Our implementation and dataset are publicly available at https://github.com/Howardkhh/FROSS.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes