CVApr 4

FunFact: Building Probabilistic Functional 3D Scene Graphs via Factor-Graph Reasoning

arXiv:2604.0369619.01 citationsh-index: 16
Predicted impact top 45% in CV · last 90 daysOriginality Incremental advance
AI Analysis

For 3D scene understanding researchers, it provides a holistic probabilistic framework that outperforms pairwise methods on functional relation inference.

FunFact constructs probabilistic open-vocabulary functional 3D scene graphs from RGB-D images using factor-graph reasoning, improving relation discovery recall and reducing calibration error for ambiguous relations compared to prior methods.

Recent work in 3D scene understanding is moving beyond purely spatial analysis toward functional scene understanding. However, existing methods often consider functional relationships between object pairs in isolation, failing to capture the scene-wide interdependence that humans use to resolve ambiguity. We introduce FunFact, a framework for constructing probabilistic open-vocabulary functional 3D scene graphs from posed RGB-D images. FunFact first builds an object- and part-centric 3D map and uses foundation models to propose semantically plausible functional relations. These candidates are converted into factor graph variables and constrained by both LLM-derived common-sense priors and geometric priors. This formulation enables joint probabilistic inference over all functional edges and their marginals, yielding substantially better calibrated confidence scores. To benchmark this setting, we introduce FunThor, a synthetic dataset based on AI2-THOR with part-level geometry and rule-based functional annotations. Experiments on SceneFun3D, FunGraph3D, and FunThor show that FunFact improves node and relation discovery recall and significantly reduces calibration error for ambiguous relations, highlighting the benefits of holistic probabilistic modeling for functional scene understanding. See our project page at https://funfact-scenegraph.github.io/

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes