CVNov 20, 2025

Click2Graph: Interactive Panoptic Video Scene Graphs from a Single Click

arXiv:2511.15948v1
Originality Incremental advance
AI Analysis

This enables controllable and interpretable video scene understanding for applications requiring structured visual analysis, though it is incremental in combining existing prompting and reasoning techniques.

The paper tackles the problem of generating panoptic video scene graphs without human guidance by introducing Click2Graph, an interactive framework that uses a single user cue to segment, track, and predict relational triplets, establishing a strong foundation on the OpenPVSG benchmark.

State-of-the-art Video Scene Graph Generation (VSGG) systems provide structured visual understanding but operate as closed, feed-forward pipelines with no ability to incorporate human guidance. In contrast, promptable segmentation models such as SAM2 enable precise user interaction but lack semantic or relational reasoning. We introduce Click2Graph, the first interactive framework for Panoptic Video Scene Graph Generation (PVSG) that unifies visual prompting with spatial, temporal, and semantic understanding. From a single user cue, such as a click or bounding box, Click2Graph segments and tracks the subject across time, autonomously discovers interacting objects, and predicts <subject, object, predicate> triplets to form a temporally consistent scene graph. Our framework introduces two key components: a Dynamic Interaction Discovery Module that generates subject-conditioned object prompts, and a Semantic Classification Head that performs joint entity and predicate reasoning. Experiments on the OpenPVSG benchmark demonstrate that Click2Graph establishes a strong foundation for user-guided PVSG, showing how human prompting can be combined with panoptic grounding and relational inference to enable controllable and interpretable video scene understanding.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes