CVAILGROMar 23, 2025

SG-Tailor: Inter-Object Commonsense Relationship Reasoning for Scene Graph Manipulation

arXiv:2503.18988v12 citationsh-index: 18Has Code
Originality Highly original
AI Analysis

This addresses the problem of scene graph manipulation for content generation and robotic manipulation tasks, representing a novel approach to an untouched task.

The paper tackles the challenging task of manipulating scene graphs by adding nodes or modifying edges, which is computationally intractable due to conflicts from interdependencies, and introduces SG-Tailor, an autoregressive model that outperforms competing methods by a large margin.

Scene graphs capture complex relationships among objects, serving as strong priors for content generation and manipulation. Yet, reasonably manipulating scene graphs -- whether by adding nodes or modifying edges -- remains a challenging and untouched task. Tasks such as adding a node to the graph or reasoning about a node's relationships with all others are computationally intractable, as even a single edge modification can trigger conflicts due to the intricate interdependencies within the graph. To address these challenges, we introduce SG-Tailor, an autoregressive model that predicts the conflict-free relationship between any two nodes. SG-Tailor not only infers inter-object relationships, including generating commonsense edges for newly added nodes but also resolves conflicts arising from edge modifications to produce coherent, manipulated graphs for downstream tasks. For node addition, the model queries the target node and other nodes from the graph to predict the appropriate relationships. For edge modification, SG-Tailor employs a Cut-And-Stitch strategy to solve the conflicts and globally adjust the graph. Extensive experiments demonstrate that SG-Tailor outperforms competing methods by a large margin and can be seamlessly integrated as a plug-in module for scene generation and robotic manipulation tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes