CVAIJul 22, 2024

Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation

arXiv:2407.15396v211 citationsh-index: 9
Originality Incremental advance
AI Analysis

This addresses a specific bias issue in scene graph generation for computer vision applications, representing an incremental improvement.

The paper tackles the problem of biased predictions in scene graph generation due to overlooking semantic diversity in predicates, and proposes a model-agnostic framework that improves performance on existing models.

The scene graph generation (SGG) task involves detecting objects within an image and predicting predicates that represent the relationships between the objects. However, in SGG benchmark datasets, each subject-object pair is annotated with a single predicate even though a single predicate may exhibit diverse semantics (i.e., semantic diversity), existing SGG models are trained to predict the one and only predicate for each pair. This in turn results in the SGG models to overlook the semantic diversity that may exist in a predicate, thus leading to biased predictions. In this paper, we propose a novel model-agnostic Semantic Diversity-aware Prototype-based Learning (DPL) framework that enables unbiased predictions based on the understanding of the semantic diversity of predicates. Specifically, DPL learns the regions in the semantic space covered by each predicate to distinguish among the various different semantics that a single predicate can represent. Extensive experiments demonstrate that our proposed model-agnostic DPL framework brings significant performance improvement on existing SGG models, and also effectively understands the semantic diversity of predicates.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes