CVMar 17, 2022

Biasing Like Human: A Cognitive Bias Framework for Scene Graph Generation

arXiv:2203.09160v12 citationsh-index: 70
Originality Highly original
AI Analysis

This work addresses the challenge of generating more informative and accurate scene graphs for computer vision applications, representing an incremental improvement by integrating cognitive biases into existing models.

The paper tackles the problem of scene graph generation being biased towards frequent relations due to ambiguous visual features and dataset annotations, by proposing a cognitive bias framework that incorporates linguistic knowledge to guide vision-based representations, achieving new state-of-the-art performance on the Visual Genome dataset with minimal parameter increase.

Scene graph generation is a sophisticated task because there is no specific recognition pattern (e.g., "looking at" and "near" have no conspicuous difference concerning vision, whereas "near" could occur between entities with different morphology). Thus some scene graph generation methods are trapped into most frequent relation predictions caused by capricious visual features and trivial dataset annotations. Therefore, recent works emphasized the "unbiased" approaches to balance predictions for a more informative scene graph. However, human's quick and accurate judgments over relations between numerous objects should be attributed to "bias" (i.e., experience and linguistic knowledge) rather than pure vision. To enhance the model capability, inspired by the "cognitive bias" mechanism, we propose a novel 3-paradigms framework that simulates how humans incorporate the label linguistic features as guidance of vision-based representations to better mine hidden relation patterns and alleviate noisy visual propagation. Our framework is model-agnostic to any scene graph model. Comprehensive experiments prove our framework outperforms baseline modules in several metrics with minimum parameters increment and achieves new SOTA performance on Visual Genome dataset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes