CVApr 1, 2019

Scene Graph Generation with External Knowledge and Image Reconstruction

arXiv:1904.00560v1318 citations
Originality Incremental advance
AI Analysis

This addresses reliability issues in scene graph generation for image understanding, though it appears incremental as it builds on existing methods with new regularization techniques.

The paper tackles dataset bias and noisy annotations in scene graph generation by incorporating external commonsense knowledge and an image reconstruction loss, achieving state-of-the-art performance on Visual Relationship Detection and Visual Genome datasets.

Scene graph generation has received growing attention with the advancements in image understanding tasks such as object detection, attributes and relationship prediction,~\etc. However, existing datasets are biased in terms of object and relationship labels, or often come with noisy and missing annotations, which makes the development of a reliable scene graph prediction model very challenging. In this paper, we propose a novel scene graph generation algorithm with external knowledge and image reconstruction loss to overcome these dataset issues. In particular, we extract commonsense knowledge from the external knowledge base to refine object and phrase features for improving generalizability in scene graph generation. To address the bias of noisy object annotations, we introduce an auxiliary image reconstruction path to regularize the scene graph generation network. Extensive experiments show that our framework can generate better scene graphs, achieving the state-of-the-art performance on two benchmark datasets: Visual Relationship Detection and Visual Genome datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes