CVFeb 3, 2015

Data-Driven Scene Understanding with Adaptively Retrieved Exemplars

arXiv:1502.00749v19 citations
AI Analysis

This work addresses the problem of reducing annotation effort for scene understanding in computer vision, though it is incremental as it builds on existing data-driven methods.

The paper tackles semantic scene understanding without pixelwise annotation by retrieving exemplars from a tagged database and propagating labels via a novel EM-based framework, achieving superior performance in semantic segmentation and image annotation on two public databases.

This article investigates a data-driven approach for semantically scene understanding, without pixelwise annotation and classifier training. Our framework parses a target image with two steps: (i) retrieving its exemplars (i.e. references) from an image database, where all images are unsegmented but annotated with tags; (ii) recovering its pixel labels by propagating semantics from the references. We present a novel framework making the two steps mutually conditional and bootstrapped under the probabilistic Expectation-Maximization (EM) formulation. In the first step, the references are selected by jointly matching their appearances with the target as well as the semantics (i.e. the assigned labels of the target and the references). We process the second step via a combinatorial graphical representation, in which the vertices are superpixels extracted from the target and its selected references. Then we derive the potentials of assigning labels to one vertex of the target, which depend upon the graph edges that connect the vertex to its spatial neighbors of the target and to its similar vertices of the references. Besides, the proposed framework can be naturally applied to perform image annotation on new test images. In the experiments, we validate our approach on two public databases, and demonstrate superior performances over the state-of-the-art methods in both semantic segmentation and image annotation tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes