CVJan 22, 2017

Greedy Structure Learning of Hierarchical Compositional Models

arXiv:1701.06171v45 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of building interpretable object models for computer vision, though it is incremental as it builds on existing hierarchical compositional models.

The authors tackled the problem of learning hierarchical generative models from images with background clutter without strong prior assumptions or segmented data, achieving competitive results in object classification on a standard dataset.

In this work, we consider the problem of learning a hierarchical generative model of an object from a set of images which show examples of the object in the presence of variable background clutter. Existing approaches to this problem are limited by making strong a-priori assumptions about the object's geometric structure and require segmented training data for learning. In this paper, we propose a novel framework for learning hierarchical compositional models (HCMs) which do not suffer from the mentioned limitations. We present a generalized formulation of HCMs and describe a greedy structure learning framework that consists of two phases: Bottom-up part learning and top-down model composition. Our framework integrates the foreground-background segmentation problem into the structure learning task via a background model. As a result, we can jointly optimize for the number of layers in the hierarchy, the number of parts per layer and a foreground-background segmentation based on class labels only. We show that the learned HCMs are semantically meaningful and achieve competitive results when compared to other generative object models at object classification on a standard transfer learning dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes