CVMar 25, 2014

A Tiered Move-making Algorithm for General Non-submodular Pairwise Energies

Vibhav Vineet, Jonathan Warrell, Philip H. S. Torr

arXiv:1403.6275v11 citations

Originality Incremental advance

AI Analysis

This addresses a bottleneck in computer vision for tasks requiring complex pairwise terms, offering an incremental improvement over existing move-making methods.

The paper tackles the problem of minimizing energy functions with non-submodular pairwise terms in computer vision, proposing a tiered move-making algorithm that achieves better accuracy and energy values than alpha-expansion, loopy belief propagation, and quadratic pseudo-boolean optimization, and is competitive with TRWS on benchmark datasets like Pascal VOC-11 segmentation.

A large number of problems in computer vision can be modelled as energy minimization problems in a Markov Random Field (MRF) or Conditional Random Field (CRF) framework. Graph-cuts based $α$-expansion is a standard move-making method to minimize the energy functions with sub-modular pairwise terms. However, certain problems require more complex pairwise terms where the $α$-expansion method is generally not applicable. In this paper, we propose an iterative {\em tiered move making algorithm} which is able to handle general pairwise terms. Each move to the next configuration is based on the current labeling and an optimal tiered move, where each tiered move requires one application of the dynamic programming based tiered labeling method introduced in Felzenszwalb et. al. \cite{tiered_cvpr_felzenszwalbV10}. The algorithm converges to a local minimum for any general pairwise potential, and we give a theoretical analysis of the properties of the algorithm, characterizing the situations in which we can expect good performance. We first evaluate our method on an object-class segmentation problem using the Pascal VOC-11 segmentation dataset where we learn general pairwise terms. Further we evaluate the algorithm on many other benchmark labeling problems such as stereo, image segmentation, image stitching and image denoising. Our method consistently gets better accuracy and energy values than alpha-expansion, loopy belief propagation (LBP), quadratic pseudo-boolean optimization (QPBO), and is competitive with TRWS.

View on arXiv PDF

Similar