CVDec 24, 2015

Truncated Max-of-Convex Models

arXiv:1512.07815v2
Originality Incremental advance
AI Analysis

This work addresses a problem in computer vision by enhancing random field models for better image labeling, though it appears incremental as it builds directly on existing TCM frameworks.

The authors tackled the limitation of truncated convex models (TCM) in capturing high-order image statistics by proposing truncated max-of-convex models (TMCM), a generalization to high-order random fields, and demonstrated improved performance over pairwise TCM on synthetic and real datasets.

Truncated convex models (TCM) are a special case of pairwise random fields that have been widely used in computer vision. However, by restricting the order of the potentials to be at most two, they fail to capture useful image statistics. We propose a natural generalization of TCM to high-order random fields, which we call truncated max-of-convex models (TMCM). The energy function of TMCM consistsof two types of potentials: (i) unary potential, which has no restriction on its form; and (ii) clique potential, which is the sum of the m largest truncated convex distances over all label pairs in a clique. The use of a convex distance function encourages smoothness, while truncation allows for discontinuities in the labeling. By using m > 1, TMCM provides robustness towards errors in the definition of the cliques. In order to minimize the energy function of a TMCM over all possible labelings, we design an efficient st-MINCUT based range expansion algorithm. We prove the accuracy of our algorithm by establishing strong multiplicative bounds for several special cases of interest. Using synthetic and standard real data sets, we demonstrate the benefit of our high-order TMCM over pairwise TCM, as well as the benefit of our range expansion algorithm over other st-MINCUT based approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes