CVNov 29, 2016

Efficient Linear Programming for Dense CRFs

arXiv:1611.09718v27 citations
AI Analysis

This work addresses a computational bottleneck for researchers and practitioners in computer vision, offering a faster method for dense CRFs, though it is incremental as it builds on prior LP approaches.

The paper tackles the slow speed of linear programming (LP) relaxation for minimizing dense CRF energies in semantic segmentation, introducing an efficient algorithm that outperforms existing baselines on standard datasets.

The fully connected conditional random field (CRF) with Gaussian pairwise potentials has proven popular and effective for multi-class semantic segmentation. While the energy of a dense CRF can be minimized accurately using a linear programming (LP) relaxation, the state-of-the-art algorithm is too slow to be useful in practice. To alleviate this deficiency, we introduce an efficient LP minimization algorithm for dense CRFs. To this end, we develop a proximal minimization framework, where the dual of each proximal problem is optimized via block coordinate descent. We show that each block of variables can be efficiently optimized. Specifically, for one block, the problem decomposes into significantly smaller subproblems, each of which is defined over a single pixel. For the other block, the problem is optimized via conditional gradient descent. This has two advantages: 1) the conditional gradient can be computed in a time linear in the number of pixels and labels; and 2) the optimal step size can be computed analytically. Our experiments on standard datasets provide compelling evidence that our approach outperforms all existing baselines including the previous LP based approach for dense CRFs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes