CVLGNov 23, 2015

Constrained Structured Regression with Convolutional Neural Networks

arXiv:1511.07497v141 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of improving regression accuracy in computer vision by integrating constraints, offering a domain-specific advancement for tasks like intrinsic image decomposition.

The paper tackles the lack of probability estimates in continuous regression tasks by introducing a regression framework that models output distributions and incorporates physical or modeling constraints, demonstrating significant state-of-the-art improvements on intrinsic image decomposition.

Convolutional Neural Networks (CNNs) have recently emerged as the dominant model in computer vision. If provided with enough training data, they predict almost any visual quantity. In a discrete setting, such as classification, CNNs are not only able to predict a label but often predict a confidence in the form of a probability distribution over the output space. In continuous regression tasks, such a probability estimate is often lacking. We present a regression framework which models the output distribution of neural networks. This output distribution allows us to infer the most likely labeling following a set of physical or modeling constraints. These constraints capture the intricate interplay between different input and output variables, and complement the output of a CNN. However, they may not hold everywhere. Our setup further allows to learn a confidence with which a constraint holds, in the form of a distribution of the constrain satisfaction. We evaluate our approach on the problem of intrinsic image decomposition, and show that constrained structured regression significantly increases the state-of-the-art.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes