CVOct 8, 2020

Estimation of Camera Response Function using Prediction Consistency and Gradual Refinement with an Extension to Deep Learning

arXiv:2010.04009v29 citations
AI Analysis

This addresses a domain-specific problem in computer vision for researchers and practitioners needing accurate CRF estimation from real-world images, but it is incremental as it builds on prior methods like EdgeCRF and CRFNet.

The paper tackles the problem of camera response function (CRF) estimation from a single image, where existing methods fail on general real images due to noise or limited training data, and introduces a non-deep-learning method using prediction consistency and gradual refinement that outperforms existing single-image methods for daytime and nighttime images, with a deep learning extension improving generalization.

Most existing methods for CRF estimation from a single image fail to handle general real images. For instance, EdgeCRF based on colour patches extracted from edges works effectively only when the presence of noise is insignificant, which is not the case for many real images; and, CRFNet, a recent method based on fully supervised deep learning works only for the CRFs that are in the training data, and hence fail to deal with other possible CRFs beyond the training data. To address these problems, we introduce a non-deep-learning method using prediction consistency and gradual refinement. First, we rely more on the patches of the input image that provide more consistent predictions. If the predictions from a patch are more consistent, it means that the patch is likely to be less affected by noise or any inferior colour combinations, and hence, it can be more reliable for CRF estimation. Second, we employ a gradual refinement scheme in which we start from a simple CRF model to generate a result which is more robust to noise but less accurate, and then we gradually increase the model's complexity to improve the result. This is because a simple model, while being less accurate, overfits less to noise than a complex model does. Our experiments show that our method outperforms the existing single-image methods for daytime and nighttime real images. We further propose a more efficient deep learning extension that performs test-time training (based on unsupervised losses) on the test input image. This provides our method better generalization performance than CRFNet making it more practically applicable for CRF estimation for general real images.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes