CVIVJun 3, 2022

Learning sRGB-to-Raw-RGB De-rendering with Content-Aware Metadata

arXiv:2206.01813v132 citationsh-index: 62
Originality Incremental advance
AI Analysis

This work addresses the need for linear raw-RGB images in computer vision tasks, offering an incremental improvement over prior raw reconstruction techniques.

The paper tackles the problem of reconstructing raw-RGB images from sRGB images by jointly learning sampling and reconstruction, resulting in improved raw reconstructions compared to existing methods.

Most camera images are rendered and saved in the standard RGB (sRGB) format by the camera's hardware. Due to the in-camera photo-finishing routines, nonlinear sRGB images are undesirable for computer vision tasks that assume a direct relationship between pixel values and scene radiance. For such applications, linear raw-RGB sensor images are preferred. Saving images in their raw-RGB format is still uncommon due to the large storage requirement and lack of support by many imaging applications. Several "raw reconstruction" methods have been proposed that utilize specialized metadata sampled from the raw-RGB image at capture time and embedded in the sRGB image. This metadata is used to parameterize a mapping function to de-render the sRGB image back to its original raw-RGB format when needed. Existing raw reconstruction methods rely on simple sampling strategies and global mapping to perform the de-rendering. This paper shows how to improve the de-rendering results by jointly learning sampling and reconstruction. Our experiments show that our learned sampling can adapt to the image content to produce better raw reconstructions than existing methods. We also describe an online fine-tuning strategy for the reconstruction network to improve results further.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes