IVCVNov 12, 2019

Merging-ISP: Multi-Exposure High Dynamic Range Image Signal Processing

arXiv:1911.04762v214 citations
AI Analysis

This addresses error propagation in HDR imaging for digital cameras, though it is an incremental improvement over existing deep learning methods.

The paper tackles the problem of reconstructing high dynamic range (HDR) images from multiple raw exposures by proposing Merging-ISP, a deep neural network that jointly handles demosaicing, alignment, and merging, resulting in over 1 dB PSNR improvement over state-of-the-art cascaded pipelines.

High dynamic range (HDR) imaging combines multiple images with different exposure times into a single high-quality image. The image signal processing pipeline (ISP) is a core component in digital cameras to perform these operations. It includes demosaicing of raw color filter array (CFA) data at different exposure times, alignment of the exposures, conversion to HDR domain, and exposure merging into an HDR image. Traditionally, such pipelines cascade algorithms that address these individual subtasks. However, cascaded designs suffer from error propagation, since simply combining multiple steps is not necessarily optimal for the entire imaging task. This paper proposes a multi-exposure HDR image signal processing pipeline (Merging-ISP) to jointly solve all these subtasks. Our pipeline is modeled by a deep neural network architecture. As such, it is end-to-end trainable, circumvents the use of hand-crafted and potentially complex algorithms, and mitigates error propagation. Merging-ISP enables direct reconstructions of HDR images of dynamic scenes from multiple raw CFA images with different exposures. We compare Merging-ISP against several state-of-the-art cascaded pipelines. The proposed method provides HDR reconstructions of high perceptual quality and it quantitatively outperforms competing ISPs by more than 1 dB in terms of PSNR.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes