CVMay 3, 2021

Multi-modal Bifurcated Network for Depth Guided Image Relighting

arXiv:2105.00690v214 citations
Originality Incremental advance
AI Analysis

This work addresses illumination adjustment in images for computer vision applications, representing an incremental improvement with a novel network design.

The paper tackles image relighting by generating new images with specified illumination using depth maps, achieving first place in SSIM and PMS metrics on the NTIRE 2021 challenge.

Image relighting aims to recalibrate the illumination setting in an image. In this paper, we propose a deep learning-based method called multi-modal bifurcated network (MBNet) for depth guided image relighting. That is, given an image and the corresponding depth maps, a new image with the given illuminant angle and color temperature is generated by our network. This model extracts the image and the depth features by the bifurcated network in the encoder. To use the two features effectively, we adopt the dynamic dilated pyramid modules in the decoder. Moreover, to increase the variety of training data, we propose a novel data process pipeline to increase the number of the training data. Experiments conducted on the VIDIT dataset show that the proposed solution obtains the \textbf{1}$^{st}$ place in terms of SSIM and PMS in the NTIRE 2021 Depth Guide One-to-one Relighting Challenge.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes