IVCVMar 4, 2024

A Spatio-temporal Aligned SUNet Model for Low-light Video Enhancement

arXiv:2403.02408v312 citationsh-index: 23ICIP
AI Analysis

This addresses the problem of degraded computer vision performance due to low-light distortions, offering a domain-specific solution for video enhancement.

The paper tackles low-light video enhancement by proposing a Spatio-Temporal Aligned SUNet (STA-SUNet) model, which achieves the highest PSNR and SSIM values across three test datasets, particularly excelling in extreme low-light conditions.

Distortions caused by low-light conditions are not only visually unpleasant but also degrade the performance of computer vision tasks. The restoration and enhancement have proven to be highly beneficial. However, there are only a limited number of enhancement methods explicitly designed for videos acquired in low-light conditions. We propose a Spatio-Temporal Aligned SUNet (STA-SUNet) model using a Swin Transformer as a backbone to capture low light video features and exploit their spatio-temporal correlations. The STA-SUNet model is trained on a novel, fully registered dataset (BVI), which comprises dynamic scenes captured under varying light conditions. It is further analysed comparatively against various other models over three test datasets. The model demonstrates superior adaptivity across all datasets, obtaining the highest PSNR and SSIM values. It is particularly effective in extreme low-light conditions, yielding fairly good visualisation results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes