Enhanced Self-Supervised Multi-Image Super-Resolution for Camera Array Images

Yating Chen, Feng Huang, Xianyu Wu, Jing Wu, Ying Shen

arXiv:2604.0681646.4h-index: 7

Predicted impact top 22% in OPTICS · last 90 daysOriginality Highly original

AI Analysis

This work addresses image restoration challenges in camera array imaging, offering a novel approach that improves detail recovery, but it is incremental as it builds on existing self-supervised and transformer-based techniques.

The paper tackles the problem of multi-image super-resolution (MISR) for camera array images, which suffer from degradation and occlusion issues, by proposing a self-supervised learning framework that combines Multi-to-Single and Multi-to-Multi methods and a dual Transformer network, achieving superior results on synthetic and real-world datasets.

Conventional multi-image super-resolution (MISR) methods, such as burst and video SR, rely on sequential frames from a single camera. Consequently, they suffer from complex image degradation and severe occlusion, increasing the difficulty of accurate image restoration. In contrast, multi-aperture camera-array imaging captures spatially distributed views with sampling offsets forming a stable disk-like distribution, which enhances the non-redundancy of observed data. Existing MISR algorithms fail to fully exploit these unique properties. Supervised MISR methods tend to overfit the degradation patterns in training data, and current self-supervised learning (SSL) techniques struggle to recover fine-grained details. To address these issues, this paper thoroughly investigates the strengths, limitations and applicability boundaries of multi-image-to-single-image (Multi-to-Single) and multi-image-to-multi-image (Multi-to-Multi) SSL methods. We propose the Multi-to-Single-Guided Multi-to-Multi SSL framework that combines the advantages of Multi-to-Single and Multi-to-Multi to generate visually appealing and high-fidelity images rich in texture details. The Multi-to-Single-Guided Multi-to-Multi SSL framework provides a new paradigm for integrating deep neural network with classical physics-based variational methods. To enhance the ability of MISR network to recover high-frequency details from aliased artifacts, this paper proposes a novel camera-array SR network called dual Transformer suitable for SSL. Experiments on synthetic and real-world datasets demonstrate the superiority of the proposed method.

View on arXiv PDF

Similar