IVCVJul 31, 2025

Learning Arbitrary-Scale RAW Image Downscaling with Wavelet-based Recurrent Reconstruction

arXiv:2507.23219v1h-index: 9Has CodeMM
Originality Incremental advance
AI Analysis

This addresses efficient storage and transmission of high-resolution images for photography and imaging applications, but it is incremental as it builds on existing downscaling methods by focusing on RAW data.

The paper tackles the problem of downscaling RAW images, which lack specialized frameworks, by proposing a wavelet-based recurrent reconstruction framework that preserves structural and textural integrity, and it outperforms state-of-the-art methods quantitatively and visually.

Image downscaling is critical for efficient storage and transmission of high-resolution (HR) images. Existing learning-based methods focus on performing downscaling within the sRGB domain, which typically suffers from blurred details and unexpected artifacts. RAW images, with their unprocessed photonic information, offer greater flexibility but lack specialized downscaling frameworks. In this paper, we propose a wavelet-based recurrent reconstruction framework that leverages the information lossless attribute of wavelet transformation to fulfill the arbitrary-scale RAW image downscaling in a coarse-to-fine manner, in which the Low-Frequency Arbitrary-Scale Downscaling Module (LASDM) and the High-Frequency Prediction Module (HFPM) are proposed to preserve structural and textural integrity of the reconstructed low-resolution (LR) RAW images, alongside an energy-maximization loss to align high-frequency energy between HR and LR domain. Furthermore, we introduce the Realistic Non-Integer RAW Downscaling (Real-NIRD) dataset, featuring a non-integer downscaling factor of 1.3$\times$, and incorporate it with publicly available datasets with integer factors (2$\times$, 3$\times$, 4$\times$) for comprehensive benchmarking arbitrary-scale image downscaling purposes. Extensive experiments demonstrate that our method outperforms existing state-of-the-art competitors both quantitatively and visually. The code and dataset will be released at https://github.com/RenYangSCU/ASRD.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes