IVCVDec 31, 2023

Compressing Deep Image Super-resolution Models

arXiv:2401.00523v210 citationsh-index: 13PCS
AI Analysis

This work addresses the practical deployment limitations of super-resolution models for applications requiring efficiency, though it is incremental as it builds on existing networks.

The paper tackles the problem of large model sizes and slow inference speeds in deep image super-resolution by introducing a three-stage compression workflow, achieving 89% and 96% reductions in model size and FLOPs for SwinIR and EDSR while maintaining competitive performance.

Deep learning techniques have been applied in the context of image super-resolution (SR), achieving remarkable advances in terms of reconstruction performance. Existing techniques typically employ highly complex model structures which result in large model sizes and slow inference speeds. This often leads to high energy consumption and restricts their adoption for practical applications. To address this issue, this work employs a three-stage workflow for compressing deep SR models which significantly reduces their memory requirement. Restoration performance has been maintained through teacher-student knowledge distillation using a newly designed distillation loss. We have applied this approach to two popular image super-resolution networks, SwinIR and EDSR, to demonstrate its effectiveness. The resulting compact models, SwinIRmini and EDSRmini, attain an 89% and 96% reduction in both model size and floating-point operations (FLOPs) respectively, compared to their original versions. They also retain competitive super-resolution performance compared to their original models and other commonly used SR approaches. The source code and pre-trained models for these two lightweight SR approaches are released at https://pikapi22.github.io/CDISM/.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes