CVIVMar 22, 2023

SCALES: Boost Binary Neural Network for Image Super-Resolution with Efficient Scalings

arXiv:2303.12270v27 citationsh-index: 99
Originality Incremental advance
AI Analysis

This work addresses efficient deployment of SR models on resource-constrained devices, offering incremental improvements in BNN performance for a specific domain.

The paper tackles the performance gap in binary neural networks (BNNs) for image super-resolution (SR) by proposing SCALES, a binarization method that captures activation variations, resulting in a 0.2dB improvement over prior art for CNN-based networks and over 1dB for Transformer-based networks.

Deep neural networks for image super-resolution (SR) have demonstrated superior performance. However, the large memory and computation consumption hinders their deployment on resource-constrained devices. Binary neural networks (BNNs), which quantize the floating point weights and activations to 1-bit can significantly reduce the cost. Although BNNs for image classification have made great progress these days, existing BNNs for SR still suffer from a large performance gap between the FP SR networks. To this end, we observe the activation distribution in SR networks and find much larger pixel-to-pixel, channel-to-channel, layer-to-layer, and image-to-image variation in the activation distribution than image classification networks. However, existing BNNs for SR fail to capture these variations that contain rich information for image reconstruction, leading to inferior performance. To address this problem, we propose SCALES, a binarization method for SR networks that consists of the layer-wise scaling factor, the spatial re-scaling method, and the channel-wise re-scaling method, capturing the layer-wise, pixel-wise, and channel-wise variations efficiently in an input-dependent manner. We evaluate our method across different network architectures and datasets. For CNN-based SR networks, our binarization method SCALES outperforms the prior art method by 0.2dB with fewer parameters and operations. With SCALES, we achieve the first accurate binary Transformer-based SR network, improving PSNR by more than 1dB compared to the baseline method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes