IV CR CVSep 5, 2022

Uformer-ICS: A U-Shaped Transformer for Image Compressive Sensing Service

Kuiyuan Zhang, Zhongyun Hua, Yuanman Li, Yushu Zhang, Yicong Zhou

arXiv:2209.01763v24.88 citationsh-index: 18

Originality Incremental advance

AI Analysis

This work addresses the problem of efficient image acquisition and reconstruction for service computing applications, offering an incremental improvement over existing deep learning-based compressive sensing methods.

The paper tackles the challenge of accurate image reconstruction from compressive sensing measurements at low sampling rates by proposing Uformer-ICS, a U-shaped transformer that integrates adaptive sampling and multi-channel projection modules, achieving significantly better performance than state-of-the-art deep learning methods.

Many service computing applications require real-time dataset collection from multiple devices, necessitating efficient sampling techniques to reduce bandwidth and storage pressure. Compressive sensing (CS) has found wide-ranging applications in image acquisition and reconstruction. Recently, numerous deep-learning methods have been introduced for CS tasks. However, the accurate reconstruction of images from measurements remains a significant challenge, especially at low sampling rates. In this paper, we propose Uformer-ICS as a novel U-shaped transformer for image CS tasks by introducing inner characteristics of CS into transformer architecture. To utilize the uneven sparsity distribution of image blocks, we design an adaptive sampling architecture that allocates measurement resources based on the estimated block sparsity, allowing the compressed results to retain maximum information from the original image. Additionally, we introduce a multi-channel projection (MCP) module inspired by traditional CS optimization methods. By integrating the MCP module into the transformer blocks, we construct projection-based transformer blocks, and then form a symmetrical reconstruction model using these blocks and residual convolutional blocks. Therefore, our reconstruction model can simultaneously utilize the local features and long-range dependencies of image, and the prior projection knowledge of CS theory. Experimental results demonstrate its significantly better reconstruction performance than state-of-the-art deep learning-based CS methods.

View on arXiv PDF

Similar