CVJan 23, 2022

Wavelet-Attention CNN for Image Classification

arXiv:2201.09271v197 citations
AI Analysis

This addresses image classification accuracy for computer vision researchers, but it is incremental as it builds on existing CNN backbones like MobileNetV2.

The paper tackled the problem of noise weakening convolutional feature statistics in image classification by proposing a Wavelet-Attention CNN that uses Discrete Wavelet Transform to apply attention only in the high-frequency domain, resulting in improvements of 1.26% Top-1 accuracy on CIFAR-10 and 1.54% on CIFAR-100.

The feature learning methods based on convolutional neural network (CNN) have successfully produced tremendous achievements in image classification tasks. However, the inherent noise and some other factors may weaken the effectiveness of the convolutional feature statistics. In this paper, we investigate Discrete Wavelet Transform (DWT) in the frequency domain and design a new Wavelet-Attention (WA) block to only implement attention in the high-frequency domain. Based on this, we propose a Wavelet-Attention convolutional neural network (WA-CNN) for image classification. Specifically, WA-CNN decomposes the feature maps into low-frequency and high-frequency components for storing the structures of the basic objects, as well as the detailed information and noise, respectively. Then, the WA block is leveraged to capture the detailed information in the high-frequency domain with different attention factors but reserves the basic object structures in the low-frequency domain. Experimental results on CIFAR-10 and CIFAR-100 datasets show that our proposed WA-CNN achieves significant improvements in classification accuracy compared to other related networks. Specifically, based on MobileNetV2 backbones, WA-CNN achieves 1.26% Top-1 accuracy improvement on the CIFAR-10 benchmark and 1.54% Top-1 accuracy improvement on the CIFAR-100 benchmark.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes