LGAIIVSPNAJul 3, 2024

SFC: Achieve Accurate Fast Convolution under Low-precision Arithmetic

arXiv:2407.02913v13 citationsh-index: 11
Originality Highly original
AI Analysis

This work addresses the problem of efficient quantized convolution for deep learning practitioners, offering a novel method that improves upon existing fast convolution techniques in low-precision settings.

The paper tackles the conflict between fast convolution algorithms and model quantization by proposing SFC, a new algebra transform that extends DFT with symbolic computing to reduce precision requirements, achieving a 3.68x multiplication reduction for 3x3 convolution while maintaining accuracy.

Fast convolution algorithms, including Winograd and FFT, can efficiently accelerate convolution operations in deep models. However, these algorithms depend on high-precision arithmetic to maintain inference accuracy, which conflicts with the model quantization. To resolve this conflict and further improve the efficiency of quantized convolution, we proposes SFC, a new algebra transform for fast convolution by extending the Discrete Fourier Transform (DFT) with symbolic computing, in which only additions are required to perform the transformation at specific transform points, avoiding the calculation of irrational number and reducing the requirement for precision. Additionally, we enhance convolution efficiency by introducing correction terms to convert invalid circular convolution outputs of the Fourier method into effective ones. The numerical error analysis is presented for the first time in this type of work and proves that our algorithms can provide a 3.68x multiplication reduction for 3x3 convolution, while the Winograd algorithm only achieves a 2.25x reduction with similarly low numerical errors. Experiments carried out on benchmarks and FPGA show that our new algorithms can further improve the computation efficiency of quantized models while maintaining accuracy, surpassing both the quantization-alone method and existing works on fast convolution quantization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes