LGAug 15, 2022

Combining Gradients and Probabilities for Heterogeneous Approximation of Neural Networks

arXiv:2208.07265v19 citationsh-index: 21Has Code
Originality Incremental advance
AI Analysis

This addresses energy efficiency for hardware implementations of neural networks, presenting an incremental improvement by combining existing techniques like additive Gaussian noise with novel training methods.

This work tackles the problem of reducing energy consumption in neural network computations by using heterogeneous approximate multipliers, achieving energy reductions of 70-79% for ResNet on CIFAR-10 with less than 1% accuracy loss and 53% for VGG16 on Tiny ImageNet with a 0.5% drop in Top-5 accuracy.

This work explores the search for heterogeneous approximate multiplier configurations for neural networks that produce high accuracy and low energy consumption. We discuss the validity of additive Gaussian noise added to accurate neural network computations as a surrogate model for behavioral simulation of approximate multipliers. The continuous and differentiable properties of the solution space spanned by the additive Gaussian noise model are used as a heuristic that generates meaningful estimates of layer robustness without the need for combinatorial optimization techniques. Instead, the amount of noise injected into the accurate computations is learned during network training using backpropagation. A probabilistic model of the multiplier error is presented to bridge the gap between the domains; the model estimates the standard deviation of the approximate multiplier error, connecting solutions in the additive Gaussian noise space to actual hardware instances. Our experiments show that the combination of heterogeneous approximation and neural network retraining reduces the energy consumption for multiplications by 70% to 79% for different ResNet variants on the CIFAR-10 dataset with a Top-1 accuracy loss below one percentage point. For the more complex Tiny ImageNet task, our VGG16 model achieves a 53 % reduction in energy consumption with a drop in Top-5 accuracy of 0.5 percentage points. We further demonstrate that our error model can predict the parameters of an approximate multiplier in the context of the commonly used additive Gaussian noise (AGN) model with high accuracy. Our software implementation is available under https://github.com/etrommer/agn-approx.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes