NE AIMar 24, 2023

PowerPruning: Selecting Weights and Activations for Power-Efficient Neural Network Acceleration

Richard Petri, Grace Li Zhang, Yiran Chen, Ulf Schlichtmann, Bing Li

arXiv:2303.13997v211.914 citationsh-index: 24

Originality Incremental advance

AI Analysis

This addresses power efficiency for deploying DNNs on edge devices, representing an incremental improvement over existing pruning techniques.

The paper tackles the problem of high power consumption in deep neural networks on edge devices by proposing PowerPruning, a method that selects weights and activations to reduce power in MAC operations, achieving up to 78.3% power reduction with minimal accuracy loss.

Deep neural networks (DNNs) have been successfully applied in various fields. A major challenge of deploying DNNs, especially on edge devices, is power consumption, due to the large number of multiply-and-accumulate (MAC) operations. To address this challenge, we propose PowerPruning, a novel method to reduce power consumption in digital neural network accelerators by selecting weights that lead to less power consumption in MAC operations. In addition, the timing characteristics of the selected weights together with all activation transitions are evaluated. The weights and activations that lead to small delays are further selected. Consequently, the maximum delay of the sensitized circuit paths in the MAC units is reduced even without modifying MAC units, which thus allows a flexible scaling of supply voltage to reduce power consumption further. Together with retraining, the proposed method can reduce power consumption of DNNs on hardware by up to 78.3% with only a slight accuracy loss.

View on arXiv PDF

Similar