NEARLGSep 23, 2020

Procrustes: a Dataflow and Accelerator for Sparse Deep Neural Network Training

arXiv:2009.10976v157 citations
Originality Incremental advance
AI Analysis

This addresses the problem of high energy and time costs for training pruned models in machine learning, offering a domain-specific solution for efficient sparse DNN training.

The paper tackles the inefficiency of existing accelerators for sparse deep neural network training by proposing Procrustes, a co-designed dataflow and accelerator that adapts algorithms and hardware to support sparse training, resulting in up to 4x speedup and 3.26x less energy consumption while maintaining accuracy.

The success of DNN pruning has led to the development of energy-efficient inference accelerators that support pruned models with sparse weight and activation tensors. Because the memory layouts and dataflows in these architectures are optimized for the access patterns during $\mathit{inference}$, however, they do not efficiently support the emerging sparse $\mathit{training}$ techniques. In this paper, we demonstrate (a) that accelerating sparse training requires a co-design approach where algorithms are adapted to suit the constraints of hardware, and (b) that hardware for sparse DNN training must tackle constraints that do not arise in inference accelerators. As proof of concept, we adapt a sparse training algorithm to be amenable to hardware acceleration; we then develop dataflow, data layout, and load-balancing techniques to accelerate it. The resulting system is a sparse DNN training accelerator that produces pruned models with the same accuracy as dense models without first training, then pruning, and finally retraining, a dense model. Compared to training the equivalent unpruned models using a state-of-the-art DNN accelerator without sparse training support, Procrustes consumes up to 3.26$\times$ less energy and offers up to 4$\times$ speedup across a range of models, while pruning weights by an order of magnitude and maintaining unpruned accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes