ARDCLGJun 15, 2021

S2Engine: A Novel Systolic Architecture for Sparse Convolutional Neural Networks

arXiv:2106.07894v1
Originality Incremental advance
AI Analysis

This work addresses the problem of computational and memory inefficiencies in CNN acceleration for hardware designers, offering a domain-specific improvement in systolic architectures for sparse neural networks.

The paper tackles the challenge of inefficient execution of sparse convolutional neural networks (CNNs) on traditional systolic architectures by proposing S2Engine, a novel systolic architecture that fully exploits sparsity with compressed data transmission and dynamic data selection, achieving about 3.2× speed improvement and about 3.0× energy efficiency gain compared to a naive systolic array.

Convolutional neural networks (CNNs) have achieved great success in performing cognitive tasks. However, execution of CNNs requires a large amount of computing resources and generates heavy memory traffic, which imposes a severe challenge on computing system design. Through optimizing parallel executions and data reuse in convolution, systolic architecture demonstrates great advantages in accelerating CNN computations. However, regular internal data transmission path in traditional systolic architecture prevents the systolic architecture from completely leveraging the benefits introduced by neural network sparsity. Deployment of fine-grained sparsity on the existing systolic architectures is greatly hindered by the incurred computational overheads. In this work, we propose S2Engine $-$ a novel systolic architecture that can fully exploit the sparsity in CNNs with maximized data reuse. S2Engine transmits compressed data internally and allows each processing element to dynamically select an aligned data from the compressed dataflow in convolution. Compared to the naive systolic array, S2Engine achieves about $3.2\times$ and about $3.0\times$ improvements on speed and energy efficiency, respectively.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes