LGARNESep 9, 2021

SONIC: A Sparse Neural Network Inference Accelerator with Silicon Photonics for Energy-Efficient Deep Learning

arXiv:2109.04459v125 citations
AI Analysis

This work addresses the problem of energy-efficient deep learning inference for resource-constrained platforms, representing a novel hardware-software co-design approach.

The paper tackles the challenge of deploying sparse neural networks on resource-constrained platforms by proposing SONIC, a silicon photonics-based accelerator, which achieves up to 5.8x better performance-per-watt and 8.4x lower energy-per-bit compared to state-of-the-art sparse electronic accelerators.

Sparse neural networks can greatly facilitate the deployment of neural networks on resource-constrained platforms as they offer compact model sizes while retaining inference accuracy. Because of the sparsity in parameter matrices, sparse neural networks can, in principle, be exploited in accelerator architectures for improved energy-efficiency and latency. However, to realize these improvements in practice, there is a need to explore sparsity-aware hardware-software co-design. In this paper, we propose a novel silicon photonics-based sparse neural network inference accelerator called SONIC. Our experimental analysis shows that SONIC can achieve up to 5.8x better performance-per-watt and 8.4x lower energy-per-bit than state-of-the-art sparse electronic neural network accelerators; and up to 13.8x better performance-per-watt and 27.6x lower energy-per-bit than the best known photonic neural network accelerators.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes