ARETLGJun 27, 2023

A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms

arXiv:2306.15552v3128 citationsh-index: 107
Originality Synthesis-oriented
AI Analysis

It provides a comprehensive overview for researchers and practitioners in HPC and deep learning, but it is incremental as it is a survey rather than new research.

This survey summarizes recent developments in deep learning hardware accelerators for heterogeneous HPC platforms, covering a wide range of technologies including GPUs, TPUs, FPGAs, ASICs, and emerging approaches like quantum-based and photonic accelerators.

Recent trends in deep learning (DL) have made hardware accelerators essential for various high-performance computing (HPC) applications, including image classification, computer vision, and speech recognition. This survey summarizes and classifies the most recent developments in DL accelerators, focusing on their role in meeting the performance demands of HPC applications. We explore cutting-edge approaches to DL acceleration, covering not only GPU- and TPU-based platforms but also specialized hardware such as FPGA- and ASIC-based accelerators, Neural Processing Units, open hardware RISC-V-based accelerators, and co-processors. This survey also describes accelerators leveraging emerging memory technologies and computing paradigms, including 3D-stacked Processor-In-Memory, non-volatile memories like Resistive RAM and Phase Change Memories used for in-memory computing, as well as Neuromorphic Processing Units, and Multi-Chip Module-based accelerators. Furthermore, we provide insights into emerging quantum-based accelerators and photonics. Finally, this survey categorizes the most influential architectures and technologies from recent years, offering readers a comprehensive perspective on the rapidly evolving field of deep learning acceleration.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes