AR ET LGJul 11, 2024

OPIMA: Optical Processing-In-Memory for Convolutional Neural Network Acceleration

Febin Sunny, Amin Shafiee, Abhishek Balasubramaniam, Mahdi Nikdast, Sudeep Pasricha

arXiv:2407.08205v12.36 citationsh-index: 36

Originality Highly original

AI Analysis

This addresses the problem of high latency and energy consumption in ML accelerators for researchers and engineers, offering a novel optical approach that is not incremental but a significant advancement over existing methods.

The paper tackles the bottleneck of data movement in deep neural networks by proposing OPIMA, an optical processing-in-memory accelerator for convolutional neural networks, achieving 2.98x higher throughput and 137x better energy efficiency compared to prior work.

Recent advances in machine learning (ML) have spotlighted the pressing need for computing architectures that bridge the gap between memory bandwidth and processing power. The advent of deep neural networks has pushed traditional Von Neumann architectures to their limits due to the high latency and energy consumption costs associated with data movement between the processor and memory for these workloads. One of the solutions to overcome this bottleneck is to perform computation within the main memory through processing-in-memory (PIM), thereby limiting data movement and the costs associated with it. However, DRAM-based PIM struggles to achieve high throughput and energy efficiency due to internal data movement bottlenecks and the need for frequent refresh operations. In this work, we introduce OPIMA, a PIM-based ML accelerator, architected within an optical main memory. OPIMA has been designed to leverage the inherent massive parallelism within main memory while performing high-speed, low-energy optical computation to accelerate ML models based on convolutional neural networks. We present a comprehensive analysis of OPIMA to guide design choices and operational mechanisms. Additionally, we evaluate the performance and energy consumption of OPIMA, comparing it with conventional electronic computing systems and emerging photonic PIM architectures. The experimental results show that OPIMA can achieve 2.98x higher throughput and 137x better energy efficiency than the best-known prior work.

View on arXiv PDF

Similar