ARApr 20

AccelCIM: Systematic Dataflow Exploration for SRAM Compute-in-Memory Accelerator

arXiv:2604.1769287.9h-index: 7
AI Analysis

This work addresses the dataflow design bottleneck for SRAM CIM accelerators, which is critical for efficient deployment of large DNN models.

AccelCIM introduces a systematic dataflow exploration framework for SRAM CIM accelerators, addressing data movement overhead for large DNN models. It provides practical insights for LLM applications through extensive design space exploration and rigorous evaluation.

SRAM-based compute-in-memory (CIM) offers high computational density and energy efficiency for deep neural network (DNN) accelerators, but its limited capacity causes on/off-chip data movement overhead for large DNN models. Existing CIM accelerator studies typically assume that DNN models fit entirely on-chip, leaving efficient dataflow design largely untapped. This paper introduces AccelCIM, a systematic dataflow exploration framework for SRAM CIM accelerator, which addresses two key limitations of prior work. (1) It formulates a systematic dataflow design space spanning CIM macro configurations and macro-array organizations. (2) It introduces rigorous design evaluation using cycle-accurate architectural simulation and post-layout PPA analysis. We conduct an extensive design space exploration and apply AccelCIM to representative LLM applications, providing practical insights for the principled design of CIM accelerators.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes