ARLGApr 22, 2025

COBRA: Algorithm-Architecture Co-optimized Binary Transformer Accelerator for Edge Inference

arXiv:2504.16269v24 citationsh-index: 42025 IEEE/ACM International Conference On Computer Aided Design (ICCAD)
Originality Incremental advance
AI Analysis

This enables efficient deployment of transformer models on edge platforms for local, secure inference, addressing a domain-specific bottleneck.

The paper tackles the inefficiency of binary transformers on edge hardware by introducing COBRA, an algorithm-architecture co-optimized accelerator that achieves up to 3,894.7 GOPS throughput and 448.7 GOPS/Watt energy efficiency, with a 311x energy efficiency improvement over GPUs and 3.5x throughput improvement over state-of-the-art binary accelerators.

Transformer-based models have demonstrated superior performance in various fields, including natural language processing and computer vision. However, their enormous model size and high demands in computation, memory, and communication limit their deployment to edge platforms for local, secure inference. Binary transformers offer a compact, low-complexity solution for edge deployment with reduced bandwidth needs and acceptable accuracy. However, existing binary transformers perform inefficiently on current hardware due to the lack of binary specific optimizations. To address this, we introduce COBRA, an algorithm-architecture co-optimized binary Transformer accelerator for edge computing. COBRA features a real 1-bit binary multiplication unit, enabling matrix operations with -1, 0, and +1 values, surpassing ternary methods. With further hardware-friendly optimizations in the attention block, COBRA achieves up to 3,894.7 GOPS throughput and 448.7 GOPS/Watt energy efficiency on edge FPGAs, delivering a 311x energy efficiency improvement over GPUs and a 3.5x throughput improvement over the state-of-the-art binary accelerator, with only negligible inference accuracy degradation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes