ARCVLGIVMay 9, 2022

Hardware-Robust In-RRAM-Computing for Object Detection

arXiv:2205.03996v114 citationsh-index: 37
Originality Incremental advance
AI Analysis

This work addresses hardware robustness issues in in-RRAM computing accelerators for object detection, representing an incremental improvement over prior methods limited to simpler tasks.

The paper tackled the challenge of device variation and nonideal effects in in-RRAM computing for object detection by proposing a joint hardware-software optimization strategy, achieving only a 3.85% mAP drop compared to catastrophic failure in naive designs.

In-memory computing is becoming a popular architecture for deep-learning hardware accelerators recently due to its highly parallel computing, low power, and low area cost. However, in-RRAM computing (IRC) suffered from large device variation and numerous nonideal effects in hardware. Although previous approaches including these effects in model training successfully improved variation tolerance, they only considered part of the nonideal effects and relatively simple classification tasks. This paper proposes a joint hardware and software optimization strategy to design a hardware-robust IRC macro for object detection. We lower the cell current by using a low word-line voltage to enable a complete convolution calculation in one operation that minimizes the impact of nonlinear addition. We also implement ternary weight mapping and remove batch normalization for better tolerance against device variation, sense amplifier variation, and IR drop problem. An extra bias is included to overcome the limitation of the current sensing range. The proposed approach has been successfully applied to a complex object detection task with only 3.85\% mAP drop, whereas a naive design suffers catastrophic failure under these nonideal effects.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes