LGARFeb 20, 2021

An Evaluation of Edge TPU Accelerators for Convolutional Neural Networks

arXiv:2102.10423v2146 citations
AI Analysis

This work addresses the need for efficient hardware/software co-design for edge devices, though it is incremental as it focuses on evaluation and modeling of existing accelerators.

The paper evaluates three classes of Edge TPU accelerators across 423,000 convolutional neural networks, providing microarchitectural insights and developing learned models that enable millisecond-level performance estimation as an alternative to slow simulators.

Edge TPUs are a domain of accelerators for low-power, edge devices and are widely used in various Google products such as Coral and Pixel devices. In this paper, we first discuss the major microarchitectural details of Edge TPUs. Then, we extensively evaluate three classes of Edge TPUs, covering different computing ecosystems, that are either currently deployed in Google products or are the product pipeline, across 423K unique convolutional neural networks. Building upon this extensive study, we discuss critical and interpretable microarchitectural insights about the studied classes of Edge TPUs. Mainly, we discuss how Edge TPU accelerators perform across convolutional neural networks with different structures. Finally, we present our ongoing efforts in developing high-accuracy learned machine learning models to estimate the major performance metrics of accelerators such as latency and energy consumption. These learned models enable significantly faster (in the order of milliseconds) evaluations of accelerators as an alternative to time-consuming cycle-accurate simulators and establish an exciting opportunity for rapid hard-ware/software co-design.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes