AR LG NEJul 11, 2018

Medusa: A Scalable Interconnect for Many-Port DNN Accelerators and Wide DRAM Controller Interfaces

Yongming Shen, Tianchu Ji, Michael Ferdman, Peter Milder

arXiv:1807.04013v12.33 citations

Originality Incremental advance

AI Analysis

This addresses a resource efficiency problem for FPGA-based DNN accelerator designers, offering incremental improvements in interconnect design.

The paper tackled the mismatch between many narrow ports in DNN accelerators and wide buses in FPGA DRAM controllers, which consumes significant FPGA resources, by designing Medusa, an optimized interconnect that reduces LUT and FF use by 4.7x and 6.0x and improves frequency by 1.8x.

To cope with the increasing demand and computational intensity of deep neural networks (DNNs), industry and academia have turned to accelerator technologies. In particular, FPGAs have been shown to provide a good balance between performance and energy efficiency for accelerating DNNs. While significant research has focused on how to build efficient layer processors, the computational building blocks of DNN accelerators, relatively little attention has been paid to the on-chip interconnects that sit between the layer processors and the FPGA's DRAM controller. We observe a disparity between DNN accelerator interfaces, which tend to comprise many narrow ports, and FPGA DRAM controller interfaces, which tend to be wide buses. This mismatch causes traditional interconnects to consume significant FPGA resources. To address this problem, we designed Medusa: an optimized FPGA memory interconnect which transposes data in the interconnect fabric, tailoring the interconnect to the needs of DNN layer processors. Compared to a traditional FPGA interconnect, our design can reduce LUT and FF use by 4.7x and 6.0x, and improves frequency by 1.8x.

View on arXiv PDF

Similar