Graphcore C2 Card performance for image-based deep learning application: A Report
This provides performance data for hardware acceleration in image-based deep learning, but it is an incremental evaluation report.
The authors benchmarked Graphcore's IPU processors on deep vision models like ResNeXt for inference, reporting observed latency, throughput, and energy efficiency.
Recently, Graphcore has introduced an IPU Processor for accelerating machine learning applications. The architecture of the processor has been designed to achieve state of the art performance on current machine intelligence models for both training and inference. In this paper, we report on a benchmark in which we have evaluated the performance of IPU processors on deep neural networks for inference. We focus on deep vision models such as ResNeXt. We report the observed latency, throughput and energy efficiency.