ARApr 24

Exploiting pre-optimized kernels with polyhedral transformations for CGRA compilation

arXiv:2604.2229746.2h-index: 11
Predicted impact top 35% in AR · last 90 daysOriginality Incremental advance
AI Analysis

For CGRA compilers, this work addresses the bottleneck of mapping multi-dimensional kernels like mmul, which previously led to suboptimal performance.

The paper introduces a specialized matrix-matrix multiplication (mmul) kernel schedule for Coarse-Grained Reconfigurable Arrays (CGRAs) and a compilation methodology using polyhedral transformations to expose hidden mmul operations, achieving speedups up to 9.1x across benchmarks.

Modern computing workloads commonly involve matrix-matrix multiplication (mmul) as a core computing pattern. Coarse-Grained Reconfigurable Arrays (CGRAs) can flexibly and efficiently support it, since they combine operation-level reconfigurability and high energy efficiency. However, mapping computational kernels that include mmul with state-of-the-art compilation strategies often leads to suboptimal results, since its multi-dimensional structure hampers the uncovering of its inherent parallelism and, ultimately, runtime performance. Here, we take a different position: we introduce a specialized mmul CGRA kernel schedule, parametrizable across different CGRA sizes. Then, we describe a novel compilation methodology that adapts program representations to effectively leverage it, employing polyhedral transformations to analyze complex computational patterns and expose hidden mmul operations through loop reordering and splitting. The identified patterns are then substituted with optimized assembly, while the remaining program sections are compiled independently. CGRA configurations are then generated, encompassing pre-compiled and compiled parts. Our strategy maximizes resource utilization and ultimately run-time performance, even when mmul is not directly apparent in the source code. The experimental results show speedups up to 9.1x across different benchmarks that contain hidden mmuls and CGRA instances of various sizes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes