ARMay 15

ICP: Exploiting Instruction Correlation for Prefetching Irregular Memory Accesses

arXiv:2605.1564518.9
Predicted impact top 72% in AR · last 90 daysOriginality Highly original
AI Analysis

For computer architects, ICP provides a highly efficient solution for prefetching irregular memory accesses with minimal storage overhead.

ICP introduces a hardware prefetcher that uses instruction-level correlations instead of address correlations to handle irregular memory accesses, achieving 14.0% speedup over Triangel and 6.0% over DMP with only 2.1 KB storage.

Irregular memory accesses pose challenges for effective and efficient data prefetching. While temporal prefetchers have recently shown promise for irregular memory access patterns, their effectiveness fundamentally depends on temporal address recurrence and large metadata storage. When memory addresses exhibit weak or no recurrence, as in indirect memory accesses, temporal prefetchers achieve limited performance gains while incurring substantial storage overhead. This paper proposes Instruction-Correlation Prefetching (ICP), a new hardware prefetching mechanism that exploits instruction-level correlations rather than memory-address correlations to handle irregular memory accesses. ICP observes that although memory addresses may not repeat, the instructions generating them often recur with stable data-dependency relationships. By learning these persistent instruction correlations, ICP speculatively computes and prefetches future irregular accesses using the execution results of their correlated predecessors. Across irregular SPEC CPU and GAP benchmarks, ICP outperforms the state-of-the-art temporal prefetcher Triangel by 14.0% and the indirect prefetcher DMP by 6.0%, while requiring only 2.1 KB of hardware storage, over three orders of magnitude smaller than temporal prefetchers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes