PFDCLGMar 14, 2025

LLMPerf: GPU Performance Modeling meets Large Language Models

arXiv:2503.11244v13 citationsh-index: 5MASCOTS
Originality Incremental advance
AI Analysis

This addresses the problem of automating performance prediction for GPU programs, which is incremental as it applies existing LLMs to a new domain.

The paper tackles performance modeling for GPGPU programs by using Large Language Models (LLMs) as estimators, achieving a mean absolute percentage error of 24.25% on a generated validation set and 46.1% on public OpenCL programs.

Performance modeling, a pivotal domain in program cost analysis, currently relies on manually crafted models constrained by various program and hardware limitations, especially in the intricate landscape of GPGPU. Meanwhile, Large Language Models (LLMs) have demonstrated their effectiveness in addressing diverse programming challenges. Our work establishes a connection between LLMs and performance modeling, employing the LLM as a performance estimator. Through experimental exploration with carefully designed large-scale OpenCL datasets, we highlight the potential capability as well as the main difficulties of using LLMs in handling performance modeling tasks for OpenCL device source programs. As the first study for this line of work, our LLM-based performance model achieves a mean absolute percentage error of $24.25\%$ for a large-scale generated validation set. On a set of publicly available OpenCL programs, our model achieves a mean absolute percentage error of $46.1\%$.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes