ARLGSEMay 29

KernelCraft: Benchmarking for Agentic Close-to-Metal Kernel Generation on Emerging Hardware

CambridgeTsinghua
arXiv:2603.0872132.23 citationsh-index: 21
Predicted impact top 2% in AR · last 90 daysOriginality Highly original
AI Analysis

This work addresses the problem of manually crafting low-level kernels for emerging hardware, which is a time-consuming and error-prone process for hardware developers, potentially accelerating the accelerator chip development cycle.

The paper introduces KernelCraft, a benchmark to evaluate LLM agents' ability to generate and optimize low-level kernels for new AI accelerators with novel instruction set architectures. The strongest agents were able to generate functionally correct kernels for unseen ISAs within a few refinement steps and produced optimized kernels that matched or outperformed compiler baselines.

New AI accelerators with novel instruction set architectures (ISAs) often require developers to manually craft low-level kernels, a time-consuming and error-prone process that does not scale across hardware targets. This delays emerging hardware platforms from reaching the market. While prior LLM-based code generation has shown promise in mature GPU ecosystems, it remains unclear whether agentic LLM systems can quickly produce valid and efficient kernels for emerging hardware with new ISAs. We present KernelCraft: the first benchmark for evaluating an LLM agent's ability to generate and optimize low-level kernels for customized accelerators through a function-calling, feedback-driven workflow. We evaluate agent performance across three emerging accelerators on more than 20 machine-learning tasks, each with five diverse task configurations. Across four leading reasoning models, the strongest agents generate functionally correct kernels for unseen ISAs within a few refinement steps and produce optimized kernels that match or outperform compiler baselines. These results demonstrate KernelCraft's potential to accelerate the accelerator chip development cycle. KernelCraft is available at https://kernelcraft-cam.github.io/.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes