SEAILGPFJun 7, 2024

LLM-Vectorizer: LLM-based Verified Loop Vectorizer

arXiv:2406.04693v134 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of auto-vectorization for high-performance computing applications, offering a novel method to improve code performance, though it is incremental in combining LLMs with existing verification tools.

The paper tackles the problem of compilers missing vectorization opportunities by using large-language models (LLMs) to generate vectorized code from scalar programs, achieving run-time speedups of 1.1x to 9.4x compared to state-of-the-art compilers. It also verifies correctness using Alive2, successfully validating 38.2% of vectorizations on the TSVC benchmark.

Vectorization is a powerful optimization technique that significantly boosts the performance of high performance computing applications operating on large data arrays. Despite decades of research on auto-vectorization, compilers frequently miss opportunities to vectorize code. On the other hand, writing vectorized code manually using compiler intrinsics is still a complex, error-prone task that demands deep knowledge of specific architecture and compilers. In this paper, we evaluate the potential of large-language models (LLMs) to generate vectorized (Single Instruction Multiple Data) code from scalar programs that process individual array elements. We propose a novel finite-state machine multi-agents based approach that harnesses LLMs and test-based feedback to generate vectorized code. Our findings indicate that LLMs are capable of producing high performance vectorized code with run-time speedup ranging from 1.1x to 9.4x as compared to the state-of-the-art compilers such as Intel Compiler, GCC, and Clang. To verify the correctness of vectorized code, we use Alive2, a leading bounded translation validation tool for LLVM IR. We describe a few domain-specific techniques to improve the scalability of Alive2 on our benchmark dataset. Overall, our approach is able to verify 38.2% of vectorizations as correct on the TSVC benchmark dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes