SE AI LG PFMar 25, 2025

VecTrans: Enhancing Compiler Auto-Vectorization through LLM-Assisted Code Transformations

Zhongchun Zheng, Kan Wu, Long Cheng, Lu Li, Rodrigo C. O. Rocha, Tianyi Liu, Wei Wei, Jianjiang Zeng, Xianwei Zhang, Yaoqing Gao

arXiv:2503.19449v39.84 citationsh-index: 2

Originality Highly original

AI Analysis

This addresses a fundamental compiler optimization bottleneck for developers and systems requiring high-performance computing, representing a significant but incremental improvement over existing methods.

The paper tackles the problem of compiler auto-vectorization struggling with intricate code patterns by introducing VecTrans, a framework that uses LLMs to refactor code for better vectorization. Experimental results show it achieves a geomean speedup of 1.77x and successfully vectorizes 24 of 51 previously unvectorizable test cases.

Auto-vectorization is a fundamental optimization for modern compilers to exploit SIMD parallelism. However, state-of-the-art approaches still struggle to handle intricate code patterns, often requiring manual hints or domain-specific expertise. Large language models (LLMs), with their ability to capture intricate patterns, provide a promising solution, yet their effective application in compiler optimizations remains an open challenge due to issues such as hallucinations and a lack of domain-specific reasoning. In this paper, we present VecTrans, a novel framework that leverages LLMs to enhance compiler-based code vectorization. VecTrans first employs compiler analysis to identify potentially vectorizable code regions. It then utilizes an LLM to refactor these regions into patterns that are more amenable to the compilers auto-vectorization. To ensure semantic correctness, VecTrans further integrates a hybrid validation mechanism at the intermediate representation (IR) level. With the above efforts, VecTrans combines the adaptability of LLMs with the precision of compiler vectorization, thereby effectively opening up the vectorization opportunities. experimental results show that among all TSVC functions unvectorizable by GCC, ICC, Clang, and BiSheng Compiler, VecTrans achieves an geomean speedup of 1.77x and successfully vectorizes 24 of 51 test cases. This marks a significant advancement over state-of-the-art approaches while maintaining a cost efficiency of $0.012 per function optimization for LLM API usage.

View on arXiv PDF

Similar