LG AI PLMay 9, 2024

Mirage: A Multi-Level Superoptimizer for Tensor Programs

Mengdi Wu, Xinhao Cheng, Shengyu Liu, Chunan Shi, Jianan Ji, Kit Ao, Praveen Velliengiri, Xupeng Miao, Oded Padon, Zhihao Jia

arXiv:2405.05751v349 citationsHas CodeOSDI

AI Analysis

This addresses the challenge of efficiently optimizing tensor programs for deep learning on GPUs, representing a novel method rather than an incremental improvement.

The paper tackles the problem of optimizing tensor programs for GPUs by introducing Mirage, a multi-level superoptimizer that uses a uniform representation called μGraphs to combine algebraic, schedule, and kernel transformations, resulting in performance improvements of up to 3.3× over existing methods for widely used DNNs.

We introduce Mirage, the first multi-level superoptimizer for tensor programs. A key idea in Mirage is $μ$Graphs, a uniform representation of tensor programs at the kernel, thread block, and thread levels of the GPU compute hierarchy. $μ$Graphs enable Mirage to discover novel optimizations that combine algebraic transformations, schedule transformations, and generation of new custom kernels. To navigate the large search space, Mirage introduces a pruning technique based on abstraction that significantly reduces the search space and provides a certain optimality guarantee. To ensure that the optimized $μ$Graph is equivalent to the input program, Mirage introduces a probabilistic equivalence verification procedure with strong theoretical guarantees. Our evaluation shows that Mirage outperforms existing approaches by up to 3.3$\times$ even for DNNs that are widely used and heavily optimized. Mirage is publicly available at https://github.com/mirage-project/mirage.

View on arXiv PDF Code

Similar