PLLGDec 18, 2024

Pattern Matching in AI Compilers and its Formalization (Extended Version)

arXiv:2412.13398v1h-index: 14
Originality Incremental advance
AI Analysis

This work addresses the challenge of building reliable and understandable optimization tools for machine learning compilers, though it is incremental as it formalizes an existing system.

The authors tackled the complexity of pattern matching in AI compilers by developing PyPM, a Python-based DSL for optimization passes, and formalized its pattern language with a core calculus, proving equivalence between declarative and algorithmic semantics using Coq.

PyPM is a Python-based domain specific language (DSL) for building rewrite-based optimization passes on machine learning computation graphs. Users define individual optimizations by writing (a) patterns that match subgraphs of a computation graph and (b) corresponding rules which replace a matched subgraph with an optimized kernel. PyPM is distinguished from the many other DSLs for defining rewriting passes by its complex and novel pattern language which borrows concepts from logic programming. PyPM patterns can be recursive, nondeterminstic, and can require checking domain-specific constraints such as the shapes of tensors. The PyPM implementation is thus similarly complicated, consisting of thousands of lines of C++ code. In this paper, we present our work on building PyPM, as well as formalizing and distilling and this complexity to an understandable mathematical core. We have developed a formal core calculus expressing the main operations of the PyPM pattern language. We define both a declarative semantics - describing which patterns match which terms - and an algorithmic semantics - an idealized version of the PyPM pattern interpreter - and prove their equivalence. The development is fully mechanized in the Coq proof assistant.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes