AR CL CR DS PLApr 20

Enabling AI ASICs for Zero Knowledge Proof

Jianming Tong, Jingtian Dang, Simon Langowski, Tianhao Huang, Asra Ali, Jeremy Kun, Jevin Jiang, Srinivas Devadas, Tushar Krishna

arXiv:2604.1780852.31 citationsh-index: 7Has Code

Predicted impact top 28% in AR · last 90 daysOriginality Highly original

AI Analysis

For ZKP prover efficiency, MORPH enables AI ASICs to accelerate costly ZKP kernels, addressing a critical bottleneck in practical ZKP deployment.

MORPH is the first framework to reformulate ZKP kernels (MSM and NTT) for AI ASICs like TPUs, achieving up to 10x higher throughput on NTT and comparable throughput on MSM compared to GZKP on TPUv6e8.

Zero-knowledge proof (ZKP) provers remain costly because multi-scalar multiplication (MSM) and number-theoretic transforms (NTTs) dominate runtime as they need significant computation. AI ASICs such as TPUs provide massive matrix throughput and SotA energy efficiency. We present MORPH, the first framework that reformulates ZKP kernels to match AI-ASIC execution. We introduce Big-T complexity, a hardware-aware complexity model that exposes heterogeneous bottlenecks and layout-transformation costs ignored by Big-O. Guided by this analysis, (1) at arithmetic level, MORPH develops an MXU-centric extended-RNS lazy reduction that converts high-precision modular arithmetic into dense low-precision GEMMs, eliminating all carry chains, and (2) at dataflow level, MORPH constructs a unified-sharding layout-stationary TPU Pippenger MSM and optimized 3/5-step NTT that avoid on-TPU shuffles to minimize costly memory reorganization. Implemented in JAX, MORPH enables TPUv6e8 to achieve up-to 10x higher throughput on NTT and comparable throughput on MSM than GZKP. Our code: https://github.com/EfficientPPML/MORPH.

View on arXiv PDF Code

Similar