LGPLDec 16, 2021

A Static Analyzer for Detecting Tensor Shape Errors in Deep Neural Network Training Code

arXiv:2112.09037v111 citations
Originality Incremental advance
AI Analysis

This addresses a critical issue for deep learning practitioners by preventing costly training failures due to tensor mismatches, though it is an incremental improvement in static analysis tools.

The authors tackled the problem of tensor shape errors in PyTorch code by developing PyTea, a static analyzer that detects these errors automatically, successfully identifying them in real-world projects within seconds.

We present an automatic static analyzer PyTea that detects tensor-shape errors in PyTorch code. The tensor-shape error is critical in the deep neural net code; much of the training cost and intermediate results are to be lost once a tensor shape mismatch occurs in the midst of the training phase. Given the input PyTorch source, PyTea statically traces every possible execution path, collects tensor shape constraints required by the tensor operation sequence of the path, and decides if the constraints are unsatisfiable (hence a shape error can occur). PyTea's scalability and precision hinges on the characteristics of real-world PyTorch applications: the number of execution paths after PyTea's conservative pruning rarely explodes and loops are simple enough to be circumscribed by our symbolic abstraction. We tested PyTea against the projects in the official PyTorch repository and some tensor-error code questioned in the StackOverflow. PyTea successfully detects tensor shape errors in these codes, each within a few seconds.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes