LGLOOct 26, 2024

Revisiting Differential Verification: Equivalence Verification with Confidence

arXiv:2410.20207v24 citationsh-index: 3TACAS
Originality Incremental advance
AI Analysis

This work addresses the need for efficient equivalence verification in neural network pruning, which is crucial for safety-critical applications like high-energy physics, though it builds incrementally on existing differential verification and confidence-based methods.

The paper tackles the problem of verifying equivalence between original and pruned neural networks by proposing a novel abstract domain for differential verification, achieving median speedups over 300x compared to state-of-the-art verifiers on benchmarks including particle jet classification at CERN's LHC.

When validated neural networks (NNs) are pruned (and retrained) before deployment, it is desirable to prove that the new NN behaves equivalently to the (original) reference NN. To this end, our paper revisits the idea of differential verification which performs reasoning on differences between NNs: On the one hand, our paper proposes a novel abstract domain for differential verification admitting more efficient reasoning about equivalence. On the other hand, we investigate empirically and theoretically which equivalence properties are (not) efficiently solved using differential reasoning. Based on the gained insights, and following a recent line of work on confidence-based verification, we propose a novel equivalence property that is amenable to Differential Verification while providing guarantees for large parts of the input space instead of small-scale guarantees constructed w.r.t. predetermined input points. We implement our approach in a new tool called VeryDiff and perform an extensive evaluation on numerous old and new benchmark families, including new pruned NNs for particle jet classification in the context of CERN's LHC where we observe median speedups >300x over the State-of-the-Art verifier alpha,beta-CROWN.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes