LOAIPLMay 17, 2024

A Certified Proof Checker for Deep Neural Network Verification in Imandra

arXiv:2405.10611v211 citationsh-index: 17ITP
Originality Incremental advance
AI Analysis

This work addresses trust issues in DNN verification for safety-critical applications, though it is incremental as it builds on existing Marabou certificate checking.

The paper tackles the problem of trust in deep neural network (DNN) verifiers by implementing a certified proof checker for Marabou certificates in Imandra, resulting in full proof of correctness and stronger independent guarantees for verification results.

Recent advances in the verification of deep neural networks (DNNs) have opened the way for a broader usage of DNN verification technology in many application areas, including safety-critical ones. However, DNN verifiers are themselves complex programs that have been shown to be susceptible to errors and numerical imprecision; this, in turn, has raised the question of trust in DNN verifiers. One prominent attempt to address this issue is enhancing DNN verifiers with the capability of producing certificates of their results that are subject to independent algorithmic checking. While formulations of Marabou certificate checking already exist on top of the state-of-the-art DNN verifier Marabou, they are implemented in C++, and that code itself raises the question of trust (e.g., in the precision of floating point calculations or guarantees for implementation soundness). Here, we present an alternative implementation of the Marabou certificate checking in Imandra -- an industrial functional programming language and an interactive theorem prover (ITP) -- that allows us to obtain full proof of certificate correctness. The significance of the result is two-fold. Firstly, it gives stronger independent guarantees for Marabou proofs. Secondly, it opens the way for the wider adoption of DNN verifiers in interactive theorem proving in the same way as many ITPs already incorporate SMT solvers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes