AIJul 31, 2025

Solution-aware vs global ReLU selection: partial MILP strikes back for DNN verification

arXiv:2507.23197v1h-index: 17ATVA
Originality Incremental advance
AI Analysis

This work addresses the computational bottleneck in neural network verification for safety-critical applications, representing an incremental improvement over previous methods.

The paper tackles the problem of efficiently verifying deep neural networks by proposing a solution-aware ReLU scoring method to select critical ReLU variables for partial MILP calls, reducing binary variables by around 6 times and decreasing undecided instances by up to 40% to low levels (8-15%) while maintaining reasonable runtimes.

To handle complex instances, we revisit a divide-and-conquer approach to break down the complexity: instead of few complex BaB calls, we rely on many small {\em partial} MILP calls. The crucial step is to select very few but very important ReLUs to treat using (costly) binary variables. The previous attempts were suboptimal in that respect. To select these important ReLU variables, we propose a novel {\em solution-aware} ReLU scoring ({\sf SAS}), as well as adapt the BaB-SR and BaB-FSB branching functions as {\em global} ReLU scoring ({\sf GS}) functions. We compare them theoretically as well as experimentally, and {\sf SAS} is more efficient at selecting a set of variables to open using binary variables. Compared with previous attempts, SAS reduces the number of binary variables by around 6 times, while maintaining the same level of accuracy. Implemented in {\em Hybrid MILP}, calling first $α,β$-CROWN with a short time-out to solve easier instances, and then partial MILP, produces a very accurate yet efficient verifier, reducing by up to $40\%$ the number of undecided instances to low levels ($8-15\%$), while keeping a reasonable runtime ($46s-417s$ on average per instance), even for fairly large CNNs with 2 million parameters.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes