LGAICRFeb 5, 2024

Verifiable evaluations of machine learning models using zkSNARKs

MIT
arXiv:2402.02675v226 citationsh-index: 13
AI Analysis

This addresses the need for transparency in evaluating private models for end-users, though it is incremental as it builds on existing zkSNARK technology.

The paper tackles the problem of verifying benchmark results for closed-source machine learning models by introducing a method using zkSNARKs to create zero-knowledge proofs of model outputs, enabling verifiable attestations of performance or fairness metrics without revealing private weights.

In a world of increasing closed-source commercial machine learning models, model evaluations from developers must be taken at face value. These benchmark results-whether over task accuracy, bias evaluations, or safety checks-are traditionally impossible to verify by a model end-user without the costly or impossible process of re-performing the benchmark on black-box model outputs. This work presents a method of verifiable model evaluation using model inference through zkSNARKs. The resulting zero-knowledge computational proofs of model outputs over datasets can be packaged into verifiable evaluation attestations showing that models with fixed private weights achieve stated performance or fairness metrics over public inputs. We present a flexible proving system that enables verifiable attestations to be performed on any standard neural network model with varying compute requirements. For the first time, we demonstrate this across a sample of real-world models and highlight key challenges and design solutions. This presents a new transparency paradigm in the verifiable evaluation of private models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes