LGFeb 26, 2025

Verde: Verification via Refereed Delegation for Machine Learning Programs

Arasu Arun, Adam St. Arnaud, Alexey Titov, Brian Wilcox, Viktor Kolobaric, Marc Brinkmann, Oguzhan Ersoy, Ben Fielding, Joseph Bonneau

arXiv:2502.19405v115.78 citationsh-index: 9

Originality Incremental advance

AI Analysis

This addresses security and reliability issues for clients using third-party compute services, such as for LLM inference or training, though it is incremental in applying cryptographic concepts to ML.

The paper tackles the problem of ensuring correctness when delegating machine learning programs to untrusted compute providers by adapting refereed delegation, achieving strong guarantees for clients with practical overheads.

Machine learning programs, such as those performing inference, fine-tuning, and training of LLMs, are commonly delegated to untrusted compute providers. To provide correctness guarantees for the client, we propose adapting the cryptographic notion of refereed delegation to the machine learning setting. This approach enables a computationally limited client to delegate a program to multiple untrusted compute providers, with a guarantee of obtaining the correct result if at least one of them is honest. Refereed delegation of ML programs poses two technical hurdles: (1) an arbitration protocol to resolve disputes when compute providers disagree on the output, and (2) the ability to bitwise reproduce ML programs across different hardware setups, For (1), we design Verde, a dispute arbitration protocol that efficiently handles the large scale and graph-based computational model of modern ML programs. For (2), we build RepOps (Reproducible Operators), a library that eliminates hardware "non-determinism" by controlling the order of floating point operations performed on all hardware. Our implementation shows that refereed delegation achieves both strong guarantees for clients and practical overheads for compute providers.

View on arXiv PDF

Similar