CYAICRFeb 5, 2025

Enabling External Scrutiny of AI Systems with Privacy-Enhancing Technologies

arXiv:2502.05219v1
Originality Incremental advance
AI Analysis

This addresses the challenge for AI governance and policymakers in providing transparency while protecting security and privacy, though it is incremental as it builds on existing PETs.

The article tackles the problem of enabling external scrutiny of AI systems without compromising sensitive information by using privacy-enhancing technologies (PETs), showcasing real-world case studies like the Christchurch Call and UK AI Safety Institute.

This article describes how technical infrastructure developed by the nonprofit OpenMined enables external scrutiny of AI systems without compromising sensitive information. Independent external scrutiny of AI systems provides crucial transparency into AI development, so it should be an integral component of any approach to AI governance. In practice, external researchers have struggled to gain access to AI systems because of AI companies' legitimate concerns about security, privacy, and intellectual property. But now, privacy-enhancing technologies (PETs) have reached a new level of maturity: end-to-end technical infrastructure developed by OpenMined combines several PETs into various setups that enable privacy-preserving audits of AI systems. We showcase two case studies where this infrastructure has been deployed in real-world governance scenarios: "Understanding Social Media Recommendation Algorithms with the Christchurch Call" and "Evaluating Frontier Models with the UK AI Safety Institute." We describe types of scrutiny of AI systems that could be facilitated by current setups and OpenMined's proposed future setups. We conclude that these innovative approaches deserve further exploration and support from the AI governance community. Interested policymakers can focus on empowering researchers on a legal level.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes