LGCRMar 3, 2021

A Modified Drake Equation for Assessing Adversarial Risk to Machine Learning Models

arXiv:2103.02718v21 citations
AI Analysis

This work addresses the need for risk evaluation in the machine learning industry, though it appears incremental as it adapts an existing formalism from another field.

The authors tackled the problem of quantifying adversarial risk for deployed machine learning models by proposing a modified Drake Equation to estimate the number of potentially successful attacks, aiming to provide a semi-quantitative benchmark for risk assessment.

Machine learning models present a risk of adversarial attack when deployed in production. Quantifying the contributing factors and uncertainties using empirical measures could assist the industry with assessing the risk of downloading and deploying common model types. This work proposes modifying the traditional Drake Equation's formalism to estimate the number of potentially successful adversarial attacks on a deployed model. The Drake Equation is famously used for parameterizing uncertainties and it has been used in many research fields outside of its original intentions to estimate the number of radio-capable extra-terrestrial civilizations. While previous work has outlined methods for discovering vulnerabilities in public model architectures, the proposed equation seeks to provide a semi-quantitative benchmark for evaluating and estimating the potential risk factors for adversarial attacks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes