MLLGMar 16, 2022

Context-Aware Drift Detection

arXiv:2203.08644v222 citationsh-index: 8
Originality Incremental advance
AI Analysis

This addresses drift detection for ML systems in real-world scenarios where contextual factors vary, offering a more general approach than existing methods.

The paper tackles the problem of detecting distributional drift in machine learning systems when data batches are not i.i.d. due to changing contexts, by developing a framework based on conditional distributional treatment effects, and demonstrates its effectiveness in detecting drift in subpopulations and on ImageNet-scale vision problems.

When monitoring machine learning systems, two-sample tests of homogeneity form the foundation upon which existing approaches to drift detection build. They are used to test for evidence that the distribution underlying recent deployment data differs from that underlying the historical reference data. Often, however, various factors such as time-induced correlation mean that batches of recent deployment data are not expected to form an i.i.d. sample from the historical data distribution. Instead we may wish to test for differences in the distributions conditional on \textit{context} that is permitted to change. To facilitate this we borrow machinery from the causal inference domain to develop a more general drift detection framework built upon a foundation of two-sample tests for conditional distributional treatment effects. We recommend a particular instantiation of the framework based on maximum conditional mean discrepancies. We then provide an empirical study demonstrating its effectiveness for various drift detection problems of practical interest, such as detecting drift in the distributions underlying subpopulations of data in a manner that is insensitive to their respective prevalences. The study additionally demonstrates applicability to ImageNet-scale vision problems.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes