CGLGDec 15, 2018

Mapper Comparison with Wasserstein Metrics

arXiv:1812.06232v1
Originality Incremental advance
AI Analysis

This addresses the understudied issue of model drift for Mapper graphs in Topological Data Analysis, which is incremental as it adapts existing techniques to a specific domain.

The paper tackles the problem of evaluating model drift for Mapper graphs in unsupervised learning by developing the Network Augmented Wasserstein Distance, an optimal transport-based metric, and demonstrates its value by transforming model drift analysis into an anomaly detection problem over dynamic graphs.

The challenge of describing model drift is an open question in unsupervised learning. It can be difficult to evaluate at what point an unsupervised model has deviated beyond what would be expected from a different sample from the same population. This is particularly true for models without a probabilistic interpretation. One such family of techniques, Topological Data Analysis, and the Mapper algorithm in particular, has found use in a variety of fields, but describing model drift for Mapper graphs is an understudied area as even existing techniques for measuring distances between related constructs like graphs or simplicial complexes fail to account for the fact that Mapper graphs represent a combination of topological, metric, and density information. In this paper, we develop an optimal transport based metric which we call the Network Augmented Wasserstein Distance for evaluating distances between Mapper graphs and demonstrate the value of the metric for model drift analysis by using the metric to transform the model drift problem into an anomaly detection problem over dynamic graphs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes