CGApr 16

Towards an Optimal Bound for the Interleaving Distance on Mapper Graphs

arXiv:2504.038654.72 citationsh-index: 2
Predicted impact top 64% in CG · last 90 daysOriginality Incremental advance
AI Analysis

This work provides a computationally tractable bound for comparing mapper graphs, addressing a known NP-hard problem, but the approach is incremental as it builds on existing loss functions and ILP techniques.

The paper introduces the first framework for bounding the interleaving distance on mapper graphs, using integer linear programming to determine existence of interleavings and minimize assignment loss. The method is evaluated on small examples and benchmark datasets, showing utility for classification tasks.

Mapper graphs are widely used tools in topological data analysis and visualization. They can be understood as discrete approximations of Reeb graphs, providing insight into the shape and connectivity of complex data. Given a high-dimensional point cloud together with a real-valued function defined on it, a mapper graph summarizes the induced topological structure: each node represents a local neighborhood, and edges connect nodes whose corresponding neighborhoods overlap. Our focus is the interleaving distance for mapper graphs, arising as a discretized analogue of the interleaving distance for Reeb graphs-a quantity known to be NP-hard to compute. This distance measures how similar two mapper graphs are by quantifying how much they must be ``stretched'' to be made comparable. Recent work introduced a loss function that gives an upper bound on this distance. The loss evaluates how far a given collection of maps, called an assignment, is from being a true interleaving. Importantly, it is computationally tractable, offering a practical way to bound the distance, however the quality of the bound is dependent on the choice of assignment. In this paper, we develop the first framework for bounding the interleaving distance on mapper graphs. We present the bound in two ways: first, by formulating an integer linear program (ILP) that determines whether an $n$-interleaving exists for a given $n$; and second, by constructing an ILP that identifies an assignment with minimal loss for that $n$. We also evaluate the method on small examples where the interleaving distance is known, and on benchmark and simulated datasets, demonstrating the utility of the approach for classification tasks based on mapper graphs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes