AT LGJul 31, 2019

Topological Machine Learning with Persistence Indicator Functions

Bastian Rieck, Filip Sadlo, Heike Leitte

arXiv:1907.13496v112.647 citations

Originality Incremental advance

AI Analysis

This work addresses the problem of efficiently applying topological methods in machine learning for researchers and practitioners dealing with complex structured data, representing an incremental advancement by providing a new computational tool.

The paper tackles the challenge of integrating topological data analysis with kernel-based machine learning by introducing persistence indicator functions (PIFs), which summarize persistence diagrams and enable linear-time computation and kernel-based similarity measures, demonstrating applications in confidence set estimation and classification of complex structured data.

Techniques from computational topology, in particular persistent homology, are becoming increasingly relevant for data analysis. Their stable metrics permit the use of many distance-based data analysis methods, such as multidimensional scaling, while providing a firm theoretical ground. Many modern machine learning algorithms, however, are based on kernels. This paper presents persistence indicator functions (PIFs), which summarize persistence diagrams, i.e., feature descriptors in topological data analysis. PIFs can be calculated and compared in linear time and have many beneficial properties, such as the availability of a kernel-based similarity measure. We demonstrate their usage in common data analysis scenarios, such as confidence set estimation and classification of complex structured data.

View on arXiv PDF

Similar