LG MLMar 7, 2025

Interpretable Visualizations of Data Spaces for Classification Problems

Christian Jorgensen, Arthur Y. Lin, Rhushil Vasavada, Rose K. Cersonsky

arXiv:2503.05861v33 citationsh-index: 2Machine Learning: Science and Technology

Originality Incremental advance

AI Analysis

It addresses the challenge of interpreting classification models for researchers and practitioners, though it is incremental as it builds on existing visualization methods.

The paper tackles the problem of visualizing decision boundaries in classification models by proposing a hybrid supervised-unsupervised technique, resulting in a human-interpretable map demonstrated on chemical neurotoxicity data.

How do classification models "see" our data? Based on their success in delineating behaviors, there must be some lens through which it is easy to see the boundary between classes; however, our current set of visualization techniques makes this prospect difficult. In this work, we propose a hybrid supervised-unsupervised technique distinctly suited to visualizing the decision boundaries determined by classification problems. This method provides a human-interpretable map that can be analyzed qualitatively and quantitatively, which we demonstrate through visualizing and interpreting a decision boundary for chemical neurotoxicity. While we discuss this method in the context of chemistry-driven problems, its application can be generalized across subfields for "unboxing" the operations of machine-learning classification models.

View on arXiv PDF

Similar