HCLGJul 25, 2024

iNNspector: Visual, Interactive Deep Model Debugging

arXiv:2407.17998v12 citationsh-index: 5
Originality Synthesis-oriented
AI Analysis

This addresses the challenge for deep learning developers and data analysts in systematically analyzing complex model data to streamline debugging, though it is incremental as it builds on existing visualization and debugging concepts.

The paper tackles the problem of debugging deep learning models by proposing iNNspector, a system that provides interactive visualizations of model data across multiple abstraction levels, which was evaluated through real-world use-cases and a user study showing effectiveness and usability.

Deep learning model design, development, and debugging is a process driven by best practices, guidelines, trial-and-error, and the personal experiences of model developers. At multiple stages of this process, performance and internal model data can be logged and made available. However, due to the sheer complexity and scale of this data and process, model developers often resort to evaluating their model performance based on abstract metrics like accuracy and loss. We argue that a structured analysis of data along the model's architecture and at multiple abstraction levels can considerably streamline the debugging process. Such a systematic analysis can further connect the developer's design choices to their impacts on the model behavior, facilitating the understanding, diagnosis, and refinement of deep learning models. Hence, in this paper, we (1) contribute a conceptual framework structuring the data space of deep learning experiments. Our framework, grounded in literature analysis and requirements interviews, captures design dimensions and proposes mechanisms to make this data explorable and tractable. To operationalize our framework in a ready-to-use application, we (2) present the iNNspector system. iNNspector enables tracking of deep learning experiments and provides interactive visualizations of the data on all levels of abstraction from multiple models to individual neurons. Finally, we (3) evaluate our approach with three real-world use-cases and a user study with deep learning developers and data analysts, proving its effectiveness and usability.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes