CVAIHCMay 6, 2025

VISLIX: An XAI Framework for Validating Vision Models with Slice Discovery and Analysis

arXiv:2505.03132v17 citationsh-index: 6Computer graphics forum (Print)
Originality Incremental advance
AI Analysis

This addresses the problem of rigorous model evaluation for domain experts in safety-critical domains like autonomous driving, though it is incremental as it builds on existing data slicing methods.

The authors tackled the challenge of validating vision models by introducing VISLIX, a visual analytics framework that uses foundation models to analyze data slices without requiring metadata, automatically generating natural language insights and enabling interactive hypothesis testing, with evaluation showing effectiveness in object detection validation.

Real-world machine learning models require rigorous evaluation before deployment, especially in safety-critical domains like autonomous driving and surveillance. The evaluation of machine learning models often focuses on data slices, which are subsets of the data that share a set of characteristics. Data slice finding automatically identifies conditions or data subgroups where models underperform, aiding developers in mitigating performance issues. Despite its popularity and effectiveness, data slicing for vision model validation faces several challenges. First, data slicing often needs additional image metadata or visual concepts, and falls short in certain computer vision tasks, such as object detection. Second, understanding data slices is a labor-intensive and mentally demanding process that heavily relies on the expert's domain knowledge. Third, data slicing lacks a human-in-the-loop solution that allows experts to form hypothesis and test them interactively. To overcome these limitations and better support the machine learning operations lifecycle, we introduce VISLIX, a novel visual analytics framework that employs state-of-the-art foundation models to help domain experts analyze slices in computer vision models. Our approach does not require image metadata or visual concepts, automatically generates natural language insights, and allows users to test data slice hypothesis interactively. We evaluate VISLIX with an expert study and three use cases, that demonstrate the effectiveness of our tool in providing comprehensive insights for validating object detection models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes