CVNov 19, 2025

FunnyNodules: A Customizable Medical Dataset Tailored for Evaluating Explainable AI

arXiv:2511.15481v21 citationsh-index: 4
Originality Synthesis-oriented
AI Analysis

This provides a customizable tool for developing and benchmarking explainable AI methods in medical image analysis, though it is incremental as it focuses on synthetic data generation rather than real-world applications.

The authors tackled the scarcity of medical image datasets with reasoning annotations for explainable AI by introducing FunnyNodules, a fully parameterized synthetic dataset that generates lung nodule-like shapes with controllable attributes, enabling systematic evaluation of attribute-based reasoning in models.

Densely annotated medical image datasets that capture not only diagnostic labels but also the underlying reasoning behind these diagnoses are scarce. Such reasoning-related annotations are essential for developing and evaluating explainable AI (xAI) models that reason similarly to radiologists: making correct predictions for the right reasons. To address this gap, we introduce FunnyNodules, a fully parameterized synthetic dataset designed for systematic analysis of attribute-based reasoning in medical AI models. The dataset generates abstract, lung nodule-like shapes with controllable visual attributes such as roundness, margin sharpness, and spiculation. Target class is derived from a predefined attribute combination, allowing full control over the decision rule that links attributes to the diagnostic class. We demonstrate how FunnyNodules can be used in model-agnostic evaluations to assess whether models learn correct attribute-target relations, to interpret over- or underperformance in attribute prediction, and to analyze attention alignment with attribute-specific regions of interest. The framework is fully customizable, supporting variations in dataset complexity, target definitions, class balance, and beyond. With complete ground truth information, FunnyNodules provides a versatile foundation for developing, benchmarking, and conducting in-depth analyses of explainable AI methods in medical image analysis.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes