LGAIApr 26, 2021

Weakly Supervised Multi-task Learning for Concept-based Explainability

arXiv:2104.12459v112 citations
AI Analysis

This work addresses the need for interpretable AI in critical domains like fraud detection and medical diagnosis, offering a practical solution for domain experts, though it is incremental in leveraging multi-task learning with noisy labels.

The paper tackles the problem of generating faithful concept-based explanations for ML-aided decision-making tasks like fraud detection, where domain experts prefer high-level explanations over low-level features, by proposing a weakly supervised multi-task learning approach that uses expert rules to generate noisy concept labels and combines them with few golden labels, resulting in improvements of 9.26% in explainability and 417.8% in decision task performance.

In ML-aided decision-making tasks, such as fraud detection or medical diagnosis, the human-in-the-loop, usually a domain-expert without technical ML knowledge, prefers high-level concept-based explanations instead of low-level explanations based on model features. To obtain faithful concept-based explanations, we leverage multi-task learning to train a neural network that jointly learns to predict a decision task based on the predictions of a precedent explainability task (i.e., multi-label concepts). There are two main challenges to overcome: concept label scarcity and the joint learning. To address both, we propose to: i) use expert rules to generate a large dataset of noisy concept labels, and ii) apply two distinct multi-task learning strategies combining noisy and golden labels. We compare these strategies with a fully supervised approach in a real-world fraud detection application with few golden labels available for the explainability task. With improvements of 9.26% and of 417.8% at the explainability and decision tasks, respectively, our results show it is possible to improve performance at both tasks by combining labels of heterogeneous quality.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes