MLLGDec 19, 2021

RELAX: Representation Learning Explainability

arXiv:2112.10161v219 citations
Originality Highly original
AI Analysis

This addresses the problem of understanding what influences learned representations for researchers and practitioners in machine learning, representing a novel contribution in explainability.

The paper tackles the lack of explainability in representation learning by proposing RELAX, the first attribution-based method for explaining learned representations, which outperforms gradient-based baselines and includes uncertainty modeling.

Despite the significant improvements that representation learning via self-supervision has led to when learning from unlabeled data, no methods exist that explain what influences the learned representation. We address this need through our proposed approach, RELAX, which is the first approach for attribution-based explanations of representations. Our approach can also model the uncertainty in its explanations, which is essential to produce trustworthy explanations. RELAX explains representations by measuring similarities in the representation space between an input and masked out versions of itself, providing intuitive explanations and significantly outperforming the gradient-based baseline. We provide theoretical interpretations of RELAX and conduct a novel analysis of feature extractors trained using supervised and unsupervised learning, providing insights into different learning strategies. Finally, we illustrate the usability of RELAX in multi-view clustering and highlight that incorporating uncertainty can be essential for providing low-complexity explanations, taking a crucial step towards explaining representations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes