CLAIOct 25, 2023

This Reads Like That: Deep Learning for Interpretable Natural Language Processing

arXiv:2310.17010v1131 citationsh-index: 5
Originality Incremental advance
AI Analysis

This work addresses the need for interpretable AI in NLP, offering incremental improvements over existing prototype-based approaches for tasks like text classification.

The paper tackles the problem of interpretable natural language processing by extending prototype learning to NLP, introducing a weighted similarity measure and a post-hoc explainability mechanism, resulting in improved predictive performance on AG News and RT Polarity datasets and enhanced faithfulness of explanations compared to prior methods.

Prototype learning, a popular machine learning method designed for inherently interpretable decisions, leverages similarities to learned prototypes for classifying new data. While it is mainly applied in computer vision, in this work, we build upon prior research and further explore the extension of prototypical networks to natural language processing. We introduce a learned weighted similarity measure that enhances the similarity computation by focusing on informative dimensions of pre-trained sentence embeddings. Additionally, we propose a post-hoc explainability mechanism that extracts prediction-relevant words from both the prototype and input sentences. Finally, we empirically demonstrate that our proposed method not only improves predictive performance on the AG News and RT Polarity datasets over a previous prototype-based approach, but also improves the faithfulness of explanations compared to rationale-based recurrent convolutions.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes