CL AIOct 25, 2023

This Reads Like That: Deep Learning for Interpretable Natural Language Processing

Claudio Fanconi, Moritz Vandenhirtz, Severin Husmann, Julia E. Vogt

arXiv:2310.17010v121.2131 citationsh-index: 5Has Code

Originality Incremental advance

AI Analysis

This work addresses the need for interpretable AI in NLP, offering incremental improvements over existing prototype-based approaches for tasks like text classification.

The paper tackles the problem of interpretable natural language processing by extending prototype learning to NLP, introducing a weighted similarity measure and a post-hoc explainability mechanism, resulting in improved predictive performance on AG News and RT Polarity datasets and enhanced faithfulness of explanations compared to prior methods.

Prototype learning, a popular machine learning method designed for inherently interpretable decisions, leverages similarities to learned prototypes for classifying new data. While it is mainly applied in computer vision, in this work, we build upon prior research and further explore the extension of prototypical networks to natural language processing. We introduce a learned weighted similarity measure that enhances the similarity computation by focusing on informative dimensions of pre-trained sentence embeddings. Additionally, we propose a post-hoc explainability mechanism that extracts prediction-relevant words from both the prototype and input sentences. Finally, we empirically demonstrate that our proposed method not only improves predictive performance on the AG News and RT Polarity datasets over a previous prototype-based approach, but also improves the faithfulness of explanations compared to rationale-based recurrent convolutions.

View on arXiv PDF Code

Similar