CLLGSep 9, 2021

SPECTRA: Sparse Structured Text Rationalization

arXiv:2109.04552v1671 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of improving rationale extraction for NLP tasks like classification and inference, offering a more stable and tunable alternative to stochastic methods, though it is incremental in nature.

The paper tackles the challenge of deterministic extraction of structured rationales in selective rationalization by introducing a unified framework using constrained inference on a factor graph, which outperforms previous methods in performance and plausibility of rationales.

Selective rationalization aims to produce decisions along with rationales (e.g., text highlights or word alignments between two sentences). Commonly, rationales are modeled as stochastic binary masks, requiring sampling-based gradient estimators, which complicates training and requires careful hyperparameter tuning. Sparse attention mechanisms are a deterministic alternative, but they lack a way to regularize the rationale extraction (e.g., to control the sparsity of a text highlight or the number of alignments). In this paper, we present a unified framework for deterministic extraction of structured explanations via constrained inference on a factor graph, forming a differentiable layer. Our approach greatly eases training and rationale regularization, generally outperforming previous work on what comes to performance and plausibility of the extracted rationales. We further provide a comparative study of stochastic and deterministic methods for rationale extraction for classification and natural language inference tasks, jointly assessing their predictive power, quality of the explanations, and model variability.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes