IRJun 29, 2018

Posthoc Interpretability of Learning to Rank Models using Secondary Training Data

arXiv:1806.11330v147 citations
Originality Synthesis-oriented
AI Analysis

This addresses the need for algorithmic transparency in ranking systems, which is incremental as it applies existing interpretability techniques to a specific domain.

The paper tackled the problem of understanding decisions made by learning-to-rank models by proposing a post-hoc, model-agnostic interpretability method using secondary training data and tree-based models, reporting results on datasets with 30k queries that show faithful interpretable rankers can be learned in certain settings.

Predictive models are omnipresent in automated and assisted decision making scenarios. But for the most part they are used as black boxes which output a prediction without understanding partially or even completely how different features influence the model prediction avoiding algorithmic transparency. Rankings are ordering over items encoding implicit comparisons typically learned using a family of features using learning-to-rank models. In this paper we focus on how best we can understand the decisions made by a ranker in a post-hoc model agnostic manner. We operate on the notion of interpretability based on explainability of rankings over an interpretable feature space. Furthermore we train a tree based model (inherently interpretable) using labels from the ranker, called secondary training data to provide explanations. Consequently, we attempt to study how well does a subset of features, potentially interpretable, explain the full model under different training sizes and algorithms. We do experiments on the learning to rank datasets with 30k queries and report results that serve show in certain settings we can learn a faithful interpretable ranker.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes