IRJun 29, 2018

Posthoc Interpretability of Learning to Rank Models using Secondary Training Data

arXiv:1806.11330v118.147 citations

Originality Synthesis-oriented

AI Analysis

This addresses the need for algorithmic transparency in ranking systems, which is incremental as it applies existing interpretability techniques to a specific domain.

The paper tackled the problem of understanding decisions made by learning-to-rank models by proposing a post-hoc, model-agnostic interpretability method using secondary training data and tree-based models, reporting results on datasets with 30k queries that show faithful interpretable rankers can be learned in certain settings.

Predictive models are omnipresent in automated and assisted decision making scenarios. But for the most part they are used as black boxes which output a prediction without understanding partially or even completely how different features influence the model prediction avoiding algorithmic transparency. Rankings are ordering over items encoding implicit comparisons typically learned using a family of features using learning-to-rank models. In this paper we focus on how best we can understand the decisions made by a ranker in a post-hoc model agnostic manner. We operate on the notion of interpretability based on explainability of rankings over an interpretable feature space. Furthermore we train a tree based model (inherently interpretable) using labels from the ranker, called secondary training data to provide explanations. Consequently, we attempt to study how well does a subset of features, potentially interpretable, explain the full model under different training sizes and algorithms. We do experiments on the learning to rank datasets with 30k queries and report results that serve show in certain settings we can learn a faithful interpretable ranker.

View on arXiv PDF

Similar