MLLGSTDec 4, 2017

Statistical Inference for Incomplete Ranking Data: The Case of Rank-Dependent Coarsening

arXiv:1712.01158v117 citations
Originality Incremental advance
AI Analysis

This work addresses statistical inference for ranking data in scenarios with incomplete samples, which is a problem for researchers and practitioners in fields like recommendation systems or social choice, but it is incremental as it builds on existing rank aggregation methods by adding a coarsening model.

The paper tackles the problem of rank aggregation from incomplete ranking data by modeling the coarsening process that generates incomplete rankings from full ones, specifically using rank-dependent coarsening. It studies the performance of rank aggregation methods, including predictive accuracy and consistency in recovering a target ranking despite bias from unknown coarsening, with results based on a Plackett-Luce distribution and pairwise preferences.

We consider the problem of statistical inference for ranking data, specifically rank aggregation, under the assumption that samples are incomplete in the sense of not comprising all choice alternatives. In contrast to most existing methods, we explicitly model the process of turning a full ranking into an incomplete one, which we call the coarsening process. To this end, we propose the concept of rank-dependent coarsening, which assumes that incomplete rankings are produced by projecting a full ranking to a random subset of ranks. For a concrete instantiation of our model, in which full rankings are drawn from a Plackett-Luce distribution and observations take the form of pairwise preferences, we study the performance of various rank aggregation methods. In addition to predictive accuracy in the finite sample setting, we address the theoretical question of consistency, by which we mean the ability to recover a target ranking when the sample size goes to infinity, despite a potential bias in the observations caused by the (unknown) coarsening.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes