LGHCMay 19, 2021

DumbleDR: Predicting User Preferences of Dimensionality Reduction Projection Quality

arXiv:2105.09275v1
Originality Incremental advance
AI Analysis

This addresses the challenge for researchers and analysts in choosing projections, though it is incremental as it builds on existing qualitative evaluation practices.

The paper tackles the problem of evaluating 2D dimensionality reduction projections by proposing a quantitative method based on human perception, using a study to derive quality metrics that rank projections and quantify subjectivity.

A plethora of dimensionality reduction techniques have emerged over the past decades, leaving researchers and analysts with a wide variety of choices for reducing their data, all the more so given some techniques come with additional parametrization (e.g. t-SNE, UMAP, etc.). Recent studies are showing that people often use dimensionality reduction as a black-box regardless of the specific properties the method itself preserves. Hence, evaluating and comparing 2D projections is usually qualitatively decided, by setting projections side-by-side and letting human judgment decide which projection is the best. In this work, we propose a quantitative way of evaluating projections, that nonetheless places human perception at the center. We run a comparative study, where we ask people to select 'good' and 'misleading' views between scatterplots of low-level projections of image datasets, simulating the way people usually select projections. We use the study data as labels for a set of quality metrics whose purpose is to discover and quantify what exactly people are looking for when deciding between projections. With this proxy for human judgments, we use it to rank projections on new datasets, explain why they are relevant, and quantify the degree of subjectivity in projections selected.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes