MLLGMEMar 15, 2022

Comparing Two Samples Through Stochastic Dominance: A Graphical Approach

arXiv:2203.07889v45 citationsh-index: 17
Originality Incremental advance
AI Analysis

This provides a more nuanced comparison tool for researchers in fields like optimization and reinforcement learning, though it is incremental as it builds on existing stochastic dominance concepts.

The paper tackles the problem of comparing two samples from non-deterministic measurements, such as stochastic algorithms, by proposing a graphical approach based on stochastic dominance. It introduces a dominance measure and graphical method that reveal additional insights missed by traditional methods like expected values or statistical tests, as demonstrated in a re-evaluation of an existing study.

Non-deterministic measurements are common in real-world scenarios: the performance of a stochastic optimization algorithm or the total reward of a reinforcement learning agent in a chaotic environment are just two examples in which unpredictable outcomes are common. These measures can be modeled as random variables and compared among each other via their expected values or more sophisticated tools such as null hypothesis statistical tests. In this paper, we propose an alternative framework to visually compare two samples according to their estimated cumulative distribution functions. First, we introduce a dominance measure for two random variables that quantifies the proportion in which the cumulative distribution function of one of the random variables stochastically dominates the other one. Then, we present a graphical method that decomposes in quantiles i) the proposed dominance measure and ii) the probability that one of the random variables takes lower values than the other. With illustrative purposes, we re-evaluate the experimentation of an already published work with the proposed methodology and we show that additional conclusions (missed by the rest of the methods) can be inferred. Additionally, the software package RVCompare was created as a convenient way of applying and experimenting with the proposed framework.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes