LGSTMLJun 18, 2012

Copula-based Kernel Dependency Measures

arXiv:1206.4682v192 citations
Originality Incremental advance
AI Analysis

This provides a robust dependence measure for applications like feature selection and distribution embedding, though it is an incremental extension of existing methods.

The paper tackles the problem of measuring dependence between random variables by extending Maximum Mean Discrepancy to copulas, resulting in a measure that is invariant to marginal transformations, consistent, robust to outliers, and uses only rank statistics.

The paper presents a new copula based method for measuring dependence between random variables. Our approach extends the Maximum Mean Discrepancy to the copula of the joint distribution. We prove that this approach has several advantageous properties. Similarly to Shannon mutual information, the proposed dependence measure is invariant to any strictly increasing transformation of the marginal variables. This is important in many applications, for example in feature selection. The estimator is consistent, robust to outliers, and uses rank statistics only. We derive upper bounds on the convergence rate and propose independence tests too. We illustrate the theoretical contributions through a series of experiments in feature selection and low-dimensional embedding of distributions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes