The Generalised Kernel Covariance Measure

Luca Bergen, Dino Sejdinovic, Vanessa Didelez

arXiv:2604.0372126.0h-index: 5

Predicted impact top 63% in ML · last 90 daysOriginality Incremental advance

AI Analysis

For practitioners needing reliable conditional independence testing, GKCM offers a flexible, computationally efficient alternative that maintains theoretical guarantees.

The paper proposes the Generalised Kernel Covariance Measure (GKCM), a kernel-based conditional independence test that works with any regression model, overcoming the computational and calibration issues of existing kernel ridge regression methods. In simulations, GKCM with tree-based regressors achieves better type I error control and competitive or superior power compared to state-of-the-art tests.

We consider the problem of conditional independence (CI) testing and adopt a kernel-based approach. Kernel-based CI tests embed variables in reproducing kernel Hilbert spaces, regress their embeddings on the conditioning variables, and test the resulting residuals for marginal independence. This approach yields tests that are sensitive to a broad range of conditional dependencies. Existing methods, however, rely heavily on kernel ridge regression, which is computationally expensive when properly tuned and yields poorly calibrated tests when left untuned, which limits their practical usefulness. We propose the Generalised Kernel Covariance Measure (GKCM), a regression-model-agnostic kernel-based CI test that accommodates a broad class of regression estimators. Building on the Generalised Hilbertian Covariance Measure framework (Lundborg et al., 2022), we characterise conditions under which GKCM satisfies uniform asymptotic level guarantees. In simulations, GKCM paired with tree-based regression models frequently outperforms state-of-the-art CI tests across a diverse range of data-generating processes, achieving better type I error control and competitive or superior power.

View on arXiv PDF

Similar