LG MLMay 21, 2012

Conditional mean embeddings as regressors - supplementary

Steffen Grünewälder, Guy Lever, Luca Baldassarre, Sam Patterson, Arthur Gretton, Massimilano Pontil

arXiv:1205.4656v2163 citations

AI Analysis

This work provides a theoretical foundation and improved convergence rates for learning conditional distributions, with applications in reinforcement learning, though it is incremental in linking existing methods.

The paper establishes an equivalence between RKHS embeddings of conditional distributions and vector-valued regressors, enabling the derivation of a sparse embedding algorithm and achieving nearly optimal minimax convergence rates of O(log(n)/n), compared to the previous state-of-the-art O(n^{-1/4}).

We demonstrate an equivalence between reproducing kernel Hilbert space (RKHS) embeddings of conditional distributions and vector-valued regressors. This connection introduces a natural regularized loss function which the RKHS embeddings minimise, providing an intuitive understanding of the embeddings and a justification for their use. Furthermore, the equivalence allows the application of vector-valued regression methods and results to the problem of learning conditional distributions. Using this link we derive a sparse version of the embedding by considering alternative formulations. Further, by applying convergence results for vector-valued regression to the embedding problem we derive minimax convergence rates which are O(\log(n)/n) -- compared to current state of the art rates of O(n^{-1/4}) -- and are valid under milder and more intuitive assumptions. These minimax upper rates coincide with lower rates up to a logarithmic factor, showing that the embedding method achieves nearly optimal rates. We study our sparse embedding algorithm in a reinforcement learning task where the algorithm shows significant improvement in sparsity over an incomplete Cholesky decomposition.

View on arXiv PDF

Similar