LGJun 16, 2016

Learning Infinite-Layer Networks: Without the Kernel Trick

arXiv:1606.05316v27 citations
Originality Incremental advance
AI Analysis

This work addresses a theoretical bottleneck in machine learning by enabling kernel methods without exact kernel computations, which is incremental but broadens applicability in scenarios where kernels are hard to compute.

The authors tackled the problem of learning Infinite-Layer Networks without relying on the kernel trick, by developing an online algorithm that uses random features for kernel estimation. They showed that this method achieves sample complexity comparable to kernel-based approaches, demonstrating that the kernel trick is not necessary for efficient learning.

Infinite--Layer Networks (ILN) have recently been proposed as an architecture that mimics neural networks while enjoying some of the advantages of kernel methods. ILN are networks that integrate over infinitely many nodes within a single hidden layer. It has been demonstrated by several authors that the problem of learning ILN can be reduced to the kernel trick, implying that whenever a certain integral can be computed analytically they are efficiently learnable. In this work we give an online algorithm for ILN, which avoids the kernel trick assumption. More generally and of independent interest, we show that kernel methods in general can be exploited even when the kernel cannot be efficiently computed but can only be estimated via sampling. We provide a regret analysis for our algorithm, showing that it matches the sample complexity of methods which have access to kernel values. Thus, our method is the first to demonstrate that the kernel trick is not necessary as such, and random features suffice to obtain comparable performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes