MLLGAug 22, 2017

Learning Combinations of Sigmoids Through Gradient Estimation

arXiv:1708.06678v24 citations
Originality Incremental advance
AI Analysis

This provides a theoretical foundation for a specific class of hidden variable models, though it is incremental as it focuses on a toy model.

The paper tackles learning parameters of regression models with hidden variables by estimating gradients at random points and clustering them to estimate hidden unit parameters, proving non-asymptotic sample bounds for a toy model of linear combinations of sigmoids.

We develop a new approach to learn the parameters of regression models with hidden variables. In a nutshell, we estimate the gradient of the regression function at a set of random points, and cluster the estimated gradients. The centers of the clusters are used as estimates for the parameters of hidden units. We justify this approach by studying a toy model, whereby the regression function is a linear combination of sigmoids. We prove that indeed the estimated gradients concentrate around the parameter vectors of the hidden units, and provide non-asymptotic bounds on the number of required samples. To the best of our knowledge, no comparable guarantees have been proven for linear combinations of sigmoids.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes