ML LGAug 22, 2017

Learning Combinations of Sigmoids Through Gradient Estimation

arXiv:1708.06678v24.14 citations

Originality Incremental advance

AI Analysis

This provides a theoretical foundation for a specific class of hidden variable models, though it is incremental as it focuses on a toy model.

The paper tackles learning parameters of regression models with hidden variables by estimating gradients at random points and clustering them to estimate hidden unit parameters, proving non-asymptotic sample bounds for a toy model of linear combinations of sigmoids.

We develop a new approach to learn the parameters of regression models with hidden variables. In a nutshell, we estimate the gradient of the regression function at a set of random points, and cluster the estimated gradients. The centers of the clusters are used as estimates for the parameters of hidden units. We justify this approach by studying a toy model, whereby the regression function is a linear combination of sigmoids. We prove that indeed the estimated gradients concentrate around the parameter vectors of the hidden units, and provide non-asymptotic bounds on the number of required samples. To the best of our knowledge, no comparable guarantees have been proven for linear combinations of sigmoids.

View on arXiv PDF

Similar