OC LG MLSep 6, 2014

Global Convergence of Online Limited Memory BFGS

arXiv:1409.2045v1171 citations

AI Analysis

This provides a more efficient optimization method for large-scale machine learning applications, though it is incremental as it builds on existing quasi-Newton techniques.

The paper tackles the problem of optimizing stochastic objectives in large-scale machine learning by establishing global convergence for an online limited memory BFGS method, showing it reduces convergence time compared to stochastic gradient descent and lowers storage and computation versus other online quasi-Newton methods in experiments on support vector machines and a search engine advertising problem.

Global convergence of an online (stochastic) limited memory version of the Broyden-Fletcher- Goldfarb-Shanno (BFGS) quasi-Newton method for solving optimization problems with stochastic objectives that arise in large scale machine learning is established. Lower and upper bounds on the Hessian eigenvalues of the sample functions are shown to suffice to guarantee that the curvature approximation matrices have bounded determinants and traces, which, in turn, permits establishing convergence to optimal arguments with probability 1. Numerical experiments on support vector machines with synthetic data showcase reductions in convergence time relative to stochastic gradient descent algorithms as well as reductions in storage and computation relative to other online quasi-Newton methods. Experimental evaluation on a search engine advertising problem corroborates that these advantages also manifest in practical applications.

View on arXiv PDF

Similar