MLLGOCOct 12, 2018

Safe Grid Search with Optimal Complexity

arXiv:1810.05471v312 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of hyperparameter tuning for practitioners, offering an automated method with theoretical guarantees, though it is incremental as it refines existing path approximation techniques.

The paper tackles the problem of tuning regularization parameters in machine learning estimators by developing a unified framework that approximates the regularization path with complexity O(1/√[d]ε) for uniformly convex losses and O(1/√ε) for Generalized Self-Concordant functions, and provides a practical algorithm with global convergence guarantees for hyperparameter tuning.

Popular machine learning estimators involve regularization parameters that can be challenging to tune, and standard strategies rely on grid search for this task. In this paper, we revisit the techniques of approximating the regularization path up to predefined tolerance $ε$ in a unified framework and show that its complexity is $O(1/\sqrt[d]ε)$ for uniformly convex loss of order $d \geq 2$ and $O(1/\sqrtε)$ for Generalized Self-Concordant functions. This framework encompasses least-squares but also logistic regression, a case that as far as we know was not handled as precisely in previous works. We leverage our technique to provide refined bounds on the validation error as well as a practical algorithm for hyperparameter tuning. The latter has global convergence guarantee when targeting a prescribed accuracy on the validation set. Last but not least, our approach helps relieving the practitioner from the (often neglected) task of selecting a stopping criterion when optimizing over the training set: our method automatically calibrates this criterion based on the targeted accuracy on the validation set.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes