DCLGDATA-ANJan 13, 2023

Hyperparameter Optimization as a Service on INFN Cloud

arXiv:2301.05522v31 citationsh-index: 16
Originality Synthesis-oriented
AI Analysis

This provides a solution for researchers needing secure, multi-site hyperparameter optimization, but it is incremental as it builds on existing tools like Optuna.

The paper tackles the challenge of coordinating hyperparameter optimization across distributed resources by introducing Hopaas, a service on INFN Cloud that uses REST APIs and Bayesian techniques, which boosted development of parameterizations for LHCb experiment simulation.

The simplest and often most effective way of parallelizing the training of complex machine learning models is to execute several training instances on multiple machines, scanning the hyperparameter space to optimize the underlying statistical model and the learning procedure. Often, such a meta-learning procedure is limited by the ability of accessing securely a common database organizing the knowledge of the previous and ongoing trials. Exploiting opportunistic GPUs provided in different environments represents a further challenge when designing such optimization campaigns. In this contribution, we discuss how a set of REST APIs can be used to access a dedicated service based on INFN Cloud to monitor and coordinate multiple training instances, with gradient-less optimization techniques, via simple HTTP requests. The service, called Hopaas (Hyperparameter OPtimization As A Service), is made of a web interface and sets of APIs implemented with a FastAPI backend running through Uvicorn and NGINX in a virtual instance of INFN Cloud. The optimization algorithms are currently based on Bayesian techniques as provided by Optuna. A Python frontend is also made available for quick prototyping. We present applications to hyperparameter optimization campaigns performed by combining private, INFN Cloud, and CINECA resources. Such multi-node multi-site optimization studies have given a significant boost to the development of a set of parameterizations for the ultra-fast simulation of the LHCb experiment.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes