LG AISep 7, 2023

Interactive Hyperparameter Optimization in Multi-Objective Problems via Preference Learning

Joseph Giovanelli, Alexander Tornede, Tanja Tornede, Marius Lindauer

arXiv:2309.03581v310.79 citationsh-index: 12Has Code

Originality Incremental advance

AI Analysis

This work addresses the problem of simplifying hyperparameter optimization for users in multi-objective machine learning, particularly in domains like environmental impact, by reducing reliance on expert knowledge, though it is incremental as it builds on existing HPO and preference learning methods.

The paper tackles the challenge of hyperparameter optimization in multi-objective machine learning by proposing an interactive approach that uses preference learning to automatically learn a quality indicator from user comparisons, eliminating the need for users to pre-select an indicator. In experiments on environmental impact, it shows substantially better Pareto fronts compared to using a wrong pre-selected indicator and performs comparably when the correct indicator is chosen.

Hyperparameter optimization (HPO) is important to leverage the full potential of machine learning (ML). In practice, users are often interested in multi-objective (MO) problems, i.e., optimizing potentially conflicting objectives, like accuracy and energy consumption. To tackle this, the vast majority of MO-ML algorithms return a Pareto front of non-dominated machine learning models to the user. Optimizing the hyperparameters of such algorithms is non-trivial as evaluating a hyperparameter configuration entails evaluating the quality of the resulting Pareto front. In literature, there are known indicators that assess the quality of a Pareto front (e.g., hypervolume, R2) by quantifying different properties (e.g., volume, proximity to a reference point). However, choosing the indicator that leads to the desired Pareto front might be a hard task for a user. In this paper, we propose a human-centered interactive HPO approach tailored towards multi-objective ML leveraging preference learning to extract desiderata from users that guide the optimization. Instead of relying on the user guessing the most suitable indicator for their needs, our approach automatically learns an appropriate indicator. Concretely, we leverage pairwise comparisons of distinct Pareto fronts to learn such an appropriate quality indicator. Then, we optimize the hyperparameters of the underlying MO-ML algorithm towards this learned indicator using a state-of-the-art HPO approach. In an experimental study targeting the environmental impact of ML, we demonstrate that our approach leads to substantially better Pareto fronts compared to optimizing based on a wrong indicator pre-selected by the user, and performs comparable in the case of an advanced user knowing which indicator to pick.

View on arXiv PDF Code

Similar