LGAICVOct 11, 2024

Efficient Hyperparameter Importance Assessment for CNNs

arXiv:2410.08920v14 citationsh-index: 2
Originality Incremental advance
AI Analysis

This provides a method for machine learning practitioners to efficiently optimize hyperparameters in CNNs, though it is incremental as it adapts existing HIA methodologies to deep learning.

The paper tackled the problem of hyperparameter selection in CNNs by developing the N-RReliefF algorithm to quantify importance weights, identifying the top five hyperparameters (e.g., number of convolutional layers, learning rate) based on training over ten thousand models across ten datasets.

Hyperparameter selection is an essential aspect of the machine learning pipeline, profoundly impacting models' robustness, stability, and generalization capabilities. Given the complex hyperparameter spaces associated with Neural Networks and the constraints of computational resources and time, optimizing all hyperparameters becomes impractical. In this context, leveraging hyperparameter importance assessment (HIA) can provide valuable guidance by narrowing down the search space. This enables machine learning practitioners to focus their optimization efforts on the hyperparameters with the most significant impact on model performance while conserving time and resources. This paper aims to quantify the importance weights of some hyperparameters in Convolutional Neural Networks (CNNs) with an algorithm called N-RReliefF, laying the groundwork for applying HIA methodologies in the Deep Learning field. We conduct an extensive study by training over ten thousand CNN models across ten popular image classification datasets, thereby acquiring a comprehensive dataset containing hyperparameter configuration instances and their corresponding performance metrics. It is demonstrated that among the investigated hyperparameters, the top five important hyperparameters of the CNN model are the number of convolutional layers, learning rate, dropout rate, optimizer and epoch.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes