LGAIApr 9, 2025

Hyperparameter Optimisation with Practical Interpretability and Explanation Methods in Probabilistic Curriculum Learning

arXiv:2504.06683v1h-index: 3
Originality Incremental advance
AI Analysis

This work addresses the computationally demanding problem of hyperparameter tuning for researchers and practitioners in reinforcement learning, though it is incremental as it builds on existing methods like TPE and SHAP.

The paper tackled the challenge of hyperparameter optimisation in probabilistic curriculum learning for reinforcement learning by analyzing hyperparameter interactions and introducing a SHAP-based interpretability method, resulting in improved optimisation efficiency and clearer insights into hyperparameter impacts.

Hyperparameter optimisation (HPO) is crucial for achieving strong performance in reinforcement learning (RL), as RL algorithms are inherently sensitive to hyperparameter settings. Probabilistic Curriculum Learning (PCL) is a curriculum learning strategy designed to improve RL performance by structuring the agent's learning process, yet effective hyperparameter tuning remains challenging and computationally demanding. In this paper, we provide an empirical analysis of hyperparameter interactions and their effects on the performance of a PCL algorithm within standard RL tasks, including point-maze navigation and DC motor control. Using the AlgOS framework integrated with Optuna's Tree-Structured Parzen Estimator (TPE), we present strategies to refine hyperparameter search spaces, enhancing optimisation efficiency. Additionally, we introduce a novel SHAP-based interpretability approach tailored specifically for analysing hyperparameter impacts, offering clear insights into how individual hyperparameters and their interactions influence RL performance. Our work contributes practical guidelines and interpretability tools that significantly improve the effectiveness and computational feasibility of hyperparameter optimisation in reinforcement learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes