LGAIPFAug 18, 2025

Hierarchical Evaluation Function: A Multi-Metric Approach for Optimizing Demand Forecasting Models

arXiv:2508.13057v5Expert syst appl
Originality Incremental advance
AI Analysis

This incremental improvement addresses demand forecasting in competitive business environments by enhancing model selection and forecast accuracy.

The paper tackled the problem of demand forecasting models being biased by single-metric optimization by proposing the Hierarchical Evaluation Function (HEF), a multi-metric framework integrating R2, RMSE, and MAE, which delivered superior results compared to unimetric approaches on benchmark datasets like Walmart, M3, M4, and M5.

Demand forecasting in competitive and uncertain business environments requires models that can integrate multiple evaluation perspectives, rather than being restricted to hyperparameter optimization through a single metric. This traditional approach tends to prioritize one error indicator, which can bias results when metrics provide contradictory signals. In this context, the Hierarchical Evaluation Function (HEF) is proposed as a multi-metric framework for hyperparameter optimization that integrates explanatory power (R2), sensitivity to extreme errors (RMSE), and average accuracy (MAE). The performance of HEF was assessed using four widely recognized benchmark datasets in the forecasting domain: the Walmart, M3, M4, and M5 datasets. Prediction models were optimized through Grid Search, Particle Swarm Optimization (PSO), and Optuna, and statistical analyses based on difference-of-proportions tests confirmed that HEF delivers superior results compared to a unimetric reference function, regardless of the optimizer employed, with particular relevance for heterogeneous monthly time series (M3) and highly granular daily demand scenarios (M5). The findings demonstrate that HEF improves stability, generalization, and robustness at a low computational cost, consolidating its role as a reliable evaluation framework that enhances model selection, enables more accurate demand forecasts, and supports decision-making in dynamic and competitive business environments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes