A self-calibrating method for heavy tailed data modelling. Application in neuroscience and finance
Provides a parametric modeling approach for multi-component heavy-tailed data in neuroscience and finance, but the method is domain-specific and incremental.
The paper introduces a self-calibrating method for modeling multi-component heavy-tailed data using a three-component hybrid distribution (Gaussian, exponential, Generalized Pareto). The method is validated on simulated data and applied to neuroscience and finance datasets, showing practical advantages over standard Extreme Value Theory approaches.
Modelling non-homogeneous and multi-component data is a problem that challenges scientific researchers in several fields. In general, it is not possible to find a simple and closed form probabilistic model to describe such data. That is why one often resorts to non-parametric approaches. However, when the multiple components are separable, parametric modelling becomes again tractable. In this study, we propose a self-calibrating method to model multi-component data that exhibit heavy tails. We introduce a three-component hybrid distribution: a Gaussian distribution is linked to a Generalized Pareto one via an exponential distribution that bridges the gap between mean and tail behaviors. An unsupervised algorithm is then developed for estimating the parameters of this model. We study analytically and numerically its convergence. The effectiveness of the self-calibrating method is tested on simulated data, before applying it to real data from neuroscience and finance, respectively. A comparison with other standard Extreme Value Theory approaches confirms the relevance and the practical advantage of this new method.