NCSTAT-MECHMLMay 24, 2016

Semiparametric energy-based probabilistic models

arXiv:1605.07371v14 citations
Originality Incremental advance
AI Analysis

This provides a more flexible modeling approach for datasets with broad probability ranges, potentially benefiting fields like neuroscience and machine learning, though it is an incremental extension of existing energy-based models.

The paper tackles the problem of modeling complex probability distributions by generalizing energy-based models to use an arbitrary decreasing function mapping energies to probabilities, which is learned from data. As a result, when applied to neural activity data from retinal ganglion cells, the model accurately captures distributions with large dynamic ranges, attributed to its ability to handle globally coupled latent variables.

Probabilistic models can be defined by an energy function, where the probability of each state is proportional to the exponential of the state's negative energy. This paper considers a generalization of energy-based models in which the probability of a state is proportional to an arbitrary positive, strictly decreasing, and twice differentiable function of the state's energy. The precise shape of the nonlinear map from energies to unnormalized probabilities has to be learned from data together with the parameters of the energy function. As a case study we show that the above generalization of a fully visible Boltzmann machine yields an accurate model of neural activity of retinal ganglion cells. We attribute this success to the model's ability to easily capture distributions whose probabilities span a large dynamic range, a possible consequence of latent variables that globally couple the system. Similar features have recently been observed in many datasets, suggesting that our new method has wide applicability.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes