LG MLMar 10, 2025

Learning Energy-Based Models by Self-normalising the Likelihood

Hugo Senetaire, Paul Jeha, Pierre-Alexandre Mattei, Jes Frellsen

arXiv:2503.07021v12 citationsh-index: 23

Originality Incremental advance

AI Analysis

This addresses the problem of intractable normalization in EBMs for machine learning practitioners, offering a simpler and more effective method, though it appears incremental as it builds on existing likelihood-based approaches.

The paper tackles the challenge of training energy-based models (EBMs) with maximum likelihood by proposing a self-normalised log-likelihood (SNL) objective that avoids expensive MCMC sampling, and results show it outperforms existing techniques on density estimation and regression tasks.

Training an energy-based model (EBM) with maximum likelihood is challenging due to the intractable normalisation constant. Traditional methods rely on expensive Markov chain Monte Carlo (MCMC) sampling to estimate the gradient of logartihm of the normalisation constant. We propose a novel objective called self-normalised log-likelihood (SNL) that introduces a single additional learnable parameter representing the normalisation constant compared to the regular log-likelihood. SNL is a lower bound of the log-likelihood, and its optimum corresponds to both the maximum likelihood estimate of the model parameters and the normalisation constant. We show that the SNL objective is concave in the model parameters for exponential family distributions. Unlike the regular log-likelihood, the SNL can be directly optimised using stochastic gradient techniques by sampling from a crude proposal distribution. We validate the effectiveness of our proposed method on various density estimation tasks as well as EBMs for regression. Our results show that the proposed method, while simpler to implement and tune, outperforms existing techniques.

View on arXiv PDF

Similar