Pseudo-likelihood produces associative memories able to generalize, even for asymmetric couplings

arXiv:2507.05147v15 citationsh-index: 2
Originality Highly original
AI Analysis

This provides a principled mechanism for memory and generalization in probabilistic models, addressing a fundamental bottleneck in machine learning and AI with potential applications across domains like computer science, physics, and biology.

The authors tackled the intractability of partition functions in energy-based probabilistic models by using pseudo-likelihood, showing that in the zero-temperature limit, networks trained this way implement associative memories with larger basins of attraction than classical Hopfield rules for small training sets, and generalize to structured datasets like MNIST and proteins as training examples increase.

Energy-based probabilistic models learned by maximizing the likelihood of the data are limited by the intractability of the partition function. A widely used workaround is to maximize the pseudo-likelihood, which replaces the global normalization with tractable local normalizations. Here we show that, in the zero-temperature limit, a network trained to maximize pseudo-likelihood naturally implements an associative memory: if the training set is small, patterns become fixed-point attractors whose basins of attraction exceed those of any classical Hopfield rule. We explain quantitatively this effect on uncorrelated random patterns. Moreover, we show that, for different structured datasets coming from computer science (random feature model, MNIST), physics (spin glasses) and biology (proteins), as the number of training examples increases the learned network goes beyond memorization, developing meaningful attractors with non-trivial correlations with test examples, thus showing the ability to generalize. Our results therefore reveal pseudo-likelihood works both as an efficient inference tool and as a principled mechanism for memory and generalization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes