LGNov 13, 2024

Hopfield-Fenchel-Young Networks: A Unified Framework for Associative Memory Retrieval

arXiv:2411.08590v55 citationsh-index: 4Has Code
Originality Incremental advance
AI Analysis

This work provides a unified framework for associative memory retrieval, which is incremental as it builds upon existing Hopfield network variants to offer broader applicability and new insights.

The authors tackled the problem of unifying and extending associative memory models like Hopfield networks by introducing Hopfield-Fenchel-Young networks, a framework that generalizes energy functions using Fenchel-Young losses, enabling sparse transformations and structured retrieval, and validated it on tasks such as image retrieval and text rationalization with demonstrated effectiveness.

Associative memory models, such as Hopfield networks and their modern variants, have garnered renewed interest due to advancements in memory capacity and connections with self-attention in transformers. In this work, we introduce a unified framework-Hopfield-Fenchel-Young networks-which generalizes these models to a broader family of energy functions. Our energies are formulated as the difference between two Fenchel-Young losses: one, parameterized by a generalized entropy, defines the Hopfield scoring mechanism, while the other applies a post-transformation to the Hopfield output. By utilizing Tsallis and norm entropies, we derive end-to-end differentiable update rules that enable sparse transformations, uncovering new connections between loss margins, sparsity, and exact retrieval of single memory patterns. We further extend this framework to structured Hopfield networks using the SparseMAP transformation, allowing the retrieval of pattern associations rather than a single pattern. Our framework unifies and extends traditional and modern Hopfield networks and provides an energy minimization perspective for widely used post-transformations like $\ell_2$-normalization and layer normalization-all through suitable choices of Fenchel-Young losses and by using convex analysis as a building block. Finally, we validate our Hopfield-Fenchel-Young networks on diverse memory recall tasks, including free and sequential recall. Experiments on simulated data, image retrieval, multiple instance learning, and text rationalization demonstrate the effectiveness of our approach.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes