LGAINEJan 4, 2025

Bridge the Inference Gaps of Neural Processes via Expectation Maximization

Tsinghua
arXiv:2501.03264v121 citationsh-index: 28Has CodeICLR
Originality Incremental advance
AI Analysis

This work addresses inference suboptimality in neural processes, a family of models for learning distributions over functions, offering an incremental improvement for practitioners in meta-learning and function approximation.

The paper tackles the under-fitting and suboptimal performance of neural processes by proposing a surrogate objective within an expectation maximization framework, resulting in the Self-normalized Importance weighted Neural Process (SI-NP), which shows competitive performance and can achieve state-of-the-art results when combined with structural biases like attention modules.

The neural process (NP) is a family of computationally efficient models for learning distributions over functions. However, it suffers from under-fitting and shows suboptimal performance in practice. Researchers have primarily focused on incorporating diverse structural inductive biases, \textit{e.g.} attention or convolution, in modeling. The topic of inference suboptimality and an analysis of the NP from the optimization objective perspective has hardly been studied in earlier work. To fix this issue, we propose a surrogate objective of the target log-likelihood of the meta dataset within the expectation maximization framework. The resulting model, referred to as the Self-normalized Importance weighted Neural Process (SI-NP), can learn a more accurate functional prior and has an improvement guarantee concerning the target log-likelihood. Experimental results show the competitive performance of SI-NP over other NPs objectives and illustrate that structural inductive biases, such as attention modules, can also augment our method to achieve SOTA performance. Our code is available at \url{https://github.com/hhq123gogogo/SI_NPs}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes