LGNov 8, 2024

Streaming Bayes GFlowNets

Tiago da Silva, Daniel Augusto de Souza, Diego Mesquita

arXiv:2411.05899v19.26 citationsh-index: 4NIPS

Originality Incremental advance

AI Analysis

This work addresses the challenge of incremental Bayesian updates for discrete spaces, which is important for applications like preference learning and phylogenetics, though it is incremental as it builds on existing GFlowNets.

The paper tackles the problem of streaming Bayesian inference over discrete parameter spaces, where existing variational inference methods are intractable, by proposing streaming Bayes GFlowNets (SB-GFlowNets) that update posteriors efficiently with new data. The result is a method that is significantly faster than retraining from scratch, as demonstrated in case studies like linear preference learning and phylogenetic inference.

Bayes' rule naturally allows for inference refinement in a streaming fashion, without the need to recompute posteriors from scratch whenever new data arrives. In principle, Bayesian streaming is straightforward: we update our prior with the available data and use the resulting posterior as a prior when processing the next data chunk. In practice, however, this recipe entails i) approximating an intractable posterior at each time step; and ii) encapsulating results appropriately to allow for posterior propagation. For continuous state spaces, variational inference (VI) is particularly convenient due to its scalability and the tractability of variational posteriors. For discrete state spaces, however, state-of-the-art VI results in analytically intractable approximations that are ill-suited for streaming settings. To enable streaming Bayesian inference over discrete parameter spaces, we propose streaming Bayes GFlowNets (abbreviated as SB-GFlowNets) by leveraging the recently proposed GFlowNets -- a powerful class of amortized samplers for discrete compositional objects. Notably, SB-GFlowNet approximates the initial posterior using a standard GFlowNet and subsequently updates it using a tailored procedure that requires only the newly observed data. Our case studies in linear preference learning and phylogenetic inference showcase the effectiveness of SB-GFlowNets in sampling from an unnormalized posterior in a streaming setting. As expected, we also observe that SB-GFlowNets is significantly faster than repeatedly training a GFlowNet from scratch to sample from the full posterior.

View on arXiv PDF

Similar