LGMay 11, 2023

Towards Understanding and Improving GFlowNet Training

arXiv:2305.07170v182 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of making GFlowNets more efficient and effective for sampling discrete objects in domains like biochemical design, representing an incremental improvement with specific gains.

The paper tackled the problem of understanding and improving the training of Generative Flow Networks (GFlowNets) under practical resource limits, introducing evaluation strategies and methods like prioritized replay and a guided trajectory balance objective, which substantially improved sample efficiency on biochemical design tasks.

Generative flow networks (GFlowNets) are a family of algorithms that learn a generative policy to sample discrete objects $x$ with non-negative reward $R(x)$. Learning objectives guarantee the GFlowNet samples $x$ from the target distribution $p^*(x) \propto R(x)$ when loss is globally minimized over all states or trajectories, but it is unclear how well they perform with practical limits on training resources. We introduce an efficient evaluation strategy to compare the learned sampling distribution to the target reward distribution. As flows can be underdetermined given training data, we clarify the importance of learned flows to generalization and matching $p^*(x)$ in practice. We investigate how to learn better flows, and propose (i) prioritized replay training of high-reward $x$, (ii) relative edge flow policy parametrization, and (iii) a novel guided trajectory balance objective, and show how it can solve a substructure credit assignment problem. We substantially improve sample efficiency on biochemical design tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes