LGJan 15

Discrete Feynman-Kac Correctors

arXiv:2601.10403v17 citationsh-index: 56
Originality Incremental advance
AI Analysis

This work addresses the need for flexible sample control in discrete diffusion models for researchers and practitioners in machine learning, offering a novel inference-time method that is incremental but impactful for specific domains.

The paper tackles the problem of controlling the distribution of generated samples in discrete diffusion models, proposing a framework that enables inference-time control for tasks like annealing, combining multiple processes, and reward integration without additional training, achieving improvements in applications such as code generation and protein sequence design.

Discrete diffusion models have recently emerged as a promising alternative to the autoregressive approach for generating discrete sequences. Sample generation via gradual denoising or demasking processes allows them to capture hierarchical non-sequential interdependencies in the data. These custom processes, however, do not assume a flexible control over the distribution of generated samples. We propose Discrete Feynman-Kac Correctors, a framework that allows for controlling the generated distribution of discrete masked diffusion models at inference time. We derive Sequential Monte Carlo (SMC) algorithms that, given a trained discrete diffusion model, control the temperature of the sampled distribution (i.e. perform annealing), sample from the product of marginals of several diffusion processes (e.g. differently conditioned processes), and sample from the product of the marginal with an external reward function, producing likely samples from the target distribution that also have high reward. Notably, our framework does not require any training of additional models or fine-tuning of the original model. We illustrate the utility of our framework in several applications including: efficient sampling from the annealed Boltzmann distribution of the Ising model, improving the performance of language models for code generation and amortized learning, as well as reward-tilted protein sequence generation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes